SlideShare ist ein Scribd-Unternehmen logo
1 von 39
Mobile Network Failure Event Detection and
Forecasting with Multiple User Activity Data Sets
Yukio UEMATSU
02/05/2018 IAAI 2018
Koh TAKEUCHIMotoyuki OKI
Providing stable and high quality service is a critical
issue for mobile network service providers
Introduction 2
Providing stable and high quality service is a critical
issue for mobile network service providers
Network failures …
– L service outage
– L degrade user satisfaction
Introduction 3
Alert
Users
L Error ?
LI can’t
connect. Outage ?
Providing stable and high quality service is a critical
issue for mobile network service providers
Network failures …
– L service outage
– L degrade user satisfaction
Introduction
Alert
Users
L Error ?
LI can’t
connect. Outage ?
Tweet Search Call
Web AccessRSS
News
Publish user’s impressions
on various data sources
4
Providing stable and high quality service is a critical
issue for mobile network service providers
Network failures …
– L service outage
– L degrade user satisfaction
Introduction
Alert
Users
L Error ?
LI can’t
connect. Outage ?
Tweet Search Call
Web AccessRSS
News
Publish user’s impressions
on various data sources
5
User activity-based
failure detection
and forecasting
• Monitoring network traffic and logs [Gill+2011,
Brutlag2000]
Failure Detection and Forecasting
Failure Detection
6
• Monitoring network traffic and logs [Gill+2011,
Brutlag2000]
• Using tweet data
– Users detect outages before providers does it [Qiu+2010]
– A keyword filter and SVM [Takeshita+2015]
Failure Detection and Forecasting
Failure Detection
7
• Monitoring network traffic and logs [Gill+2011,
Brutlag2000]
• Using tweet data
– Users detect outages before providers does it [Qiu+2010]
– A keyword filter and SVM [Takeshita+2015]
- Influenza, airport threats, and civil unrest, etc.
- Represent different aspects of the user activity
Failure Detection and Forecasting
Failure Detection
+
8
Event Detection based on multiple data sets
• Monitoring network traffic and logs [Gill+2011,
Brutlag2000]
• Using tweet data
– Users detect outages before providers does it [Qiu+2010]
– A keyword filter and SVM [Takeshita+2015]
- Influenza, airport threats, and civil unrest, etc.
- Represent different aspects of the user activity
Failure Detection and Forecasting
Focus network failure detection and forecasting
using multiple user activity data sets
Failure Detection
+
Event Detection based on multiple data sets
9
• Number of observations in each data
– Social (Tweets) and Telecom (Web Access Logs and Search
Queries) data sets
User Activity Data Sets 10
• Number of observations in each data
– Social (Tweets) and Telecom (Web Access Logs and Search
Queries) data sets
User Activity Data Sets
Zoom
Change ?
11
• Framework for failure detection and forecasting
from multiple user activity data sets
– Feature construction methods and model ensemble
method
• Extensive experiments using real-world multiple
data sets
Research Contributions 12
• To estimate a prediction model that predict a
failure event from feature vectors
– J Simple binary classification problem
• Two variables
–𝒙" = (𝑥&,",, … , 𝑥),",) ∈ ℝ) is a feature vector of user
activity data on the timestamp 𝑡	 ∈ 1, … , 𝑇
–𝑦" ∈ 1, −1 is a label
• whether an event occurs at timestamp 𝑡	 or not
Problem Formulation 13
𝒙" : Three data sets
Data Sets and Failure Events 14
𝒙" : Three data sets
Data Sets and Failure Events
𝒚" : Nine failure events
15
𝒙" : Three data sets
Data Sets and Failure Events
Five training and test data sets
𝒚" : Nine failure events
+
16
𝒙" : Three data sets
Data Sets and Failure Events
Five training and test data sets
𝒚" : Nine failure events
Special characteristics
L Imbalanced labels (see Ratio)
L Very sparse (see Sparsity)
+
17
𝒙" : Three data sets
Data Sets and Failure Events
Five training and test data sets
𝒚" : Nine failure events
Special characteristics
L Imbalanced labels (see Ratio) ⇒ Bi-normal Separator*
L Very sparse (see Sparsity) ⇒ Simple Moving Average*
[*] See our paper
+
18
• (1) Entire period (EP) detection : Entire periods of Test
Three Key Tasks for Failure Detection
Training : 210 days Test : 15 days
:Failure Event
start end
(1) EP
19
duration 60 mins
• (1) Entire period (EP) detection : Entire periods of Test
• (2) Early detection (ED) : Only 60 mins after a failure event
and the rest interval
Three Key Tasks for Failure Detection
Training : 210 days Test : 15 days
:Failure Event
(2) Early Detection (60 min) start end
(1) EP
20
• (1) Entire period (EP) detection : Entire periods of Test
• (2) Early detection (ED) : Only 60 mins after a failure event
and the rest interval
• (3) Forecast : failures in α minutes
Three Key Tasks for Failure Detection
duration 60 mins
Training : 210 days Test : 15 days
:Failure Event
(2) Early Detection (60 min)
(3) Forecast
start end
(1) EP
t = 1 t = T
duration α min
t =T +1 t =T +T’
21
• Experiment 1 (Two tasks : EP and ED)
– Comparison of multiple classification and anomaly detection
models in respect to AUC
– LR(Logistic Regression) / ADA (Adaboost of decision
stamps) / RF (Random Forest) / NN (Neural Network) /
OCS (One Class SVM) / AE (Auto Encoder)
• Experiment 2 (Two tasks : EP and ED)
– Effective approach to combine multiple models and data
sets
• Experiment 3 (One task : Forecast)
– Forecasting performance
Experiment Outline 22
Single data set is not sufficient
• Experiment 1
– Tasks : EP and ED
- Comparison methods : LR(Logistic Regression) / ADA
(Adaboost of decision stamps) / RF (Random Forest) / NN
(Neural Network) / OCS (One Class SVM) / AE (Auto Encoder)
Results : Optimal combinations of models and data are different
23
Single data set is not sufficient
• Experiment 1
– Tasks : EP and ED
- Comparison methods : LR(Logistic Regression) / ADA
(Adaboost of decision stamps) / RF (Random Forest) / NN
(Neural Network) / OCS (One Class SVM) / AE (Auto Encoder)
Results : Optimal combinations of models and data are different
24
Tweet data detects failures early
• Experiment 1
– Tasks : EP and ED
Assumption : Post tweets, then access web pages
25
…
Tweet data detects failures early
• Experiment 1
– Tasks : EP and ED
Assumption : Post tweets, then access web pages
26
…
Tweet data detects failures early
• Experiment 1
– Tasks : EP and ED
Assumption : Post tweets, then access web pages
27
…
• A model ensemble approach to effective
combinations of data sets and models
Combine multiple user activity data sets
LR 𝒑"
LR
RF
AE
AE
{(𝒙", 𝑦")}"7&:9
{(𝒙", 𝑦")}"7&:9
{𝒙"}"7&:9
:
{(𝒑", 𝑦")}"7&:9
^
𝑓(;)
𝚳 𝟏
𝚳 𝟐
𝚳 𝟑
𝚳 𝟑
𝑫 𝟏
𝑫 𝟐
𝑫 𝟑
Level 1 Level 2
𝒑"		: Prediction scores of failure events of model 𝚳𝒊	and
data sets 𝑫𝒋
28
• Experiment 2
– Tasks : EP and 60min
– Model : ME
– Comparison method : Data Ensemble (DE)
• Results
Model ensemble method achieved best scores
LR 𝒑"
29
concatenation
Prediction scores (Test ID : 9) 30
ME suppress cases of false negatives 31
Effective combination of Tweets and Web access logs
Rapid rise
Delayed rise
32
• Experiment 3
– Task:Forecast
– Model : ME
– Comparison methods : DE and LR on each dataset
User activity can forecast future failures 33
• Experiment 3
– Task:Forecast
– Model : ME
– Comparison methods : DE and LR on each dataset
• Results : ME showed best performance
User activity can forecast future failures 34
• Summary
– Proposed a multiple user activity-based failure detection
and forecasting framework
– Demonstrated that our proposed methods improved AUC
scores through extensive experiments
• Future Works
– Utilizing additional data sets
– Deep neural network (e.g., LSTM model)
– Feature Analysis
– Deployment application to real-time monitoring system
Conclusions
Thank you !
35
Note
• Framework for failure detection and forecasting
• Extensive experiments using real-world multiple data sets
Research Contributions
Overview of our framework
Users
Service Provider
Failure Event
𝒙 𝒕 : Multiple User Activity Data
Error ?
I can’t
connect.
Tweet Search Call
Web Access
𝑦" : Failure
Reports
Detection and Forecasting !
Social Data Telecom Data
Alert
Outage ?
RSS
News
Reporting
37
• First method : Simple Moving Average (SMA)
• Second method : BNS (Bi-Normal Separator [Forman2003])
Method : Feature Construction
L Sparseness of user activities
à Aggregate each feature observed from the past time
(t − S) to an average value in a time t
L Imbalanced labels
→ Feature scaling for imbalanced dataset
38
𝐹E&
; : inverse cumulative
normal distribution
tpr : True positive rate
fpr : False positive rate
𝑡𝑓𝑏𝑛𝑠 𝑥I," = 𝑡𝑓 𝑥I," ×𝑏𝑛𝑠 𝑑I
𝑡𝑓 𝑥I," =
𝑥I,"
∑ 𝑥I,"
)
I7&
𝑏𝑛𝑠 𝑑I = 𝐹E&
𝑡𝑝𝑟 − 𝐹E&
𝑓𝑝𝑟
• Experimental Setup
– Evaluation Set : EP
– Model : Logistic regression with ridge regularization
– Comparison methods
• td(original features)/ tf-idf ([Salton+1986]) / tf-bns / +sma
(simple moving average)
• Results : J Improved AUC by Average 31%.
– SMA for time series features and BNS Scaling for imbalanced
dataset are effectiveness.
Experiment :Effects of Feature Construction
* The values are Average and standard deviation of AUC.
+ Average 31%
39

Weitere ähnliche Inhalte

Ähnlich wie 18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting with Multiple User Activity Data Sets

Performance Evaluation of a Network Using Simulation Tools or Packet Tracer
Performance Evaluation of a Network Using Simulation Tools or Packet TracerPerformance Evaluation of a Network Using Simulation Tools or Packet Tracer
Performance Evaluation of a Network Using Simulation Tools or Packet Tracer
IOSRjournaljce
 
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
Pete Burnap
 
A Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft DetectionA Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft Detection
Camella Taylor
 
OORPT Dynamic Analysis
OORPT Dynamic AnalysisOORPT Dynamic Analysis
OORPT Dynamic Analysis
lienhard
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Sc Huang
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business Value
Xavier Amatriain
 
A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...
Lionel Briand
 
NetBrain-in-Action
NetBrain-in-ActionNetBrain-in-Action
NetBrain-in-Action
Ken Reiff
 

Ähnlich wie 18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting with Multiple User Activity Data Sets (20)

Performance Evaluation of a Network Using Simulation Tools or Packet Tracer
Performance Evaluation of a Network Using Simulation Tools or Packet TracerPerformance Evaluation of a Network Using Simulation Tools or Packet Tracer
Performance Evaluation of a Network Using Simulation Tools or Packet Tracer
 
Automated Discovery of Performance Regressions in Enterprise Applications
Automated Discovery of Performance Regressions in Enterprise ApplicationsAutomated Discovery of Performance Regressions in Enterprise Applications
Automated Discovery of Performance Regressions in Enterprise Applications
 
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
Real-time Classification of Malicious URLs on Twitter using Machine Activity ...
 
Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...
Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...
Mining System Logs to Learn Error Predictors, Universität Stuttgart, Stuttgar...
 
Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)Test Tool for Industrial Ethernet Network Performance (June 2009)
Test Tool for Industrial Ethernet Network Performance (June 2009)
 
Detecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking DataDetecting Hacks: Anomaly Detection on Networking Data
Detecting Hacks: Anomaly Detection on Networking Data
 
Workshop BI/DWH AGILE TESTING SNS Bank English
Workshop BI/DWH AGILE TESTING SNS Bank EnglishWorkshop BI/DWH AGILE TESTING SNS Bank English
Workshop BI/DWH AGILE TESTING SNS Bank English
 
A Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft DetectionA Hybrid Theory Of Power Theft Detection
A Hybrid Theory Of Power Theft Detection
 
Ch15 software reliability
Ch15 software reliabilityCh15 software reliability
Ch15 software reliability
 
OORPT Dynamic Analysis
OORPT Dynamic AnalysisOORPT Dynamic Analysis
OORPT Dynamic Analysis
 
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...Three Laws of Trusted Data Sharing:(Building a Better Business Case for Dat...
Three Laws of Trusted Data Sharing: (Building a Better Business Case for Dat...
 
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
Random Walk by User Trust and Temporal Issues toward Sparsity Problem in Soci...
 
Automatic Analyzing System for Packet Testing and Fault Mapping
Automatic Analyzing System for Packet Testing and Fault MappingAutomatic Analyzing System for Packet Testing and Fault Mapping
Automatic Analyzing System for Packet Testing and Fault Mapping
 
Cikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business ValueCikm 2013 - Beyond Data From User Information to Business Value
Cikm 2013 - Beyond Data From User Information to Business Value
 
A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...A practical guide for using Statistical Tests to assess Randomized Algorithms...
A practical guide for using Statistical Tests to assess Randomized Algorithms...
 
NetBrain-in-Action
NetBrain-in-ActionNetBrain-in-Action
NetBrain-in-Action
 
Observability in highly distributed systems
Observability in highly distributed systemsObservability in highly distributed systems
Observability in highly distributed systems
 
Applications of Machine Learning and Metaheuristic Search to Security Testing
Applications of Machine Learning and Metaheuristic Search to Security TestingApplications of Machine Learning and Metaheuristic Search to Security Testing
Applications of Machine Learning and Metaheuristic Search to Security Testing
 
Test Data, Information, Knowledge, Wisdom: past, present & future of standing...
Test Data, Information, Knowledge, Wisdom: past, present & future of standing...Test Data, Information, Knowledge, Wisdom: past, present & future of standing...
Test Data, Information, Knowledge, Wisdom: past, present & future of standing...
 
Metrics
MetricsMetrics
Metrics
 

Mehr von LINE Corp.

14.09.12_インターネット測定システムの開発と運用
14.09.12_インターネット測定システムの開発と運用14.09.12_インターネット測定システムの開発と運用
14.09.12_インターネット測定システムの開発と運用
LINE Corp.
 
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
LINE Corp.
 
Rによるテキストマイニングの一例
Rによるテキストマイニングの一例Rによるテキストマイニングの一例
Rによるテキストマイニングの一例
LINE Corp.
 
Rによる決定木解析の一例
Rによる決定木解析の一例Rによる決定木解析の一例
Rによる決定木解析の一例
LINE Corp.
 
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
LINE Corp.
 
13.12.21_大ヒットの方程式数理モデル解説
13.12.21_大ヒットの方程式数理モデル解説13.12.21_大ヒットの方程式数理モデル解説
13.12.21_大ヒットの方程式数理モデル解説
LINE Corp.
 
13.12.07 CIKM2013読み会
13.12.07 CIKM2013読み会13.12.07 CIKM2013読み会
13.12.07 CIKM2013読み会
LINE Corp.
 
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
LINE Corp.
 
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
LINE Corp.
 
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
LINE Corp.
 
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
LINE Corp.
 

Mehr von LINE Corp. (20)

18.07.11_useR2018 Poster_Time Series Digger : Automatic time series analysis ...
18.07.11_useR2018 Poster_Time Series Digger : Automatic time series analysis ...18.07.11_useR2018 Poster_Time Series Digger : Automatic time series analysis ...
18.07.11_useR2018 Poster_Time Series Digger : Automatic time series analysis ...
 
17.04.27_JSAI Cup 2017_5th winner's solution
17.04.27_JSAI Cup 2017_5th winner's solution17.04.27_JSAI Cup 2017_5th winner's solution
17.04.27_JSAI Cup 2017_5th winner's solution
 
17.11.29_論文紹介_BRACID a comprehensive approach to learning rules from imbalanc...
17.11.29_論文紹介_BRACID a comprehensive approach to learning rules from imbalanc...17.11.29_論文紹介_BRACID a comprehensive approach to learning rules from imbalanc...
17.11.29_論文紹介_BRACID a comprehensive approach to learning rules from imbalanc...
 
17.04.21_論文紹介_Decision bireducts and decision reducts - a comparison
17.04.21_論文紹介_Decision bireducts and decision reducts - a comparison17.04.21_論文紹介_Decision bireducts and decision reducts - a comparison
17.04.21_論文紹介_Decision bireducts and decision reducts - a comparison
 
17.01.18_論文紹介_Discrimination- and privacy-aware patterns
17.01.18_論文紹介_Discrimination- and privacy-aware patterns17.01.18_論文紹介_Discrimination- and privacy-aware patterns
17.01.18_論文紹介_Discrimination- and privacy-aware patterns
 
16.02.08_Hadoop Conferece Japan 2016_データサイエンスにおける一次可視化からのSpark on Elasticsear...
16.02.08_Hadoop Conferece Japan 2016_データサイエンスにおける一次可視化からのSpark on Elasticsear...16.02.08_Hadoop Conferece Japan 2016_データサイエンスにおける一次可視化からのSpark on Elasticsear...
16.02.08_Hadoop Conferece Japan 2016_データサイエンスにおける一次可視化からのSpark on Elasticsear...
 
15.05.21_ビッグデータ分析基盤Sparkの最新動向とその活用-Spark SUMMIT EAST 2015-
15.05.21_ビッグデータ分析基盤Sparkの最新動向とその活用-Spark SUMMIT EAST 2015-15.05.21_ビッグデータ分析基盤Sparkの最新動向とその活用-Spark SUMMIT EAST 2015-
15.05.21_ビッグデータ分析基盤Sparkの最新動向とその活用-Spark SUMMIT EAST 2015-
 
13.03.09_決定ルール解析のための頑健性指標
13.03.09_決定ルール解析のための頑健性指標13.03.09_決定ルール解析のための頑健性指標
13.03.09_決定ルール解析のための頑健性指標
 
14.09.12_インターネット測定システムの開発と運用
14.09.12_インターネット測定システムの開発と運用14.09.12_インターネット測定システムの開発と運用
14.09.12_インターネット測定システムの開発と運用
 
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
13.11.15_野村総研マーケティング分析コンテスト2013(佳作賞)_Facebookとtwitterの利用者に対する消費行動分析とそれに基づく広告...
 
14.01.31_アドベンチャー杯2013_情報工学分野と他の研究分野間の関連度分析
14.01.31_アドベンチャー杯2013_情報工学分野と他の研究分野間の関連度分析14.01.31_アドベンチャー杯2013_情報工学分野と他の研究分野間の関連度分析
14.01.31_アドベンチャー杯2013_情報工学分野と他の研究分野間の関連度分析
 
Rによるテキストマイニングの一例
Rによるテキストマイニングの一例Rによるテキストマイニングの一例
Rによるテキストマイニングの一例
 
Rによる決定木解析の一例
Rによる決定木解析の一例Rによる決定木解析の一例
Rによる決定木解析の一例
 
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
11.12.06_論文紹介_Classification systems based on rough sets under the belief fun...
 
13.12.21_大ヒットの方程式数理モデル解説
13.12.21_大ヒットの方程式数理モデル解説13.12.21_大ヒットの方程式数理モデル解説
13.12.21_大ヒットの方程式数理モデル解説
 
13.12.07 CIKM2013読み会
13.12.07 CIKM2013読み会13.12.07 CIKM2013読み会
13.12.07 CIKM2013読み会
 
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
11.05.10_論文紹介_Can Bayesian Confirmation Measures be Useful for Rough Set Deci...
 
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
11.11.27_2011年度数理システム学生研究奨励賞「佳作賞」_ニュースサイトにおける大学生の嗜好要因の特定とサイト制作への応用_
 
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
12.11.06_2012年度S-PLUS学生研究奨励賞「佳作賞」_数量化理論第Ⅱ類とアソシエーションルール解析による自動車バナーデザインの分析
 
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
12.10.22_論文紹介_Measuring expected effects of interventions based on decision r...
 

Kürzlich hochgeladen

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
amitlee9823
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
amitlee9823
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
amitlee9823
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
amitlee9823
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
amitlee9823
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
amitlee9823
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
JoseMangaJr1
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
karishmasinghjnh
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
amitlee9823
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
amitlee9823
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
amitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
only4webmaster01
 

Kürzlich hochgeladen (20)

Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
Call Girls Jalahalli Just Call 👗 7737669865 👗 Top Class Call Girl Service Ban...
 
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men  🔝malwa🔝   Escorts Ser...
➥🔝 7737669865 🔝▻ malwa Call-girls in Women Seeking Men 🔝malwa🔝 Escorts Ser...
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men  🔝Mathura🔝   Escorts...
➥🔝 7737669865 🔝▻ Mathura Call-girls in Women Seeking Men 🔝Mathura🔝 Escorts...
 
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Marol Naka Call On 9920725232 With Body to body massage...
 
Detecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning ApproachDetecting Credit Card Fraud: A Machine Learning Approach
Detecting Credit Card Fraud: A Machine Learning Approach
 
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Doha Qatar (+966572737505 ! Get Cytotec
Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...
 
Probability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter LessonsProbability Grade 10 Third Quarter Lessons
Probability Grade 10 Third Quarter Lessons
 
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
👉 Amritsar Call Girl 👉📞 6367187148 👉📞 Just📲 Call Ruhi Call Girl Phone No Amri...
 
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...
 
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men  🔝Thrissur🔝   Escor...
➥🔝 7737669865 🔝▻ Thrissur Call-girls in Women Seeking Men 🔝Thrissur🔝 Escor...
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...
 
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men  🔝Dindigul🔝   Escor...
➥🔝 7737669865 🔝▻ Dindigul Call-girls in Women Seeking Men 🔝Dindigul🔝 Escor...
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 

18.02.05_IAAI2018_Mobille Network Failure Event Detection and Forecasting with Multiple User Activity Data Sets

  • 1. Mobile Network Failure Event Detection and Forecasting with Multiple User Activity Data Sets Yukio UEMATSU 02/05/2018 IAAI 2018 Koh TAKEUCHIMotoyuki OKI
  • 2. Providing stable and high quality service is a critical issue for mobile network service providers Introduction 2
  • 3. Providing stable and high quality service is a critical issue for mobile network service providers Network failures … – L service outage – L degrade user satisfaction Introduction 3 Alert Users L Error ? LI can’t connect. Outage ?
  • 4. Providing stable and high quality service is a critical issue for mobile network service providers Network failures … – L service outage – L degrade user satisfaction Introduction Alert Users L Error ? LI can’t connect. Outage ? Tweet Search Call Web AccessRSS News Publish user’s impressions on various data sources 4
  • 5. Providing stable and high quality service is a critical issue for mobile network service providers Network failures … – L service outage – L degrade user satisfaction Introduction Alert Users L Error ? LI can’t connect. Outage ? Tweet Search Call Web AccessRSS News Publish user’s impressions on various data sources 5 User activity-based failure detection and forecasting
  • 6. • Monitoring network traffic and logs [Gill+2011, Brutlag2000] Failure Detection and Forecasting Failure Detection 6
  • 7. • Monitoring network traffic and logs [Gill+2011, Brutlag2000] • Using tweet data – Users detect outages before providers does it [Qiu+2010] – A keyword filter and SVM [Takeshita+2015] Failure Detection and Forecasting Failure Detection 7
  • 8. • Monitoring network traffic and logs [Gill+2011, Brutlag2000] • Using tweet data – Users detect outages before providers does it [Qiu+2010] – A keyword filter and SVM [Takeshita+2015] - Influenza, airport threats, and civil unrest, etc. - Represent different aspects of the user activity Failure Detection and Forecasting Failure Detection + 8 Event Detection based on multiple data sets
  • 9. • Monitoring network traffic and logs [Gill+2011, Brutlag2000] • Using tweet data – Users detect outages before providers does it [Qiu+2010] – A keyword filter and SVM [Takeshita+2015] - Influenza, airport threats, and civil unrest, etc. - Represent different aspects of the user activity Failure Detection and Forecasting Focus network failure detection and forecasting using multiple user activity data sets Failure Detection + Event Detection based on multiple data sets 9
  • 10. • Number of observations in each data – Social (Tweets) and Telecom (Web Access Logs and Search Queries) data sets User Activity Data Sets 10
  • 11. • Number of observations in each data – Social (Tweets) and Telecom (Web Access Logs and Search Queries) data sets User Activity Data Sets Zoom Change ? 11
  • 12. • Framework for failure detection and forecasting from multiple user activity data sets – Feature construction methods and model ensemble method • Extensive experiments using real-world multiple data sets Research Contributions 12
  • 13. • To estimate a prediction model that predict a failure event from feature vectors – J Simple binary classification problem • Two variables –𝒙" = (𝑥&,",, … , 𝑥),",) ∈ ℝ) is a feature vector of user activity data on the timestamp 𝑡 ∈ 1, … , 𝑇 –𝑦" ∈ 1, −1 is a label • whether an event occurs at timestamp 𝑡 or not Problem Formulation 13
  • 14. 𝒙" : Three data sets Data Sets and Failure Events 14
  • 15. 𝒙" : Three data sets Data Sets and Failure Events 𝒚" : Nine failure events 15
  • 16. 𝒙" : Three data sets Data Sets and Failure Events Five training and test data sets 𝒚" : Nine failure events + 16
  • 17. 𝒙" : Three data sets Data Sets and Failure Events Five training and test data sets 𝒚" : Nine failure events Special characteristics L Imbalanced labels (see Ratio) L Very sparse (see Sparsity) + 17
  • 18. 𝒙" : Three data sets Data Sets and Failure Events Five training and test data sets 𝒚" : Nine failure events Special characteristics L Imbalanced labels (see Ratio) ⇒ Bi-normal Separator* L Very sparse (see Sparsity) ⇒ Simple Moving Average* [*] See our paper + 18
  • 19. • (1) Entire period (EP) detection : Entire periods of Test Three Key Tasks for Failure Detection Training : 210 days Test : 15 days :Failure Event start end (1) EP 19
  • 20. duration 60 mins • (1) Entire period (EP) detection : Entire periods of Test • (2) Early detection (ED) : Only 60 mins after a failure event and the rest interval Three Key Tasks for Failure Detection Training : 210 days Test : 15 days :Failure Event (2) Early Detection (60 min) start end (1) EP 20
  • 21. • (1) Entire period (EP) detection : Entire periods of Test • (2) Early detection (ED) : Only 60 mins after a failure event and the rest interval • (3) Forecast : failures in α minutes Three Key Tasks for Failure Detection duration 60 mins Training : 210 days Test : 15 days :Failure Event (2) Early Detection (60 min) (3) Forecast start end (1) EP t = 1 t = T duration α min t =T +1 t =T +T’ 21
  • 22. • Experiment 1 (Two tasks : EP and ED) – Comparison of multiple classification and anomaly detection models in respect to AUC – LR(Logistic Regression) / ADA (Adaboost of decision stamps) / RF (Random Forest) / NN (Neural Network) / OCS (One Class SVM) / AE (Auto Encoder) • Experiment 2 (Two tasks : EP and ED) – Effective approach to combine multiple models and data sets • Experiment 3 (One task : Forecast) – Forecasting performance Experiment Outline 22
  • 23. Single data set is not sufficient • Experiment 1 – Tasks : EP and ED - Comparison methods : LR(Logistic Regression) / ADA (Adaboost of decision stamps) / RF (Random Forest) / NN (Neural Network) / OCS (One Class SVM) / AE (Auto Encoder) Results : Optimal combinations of models and data are different 23
  • 24. Single data set is not sufficient • Experiment 1 – Tasks : EP and ED - Comparison methods : LR(Logistic Regression) / ADA (Adaboost of decision stamps) / RF (Random Forest) / NN (Neural Network) / OCS (One Class SVM) / AE (Auto Encoder) Results : Optimal combinations of models and data are different 24
  • 25. Tweet data detects failures early • Experiment 1 – Tasks : EP and ED Assumption : Post tweets, then access web pages 25 …
  • 26. Tweet data detects failures early • Experiment 1 – Tasks : EP and ED Assumption : Post tweets, then access web pages 26 …
  • 27. Tweet data detects failures early • Experiment 1 – Tasks : EP and ED Assumption : Post tweets, then access web pages 27 …
  • 28. • A model ensemble approach to effective combinations of data sets and models Combine multiple user activity data sets LR 𝒑" LR RF AE AE {(𝒙", 𝑦")}"7&:9 {(𝒙", 𝑦")}"7&:9 {𝒙"}"7&:9 : {(𝒑", 𝑦")}"7&:9 ^ 𝑓(;) 𝚳 𝟏 𝚳 𝟐 𝚳 𝟑 𝚳 𝟑 𝑫 𝟏 𝑫 𝟐 𝑫 𝟑 Level 1 Level 2 𝒑" : Prediction scores of failure events of model 𝚳𝒊 and data sets 𝑫𝒋 28
  • 29. • Experiment 2 – Tasks : EP and 60min – Model : ME – Comparison method : Data Ensemble (DE) • Results Model ensemble method achieved best scores LR 𝒑" 29 concatenation
  • 31. ME suppress cases of false negatives 31
  • 32. Effective combination of Tweets and Web access logs Rapid rise Delayed rise 32
  • 33. • Experiment 3 – Task:Forecast – Model : ME – Comparison methods : DE and LR on each dataset User activity can forecast future failures 33
  • 34. • Experiment 3 – Task:Forecast – Model : ME – Comparison methods : DE and LR on each dataset • Results : ME showed best performance User activity can forecast future failures 34
  • 35. • Summary – Proposed a multiple user activity-based failure detection and forecasting framework – Demonstrated that our proposed methods improved AUC scores through extensive experiments • Future Works – Utilizing additional data sets – Deep neural network (e.g., LSTM model) – Feature Analysis – Deployment application to real-time monitoring system Conclusions Thank you ! 35
  • 36. Note
  • 37. • Framework for failure detection and forecasting • Extensive experiments using real-world multiple data sets Research Contributions Overview of our framework Users Service Provider Failure Event 𝒙 𝒕 : Multiple User Activity Data Error ? I can’t connect. Tweet Search Call Web Access 𝑦" : Failure Reports Detection and Forecasting ! Social Data Telecom Data Alert Outage ? RSS News Reporting 37
  • 38. • First method : Simple Moving Average (SMA) • Second method : BNS (Bi-Normal Separator [Forman2003]) Method : Feature Construction L Sparseness of user activities à Aggregate each feature observed from the past time (t − S) to an average value in a time t L Imbalanced labels → Feature scaling for imbalanced dataset 38 𝐹E& ; : inverse cumulative normal distribution tpr : True positive rate fpr : False positive rate 𝑡𝑓𝑏𝑛𝑠 𝑥I," = 𝑡𝑓 𝑥I," ×𝑏𝑛𝑠 𝑑I 𝑡𝑓 𝑥I," = 𝑥I," ∑ 𝑥I," ) I7& 𝑏𝑛𝑠 𝑑I = 𝐹E& 𝑡𝑝𝑟 − 𝐹E& 𝑓𝑝𝑟
  • 39. • Experimental Setup – Evaluation Set : EP – Model : Logistic regression with ridge regularization – Comparison methods • td(original features)/ tf-idf ([Salton+1986]) / tf-bns / +sma (simple moving average) • Results : J Improved AUC by Average 31%. – SMA for time series features and BNS Scaling for imbalanced dataset are effectiveness. Experiment :Effects of Feature Construction * The values are Average and standard deviation of AUC. + Average 31% 39