SlideShare ist ein Scribd-Unternehmen logo
1 von 24
We make ICT strategies work
Prof. Dr.-Ing. Thomas Bauschert , Dr. Mathias Schweigel, Oleksandr Kryvoshapka
Technische Universität Chemnitz, Detecon International GmbH
Feb 2016
A Framework for Telecommunication
Traffic Demand Forecasting
Motivation & Goals
©Detecon
– 2 –
 Extend the forecasting abilities of Detecon
NetWork products
 Create a theoretical and practical basis for
further developments
 Provide implementation of the forecasting
methods
 Learn the basics of forecasting process
 Classify the existing forecasting methods
 Implement some forecasting methods (if
necessary)
 Identify restrictions and conditions of
forecasting for mobile networks
 Perform a research with real operator’s data
Tasks Goals
©Detecon
– 3 –
AuthorQuote
Introduction
"Prediction is very difficult,
especially if it's about the
future.“
“A planning tool that helps management in its attempts to
cope with the uncertainty of the future, relying mainly on
data from the past and present and analysis of trends.”
Niels Bohr (1885-1962)
Content
1. Basics of forecasting
2. Forecasting methods
3. Applications constrains
4. Evaluation of results
5. Summary & Outlook
Overview:
©Detecon
– 4 –
Basics
©Detecon
– 5 –
 Timing
 Frequency
 Duration
 Short-term (<1 year)
 Medium-term (1-2 years)
 Long-term (>2 years)
 Operation management
 Marketing
 Finance and Risk
management
 Economics
 Industrial process control
 Demography
PropertiesAreas Forecast horizon
Forecasting introduction
Basics
©Detecon
– 6 –
ComponentsTime-series
Time-series data
 Usually historical data
consist of a sequence of
observations over time.
 It is called a time-series
data: set of observations x,
recorded at a specific time t.
 There are two types of such
series:
 Discrete time-series
 Continuous time-series
Basics
©Detecon
– 7 –
ComponentsTime-series
Time-series components
 Time-series can contain next
components or patterns:
 Trend (T)
 Seasonal variations (S)
 Cyclical variation (C)
 Irregular fluctuations (I)
 Can be combined in different
ways
 Not all components are
present in the time-series
simultaneously
Proposed Classification
Forecasting methods
Forecasting methods classification
©Detecon
– 8 –
* - implemented
©Detecon
– 9 –
Pros & ConsMethod overview
Periodic DES and Periodic LinReg
 Most of the methods are
able to handle only trend
component of the time series
data, but not seasonal (e.g.
DES, LinReg).
 Modification –
deseasonalization:
1. Split the data into
unseasonal slices
2. Perform forecasting for
each slice separately
3. Combine the results of
forecasting in the correct
order
 Forecasting equations
remains unchanged
 Only data pre- and post-
processing is changed
Methods overview
Application constrains
Aggregation levels
©Detecon
– 10 –
Data scale Forecast horizon (forecast next…)
Hourly
Daily
Weekly
Monthly
Quarterly
Yearly
Hour
Day
Week
Month
Quarter
Year
w3
©Detecon
– 11 –
DescriptionIllustration
Data problem
 Practical forecasting task is
different from theoretical
 Data corruption problems:
 Missing data
 Outliers
 There are two parameters of
data corruption:
 Position
 Size
 Different recovery
techniques proposed:
 Naïve (Copy)
 Averaging
 Interpolation
 “Small” forecast
Application constrains
©Detecon
– 12 –
DescriptionIllustration
Test methodology
 MODEL part used as an
input.
 Period count starts from the
earliest data.
 FORECAST START line
separates present from
future
 FORECAST part used as a
reference for output.
 FH specifies the range of
forecast (e.g. days, weeks)
 MSE value is used as an
accuracy measurement
 Implementation -
Python+Excel
Evaluation of results
FORECAST START
MODEL FORECAST
1 2 3 4 5 6 7
Forecast Horizon (FH)
©Detecon
– 13 –
List of the tests
Performed tests
Evaluation of results
№ Objective Methods
1 Influence of different amount of input data
TES, Periodic DES, Periodic
LinReg
2 Influence of different FH size
3
1 Influence of the position of missing data (recovered)
2 Influence of different recovery techniques
4
1 Hourly data based forecast (FH -168 values)
2 Daily data based forecast (FH - 7 values)
TES, Periodic DES, Periodic
LinReg, DES, LinReg
3 Weekly data based forecast (FH - 1 values) DES, LinReg
©Detecon
– 14 –
ParametersData
TEST 1: Different amount of input periods used
 Input data
 Input - 48-168 hourly
measurements
 Output - 48 hourly values
 Periodicity
 d1 – 24
 Tested methods
 TES
 Periodic DES
 Periodic LinReg
Evaluation of results
©Detecon
– 15 –
ResultsOutput
TEST 1: Different amount of input periods used
 MSEforecast compared.
 An overall slowdown of the
MSEforecast values with the
increment of input data amount.
 Most dramatic decrease –
Periodic DES
 Smallest MSEforecast value –
TES
 7 periods of input data will be
used for the next test scenario.
Evaluation of results
©Detecon
– 16 –
ParametersData
TEST 3.1: Missing data position
 Input data
 Input - 168 hourly
measurements
 Positions of the gap:
– 1st period
– 4th period
– 7th period
 Gap recovered as average of
full input
 Output - 48 hourly values
 Periodicity
 d1 – 24
 Tested methods
 TES
 Periodic DES
 Periodic LinReg
Evaluation of results
©Detecon
– 17 –
ResultsOutput
TEST 3.1: Missing data position
 MSEforecast compared.
 The position of the gap does
matter.
 All three forecasting methods
reacts on the recovered gap with
the growth of MSEforecast value.
 Gap in the last period is the most
corrupting for the MSEforecast
values.
 Gap in the middle is the most
safe for TES and Periodic DES
 Gap in the beginning safe for
Periodic LinReg.
Evaluation of results
©Detecon
– 18 –
ParametersData
TEST 4.1: Hourly forecast
 Input data
 Input - 336 hourly
measurements
 Output - 168 hourly values
 Periodicity
 d1 – 24
 w1 - 168
 Tested methods
 TES
 Periodic DES
 Periodic LinReg
Evaluation of results
©Detecon
– 19 –
ResultsOutput
TEST 4.1: Hourly forecast
 MSEforecast and evaluation
speed was compared.
 TES performing worse in w1 than
in d1
 Periodic DES was improved in
case w1
 Periodic LinReg almost does not
change. The fastest method
(x1000)
Evaluation of results
©Detecon
– 20 –
ParametersData
TEST 4.2: Daily forecast
 Input data
 Input - 14 daily
measurements
 Output - 7 daily values
 Periodicity
 w2 - 7
 Tested methods
 TES
 Periodic DES
 Periodic LinReg
 DES
 LinReg
Evaluation of results
©Detecon
– 21 –
ResultsOutput
TEST 4.2: Daily forecast
 MSEforecast and evaluation
speed was compared.
 Cases d1 and w1 are just
aggregated to the busy-hour-per-
day scale results of previous test
 MSEforecast value for TES and
Periodic DES is noticeably
reduced.
 LinReg – the smallest
MSEforecast value. The fastest
method.
Evaluation of results
©Detecon
– 22 –
DecisionsTests
TESTs Results
 Conclusion out of all test are
relevant only for used test data
set.
 TES – the most universal method
 Periodic LinReg – the fastest
method
 Combination of the methods can
increase the chances for making
the correct decision.
Evaluation of results
TEST 1 More data - better
the forecast
Test leader: TES
TEST 2
Forecast reliability
decreases with FH
duration
Test leader: TES
TEST 3
Edges of the data
is the most
influencing for
forecasting
accuracy
Test leader:
TES +
Recover by the
neighboring
average
TEST 4
If enough data
available, the same
aggragation level
as FH should be
used.
Test leader:
Periodic LinReg
Summary & Outlook
Summary & Outlook
©Detecon
– 23 –
The work can be extended in a several directions:
 Integration of the methods into the NetWorks Forecast
 New forecasting methods investigation
 Automatic constrains (data problem) solving algorithms
 Additional test scenarios to generalize the knowledge
 The basics of forecasting process were studied
 The criteria for classifying forecasting methods were
illustrated.
 New classification were proposed, focusing on
quantitive methods grouped by internal used statistical
mechanism.
 The modifications of two existing methods were
proposed: Periodic DES and Periodic LinReg.
 Four important factors for selecting appropriate
forecasting methods were highlighted.
 A guideline for data recovery after missing data and
outliers was proposed.
 Different constrains related to the telecom industry
(and not only) were mentioned: different aggregation
levels, data problems and industry requirements.
 Several test scenarios were created, to check the
performance of three implemented forecasting (TES,
Periodic DES, and Periodic LinReg) and compare
them.
Thesis summary Outlook
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTScsandit
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Predictionijsrd.com
 
An Effective Strategy of Firewall Based Matching Algorithm
An Effective Strategy of Firewall Based Matching AlgorithmAn Effective Strategy of Firewall Based Matching Algorithm
An Effective Strategy of Firewall Based Matching AlgorithmIJMER
 
Parameter tuning or default values
Parameter tuning or default valuesParameter tuning or default values
Parameter tuning or default valuesVivek Nair
 
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning ApproachIRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning ApproachIRJET Journal
 
The End User Requirement for Project Management Software Accuracy
The End User Requirement for Project Management Software Accuracy The End User Requirement for Project Management Software Accuracy
The End User Requirement for Project Management Software Accuracy IJECEIAES
 
Benchmarking machine learning techniques
Benchmarking machine learning techniquesBenchmarking machine learning techniques
Benchmarking machine learning techniquesijseajournal
 
Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...
 Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c... Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...
Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...ijiert bestjournal
 
Computational optimization, modelling and simulation: Recent advances and ove...
Computational optimization, modelling and simulation: Recent advances and ove...Computational optimization, modelling and simulation: Recent advances and ove...
Computational optimization, modelling and simulation: Recent advances and ove...Xin-She Yang
 
One–day wave forecasts based on artificial neural networks
One–day wave forecasts based on artificial neural networksOne–day wave forecasts based on artificial neural networks
One–day wave forecasts based on artificial neural networksJonathan D'Cruz
 
Application of the analytic hierarchy process (AHP) for selection of forecast...
Application of the analytic hierarchy process (AHP) for selection of forecast...Application of the analytic hierarchy process (AHP) for selection of forecast...
Application of the analytic hierarchy process (AHP) for selection of forecast...Gurdal Ertek
 
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET Journal
 
PM Estimation techniques
PM Estimation techniquesPM Estimation techniques
PM Estimation techniquesChander Parkash
 

Was ist angesagt? (18)

ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
 
Guidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability PredictionGuidelines to Understanding Design of Experiment and Reliability Prediction
Guidelines to Understanding Design of Experiment and Reliability Prediction
 
Bachelor Thesis
Bachelor ThesisBachelor Thesis
Bachelor Thesis
 
An Effective Strategy of Firewall Based Matching Algorithm
An Effective Strategy of Firewall Based Matching AlgorithmAn Effective Strategy of Firewall Based Matching Algorithm
An Effective Strategy of Firewall Based Matching Algorithm
 
Parameter tuning or default values
Parameter tuning or default valuesParameter tuning or default values
Parameter tuning or default values
 
IRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning ApproachIRJET- Software Bug Prediction using Machine Learning Approach
IRJET- Software Bug Prediction using Machine Learning Approach
 
The End User Requirement for Project Management Software Accuracy
The End User Requirement for Project Management Software Accuracy The End User Requirement for Project Management Software Accuracy
The End User Requirement for Project Management Software Accuracy
 
Benchmarking machine learning techniques
Benchmarking machine learning techniquesBenchmarking machine learning techniques
Benchmarking machine learning techniques
 
Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...
 Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c... Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...
Applicability of Hooke’s and Jeeves Direct Search Solution Method to Metal c...
 
Computational optimization, modelling and simulation: Recent advances and ove...
Computational optimization, modelling and simulation: Recent advances and ove...Computational optimization, modelling and simulation: Recent advances and ove...
Computational optimization, modelling and simulation: Recent advances and ove...
 
Dimensional analysis
Dimensional analysisDimensional analysis
Dimensional analysis
 
Program 2013
Program 2013Program 2013
Program 2013
 
One–day wave forecasts based on artificial neural networks
One–day wave forecasts based on artificial neural networksOne–day wave forecasts based on artificial neural networks
One–day wave forecasts based on artificial neural networks
 
Application of the analytic hierarchy process (AHP) for selection of forecast...
Application of the analytic hierarchy process (AHP) for selection of forecast...Application of the analytic hierarchy process (AHP) for selection of forecast...
Application of the analytic hierarchy process (AHP) for selection of forecast...
 
Rsc 04
Rsc 04Rsc 04
Rsc 04
 
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...IRJET -  	  Neural Network based Leaf Disease Detection and Remedy Recommenda...
IRJET - Neural Network based Leaf Disease Detection and Remedy Recommenda...
 
Shewhart
ShewhartShewhart
Shewhart
 
PM Estimation techniques
PM Estimation techniquesPM Estimation techniques
PM Estimation techniques
 

Andere mochten auch

Designing the grid of the future by Stefan Kuppers
Designing the grid of the future by Stefan KuppersDesigning the grid of the future by Stefan Kuppers
Designing the grid of the future by Stefan KuppersMaRS Discovery District
 
Description_ROTTWEILER.docx
Description_ROTTWEILER.docxDescription_ROTTWEILER.docx
Description_ROTTWEILER.docxLipika Kanojia
 
Internship letter
Internship letterInternship letter
Internship letterAhsan Nawaz
 
Future work@detecon teaser 2016 incl. references
Future work@detecon teaser 2016 incl. referencesFuture work@detecon teaser 2016 incl. references
Future work@detecon teaser 2016 incl. referencesMarc Wagner
 
High Scalability by Example – How can Web-Architecture scale like Facebook, T...
High Scalability by Example – How can Web-Architecture scale like Facebook, T...High Scalability by Example – How can Web-Architecture scale like Facebook, T...
High Scalability by Example – How can Web-Architecture scale like Facebook, T...Robert Mederer
 

Andere mochten auch (10)

Work Experience
Work ExperienceWork Experience
Work Experience
 
Poster FR open day
Poster FR open dayPoster FR open day
Poster FR open day
 
Designing the grid of the future by Stefan Kuppers
Designing the grid of the future by Stefan KuppersDesigning the grid of the future by Stefan Kuppers
Designing the grid of the future by Stefan Kuppers
 
Description_ROTTWEILER.docx
Description_ROTTWEILER.docxDescription_ROTTWEILER.docx
Description_ROTTWEILER.docx
 
PROJECT_ Kryvoshapka_ FINAL RELEASE
PROJECT_ Kryvoshapka_ FINAL RELEASEPROJECT_ Kryvoshapka_ FINAL RELEASE
PROJECT_ Kryvoshapka_ FINAL RELEASE
 
letter-of-recommendation
letter-of-recommendationletter-of-recommendation
letter-of-recommendation
 
Internship letter
Internship letterInternship letter
Internship letter
 
Lebenslauf
LebenslaufLebenslauf
Lebenslauf
 
Future work@detecon teaser 2016 incl. references
Future work@detecon teaser 2016 incl. referencesFuture work@detecon teaser 2016 incl. references
Future work@detecon teaser 2016 incl. references
 
High Scalability by Example – How can Web-Architecture scale like Facebook, T...
High Scalability by Example – How can Web-Architecture scale like Facebook, T...High Scalability by Example – How can Web-Architecture scale like Facebook, T...
High Scalability by Example – How can Web-Architecture scale like Facebook, T...
 

Ähnlich wie Presentation_THESIS_Kryvoshapka_v1.4

3122999029.pptx
3122999029.pptx3122999029.pptx
3122999029.pptxMyoZawOo10
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning TechniquesIRJET Journal
 
Service Management: Forecasting Hydrogen Demand
Service Management: Forecasting Hydrogen DemandService Management: Forecasting Hydrogen Demand
Service Management: Forecasting Hydrogen Demandirrosennen
 
IRJET- Overview of Forecasting Techniques
IRJET- Overview of Forecasting TechniquesIRJET- Overview of Forecasting Techniques
IRJET- Overview of Forecasting TechniquesIRJET Journal
 
Improving the cosmic approximate sizing using the fuzzy logic epcu model al...
Improving the cosmic approximate sizing using the fuzzy logic epcu model   al...Improving the cosmic approximate sizing using the fuzzy logic epcu model   al...
Improving the cosmic approximate sizing using the fuzzy logic epcu model al...IWSM Mensura
 
Forecasting cost and schedule performance
Forecasting cost and schedule performanceForecasting cost and schedule performance
Forecasting cost and schedule performanceGlen Alleman
 
IRJET- Error Reduction in Data Prediction using Least Square Regression Method
IRJET- Error Reduction in Data Prediction using Least Square Regression MethodIRJET- Error Reduction in Data Prediction using Least Square Regression Method
IRJET- Error Reduction in Data Prediction using Least Square Regression MethodIRJET Journal
 
High dimensionality reduction on graphical data
High dimensionality reduction on graphical dataHigh dimensionality reduction on graphical data
High dimensionality reduction on graphical dataeSAT Journals
 
Neural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesNeural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesJonathan D'Cruz
 
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...ijcsa
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Pedro Lopes
 
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...Jean-Claude Meteodyn
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecastsinside-BigData.com
 
Atmospheric Pollutant Concentration Prediction Based on KPCA BP
Atmospheric Pollutant Concentration Prediction Based on KPCA BPAtmospheric Pollutant Concentration Prediction Based on KPCA BP
Atmospheric Pollutant Concentration Prediction Based on KPCA BPijtsrd
 
Parkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMParkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMIRJET Journal
 
Automatically Estimating Software Effort and Cost using Computing Intelligenc...
Automatically Estimating Software Effort and Cost using Computing Intelligenc...Automatically Estimating Software Effort and Cost using Computing Intelligenc...
Automatically Estimating Software Effort and Cost using Computing Intelligenc...cscpconf
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...IOSR Journals
 

Ähnlich wie Presentation_THESIS_Kryvoshapka_v1.4 (20)

3122999029.pptx
3122999029.pptx3122999029.pptx
3122999029.pptx
 
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
IRJET -  	  Intelligent Weather Forecasting using Machine Learning TechniquesIRJET -  	  Intelligent Weather Forecasting using Machine Learning Techniques
IRJET - Intelligent Weather Forecasting using Machine Learning Techniques
 
Service Management: Forecasting Hydrogen Demand
Service Management: Forecasting Hydrogen DemandService Management: Forecasting Hydrogen Demand
Service Management: Forecasting Hydrogen Demand
 
IRJET- Overview of Forecasting Techniques
IRJET- Overview of Forecasting TechniquesIRJET- Overview of Forecasting Techniques
IRJET- Overview of Forecasting Techniques
 
Improving the cosmic approximate sizing using the fuzzy logic epcu model al...
Improving the cosmic approximate sizing using the fuzzy logic epcu model   al...Improving the cosmic approximate sizing using the fuzzy logic epcu model   al...
Improving the cosmic approximate sizing using the fuzzy logic epcu model al...
 
Forecasting cost and schedule performance
Forecasting cost and schedule performanceForecasting cost and schedule performance
Forecasting cost and schedule performance
 
IRJET- Error Reduction in Data Prediction using Least Square Regression Method
IRJET- Error Reduction in Data Prediction using Least Square Regression MethodIRJET- Error Reduction in Data Prediction using Least Square Regression Method
IRJET- Error Reduction in Data Prediction using Least Square Regression Method
 
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATIONONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
ONE HIDDEN LAYER ANFIS MODEL FOR OOS DEVELOPMENT EFFORT ESTIMATION
 
High dimensionality reduction on graphical data
High dimensionality reduction on graphical dataHigh dimensionality reduction on graphical data
High dimensionality reduction on graphical data
 
A1802050102
A1802050102A1802050102
A1802050102
 
Neural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variablesNeural networks for the prediction and forecasting of water resources variables
Neural networks for the prediction and forecasting of water resources variables
 
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...
EMPIRICAL APPLICATION OF SIMULATED ANNEALING USING OBJECT-ORIENTED METRICS TO...
 
Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013Poster_Reseau_Neurones_Journees_2013
Poster_Reseau_Neurones_Journees_2013
 
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...
Wind meteodyn WT cfd micro scale modeling combined statistical learning for s...
 
Machine Learning for Weather Forecasts
Machine Learning for Weather ForecastsMachine Learning for Weather Forecasts
Machine Learning for Weather Forecasts
 
Atmospheric Pollutant Concentration Prediction Based on KPCA BP
Atmospheric Pollutant Concentration Prediction Based on KPCA BPAtmospheric Pollutant Concentration Prediction Based on KPCA BP
Atmospheric Pollutant Concentration Prediction Based on KPCA BP
 
Parkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMParkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVM
 
Automatically Estimating Software Effort and Cost using Computing Intelligenc...
Automatically Estimating Software Effort and Cost using Computing Intelligenc...Automatically Estimating Software Effort and Cost using Computing Intelligenc...
Automatically Estimating Software Effort and Cost using Computing Intelligenc...
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...
 
Time series
Time seriesTime series
Time series
 

Presentation_THESIS_Kryvoshapka_v1.4

  • 1. We make ICT strategies work Prof. Dr.-Ing. Thomas Bauschert , Dr. Mathias Schweigel, Oleksandr Kryvoshapka Technische Universität Chemnitz, Detecon International GmbH Feb 2016 A Framework for Telecommunication Traffic Demand Forecasting
  • 2. Motivation & Goals ©Detecon – 2 –  Extend the forecasting abilities of Detecon NetWork products  Create a theoretical and practical basis for further developments  Provide implementation of the forecasting methods  Learn the basics of forecasting process  Classify the existing forecasting methods  Implement some forecasting methods (if necessary)  Identify restrictions and conditions of forecasting for mobile networks  Perform a research with real operator’s data Tasks Goals
  • 3. ©Detecon – 3 – AuthorQuote Introduction "Prediction is very difficult, especially if it's about the future.“ “A planning tool that helps management in its attempts to cope with the uncertainty of the future, relying mainly on data from the past and present and analysis of trends.” Niels Bohr (1885-1962)
  • 4. Content 1. Basics of forecasting 2. Forecasting methods 3. Applications constrains 4. Evaluation of results 5. Summary & Outlook Overview: ©Detecon – 4 –
  • 5. Basics ©Detecon – 5 –  Timing  Frequency  Duration  Short-term (<1 year)  Medium-term (1-2 years)  Long-term (>2 years)  Operation management  Marketing  Finance and Risk management  Economics  Industrial process control  Demography PropertiesAreas Forecast horizon Forecasting introduction
  • 6. Basics ©Detecon – 6 – ComponentsTime-series Time-series data  Usually historical data consist of a sequence of observations over time.  It is called a time-series data: set of observations x, recorded at a specific time t.  There are two types of such series:  Discrete time-series  Continuous time-series
  • 7. Basics ©Detecon – 7 – ComponentsTime-series Time-series components  Time-series can contain next components or patterns:  Trend (T)  Seasonal variations (S)  Cyclical variation (C)  Irregular fluctuations (I)  Can be combined in different ways  Not all components are present in the time-series simultaneously
  • 8. Proposed Classification Forecasting methods Forecasting methods classification ©Detecon – 8 – * - implemented
  • 9. ©Detecon – 9 – Pros & ConsMethod overview Periodic DES and Periodic LinReg  Most of the methods are able to handle only trend component of the time series data, but not seasonal (e.g. DES, LinReg).  Modification – deseasonalization: 1. Split the data into unseasonal slices 2. Perform forecasting for each slice separately 3. Combine the results of forecasting in the correct order  Forecasting equations remains unchanged  Only data pre- and post- processing is changed Methods overview
  • 10. Application constrains Aggregation levels ©Detecon – 10 – Data scale Forecast horizon (forecast next…) Hourly Daily Weekly Monthly Quarterly Yearly Hour Day Week Month Quarter Year w3
  • 11. ©Detecon – 11 – DescriptionIllustration Data problem  Practical forecasting task is different from theoretical  Data corruption problems:  Missing data  Outliers  There are two parameters of data corruption:  Position  Size  Different recovery techniques proposed:  Naïve (Copy)  Averaging  Interpolation  “Small” forecast Application constrains
  • 12. ©Detecon – 12 – DescriptionIllustration Test methodology  MODEL part used as an input.  Period count starts from the earliest data.  FORECAST START line separates present from future  FORECAST part used as a reference for output.  FH specifies the range of forecast (e.g. days, weeks)  MSE value is used as an accuracy measurement  Implementation - Python+Excel Evaluation of results FORECAST START MODEL FORECAST 1 2 3 4 5 6 7 Forecast Horizon (FH)
  • 13. ©Detecon – 13 – List of the tests Performed tests Evaluation of results № Objective Methods 1 Influence of different amount of input data TES, Periodic DES, Periodic LinReg 2 Influence of different FH size 3 1 Influence of the position of missing data (recovered) 2 Influence of different recovery techniques 4 1 Hourly data based forecast (FH -168 values) 2 Daily data based forecast (FH - 7 values) TES, Periodic DES, Periodic LinReg, DES, LinReg 3 Weekly data based forecast (FH - 1 values) DES, LinReg
  • 14. ©Detecon – 14 – ParametersData TEST 1: Different amount of input periods used  Input data  Input - 48-168 hourly measurements  Output - 48 hourly values  Periodicity  d1 – 24  Tested methods  TES  Periodic DES  Periodic LinReg Evaluation of results
  • 15. ©Detecon – 15 – ResultsOutput TEST 1: Different amount of input periods used  MSEforecast compared.  An overall slowdown of the MSEforecast values with the increment of input data amount.  Most dramatic decrease – Periodic DES  Smallest MSEforecast value – TES  7 periods of input data will be used for the next test scenario. Evaluation of results
  • 16. ©Detecon – 16 – ParametersData TEST 3.1: Missing data position  Input data  Input - 168 hourly measurements  Positions of the gap: – 1st period – 4th period – 7th period  Gap recovered as average of full input  Output - 48 hourly values  Periodicity  d1 – 24  Tested methods  TES  Periodic DES  Periodic LinReg Evaluation of results
  • 17. ©Detecon – 17 – ResultsOutput TEST 3.1: Missing data position  MSEforecast compared.  The position of the gap does matter.  All three forecasting methods reacts on the recovered gap with the growth of MSEforecast value.  Gap in the last period is the most corrupting for the MSEforecast values.  Gap in the middle is the most safe for TES and Periodic DES  Gap in the beginning safe for Periodic LinReg. Evaluation of results
  • 18. ©Detecon – 18 – ParametersData TEST 4.1: Hourly forecast  Input data  Input - 336 hourly measurements  Output - 168 hourly values  Periodicity  d1 – 24  w1 - 168  Tested methods  TES  Periodic DES  Periodic LinReg Evaluation of results
  • 19. ©Detecon – 19 – ResultsOutput TEST 4.1: Hourly forecast  MSEforecast and evaluation speed was compared.  TES performing worse in w1 than in d1  Periodic DES was improved in case w1  Periodic LinReg almost does not change. The fastest method (x1000) Evaluation of results
  • 20. ©Detecon – 20 – ParametersData TEST 4.2: Daily forecast  Input data  Input - 14 daily measurements  Output - 7 daily values  Periodicity  w2 - 7  Tested methods  TES  Periodic DES  Periodic LinReg  DES  LinReg Evaluation of results
  • 21. ©Detecon – 21 – ResultsOutput TEST 4.2: Daily forecast  MSEforecast and evaluation speed was compared.  Cases d1 and w1 are just aggregated to the busy-hour-per- day scale results of previous test  MSEforecast value for TES and Periodic DES is noticeably reduced.  LinReg – the smallest MSEforecast value. The fastest method. Evaluation of results
  • 22. ©Detecon – 22 – DecisionsTests TESTs Results  Conclusion out of all test are relevant only for used test data set.  TES – the most universal method  Periodic LinReg – the fastest method  Combination of the methods can increase the chances for making the correct decision. Evaluation of results TEST 1 More data - better the forecast Test leader: TES TEST 2 Forecast reliability decreases with FH duration Test leader: TES TEST 3 Edges of the data is the most influencing for forecasting accuracy Test leader: TES + Recover by the neighboring average TEST 4 If enough data available, the same aggragation level as FH should be used. Test leader: Periodic LinReg
  • 23. Summary & Outlook Summary & Outlook ©Detecon – 23 – The work can be extended in a several directions:  Integration of the methods into the NetWorks Forecast  New forecasting methods investigation  Automatic constrains (data problem) solving algorithms  Additional test scenarios to generalize the knowledge  The basics of forecasting process were studied  The criteria for classifying forecasting methods were illustrated.  New classification were proposed, focusing on quantitive methods grouped by internal used statistical mechanism.  The modifications of two existing methods were proposed: Periodic DES and Periodic LinReg.  Four important factors for selecting appropriate forecasting methods were highlighted.  A guideline for data recovery after missing data and outliers was proposed.  Different constrains related to the telecom industry (and not only) were mentioned: different aggregation levels, data problems and industry requirements.  Several test scenarios were created, to check the performance of three implemented forecasting (TES, Periodic DES, and Periodic LinReg) and compare them. Thesis summary Outlook

Hinweis der Redaktion

  1. 1
  2. According to the business dictionary…
  3. Short- and medium-term forecasts are required for activities that range from operations management to budgeting and selecting new research and development projects. Long-term forecasts affect issues such as strategic planning. Short- and medium-term forecasting is typically based on identifying, modeling, and extrapolating the patterns found in historical data. The task of timing forecasts is to determine the time when an event will happen. Frequency forecasts are aiming to determine quantities of events that will occur at certain period. The continuance of an event is the reason for duration forecasts.
  4. Monthly sales for the souvenir shop at a beach resort town in Queensland, Australia [a-little-book-of-r-for-time-series.readthedocs.org]
  5. Periodicity is IMPORTANT!!! Trend The horizontal direction of a smoothed time-series. Trends can be long-term pattern or dynamic in relatively short-term duration. Trend reflects the underlying growth or decline in the value of the variable. This variation pattern is present at least over several successive periods. Perception of the trend depends on the length of the observed series. If a time series does not show an increasing or decreasing pattern then such time-series called “stationary”. Seasonal variations Patterns of change in a time-series within a period of no more than a year. These patterns tend to repeat themselves. It refers to short-term, relatively frequent variations, which are identified by the differences between the actual results and the trend line. In real life, this pattern can repeat hourly, daily, weekly, monthly, yearly, etc. Seasonal variations is always has a known period sometimes called periodic variations. Generally related to factors such as weather, holidays and vacations and so on. Cyclical variations The variations of a time-series over periods longer than one year. They are not having a fixed period and often related to the current economic conditions. As a rule, the length of cycles is longer than the length of a season, and the magnitude of cycles usually much higher than the magnitude of seasonal patterns. Usually cyclical variations are not present in the typical time-series. Irregular variations Unpredictable component of every time-series that makes it a random variable. Irregular variations in the data caused by unusual circumstances. In general, the duration if such variations is short. There are two types of irregular variations can be specified: episodic and residual. Episodic fluctuations can be identified by nature of emergence. The residual fluctuations (chance fluctuations) cannot be identified. Of course, neither episodic nor residual variation can be projected into the future.
  6. The advantages of Periodic DES and Periodic LinReg over normal DES and Linear Regression methods are: The individual simplicity if original methods is kept. The new periodic algorithms are able to forecast seasonal time series with (local or global) trend. The disadvantages are next: Overall complexity of the method is dependent from the periodicity L of the input data. More input periods is required to produce adequate forecast.
  7. Operators collect data usually on hourly or sub-hourly basis. To save storage space – aggregation to the higher level. To forecast next hour can be interesting but not useful. Most used cases in short term forecasting. Forecast MAX to know if we need to increase the capacity of the cell.
  8. Due to the lack of time, only some of the tests will be explained in the details
  9. The main purpose of this test is to find out the impact of different amount of input data on the accuracy of implemented forecasting methods. The preliminary consideration is that more input data used – more accurate (in terms of MSEforecast) will be the forecasted values.
  10. Accuracy measure problem: no distinguish between above or below certain value.