SlideShare a Scribd company logo
1 of 11
A Study of Data Quality and Analytics
1
Experimental work
• Predictive Modeling - Linear vs. Nonlinear
• GARCH (Generalized Autoregressive Conditional
Heteroskedasticity ) model application onTime
series data
• GARCH vs. ANN with Heteroskedasticity
• Deployment of Predictive Model using PMML
(Predictive Model Markup Language)
 Areas of focus:
▪ Predictive Analytics – VariousApplications of Predictive Models
▪ PMML
 Resources
▪ IEEEComputer Society,Transaction publications
▪ International Journal for Research andApplication
▪ International Institute of Forecasters
▪ ACM Journals /transactions
 Status
▪ Literature survey - about 85% completion
▪ Relevant publications extracted : 75+
▪ Further survey – Deployment of Model using PMML
2
 Application of R Programming for Forecasting Day-ahead
electricity demand - Internal Journal of Computer Science
Issues, Vol 9, Issue 6, no 1, Nov 2012
 Mining ofTime series data for forecasting Day and Night
variances in electricity demand - National Conference on
Business Analytics and Business Intelligence , Institute of
Public Enterprise , Jan 2013
 Forecasting of Electricity Demand using SARIMA and Feed
Forward Neural Network Models, Accepted for publication
in International Journal of Research in Computer
Application and Management
3
 Evaluate GARCH and ARIMA model for forecasting
Day ahead electricity demand
 Data - Daily Power consumption data
 DevelopTesting Procedure for GARCH using R
programming
4
Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models
,Analysis
 Evaluate GARCH and SARIMA model for forecasting
day and night variances in electricity demand
 Data - Hourly Power consumption data
 GARCH forecasting has lower RMSE (Root Mean
Square Error) than that of SARIMA forecasting
5
Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models ,
Analysis
 Evaluate SARIMA and Neural Networks model for
forecasting monthly electricity demand
 Data - Monthly Power consumption data
 RMSE of SARIMA fitted model is smaller than that of
NN whereas NN forecasting has smaller RMSE (Root
Mean Square Error) than that of SARIMA forecasting
6
Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models
,Analysis
 Predictive methods and techniques –
▪ Linear Regression – ARMA, ARIMA, SARIMA
▪ Non-linear - Neural Networks, GARCH
 Tools
▪ R Project, IBM SPSS
 Data - Power Consumption , Stock exchange data
 PMML - Predictive Model Markup Language
 Model Deployment using PMML
7
 Evaluate the GARCH model for comparing the
share price performance of 3 companies
 Prototype Development for the Deployment of
Predictive model using PMML
8
 Autoregressive Conditional
Heteroskedasticity
 Predictive (conditional)
 Uncertainty (heteroskedasticity)
 That fluctuates over time (autoregressive)
 GENERALIZEDARCH (Bollerslev) a most
important extension
 Tomorrow’s variance is predicted to be a
weighted average of the
 Long run average variance
 Today’s variance forecast
 The news (today’s squared return)
11

More Related Content

What's hot

Sigmaplot 13 PPT
Sigmaplot 13 PPTSigmaplot 13 PPT
Sigmaplot 13 PPT
Siriyak Cr
 
Railroad Modeling at Hadoop Scale
Railroad Modeling at Hadoop ScaleRailroad Modeling at Hadoop Scale
Railroad Modeling at Hadoop Scale
DataWorks Summit
 
Building Electricity Demand Forecasting
Building Electricity Demand ForecastingBuilding Electricity Demand Forecasting
Building Electricity Demand Forecasting
Shubham Saini
 
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.pptAnalysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
grssieee
 

What's hot (12)

Design and analysis of inexact floating point adders
Design and analysis of inexact floating point addersDesign and analysis of inexact floating point adders
Design and analysis of inexact floating point adders
 
Spark for Behavioral Analytics Research: Spark Summit East talk by John W u
Spark for Behavioral Analytics Research: Spark Summit East talk by John W uSpark for Behavioral Analytics Research: Spark Summit East talk by John W u
Spark for Behavioral Analytics Research: Spark Summit East talk by John W u
 
Poster
PosterPoster
Poster
 
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
AN ADABOOST-BASED FACE DETECTION SYSTEM USING PARALLEL CONFIGURABLE ARCHITECT...
 
Leveraging big data to maximize value from rail and power infrastructure assets.
Leveraging big data to maximize value from rail and power infrastructure assets.Leveraging big data to maximize value from rail and power infrastructure assets.
Leveraging big data to maximize value from rail and power infrastructure assets.
 
Graph based transistor network generation
Graph based transistor network generationGraph based transistor network generation
Graph based transistor network generation
 
Sigmaplot 13 PPT
Sigmaplot 13 PPTSigmaplot 13 PPT
Sigmaplot 13 PPT
 
Railroad Modeling at Hadoop Scale
Railroad Modeling at Hadoop ScaleRailroad Modeling at Hadoop Scale
Railroad Modeling at Hadoop Scale
 
Building Electricity Demand Forecasting
Building Electricity Demand ForecastingBuilding Electricity Demand Forecasting
Building Electricity Demand Forecasting
 
Small MATLAB Projects Research Help
Small MATLAB Projects Research HelpSmall MATLAB Projects Research Help
Small MATLAB Projects Research Help
 
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.pptAnalysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
Analysis_of_Remote_Sensing_Quantitative_Inversion_in_Cloud_Computing.ppt
 
MATLAB Project Ideas Engineering Research Assistance
MATLAB Project Ideas Engineering Research AssistanceMATLAB Project Ideas Engineering Research Assistance
MATLAB Project Ideas Engineering Research Assistance
 

Viewers also liked

Cost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponderCost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponder
Marshall Sponder
 
syngenta procurement prices july 2015
syngenta procurement prices july 2015syngenta procurement prices july 2015
syngenta procurement prices july 2015
Kranthi Kumar
 

Viewers also liked (16)

Data Wrangling
Data WranglingData Wrangling
Data Wrangling
 
Transform Legacy Enterprise into Data-Driven Digital Business
Transform Legacy Enterprise into Data-Driven Digital BusinessTransform Legacy Enterprise into Data-Driven Digital Business
Transform Legacy Enterprise into Data-Driven Digital Business
 
Cost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponderCost & benefits of business analytics marshall sponder
Cost & benefits of business analytics marshall sponder
 
syngenta procurement prices july 2015
syngenta procurement prices july 2015syngenta procurement prices july 2015
syngenta procurement prices july 2015
 
"Being creative with data" 25th November - SAS presentation
"Being creative with data" 25th November - SAS presentation"Being creative with data" 25th November - SAS presentation
"Being creative with data" 25th November - SAS presentation
 
Data Wrangling
Data WranglingData Wrangling
Data Wrangling
 
SAS Syngenta Creative Process Integration Case Study
SAS Syngenta Creative Process Integration Case StudySAS Syngenta Creative Process Integration Case Study
SAS Syngenta Creative Process Integration Case Study
 
Health IT Summit San Diego 2015 - Case Study "Analytics Strategy: Enablement,...
Health IT Summit San Diego 2015 - Case Study "Analytics Strategy: Enablement,...Health IT Summit San Diego 2015 - Case Study "Analytics Strategy: Enablement,...
Health IT Summit San Diego 2015 - Case Study "Analytics Strategy: Enablement,...
 
6 steps to richer visualizations using alteryx for microsoft power bi updated
6 steps to richer visualizations using alteryx for microsoft power bi updated6 steps to richer visualizations using alteryx for microsoft power bi updated
6 steps to richer visualizations using alteryx for microsoft power bi updated
 
Cleveland clinic
Cleveland clinicCleveland clinic
Cleveland clinic
 
Align Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital TransformationAlign Business Data & Analytics for Digital Transformation
Align Business Data & Analytics for Digital Transformation
 
Marketing Analytics: Data Quality, Data Matching & Marketing Metrics
Marketing Analytics: Data Quality, Data Matching & Marketing MetricsMarketing Analytics: Data Quality, Data Matching & Marketing Metrics
Marketing Analytics: Data Quality, Data Matching & Marketing Metrics
 
data warehouse , data mart, etl
data warehouse , data mart, etldata warehouse , data mart, etl
data warehouse , data mart, etl
 
Data mart
Data martData mart
Data mart
 
Data mart
Data martData mart
Data mart
 
Customer Insight Analysis
Customer Insight AnalysisCustomer Insight Analysis
Customer Insight Analysis
 

Similar to A study of Data Quality and Analytics

Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Spark Summit
 
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
Allen Day, PhD
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Spark Summit
 

Similar to A study of Data Quality and Analytics (20)

4.1_Simulation & Analysis Tools for Microgrids_Weng and Cortes_EPRI/SNL Micro...
4.1_Simulation & Analysis Tools for Microgrids_Weng and Cortes_EPRI/SNL Micro...4.1_Simulation & Analysis Tools for Microgrids_Weng and Cortes_EPRI/SNL Micro...
4.1_Simulation & Analysis Tools for Microgrids_Weng and Cortes_EPRI/SNL Micro...
 
Energy resource management
Energy resource managementEnergy resource management
Energy resource management
 
How to do accurate RE forecasting & scheduling
How to do accurate RE forecasting & scheduling How to do accurate RE forecasting & scheduling
How to do accurate RE forecasting & scheduling
 
Scalable AutoML for Time Series Forecasting using Ray
Scalable AutoML for Time Series Forecasting using RayScalable AutoML for Time Series Forecasting using Ray
Scalable AutoML for Time Series Forecasting using Ray
 
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
High Resolution Energy Modeling that Scales with Apache Spark 2.0 Spark Summi...
 
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data  with Ramya RaghavendraImproving Traffic Prediction Using Weather Data  with Ramya Raghavendra
Improving Traffic Prediction Using Weather Data with Ramya Raghavendra
 
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
Optimizing Numerical Weather Prediction Model Performance Using Machine Learn...
 
Recent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor ModelRecent and Planned Improvements to the System Advisor Model
Recent and Planned Improvements to the System Advisor Model
 
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
20131111 - Santa Monica - BigDataCamp - Big Data Design Patterns
 
6.3_DOE-OE Microgrid Cost Study_Pratt_EPRI/SNL Microgrid Symposium
6.3_DOE-OE Microgrid Cost Study_Pratt_EPRI/SNL Microgrid Symposium6.3_DOE-OE Microgrid Cost Study_Pratt_EPRI/SNL Microgrid Symposium
6.3_DOE-OE Microgrid Cost Study_Pratt_EPRI/SNL Microgrid Symposium
 
Predictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, ZementisPredictive Analytics - Big Data Warehousing Meetup, Zementis
Predictive Analytics - Big Data Warehousing Meetup, Zementis
 
Personalized power saving profiles generation analyzing smart device usage pa...
Personalized power saving profiles generation analyzing smart device usage pa...Personalized power saving profiles generation analyzing smart device usage pa...
Personalized power saving profiles generation analyzing smart device usage pa...
 
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive ApplicationsM3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
M3AT: Monitoring Agents Assignment Model for the Data-Intensive Applications
 
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya RaghavendraImproving Traffic Prediction Using Weather Datawith Ramya Raghavendra
Improving Traffic Prediction Using Weather Datawith Ramya Raghavendra
 
Energy efficient computing & computational services
Energy efficient computing & computational services Energy efficient computing & computational services
Energy efficient computing & computational services
 
Load_Forecastinglfviuguuyihonrekgdbgr.pptx
Load_Forecastinglfviuguuyihonrekgdbgr.pptxLoad_Forecastinglfviuguuyihonrekgdbgr.pptx
Load_Forecastinglfviuguuyihonrekgdbgr.pptx
 
Is Spark the right choice for data analysis ?
Is Spark the right choice for data analysis ?Is Spark the right choice for data analysis ?
Is Spark the right choice for data analysis ?
 
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkPresentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
 
cv_Md_Ariful_Islam
cv_Md_Ariful_Islamcv_Md_Ariful_Islam
cv_Md_Ariful_Islam
 
XMPLR Data Analytics in Power Generation
XMPLR Data Analytics in  Power GenerationXMPLR Data Analytics in  Power Generation
XMPLR Data Analytics in Power Generation
 

More from Ali Habeeb

Anonymous Connections And Onion Routing
Anonymous Connections And Onion RoutingAnonymous Connections And Onion Routing
Anonymous Connections And Onion Routing
Ali Habeeb
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
Ali Habeeb
 
Cloud Security
Cloud SecurityCloud Security
Cloud Security
Ali Habeeb
 
Secure erasure code based distributed storage system with secure data forwarding
Secure erasure code based distributed storage system with secure data forwardingSecure erasure code based distributed storage system with secure data forwarding
Secure erasure code based distributed storage system with secure data forwarding
Ali Habeeb
 
Organizing User Search Histories
Organizing User Search HistoriesOrganizing User Search Histories
Organizing User Search Histories
Ali Habeeb
 
Detecting and Resolving Firewall Policy Anomalies
Detecting and Resolving Firewall Policy AnomaliesDetecting and Resolving Firewall Policy Anomalies
Detecting and Resolving Firewall Policy Anomalies
Ali Habeeb
 
Bit Torrent Protocol
Bit Torrent ProtocolBit Torrent Protocol
Bit Torrent Protocol
Ali Habeeb
 

More from Ali Habeeb (20)

Anonymous Connections And Onion Routing
Anonymous Connections And Onion RoutingAnonymous Connections And Onion Routing
Anonymous Connections And Onion Routing
 
Opinion Mining
Opinion MiningOpinion Mining
Opinion Mining
 
WAP
WAPWAP
WAP
 
USB 3.0
USB 3.0USB 3.0
USB 3.0
 
Blue Eyes
Blue EyesBlue Eyes
Blue Eyes
 
Cloud Security
Cloud SecurityCloud Security
Cloud Security
 
Data-Centric Routing Protocols in Wireless Sensor Network: A survey
Data-Centric Routing Protocols in Wireless Sensor Network: A surveyData-Centric Routing Protocols in Wireless Sensor Network: A survey
Data-Centric Routing Protocols in Wireless Sensor Network: A survey
 
Web Security
Web SecurityWeb Security
Web Security
 
Secure erasure code based distributed storage system with secure data forwarding
Secure erasure code based distributed storage system with secure data forwardingSecure erasure code based distributed storage system with secure data forwarding
Secure erasure code based distributed storage system with secure data forwarding
 
Organizing User Search Histories
Organizing User Search HistoriesOrganizing User Search Histories
Organizing User Search Histories
 
Detecting and Resolving Firewall Policy Anomalies
Detecting and Resolving Firewall Policy AnomaliesDetecting and Resolving Firewall Policy Anomalies
Detecting and Resolving Firewall Policy Anomalies
 
Bit Torrent Protocol
Bit Torrent ProtocolBit Torrent Protocol
Bit Torrent Protocol
 
Adhoc and Sensor Networks - Chapter 10
Adhoc and Sensor Networks - Chapter 10Adhoc and Sensor Networks - Chapter 10
Adhoc and Sensor Networks - Chapter 10
 
Adhoc and Sensor Networks - Chapter 09
Adhoc and Sensor Networks - Chapter 09Adhoc and Sensor Networks - Chapter 09
Adhoc and Sensor Networks - Chapter 09
 
Adhoc and Sensor Networks - Chapter 08
Adhoc and Sensor Networks - Chapter 08Adhoc and Sensor Networks - Chapter 08
Adhoc and Sensor Networks - Chapter 08
 
Adhoc and Sensor Networks - Chapter 07
Adhoc and Sensor Networks - Chapter 07Adhoc and Sensor Networks - Chapter 07
Adhoc and Sensor Networks - Chapter 07
 
Adhoc and Sensor Networks - Chapter 06
Adhoc and Sensor Networks - Chapter 06Adhoc and Sensor Networks - Chapter 06
Adhoc and Sensor Networks - Chapter 06
 
Adhoc and Sensor Networks - Chapter 05
Adhoc and Sensor Networks - Chapter 05Adhoc and Sensor Networks - Chapter 05
Adhoc and Sensor Networks - Chapter 05
 
Adhoc and Sensor Networks - Chapter 04
Adhoc and Sensor Networks - Chapter 04Adhoc and Sensor Networks - Chapter 04
Adhoc and Sensor Networks - Chapter 04
 
Adhoc and Sensor Networks - Chapter 03
Adhoc and Sensor Networks - Chapter 03Adhoc and Sensor Networks - Chapter 03
Adhoc and Sensor Networks - Chapter 03
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
WSO2
 

Recently uploaded (20)

Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 

A study of Data Quality and Analytics

  • 1. A Study of Data Quality and Analytics 1 Experimental work • Predictive Modeling - Linear vs. Nonlinear • GARCH (Generalized Autoregressive Conditional Heteroskedasticity ) model application onTime series data • GARCH vs. ANN with Heteroskedasticity • Deployment of Predictive Model using PMML (Predictive Model Markup Language)
  • 2.  Areas of focus: ▪ Predictive Analytics – VariousApplications of Predictive Models ▪ PMML  Resources ▪ IEEEComputer Society,Transaction publications ▪ International Journal for Research andApplication ▪ International Institute of Forecasters ▪ ACM Journals /transactions  Status ▪ Literature survey - about 85% completion ▪ Relevant publications extracted : 75+ ▪ Further survey – Deployment of Model using PMML 2
  • 3.  Application of R Programming for Forecasting Day-ahead electricity demand - Internal Journal of Computer Science Issues, Vol 9, Issue 6, no 1, Nov 2012  Mining ofTime series data for forecasting Day and Night variances in electricity demand - National Conference on Business Analytics and Business Intelligence , Institute of Public Enterprise , Jan 2013  Forecasting of Electricity Demand using SARIMA and Feed Forward Neural Network Models, Accepted for publication in International Journal of Research in Computer Application and Management 3
  • 4.  Evaluate GARCH and ARIMA model for forecasting Day ahead electricity demand  Data - Daily Power consumption data  DevelopTesting Procedure for GARCH using R programming 4 Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models ,Analysis
  • 5.  Evaluate GARCH and SARIMA model for forecasting day and night variances in electricity demand  Data - Hourly Power consumption data  GARCH forecasting has lower RMSE (Root Mean Square Error) than that of SARIMA forecasting 5 Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models , Analysis
  • 6.  Evaluate SARIMA and Neural Networks model for forecasting monthly electricity demand  Data - Monthly Power consumption data  RMSE of SARIMA fitted model is smaller than that of NN whereas NN forecasting has smaller RMSE (Root Mean Square Error) than that of SARIMA forecasting 6 Data collection, Data cleaning, Setup the environment, Evaluate Predictive Models ,Analysis
  • 7.  Predictive methods and techniques – ▪ Linear Regression – ARMA, ARIMA, SARIMA ▪ Non-linear - Neural Networks, GARCH  Tools ▪ R Project, IBM SPSS  Data - Power Consumption , Stock exchange data  PMML - Predictive Model Markup Language  Model Deployment using PMML 7
  • 8.  Evaluate the GARCH model for comparing the share price performance of 3 companies  Prototype Development for the Deployment of Predictive model using PMML 8
  • 9.  Autoregressive Conditional Heteroskedasticity  Predictive (conditional)  Uncertainty (heteroskedasticity)  That fluctuates over time (autoregressive)
  • 10.  GENERALIZEDARCH (Bollerslev) a most important extension  Tomorrow’s variance is predicted to be a weighted average of the  Long run average variance  Today’s variance forecast  The news (today’s squared return)
  • 11. 11