SlideShare ist ein Scribd-Unternehmen logo
1 von 22
Predictive Analytics
Advanced Techniques in Data Mining

Sara Venturina



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Agenda
• What is predictive analytics?

• Predictive Analytics Process

• Data Preparation techniques

• Modeling Techniques

• Model Monitoring techniques




                                                                                      2



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
What is Predictive Analytics?
Different levels of analytics


                                                                      Forecasting               Predictive
                                                                                                modeling     Optimization
                                           Statistical
                                           analysis
                     Query drilldown Alerts
                     (or OLAP)
           Ad hoc
           reports
Standard
reports




                                                                                                                            3



                                    Copyright © 2011, SAS Institute Inc. All rights reserved.
What is Predictive Analytics?
Unfortunately, there is no “magic” involved!

• Use of data from different source tables
• Utilizing various data transformation techniques
• Employing statistical theories as foundation
• Will need software to manage this



Focus on business/commercial (as opposed to
 research) analytics is trickier as you need to
 balance the theories with realistic application


                                                                                    4



                        Copyright © 2011, SAS Institute Inc. All rights reserved.
Predictive Analytics Process


                                                Defining
                                               Objectives




             Model                                                                     Data
           Monitoring                                                               Preparation
                                              Predictive
                                              Analytics
                                               Process




                  Deployment                                                Modeling




                                                                                                  5



                        Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
• Possible data sources
• Data transformation techniques
• Deriving “behavioral” information
• Data quality check before modeling




                                                                                  6



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Possible data sources
• Data warehouse/ data marts
• Operational systems i.e. transaction systems, billing,
  call center data, etc
• External data i.e. survey data, campaign, data from
  external agencies, etc

For external data make sure information is consistently available




                                                                                      7



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Data transformation techniques
• Entity-level information
• Indicator variables
   • Are values skewed towards 1 level?

• Categorization/grouping of values
   • Is there too many levels of values?
   • Are there values that rarely occur?

• Binning of continuous variables
• Benchmarking information, i.e. industry benchmarking

                                                                                     8



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Deriving “behavioral” information using several time
 periods
• Average behavior over the last X time periods
• Measures of variation
   • Standard deviation
   • Coefficient of Variation
   • Deviation from the Mean

• Measures of trend information
   • Ratio of 1 vs 3, 3 vs 6 time periods
   • Proportion of Current vs Average of last X time periods
   • Slope of regression line                                                         9



                          Copyright © 2011, SAS Institute Inc. All rights reserved.
Data Preparation Techniques
Data quality check before modeling
• Generation of summary statistics of derived variables
• Random checking
• Correct imputation of missing values




                                                                                 10



                     Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
• Use of SAS Enterprise Miner
• Ensemble modeling outside of SAS
• Base SAS modeling i.e. for categorical target, survival
 analysis, etc




                                                                                 11



                     Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Use of SAS Enterprise Miner




     For initial /basic modeling, use Decision Tree, Regression.
      Neural networks can be used to provide diagnostic insights
                                                                                   12



                       Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Ensemble modeling in and out of SAS EM
                                         Ensemble Models based on the
                                                                      Weightage
                                               following models
                                             Model 1        Decision     0.4
                                             Model 2       Regression    0.6
                                             Model 3       Regression    0.4




                                                                                  13



                  Copyright © 2011, SAS Institute Inc. All rights reserved.
Modeling Techniques
Base SAS modeling
• Categorical data modeling i.e.
    • PROC CATMOD/GENMOD
    • PROC SURVEYLOGISTIC
• Survival analysis:
    • PROC LIFEREG
    • PROC LIFETEST
    • PROC PHREG

Base SAS modeling requires more familiarity with underlying statistical
 concepts
                                                                                     14



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring Techniques
• Comparing actual vs predicted
• Scored base analysis:
   • Variable distribution analysis
   • Predicted Score distribution




                                                                                  15



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring
Monitoring of model assessment charts i.e.
                                                                                measures what percentage of all churners
 Compares the effectiveness of running a                                        are in the scoring list (i.e. top 10% scores
    model versus selecting randomly                                                 captured 40% of actual churners)




Other model assessment statistics can be computed such as hit rate,
 Gini coefficient, etc
                                                                                                                               16



                                  Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring (cont’d)
Scored base analysis i.e.
• Variable distribution analysis




                                                                                   17



                       Copyright © 2011, SAS Institute Inc. All rights reserved.
Model Monitoring (cont’d)
Scored base analysis i.e.
• Predicted Score distribution




                                                                                  18



                      Copyright © 2011, SAS Institute Inc. All rights reserved.
Predictive Analytics as an Iterative Process


                                                 Defining
                                                Objectives




              Model                                                                     Data
            Monitoring                                                               Preparation
                                               Predictive
                                               Analytics
                                                Process




                   Deployment                                                Modeling




                                                                                                   19



                         Copyright © 2011, SAS Institute Inc. All rights reserved.
Questions?




                                                                              20

                                                                         20
             Copyright © 2011, SAS Institute Inc. All rights reserved.
21

                                                            21
Copyright © 2011, SAS Institute Inc. All rights reserved.
Copyright © 2011, SAS Institute Inc. All rights reserved.

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part Ijayroy
 
The 8 Best Examples Of Real-Time Data Analytics
The 8 Best Examples Of Real-Time Data AnalyticsThe 8 Best Examples Of Real-Time Data Analytics
The 8 Best Examples Of Real-Time Data AnalyticsBernard Marr
 
Stock market prediction technique:
Stock market prediction technique:Stock market prediction technique:
Stock market prediction technique:Paladion Networks
 
The future of business intelligence
The future of business intelligence The future of business intelligence
The future of business intelligence Phocas Software
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service SegmentGraeme Wood
 
Become a Data Analyst
Become a Data Analyst Become a Data Analyst
Become a Data Analyst Aaron Lamphere
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Edureka!
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesDianaGray10
 
Big Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowBig Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowSnapLogic
 
Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services NVIDIA
 
Business analytics
Business analyticsBusiness analytics
Business analyticsSilla Rupesh
 
Data Monetization
Data MonetizationData Monetization
Data MonetizationDATAVERSITY
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AIMark DeLoura
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologySergey Shelpuk
 

Was ist angesagt? (20)

Maisa Penha - Art of Possible.pdf
Maisa Penha - Art of Possible.pdfMaisa Penha - Art of Possible.pdf
Maisa Penha - Art of Possible.pdf
 
Data Science
Data ScienceData Science
Data Science
 
Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part I
 
The 8 Best Examples Of Real-Time Data Analytics
The 8 Best Examples Of Real-Time Data AnalyticsThe 8 Best Examples Of Real-Time Data Analytics
The 8 Best Examples Of Real-Time Data Analytics
 
Stock market prediction technique:
Stock market prediction technique:Stock market prediction technique:
Stock market prediction technique:
 
Big data
Big dataBig data
Big data
 
The future of business intelligence
The future of business intelligence The future of business intelligence
The future of business intelligence
 
AI and the Financial Service Segment
AI and the Financial Service SegmentAI and the Financial Service Segment
AI and the Financial Service Segment
 
Become a Data Analyst
Become a Data Analyst Become a Data Analyst
Become a Data Analyst
 
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
Data Science Tutorial | What is Data Science? | Data Science For Beginners | ...
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
Big Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To KnowBig Data Management: What's New, What's Different, and What You Need To Know
Big Data Management: What's New, What's Different, and What You Need To Know
 
Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services Artificial Intelligence (AI) for Financial Services
Artificial Intelligence (AI) for Financial Services
 
Business analytics
Business analyticsBusiness analytics
Business analytics
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Data Monetization
Data MonetizationData Monetization
Data Monetization
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AI
 
Data Science
Data ScienceData Science
Data Science
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
CRISP-DM: a data science project methodology
CRISP-DM: a data science project methodologyCRISP-DM: a data science project methodology
CRISP-DM: a data science project methodology
 

Ähnlich wie Predictive Analytics: Advanced techniques in data mining

Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big AnalyticsDeepak Ramanathan
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Kun Le
 
Asian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceAsian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceDeepak Ramanathan
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analyticsDeepak Ramanathan
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data European Data Forum
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsTeradata Aster
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?SAS Canada
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introductionrameshkbudhani
 
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Pivotal Analytics (Cetas Analytics)
 
Introduction to SAS Forecasting
Introduction to SAS ForecastingIntroduction to SAS Forecasting
Introduction to SAS ForecastingSAS Canada
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance AnalyticsMary Snyder
 
Sybase Complex Event Processing
Sybase Complex Event ProcessingSybase Complex Event Processing
Sybase Complex Event ProcessingSybase Türkiye
 
Real-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionReal-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionRevolution Analytics
 

Ähnlich wie Predictive Analytics: Advanced techniques in data mining (20)

Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
 
Big Data Needs Big Analytics
Big Data Needs Big AnalyticsBig Data Needs Big Analytics
Big Data Needs Big Analytics
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415
 
101 ab 1345-1415
101 ab 1345-1415101 ab 1345-1415
101 ab 1345-1415
 
Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...Best practices for building and deploying predictive models over big data pre...
Best practices for building and deploying predictive models over big data pre...
 
Future of Analytics is here
Future of Analytics is hereFuture of Analytics is here
Future of Analytics is here
 
Asian Bankers Association, Manila Conference
Asian Bankers Association, Manila ConferenceAsian Bankers Association, Manila Conference
Asian Bankers Association, Manila Conference
 
Big data meets big analytics
Big data meets big analyticsBig data meets big analytics
Big data meets big analytics
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Evaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics PlatformsEvaluating Big Data Predictive Analytics Platforms
Evaluating Big Data Predictive Analytics Platforms
 
What is the Value of SAS Analytics?
What is the Value of SAS Analytics?What is the Value of SAS Analytics?
What is the Value of SAS Analytics?
 
Zakipoint Introduction
Zakipoint IntroductionZakipoint Introduction
Zakipoint Introduction
 
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
Streaming Cloud Analytics: Enabling Dynamic Product Innovation From User Expe...
 
Introduction to SAS Forecasting
Introduction to SAS ForecastingIntroduction to SAS Forecasting
Introduction to SAS Forecasting
 
Data Management for High Performance Analytics
Data Management for High Performance AnalyticsData Management for High Performance Analytics
Data Management for High Performance Analytics
 
Sybase Complex Event Processing
Sybase Complex Event ProcessingSybase Complex Event Processing
Sybase Complex Event Processing
 
Clinical approach to technical upgrade
Clinical approach to technical upgradeClinical approach to technical upgrade
Clinical approach to technical upgrade
 
Technology update
Technology update   Technology update
Technology update
 
Technology Update
Technology UpdateTechnology Update
Technology Update
 
Real-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to ProductionReal-time Big Data Analytics: From Deployment to Production
Real-time Big Data Analytics: From Deployment to Production
 

Mehr von SAS Asia Pacific

Improving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesImproving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesSAS Asia Pacific
 
Instantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsInstantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsSAS Asia Pacific
 
Produce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsProduce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsSAS Asia Pacific
 
Better decisions through analytics in healthcare industry. Our journey so far
Better decisions through analytics in healthcare industry.  Our journey so farBetter decisions through analytics in healthcare industry.  Our journey so far
Better decisions through analytics in healthcare industry. Our journey so farSAS Asia Pacific
 
How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?SAS Asia Pacific
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorSAS Asia Pacific
 
Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data SAS Asia Pacific
 
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...SAS Asia Pacific
 
A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...SAS Asia Pacific
 

Mehr von SAS Asia Pacific (9)

Improving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble ApproachesImproving the Model’s Predictive Power with Ensemble Approaches
Improving the Model’s Predictive Power with Ensemble Approaches
 
Instantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful AnalyticsInstantly & Visually Explore Big Data with Powerful Analytics
Instantly & Visually Explore Big Data with Powerful Analytics
 
Produce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry NeedsProduce Analytical Talent to Meet the Industry Needs
Produce Analytical Talent to Meet the Industry Needs
 
Better decisions through analytics in healthcare industry. Our journey so far
Better decisions through analytics in healthcare industry.  Our journey so farBetter decisions through analytics in healthcare industry.  Our journey so far
Better decisions through analytics in healthcare industry. Our journey so far
 
How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?How can Analytics Drive Customer Values?
How can Analytics Drive Customer Values?
 
Developing an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical CompetitorDeveloping an Analytical Mindset – Becoming an Analytical Competitor
Developing an Analytical Mindset – Becoming an Analytical Competitor
 
Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data Gaining New Insights into Usage Log Data
Gaining New Insights into Usage Log Data
 
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
A Journey through the Spatial Data Mining and Geographic Knowledge Discover J...
 
A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...A journey through the spatial data mining and geographic knowledge discovery ...
A journey through the spatial data mining and geographic knowledge discovery ...
 

Kürzlich hochgeladen

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 

Kürzlich hochgeladen (20)

2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 

Predictive Analytics: Advanced techniques in data mining

  • 1. Predictive Analytics Advanced Techniques in Data Mining Sara Venturina Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 2. Agenda • What is predictive analytics? • Predictive Analytics Process • Data Preparation techniques • Modeling Techniques • Model Monitoring techniques 2 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 3. What is Predictive Analytics? Different levels of analytics Forecasting Predictive modeling Optimization Statistical analysis Query drilldown Alerts (or OLAP) Ad hoc reports Standard reports 3 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 4. What is Predictive Analytics? Unfortunately, there is no “magic” involved! • Use of data from different source tables • Utilizing various data transformation techniques • Employing statistical theories as foundation • Will need software to manage this Focus on business/commercial (as opposed to research) analytics is trickier as you need to balance the theories with realistic application 4 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 5. Predictive Analytics Process Defining Objectives Model Data Monitoring Preparation Predictive Analytics Process Deployment Modeling 5 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 6. Data Preparation Techniques • Possible data sources • Data transformation techniques • Deriving “behavioral” information • Data quality check before modeling 6 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 7. Data Preparation Techniques Possible data sources • Data warehouse/ data marts • Operational systems i.e. transaction systems, billing, call center data, etc • External data i.e. survey data, campaign, data from external agencies, etc For external data make sure information is consistently available 7 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 8. Data Preparation Techniques Data transformation techniques • Entity-level information • Indicator variables • Are values skewed towards 1 level? • Categorization/grouping of values • Is there too many levels of values? • Are there values that rarely occur? • Binning of continuous variables • Benchmarking information, i.e. industry benchmarking 8 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 9. Data Preparation Techniques Deriving “behavioral” information using several time periods • Average behavior over the last X time periods • Measures of variation • Standard deviation • Coefficient of Variation • Deviation from the Mean • Measures of trend information • Ratio of 1 vs 3, 3 vs 6 time periods • Proportion of Current vs Average of last X time periods • Slope of regression line 9 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 10. Data Preparation Techniques Data quality check before modeling • Generation of summary statistics of derived variables • Random checking • Correct imputation of missing values 10 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 11. Modeling Techniques • Use of SAS Enterprise Miner • Ensemble modeling outside of SAS • Base SAS modeling i.e. for categorical target, survival analysis, etc 11 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 12. Modeling Techniques Use of SAS Enterprise Miner For initial /basic modeling, use Decision Tree, Regression. Neural networks can be used to provide diagnostic insights 12 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 13. Modeling Techniques Ensemble modeling in and out of SAS EM Ensemble Models based on the Weightage following models Model 1 Decision 0.4 Model 2 Regression 0.6 Model 3 Regression 0.4 13 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 14. Modeling Techniques Base SAS modeling • Categorical data modeling i.e. • PROC CATMOD/GENMOD • PROC SURVEYLOGISTIC • Survival analysis: • PROC LIFEREG • PROC LIFETEST • PROC PHREG Base SAS modeling requires more familiarity with underlying statistical concepts 14 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 15. Model Monitoring Techniques • Comparing actual vs predicted • Scored base analysis: • Variable distribution analysis • Predicted Score distribution 15 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 16. Model Monitoring Monitoring of model assessment charts i.e. measures what percentage of all churners Compares the effectiveness of running a are in the scoring list (i.e. top 10% scores model versus selecting randomly captured 40% of actual churners) Other model assessment statistics can be computed such as hit rate, Gini coefficient, etc 16 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 17. Model Monitoring (cont’d) Scored base analysis i.e. • Variable distribution analysis 17 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 18. Model Monitoring (cont’d) Scored base analysis i.e. • Predicted Score distribution 18 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 19. Predictive Analytics as an Iterative Process Defining Objectives Model Data Monitoring Preparation Predictive Analytics Process Deployment Modeling 19 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 20. Questions? 20 20 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 21. 21 21 Copyright © 2011, SAS Institute Inc. All rights reserved.
  • 22. Copyright © 2011, SAS Institute Inc. All rights reserved.