SlideShare ist ein Scribd-Unternehmen logo
1 von 35
Predictive Analytics
Context and Use Cases
Kimberley Mitchell
kim@who-knows.com
http://mitchki.com
@mitchki
3/5/2015
Contents
• Evolution of Analytics
• Predictive Analytics: Definition
– Predictive Analytics: Process
– Traditional Predictive Analytics
• 21st Century Transition: Big Data Content
– Who is using big data?
– Analytic process changes
– Big Data Tools
– Big Data Techniques
• Predictive Analytics Use Cases
• Use Cases: Optimizing Outcomes
– Improve risk calculations
– Optimize Energy Usage
– Improve Customer Recommendations
– Optimize policy interventions
• Business Use Case: Increase Revenue
– Kraft: Digital Marketing
– Walgreens: Using data from loyalty program
– Legendary Pictures: Applied Analytics Division
– Mashable: Data Driven Digital Publishing
• Business Use Case: Reduce Cost
– Human Resources: Reducing Attrition
– Manufacturing Process Improvement
– Improve Customer Retention
– Predictive Maintenance
• Predictive Analytics Trends Forecast
• Questions?
2
Evolution of Analytics
3/5/2015 3
Seat of the pants:
Intuition, domain
knowledge
BI (data driven):
Visualizations,
KPI’s, OLAP,
Dashboards
Predictive
Analytics:
Modeling,
Experimental
verification
Real-time
streaming
analytics:
Deep learning, A.I.
Complexity
Insights
Future looking
Past looking
Predictive Analytics - Definition
Predictive analytics is an area of data mining that deals with
extracting information from data and using it to predict trends
and behavior patterns. Often the unknown event of interest is
in the future, but predictive analytics can be applied to any
type of unknown whether it be in the past, present or future.
-- Wikipedia
Predictive Analytics Process
Model
Verification
Theoretical
Model
Treatments +
Controls + Error =
Response
Domain
knowledge,
1st principles,
experience
Pre-existing
data
Traditional Predictive Analytics
• Applications
– Research and development in agriculture and industry
• Designed experiments for product and process improvement
– Verifying the effectiveness of healthcare treatments
• Clinical trials (randomized double blind are best)
– Predicting the weather
– Polls: opinions, election politics, Nielson ratings.
– Standardized tests: e.g. SAT’s to predict college success
– Actuarial Science: life expectancy, etc.
• Limitations
– Each data point must be planned and collected intentionally
– Expensive to design, collect the data
21st Century Transformation: Big Data Content
1. Internet behavior and logs
– Every click and keystroke stored, server logs
– Automatic byproduct of the Internet
2. Internet content
– Text, emails, statuses, images, audio, video, etc.
– Voluntary product of Internet use, automatically harvested
3. Internet of things
– Sensors of all types: temperature, pressure, kinematics, images, audio, etc.
– Must be consciously set up and designed
4. Access to thousands of sources of traditionally collected data
– Open data movement (gov’t, academia)
Who is using big data?
“Big data is like teenage sex: everyone talks about it, nobody really knows
how to do it, everyone thinks everyone else is doing it, so everyone claims
they are doing it…”—Dan Ariely (2012)
Even if this was true in 2012, it is no longer true; many organizations are
successfully implementing modern predictive analytics with big data.
Analytic process changes
• Huge quantity and variety of data available with little effort/cost
• Substantial extra effort needed to extract and process data that have been
collected without a specific end use in mind (ETL)
– Unstructured data images, videos, audio, etc.
– Combination of a variety of data sources
– Some creativity required to compose relevant metrics
• Data availability leads to increased data mining, exploratory data analysis and
data visualization requirements
– Hundreds or even thousands of variables are “thrown in” to analyses
– More “statistically significant” relationships found
• Some new insights that are real
• Some mistakes due to over-fitting, random clusters (some algorithms exist to help filter out false
“hits”)
– Domain knowledge / insights still crucial to distinguish real effects from random chance or
allied effects
Traditional Tools and Techniques
• Traditional Tools
– Mostly commercial – SAS, SPSS, STATA, Minitab, Excel
– Open Source – R, Python
• Traditional Techniques
– Various flavors of regression analyses
• Linear, multinomial, logistic, general linear models, regression discontinuity,
instrumental variables, panel models, time series, etc.
• Designed experiments, ANOVA
– Controls built in for known covariates
– Randomization to accommodate unknown covariates
– Sometimes designed to extract maximum possible info from fewest observations
10
Big Data Tools
• Big Data Predictive Analytics Tools
– Open Source
• Apache Spark / MLlib
• Python scripts interfaced to Hadoop or other distributed data source
• Sometimes R or other traditional tools if data can be extracted and reduced to a size to fit on
a single processor.
• Mondrian (OLAP – more BI than predictive analytics)
– Commercial / Proprietary
• RapidMiner (Radoop for Hadoop analytics)
• Pentaho – compare effectiveness of various models: operates against many data sources
(being acquired by Hitachi)
• HP Vertica Distributed R – ML algorithms pre-loaded
• BeyondCore - 1st through 4th order interactions automatically calculated
• SAS Enterprise Miner
• SPSS adding big data analytics modules
• IBM Predictive Maintenance and Quality
• Homegrown solutions internal to companies (hand-coded Python or Java algorithms)
Big Data Techniques
• Big Data Predictive Analytics Techniques
– Numerical responses / Categorical responses
– Traditional Analytical Techniques – Linear, logistic regressions, GLM regressions, simulations
– Distributed Analytics – i.e. MapReduce
– Machine Learning Algorithms - Bayes network, decision trees, rule engines
– A/B Testing – Designed randomized experiments to confirm model’s predictive power
– More use of natural experiments (due to increased sharing of data sets)
• Reference Books
– Kuhn, M. and Johnson, K., Applied Predictive Modeling, Springer, 2013. (1)
• Many worked examples in R with data available
• Guide to understanding various models and how to apply them.
• Johns Hopkins/Coursera MOOC on Practical Machine Learning uses this book
– Leskovec, Rajaraman, and Ullman, Mining of Massive Datasets, Cambridge University Press, 2nd edition, 2014.
(2)
• Available commercially and also as a free pdf download from http://www.mmds.org/index2.html with associated materials.
• Teaches the mathematical models and logical constructs (algorithms) underlying data mining and machine learning algorithms,
including many exercises .
• Stanford/Coursera MOOC on Mining Massive Datasets uses this book
Predictive Analytics:
Current Use Cases
Directly related to the sponsoring organization’s goals
13
Use Cases
• What determines actual usage of analytics?
– As always: organizational goals
• Business / organizational goals drive initiatives
1. Improve outcomes (not obviously dollar related)
• For non-profits: improve client outcomes
– Fulfill the non-market based goal
• Improve products, customer satisfaction
– Provide the best product/experience and the customer will come
2. Increase profits by increasing revenue
3. Increase profits by reducing costs
14
Use Cases: Optimize Products or Outcomes
• More precise risk calculations in Auto Insurance (IoT)
– Progressive - Snapshot measures risk of driving habits to offer lower rates
• http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of-cheap-car-insurance/ (3)
• Optimize comfort and energy usage in commercial buildings
– BuildingIQ - Predictive Energy Optimization™ uses advanced algorithms to automatically fine tune and control
HVAC systems resulting in savings while maintaining or improving comfort.
• http://www.buildingiq.com/ (4)
• Improve Customer Recommendations (many examples)
– Nordstrom - Scoring customer to product segment interaction. Output: Affinity scores of each customer to the
product segment, For each segment, count of customers who have above average affinity scores compared to
the general population in that segment.
• https://ieondemand.com/divisions/analytics/events/275/presentations/multi-channel-analytics-at-nordstrom-data-labs (5)
• Policy interventions – development economics (non-profit)
– Data-Pop Alliance: Using cell phone data to predict socioeconomic levels in developing areas
• https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development-technocratic-democratic-
considerations (6)
15
Improve risk calculations
• Progressive Auto Insurance: Snapshot measures risk of driving
habits to allow the offer of lower rates
– Automated device tracks actual driver patterns for 30 days (IoT)
– Discounts offered based on time of day, pattern of hard braking,
amount of driving
– Response: Much clearer picture of actual driving risks than self-
reporting
– http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of-
cheap-car-insurance
Optimize Energy Usage
• Optimize comfort and energy usage in commercial buildings
(IoT)
– BuildingIQ - Predictive Energy Optimization™ uses advanced
algorithms to automatically fine tune and control HVAC systems
resulting in savings while maintaining or improving comfort.
– Interfaces with existing building energy management system and
uses predictive analytics to reduce energy costs while maintaining
comfort.
• http://www.buildingiq.com/
Optimize Energy Usage - BuildingIQ
Improve Customer Recommendations
• Many examples with “long tail” product offerings
– Nordstrom - Scoring customer to product segment interaction.
• Output: Affinity scores of each customer to the product segment
• For each segment, count of customers who have above average affinity
scores compared to the general population in that segment.
• Probable tools: Clustering, recommendation engines
• https://ieondemand.com/divisions/analytics/events/275/presentations/mult
i-channel-analytics-at-nordstrom-data-labs
Optimize policy interventions
• Development economics (non-profit)
– Data-Pop Alliance: ”Our mission statement and raison d'être is to promote a humanistic, people-
centered ‘Big Data revolution’: one that fosters human development and social progress through
the ethical use of personal data and empowerment of the poor and most vulnerable and avoids
the pitfalls of a new digital divide, de-humanization and de-democratization.”
– Using cell phone data to predict socioeconomic levels in developing areas
– Traditionally difficult to measure level of economic well-being; involving expensive surveys and
in-person travel
– Since many developing societies are bypassing landlines and going directly to cell phone
communications, quantities of cell phone units and traffic can be used to estimate changes in
societal well being at a vastly reduced cost.
• https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development-
technocratic-democratic-considerations
Business Use Case: Increase Revenue
• Digital Marketing
– Kraft: How Kraft uses Digital Marketing
• http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to-transform-marketing (7)
– Walgreens: Using data from loyalty program to deliver value and drive customer loyalty
• https://ieondemand.com/divisions/analytics/events/275/presentations/delivering-value-driving-
best-customer-loyalty-through-pricing-promotions (8)
– Legendary Entertainment: Inform Creative decisions (primarily concept evaluation and cast
evaluation (social media) transform marketing
• https://ieondemand.com/divisions/analytics/events/275/presentations/changing-hollywood-
paradigms-with-analytics (9)
– Mashable: Data Driven Digital Publishing - Mashable's proprietary Velocity platform
predicts what's going viral next
• https://ieondemand.com/divisions/analytics/events/275/presentations/data-driving-digital-
publishing (10)
Kraft: Digital Marketing
• How Kraft uses Digital Marketing
• Leveraging 18 years worth of customer connections, first from print, then from
kraftrecipes.com
• Using the combined content from kraftrecipes.com with their data
management systems to develop insights about their customer base.
• Slogan: “data is people”
• Now recording more than 34,000 attributes from 100 million online visitors
each year, forming 800 segments of customers to buy ads against
• http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to-
transform-marketing
Walgreens: Using data from loyalty program
• Primarily “brick and mortar” stores; so how to get individual customer data for
analytics?
• Walgreens launched “Balance Rewards” in September 2012, to combine insights from a
loyalty program with consumer behavior.
– Loyalty programs are an illustration of how non-website data can be used for predictive analytics
• Driving direct mail advertising, brick and mortar display positioning, inventory, etc.
– Assortment: Carry products that meet the needs of targeted customer segments
– Mass promotions: Drive loyalty with offers that attract and drive sales with targeted segments
– Direct Marketing: Target “best” customers with “best” products for them.
– Pricing: Understand customer pricing sensitivity.
– Store Layout and Space Allocation: Align layout with the way customers shop.
• https://ieondemand.com/divisions/analytics/events/275/presentations/delivering-
value-driving-best-customer-loyalty-through-pricing-promotions
Legendary Pictures: Applied Analytics Division
• Legendary Pictures
– Movies such as Interstellar, Godzilla, Dracula Untold, Man of
Steel, Inception, Unbroken
– Global scale and reach
– Innovation factory, not a warehouse
• Use of predictive analytics now a part of key strategies
to inform creative decisions
– Concept and cast evaluation
• Green lighting concepts
• Cast strengths and weaknesses
• Build trailers on analytics
• Proprietary, commercial-grade software constantly
analyzes unique, extensive data across people, social
media, and content.
– 500 million emails, 250 million households
– Entertainment data
• Metadata on films since 1989
• Theatre data for 10 years
• Social media data: sentiment, zeitgeist, topics
• Transform marketing
– Old fashioned: 4 quadrants:
male/female/over 25/under 25; 4
groups of 80 million each
– Targeting: 80 million groups of 4
– Aiming at “swing groups” not
fans nor never interested “Mom”
• https://ieondemand.com/divisio
ns/analytics/events/275/present
ations/changing-hollywood-
paradigms-with-analytics
24
Mashable: Data Driven Digital Publishing
• For article placement, mashable.com needs to predict which items will go viral in real
time!
• Mashable's proprietary Velocity platform predicts what's going viral
• Mashable Velocity: proprietary software: 130,000,000 urls, with 1,000,000 urls crawled
per day
• Saw an increase of 55% in web traffic over the first 2 years using the Velocity
technology.
• https://ieondemand.com/divisions/analytics/events/275/presentations/data-driving-
digital-publishing
25
Business Use Case: Reducing Cost
• Human Resources: Reducing Attrition
– Talent Analytics: Case Study: “Raw Talent Traits” Correlated to Attrition - Attrition Dropped by 30%
Yielding Multimillion Dollar Cost Savings
• http://www.talentanalytics.com/resources/case-studies/ (11)
• Manufacturing Process Improvement
– Intel - Using IoT and predictive analytics in chip manufacturing saved $3 million in core processor
testing
• http://newsroom.intel.com/community/apac_en/blog/2014/10/09/big-data-and-iot-in-manufacturing-in-everyday-
life (12)
• Improve Customer Retention
– Time-Warner Cable: Using analytics to optimize price changes
• https://ieondemand.com/divisions/analytics/events/275/presentations/using-analytics-to-optimize-subscription-
price-changes (13)
• Predictive Maintenance
– How Predictive Analytics Improves Wind Turbine Maintenance
• http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine-maintenance/ (14)
Human Resources: Reducing Attrition
• Talent Analytics: Case Study: “Raw Talent Traits” Correlated to
Attrition - Attrition Dropped by 30% Yielding Multimillion Dollar Cost
Savings
– CSR roles required a 12-week training plus passing a series 7 test before
starting work
– Voluntary attrition was greater than 60%; reduced to < 40% as a result of
analytics intervention
– Success linked to combining data sets; traditional HR metrics with
proprietary survey to identify raw talent traits
– http://www.talentanalytics.com/resources/case-studies/
Manufacturing Process Improvement
• Intel - Using IoT and predictive analytics in chip manufacturing
– Process and product measurements taken upstream during the manufacturing process are
able to predict whether the chip will pass final test leading to a reduction in final testing
requirements.
– "Instead of running every single chip through 19,000 tests, we can focus tests on specific
chips to cut down test time.“
– This predictive analytics process, implemented on a single line of Intel Core processors in
2012, allowed Intel to save $3 million in manufacturing costs. In 2013-14, Intel expects to
extend the process to more chip lines and save an additional $30 million, the company
said.
• http://www.informationweek.com/software/information-management/intel-cuts-manufacturing-
costs-with-big-data/d/d-id/1109111?
Improve Customer Retention
• Time-Warner Cable: Using analytics to optimize price changes
– Using analytics, predict time-oriented price sensitivity
– Use promotion expiration as “natural experiments” due to regulatory
prohibitions against actual experimentation with customers pricing
– Measure churn response of “test” group against pre-selected similar
“control” group
• https://ieondemand.com/divisions/analytics/events/275/presentations/usin
g-analytics-to-optimize-subscription-price-changes
Predictive Maintenance
• How Predictive Analytics Improves Wind Turbine Maintenance
– http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine-maintenance/
Predictive Analytics Trends
• Cost of analytics will be driven down by:
– Standardization of IoT
– Spread of expertise
– Commercialization and widespread use of
successful techniques
– Development of “best practices” and user-
friendly software to support them in specific
key applications
• Web and web-content data
– Profusion of analytical tools will consolidate to
a couple of winners
– Most used and useful tools will be identified
and commercialized
– New applications will continue to target
identifiable individuals in the “persuadable
middle”
• Internet of Things
– Development of interface standards will lead
to more commercially available tools as well
as open source solutions
• Healthcare analytics
– Will increasingly use Big Data as privacy and
regulatory issues resolved
• Leading edge
– Move from manual to dynamic model
generation as understanding of relevant
factors improves (especially in IoT, since things
tend to have more consistent behaviors than
people)
– Analysis of streaming data becomes more
mainstream
31
Questions?
Prediction is very difficult, especially if
it’s about the future.
-- Niels Bohr
Kimberley Mitchell
kim@who-knows.com
http://mitchki.com
@mitchki
Appendix: References
(1) Kuhn, M. and Johnson, K., Applied Predictive Modeling, Springer, 2013.
(2) Leskovec, Rajaraman, and Ullman, Mining of Massive Datasets, Cambridge University Press, 2nd edition, 2014.
(3) Hayes, Joy M. (2012, August 27) The Hidden Costs of Cheap Car Insurance. Retrieved from
http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of-cheap-car-insurance/
(4) BuildingIQ (2014). Predictive Energy Optimization. Retrieved from http://www.buildingiq.com/products/buildingiq-platform/
(5) Arun, Veettil (2014, November) Multi-Channel Analytics at Nordstrom Data Labs. Retrieved from
https://ieondemand.com/divisions/analytics/events/275/presentations/multi-channel-analytics-at-nordstrom-data-labs
(6) Letouzé, Emmanuel (2014, November) Big Data for Development: Technocratic & Democratic Considerations. Retrieved from
https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development-technocratic-democratic-
considerations
3/5/2015 33
Appendix: References (page 2)
(7) White, Melanie (2015, January 29) How Kraft Harnessed Data to Transform Marketing, Retrieved from
http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to-transform-marketing
(8) Darer, Michael (2014, November) Delivering Value & Driving Best Customer Loyalty Through Pricing &
Promotions, Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/delivering-
value-driving-best-customer-loyalty-through-pricing-promotions
(9) Marolda, Matt (2014, November) Changing Hollywood Paradigms with Analytics. Retrieved from
https://ieondemand.com/divisions/analytics/events/275/presentations/changing-hollywood-paradigms-with-
analytics
(10) Haile Owusu, Haile (2014, November) Data Driving Digital Publishing. Retrieved from
ttps://ieondemand.com/divisions/analytics/events/275/presentations/data-driving-digital-publishing
3/5/2015 34
Appendix: References (page 3)
(11) Talent Analytics (2014) Case Study: “Raw Talent Traits” Correlated to Attrition. Retrieved from
http://www.talentanalytics.com/resources/case-studies/
(12) Mandeville, John (2014, October 9) Big Data and IoT in Manufacturing “In Everyday Life”.
Retrieved from http://newsroom.intel.com/community/apac_en/blog/2014/10/09/big-data-and-
iot-in-manufacturing-in-everyday-life
(13) Zyabkina, Tanya (2014, November) Using Analytics to Optimize Subscription Price Changes.
Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/using-
analytics-to-optimize-subscription-price-changes
(14) Mark-Jan Harte ( 2014, March5) How Predictive Analytics Improves Wind Turbine Maintenance.
Retrieved from http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine-
maintenance/
3/5/2015 35

Weitere ähnliche Inhalte

Was ist angesagt?

Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part Ijayroy
 
Data Analytics and Business Intelligence
Data Analytics and Business IntelligenceData Analytics and Business Intelligence
Data Analytics and Business IntelligenceChris Ortega, MBA
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling FundamentalsDATAVERSITY
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An OverviewMachinePulse
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analyticsSSaudia
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Simplilearn
 
Data Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsMotaz Saad
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Edureka!
 
Feature Engineering.pdf
Feature Engineering.pdfFeature Engineering.pdf
Feature Engineering.pdfRajoo Jha
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data ArchitectureGuido Schmutz
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data scienceMahir Haque
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsDurga Palakurthy
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and PredictionSOUMIT KAR
 
Business Intelligence Trends 2020
Business Intelligence Trends 2020Business Intelligence Trends 2020
Business Intelligence Trends 2020Wiiisdom
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science ProcessVishal Patel
 

Was ist angesagt? (20)

Introduction To Predictive Analytics Part I
Introduction To Predictive Analytics   Part IIntroduction To Predictive Analytics   Part I
Introduction To Predictive Analytics Part I
 
Data Analytics and Business Intelligence
Data Analytics and Business IntelligenceData Analytics and Business Intelligence
Data Analytics and Business Intelligence
 
Business Analytics
 Business Analytics  Business Analytics
Business Analytics
 
Data analytics
Data analyticsData analytics
Data analytics
 
Data Modeling Fundamentals
Data Modeling FundamentalsData Modeling Fundamentals
Data Modeling Fundamentals
 
Data analytics
Data analyticsData analytics
Data analytics
 
Three Big Data Case Studies
Three Big Data Case StudiesThree Big Data Case Studies
Three Big Data Case Studies
 
Predictive Analytics - An Overview
Predictive Analytics - An OverviewPredictive Analytics - An Overview
Predictive Analytics - An Overview
 
Introduction to data analytics
Introduction to data analyticsIntroduction to data analytics
Introduction to data analytics
 
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
Big Data Analytics | What Is Big Data Analytics? | Big Data Analytics For Beg...
 
Data Mining and Business Intelligence Tools
Data Mining and Business Intelligence ToolsData Mining and Business Intelligence Tools
Data Mining and Business Intelligence Tools
 
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
Data Analytics For Beginners | Introduction To Data Analytics | Data Analytic...
 
Feature Engineering.pdf
Feature Engineering.pdfFeature Engineering.pdf
Feature Engineering.pdf
 
Big Data Architecture
Big Data ArchitectureBig Data Architecture
Big Data Architecture
 
Introduction to data science
Introduction to data scienceIntroduction to data science
Introduction to data science
 
Analytics Overview #Predictive Analytics
Analytics Overview #Predictive AnalyticsAnalytics Overview #Predictive Analytics
Analytics Overview #Predictive Analytics
 
Customer Churn Analysis and Prediction
Customer Churn Analysis and PredictionCustomer Churn Analysis and Prediction
Customer Churn Analysis and Prediction
 
Business Intelligence Trends 2020
Business Intelligence Trends 2020Business Intelligence Trends 2020
Business Intelligence Trends 2020
 
The Data Science Process
The Data Science ProcessThe Data Science Process
The Data Science Process
 
BIG DATA and USE CASES
BIG DATA and USE CASESBIG DATA and USE CASES
BIG DATA and USE CASES
 

Andere mochten auch

Internet of Things (IoT) protocols COAP MQTT OSCON2014
Internet of Things (IoT) protocols  COAP MQTT OSCON2014Internet of Things (IoT) protocols  COAP MQTT OSCON2014
Internet of Things (IoT) protocols COAP MQTT OSCON2014Vidhya Gholkar
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013deepersnet
 
CoAP Course for m2m and Internet of Things scenarios
CoAP Course for m2m and Internet of Things scenariosCoAP Course for m2m and Internet of Things scenarios
CoAP Course for m2m and Internet of Things scenarioscarlosralli
 
Blockchain in IoT and Other Considerations by Dinis Guarda
Blockchain in IoT and Other Considerations by Dinis GuardaBlockchain in IoT and Other Considerations by Dinis Guarda
Blockchain in IoT and Other Considerations by Dinis GuardaDinis Guarda
 
CBGTBT - Part 1 - Workshop introduction & primer
CBGTBT - Part 1 - Workshop introduction & primerCBGTBT - Part 1 - Workshop introduction & primer
CBGTBT - Part 1 - Workshop introduction & primerBlockstrap.com
 

Andere mochten auch (6)

Internet of Things (IoT) protocols COAP MQTT OSCON2014
Internet of Things (IoT) protocols  COAP MQTT OSCON2014Internet of Things (IoT) protocols  COAP MQTT OSCON2014
Internet of Things (IoT) protocols COAP MQTT OSCON2014
 
Sap sap so h 2013
Sap sap so h 2013Sap sap so h 2013
Sap sap so h 2013
 
CoAP Course for m2m and Internet of Things scenarios
CoAP Course for m2m and Internet of Things scenariosCoAP Course for m2m and Internet of Things scenarios
CoAP Course for m2m and Internet of Things scenarios
 
Blockchain in IoT and Other Considerations by Dinis Guarda
Blockchain in IoT and Other Considerations by Dinis GuardaBlockchain in IoT and Other Considerations by Dinis Guarda
Blockchain in IoT and Other Considerations by Dinis Guarda
 
Supply Chain Strategy
Supply Chain StrategySupply Chain Strategy
Supply Chain Strategy
 
CBGTBT - Part 1 - Workshop introduction & primer
CBGTBT - Part 1 - Workshop introduction & primerCBGTBT - Part 1 - Workshop introduction & primer
CBGTBT - Part 1 - Workshop introduction & primer
 

Ähnlich wie Predictive Analytics Use Cases

predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017Prashant Bhatmule
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big dataPoo Kuan Hoong
 
Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Peter Schleinitz
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptxXanGwaps
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersRuhollah Farchtchi
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAjaved75
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseSoftServe
 
Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applicationsPadma Metta
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Geoffrey Fox
 
Data science and business analytics
Data  science and business analyticsData  science and business analytics
Data science and business analyticsInbavalli Valli
 
An Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningAn Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningBarry Leventhal
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Amit Fogla
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeGeoffrey Fox
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceICFAI Business School
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics Venkat .P
 

Ähnlich wie Predictive Analytics Use Cases (20)

predictive analysis and usage in procurement ppt 2017
predictive analysis and usage in procurement  ppt 2017predictive analysis and usage in procurement  ppt 2017
predictive analysis and usage in procurement ppt 2017
 
Machine learning and big data
Machine learning and big dataMachine learning and big data
Machine learning and big data
 
Big data Analytics
Big data AnalyticsBig data Analytics
Big data Analytics
 
Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0Machine Learning and Industrie 4.0
Machine Learning and Industrie 4.0
 
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx351315535-Module-1-Intro-to-Data-Science-pptx.pptx
351315535-Module-1-Intro-to-Data-Science-pptx.pptx
 
Applications of Big Data
Applications of Big DataApplications of Big Data
Applications of Big Data
 
Big data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makersBig data analytics presented at meetup big data for decision makers
Big data analytics presented at meetup big data for decision makers
 
Trends in data analytics
Trends in data analyticsTrends in data analytics
Trends in data analytics
 
Data Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATAData Science.pptx NEW COURICUUMN IN DATA
Data Science.pptx NEW COURICUUMN IN DATA
 
Advanced Analytics and Data Science Expertise
Advanced Analytics and Data Science ExpertiseAdvanced Analytics and Data Science Expertise
Advanced Analytics and Data Science Expertise
 
Bigdata and Hadoop with applications
Bigdata and Hadoop with applicationsBigdata and Hadoop with applications
Bigdata and Hadoop with applications
 
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
Multi-faceted Classification of Big Data Use Cases and Proposed Architecture ...
 
Data science and business analytics
Data  science and business analyticsData  science and business analytics
Data science and business analytics
 
00-01 DSnDA.pdf
00-01 DSnDA.pdf00-01 DSnDA.pdf
00-01 DSnDA.pdf
 
An Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data miningAn Introduction to Advanced analytics and data mining
An Introduction to Advanced analytics and data mining
 
Mis jaiswal-chapter-08
Mis jaiswal-chapter-08Mis jaiswal-chapter-08
Mis jaiswal-chapter-08
 
High Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run TimeHigh Performance Data Analytics and a Java Grande Run Time
High Performance Data Analytics and a Java Grande Run Time
 
Lesson1.2.pptx.pdf
Lesson1.2.pptx.pdfLesson1.2.pptx.pdf
Lesson1.2.pptx.pdf
 
Modern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance ExcellenceModern Analytics And The Future Of Quality And Performance Excellence
Modern Analytics And The Future Of Quality And Performance Excellence
 
Introductions to Business Analytics
Introductions to Business Analytics Introductions to Business Analytics
Introductions to Business Analytics
 

Kürzlich hochgeladen

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptxAnupama Kate
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxolyaivanovalion
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

Kürzlich hochgeladen (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx100-Concepts-of-AI by Anupama Kate .pptx
100-Concepts-of-AI by Anupama Kate .pptx
 
BabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptxBabyOno dropshipping via API with DroFx.pptx
BabyOno dropshipping via API with DroFx.pptx
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

Predictive Analytics Use Cases

  • 1. Predictive Analytics Context and Use Cases Kimberley Mitchell kim@who-knows.com http://mitchki.com @mitchki 3/5/2015
  • 2. Contents • Evolution of Analytics • Predictive Analytics: Definition – Predictive Analytics: Process – Traditional Predictive Analytics • 21st Century Transition: Big Data Content – Who is using big data? – Analytic process changes – Big Data Tools – Big Data Techniques • Predictive Analytics Use Cases • Use Cases: Optimizing Outcomes – Improve risk calculations – Optimize Energy Usage – Improve Customer Recommendations – Optimize policy interventions • Business Use Case: Increase Revenue – Kraft: Digital Marketing – Walgreens: Using data from loyalty program – Legendary Pictures: Applied Analytics Division – Mashable: Data Driven Digital Publishing • Business Use Case: Reduce Cost – Human Resources: Reducing Attrition – Manufacturing Process Improvement – Improve Customer Retention – Predictive Maintenance • Predictive Analytics Trends Forecast • Questions? 2
  • 3. Evolution of Analytics 3/5/2015 3 Seat of the pants: Intuition, domain knowledge BI (data driven): Visualizations, KPI’s, OLAP, Dashboards Predictive Analytics: Modeling, Experimental verification Real-time streaming analytics: Deep learning, A.I. Complexity Insights Future looking Past looking
  • 4. Predictive Analytics - Definition Predictive analytics is an area of data mining that deals with extracting information from data and using it to predict trends and behavior patterns. Often the unknown event of interest is in the future, but predictive analytics can be applied to any type of unknown whether it be in the past, present or future. -- Wikipedia
  • 5. Predictive Analytics Process Model Verification Theoretical Model Treatments + Controls + Error = Response Domain knowledge, 1st principles, experience Pre-existing data
  • 6. Traditional Predictive Analytics • Applications – Research and development in agriculture and industry • Designed experiments for product and process improvement – Verifying the effectiveness of healthcare treatments • Clinical trials (randomized double blind are best) – Predicting the weather – Polls: opinions, election politics, Nielson ratings. – Standardized tests: e.g. SAT’s to predict college success – Actuarial Science: life expectancy, etc. • Limitations – Each data point must be planned and collected intentionally – Expensive to design, collect the data
  • 7. 21st Century Transformation: Big Data Content 1. Internet behavior and logs – Every click and keystroke stored, server logs – Automatic byproduct of the Internet 2. Internet content – Text, emails, statuses, images, audio, video, etc. – Voluntary product of Internet use, automatically harvested 3. Internet of things – Sensors of all types: temperature, pressure, kinematics, images, audio, etc. – Must be consciously set up and designed 4. Access to thousands of sources of traditionally collected data – Open data movement (gov’t, academia)
  • 8. Who is using big data? “Big data is like teenage sex: everyone talks about it, nobody really knows how to do it, everyone thinks everyone else is doing it, so everyone claims they are doing it…”—Dan Ariely (2012) Even if this was true in 2012, it is no longer true; many organizations are successfully implementing modern predictive analytics with big data.
  • 9. Analytic process changes • Huge quantity and variety of data available with little effort/cost • Substantial extra effort needed to extract and process data that have been collected without a specific end use in mind (ETL) – Unstructured data images, videos, audio, etc. – Combination of a variety of data sources – Some creativity required to compose relevant metrics • Data availability leads to increased data mining, exploratory data analysis and data visualization requirements – Hundreds or even thousands of variables are “thrown in” to analyses – More “statistically significant” relationships found • Some new insights that are real • Some mistakes due to over-fitting, random clusters (some algorithms exist to help filter out false “hits”) – Domain knowledge / insights still crucial to distinguish real effects from random chance or allied effects
  • 10. Traditional Tools and Techniques • Traditional Tools – Mostly commercial – SAS, SPSS, STATA, Minitab, Excel – Open Source – R, Python • Traditional Techniques – Various flavors of regression analyses • Linear, multinomial, logistic, general linear models, regression discontinuity, instrumental variables, panel models, time series, etc. • Designed experiments, ANOVA – Controls built in for known covariates – Randomization to accommodate unknown covariates – Sometimes designed to extract maximum possible info from fewest observations 10
  • 11. Big Data Tools • Big Data Predictive Analytics Tools – Open Source • Apache Spark / MLlib • Python scripts interfaced to Hadoop or other distributed data source • Sometimes R or other traditional tools if data can be extracted and reduced to a size to fit on a single processor. • Mondrian (OLAP – more BI than predictive analytics) – Commercial / Proprietary • RapidMiner (Radoop for Hadoop analytics) • Pentaho – compare effectiveness of various models: operates against many data sources (being acquired by Hitachi) • HP Vertica Distributed R – ML algorithms pre-loaded • BeyondCore - 1st through 4th order interactions automatically calculated • SAS Enterprise Miner • SPSS adding big data analytics modules • IBM Predictive Maintenance and Quality • Homegrown solutions internal to companies (hand-coded Python or Java algorithms)
  • 12. Big Data Techniques • Big Data Predictive Analytics Techniques – Numerical responses / Categorical responses – Traditional Analytical Techniques – Linear, logistic regressions, GLM regressions, simulations – Distributed Analytics – i.e. MapReduce – Machine Learning Algorithms - Bayes network, decision trees, rule engines – A/B Testing – Designed randomized experiments to confirm model’s predictive power – More use of natural experiments (due to increased sharing of data sets) • Reference Books – Kuhn, M. and Johnson, K., Applied Predictive Modeling, Springer, 2013. (1) • Many worked examples in R with data available • Guide to understanding various models and how to apply them. • Johns Hopkins/Coursera MOOC on Practical Machine Learning uses this book – Leskovec, Rajaraman, and Ullman, Mining of Massive Datasets, Cambridge University Press, 2nd edition, 2014. (2) • Available commercially and also as a free pdf download from http://www.mmds.org/index2.html with associated materials. • Teaches the mathematical models and logical constructs (algorithms) underlying data mining and machine learning algorithms, including many exercises . • Stanford/Coursera MOOC on Mining Massive Datasets uses this book
  • 13. Predictive Analytics: Current Use Cases Directly related to the sponsoring organization’s goals 13
  • 14. Use Cases • What determines actual usage of analytics? – As always: organizational goals • Business / organizational goals drive initiatives 1. Improve outcomes (not obviously dollar related) • For non-profits: improve client outcomes – Fulfill the non-market based goal • Improve products, customer satisfaction – Provide the best product/experience and the customer will come 2. Increase profits by increasing revenue 3. Increase profits by reducing costs 14
  • 15. Use Cases: Optimize Products or Outcomes • More precise risk calculations in Auto Insurance (IoT) – Progressive - Snapshot measures risk of driving habits to offer lower rates • http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of-cheap-car-insurance/ (3) • Optimize comfort and energy usage in commercial buildings – BuildingIQ - Predictive Energy Optimization™ uses advanced algorithms to automatically fine tune and control HVAC systems resulting in savings while maintaining or improving comfort. • http://www.buildingiq.com/ (4) • Improve Customer Recommendations (many examples) – Nordstrom - Scoring customer to product segment interaction. Output: Affinity scores of each customer to the product segment, For each segment, count of customers who have above average affinity scores compared to the general population in that segment. • https://ieondemand.com/divisions/analytics/events/275/presentations/multi-channel-analytics-at-nordstrom-data-labs (5) • Policy interventions – development economics (non-profit) – Data-Pop Alliance: Using cell phone data to predict socioeconomic levels in developing areas • https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development-technocratic-democratic- considerations (6) 15
  • 16. Improve risk calculations • Progressive Auto Insurance: Snapshot measures risk of driving habits to allow the offer of lower rates – Automated device tracks actual driver patterns for 30 days (IoT) – Discounts offered based on time of day, pattern of hard braking, amount of driving – Response: Much clearer picture of actual driving risks than self- reporting – http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of- cheap-car-insurance
  • 17. Optimize Energy Usage • Optimize comfort and energy usage in commercial buildings (IoT) – BuildingIQ - Predictive Energy Optimization™ uses advanced algorithms to automatically fine tune and control HVAC systems resulting in savings while maintaining or improving comfort. – Interfaces with existing building energy management system and uses predictive analytics to reduce energy costs while maintaining comfort. • http://www.buildingiq.com/
  • 18. Optimize Energy Usage - BuildingIQ
  • 19. Improve Customer Recommendations • Many examples with “long tail” product offerings – Nordstrom - Scoring customer to product segment interaction. • Output: Affinity scores of each customer to the product segment • For each segment, count of customers who have above average affinity scores compared to the general population in that segment. • Probable tools: Clustering, recommendation engines • https://ieondemand.com/divisions/analytics/events/275/presentations/mult i-channel-analytics-at-nordstrom-data-labs
  • 20. Optimize policy interventions • Development economics (non-profit) – Data-Pop Alliance: ”Our mission statement and raison d'être is to promote a humanistic, people- centered ‘Big Data revolution’: one that fosters human development and social progress through the ethical use of personal data and empowerment of the poor and most vulnerable and avoids the pitfalls of a new digital divide, de-humanization and de-democratization.” – Using cell phone data to predict socioeconomic levels in developing areas – Traditionally difficult to measure level of economic well-being; involving expensive surveys and in-person travel – Since many developing societies are bypassing landlines and going directly to cell phone communications, quantities of cell phone units and traffic can be used to estimate changes in societal well being at a vastly reduced cost. • https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development- technocratic-democratic-considerations
  • 21. Business Use Case: Increase Revenue • Digital Marketing – Kraft: How Kraft uses Digital Marketing • http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to-transform-marketing (7) – Walgreens: Using data from loyalty program to deliver value and drive customer loyalty • https://ieondemand.com/divisions/analytics/events/275/presentations/delivering-value-driving- best-customer-loyalty-through-pricing-promotions (8) – Legendary Entertainment: Inform Creative decisions (primarily concept evaluation and cast evaluation (social media) transform marketing • https://ieondemand.com/divisions/analytics/events/275/presentations/changing-hollywood- paradigms-with-analytics (9) – Mashable: Data Driven Digital Publishing - Mashable's proprietary Velocity platform predicts what's going viral next • https://ieondemand.com/divisions/analytics/events/275/presentations/data-driving-digital- publishing (10)
  • 22. Kraft: Digital Marketing • How Kraft uses Digital Marketing • Leveraging 18 years worth of customer connections, first from print, then from kraftrecipes.com • Using the combined content from kraftrecipes.com with their data management systems to develop insights about their customer base. • Slogan: “data is people” • Now recording more than 34,000 attributes from 100 million online visitors each year, forming 800 segments of customers to buy ads against • http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to- transform-marketing
  • 23. Walgreens: Using data from loyalty program • Primarily “brick and mortar” stores; so how to get individual customer data for analytics? • Walgreens launched “Balance Rewards” in September 2012, to combine insights from a loyalty program with consumer behavior. – Loyalty programs are an illustration of how non-website data can be used for predictive analytics • Driving direct mail advertising, brick and mortar display positioning, inventory, etc. – Assortment: Carry products that meet the needs of targeted customer segments – Mass promotions: Drive loyalty with offers that attract and drive sales with targeted segments – Direct Marketing: Target “best” customers with “best” products for them. – Pricing: Understand customer pricing sensitivity. – Store Layout and Space Allocation: Align layout with the way customers shop. • https://ieondemand.com/divisions/analytics/events/275/presentations/delivering- value-driving-best-customer-loyalty-through-pricing-promotions
  • 24. Legendary Pictures: Applied Analytics Division • Legendary Pictures – Movies such as Interstellar, Godzilla, Dracula Untold, Man of Steel, Inception, Unbroken – Global scale and reach – Innovation factory, not a warehouse • Use of predictive analytics now a part of key strategies to inform creative decisions – Concept and cast evaluation • Green lighting concepts • Cast strengths and weaknesses • Build trailers on analytics • Proprietary, commercial-grade software constantly analyzes unique, extensive data across people, social media, and content. – 500 million emails, 250 million households – Entertainment data • Metadata on films since 1989 • Theatre data for 10 years • Social media data: sentiment, zeitgeist, topics • Transform marketing – Old fashioned: 4 quadrants: male/female/over 25/under 25; 4 groups of 80 million each – Targeting: 80 million groups of 4 – Aiming at “swing groups” not fans nor never interested “Mom” • https://ieondemand.com/divisio ns/analytics/events/275/present ations/changing-hollywood- paradigms-with-analytics 24
  • 25. Mashable: Data Driven Digital Publishing • For article placement, mashable.com needs to predict which items will go viral in real time! • Mashable's proprietary Velocity platform predicts what's going viral • Mashable Velocity: proprietary software: 130,000,000 urls, with 1,000,000 urls crawled per day • Saw an increase of 55% in web traffic over the first 2 years using the Velocity technology. • https://ieondemand.com/divisions/analytics/events/275/presentations/data-driving- digital-publishing 25
  • 26. Business Use Case: Reducing Cost • Human Resources: Reducing Attrition – Talent Analytics: Case Study: “Raw Talent Traits” Correlated to Attrition - Attrition Dropped by 30% Yielding Multimillion Dollar Cost Savings • http://www.talentanalytics.com/resources/case-studies/ (11) • Manufacturing Process Improvement – Intel - Using IoT and predictive analytics in chip manufacturing saved $3 million in core processor testing • http://newsroom.intel.com/community/apac_en/blog/2014/10/09/big-data-and-iot-in-manufacturing-in-everyday- life (12) • Improve Customer Retention – Time-Warner Cable: Using analytics to optimize price changes • https://ieondemand.com/divisions/analytics/events/275/presentations/using-analytics-to-optimize-subscription- price-changes (13) • Predictive Maintenance – How Predictive Analytics Improves Wind Turbine Maintenance • http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine-maintenance/ (14)
  • 27. Human Resources: Reducing Attrition • Talent Analytics: Case Study: “Raw Talent Traits” Correlated to Attrition - Attrition Dropped by 30% Yielding Multimillion Dollar Cost Savings – CSR roles required a 12-week training plus passing a series 7 test before starting work – Voluntary attrition was greater than 60%; reduced to < 40% as a result of analytics intervention – Success linked to combining data sets; traditional HR metrics with proprietary survey to identify raw talent traits – http://www.talentanalytics.com/resources/case-studies/
  • 28. Manufacturing Process Improvement • Intel - Using IoT and predictive analytics in chip manufacturing – Process and product measurements taken upstream during the manufacturing process are able to predict whether the chip will pass final test leading to a reduction in final testing requirements. – "Instead of running every single chip through 19,000 tests, we can focus tests on specific chips to cut down test time.“ – This predictive analytics process, implemented on a single line of Intel Core processors in 2012, allowed Intel to save $3 million in manufacturing costs. In 2013-14, Intel expects to extend the process to more chip lines and save an additional $30 million, the company said. • http://www.informationweek.com/software/information-management/intel-cuts-manufacturing- costs-with-big-data/d/d-id/1109111?
  • 29. Improve Customer Retention • Time-Warner Cable: Using analytics to optimize price changes – Using analytics, predict time-oriented price sensitivity – Use promotion expiration as “natural experiments” due to regulatory prohibitions against actual experimentation with customers pricing – Measure churn response of “test” group against pre-selected similar “control” group • https://ieondemand.com/divisions/analytics/events/275/presentations/usin g-analytics-to-optimize-subscription-price-changes
  • 30. Predictive Maintenance • How Predictive Analytics Improves Wind Turbine Maintenance – http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine-maintenance/
  • 31. Predictive Analytics Trends • Cost of analytics will be driven down by: – Standardization of IoT – Spread of expertise – Commercialization and widespread use of successful techniques – Development of “best practices” and user- friendly software to support them in specific key applications • Web and web-content data – Profusion of analytical tools will consolidate to a couple of winners – Most used and useful tools will be identified and commercialized – New applications will continue to target identifiable individuals in the “persuadable middle” • Internet of Things – Development of interface standards will lead to more commercially available tools as well as open source solutions • Healthcare analytics – Will increasingly use Big Data as privacy and regulatory issues resolved • Leading edge – Move from manual to dynamic model generation as understanding of relevant factors improves (especially in IoT, since things tend to have more consistent behaviors than people) – Analysis of streaming data becomes more mainstream 31
  • 32. Questions? Prediction is very difficult, especially if it’s about the future. -- Niels Bohr Kimberley Mitchell kim@who-knows.com http://mitchki.com @mitchki
  • 33. Appendix: References (1) Kuhn, M. and Johnson, K., Applied Predictive Modeling, Springer, 2013. (2) Leskovec, Rajaraman, and Ullman, Mining of Massive Datasets, Cambridge University Press, 2nd edition, 2014. (3) Hayes, Joy M. (2012, August 27) The Hidden Costs of Cheap Car Insurance. Retrieved from http://www.dailyfinance.com/2012/08/27/the-hidden-costs-of-cheap-car-insurance/ (4) BuildingIQ (2014). Predictive Energy Optimization. Retrieved from http://www.buildingiq.com/products/buildingiq-platform/ (5) Arun, Veettil (2014, November) Multi-Channel Analytics at Nordstrom Data Labs. Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/multi-channel-analytics-at-nordstrom-data-labs (6) Letouzé, Emmanuel (2014, November) Big Data for Development: Technocratic & Democratic Considerations. Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/big-data-for-development-technocratic-democratic- considerations 3/5/2015 33
  • 34. Appendix: References (page 2) (7) White, Melanie (2015, January 29) How Kraft Harnessed Data to Transform Marketing, Retrieved from http://www.clickz.com/clickz/news/2392789/how-kraft-harnessed-data-to-transform-marketing (8) Darer, Michael (2014, November) Delivering Value & Driving Best Customer Loyalty Through Pricing & Promotions, Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/delivering- value-driving-best-customer-loyalty-through-pricing-promotions (9) Marolda, Matt (2014, November) Changing Hollywood Paradigms with Analytics. Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/changing-hollywood-paradigms-with- analytics (10) Haile Owusu, Haile (2014, November) Data Driving Digital Publishing. Retrieved from ttps://ieondemand.com/divisions/analytics/events/275/presentations/data-driving-digital-publishing 3/5/2015 34
  • 35. Appendix: References (page 3) (11) Talent Analytics (2014) Case Study: “Raw Talent Traits” Correlated to Attrition. Retrieved from http://www.talentanalytics.com/resources/case-studies/ (12) Mandeville, John (2014, October 9) Big Data and IoT in Manufacturing “In Everyday Life”. Retrieved from http://newsroom.intel.com/community/apac_en/blog/2014/10/09/big-data-and- iot-in-manufacturing-in-everyday-life (13) Zyabkina, Tanya (2014, November) Using Analytics to Optimize Subscription Price Changes. Retrieved from https://ieondemand.com/divisions/analytics/events/275/presentations/using- analytics-to-optimize-subscription-price-changes (14) Mark-Jan Harte ( 2014, March5) How Predictive Analytics Improves Wind Turbine Maintenance. Retrieved from http://algoritmica.nl/how-predictive-analytics-improves-wind-turbine- maintenance/ 3/5/2015 35