SlideShare ist ein Scribd-Unternehmen logo
1 von 29
Downloaden Sie, um offline zu lesen
info@@scientificrevenue.com Pricing Without Compromise
Causal Inference, Reinforcement Learning,
and Continuous Optimization
Nucl.ai, 2016
info@@scientificrevenue.com Pricing Without Compromise
The Motivational Quote
This book does not take a
decision theoretic perspective ...
because the problem faced by
most economists or intending
economists does not seem
sensibly described as one of
decision. It seems more like
that of sensibly and concisely
reporting their findings .... this
leaves it up to others to use
your report as a basis for
decision making.
info@@scientificrevenue.com Pricing Without Compromise
Stated Another Way
….
info@@scientificrevenue.com Pricing Without Compromise
The First Escape: “A/B” Testing
It’s a compromise
You get engineering to insert some
bifurcated code (the test) into the system
Usually define all the variations in advance,
and then wait for a new version of the game
to be released
After that, someone looks at the test every
hour until statistical significance is achieved

You have a “winner” and go with it
info@@scientificrevenue.com Pricing Without Compromise
Multivariate Testing
A/ B quickly becomes multivariate – 4 or 5 arms is common
This is the most common evaluation methodology today
Problems:
• For most things worth testing, more arms elongates the
testing cycle
• Therefore, success requires a long-term test
• Therefore, ability to iterate is limited
During the Test:
• You’ve got potentially bad variations live!
info@@scientificrevenue.com Pricing Without Compromise
Runge et Al on Churn
Churn detection
algorith, worked
very well
No churn
prevention policy
worked well
against the
general
population
A/B Test
http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6932875
info@@scientificrevenue.com Pricing Without Compromise
Levitt Et Al on Pricing
Had 4 arms and a control.
Ran for 3 months.
Results were inconclusive.
http://www.pnas.org/content/113/27/7323.full
info@@scientificrevenue.com Pricing Without Compromise
Next Step: Multi-Arm Bandits
Core idea: vary traffic to
arms of test based on
performance criteria
• At any given moment in
time, either “explore”
(focus on learning about
performance) or “exploit”
(use currently optimal
arm)
Very popular in advertising
realm. Huge and interesting
literature
info@@scientificrevenue.com Pricing Without Compromise
Multi-Arm Bandit Pros
Traffic to bad variations quickly
decreases (assuming a robust
performance metric)
Generally, helps you get to a “winner”
faster
It’s got an O’Reilly book, so you don’t
have to explain it to engineering
One major use case: use MAB to
eliminate “bad” arms, then multi-
variate test the rest
info@@scientificrevenue.com Pricing Without Compromise
Multi-Arm Bandit “Cons”
You’re not sending traffic to all the
arms at the same rate. Statistical
significance is very hard to achieve
Changing traffic volumes
introduces bias in experimental
populations (during analysis, you
could conceivably reweight using
propensity scores)
Markov assumptions underlying
standard reinforcement learning
theory are not fully valid
Defining the objective function
can be difficult
info@@scientificrevenue.com Pricing Without Compromise
Bias ?
Changing traffic volumes introduces bias in experimental populations
(during analysis, you could conceivably reweight using propensity
scores)?
Suppose you send 20% of the users to each of 5 arms. Then suppose
you send 40% of new users to the first arm, and 15% to each of the
remaining arms
The population going to the first arm has a lower percentage of
experienced users, and a higher percentage of people from certain
locales (depending on the time you alter the percentages)
Is it a big deal? Not as long as you’re aware of it. And use propensity
scores appropriately.
info@@scientificrevenue.com Pricing Without Compromise
But … Why Assume There's a Single Best Outcome?
You've parametrized multiple
behaviors
You're recording lots of user
features
You're already changing system
behavior at runtime
You’re running randomized trials
already
If you're really a bandit maven,
you've got a reserved population
already in place for ongoing
exploration
info@@scientificrevenue.com Pricing Without Compromise
Articulating the New Goal
Instead of thinking about “winners”
and “losers”
Instead of thinking about “better”
and “worse”
Think of a test arm as a population-
selecting function
Given an arm of a test, the
population it selects is the population
it is optimal for, under some
objective function
info@@scientificrevenue.com Pricing Without Compromise
Key Idea: Continuous Optimization Using a Control
Framework
If you have an objective function
And you have a control state
And you have multiple treatments
Then you should map the user to the treatment that maximizes
the objective function
• In realtime
• On a per-user basis
info@@scientificrevenue.com Pricing Without Compromise
The Analytical Two-Step
Run randomized trials
• Users randomly assigned to treatments
• Banditing has much more explore (and much less
exploit) than is usual
• Exploration is guided by models
After the trial, run a causal model builder
• Put your eyeballs at the end of the experiment and see if
you can figure out how you should have assigned the
users (to optimize the objective function)
• This is inherently a counterfactual exercise, and requires
causal inference
info@@scientificrevenue.com Pricing Without Compromise
Causal Inference Notation
Long history of “counterfactual” or “causal” reasoning – goes
back almost 100 years.
(Binary) Notation:
• 𝐷𝑖 -- whether user i received a treatment.
• 𝑌∗𝑖 -- the outcome for user i under treatment *
𝑌𝑖 = ቊ
𝑌1𝑖 𝑖𝑓 𝐷𝑖 = 1
𝑌0𝑖 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒
𝜏𝑖 = 𝑌1𝑖 − 𝑌0𝑖
info@@scientificrevenue.com Pricing Without Compromise
The Hard Part To Wrap Your Head Around
Note that 𝑌∗𝑖 and 𝜏𝑖 are unmeasurable in general (they’re not
observed) – users either get the treatment or they don’t. This is
the hard part to wrap your head around.
Hence the term “counterfactual”
info@@scientificrevenue.com Pricing Without Compromise
Causal Inference Notation II
E[𝑌𝑖 𝐷𝑖 = 1 − 𝐸 𝑌𝑖 𝐷𝑖 = 0] -- observed difference in outcome
E[𝑌1𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 0] -- same thing
E[𝑌1𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 1] + E[𝑌0𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 0]
Treatment effect on treated Selection bias
(red is counterfactual and inserted for algebraic convenience)
info@@scientificrevenue.com Pricing Without Compromise
Causal Decision Trees
Idea:
Split leafs based on a MSE across all treatments
Standard penalization for complex trees (𝜆 ∗ # 𝑙𝑒𝑎𝑣𝑒𝑠)
Estimator:
Ƹ𝜏𝑖
𝐶𝑇
sample average treatment effect in leaf (with
propensity scores)
𝑌𝑖
∗
= ቊ
2 ∗ 𝑌𝑖 (𝐷𝑖 = 1)
−2 ∗ 𝑌𝑖 (𝐷𝑖 = 0)
−
1
𝑛
σ𝑖=1
𝑁
( Ƹ𝜏𝑖
𝐶𝑇
− 𝑌𝑖
∗
)^2 in-sample goodness of fit
info@@scientificrevenue.com Pricing Without Compromise
Why Do This
Compare to “two tree model”
Build two regression trees (control and treatment)
Predict outcome for given user on both trees
Choose the treatment with maximal value for a given user.
Causal DT separates “model construction” from treatment effect
estimation. Works well when there is a lot of hererogeneity
unrelated to treatment effects
Two trees work well when control outcomes are close to
constant – rare in real life
info@@scientificrevenue.com Pricing Without Compromise
Drilldown: Causal Random Forests
A Casual Decision Tree …. Overfits (just like a decision tree!)
A Causal Random Forest is just a bag of Causal Decision Trees
This example is adding two treatments to decision trees.
But the algebra is similar for m treatments, and for different
ML algorithms (which is what we use)
info@@scientificrevenue.com Pricing Without Compromise
The Analytical Two-Step (Revisited)
Run randomized trials
• Users randomly assigned to treatments
• Banditing has much more explore (and much less
exploit) than is usual
• Exploration is guided by models
After the trial, run a causal model builder
• Put your eyeballs at the end of the experiment and see if
you can figure out how you should have assigned the
users (to optimize the objective function)
• This is inherently a counterfactual exercise, and requires
causal inference
info@@scientificrevenue.com Pricing Without Compromise
What Does a Model Builder Produce
Fast segmenters (the primary goal of a model builder is to
provide a real-time segmentation algorithm whose segments
can be matched to treatments)
Proportional estimates. What percentage of traffic is going to
each treatment (thought of as a segment)
Estimates of improvement for each segment (the model should
predict the gain)
info@@scientificrevenue.com Pricing Without Compromise
Evaluation Via “Two-Armed Partition-Based Test”
Two arms: Control and Variation
• Control has “Before” (no treatment)
• Variation has entire model (all m treatments)
Partitions partition user space
• m treatments -> up to m disjoint segments in the
partition.
• Disjoint segments are each mapped to different
treatments
info@@scientificrevenue.com Pricing Without Compromise
info@@scientificrevenue.com Pricing Without Compromise
Revisiting Runge et Al
Runge et al built a model of churn prediction
This defines both the test population and the objective function
• Test population: Likely to churn
• (Very simpleminded) Objective function: (Number of
Days until Actually Churned) – (Predicted Number of
Days)
Covariates: they’ve collected a bunch
What should they do next?
Causal inference to see which strategies worked for whom
(using the covariates as features)
info@@scientificrevenue.com Pricing Without Compromise
This Works in Production ….
(SR Customer Dashboard, with identifying info removed)
info@@scientificrevenue.com Pricing Without Compromise
The Starting Points for Really Understanding This
info@@scientificrevenue.com Pricing Without Compromise
Thank You
William Grossobill@scientificrevenue.com
bill@scientificrevenue.com

Weitere ähnliche Inhalte

Was ist angesagt?

Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Roger Barga
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptopRising Media, Inc.
 
1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his macRising Media, Inc.
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptopRising Media, Inc.
 
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesIntroduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesQuestionPro
 
1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptopRising Media, Inc.
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2Roger Barga
 
Data Driven Engineering 2014
Data Driven Engineering 2014Data Driven Engineering 2014
Data Driven Engineering 2014Roger Barga
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteRoger Barga
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1Roger Barga
 
1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptopRising Media, Inc.
 
ML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problemsML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problemsAmy Hodler
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data ScienceJohn B. Rollins, Ph.D.
 
Barga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkBarga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkRoger Barga
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and adviceThe Marketing Distillery
 
Webinar Tutorial - A Beginners Guide To MaxDiff Scaling
Webinar Tutorial - A Beginners Guide To MaxDiff ScalingWebinar Tutorial - A Beginners Guide To MaxDiff Scaling
Webinar Tutorial - A Beginners Guide To MaxDiff ScalingQuestionPro
 
The Three Body Problem of Data Science
The Three Body Problem of Data ScienceThe Three Body Problem of Data Science
The Three Body Problem of Data ScienceDima Karamshuk
 
1645 track 2 ard_using our laptop
1645 track 2 ard_using our laptop1645 track 2 ard_using our laptop
1645 track 2 ard_using our laptopRising Media, Inc.
 

Was ist angesagt? (20)

Barga Galvanize Sept 2015
Barga Galvanize Sept 2015Barga Galvanize Sept 2015
Barga Galvanize Sept 2015
 
1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop1030 track 2 barrett_using our laptop
1030 track 2 barrett_using our laptop
 
1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac1225 lunchlearn shekhar_using his mac
1225 lunchlearn shekhar_using his mac
 
1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop1440 track 2 boire_using our laptop
1440 track 2 boire_using our laptop
 
7 steps to Predictive Analytics
7 steps to Predictive Analytics 7 steps to Predictive Analytics
7 steps to Predictive Analytics
 
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing SlidesIntroduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
Introduction to MaxDiff Scaling of Importance - Parametric Marketing Slides
 
1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop1120 track 3 prendki_using our laptop
1120 track 3 prendki_using our laptop
 
Barga Data Science lecture 2
Barga Data Science lecture 2Barga Data Science lecture 2
Barga Data Science lecture 2
 
Data Driven Engineering 2014
Data Driven Engineering 2014Data Driven Engineering 2014
Data Driven Engineering 2014
 
Barga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 KeynoteBarga ACM DEBS 2013 Keynote
Barga ACM DEBS 2013 Keynote
 
Barga Data Science lecture 1
Barga Data Science lecture 1Barga Data Science lecture 1
Barga Data Science lecture 1
 
1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop1555 track 2 ning_using our laptop
1555 track 2 ning_using our laptop
 
ML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problemsML Drift - How to find issues before they become problems
ML Drift - How to find issues before they become problems
 
Foundational Methodology for Data Science
Foundational Methodology for Data ScienceFoundational Methodology for Data Science
Foundational Methodology for Data Science
 
Barga DIDC'14 Invited Talk
Barga DIDC'14 Invited TalkBarga DIDC'14 Invited Talk
Barga DIDC'14 Invited Talk
 
Predictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advicePredictive analytics in action real-world examples and advice
Predictive analytics in action real-world examples and advice
 
Webinar Tutorial - A Beginners Guide To MaxDiff Scaling
Webinar Tutorial - A Beginners Guide To MaxDiff ScalingWebinar Tutorial - A Beginners Guide To MaxDiff Scaling
Webinar Tutorial - A Beginners Guide To MaxDiff Scaling
 
The Three Body Problem of Data Science
The Three Body Problem of Data ScienceThe Three Body Problem of Data Science
The Three Body Problem of Data Science
 
1645 track 2 ard_using our laptop
1645 track 2 ard_using our laptop1645 track 2 ard_using our laptop
1645 track 2 ard_using our laptop
 
10 best practices in operational analytics
10 best practices in operational analytics 10 best practices in operational analytics
10 best practices in operational analytics
 

Andere mochten auch

Emerging Best Practices in Dynamic Pricing
Emerging Best Practices in Dynamic Pricing Emerging Best Practices in Dynamic Pricing
Emerging Best Practices in Dynamic Pricing ScientificRevenue
 
On Annuity Design - Pocket Gamer London 2017
On Annuity Design  - Pocket Gamer London 2017On Annuity Design  - Pocket Gamer London 2017
On Annuity Design - Pocket Gamer London 2017ScientificRevenue
 
Scientific Revenue for SVIEF 2015
Scientific Revenue for SVIEF 2015Scientific Revenue for SVIEF 2015
Scientific Revenue for SVIEF 2015ScientificRevenue
 
10 Tips for Coin Store Design
10 Tips for Coin Store Design10 Tips for Coin Store Design
10 Tips for Coin Store DesignScientificRevenue
 
10 Tips for IAP Monetization Design.
10 Tips for IAP Monetization Design.10 Tips for IAP Monetization Design.
10 Tips for IAP Monetization Design.ScientificRevenue
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsAmit Sharma
 

Andere mochten auch (6)

Emerging Best Practices in Dynamic Pricing
Emerging Best Practices in Dynamic Pricing Emerging Best Practices in Dynamic Pricing
Emerging Best Practices in Dynamic Pricing
 
On Annuity Design - Pocket Gamer London 2017
On Annuity Design  - Pocket Gamer London 2017On Annuity Design  - Pocket Gamer London 2017
On Annuity Design - Pocket Gamer London 2017
 
Scientific Revenue for SVIEF 2015
Scientific Revenue for SVIEF 2015Scientific Revenue for SVIEF 2015
Scientific Revenue for SVIEF 2015
 
10 Tips for Coin Store Design
10 Tips for Coin Store Design10 Tips for Coin Store Design
10 Tips for Coin Store Design
 
10 Tips for IAP Monetization Design.
10 Tips for IAP Monetization Design.10 Tips for IAP Monetization Design.
10 Tips for IAP Monetization Design.
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systems
 

Ähnlich wie Causal Inference, Reinforcement Learning, and Continuous Optimization

How to understand and implement regression analysis
How to understand and implement regression analysisHow to understand and implement regression analysis
How to understand and implement regression analysisClaireWhittaker5
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Francesca Lazzeri, PhD
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandrySri Ambati
 
Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...Egor Kraev
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsFrancesca Lazzeri, PhD
 
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...Intel® Software
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsSri Ambati
 
The Data Science Product Management Toolkit
The Data Science Product Management ToolkitThe Data Science Product Management Toolkit
The Data Science Product Management ToolkitJack Moore
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemMilind Gokhale
 
Testing 2 - Thinking Like A Tester
Testing 2 - Thinking Like A TesterTesting 2 - Thinking Like A Tester
Testing 2 - Thinking Like A TesterArleneAndrews2
 
Btm8107 8 week2 activity understanding and exploring assumptions a+ work
Btm8107 8 week2 activity understanding and exploring assumptions a+ workBtm8107 8 week2 activity understanding and exploring assumptions a+ work
Btm8107 8 week2 activity understanding and exploring assumptions a+ workcoursesexams1
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?BalaBit
 
Workbook Project
Workbook ProjectWorkbook Project
Workbook ProjectBrian Ryan
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment briefAhmed Shaaban
 
Machine Learning - Algorithms and simple business cases
Machine Learning - Algorithms and simple business casesMachine Learning - Algorithms and simple business cases
Machine Learning - Algorithms and simple business casesClaudio Mirti
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning ProjectEng Teong Cheah
 

Ähnlich wie Causal Inference, Reinforcement Learning, and Continuous Optimization (20)

How to understand and implement regression analysis
How to understand and implement regression analysisHow to understand and implement regression analysis
How to understand and implement regression analysis
 
Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
Spark + AI Summit - The Importance of Model Fairness and Interpretability in ...
 
H2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark LandryH2O World - Top 10 Data Science Pitfalls - Mark Landry
H2O World - Top 10 Data Science Pitfalls - Mark Landry
 
Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...Supercharge your AB testing with automated causal inference - Community Works...
Supercharge your AB testing with automated causal inference - Community Works...
 
The importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systemsThe importance of model fairness and interpretability in AI systems
The importance of model fairness and interpretability in AI systems
 
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...
Data Analytics, Machine Learning, and HPC in Today’s Changing Application Env...
 
Top 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner PitfallsTop 10 Data Science Practitioner Pitfalls
Top 10 Data Science Practitioner Pitfalls
 
The Data Science Product Management Toolkit
The Data Science Product Management ToolkitThe Data Science Product Management Toolkit
The Data Science Product Management Toolkit
 
Intro to ml_2021
Intro to ml_2021Intro to ml_2021
Intro to ml_2021
 
Collaborative Filtering Recommendation System
Collaborative Filtering Recommendation SystemCollaborative Filtering Recommendation System
Collaborative Filtering Recommendation System
 
Testing 2 - Thinking Like A Tester
Testing 2 - Thinking Like A TesterTesting 2 - Thinking Like A Tester
Testing 2 - Thinking Like A Tester
 
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdfTop Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
Top Machine Learning Algorithms Used By AI Professionals ARTiBA.pdf
 
Btm8107 8 week2 activity understanding and exploring assumptions a+ work
Btm8107 8 week2 activity understanding and exploring assumptions a+ workBtm8107 8 week2 activity understanding and exploring assumptions a+ work
Btm8107 8 week2 activity understanding and exploring assumptions a+ work
 
Big Data Science - hype?
Big Data Science - hype?Big Data Science - hype?
Big Data Science - hype?
 
Workbook Project
Workbook ProjectWorkbook Project
Workbook Project
 
Virtual segment brief
Virtual segment briefVirtual segment brief
Virtual segment brief
 
Comparison and evaluation of alternative designs
Comparison and evaluation of alternative designsComparison and evaluation of alternative designs
Comparison and evaluation of alternative designs
 
Machine Learning - Algorithms and simple business cases
Machine Learning - Algorithms and simple business casesMachine Learning - Algorithms and simple business cases
Machine Learning - Algorithms and simple business cases
 
End-to-End Machine Learning Project
End-to-End Machine Learning ProjectEnd-to-End Machine Learning Project
End-to-End Machine Learning Project
 

Kürzlich hochgeladen

LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGIThomas Poetter
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.natarajan8993
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesTimothy Spann
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一F La
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...ttt fff
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanMYRABACSAFRA2
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsVICTOR MAESTRE RAMIREZ
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxUnduhUnggah1
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一F sss
 

Kürzlich hochgeladen (20)

LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGILLMs, LMMs, their Improvement Suggestions and the Path towards AGI
LLMs, LMMs, their Improvement Suggestions and the Path towards AGI
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.RABBIT: A CLI tool for identifying bots based on their GitHub events.
RABBIT: A CLI tool for identifying bots based on their GitHub events.
 
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming PipelinesConf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
Conf42-LLM_Adding Generative AI to Real-Time Streaming Pipelines
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
办理(Vancouver毕业证书)加拿大温哥华岛大学毕业证成绩单原版一比一
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
毕业文凭制作#回国入职#diploma#degree美国加州州立大学北岭分校毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#de...
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
Identifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population MeanIdentifying Appropriate Test Statistics Involving Population Mean
Identifying Appropriate Test Statistics Involving Population Mean
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
Advanced Machine Learning for Business Professionals
Advanced Machine Learning for Business ProfessionalsAdvanced Machine Learning for Business Professionals
Advanced Machine Learning for Business Professionals
 
MK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docxMK KOMUNIKASI DATA (TI)komdat komdat.docx
MK KOMUNIKASI DATA (TI)komdat komdat.docx
 
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
办理学位证加利福尼亚大学洛杉矶分校毕业证,UCLA成绩单原版一比一
 

Causal Inference, Reinforcement Learning, and Continuous Optimization

  • 1. info@@scientificrevenue.com Pricing Without Compromise Causal Inference, Reinforcement Learning, and Continuous Optimization Nucl.ai, 2016
  • 2. info@@scientificrevenue.com Pricing Without Compromise The Motivational Quote This book does not take a decision theoretic perspective ... because the problem faced by most economists or intending economists does not seem sensibly described as one of decision. It seems more like that of sensibly and concisely reporting their findings .... this leaves it up to others to use your report as a basis for decision making.
  • 3. info@@scientificrevenue.com Pricing Without Compromise Stated Another Way ….
  • 4. info@@scientificrevenue.com Pricing Without Compromise The First Escape: “A/B” Testing It’s a compromise You get engineering to insert some bifurcated code (the test) into the system Usually define all the variations in advance, and then wait for a new version of the game to be released After that, someone looks at the test every hour until statistical significance is achieved  You have a “winner” and go with it
  • 5. info@@scientificrevenue.com Pricing Without Compromise Multivariate Testing A/ B quickly becomes multivariate – 4 or 5 arms is common This is the most common evaluation methodology today Problems: • For most things worth testing, more arms elongates the testing cycle • Therefore, success requires a long-term test • Therefore, ability to iterate is limited During the Test: • You’ve got potentially bad variations live!
  • 6. info@@scientificrevenue.com Pricing Without Compromise Runge et Al on Churn Churn detection algorith, worked very well No churn prevention policy worked well against the general population A/B Test http://ieeexplore.ieee.org/xpl/login.jsp?tp=&arnumber=6932875
  • 7. info@@scientificrevenue.com Pricing Without Compromise Levitt Et Al on Pricing Had 4 arms and a control. Ran for 3 months. Results were inconclusive. http://www.pnas.org/content/113/27/7323.full
  • 8. info@@scientificrevenue.com Pricing Without Compromise Next Step: Multi-Arm Bandits Core idea: vary traffic to arms of test based on performance criteria • At any given moment in time, either “explore” (focus on learning about performance) or “exploit” (use currently optimal arm) Very popular in advertising realm. Huge and interesting literature
  • 9. info@@scientificrevenue.com Pricing Without Compromise Multi-Arm Bandit Pros Traffic to bad variations quickly decreases (assuming a robust performance metric) Generally, helps you get to a “winner” faster It’s got an O’Reilly book, so you don’t have to explain it to engineering One major use case: use MAB to eliminate “bad” arms, then multi- variate test the rest
  • 10. info@@scientificrevenue.com Pricing Without Compromise Multi-Arm Bandit “Cons” You’re not sending traffic to all the arms at the same rate. Statistical significance is very hard to achieve Changing traffic volumes introduces bias in experimental populations (during analysis, you could conceivably reweight using propensity scores) Markov assumptions underlying standard reinforcement learning theory are not fully valid Defining the objective function can be difficult
  • 11. info@@scientificrevenue.com Pricing Without Compromise Bias ? Changing traffic volumes introduces bias in experimental populations (during analysis, you could conceivably reweight using propensity scores)? Suppose you send 20% of the users to each of 5 arms. Then suppose you send 40% of new users to the first arm, and 15% to each of the remaining arms The population going to the first arm has a lower percentage of experienced users, and a higher percentage of people from certain locales (depending on the time you alter the percentages) Is it a big deal? Not as long as you’re aware of it. And use propensity scores appropriately.
  • 12. info@@scientificrevenue.com Pricing Without Compromise But … Why Assume There's a Single Best Outcome? You've parametrized multiple behaviors You're recording lots of user features You're already changing system behavior at runtime You’re running randomized trials already If you're really a bandit maven, you've got a reserved population already in place for ongoing exploration
  • 13. info@@scientificrevenue.com Pricing Without Compromise Articulating the New Goal Instead of thinking about “winners” and “losers” Instead of thinking about “better” and “worse” Think of a test arm as a population- selecting function Given an arm of a test, the population it selects is the population it is optimal for, under some objective function
  • 14. info@@scientificrevenue.com Pricing Without Compromise Key Idea: Continuous Optimization Using a Control Framework If you have an objective function And you have a control state And you have multiple treatments Then you should map the user to the treatment that maximizes the objective function • In realtime • On a per-user basis
  • 15. info@@scientificrevenue.com Pricing Without Compromise The Analytical Two-Step Run randomized trials • Users randomly assigned to treatments • Banditing has much more explore (and much less exploit) than is usual • Exploration is guided by models After the trial, run a causal model builder • Put your eyeballs at the end of the experiment and see if you can figure out how you should have assigned the users (to optimize the objective function) • This is inherently a counterfactual exercise, and requires causal inference
  • 16. info@@scientificrevenue.com Pricing Without Compromise Causal Inference Notation Long history of “counterfactual” or “causal” reasoning – goes back almost 100 years. (Binary) Notation: • 𝐷𝑖 -- whether user i received a treatment. • 𝑌∗𝑖 -- the outcome for user i under treatment * 𝑌𝑖 = ቊ 𝑌1𝑖 𝑖𝑓 𝐷𝑖 = 1 𝑌0𝑖 𝑜𝑡ℎ𝑒𝑟𝑤𝑖𝑠𝑒 𝜏𝑖 = 𝑌1𝑖 − 𝑌0𝑖
  • 17. info@@scientificrevenue.com Pricing Without Compromise The Hard Part To Wrap Your Head Around Note that 𝑌∗𝑖 and 𝜏𝑖 are unmeasurable in general (they’re not observed) – users either get the treatment or they don’t. This is the hard part to wrap your head around. Hence the term “counterfactual”
  • 18. info@@scientificrevenue.com Pricing Without Compromise Causal Inference Notation II E[𝑌𝑖 𝐷𝑖 = 1 − 𝐸 𝑌𝑖 𝐷𝑖 = 0] -- observed difference in outcome E[𝑌1𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 0] -- same thing E[𝑌1𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 1] + E[𝑌0𝑖 𝐷𝑖 = 1 − 𝐸 𝑌0𝑖 𝐷𝑖 = 0] Treatment effect on treated Selection bias (red is counterfactual and inserted for algebraic convenience)
  • 19. info@@scientificrevenue.com Pricing Without Compromise Causal Decision Trees Idea: Split leafs based on a MSE across all treatments Standard penalization for complex trees (𝜆 ∗ # 𝑙𝑒𝑎𝑣𝑒𝑠) Estimator: Ƹ𝜏𝑖 𝐶𝑇 sample average treatment effect in leaf (with propensity scores) 𝑌𝑖 ∗ = ቊ 2 ∗ 𝑌𝑖 (𝐷𝑖 = 1) −2 ∗ 𝑌𝑖 (𝐷𝑖 = 0) − 1 𝑛 σ𝑖=1 𝑁 ( Ƹ𝜏𝑖 𝐶𝑇 − 𝑌𝑖 ∗ )^2 in-sample goodness of fit
  • 20. info@@scientificrevenue.com Pricing Without Compromise Why Do This Compare to “two tree model” Build two regression trees (control and treatment) Predict outcome for given user on both trees Choose the treatment with maximal value for a given user. Causal DT separates “model construction” from treatment effect estimation. Works well when there is a lot of hererogeneity unrelated to treatment effects Two trees work well when control outcomes are close to constant – rare in real life
  • 21. info@@scientificrevenue.com Pricing Without Compromise Drilldown: Causal Random Forests A Casual Decision Tree …. Overfits (just like a decision tree!) A Causal Random Forest is just a bag of Causal Decision Trees This example is adding two treatments to decision trees. But the algebra is similar for m treatments, and for different ML algorithms (which is what we use)
  • 22. info@@scientificrevenue.com Pricing Without Compromise The Analytical Two-Step (Revisited) Run randomized trials • Users randomly assigned to treatments • Banditing has much more explore (and much less exploit) than is usual • Exploration is guided by models After the trial, run a causal model builder • Put your eyeballs at the end of the experiment and see if you can figure out how you should have assigned the users (to optimize the objective function) • This is inherently a counterfactual exercise, and requires causal inference
  • 23. info@@scientificrevenue.com Pricing Without Compromise What Does a Model Builder Produce Fast segmenters (the primary goal of a model builder is to provide a real-time segmentation algorithm whose segments can be matched to treatments) Proportional estimates. What percentage of traffic is going to each treatment (thought of as a segment) Estimates of improvement for each segment (the model should predict the gain)
  • 24. info@@scientificrevenue.com Pricing Without Compromise Evaluation Via “Two-Armed Partition-Based Test” Two arms: Control and Variation • Control has “Before” (no treatment) • Variation has entire model (all m treatments) Partitions partition user space • m treatments -> up to m disjoint segments in the partition. • Disjoint segments are each mapped to different treatments
  • 26. info@@scientificrevenue.com Pricing Without Compromise Revisiting Runge et Al Runge et al built a model of churn prediction This defines both the test population and the objective function • Test population: Likely to churn • (Very simpleminded) Objective function: (Number of Days until Actually Churned) – (Predicted Number of Days) Covariates: they’ve collected a bunch What should they do next? Causal inference to see which strategies worked for whom (using the covariates as features)
  • 27. info@@scientificrevenue.com Pricing Without Compromise This Works in Production …. (SR Customer Dashboard, with identifying info removed)
  • 28. info@@scientificrevenue.com Pricing Without Compromise The Starting Points for Really Understanding This
  • 29. info@@scientificrevenue.com Pricing Without Compromise Thank You William Grossobill@scientificrevenue.com bill@scientificrevenue.com