SlideShare a Scribd company logo
1 of 48
Estimating the causal
impact of
recommendation systems
AMIT SHARMA, JAKE HOFMAN, DUNCAN WATTS
MICROSOFT RESEARCH, NEW YORK
1
2nd International Conference on
Computational Social Science
How much do they
change user behavior?
2
Naively, up to 30% of traffic
comes from recommendations
3
Naively, up to 30% of traffic
comes from recommendations
“Burton Snowboard, a sports retailer, reported
that personalized product recommendations
have driven nearly 25% of total sales since it
began offering them in 2008. Prior to this,
Burton’s customer recommendations consisted
of items from its list of top-selling products.”
4
Almost surely an over-estimate of the
actual effect, because of correlated
demand between a product and its
recommendations.
Example: product browsing on
Amazon.com
Example: product browsing on
Amazon.com
Example: product browsing on
Amazon.com
Counterfactual browsing: no
recommendations
Counterfactual browsing: no
recommendations
Problem: Correlated demand may
drive page visits, even without
recommendations
The problem of correlated
demand
Demand
for winter
accessories
Visits to
winter hat
Rec. visits
to winter
gloves
12
Goal: Estimate the extra activity caused
by a recommender system that would
not have happened otherwise
Causal
Convenience
OBSERVED CLICK-THROUGHS WITHOUT RECOMMENDER
Convenience
?
13
Ideal experiment: A/B Test
Treatment (A) Control (B)
But, experiments:
may be costly
hamper user experience
require full access to the system
14
Using natural variations to
simulate an experiment
16
Studying sudden spikes,
“shocks” to demand for a book
[Carmi et al. 2012]
17
The same author’s recommended
book may also have a shock
18
Past work: Controlling for
correlated demand
Uses statistical models to control for confounds
Carmi et al. [2012], Oestreicher and Sundararajan [2012] and Lin [2013]
construct “complementary sets” of similar, non-recommended
products.
Garfinkel et. al. [2006] and Broder et al. [2015] compare to model-
predicted clicks without recommendations.
But,
1. These assumptions are hard to verify.
2. Finding examples of valid shocks requires ingenuity
and restricts researchers to very specific categories
19
Shock-IV: A simpler, more robust
method for estimating causal impact.
Distinguishing between
recommendation and direct traffic
All visits to a
product
Recommender
visits
Direct visits
Search visits
Direct
browsing
Proxy for unobserved demand
21
The Shock-IV strategy:
Searching for valid shocks
? ?
22
The Shock-IV strategy: Filtering
out invalid shocks
23
Search for products that receive a
sudden shock in their traffic but direct
traffic for their recommendations
remains constant.
Why does it work? Shock as an
instrumental variable
Demand
Focal
visits (X)
Rec.
visits (Y)
Sudden
Shock
Direct
visits (Y)
Computing the causal
estimate
Increase in
recommendation
clicks ( )
Causal CTR (
*Same as Wald estimator
for instrumental variables
Increase in
visits to focal
product ( )
The shock-IV strategy: In
equations
Application to Amazon.com,
using Bing toolbar logs
•
•
•
Sept 2013-May 2014
Recreating sequence of page
visits by a user
Recreating sequence of page
visits by a user
Timestamp URL
2014-01-20
09:04:10
http://www.amazon.com/s/ref=nb_sb_noss
_1?field-keywords=George%20saunders
2014-01-20
09:04:15
http://www.amazon.com/dp/0812984250/r
ef=sr_1_1
2014-01-20
09:05:01
http://www.amazon.com/dp/1573225797/r
ef=pd_sim_b_2
Recreating sequence of page
visits by a user
Timestamp URL
2014-01-20
09:04:10
http://www.amazon.com/s/ref=nb_sb_no
ss_1?field-keywords=George%20saunders
2014-01-20
09:04:15
http://www.amazon.com/dp/0812984250/
ref=sr_1_1
2014-01-20
09:05:01
http://www.amazon.com/dp/1573225797/
ref=pd_sim_b_2
User searches for
George Saunders
User clicks on the first
search result
User clicks on the
second recommendation
I. Weekly and seasonal patterns in
traffic, nearly tripling in holidays
II. 30% of all pageviews come
through recommendations
III. Books and eBooks are the
most popular categories by far
IV. Apparel and shoes see a
substantially higher fraction of
visits through recommendations
Shock-IV: Finding shocks in
user visit data
We look for focal products with large and sudden
increases in views relative to typical traffic.
Size of shock exceeds:
◦ 5 times median traffic
◦ Shock exceeds 5 times the previous day's traffic and 5 times the
mean of the last 7 days.
Shocked product has:
◦ Visits from at least 10 unique users during the shock
◦ Non-zero visits for at least five out of seven days before and after
the shock
36
Shock-IV: Ensuring exclusion
restriction
Recommended product (Y) should have constant
direct visits during the time of the shock.
(1-β): Ratio of maximum 14-day variation in visits to a
recommended product to the size of the shock for the focal
product.
Direct traffic to Y is
stable relative to
the shock to the
focal product.
β = 1 Direct traffic to Y is
no less varying
than the shock to
focal product.
β = 0
37
How to choose 𝛽?
Accept
RejectSelect 𝛽 = 0.7
Using the method, obtain
>4000 natural experiments!
Estimating the causal
clickthrough rate (𝜌)
Causal click-through rate by
product category
Estimating fraction of observed
click-throughs that are causal
Compare the number of estimated causal clicks to
all observed recommendation clicks (non-shock
period).
43
Only a quarter of the observed
click-throughs are causal
At β = 0.7, only 25% of
recommendation traffic is
caused by the recommender.
Generalization?
Shocks may be due to
discounts or sales
Lower CTR may be due to
the holiday season
45
Local average treatment effect
(LATE), not fully generalizable
Shocked products are not a representative sample
of all products, nor are the users who participate in
them.
• Shock-IV method covers roughly one-fifth of all
products with at least 10 visits on any single day.
• Our results are robust to sale or holiday effects.
• Causal estimates are consistent with
experimental findings (e.g., Belluf et. al. [2012])
46
More generally…
A robust, scalable method for causal inference.
◦ Causal CTR for Amazon’s recommender system much less
than the naïve observational CTR.
◦ Can be applied to other domains, such as online ads.
Data mining for instruments
I. Allows us to study a much larger sample of
natural experiments, while being able to test for
exclusion restriction directly.
II. Can be used for finding potential instruments.
47
Thank you!
AMIT SHARMA
MICROSOFT RESEARCH
http://www.amitsharma.in
Sharma, A., Hofman, J. M., & Watts, D. J. (2015). Estimating the causal impact of
recommendation systems from observational data. In Proceedings of the Sixteenth ACM
Conference on Economics and Computation.
48
Shock-IV: A robust, scalable method for estimating
causal impact from observational data, with testable
assumptions.
Naïve observational estimates of CTR for
recommendation systems may be big overestimates.

More Related Content

What's hot

IRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product MarketingIRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product MarketingIRJET Journal
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Amit Sharma
 
IRJET- Enhancing NLP Techniques for Fake Review Detection
IRJET- Enhancing NLP Techniques for Fake Review DetectionIRJET- Enhancing NLP Techniques for Fake Review Detection
IRJET- Enhancing NLP Techniques for Fake Review DetectionIRJET Journal
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET Journal
 
Digital analytics: Analytics problems (Lecture 9)
Digital analytics: Analytics problems (Lecture 9)Digital analytics: Analytics problems (Lecture 9)
Digital analytics: Analytics problems (Lecture 9)Joni Salminen
 
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-final
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-finalICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-final
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-finalriedlc
 
Learn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hrLearn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hrQuestionPro
 
Outbrain click prediction by vishalchangrani
Outbrain click prediction by vishalchangrani Outbrain click prediction by vishalchangrani
Outbrain click prediction by vishalchangrani Vishal Changrani
 
Digital analytics lecture4
Digital analytics lecture4Digital analytics lecture4
Digital analytics lecture4Joni Salminen
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: OverviewMatt Hansen
 
Results Usersurvey 2005
Results Usersurvey 2005Results Usersurvey 2005
Results Usersurvey 2005annegrete
 
Planning Innovations: Conjoint Analysis Slides
Planning Innovations: Conjoint Analysis SlidesPlanning Innovations: Conjoint Analysis Slides
Planning Innovations: Conjoint Analysis SlidesQuestionPro
 

What's hot (13)

IRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product MarketingIRJET- Predicting Review Ratings for Product Marketing
IRJET- Predicting Review Ratings for Product Marketing
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...
 
IRJET- Enhancing NLP Techniques for Fake Review Detection
IRJET- Enhancing NLP Techniques for Fake Review DetectionIRJET- Enhancing NLP Techniques for Fake Review Detection
IRJET- Enhancing NLP Techniques for Fake Review Detection
 
IRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation SystemIRJET- Hybrid Book Recommendation System
IRJET- Hybrid Book Recommendation System
 
Digital analytics: Analytics problems (Lecture 9)
Digital analytics: Analytics problems (Lecture 9)Digital analytics: Analytics problems (Lecture 9)
Digital analytics: Analytics problems (Lecture 9)
 
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-final
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-finalICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-final
ICIS Rating Scales for Collective IntelligenceIcis idea rating-v1.0-final
 
Learn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hrLearn how to do a conjoint analysis project in 1 hr
Learn how to do a conjoint analysis project in 1 hr
 
Outbrain click prediction by vishalchangrani
Outbrain click prediction by vishalchangrani Outbrain click prediction by vishalchangrani
Outbrain click prediction by vishalchangrani
 
Digital analytics lecture4
Digital analytics lecture4Digital analytics lecture4
Digital analytics lecture4
 
Hypothesis Testing: Overview
Hypothesis Testing: OverviewHypothesis Testing: Overview
Hypothesis Testing: Overview
 
Results Usersurvey 2005
Results Usersurvey 2005Results Usersurvey 2005
Results Usersurvey 2005
 
OutbrainFinalReport
OutbrainFinalReportOutbrainFinalReport
OutbrainFinalReport
 
Planning Innovations: Conjoint Analysis Slides
Planning Innovations: Conjoint Analysis SlidesPlanning Innovations: Conjoint Analysis Slides
Planning Innovations: Conjoint Analysis Slides
 

Viewers also liked

Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization CS, NcState
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsAladejubelo Oluwashina
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringChangsung Moon
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperChangsung Moon
 
ESSIR 2013 Recommender Systems tutorial
ESSIR 2013 Recommender Systems tutorial ESSIR 2013 Recommender Systems tutorial
ESSIR 2013 Recommender Systems tutorial Alexandros Karatzoglou
 
User Personality and the New User Problem in a Context-Aware Point of Interes...
User Personality and the New User Problem in a Context-Aware Point of Interes...User Personality and the New User Problem in a Context-Aware Point of Interes...
User Personality and the New User Problem in a Context-Aware Point of Interes...University of Bergen
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsLei Guo
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Xavier Amatriain
 

Viewers also liked (8)

Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization Handling Missing Attributes using Matrix Factorization 
Handling Missing Attributes using Matrix Factorization 
 
Matrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender SystemsMatrix Factorization Technique for Recommender Systems
Matrix Factorization Technique for Recommender Systems
 
Recommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative FilteringRecommender Systems: Advances in Collaborative Filtering
Recommender Systems: Advances in Collaborative Filtering
 
Summary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paperSummary of a Recommender Systems Survey paper
Summary of a Recommender Systems Survey paper
 
ESSIR 2013 Recommender Systems tutorial
ESSIR 2013 Recommender Systems tutorial ESSIR 2013 Recommender Systems tutorial
ESSIR 2013 Recommender Systems tutorial
 
User Personality and the New User Problem in a Context-Aware Point of Interes...
User Personality and the New User Problem in a Context-Aware Point of Interes...User Personality and the New User Problem in a Context-Aware Point of Interes...
User Personality and the New User Problem in a Context-Aware Point of Interes...
 
Matrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender SystemsMatrix Factorization Techniques For Recommender Systems
Matrix Factorization Techniques For Recommender Systems
 
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)Recommender Systems (Machine Learning Summer School 2014 @ CMU)
Recommender Systems (Machine Learning Summer School 2014 @ CMU)
 

Similar to Estimating Causal Impact of Recommendation Systems Using Natural Experiments

Hotspot ALPHA Camp_Setting Course with Metrics
Hotspot ALPHA Camp_Setting Course with MetricsHotspot ALPHA Camp_Setting Course with Metrics
Hotspot ALPHA Camp_Setting Course with MetricsALPHA Camp
 
Lean Startup Metrics & Analytics
Lean Startup Metrics & AnalyticsLean Startup Metrics & Analytics
Lean Startup Metrics & AnalyticsNicola Junior Vitto
 
230286802015PPT.pptx
230286802015PPT.pptx230286802015PPT.pptx
230286802015PPT.pptxannalakshmi35
 
Competitive Intelligence Analysis
Competitive Intelligence AnalysisCompetitive Intelligence Analysis
Competitive Intelligence AnalysisGeorge Giannakeas
 
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"eyeDemand "Demystifying RTB: Keys to a Successful Campaign"
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"IAB Canada
 
Serious SEM's Reputation Amplifier
Serious SEM's Reputation AmplifierSerious SEM's Reputation Amplifier
Serious SEM's Reputation AmplifierSerious SEM
 
Conversion Whitepaper
Conversion WhitepaperConversion Whitepaper
Conversion WhitepaperWSI Ensenada
 
A cutting edge behavioural approach to achieving your contact centre’s object...
A cutting edge behavioural approach to achieving your contact centre’s object...A cutting edge behavioural approach to achieving your contact centre’s object...
A cutting edge behavioural approach to achieving your contact centre’s object...Contact Centre Management Group
 
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...BrightEdge Technologies
 
Digital analytics: Optimization (Lecture 10)
Digital analytics: Optimization (Lecture 10)Digital analytics: Optimization (Lecture 10)
Digital analytics: Optimization (Lecture 10)Joni Salminen
 
[0324] impulse buying (study design)
[0324] impulse buying (study design)[0324] impulse buying (study design)
[0324] impulse buying (study design)ivaderivader
 
How to convert more patients online 7 16-14
How to convert more patients online 7 16-14How to convert more patients online 7 16-14
How to convert more patients online 7 16-14OptiCall
 
What Your Customers Really Do Online: 5 Ways to Remove the Guesswork
What Your Customers Really Do Online: 5 Ways to Remove the GuessworkWhat Your Customers Really Do Online: 5 Ways to Remove the Guesswork
What Your Customers Really Do Online: 5 Ways to Remove the GuessworkOptimizely
 
SMX Munich 2018 — Advanced Google Shopping Reporting
SMX Munich 2018 — Advanced Google Shopping ReportingSMX Munich 2018 — Advanced Google Shopping Reporting
SMX Munich 2018 — Advanced Google Shopping ReportingSmarter Ecommerce GmbH
 
Funnels Workshop Web Summit 2014 @geckoboard @GA
Funnels Workshop Web Summit 2014 @geckoboard @GAFunnels Workshop Web Summit 2014 @geckoboard @GA
Funnels Workshop Web Summit 2014 @geckoboard @GASofia Quintero
 
The American Attribution Index: Using Attribution Management to Improve Media...
The American Attribution Index: Using Attribution Management to Improve Media...The American Attribution Index: Using Attribution Management to Improve Media...
The American Attribution Index: Using Attribution Management to Improve Media...ClearSaleing
 
Action plan creator review
Action plan creator reviewAction plan creator review
Action plan creator reviewClint Cora
 

Similar to Estimating Causal Impact of Recommendation Systems Using Natural Experiments (20)

Hotspot ALPHA Camp_Setting Course with Metrics
Hotspot ALPHA Camp_Setting Course with MetricsHotspot ALPHA Camp_Setting Course with Metrics
Hotspot ALPHA Camp_Setting Course with Metrics
 
Lean Startup Metrics & Analytics
Lean Startup Metrics & AnalyticsLean Startup Metrics & Analytics
Lean Startup Metrics & Analytics
 
230286802015PPT.pptx
230286802015PPT.pptx230286802015PPT.pptx
230286802015PPT.pptx
 
Competitive Intelligence Analysis
Competitive Intelligence AnalysisCompetitive Intelligence Analysis
Competitive Intelligence Analysis
 
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"eyeDemand "Demystifying RTB: Keys to a Successful Campaign"
eyeDemand "Demystifying RTB: Keys to a Successful Campaign"
 
Serious SEM's Reputation Amplifier
Serious SEM's Reputation AmplifierSerious SEM's Reputation Amplifier
Serious SEM's Reputation Amplifier
 
Conversion Whitepaper
Conversion WhitepaperConversion Whitepaper
Conversion Whitepaper
 
A cutting edge behavioural approach to achieving your contact centre’s object...
A cutting edge behavioural approach to achieving your contact centre’s object...A cutting edge behavioural approach to achieving your contact centre’s object...
A cutting edge behavioural approach to achieving your contact centre’s object...
 
Digital measurement & analytics
Digital measurement & analyticsDigital measurement & analytics
Digital measurement & analytics
 
Online Reviews: Trends, Innovations, Strategy
Online Reviews: Trends, Innovations, StrategyOnline Reviews: Trends, Innovations, Strategy
Online Reviews: Trends, Innovations, Strategy
 
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...
BrightEdge Share15 - CM205: Content Convergence: Search, Social & Content - M...
 
Digital analytics: Optimization (Lecture 10)
Digital analytics: Optimization (Lecture 10)Digital analytics: Optimization (Lecture 10)
Digital analytics: Optimization (Lecture 10)
 
[0324] impulse buying (study design)
[0324] impulse buying (study design)[0324] impulse buying (study design)
[0324] impulse buying (study design)
 
How to convert more patients online 7 16-14
How to convert more patients online 7 16-14How to convert more patients online 7 16-14
How to convert more patients online 7 16-14
 
What Your Customers Really Do Online: 5 Ways to Remove the Guesswork
What Your Customers Really Do Online: 5 Ways to Remove the GuessworkWhat Your Customers Really Do Online: 5 Ways to Remove the Guesswork
What Your Customers Really Do Online: 5 Ways to Remove the Guesswork
 
SMX Munich 2018 — Advanced Google Shopping Reporting
SMX Munich 2018 — Advanced Google Shopping ReportingSMX Munich 2018 — Advanced Google Shopping Reporting
SMX Munich 2018 — Advanced Google Shopping Reporting
 
Funnels Workshop Web Summit 2014 @geckoboard @GA
Funnels Workshop Web Summit 2014 @geckoboard @GAFunnels Workshop Web Summit 2014 @geckoboard @GA
Funnels Workshop Web Summit 2014 @geckoboard @GA
 
The American Attribution Index: Using Attribution Management to Improve Media...
The American Attribution Index: Using Attribution Management to Improve Media...The American Attribution Index: Using Attribution Management to Improve Media...
The American Attribution Index: Using Attribution Management to Improve Media...
 
Action plan creator review
Action plan creator reviewAction plan creator review
Action plan creator review
 
1305 track 3 siegel
1305 track 3 siegel1305 track 3 siegel
1305 track 3 siegel
 

More from Amit Sharma

Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAmit Sharma
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactAmit Sharma
 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data scienceAmit Sharma
 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesAmit Sharma
 
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesEquivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesAmit Sharma
 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsAmit Sharma
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsAmit Sharma
 
Causal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereCausal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereAmit Sharma
 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...Amit Sharma
 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferencesAmit Sharma
 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...Amit Sharma
 
RSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationRSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationAmit Sharma
 

More from Amit Sharma (13)

Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal Models
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal Impact
 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data science
 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practices
 
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesEquivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes
 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actions
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systems
 
Causal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereCausal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhere
 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...
 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferences
 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
 
RSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendationRSWEB 2013: A research platform for social recommendation
RSWEB 2013: A research platform for social recommendation
 

Recently uploaded

Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queensdataanalyticsqueen03
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Cantervoginip
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort servicejennyeacort
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Boston Institute of Analytics
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024Timothy Spann
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Seán Kennedy
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxellehsormae
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024thyngster
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxaleedritatuxx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPTBoston Institute of Analytics
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理e4aez8ss
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 

Recently uploaded (20)

Top 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In QueensTop 5 Best Data Analytics Courses In Queens
Top 5 Best Data Analytics Courses In Queens
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
ASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel CanterASML's Taxonomy Adventure by Daniel Canter
ASML's Taxonomy Adventure by Daniel Canter
 
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
9711147426✨Call In girls Gurgaon Sector 31. SCO 25 escort service
 
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
Data Analysis Project : Targeting the Right Customers, Presentation on Bank M...
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...Student profile product demonstration on grades, ability, well-being and mind...
Student profile product demonstration on grades, ability, well-being and mind...
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
Deep Generative Learning for All - The Gen AI Hype (Spring 2024)
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 
Vision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptxVision, Mission, Goals and Objectives ppt..pptx
Vision, Mission, Goals and Objectives ppt..pptx
 
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
Consent & Privacy Signals on Google *Pixels* - MeasureCamp Amsterdam 2024
 
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptxmodul pembelajaran robotic Workshop _ by Slidesgo.pptx
modul pembelajaran robotic Workshop _ by Slidesgo.pptx
 
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default  Presentation : Data Analysis Project PPTPredictive Analysis for Loan Default  Presentation : Data Analysis Project PPT
Predictive Analysis for Loan Default Presentation : Data Analysis Project PPT
 
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
科罗拉多大学波尔得分校毕业证学位证成绩单-可办理
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 

Estimating Causal Impact of Recommendation Systems Using Natural Experiments

  • 1. Estimating the causal impact of recommendation systems AMIT SHARMA, JAKE HOFMAN, DUNCAN WATTS MICROSOFT RESEARCH, NEW YORK 1 2nd International Conference on Computational Social Science
  • 2. How much do they change user behavior? 2
  • 3. Naively, up to 30% of traffic comes from recommendations 3
  • 4. Naively, up to 30% of traffic comes from recommendations “Burton Snowboard, a sports retailer, reported that personalized product recommendations have driven nearly 25% of total sales since it began offering them in 2008. Prior to this, Burton’s customer recommendations consisted of items from its list of top-selling products.” 4
  • 5. Almost surely an over-estimate of the actual effect, because of correlated demand between a product and its recommendations.
  • 11. Problem: Correlated demand may drive page visits, even without recommendations
  • 12. The problem of correlated demand Demand for winter accessories Visits to winter hat Rec. visits to winter gloves 12
  • 13. Goal: Estimate the extra activity caused by a recommender system that would not have happened otherwise Causal Convenience OBSERVED CLICK-THROUGHS WITHOUT RECOMMENDER Convenience ? 13
  • 14. Ideal experiment: A/B Test Treatment (A) Control (B) But, experiments: may be costly hamper user experience require full access to the system 14
  • 15.
  • 16. Using natural variations to simulate an experiment 16
  • 17. Studying sudden spikes, “shocks” to demand for a book [Carmi et al. 2012] 17
  • 18. The same author’s recommended book may also have a shock 18
  • 19. Past work: Controlling for correlated demand Uses statistical models to control for confounds Carmi et al. [2012], Oestreicher and Sundararajan [2012] and Lin [2013] construct “complementary sets” of similar, non-recommended products. Garfinkel et. al. [2006] and Broder et al. [2015] compare to model- predicted clicks without recommendations. But, 1. These assumptions are hard to verify. 2. Finding examples of valid shocks requires ingenuity and restricts researchers to very specific categories 19
  • 20. Shock-IV: A simpler, more robust method for estimating causal impact.
  • 21. Distinguishing between recommendation and direct traffic All visits to a product Recommender visits Direct visits Search visits Direct browsing Proxy for unobserved demand 21
  • 22. The Shock-IV strategy: Searching for valid shocks ? ? 22
  • 23. The Shock-IV strategy: Filtering out invalid shocks 23
  • 24. Search for products that receive a sudden shock in their traffic but direct traffic for their recommendations remains constant.
  • 25. Why does it work? Shock as an instrumental variable Demand Focal visits (X) Rec. visits (Y) Sudden Shock Direct visits (Y)
  • 26. Computing the causal estimate Increase in recommendation clicks ( ) Causal CTR ( *Same as Wald estimator for instrumental variables Increase in visits to focal product ( )
  • 27. The shock-IV strategy: In equations
  • 28. Application to Amazon.com, using Bing toolbar logs • • • Sept 2013-May 2014
  • 29. Recreating sequence of page visits by a user
  • 30. Recreating sequence of page visits by a user Timestamp URL 2014-01-20 09:04:10 http://www.amazon.com/s/ref=nb_sb_noss _1?field-keywords=George%20saunders 2014-01-20 09:04:15 http://www.amazon.com/dp/0812984250/r ef=sr_1_1 2014-01-20 09:05:01 http://www.amazon.com/dp/1573225797/r ef=pd_sim_b_2
  • 31. Recreating sequence of page visits by a user Timestamp URL 2014-01-20 09:04:10 http://www.amazon.com/s/ref=nb_sb_no ss_1?field-keywords=George%20saunders 2014-01-20 09:04:15 http://www.amazon.com/dp/0812984250/ ref=sr_1_1 2014-01-20 09:05:01 http://www.amazon.com/dp/1573225797/ ref=pd_sim_b_2 User searches for George Saunders User clicks on the first search result User clicks on the second recommendation
  • 32. I. Weekly and seasonal patterns in traffic, nearly tripling in holidays
  • 33. II. 30% of all pageviews come through recommendations
  • 34. III. Books and eBooks are the most popular categories by far
  • 35. IV. Apparel and shoes see a substantially higher fraction of visits through recommendations
  • 36. Shock-IV: Finding shocks in user visit data We look for focal products with large and sudden increases in views relative to typical traffic. Size of shock exceeds: ◦ 5 times median traffic ◦ Shock exceeds 5 times the previous day's traffic and 5 times the mean of the last 7 days. Shocked product has: ◦ Visits from at least 10 unique users during the shock ◦ Non-zero visits for at least five out of seven days before and after the shock 36
  • 37. Shock-IV: Ensuring exclusion restriction Recommended product (Y) should have constant direct visits during the time of the shock. (1-β): Ratio of maximum 14-day variation in visits to a recommended product to the size of the shock for the focal product. Direct traffic to Y is stable relative to the shock to the focal product. β = 1 Direct traffic to Y is no less varying than the shock to focal product. β = 0 37
  • 38. How to choose 𝛽? Accept RejectSelect 𝛽 = 0.7
  • 39. Using the method, obtain >4000 natural experiments!
  • 41. Causal click-through rate by product category
  • 42.
  • 43. Estimating fraction of observed click-throughs that are causal Compare the number of estimated causal clicks to all observed recommendation clicks (non-shock period). 43
  • 44. Only a quarter of the observed click-throughs are causal At β = 0.7, only 25% of recommendation traffic is caused by the recommender.
  • 45. Generalization? Shocks may be due to discounts or sales Lower CTR may be due to the holiday season 45
  • 46. Local average treatment effect (LATE), not fully generalizable Shocked products are not a representative sample of all products, nor are the users who participate in them. • Shock-IV method covers roughly one-fifth of all products with at least 10 visits on any single day. • Our results are robust to sale or holiday effects. • Causal estimates are consistent with experimental findings (e.g., Belluf et. al. [2012]) 46
  • 47. More generally… A robust, scalable method for causal inference. ◦ Causal CTR for Amazon’s recommender system much less than the naïve observational CTR. ◦ Can be applied to other domains, such as online ads. Data mining for instruments I. Allows us to study a much larger sample of natural experiments, while being able to test for exclusion restriction directly. II. Can be used for finding potential instruments. 47
  • 48. Thank you! AMIT SHARMA MICROSOFT RESEARCH http://www.amitsharma.in Sharma, A., Hofman, J. M., & Watts, D. J. (2015). Estimating the causal impact of recommendation systems from observational data. In Proceedings of the Sixteenth ACM Conference on Economics and Computation. 48 Shock-IV: A robust, scalable method for estimating causal impact from observational data, with testable assumptions. Naïve observational estimates of CTR for recommendation systems may be big overestimates.