Causal inference in online systems: Methods, pitfalls and best practices

•Als PPTX, PDF herunterladen•

3 gefällt mir•11,459 views

From recommending what to buy, which movies to watch, to selecting the news to read, people to follow and jobs to apply for, online systems have become an important part of our daily lives. A natural question to ask is how these socio-technical systems impact our behavior. However, because of the intricate interplay between the outputs of these systems and people's actions, identifying their impact on people's behavior is non-trivial. Fortunately, there is a rich body of work on causal inference that we can build on. In the first part of the tutorial, I will show the value of counterfactual reasoning for studying socio-technical systems, by demonstrating how predictive modeling based on correlations can be counterproductive. Then, we will discuss different approaches to causal inference, including randomized experiments, natural experiments such as instrumental variables and regression discontinuities, and observational methods such as stratification and matching. Throughout, we will try to make connections with graphical models, machine learning and past work in the social sciences. The second half will be more hands-on. We will work through a practical example of estimating the causal impact of a recommender system, starting from simple to more complex methods. The goal of the practical exercise will be to appreciate the pitfalls in different approaches to causal reasoning and take away best practices for doing causal inference with messy, real-world data. Code used is available at: https://github.com/amit-sharma/causal-inference-tutorial/

Daten & Analysen

amshar@microsoft.com
1http://www.github.com/amit-sharma/causal-inference-tutorial

Use these correlations to make a predictive model.
Future Activity ->
f(number of friends, logins in past month)

6

19
Old Algorithm (A) New Algorithm (B)
50/1000 (5%) 54/1000 (5.4%)

20
Old Algorithm (A) New Algorithm (B)
10/400 (2.5%) 4/200 (2%)
Old Algorithm (A) New Algorithm (B)
40/600 (6.6%) 50/800 (6.2%)
0
2
4
6
8
Low-activity High-activity
CTR

Is Algorithm A better?
Old algorithm (A) New Algorithm
(B)
CTR for Low-
Activity users
10/400 (2.5%) 4/200 (2%)
CTR for High-
Activity users
40/600 (6.6%) 50/800 (6.2%)
Total CTR 50/1000 (5%) 54/1000 (5.4%)
21

Average comment length decreases over time.
23
But for each yearly cohort of users, comment length
increases over time.

27http://plato.stanford.edu/entries/causation-mani/

28http://plato.stanford.edu/entries/causation-counterfactual/

41Dunning (2002), Rosenzweig-Wolpin (2000)

55
Does new Algorithm B increase CTR for recommendations on
Windows Store, compared to old algorithm A?

Does new Algorithm B increase CTR for recommendations on
Windows Store, compared to old algorithm A?
56

𝑷𝒓𝒐𝒑𝒆𝒏𝒔𝒊𝒕𝒚 𝑁𝑒𝑤𝐴𝑙𝑔𝑜 𝑈𝑠𝑒𝑟𝑖 = 𝑳𝒐𝒈𝒊𝒔𝒕𝒊𝒄(𝑎 𝑐𝑎𝑡1, 𝑎 𝑐𝑎𝑡2, … 𝑎 𝑐𝑎𝑡𝑛)
Compare CTR between users with the same propensity score.
66

69
Non-FriendsEgo Network
f5
u
f1
f4
f3f2
n5
u
n1
n4
n3n2

73http://tylervigen.com/spurious-correlations

http://www.github.com/amit-sharma/causal-inference-
tutorial
amshar@microsoft.com
75

https://www.github.com/amit-sharma/causal-inference-tutorial
76

> nrow(user_app_visits_A)
[1] 1,000,000
> length(unique(user_app_visits_A$user_id))
[1] 10,000
> length(unique(user_app_visits_A$product_id))
[1] 990
> length(unique(user_app_visits_A$category))
[1] 10
82

$> user_app_visits_B = read.csv("user_app_visits_B.csv") > naive_observational_estimate <- function(user_visits){ # Naive observational estimate # Simply the fraction of visits that resulted in a recommendation click- through. est = summarise(user_visits, naive_estimate=sum(is_rec_visit)/length(is_rec_visit)) return(est) } > naive_observational_estimate(user_app_visits_A) naive_estimate [1] 0.200768 > naive_observational_estimate(user_app_visits_B) naive_estimate [1] 0.226467 85$

> stratified_by_activity_estimate(user_app_visits_A)
Source: local data frame [4 x 2]
activity_level stratified_estimate
1 1 0.1248852
2 2 0.1750483
3 3 0.2266394
4 4 0.2763522
> stratified_by_activity_estimate(user_app_visits_B)
Source: local data frame [4 x 2]
activity_level stratified_estimate
1 1 0.1253469
2 2 0.1753933
3 3 0.2257211
4 4 0.2749867
87

> stratified_by_category_estimate(user_app_visits_A)
Source: local data frame [10 x 2]
category stratified_estimate
1 1 0.1758294
2 2 0.2276829
3 3 0.2763157
4 4 0.1239860
5 5 0.1767163
… … …
> stratified_by_category_estimate(user_app_visits_B)
Source: local data frame [10 x 2]
category stratified_estimate
1 1 0.2002127
2 2 0.2517528
3 3 0.3021371
4 4 0.1503150
5 5 0.1999519
… … …
88

> naive_observational_estimate(user_app_visits_A)
naive_estimate
[1] 0.200768
> ranking_discontinuity_estimate(user_app_visits_A)
discontinuity_estimate
[1] 0.121362
40% of app visits coming from recommendation click-
throughs are not causal.
Could have happened even without the
recommendation system.
93

Empfohlen

Causal data mining: Identifying causal effects at scaleAmit Sharma

Dowhy: An end-to-end library for causal inferenceAmit Sharma

Alleviating Privacy Attacks Using Causal ModelsAmit Sharma

DoWhy Python library for causal inference: An End-to-End toolAmit Sharma

The Impact of Computing Systems | Causal inference in practiceAmit Sharma

Artificial Intelligence for Societal ImpactAmit Sharma

Measuring effectiveness of machine learning systemsAmit Sharma

Auditing search engines for differential satisfaction across demographicsAmit Sharma

Empfohlen

Causal data mining: Identifying causal effects at scaleAmit Sharma

Dowhy: An end-to-end library for causal inferenceAmit Sharma

Alleviating Privacy Attacks Using Causal ModelsAmit Sharma

DoWhy Python library for causal inference: An End-to-End toolAmit Sharma

The Impact of Computing Systems | Causal inference in practiceAmit Sharma

Artificial Intelligence for Societal ImpactAmit Sharma

Measuring effectiveness of machine learning systemsAmit Sharma

Auditing search engines for differential satisfaction across demographicsAmit Sharma

Causal inference in data scienceAmit Sharma

Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesAmit Sharma

Estimating the causal impact of recommender systemsAmit Sharma

Predictability of popularity on online social media: Gaps between prediction ...Amit Sharma

Data mining for causal inference: Effect of recommendations on Amazon.comAmit Sharma

Estimating influence of online activity feeds on people's actionsAmit Sharma

From prediction to causation: Causal inference in online systemsAmit Sharma

Causal inference in practiceAmit Sharma

Causal inference in practice: Here, there, causality is everywhereAmit Sharma

The interplay of personal preference and social influence in sharing networks...Amit Sharma

The role of social connections in shaping our preferencesAmit Sharma

[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...Amit Sharma

RSWEB 2013: A research platform for social recommendationAmit Sharma

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Midocean dropshipping via API with DroFxolyaivanovalion

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Weitere ähnliche Inhalte

Mehr von Amit Sharma

Causal inference in data scienceAmit Sharma

Equivalence causal frameworks: SEMs, Graphical models and Potential OutcomesAmit Sharma

Estimating the causal impact of recommender systemsAmit Sharma

Predictability of popularity on online social media: Gaps between prediction ...Amit Sharma

Data mining for causal inference: Effect of recommendations on Amazon.comAmit Sharma

Estimating influence of online activity feeds on people's actionsAmit Sharma

From prediction to causation: Causal inference in online systemsAmit Sharma

Causal inference in practiceAmit Sharma

Causal inference in practice: Here, there, causality is everywhereAmit Sharma

The interplay of personal preference and social influence in sharing networks...Amit Sharma

The role of social connections in shaping our preferencesAmit Sharma

[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...Amit Sharma

RSWEB 2013: A research platform for social recommendationAmit Sharma

Mehr von Amit Sharma (13)

Causal inference in data science

Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes

Estimating the causal impact of recommender systems

Predictability of popularity on online social media: Gaps between prediction ...

Data mining for causal inference: Effect of recommendations on Amazon.com

Estimating influence of online activity feeds on people's actions

From prediction to causation: Causal inference in online systems

Causal inference in practice

Causal inference in practice: Here, there, causality is everywhere

The interplay of personal preference and social influence in sharing networks...

The role of social connections in shaping our preferences

[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...

RSWEB 2013: A research platform for social recommendation

Kürzlich hochgeladen

Predicting Loan Approval: A Data Science ProjectBoston Institute of Analytics

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Midocean dropshipping via API with DroFxolyaivanovalion

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...amitlee9823

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...SUHANI PANDEY

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -Pooja Nehwal

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangaloreamitlee9823

Discover Why Less is More in B2B Researchmichael115558

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...amitlee9823

Halmar dropshipping via API with DroFxolyaivanovalion

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823

Anomaly detection and data imputation within time seriesParis Women in Machine Learning and Data Science

Kürzlich hochgeladen (20)

Predicting Loan Approval: A Data Science Project

Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...

Mature dropshipping via API with DroFx.pptx

Midocean dropshipping via API with DroFx

Escorts Service Kumaraswamy Layout ☎ 7737669865☎ Book Your One night Stand (B...

VIP Model Call Girls Hinjewadi ( Pune ) Call ON 8005736733 Starting From 5K t...

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Thane Call Girls 7091864438 Call Girls in Thane Escort service book now -

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Call Girls In Hsr Layout ☎ 7737669865 🥵 Book Your One night Stand

Call Girls Begur Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore

Discover Why Less is More in B2B Research

Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

➥🔝 7737669865 🔝▻ Bangalore Call-girls in Women Seeking Men 🔝Bangalore🔝 Esc...

Halmar dropshipping via API with DroFx

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand

Anomaly detection and data imputation within time series

Causal inference in online systems: Methods, pitfalls and best practices

1. amshar@microsoft.com 1http://www.github.com/amit-sharma/causal-inference-tutorial

2. 2

3. 3

4. 4

5. 5

6. Use these correlations to make a predictive model. Future Activity -> f(number of friends, logins in past month)  6

7. 7

8. 8

9. 9

10. 10

11. 11

12. 12

13. 13

14. 14

15. 15

16. 16

17. 17

18. 18

19. 19 Old Algorithm (A) New Algorithm (B) 50/1000 (5%) 54/1000 (5.4%)

20. 20 Old Algorithm (A) New Algorithm (B) 10/400 (2.5%) 4/200 (2%) Old Algorithm (A) New Algorithm (B) 40/600 (6.6%) 50/800 (6.2%) 0 2 4 6 8 Low-activity High-activity CTR

21. Is Algorithm A better? Old algorithm (A) New Algorithm (B) CTR for Low- Activity users 10/400 (2.5%) 4/200 (2%) CTR for High- Activity users 40/600 (6.6%) 50/800 (6.2%) Total CTR 50/1000 (5%) 54/1000 (5.4%) 21

22. 22

23. Average comment length decreases over time. 23 But for each yearly cohort of users, comment length increases over time.

24. 24

25. 25

26. 26

27. 27http://plato.stanford.edu/entries/causation-mani/

28. 28http://plato.stanford.edu/entries/causation-counterfactual/

29. 29

30. 30

31. 31

32. 32

33. 33

34. 34

35. 35

36. 36

37. 37

38. 38

39. 39

40. 40

41. 41Dunning (2002), Rosenzweig-Wolpin (2000)

42. 42

43. 43

44. 44

45. 45

46. 46

47. 47

48. 48

49. 49

50. 50

51. 51

52. 52

53. 53

54. 54

55. 55 Does new Algorithm B increase CTR for recommendations on Windows Store, compared to old algorithm A?

56. Does new Algorithm B increase CTR for recommendations on Windows Store, compared to old algorithm A? 56

57. 57

58. 58

59. 59

60. 60

61. 61

62. 62

63. 63

64. 64

65. 65

66. 𝑷𝒓𝒐𝒑𝒆𝒏𝒔𝒊𝒕𝒚 𝑁𝑒𝑤𝐴𝑙𝑔𝑜 𝑈𝑠𝑒𝑟𝑖 = 𝑳𝒐𝒈𝒊𝒔𝒕𝒊𝒄(𝑎 𝑐𝑎𝑡1, 𝑎 𝑐𝑎𝑡2, … 𝑎 𝑐𝑎𝑡𝑛) Compare CTR between users with the same propensity score. 66

67. 67

68. 68

69. 69 Non-FriendsEgo Network f5 u f1 f4 f3f2 n5 u n1 n4 n3n2

70. 70

71. 71

72. 72

73. 73http://tylervigen.com/spurious-correlations

74. 74

75. http://www.github.com/amit-sharma/causal-inference- tutorial amshar@microsoft.com 75

76. https://www.github.com/amit-sharma/causal-inference-tutorial 76

77. 77

78. 78

79. 79

80. 80

81. 81

82. > nrow(user_app_visits_A) [1] 1,000,000 > length(unique(user_app_visits_A$user_id)) [1] 10,000 > length(unique(user_app_visits_A$product_id)) [1] 990 > length(unique(user_app_visits_A$category)) [1] 10 82

83. 83

84. 84

85. > user_app_visits_B = read.csv("user_app_visits_B.csv") > naive_observational_estimate <- function(user_visits){ # Naive observational estimate # Simply the fraction of visits that resulted in a recommendation click- through. est = summarise(user_visits, naive_estimate=sum(is_rec_visit)/length(is_rec_visit)) return(est) } > naive_observational_estimate(user_app_visits_A) naive_estimate [1] 0.200768 > naive_observational_estimate(user_app_visits_B) naive_estimate [1] 0.226467 85

86. 86

87. > stratified_by_activity_estimate(user_app_visits_A) Source: local data frame [4 x 2] activity_level stratified_estimate 1 1 0.1248852 2 2 0.1750483 3 3 0.2266394 4 4 0.2763522 > stratified_by_activity_estimate(user_app_visits_B) Source: local data frame [4 x 2] activity_level stratified_estimate 1 1 0.1253469 2 2 0.1753933 3 3 0.2257211 4 4 0.2749867 87

88. > stratified_by_category_estimate(user_app_visits_A) Source: local data frame [10 x 2] category stratified_estimate 1 1 0.1758294 2 2 0.2276829 3 3 0.2763157 4 4 0.1239860 5 5 0.1767163 … … … > stratified_by_category_estimate(user_app_visits_B) Source: local data frame [10 x 2] category stratified_estimate 1 1 0.2002127 2 2 0.2517528 3 3 0.3021371 4 4 0.1503150 5 5 0.1999519 … … … 88

89. 89

90. 90

91. 91

92. 92

93. > naive_observational_estimate(user_app_visits_A) naive_estimate [1] 0.200768 > ranking_discontinuity_estimate(user_app_visits_A) discontinuity_estimate [1] 0.121362 40% of app visits coming from recommendation click- throughs are not causal. Could have happened even without the recommendation system. 93

94. 94

95. 95 amshar@microsoft.com