SlideShare a Scribd company logo
1 of 21
EQUIVALENCE OF SEM,
POTENTIAL OUTCOMES AND
CAUSAL GRAPHICAL MODELS
AMIT SHARMA
POSTDOCTORAL RESEARCHER
MICROSOFT
http://www.amitsharma.in
@amt_shrma
WHAT IS CAUSALITY?
• Debatable, from the times of Aristotle and Hume.
• Practical definition:
• Interventionist causality: X causes Y if changing X leads to a
change in Y, keeping everything else constant.
CAUSALITY IS MEANINGLESS
WITHOUT A MODEL
• “Keeping everything else constant” requires knowing what
everything else is.
• Demand increases price is valid in most economies. So seems
a universal causal law.
• … except in a fully regulated economy.
• Model: Explicit specification of “everything else” that can
affect causal estimate.
WITHOUT A MODEL, EVEN EXPERIMENTS
DO NOT TELL YOU ANYTHING ABOUT
THE FUTURE
• A/B experiments study the past.
• Provide a counterfactual answer.
• But we want to use the results for the future.
• Model: The world stays the same between:
• When the experiment was run, and
• When its results will be applied.
HOW MIGHT WE SPECIFY A MODEL?
• By qualitative knowledge about how the world works.
Encouragement Effort Outcome
HOW MIGHT WE SPECIFY A MODEL?
• By writing equations about how the world works.
• E.g. F = ma
• Encouragement (Z)
• Effort (X)
• Outcome (Y)
𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦
𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥
HOW MIGHT WE SPECIFY A MODEL?
• By thinking about the different worlds that changing the
causal variable creates (inspired by a randomized
experiment).
• Effort (X)
• Outcome (Y)
𝑌𝑥1 𝑜𝑟 𝑌𝑥2 … .
• Encouragement (Z)
𝑌𝑥1𝑧1 𝑜𝑟 𝑌𝑥2𝑧1 𝑜𝑟 𝑌𝑥1𝑧2 𝑜𝑟 𝑌𝑥2𝑧2 … .
THREE MAJOR FRAMEWORKS FOR
SPECIFYING A CAUSAL MODEL
• Causal Graphical model
• Structural Equation Model
𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦
𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥
• Potential Outcomes Framework
𝑌𝑥1𝑧1 𝑜𝑟 𝑌𝑥2𝑧1 𝑜𝑟 𝑌𝑥1𝑧2 𝑜𝑟 𝑌𝑥2𝑧2 … .
Encouragement Effort Outcome
ALL THREE ARE EQUIVALENT
• A theorem in one is a theorem in another (See Pearl [2009]).
• So what’s the problem?
• Different disciplines prefer one over another.
• Misconceptions abound about the frameworks.
• In general, no unified causal inference course in major
universities.
A HISTORICAL TOUR OF CAUSALITY
• 1850s: John Snow uses a natural experiment to detect causal
connection between water and cholera.
• 1910s: Buoyed by triumphs in physics, Bertrand Russell
argues that causality is irrevelant.
• 1920s: Sewall and Philip Wright develop path diagrams and
simultaneous equation modelling (SEM) for determining
supply or demand from price and quantity.
• 1920s: Neyman uses potential outcomes to analyze
experiments.
• 1930s: Ronald Fisher popularizes the randomized
experiment.
A HISTORICAL TOUR OF CAUSALITY
• 1960s: Blalock and Duncan solve path diagrams using
regression equations.
• 1960-now?: Age of regression.
• Path diagrams lose their original causal interpretation.
• SEMs, Path diagrams and regression become entangled.
• 1970s: Rubin builds on potential outcomes framework.
• Becomes popular with social scientists.
• 1980s: Pearl builds on SEM framework.
• Starting to become popular with computer scientists.
EQUIVALENCE OF GRAPHICAL
MODELS AND SEM
Encouragement
(Z)
Effort
(X)
Outcome
(Y)
P(X, Y, Z ) = P(Y|X) P(X|Z) P(Z)
𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦
𝑥 ∶= 𝑓(𝑧, 𝜖 𝑥)
EQUIVALENCE OF GRAPHICAL
MODELS AND SEM
Encouragement
(Z)
Effort
(X)
Outcome
(Y)
P(Y|do(X)) = P(Y|X)
𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦
Effect =
𝑑𝑦
𝑑𝑥
EQUIVALENCE OF POTENTIAL
OUTCOMES AND SEM
Encouragement
(Z)
Effort
(X)
Outcome
(Y)
𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦
𝑥 ∶= 𝑓(𝑧, 𝜖 𝑥)
𝑌𝑥1, 𝑌𝑥2, …
𝑋𝑧1, 𝑋𝑧2, …
𝑌𝑥1𝑧1, 𝑌𝑥1𝑧2, …
EQUIVALENCE OF POTENTIAL
OUTCOMES AND SEM
Encouragement
(Z)
Effort
(X)
Outcome
(Y)
Effect = 𝑌𝑥2 − 𝑌𝑥1
𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦
Effect =
𝑑𝑦
𝑑𝑥
INSTRUMENTAL
VARIABLES IN
ALL THREE
FRAMEWORKS
IV BY GRAPHICAL MODEL
Encouragement
(Z)
Effort
(X)
Outcome
(Y)
Unobserved
Confounders
(U)
𝑃 𝑦 𝑑𝑜 𝑥 =
𝑢
𝑃 𝑦 𝑥, 𝑢 𝑃(𝑢)
Average Causal Effect = 𝑃 𝑦2 𝑑𝑜 𝑥2 − 𝑃 𝑦2 𝑑𝑜 𝑥1
=
𝑢
𝑃 𝑦2 𝑥2, 𝑢 − 𝑃(𝑦2|𝑥1, 𝑢) 𝑃(𝑢)
IV BY STRUCTURAL EQUATIONS
𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦
𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥
Local average Causal effect =
𝐶𝑜𝑣(𝑌, 𝑋)
𝐶𝑜𝑣(𝑋,𝑍)
IV IN POTENTIAL OUTCOMES
• Assumptions:
• 𝑌𝑥1𝑧1 = 𝑌𝑥1𝑧2
Local average Causal Effect =
𝐸 𝑌 𝑋2 – 𝐸 𝑌 𝑋1
𝐸 𝑋 𝑍2 −𝐸 𝑋 𝑍1
BEST PRACTICES
• A randomized experiment, or a problem with few variables:
• Use Potential outcomes framework: Simple and practical.
• An observational study with many confounders, or any
problem with many variables:
• Use graphical models to encode causal assumptions.
• If functional forms unknown, use graphical criteria or do-calculus
to estimate causal effect.
• Else, a domain where functional forms are known or can be
approximated
• Use structural equation models to solve for causal effects, based on the
causal graph.
THANK YOU!
Amit Sharma
http://www.amitsharma.in
@amt_shrma

More Related Content

What's hot

A review of net lift models
A review of net lift modelsA review of net lift models
A review of net lift modelsZixia Wang
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olapSalah Amean
 
Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)DheerajPachauri
 
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Edureka!
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processingVijayasankariS
 
Uplift Modeling: Optimize for Influence and Persuade by the Numbers
Uplift Modeling: Optimize for Influence and Persuade by the NumbersUplift Modeling: Optimize for Influence and Persuade by the Numbers
Uplift Modeling: Optimize for Influence and Persuade by the NumbersRising Media Ltd.
 
Web-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterWeb-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterDatabricks
 
Standards Metadata Management (system)
Standards Metadata Management (system)Standards Metadata Management (system)
Standards Metadata Management (system)Kevin Lee
 
Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket AnalysisMarc Berman
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceNeo4j
 
Machine Learning by Analogy
Machine Learning by AnalogyMachine Learning by Analogy
Machine Learning by AnalogyColleen Farrelly
 
1. Data Analytics-introduction
1. Data Analytics-introduction1. Data Analytics-introduction
1. Data Analytics-introductionkrishna singh
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streamshktripathy
 

What's hot (20)

DISE - Database Concepts
DISE - Database ConceptsDISE - Database Concepts
DISE - Database Concepts
 
01 intro
01 intro01 intro
01 intro
 
A review of net lift models
A review of net lift modelsA review of net lift models
A review of net lift models
 
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
Data Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olapData Mining:  Concepts and Techniques (3rd ed.)— Chapter _04 olap
Data Mining: Concepts and Techniques (3rd ed.) — Chapter _04 olap
 
Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)Clustering for Stream and Parallelism (DATA ANALYTICS)
Clustering for Stream and Parallelism (DATA ANALYTICS)
 
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
Big Data Analytics Tutorial | Big Data Analytics for Beginners | Hadoop Tutor...
 
Data warehousing and online analytical processing
Data warehousing and online analytical processingData warehousing and online analytical processing
Data warehousing and online analytical processing
 
04 olap
04 olap04 olap
04 olap
 
Uplift Modeling: Optimize for Influence and Persuade by the Numbers
Uplift Modeling: Optimize for Influence and Persuade by the NumbersUplift Modeling: Optimize for Influence and Persuade by the Numbers
Uplift Modeling: Optimize for Influence and Persuade by the Numbers
 
Web-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim HunterWeb-Scale Graph Analytics with Apache Spark with Tim Hunter
Web-Scale Graph Analytics with Apache Spark with Tim Hunter
 
R time series analysis
R   time series analysisR   time series analysis
R time series analysis
 
Standards Metadata Management (system)
Standards Metadata Management (system)Standards Metadata Management (system)
Standards Metadata Management (system)
 
Masket Basket Analysis
Masket Basket AnalysisMasket Basket Analysis
Masket Basket Analysis
 
Workshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data ScienceWorkshop - Neo4j Graph Data Science
Workshop - Neo4j Graph Data Science
 
Birch
BirchBirch
Birch
 
Machine Learning by Analogy
Machine Learning by AnalogyMachine Learning by Analogy
Machine Learning by Analogy
 
Curse of Dimensionality and Big Data
Curse of Dimensionality and Big DataCurse of Dimensionality and Big Data
Curse of Dimensionality and Big Data
 
NOSQL vs SQL
NOSQL vs SQLNOSQL vs SQL
NOSQL vs SQL
 
1. Data Analytics-introduction
1. Data Analytics-introduction1. Data Analytics-introduction
1. Data Analytics-introduction
 
Lecture6 introduction to data streams
Lecture6 introduction to data streamsLecture6 introduction to data streams
Lecture6 introduction to data streams
 

Viewers also liked

Lista de cotejo_de_reporte
Lista de cotejo_de_reporteLista de cotejo_de_reporte
Lista de cotejo_de_reportejavier_ot99
 
Questionare results 2
Questionare results 2Questionare results 2
Questionare results 2krystal11
 
ONCEWAS Winter 16 Lookbook to email
ONCEWAS Winter 16 Lookbook to emailONCEWAS Winter 16 Lookbook to email
ONCEWAS Winter 16 Lookbook to emailBelinda Abbott
 
Case Alert Adecco UK Ltd
Case Alert   Adecco UK LtdCase Alert   Adecco UK Ltd
Case Alert Adecco UK LtdGraham Brearley
 
MEN’S HAND KNIT SWEATER & TEXTURED HAT
MEN’S HAND KNIT SWEATER & TEXTURED HATMEN’S HAND KNIT SWEATER & TEXTURED HAT
MEN’S HAND KNIT SWEATER & TEXTURED HATGabriel Katinas
 
Lista de cotejo_para_mapa_mental_1
Lista de cotejo_para_mapa_mental_1Lista de cotejo_para_mapa_mental_1
Lista de cotejo_para_mapa_mental_1javier_ot99
 
How to mold children into leaders
How to mold children into leadersHow to mold children into leaders
How to mold children into leadersSachin Mittal
 
Живописный бухта Халонге осенью
Живописный бухта Халонге осеньюЖивописный бухта Халонге осенью
Живописный бухта Халонге осеньюTur Vietnam
 
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!Хой Ан - Белопесчаные пляжи и древнее культурное наследие!
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!Tur Vietnam
 
Avalinvest extras respostas
Avalinvest extras respostasAvalinvest extras respostas
Avalinvest extras respostasAdriano Bruni
 
Case study pack its
Case study pack itsCase study pack its
Case study pack itsAshuvyas2128
 
Suporte - monitores
Suporte - monitoresSuporte - monitores
Suporte - monitoresRoney Sousa
 
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da Abradt
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da AbradtApresentação: Onofre Alves Batista Júnior | XIX Congresso da Abradt
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da AbradtABRADT
 
Tema 4, 4º ESO: La Edad Media, 1ª Parte
Tema 4, 4º ESO: La Edad Media, 1ª ParteTema 4, 4º ESO: La Edad Media, 1ª Parte
Tema 4, 4º ESO: La Edad Media, 1ª ParteVanessa Silvano Prieto
 
Fórmulas Condicionais em Excel
Fórmulas Condicionais em ExcelFórmulas Condicionais em Excel
Fórmulas Condicionais em ExcelDaniel Brandão
 
Gambling standard
Gambling standardGambling standard
Gambling standardTarun bhatt
 

Viewers also liked (18)

Lista de cotejo_de_reporte
Lista de cotejo_de_reporteLista de cotejo_de_reporte
Lista de cotejo_de_reporte
 
Questionare results 2
Questionare results 2Questionare results 2
Questionare results 2
 
ONCEWAS Winter 16 Lookbook to email
ONCEWAS Winter 16 Lookbook to emailONCEWAS Winter 16 Lookbook to email
ONCEWAS Winter 16 Lookbook to email
 
Registro de tutoria
Registro de tutoriaRegistro de tutoria
Registro de tutoria
 
Case Alert Adecco UK Ltd
Case Alert   Adecco UK LtdCase Alert   Adecco UK Ltd
Case Alert Adecco UK Ltd
 
MEN’S HAND KNIT SWEATER & TEXTURED HAT
MEN’S HAND KNIT SWEATER & TEXTURED HATMEN’S HAND KNIT SWEATER & TEXTURED HAT
MEN’S HAND KNIT SWEATER & TEXTURED HAT
 
Lista de cotejo_para_mapa_mental_1
Lista de cotejo_para_mapa_mental_1Lista de cotejo_para_mapa_mental_1
Lista de cotejo_para_mapa_mental_1
 
How to mold children into leaders
How to mold children into leadersHow to mold children into leaders
How to mold children into leaders
 
Живописный бухта Халонге осенью
Живописный бухта Халонге осеньюЖивописный бухта Халонге осенью
Живописный бухта Халонге осенью
 
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!Хой Ан - Белопесчаные пляжи и древнее культурное наследие!
Хой Ан - Белопесчаные пляжи и древнее культурное наследие!
 
Avalinvest extras respostas
Avalinvest extras respostasAvalinvest extras respostas
Avalinvest extras respostas
 
Case study pack its
Case study pack itsCase study pack its
Case study pack its
 
Suporte - monitores
Suporte - monitoresSuporte - monitores
Suporte - monitores
 
PROJECT - PART 1
PROJECT - PART 1PROJECT - PART 1
PROJECT - PART 1
 
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da Abradt
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da AbradtApresentação: Onofre Alves Batista Júnior | XIX Congresso da Abradt
Apresentação: Onofre Alves Batista Júnior | XIX Congresso da Abradt
 
Tema 4, 4º ESO: La Edad Media, 1ª Parte
Tema 4, 4º ESO: La Edad Media, 1ª ParteTema 4, 4º ESO: La Edad Media, 1ª Parte
Tema 4, 4º ESO: La Edad Media, 1ª Parte
 
Fórmulas Condicionais em Excel
Fórmulas Condicionais em ExcelFórmulas Condicionais em Excel
Fórmulas Condicionais em Excel
 
Gambling standard
Gambling standardGambling standard
Gambling standard
 

Similar to Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes

The Intersection Axiom of Conditional Probability
The Intersection Axiom of Conditional ProbabilityThe Intersection Axiom of Conditional Probability
The Intersection Axiom of Conditional ProbabilityRichard Gill
 
Hunermund causal inference in ml and ai
Hunermund   causal inference in ml and aiHunermund   causal inference in ml and ai
Hunermund causal inference in ml and aiBoston Global Forum
 
The binomial distributions
The binomial distributionsThe binomial distributions
The binomial distributionsmaamir farooq
 
Chapter 1.1.pptx
Chapter 1.1.pptxChapter 1.1.pptx
Chapter 1.1.pptxlucasshen3
 
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...MLconf
 

Similar to Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes (9)

The Intersection Axiom of Conditional Probability
The Intersection Axiom of Conditional ProbabilityThe Intersection Axiom of Conditional Probability
The Intersection Axiom of Conditional Probability
 
Hunermund causal inference in ml and ai
Hunermund   causal inference in ml and aiHunermund   causal inference in ml and ai
Hunermund causal inference in ml and ai
 
The binomial distributions
The binomial distributionsThe binomial distributions
The binomial distributions
 
Day 3.pptx
Day 3.pptxDay 3.pptx
Day 3.pptx
 
Propositional Logic and Pridicate logic
Propositional Logic and Pridicate logicPropositional Logic and Pridicate logic
Propositional Logic and Pridicate logic
 
Chapter 1.1.pptx
Chapter 1.1.pptxChapter 1.1.pptx
Chapter 1.1.pptx
 
Causality detection
Causality detectionCausality detection
Causality detection
 
Probability theory
Probability theoryProbability theory
Probability theory
 
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...
Alex Dimakis, Associate Professor, Dept. of Electrical and Computer Engineeri...
 

More from Amit Sharma

Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceAmit Sharma
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAmit Sharma
 
DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolAmit Sharma
 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceAmit Sharma
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactAmit Sharma
 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsAmit Sharma
 
Causal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleCausal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleAmit Sharma
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAmit Sharma
 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data scienceAmit Sharma
 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesAmit Sharma
 
Estimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsEstimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsAmit Sharma
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Amit Sharma
 
Data mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comData mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comAmit Sharma
 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsAmit Sharma
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsAmit Sharma
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practiceAmit Sharma
 
Causal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereCausal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereAmit Sharma
 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...Amit Sharma
 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferencesAmit Sharma
 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...Amit Sharma
 

More from Amit Sharma (20)

Dowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inferenceDowhy: An end-to-end library for causal inference
Dowhy: An end-to-end library for causal inference
 
Alleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal ModelsAlleviating Privacy Attacks Using Causal Models
Alleviating Privacy Attacks Using Causal Models
 
DoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End toolDoWhy Python library for causal inference: An End-to-End tool
DoWhy Python library for causal inference: An End-to-End tool
 
The Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practiceThe Impact of Computing Systems | Causal inference in practice
The Impact of Computing Systems | Causal inference in practice
 
Artificial Intelligence for Societal Impact
Artificial Intelligence for Societal ImpactArtificial Intelligence for Societal Impact
Artificial Intelligence for Societal Impact
 
Measuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systemsMeasuring effectiveness of machine learning systems
Measuring effectiveness of machine learning systems
 
Causal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scaleCausal data mining: Identifying causal effects at scale
Causal data mining: Identifying causal effects at scale
 
Auditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographicsAuditing search engines for differential satisfaction across demographics
Auditing search engines for differential satisfaction across demographics
 
Causal inference in data science
Causal inference in data scienceCausal inference in data science
Causal inference in data science
 
Causal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practicesCausal inference in online systems: Methods, pitfalls and best practices
Causal inference in online systems: Methods, pitfalls and best practices
 
Estimating the causal impact of recommender systems
Estimating the causal impact of recommender systemsEstimating the causal impact of recommender systems
Estimating the causal impact of recommender systems
 
Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...Predictability of popularity on online social media: Gaps between prediction ...
Predictability of popularity on online social media: Gaps between prediction ...
 
Data mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.comData mining for causal inference: Effect of recommendations on Amazon.com
Data mining for causal inference: Effect of recommendations on Amazon.com
 
Estimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actionsEstimating influence of online activity feeds on people's actions
Estimating influence of online activity feeds on people's actions
 
From prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systemsFrom prediction to causation: Causal inference in online systems
From prediction to causation: Causal inference in online systems
 
Causal inference in practice
Causal inference in practiceCausal inference in practice
Causal inference in practice
 
Causal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhereCausal inference in practice: Here, there, causality is everywhere
Causal inference in practice: Here, there, causality is everywhere
 
The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...The interplay of personal preference and social influence in sharing networks...
The interplay of personal preference and social influence in sharing networks...
 
The role of social connections in shaping our preferences
The role of social connections in shaping our preferencesThe role of social connections in shaping our preferences
The role of social connections in shaping our preferences
 
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
[RecSys '13]Pairwise Learning: Experiments with Community Recommendation on L...
 

Recently uploaded

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our WorldEduminds Learning
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...Dr Arash Najmaei ( Phd., MBA, BSc)
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxHaritikaChhatwal1
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...Jack Cole
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxHimangsuNath
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfsimulationsindia
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelBoston Institute of Analytics
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Thomas Poetter
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxSimranPal17
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaManalVerma4
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max PrincetonTimothy Spann
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksdeepakthakur548787
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024Susanna-Assunta Sansone
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxTasha Penwell
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Boston Institute of Analytics
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfblazblazml
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Seán Kennedy
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxMike Bennett
 

Recently uploaded (20)

Learn How Data Science Changes Our World
Learn How Data Science Changes Our WorldLearn How Data Science Changes Our World
Learn How Data Science Changes Our World
 
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
6 Tips for Interpretable Topic Models _ by Nicha Ruchirawat _ Towards Data Sc...
 
SMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptxSMOTE and K-Fold Cross Validation-Presentation.pptx
SMOTE and K-Fold Cross Validation-Presentation.pptx
 
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
why-transparency-and-traceability-are-essential-for-sustainable-supply-chains...
 
Networking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptxNetworking Case Study prepared by teacher.pptx
Networking Case Study prepared by teacher.pptx
 
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdfWorld Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
World Economic Forum Metaverse Ecosystem By Utpal Chakraborty.pdf
 
Insurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis ProjectInsurance Churn Prediction Data Analysis Project
Insurance Churn Prediction Data Analysis Project
 
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis modelDecoding Movie Sentiments: Analyzing Reviews with Data Analysis model
Decoding Movie Sentiments: Analyzing Reviews with Data Analysis model
 
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
Minimizing AI Hallucinations/Confabulations and the Path towards AGI with Exa...
 
What To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptxWhat To Do For World Nature Conservation Day by Slidesgo.pptx
What To Do For World Nature Conservation Day by Slidesgo.pptx
 
IBEF report on the Insurance market in India
IBEF report on the Insurance market in IndiaIBEF report on the Insurance market in India
IBEF report on the Insurance market in India
 
Real-Time AI Streaming - AI Max Princeton
Real-Time AI  Streaming - AI Max PrincetonReal-Time AI  Streaming - AI Max Princeton
Real-Time AI Streaming - AI Max Princeton
 
Digital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing worksDigital Marketing Plan, how digital marketing works
Digital Marketing Plan, how digital marketing works
 
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
FAIR, FAIRsharing, FAIR Cookbook and ELIXIR - Sansone SA - Boston 2024
 
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptxThe Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
The Power of Data-Driven Storytelling_ Unveiling the Layers of Insight.pptx
 
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
Decoding the Heart: Student Presentation on Heart Attack Prediction with Data...
 
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdfEnglish-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
English-8-Q4-W3-Synthesizing-Essential-Information-From-Various-Sources-1.pdf
 
Data Analysis Project: Stroke Prediction
Data Analysis Project: Stroke PredictionData Analysis Project: Stroke Prediction
Data Analysis Project: Stroke Prediction
 
Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...Student Profile Sample report on improving academic performance by uniting gr...
Student Profile Sample report on improving academic performance by uniting gr...
 
Semantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptxSemantic Shed - Squashing and Squeezing.pptx
Semantic Shed - Squashing and Squeezing.pptx
 

Equivalence causal frameworks: SEMs, Graphical models and Potential Outcomes

  • 1. EQUIVALENCE OF SEM, POTENTIAL OUTCOMES AND CAUSAL GRAPHICAL MODELS AMIT SHARMA POSTDOCTORAL RESEARCHER MICROSOFT http://www.amitsharma.in @amt_shrma
  • 2. WHAT IS CAUSALITY? • Debatable, from the times of Aristotle and Hume. • Practical definition: • Interventionist causality: X causes Y if changing X leads to a change in Y, keeping everything else constant.
  • 3. CAUSALITY IS MEANINGLESS WITHOUT A MODEL • “Keeping everything else constant” requires knowing what everything else is. • Demand increases price is valid in most economies. So seems a universal causal law. • … except in a fully regulated economy. • Model: Explicit specification of “everything else” that can affect causal estimate.
  • 4. WITHOUT A MODEL, EVEN EXPERIMENTS DO NOT TELL YOU ANYTHING ABOUT THE FUTURE • A/B experiments study the past. • Provide a counterfactual answer. • But we want to use the results for the future. • Model: The world stays the same between: • When the experiment was run, and • When its results will be applied.
  • 5. HOW MIGHT WE SPECIFY A MODEL? • By qualitative knowledge about how the world works. Encouragement Effort Outcome
  • 6. HOW MIGHT WE SPECIFY A MODEL? • By writing equations about how the world works. • E.g. F = ma • Encouragement (Z) • Effort (X) • Outcome (Y) 𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦 𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥
  • 7. HOW MIGHT WE SPECIFY A MODEL? • By thinking about the different worlds that changing the causal variable creates (inspired by a randomized experiment). • Effort (X) • Outcome (Y) 𝑌𝑥1 𝑜𝑟 𝑌𝑥2 … . • Encouragement (Z) 𝑌𝑥1𝑧1 𝑜𝑟 𝑌𝑥2𝑧1 𝑜𝑟 𝑌𝑥1𝑧2 𝑜𝑟 𝑌𝑥2𝑧2 … .
  • 8. THREE MAJOR FRAMEWORKS FOR SPECIFYING A CAUSAL MODEL • Causal Graphical model • Structural Equation Model 𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦 𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥 • Potential Outcomes Framework 𝑌𝑥1𝑧1 𝑜𝑟 𝑌𝑥2𝑧1 𝑜𝑟 𝑌𝑥1𝑧2 𝑜𝑟 𝑌𝑥2𝑧2 … . Encouragement Effort Outcome
  • 9. ALL THREE ARE EQUIVALENT • A theorem in one is a theorem in another (See Pearl [2009]). • So what’s the problem? • Different disciplines prefer one over another. • Misconceptions abound about the frameworks. • In general, no unified causal inference course in major universities.
  • 10. A HISTORICAL TOUR OF CAUSALITY • 1850s: John Snow uses a natural experiment to detect causal connection between water and cholera. • 1910s: Buoyed by triumphs in physics, Bertrand Russell argues that causality is irrevelant. • 1920s: Sewall and Philip Wright develop path diagrams and simultaneous equation modelling (SEM) for determining supply or demand from price and quantity. • 1920s: Neyman uses potential outcomes to analyze experiments. • 1930s: Ronald Fisher popularizes the randomized experiment.
  • 11. A HISTORICAL TOUR OF CAUSALITY • 1960s: Blalock and Duncan solve path diagrams using regression equations. • 1960-now?: Age of regression. • Path diagrams lose their original causal interpretation. • SEMs, Path diagrams and regression become entangled. • 1970s: Rubin builds on potential outcomes framework. • Becomes popular with social scientists. • 1980s: Pearl builds on SEM framework. • Starting to become popular with computer scientists.
  • 12. EQUIVALENCE OF GRAPHICAL MODELS AND SEM Encouragement (Z) Effort (X) Outcome (Y) P(X, Y, Z ) = P(Y|X) P(X|Z) P(Z) 𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦 𝑥 ∶= 𝑓(𝑧, 𝜖 𝑥)
  • 13. EQUIVALENCE OF GRAPHICAL MODELS AND SEM Encouragement (Z) Effort (X) Outcome (Y) P(Y|do(X)) = P(Y|X) 𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦 Effect = 𝑑𝑦 𝑑𝑥
  • 14. EQUIVALENCE OF POTENTIAL OUTCOMES AND SEM Encouragement (Z) Effort (X) Outcome (Y) 𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦 𝑥 ∶= 𝑓(𝑧, 𝜖 𝑥) 𝑌𝑥1, 𝑌𝑥2, … 𝑋𝑧1, 𝑋𝑧2, … 𝑌𝑥1𝑧1, 𝑌𝑥1𝑧2, …
  • 15. EQUIVALENCE OF POTENTIAL OUTCOMES AND SEM Encouragement (Z) Effort (X) Outcome (Y) Effect = 𝑌𝑥2 − 𝑌𝑥1 𝑦 ∶= 𝑓 𝑥, 𝜖 𝑦 Effect = 𝑑𝑦 𝑑𝑥
  • 17. IV BY GRAPHICAL MODEL Encouragement (Z) Effort (X) Outcome (Y) Unobserved Confounders (U) 𝑃 𝑦 𝑑𝑜 𝑥 = 𝑢 𝑃 𝑦 𝑥, 𝑢 𝑃(𝑢) Average Causal Effect = 𝑃 𝑦2 𝑑𝑜 𝑥2 − 𝑃 𝑦2 𝑑𝑜 𝑥1 = 𝑢 𝑃 𝑦2 𝑥2, 𝑢 − 𝑃(𝑦2|𝑥1, 𝑢) 𝑃(𝑢)
  • 18. IV BY STRUCTURAL EQUATIONS 𝑦 ∶= 𝛽𝑥 + 𝜖 𝑦 𝑥 ∶= 𝛾𝑧 + 𝜖 𝑥 Local average Causal effect = 𝐶𝑜𝑣(𝑌, 𝑋) 𝐶𝑜𝑣(𝑋,𝑍)
  • 19. IV IN POTENTIAL OUTCOMES • Assumptions: • 𝑌𝑥1𝑧1 = 𝑌𝑥1𝑧2 Local average Causal Effect = 𝐸 𝑌 𝑋2 – 𝐸 𝑌 𝑋1 𝐸 𝑋 𝑍2 −𝐸 𝑋 𝑍1
  • 20. BEST PRACTICES • A randomized experiment, or a problem with few variables: • Use Potential outcomes framework: Simple and practical. • An observational study with many confounders, or any problem with many variables: • Use graphical models to encode causal assumptions. • If functional forms unknown, use graphical criteria or do-calculus to estimate causal effect. • Else, a domain where functional forms are known or can be approximated • Use structural equation models to solve for causal effects, based on the causal graph.