SlideShare ist ein Scribd-Unternehmen logo
1 von 52
Downloaden Sie, um offline zu lesen
Joint causal inference
on observational and experimental datasets
Sara Magliacane, Tom Claassen, Joris M. Mooij
s.magliacane@uva.nl
What If?, 10th December, 2016
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 1 / 21
Part I
Introduction
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 2 / 21
Causal inference: learning causal relations from data
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
Causal inference: learning causal relations from data
Definition
X causes Y (X Y ) = intervening upon (changing) X changes Y
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
Causal inference: learning causal relations from data
Definition
X causes Y (X Y ) = intervening upon (changing) X changes Y
We can represent causal relations with a causal DAG (hidden vars):
X Y
Z
E.g. X = Smoking, Y = Cancer, Z = Genetics
Causal inference = structure learning of the causal DAG
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
Causal inference: learning causal relations from data
Definition
X causes Y (X Y ) = intervening upon (changing) X changes Y
We can represent causal relations with a causal DAG (hidden vars):
X Y
Z
E.g. X = Smoking, Y = Cancer, Z = Genetics
Causal inference = structure learning of the causal DAG
Traditionally, causal relations are inferred from interventions.
Sometimes, interventions are unethical, unfeasible or too expensive
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
Causal inference from observational and experimental data
Holy Grail of Causal Inference
Learn as much causal structure as possible from observations,
integrating background knowledge and experimental data.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
Causal inference from observational and experimental data
Holy Grail of Causal Inference
Learn as much causal structure as possible from observations,
integrating background knowledge and experimental data.
Current causal inference methods:
Score-based: evaluate models using a penalized likelihood score
Constraint-based: use statistical independences to express
constraints over possible causal models
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
Causal inference from observational and experimental data
Holy Grail of Causal Inference
Learn as much causal structure as possible from observations,
integrating background knowledge and experimental data.
Current causal inference methods:
Score-based: evaluate models using a penalized likelihood score
Constraint-based: use statistical independences to express
constraints over possible causal models
Advantage of constraint-based methods:
can handle latent confounders naturally
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
Joint inference on observational and experimental data
Advantage of score-based methods:
can formulate joint inference on observational and experimental data
and learn the targets of interventions, e.g. [Eaton and Murphy, 2007].
raf
mek12
pip2
erk
akt
p38jnk
pka
(a) (b)
raf
erk
p38
pkc
akt
pka
mek12
B2cAMP
f
erk
akt
jnk
Psitect AKT inh U0126
PMA
p38
G06967
mek12
raf
pkc
pip3
plcy
pip2
pka
Present
Missing
Int. edge
(c) (d)
logical data. (a) A partial model of the T-cell pathway, as currently accepted by biologists. The small round
ent various interventions (green = activators, red = inhibitors). From [SPP+
05]. Reprinted with permission
h marginal probability above 0.5 as estimated by [SPP+
05]. (c) Edges with marginal probability above 0.5
g known perfect interventions. Dashed edges are ones that are missing from the union of (a) and (b). These
edges that Sachs et al missed. (d) Edges with marginal probability above 0.5 as estimated by us, assuming
entions, and a fan-in bound of k = 2. The intervention nodes are in red, and edges from the intervention
d edges are ones that are missing from the union of (a) and (b). This figure is best viewed in colour.
Example from [Eaton and Murphy, 2007]
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 5 / 21
Joint inference on observational and experimental data
Advantage of score-based methods:
can formulate joint inference on observational and experimental data
and learn the targets of interventions, e.g. [Eaton and Murphy, 2007].
raf
mek12
pip2
erk
akt
p38jnk
pka
(a) (b)
raf
erk
p38
pkc
akt
pka
mek12
B2cAMP
f
erk
akt
jnk
Psitect AKT inh U0126
PMA
p38
G06967
mek12
raf
pkc
pip3
plcy
pip2
pka
Present
Missing
Int. edge
(c) (d)
logical data. (a) A partial model of the T-cell pathway, as currently accepted by biologists. The small round
ent various interventions (green = activators, red = inhibitors). From [SPP+
05]. Reprinted with permission
h marginal probability above 0.5 as estimated by [SPP+
05]. (c) Edges with marginal probability above 0.5
g known perfect interventions. Dashed edges are ones that are missing from the union of (a) and (b). These
edges that Sachs et al missed. (d) Edges with marginal probability above 0.5 as estimated by us, assuming
entions, and a fan-in bound of k = 2. The intervention nodes are in red, and edges from the intervention
d edges are ones that are missing from the union of (a) and (b). This figure is best viewed in colour.
Example from [Eaton and Murphy, 2007]
Goal: Can we perform joint inference using constraint-based methods?
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 5 / 21
Part II
Joint Causal Inference
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 6 / 21
Joint Causal Inference: Assumptions
Idea: Model jointly several observational or experimental datasets
{Dr }r∈{1...n} with zero or more possibly unknown intervention targets.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
Joint Causal Inference: Assumptions
Idea: Model jointly several observational or experimental datasets
{Dr }r∈{1...n} with zero or more possibly unknown intervention targets.
We assume a unique underlying causal DAG across datasets defined
over system variables {Xj }j∈X (some of which possibly hidden).
Example
X1 X2
X3
Dataset D1
unknown ints
X1 X2
X3
Dataset D2
unknown ints
X1 X2
X3
Dataset D3
do X1
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
Joint Causal Inference: Assumptions
Idea: Model jointly several observational or experimental datasets
{Dr }r∈{1...n} with zero or more possibly unknown intervention targets.
We assume a unique underlying causal DAG across datasets defined
over system variables {Xj }j∈X (some of which possibly hidden).
Example
X1 X2
X3
Dataset D1
unknown ints
X1 X2
X3
Dataset D2
unknown ints
X1 X2
X3
Dataset D3
do X1
Note: cannot handle certain intervention types, e.g. perfect interventions.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
Joint Causal Inference: SCM
We introduce two types of dummy variables in the data:
a regime variable R, indicating which dataset Dr a data point is
from
intervention variables {Ii }i∈I, functions of R
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 8 / 21
Joint Causal Inference: SCM
We introduce two types of dummy variables in the data:
a regime variable R, indicating which dataset Dr a data point is
from
intervention variables {Ii }i∈I, functions of R
We assume that we can represent the whole system as an acyclic SCM:



R = ER,
Ii = gi (R), i ∈ I,
Xj = fj (Xpa(Xj )∩X , Ipa(Xj )∩I, Ej ), j ∈ X,
P (Ek)k∈X∪{R} =
k∈X∪{R}
P(Ek).
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 8 / 21
Joint Causal Inference: single joint causal DAG
We represent the SCM with a causal DAG C representing all datasets
jointly:
R I1 I2 X1 X2 X4
1 20 0 0.1 0.2 0.5
1 20 0 0.13 0.21 0.49
1 20 0 . . . . . . . . .
2 20 1 . . . . . . . . .
3 30 0 . . . . . . . . .
4 30 1 . . . . . . . . .
4 datasets with 2 interventions
R
I1 I2
X1
X2
X3
X4
Causal DAG C
We assume Causal Markov and Minimality hold in C.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 9 / 21
Joint Causal Inference: Faithfulness violations
Causal Faithfulness assumption:
X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C]
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
Joint Causal Inference: Faithfulness violations
Causal Faithfulness assumption:
X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C]
R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
Joint Causal Inference: Faithfulness violations
Causal Faithfulness assumption:
X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C]
R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R
Solution: D-separation [Spirtes et al., 2000]: X1 ⊥D I1 | R
Problem: ⊥D provably complete (for now) for functionally determined
relations
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
Joint Causal Inference: Faithfulness violations
Causal Faithfulness assumption:
X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C]
R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R
Solution: D-separation [Spirtes et al., 2000]: X1 ⊥D I1 | R
Problem: ⊥D provably complete (for now) for functionally determined
relations
Solution: restrict deterministic relations to only ∀i ∈ I : Ii = gi (R)
D-Faithfulness: X ⊥⊥ Y | W =⇒ X ⊥D Y | W [C].
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
Joint Causal Inference
Joint Causal Inference (JCI) = Given all the assumptions, reconstruct
the causal DAG C from independence test results.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
Joint Causal Inference
Joint Causal Inference (JCI) = Given all the assumptions, reconstruct
the causal DAG C from independence test results.
X1 ⊥⊥ I1
X1 ⊥⊥ I1 | R
X4 ⊥⊥ X2 | I1
. . .
=⇒
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
Joint Causal Inference
Joint Causal Inference (JCI) = Given all the assumptions, reconstruct
the causal DAG C from independence test results.
X1 ⊥⊥ I1
X1 ⊥⊥ I1 | R
X4 ⊥⊥ X2 | I1
. . .
=⇒
R
I1 I2
X1
X2
X3
X4
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
Joint Causal Inference
Joint Causal Inference (JCI) = Given all the assumptions, reconstruct
the causal DAG C from independence test results.
X1 ⊥⊥ I1
X1 ⊥⊥ I1 | R
X4 ⊥⊥ X2 | I1
. . .
=⇒
R
I1 I2
X1
X2
X3
X4
Problem: Current constraint-based methods cannot work with JCI,
because of faithfulness violations.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
Part III
Extending constraint-based methods for JCI
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 12 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
A simple strategy for dealing with functionally determined relations:
1 Rephrase constraint-based method in terms of d-separations
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
A simple strategy for dealing with functionally determined relations:
1 Rephrase constraint-based method in terms of d-separations
2 For each independence test result derive sound d-separations:
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
A simple strategy for dealing with functionally determined relations:
1 Rephrase constraint-based method in terms of d-separations
2 For each independence test result derive sound d-separations:
X ⊥⊥ Y |W =⇒ X ⊥d Y |W
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
A simple strategy for dealing with functionally determined relations:
1 Rephrase constraint-based method in terms of d-separations
2 For each independence test result derive sound d-separations:
X ⊥⊥ Y |W =⇒ X ⊥d Y |W
X ∈ Det(W ) and Y ∈ Det(W ) and X ⊥⊥ Y | W =⇒
X ⊥d Y | Det(W )
where Det(W ) = variables determined by (a subset of) W .
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
A simple strategy for dealing with faithfulness violations
Idea: Faithfulness violations → Partial inputs
A simple strategy for dealing with functionally determined relations:
1 Rephrase constraint-based method in terms of d-separations
2 For each independence test result derive sound d-separations:
X ⊥⊥ Y |W =⇒ X ⊥d Y |W
X ∈ Det(W ) and Y ∈ Det(W ) and X ⊥⊥ Y | W =⇒
X ⊥d Y | Det(W )
where Det(W ) = variables determined by (a subset of) W .
Conjecture: sound also for a larger class of deterministic relations.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
Ancestral Causal Inference with Determinism (ACID)
Idea: Faithfulness violations → Partial inputs
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
Ancestral Causal Inference with Determinism (ACID)
Idea: Faithfulness violations → Partial inputs
Traditional constraint-based methods (e.g., PC, FCI, ...):
cannot handle partial inputs
cannot exploit the rich background knowledge in JCI
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
Ancestral Causal Inference with Determinism (ACID)
Idea: Faithfulness violations → Partial inputs
Traditional constraint-based methods (e.g., PC, FCI, ...):
cannot handle partial inputs
cannot exploit the rich background knowledge in JCI
Solution: Logic-based methods, e.g., ACI [Magliacane et al., 2016]
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
Ancestral Causal Inference with Determinism (ACID)
Idea: Faithfulness violations → Partial inputs
Traditional constraint-based methods (e.g., PC, FCI, ...):
cannot handle partial inputs
cannot exploit the rich background knowledge in JCI
Solution: Logic-based methods, e.g., ACI [Magliacane et al., 2016]
=⇒ we implement the strategy in an extension of ACI called:
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
Rephrasing ACI rules in terms of d-separations
ACI rules:
Example
For X, Y , W disjoint (sets of) variables:
(X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 15 / 21
Rephrasing ACI rules in terms of d-separations
ACI rules:
Example
For X, Y , W disjoint (sets of) variables:
(X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y
ACID rules:
Example
For X, Y , W disjoint (sets of) variables:
(X ⊥d Y | W ) ∧ (X W ) =⇒ X Y
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 15 / 21
ACID-JCI
ACID-JCI = ACID rules + sound d-separations from strategy
+ JCI background knowledge
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
ACID-JCI
ACID-JCI = ACID rules + sound d-separations from strategy
+ JCI background knowledge
For example:
∀i ∈ I, ∀j ∈ X : (Xj R) ∧ (Xj Ii )
“Standard variables cannot cause dummy variables”
Example
I1
X1
R
I1
X1
R
I1 X1
R
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
ACID-JCI
ACID-JCI = ACID rules + sound d-separations from strategy
+ JCI background knowledge
For example:
∀i ∈ I, ∀j ∈ X : (Xj R) ∧ (Xj Ii )
“Standard variables cannot cause dummy variables”
Example
I1
X1
R
I1
X1
R
I1 X1
R
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
Part IV
Evaluation
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 17 / 21
Simulated data accuracy: example Precision Recall curve
Ancestral (“causes”) relations Non-ancestral (“not causes”)
Precision Recall curves of 2000 randomly generated causal graphs for
4 system variables and 3 interventions
ACID-JCI substantially improves accuracy w.r.t. merging learnt
structures (merged ACI)
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 18 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
A simple strategy for dealing with faithfulness violations due to
functionally determined relations
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
A simple strategy for dealing with faithfulness violations due to
functionally determined relations
An implementation, ACID-JCI, that substantially improves the
accuracy w.r.t. state-of-the-art
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
A simple strategy for dealing with faithfulness violations due to
functionally determined relations
An implementation, ACID-JCI, that substantially improves the
accuracy w.r.t. state-of-the-art
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
A simple strategy for dealing with faithfulness violations due to
functionally determined relations
An implementation, ACID-JCI, that substantially improves the
accuracy w.r.t. state-of-the-art
Future work:
Improve scalability (now max 7 variables in C).
Working paper: https://arxiv.org/abs/1611.10351
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
Conclusion
Joint Causal Inference, a powerful formulation of causal discovery
over multiple datasets for constraint-based methods
A simple strategy for dealing with faithfulness violations due to
functionally determined relations
An implementation, ACID-JCI, that substantially improves the
accuracy w.r.t. state-of-the-art
Future work:
Improve scalability (now max 7 variables in C).
Working paper: https://arxiv.org/abs/1611.10351
Collaborators:
Tom Claassen Joris M. Mooij
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
References I
Claassen, T. and Heskes, T. (2011).
A logical characterization of constraint-based causal discovery.
In UAI 2011, pages 135–144.
Eaton, D. and Murphy, K. (2007).
Exact Bayesian structure learning from uncertain interventions.
In Proceedings of the 10th International Conference on Artificial Intelligence and Statistics (AISTATS), pages 107–114.
Magliacane, S., Claassen, T., and Mooij, J. M. (2016).
Ancestral causal inference.
In NIPS.
Mooij, J. M. and Heskes, T. (2013).
Cyclic causal discovery from continuous equilibrium data.
In Nicholson, A. and Smyth, P., editors, Proceedings of the 29th Annual Conference on Uncertainty in Artificial
Intelligence (UAI-13), pages 431–439. AUAI Press.
Spirtes, P., Glymour, C., and Scheines, R. (2000).
Causation, Prediction, and Search.
MIT press, 2nd edition.
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 20 / 21
Ancestral Causal Inference (ACI)
Weighted list of inputs: I = {(ij , wj )}:
E.g. I = { (Y ⊥⊥ Z | X, 0.2), (Y ⊥⊥ X, 0.1)} }
Any consistent weighting scheme, e.g. frequentist, Bayesian
For any possible ancestral structure C, we define the loss function:
Loss(C, I) :=
(ij ,wj )∈I: ij is not satisfied in C
wj
Here: “ij is not satisfied in C” = defined by ancestral reasoning rules
For each possible causal relation X Y provide score:
Conf (X Y ) = min
C∈C
Loss(C, I + (X Y , ∞))
− min
C∈C
Loss(C, I + (X Y , ∞))
Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 21 / 21

Weitere ähnliche Inhalte

Was ist angesagt?

Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Artificial Intelligence Institute at UofSC
 

Was ist angesagt? (12)

Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
Vahid Taslimitehrani PhD Dissertation Defense: Contrast Pattern Aided Regress...
 
Non-Gaussian structural equation models for causal discovery
Non-Gaussian structural equation models for causal discoveryNon-Gaussian structural equation models for causal discovery
Non-Gaussian structural equation models for causal discovery
 
Big Data Analysis
Big Data AnalysisBig Data Analysis
Big Data Analysis
 
効率的反実仮想学習
効率的反実仮想学習効率的反実仮想学習
効率的反実仮想学習
 
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
On Extension of Weibull Distribution with Bayesian Analysis using S-Plus Soft...
 
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
PMED Transition Workshop - Non-parametric Techniques for Estimating Tumor Het...
 
A non-Gaussian model for causal discovery in the presence of hidden common ca...
A non-Gaussian model for causal discovery in the presence of hidden common ca...A non-Gaussian model for causal discovery in the presence of hidden common ca...
A non-Gaussian model for causal discovery in the presence of hidden common ca...
 
APHA 2012: Hierarchical Multiple Informants Model
APHA 2012: Hierarchical Multiple Informants ModelAPHA 2012: Hierarchical Multiple Informants Model
APHA 2012: Hierarchical Multiple Informants Model
 
Reading revue of "Inferring Multiple Graphical Structures"
Reading revue of "Inferring Multiple Graphical Structures"Reading revue of "Inferring Multiple Graphical Structures"
Reading revue of "Inferring Multiple Graphical Structures"
 
Les5e ppt 08
Les5e ppt 08Les5e ppt 08
Les5e ppt 08
 
Probabilistic Reasoning
Probabilistic ReasoningProbabilistic Reasoning
Probabilistic Reasoning
 
PMED Opening Workshop - Inference on Individualized Treatment Rules from Obse...
PMED Opening Workshop - Inference on Individualized Treatment Rules from Obse...PMED Opening Workshop - Inference on Individualized Treatment Rules from Obse...
PMED Opening Workshop - Inference on Individualized Treatment Rules from Obse...
 

Andere mochten auch

Uncertainty business strategy by bhawani nandan prasad iim calcutta
Uncertainty business strategy by bhawani nandan prasad iim calcuttaUncertainty business strategy by bhawani nandan prasad iim calcutta
Uncertainty business strategy by bhawani nandan prasad iim calcutta
Bhawani N Prasad
 
Bayesian Inference: An Introduction to Principles and ...
Bayesian Inference: An Introduction to Principles and ...Bayesian Inference: An Introduction to Principles and ...
Bayesian Inference: An Introduction to Principles and ...
butest
 
NIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learningNIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learning
zukun
 

Andere mochten auch (20)

ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"ISWC 2012 "Efficient execution of top-k SPARQL queries"
ISWC 2012 "Efficient execution of top-k SPARQL queries"
 
ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"ISWC DC poster "Reconstructing Provenance"
ISWC DC poster "Reconstructing Provenance"
 
A Constructivist Approach to Rule Bases
A Constructivist Approach to Rule BasesA Constructivist Approach to Rule Bases
A Constructivist Approach to Rule Bases
 
Bridging Representation of Laws, of Implementations and of Behaviours
Bridging Representation of Laws, of Implementations and of BehavioursBridging Representation of Laws, of Implementations and of Behaviours
Bridging Representation of Laws, of Implementations and of Behaviours
 
Commitments, Expectations, Affordances and Susceptibilities: Towards Position...
Commitments, Expectations, Affordances and Susceptibilities: Towards Position...Commitments, Expectations, Affordances and Susceptibilities: Towards Position...
Commitments, Expectations, Affordances and Susceptibilities: Towards Position...
 
Reading Agendas Between the Lines, an exercise
Reading Agendas Between the Lines, an exerciseReading Agendas Between the Lines, an exercise
Reading Agendas Between the Lines, an exercise
 
Revisiting Constitutive Rules
Revisiting Constitutive RulesRevisiting Constitutive Rules
Revisiting Constitutive Rules
 
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015Trust Models for RDF Data: Semantics and Complexity - AAAI2015
Trust Models for RDF Data: Semantics and Complexity - AAAI2015
 
Formal Maps and their Algebra
Formal Maps and their AlgebraFormal Maps and their Algebra
Formal Maps and their Algebra
 
Latinos and Social Media
Latinos and Social MediaLatinos and Social Media
Latinos and Social Media
 
On the Interactional Meaning of Fundamental Legal Concepts
On the Interactional Meaning of Fundamental Legal ConceptsOn the Interactional Meaning of Fundamental Legal Concepts
On the Interactional Meaning of Fundamental Legal Concepts
 
Uncertainty business strategy by bhawani nandan prasad iim calcutta
Uncertainty business strategy by bhawani nandan prasad iim calcuttaUncertainty business strategy by bhawani nandan prasad iim calcutta
Uncertainty business strategy by bhawani nandan prasad iim calcutta
 
Quantiative uncertainty in QSAR predictions - Bayesian predictive inference a...
Quantiative uncertainty in QSAR predictions - Bayesian predictive inference a...Quantiative uncertainty in QSAR predictions - Bayesian predictive inference a...
Quantiative uncertainty in QSAR predictions - Bayesian predictive inference a...
 
Bayesian Inference: An Introduction to Principles and ...
Bayesian Inference: An Introduction to Principles and ...Bayesian Inference: An Introduction to Principles and ...
Bayesian Inference: An Introduction to Principles and ...
 
Knowledge management on the desktop
Knowledge management on the desktopKnowledge management on the desktop
Knowledge management on the desktop
 
The Uncertainty Model: Understanding What Business You Are In
The Uncertainty Model: Understanding What Business You Are InThe Uncertainty Model: Understanding What Business You Are In
The Uncertainty Model: Understanding What Business You Are In
 
Night Knights: exploiting games to engage people in a citizen science campaign
Night Knights: exploiting games to engage people in a citizen science campaignNight Knights: exploiting games to engage people in a citizen science campaign
Night Knights: exploiting games to engage people in a citizen science campaign
 
NIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learningNIPS2010: optimization algorithms in machine learning
NIPS2010: optimization algorithms in machine learning
 
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.A Model-Based Method for  Seamless Web and Mobile Experience. Splash 2016 conf.
A Model-Based Method for Seamless Web and Mobile Experience. Splash 2016 conf.
 
BPMN and Design Patterns for Engineering Social BPM Solutions
BPMN and Design Patterns for Engineering Social BPM SolutionsBPMN and Design Patterns for Engineering Social BPM Solutions
BPMN and Design Patterns for Engineering Social BPM Solutions
 

Ähnlich wie Talk: Joint causal inference on observational and experimental data - NIPS 2016 "What If?" workshop poster

A Causal Framework for Meta-Analysis, drafty
A Causal Framework for Meta-Analysis, drafty A Causal Framework for Meta-Analysis, drafty
A Causal Framework for Meta-Analysis, drafty
Wei Wang
 

Ähnlich wie Talk: Joint causal inference on observational and experimental data - NIPS 2016 "What If?" workshop poster (20)

Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
 
Artificial Intelligence 06.2 More on Causality Bayesian Networks
Artificial Intelligence 06.2 More on  Causality Bayesian NetworksArtificial Intelligence 06.2 More on  Causality Bayesian Networks
Artificial Intelligence 06.2 More on Causality Bayesian Networks
 
Identifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual NetworksIdentifiability in Dynamic Casual Networks
Identifiability in Dynamic Casual Networks
 
Basen Network
Basen NetworkBasen Network
Basen Network
 
PMED Transition Workshop - Virtual Tumor Populations from a Randomized Reacti...
PMED Transition Workshop - Virtual Tumor Populations from a Randomized Reacti...PMED Transition Workshop - Virtual Tumor Populations from a Randomized Reacti...
PMED Transition Workshop - Virtual Tumor Populations from a Randomized Reacti...
 
Essential econometrics for data scientists
Essential econometrics for data scientistsEssential econometrics for data scientists
Essential econometrics for data scientists
 
Consensual gene co-expression network inference with multiple samples
Consensual gene co-expression network inference with multiple samplesConsensual gene co-expression network inference with multiple samples
Consensual gene co-expression network inference with multiple samples
 
Datascience Introduction WebSci Summer School 2014
Datascience Introduction WebSci Summer School 2014Datascience Introduction WebSci Summer School 2014
Datascience Introduction WebSci Summer School 2014
 
A short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studiesA short and naive introduction to epistasis in association studies
A short and naive introduction to epistasis in association studies
 
Defeasible systems in legal reasoning: a comparative assessment
Defeasible systems in legal reasoning: a comparative assessment Defeasible systems in legal reasoning: a comparative assessment
Defeasible systems in legal reasoning: a comparative assessment
 
Ievobio2013
Ievobio2013Ievobio2013
Ievobio2013
 
Ec33772776
Ec33772776Ec33772776
Ec33772776
 
Ec33772776
Ec33772776Ec33772776
Ec33772776
 
A Causal Framework for Meta-Analysis, drafty
A Causal Framework for Meta-Analysis, drafty A Causal Framework for Meta-Analysis, drafty
A Causal Framework for Meta-Analysis, drafty
 
Sandhya Prabhakaran - A Bayesian Approach To Model Overlapping Objects Availa...
Sandhya Prabhakaran - A Bayesian Approach To Model Overlapping Objects Availa...Sandhya Prabhakaran - A Bayesian Approach To Model Overlapping Objects Availa...
Sandhya Prabhakaran - A Bayesian Approach To Model Overlapping Objects Availa...
 
main
mainmain
main
 
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
Program on Mathematical and Statistical Methods for Climate and the Earth Sys...
 
Estimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample SetsEstimating Space-Time Covariance from Finite Sample Sets
Estimating Space-Time Covariance from Finite Sample Sets
 
High-Dimensional Machine Learning for Medicine
High-Dimensional Machine Learning for MedicineHigh-Dimensional Machine Learning for Medicine
High-Dimensional Machine Learning for Medicine
 
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
An Importance Sampling Approach to Integrate Expert Knowledge When Learning B...
 

Kürzlich hochgeladen

Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 

Kürzlich hochgeladen (20)

Green chemistry and Sustainable development.pptx
Green chemistry  and Sustainable development.pptxGreen chemistry  and Sustainable development.pptx
Green chemistry and Sustainable development.pptx
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Creating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening DesignsCreating and Analyzing Definitive Screening Designs
Creating and Analyzing Definitive Screening Designs
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
Vip profile Call Girls In Lonavala 9748763073 For Genuine Sex Service At Just...
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Zoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdfZoology 4th semester series (krishna).pdf
Zoology 4th semester series (krishna).pdf
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 60009654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
9654467111 Call Girls In Raj Nagar Delhi Short 1500 Night 6000
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 

Talk: Joint causal inference on observational and experimental data - NIPS 2016 "What If?" workshop poster

  • 1. Joint causal inference on observational and experimental datasets Sara Magliacane, Tom Claassen, Joris M. Mooij s.magliacane@uva.nl What If?, 10th December, 2016 Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 1 / 21
  • 2. Part I Introduction Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 2 / 21
  • 3. Causal inference: learning causal relations from data Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
  • 4. Causal inference: learning causal relations from data Definition X causes Y (X Y ) = intervening upon (changing) X changes Y Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
  • 5. Causal inference: learning causal relations from data Definition X causes Y (X Y ) = intervening upon (changing) X changes Y We can represent causal relations with a causal DAG (hidden vars): X Y Z E.g. X = Smoking, Y = Cancer, Z = Genetics Causal inference = structure learning of the causal DAG Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
  • 6. Causal inference: learning causal relations from data Definition X causes Y (X Y ) = intervening upon (changing) X changes Y We can represent causal relations with a causal DAG (hidden vars): X Y Z E.g. X = Smoking, Y = Cancer, Z = Genetics Causal inference = structure learning of the causal DAG Traditionally, causal relations are inferred from interventions. Sometimes, interventions are unethical, unfeasible or too expensive Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 3 / 21
  • 7. Causal inference from observational and experimental data Holy Grail of Causal Inference Learn as much causal structure as possible from observations, integrating background knowledge and experimental data. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
  • 8. Causal inference from observational and experimental data Holy Grail of Causal Inference Learn as much causal structure as possible from observations, integrating background knowledge and experimental data. Current causal inference methods: Score-based: evaluate models using a penalized likelihood score Constraint-based: use statistical independences to express constraints over possible causal models Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
  • 9. Causal inference from observational and experimental data Holy Grail of Causal Inference Learn as much causal structure as possible from observations, integrating background knowledge and experimental data. Current causal inference methods: Score-based: evaluate models using a penalized likelihood score Constraint-based: use statistical independences to express constraints over possible causal models Advantage of constraint-based methods: can handle latent confounders naturally Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 4 / 21
  • 10. Joint inference on observational and experimental data Advantage of score-based methods: can formulate joint inference on observational and experimental data and learn the targets of interventions, e.g. [Eaton and Murphy, 2007]. raf mek12 pip2 erk akt p38jnk pka (a) (b) raf erk p38 pkc akt pka mek12 B2cAMP f erk akt jnk Psitect AKT inh U0126 PMA p38 G06967 mek12 raf pkc pip3 plcy pip2 pka Present Missing Int. edge (c) (d) logical data. (a) A partial model of the T-cell pathway, as currently accepted by biologists. The small round ent various interventions (green = activators, red = inhibitors). From [SPP+ 05]. Reprinted with permission h marginal probability above 0.5 as estimated by [SPP+ 05]. (c) Edges with marginal probability above 0.5 g known perfect interventions. Dashed edges are ones that are missing from the union of (a) and (b). These edges that Sachs et al missed. (d) Edges with marginal probability above 0.5 as estimated by us, assuming entions, and a fan-in bound of k = 2. The intervention nodes are in red, and edges from the intervention d edges are ones that are missing from the union of (a) and (b). This figure is best viewed in colour. Example from [Eaton and Murphy, 2007] Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 5 / 21
  • 11. Joint inference on observational and experimental data Advantage of score-based methods: can formulate joint inference on observational and experimental data and learn the targets of interventions, e.g. [Eaton and Murphy, 2007]. raf mek12 pip2 erk akt p38jnk pka (a) (b) raf erk p38 pkc akt pka mek12 B2cAMP f erk akt jnk Psitect AKT inh U0126 PMA p38 G06967 mek12 raf pkc pip3 plcy pip2 pka Present Missing Int. edge (c) (d) logical data. (a) A partial model of the T-cell pathway, as currently accepted by biologists. The small round ent various interventions (green = activators, red = inhibitors). From [SPP+ 05]. Reprinted with permission h marginal probability above 0.5 as estimated by [SPP+ 05]. (c) Edges with marginal probability above 0.5 g known perfect interventions. Dashed edges are ones that are missing from the union of (a) and (b). These edges that Sachs et al missed. (d) Edges with marginal probability above 0.5 as estimated by us, assuming entions, and a fan-in bound of k = 2. The intervention nodes are in red, and edges from the intervention d edges are ones that are missing from the union of (a) and (b). This figure is best viewed in colour. Example from [Eaton and Murphy, 2007] Goal: Can we perform joint inference using constraint-based methods? Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 5 / 21
  • 12. Part II Joint Causal Inference Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 6 / 21
  • 13. Joint Causal Inference: Assumptions Idea: Model jointly several observational or experimental datasets {Dr }r∈{1...n} with zero or more possibly unknown intervention targets. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
  • 14. Joint Causal Inference: Assumptions Idea: Model jointly several observational or experimental datasets {Dr }r∈{1...n} with zero or more possibly unknown intervention targets. We assume a unique underlying causal DAG across datasets defined over system variables {Xj }j∈X (some of which possibly hidden). Example X1 X2 X3 Dataset D1 unknown ints X1 X2 X3 Dataset D2 unknown ints X1 X2 X3 Dataset D3 do X1 Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
  • 15. Joint Causal Inference: Assumptions Idea: Model jointly several observational or experimental datasets {Dr }r∈{1...n} with zero or more possibly unknown intervention targets. We assume a unique underlying causal DAG across datasets defined over system variables {Xj }j∈X (some of which possibly hidden). Example X1 X2 X3 Dataset D1 unknown ints X1 X2 X3 Dataset D2 unknown ints X1 X2 X3 Dataset D3 do X1 Note: cannot handle certain intervention types, e.g. perfect interventions. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 7 / 21
  • 16. Joint Causal Inference: SCM We introduce two types of dummy variables in the data: a regime variable R, indicating which dataset Dr a data point is from intervention variables {Ii }i∈I, functions of R Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 8 / 21
  • 17. Joint Causal Inference: SCM We introduce two types of dummy variables in the data: a regime variable R, indicating which dataset Dr a data point is from intervention variables {Ii }i∈I, functions of R We assume that we can represent the whole system as an acyclic SCM:    R = ER, Ii = gi (R), i ∈ I, Xj = fj (Xpa(Xj )∩X , Ipa(Xj )∩I, Ej ), j ∈ X, P (Ek)k∈X∪{R} = k∈X∪{R} P(Ek). Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 8 / 21
  • 18. Joint Causal Inference: single joint causal DAG We represent the SCM with a causal DAG C representing all datasets jointly: R I1 I2 X1 X2 X4 1 20 0 0.1 0.2 0.5 1 20 0 0.13 0.21 0.49 1 20 0 . . . . . . . . . 2 20 1 . . . . . . . . . 3 30 0 . . . . . . . . . 4 30 1 . . . . . . . . . 4 datasets with 2 interventions R I1 I2 X1 X2 X3 X4 Causal DAG C We assume Causal Markov and Minimality hold in C. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 9 / 21
  • 19. Joint Causal Inference: Faithfulness violations Causal Faithfulness assumption: X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C] Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
  • 20. Joint Causal Inference: Faithfulness violations Causal Faithfulness assumption: X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C] R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
  • 21. Joint Causal Inference: Faithfulness violations Causal Faithfulness assumption: X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C] R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R Solution: D-separation [Spirtes et al., 2000]: X1 ⊥D I1 | R Problem: ⊥D provably complete (for now) for functionally determined relations Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
  • 22. Joint Causal Inference: Faithfulness violations Causal Faithfulness assumption: X ⊥⊥ Y | W =⇒ X ⊥d Y | W [C] R I1 X1 I1 = g1(R) =⇒ X1 ⊥⊥ I1 | R ... but X1 ⊥d I1 | R Solution: D-separation [Spirtes et al., 2000]: X1 ⊥D I1 | R Problem: ⊥D provably complete (for now) for functionally determined relations Solution: restrict deterministic relations to only ∀i ∈ I : Ii = gi (R) D-Faithfulness: X ⊥⊥ Y | W =⇒ X ⊥D Y | W [C]. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 10 / 21
  • 23. Joint Causal Inference Joint Causal Inference (JCI) = Given all the assumptions, reconstruct the causal DAG C from independence test results. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
  • 24. Joint Causal Inference Joint Causal Inference (JCI) = Given all the assumptions, reconstruct the causal DAG C from independence test results. X1 ⊥⊥ I1 X1 ⊥⊥ I1 | R X4 ⊥⊥ X2 | I1 . . . =⇒ Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
  • 25. Joint Causal Inference Joint Causal Inference (JCI) = Given all the assumptions, reconstruct the causal DAG C from independence test results. X1 ⊥⊥ I1 X1 ⊥⊥ I1 | R X4 ⊥⊥ X2 | I1 . . . =⇒ R I1 I2 X1 X2 X3 X4 Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
  • 26. Joint Causal Inference Joint Causal Inference (JCI) = Given all the assumptions, reconstruct the causal DAG C from independence test results. X1 ⊥⊥ I1 X1 ⊥⊥ I1 | R X4 ⊥⊥ X2 | I1 . . . =⇒ R I1 I2 X1 X2 X3 X4 Problem: Current constraint-based methods cannot work with JCI, because of faithfulness violations. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 11 / 21
  • 27. Part III Extending constraint-based methods for JCI Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 12 / 21
  • 28. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 29. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs A simple strategy for dealing with functionally determined relations: 1 Rephrase constraint-based method in terms of d-separations Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 30. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs A simple strategy for dealing with functionally determined relations: 1 Rephrase constraint-based method in terms of d-separations 2 For each independence test result derive sound d-separations: Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 31. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs A simple strategy for dealing with functionally determined relations: 1 Rephrase constraint-based method in terms of d-separations 2 For each independence test result derive sound d-separations: X ⊥⊥ Y |W =⇒ X ⊥d Y |W Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 32. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs A simple strategy for dealing with functionally determined relations: 1 Rephrase constraint-based method in terms of d-separations 2 For each independence test result derive sound d-separations: X ⊥⊥ Y |W =⇒ X ⊥d Y |W X ∈ Det(W ) and Y ∈ Det(W ) and X ⊥⊥ Y | W =⇒ X ⊥d Y | Det(W ) where Det(W ) = variables determined by (a subset of) W . Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 33. A simple strategy for dealing with faithfulness violations Idea: Faithfulness violations → Partial inputs A simple strategy for dealing with functionally determined relations: 1 Rephrase constraint-based method in terms of d-separations 2 For each independence test result derive sound d-separations: X ⊥⊥ Y |W =⇒ X ⊥d Y |W X ∈ Det(W ) and Y ∈ Det(W ) and X ⊥⊥ Y | W =⇒ X ⊥d Y | Det(W ) where Det(W ) = variables determined by (a subset of) W . Conjecture: sound also for a larger class of deterministic relations. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 13 / 21
  • 34. Ancestral Causal Inference with Determinism (ACID) Idea: Faithfulness violations → Partial inputs Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
  • 35. Ancestral Causal Inference with Determinism (ACID) Idea: Faithfulness violations → Partial inputs Traditional constraint-based methods (e.g., PC, FCI, ...): cannot handle partial inputs cannot exploit the rich background knowledge in JCI Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
  • 36. Ancestral Causal Inference with Determinism (ACID) Idea: Faithfulness violations → Partial inputs Traditional constraint-based methods (e.g., PC, FCI, ...): cannot handle partial inputs cannot exploit the rich background knowledge in JCI Solution: Logic-based methods, e.g., ACI [Magliacane et al., 2016] Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
  • 37. Ancestral Causal Inference with Determinism (ACID) Idea: Faithfulness violations → Partial inputs Traditional constraint-based methods (e.g., PC, FCI, ...): cannot handle partial inputs cannot exploit the rich background knowledge in JCI Solution: Logic-based methods, e.g., ACI [Magliacane et al., 2016] =⇒ we implement the strategy in an extension of ACI called: Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 14 / 21
  • 38. Rephrasing ACI rules in terms of d-separations ACI rules: Example For X, Y , W disjoint (sets of) variables: (X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 15 / 21
  • 39. Rephrasing ACI rules in terms of d-separations ACI rules: Example For X, Y , W disjoint (sets of) variables: (X ⊥⊥ Y | W ) ∧ (X W ) =⇒ X Y ACID rules: Example For X, Y , W disjoint (sets of) variables: (X ⊥d Y | W ) ∧ (X W ) =⇒ X Y Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 15 / 21
  • 40. ACID-JCI ACID-JCI = ACID rules + sound d-separations from strategy + JCI background knowledge Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
  • 41. ACID-JCI ACID-JCI = ACID rules + sound d-separations from strategy + JCI background knowledge For example: ∀i ∈ I, ∀j ∈ X : (Xj R) ∧ (Xj Ii ) “Standard variables cannot cause dummy variables” Example I1 X1 R I1 X1 R I1 X1 R Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
  • 42. ACID-JCI ACID-JCI = ACID rules + sound d-separations from strategy + JCI background knowledge For example: ∀i ∈ I, ∀j ∈ X : (Xj R) ∧ (Xj Ii ) “Standard variables cannot cause dummy variables” Example I1 X1 R I1 X1 R I1 X1 R Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 16 / 21
  • 43. Part IV Evaluation Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 17 / 21
  • 44. Simulated data accuracy: example Precision Recall curve Ancestral (“causes”) relations Non-ancestral (“not causes”) Precision Recall curves of 2000 randomly generated causal graphs for 4 system variables and 3 interventions ACID-JCI substantially improves accuracy w.r.t. merging learnt structures (merged ACI) Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 18 / 21
  • 45. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 46. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods A simple strategy for dealing with faithfulness violations due to functionally determined relations Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 47. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods A simple strategy for dealing with faithfulness violations due to functionally determined relations An implementation, ACID-JCI, that substantially improves the accuracy w.r.t. state-of-the-art Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 48. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods A simple strategy for dealing with faithfulness violations due to functionally determined relations An implementation, ACID-JCI, that substantially improves the accuracy w.r.t. state-of-the-art Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 49. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods A simple strategy for dealing with faithfulness violations due to functionally determined relations An implementation, ACID-JCI, that substantially improves the accuracy w.r.t. state-of-the-art Future work: Improve scalability (now max 7 variables in C). Working paper: https://arxiv.org/abs/1611.10351 Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 50. Conclusion Joint Causal Inference, a powerful formulation of causal discovery over multiple datasets for constraint-based methods A simple strategy for dealing with faithfulness violations due to functionally determined relations An implementation, ACID-JCI, that substantially improves the accuracy w.r.t. state-of-the-art Future work: Improve scalability (now max 7 variables in C). Working paper: https://arxiv.org/abs/1611.10351 Collaborators: Tom Claassen Joris M. Mooij Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 19 / 21
  • 51. References I Claassen, T. and Heskes, T. (2011). A logical characterization of constraint-based causal discovery. In UAI 2011, pages 135–144. Eaton, D. and Murphy, K. (2007). Exact Bayesian structure learning from uncertain interventions. In Proceedings of the 10th International Conference on Artificial Intelligence and Statistics (AISTATS), pages 107–114. Magliacane, S., Claassen, T., and Mooij, J. M. (2016). Ancestral causal inference. In NIPS. Mooij, J. M. and Heskes, T. (2013). Cyclic causal discovery from continuous equilibrium data. In Nicholson, A. and Smyth, P., editors, Proceedings of the 29th Annual Conference on Uncertainty in Artificial Intelligence (UAI-13), pages 431–439. AUAI Press. Spirtes, P., Glymour, C., and Scheines, R. (2000). Causation, Prediction, and Search. MIT press, 2nd edition. Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 20 / 21
  • 52. Ancestral Causal Inference (ACI) Weighted list of inputs: I = {(ij , wj )}: E.g. I = { (Y ⊥⊥ Z | X, 0.2), (Y ⊥⊥ X, 0.1)} } Any consistent weighting scheme, e.g. frequentist, Bayesian For any possible ancestral structure C, we define the loss function: Loss(C, I) := (ij ,wj )∈I: ij is not satisfied in C wj Here: “ij is not satisfied in C” = defined by ancestral reasoning rules For each possible causal relation X Y provide score: Conf (X Y ) = min C∈C Loss(C, I + (X Y , ∞)) − min C∈C Loss(C, I + (X Y , ∞)) Sara Magliacane (VU, UvA) Joint causal inference 10-12-2016 21 / 21