SlideShare a Scribd company logo
1 of 28
1
http://chalearn.org/
Causality and
Graph Reconstruction
MLconf 2015
Isabelle Guyon, ChaLearn
2
Motivation
BIG data makes lots of BIG promises, but…
… will the promises be held?
DIFFICULTY
VALUE
Classical statistics Machine learning
What
happened?
How
happened?
Explicative
power
Forecasting
power
http://chalearn.org/
Decisional
power
What will
happen?
What is
Causal graph reconstruction? http://chalearn.org/
4
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
5
Causal questions http://chalearn.org/
actions
…your health?
…climate
changes?… the economy?
What affects…
6
Scientific method http://chalearn.org/
7
Thanks to Jonas Peters for this example
Observe correlations http://chalearn.org/
8
Hypothesize causal relationships http://chalearn.org/
Thanks to Jonas Peters for this example
9
Hypothesize causal relationships http://chalearn.org/
Thanks to Jonas Peters for this example
10
Hypothesize causal relationships http://chalearn.org/
Chocolate Nobel
Chocolate Nobel
Chocolate Nobel
Chocolate Nobel
?
11
“Please test your
researchers for ten
years: Randomly pick
half of them and give
them chocolate for
desert and give apples
to the other half. Then
compare the number
of Nobel prizes in the
two populations.”
Perform randomized
controlled experiments http://chalearn.org/
12







How far can we get
to improve causal hypotheses …
… to minimize the need for experiments?
13
• Pioneer work: Glymour, Scheines, Spirtes, Pearl (Turing
Award, 2011), Rubin, in the US, since the 80’s.
• New wave: Hyvärinen, Schölkopf, Bühlmann in the EU.
• Nobel prizes in econometrics: Haavelmo (1989),
Granger (2003), Sargent and Sims (2011).
• DARPA programs: Big mechanisms (2014), upcoming
program (Schwartz, program manager).
Landmark work http://chalearn.org/
14
Game changing work:
Causality challenges http://chalearn.org/
Cause-Effect Pairs (2013)
Neural Connectomics (2014)
Causation and Prediction (2007)
Pot-luck challenge (2008)
15
To make a long story short… http://chalearn.org/
1. Discovering dependencies: easiest = classical
feature selection. Hard to beat!
2. Removing spurious dependencies: harder and
“dangerous” because removing good features
is more harmful than keeping bad ones.
3. Orienting dependencies: hardest.
16
Cause-effect pair challenge
(2013) http://chalearn.org/
Initial impulse: Joris Mooij, Dominik Janzing, and Bernhard Schölkopf.
Examples of algorithms and data: Povilas Daniušis, Arthur Gretton,
Patrik O. Hoyer, Dominik Janzing, Antti Kerminen, Joris Mooij, Jonas
Peters, Bernhard Schölkopf, Shohei Shimizu, Oliver Stegle, and Kun
Zhang, Jakob Zscheischler.
Datasets and result analysis: Isabelle Guyon + Mehreen Saeed +
{Mikael Henaff, Sisi Ma, and Alexander Statnikov}, from NYU.
Website and sample code: Isabelle Guyon +
Phase 1: Ben Hamner (Kaggle) https://www.kaggle.com/c/cause-
effect-pairs
Phase 2: Ivan Judson, Christophe Poulain, Evelyne Viegas,
Michael Zyskowski https://www.codalab.org/competitions/1381
Review, testing: Marc Boullé,
Hugo Jair Escalante, Frederick Eberhardt,
Seth Flaxman, Patrik Hoyer,
Dominik Janzing, Richard Kennaway,
Vincent Lemaire, Joris Mooij,
Jonas Peters, Florin Popescu, Peter Spirtes,
Ioannis Tsamardinos, Jianxin Yin, Kun Zhang.
Mehreen
Evelyne
Joris Dominik
Bernhard
Kun
Ben
Alexander
Marc
17
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
18
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
A -> B ?
0 / 1
19
B =Temperature
A = log(Altitude)
A  B ? http://chalearn.org/
20
A  B A  B
Best fit: A  B http://chalearn.org/
21
The data:
A
B
Z
A  B
B
A
Z
A <- B
A B
Z
ZBZA
A  Z  B
A B A | B
Demographics:
Sex  Height
Age  Wages
Country  Education
Latitude  Infant mortality
Ecology:
City elevation  Temperature
Water level  Algal frequency
Elevation  Vegetation
Dist. to hydrology  Fire
Econometrics:
Mileage  Car resell price
Num.rooms  House price
Trade price last day  Trade price
Medicine:
Cancer vol.  Recurrence
Metastasis  Prognosis
Age  Blood pressure
Genomics (mRNA level):
transcription factor  protein
induced
Engineering:
Car model year  Horsepower
Number of cylinders  MPG
Cache memory  Compute power
Roof area  Heating load
Cement used  Compressive strength
20% 80%
http://chalearn.org/
22
0.5
0.55
0.6
0.65
0.7
0.75
0.8
0.85
267
test
The results:
http://chalearn.org/
23
Amazing: an operational
causation coefficient!
http://chalearn.org/
24
Neural connectomics
Challenge (2014) http://chalearn.org/
Coordinator:
Isabelle Guyon
Data Providers:
Demian Battaglia
Javier Orlandi
Jordi Soriano Fradera
Olav Stetter
Advisors:
Gavin Cawley
Gideon Dror
Hugo-Jair Escalante
Alice Guyon
Vincent Lemaire
Sisi Ma
Eric Peskin
Florin Popescu
Bisakha Ray,
Mehreen Saeed
Alexander Statnikov
Demian
Olav
Jordi
Javier
Bisakha
Mehreen
25
Problem setting http://chalearn.org/
A
F
I
H
E
B
D G
C
J
INPUT
OUTPUT
26
Network deconvolution http://chalearn.org/
27
Conclusion
• Causal models:
– Better explain data.
– Make decisions.
• Challenges:
– Fair evaluations.
– Innovation.
• Machine Learning:
– Novel approaches to causal discovery.
– Operational “causation coefficient”:
• First detect oriented pairs, then prune indirect effects and confounders.
• First build undirected graph, then orient edges.
28
http://chalearn.org/
Fully automatic machine learning
without ANY human intervention
automl.chalearn.org
December 2014 – January 2016
$30,000 in prizes
Thank you!
AutoML Challenge

More Related Content

More from MLconf

Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceMLconf
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...MLconf
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...MLconf
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMLconf
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionMLconf
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLMLconf
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksMLconf
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...MLconf
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldMLconf
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...MLconf
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...MLconf
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...MLconf
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeMLconf
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...MLconf
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareMLconf
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesMLconf
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...MLconf
 
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...MLconf
 
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...MLconf
 
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...MLconf
 

More from MLconf (20)

Josh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious ExperienceJosh Wills - Data Labeling as Religious Experience
Josh Wills - Data Labeling as Religious Experience
 
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
Vinay Prabhu - Project GaitNet: Ushering in the ImageNet moment for human Gai...
 
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
Jekaterina Novikova - Machine Learning Methods in Detecting Alzheimer’s Disea...
 
Meghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the CheapMeghana Ravikumar - Optimized Image Classification on the Cheap
Meghana Ravikumar - Optimized Image Classification on the Cheap
 
Noam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data CollectionNoam Finkelstein - The Importance of Modeling Data Collection
Noam Finkelstein - The Importance of Modeling Data Collection
 
June Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of MLJune Andrews - The Uncanny Valley of ML
June Andrews - The Uncanny Valley of ML
 
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection TasksSneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
Sneha Rajana - Deep Learning Architectures for Semantic Relation Detection Tasks
 
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
Anoop Deoras - Building an Incrementally Trained, Local Taste Aware, Global D...
 
Vito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI WorldVito Ostuni - The Voice: New Challenges in a Zero UI World
Vito Ostuni - The Voice: New Challenges in a Zero UI World
 
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
Anna choromanska - Data-driven Challenges in AI: Scale, Information Selection...
 
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
Janani Kalyanam - Machine Learning to Detect Illegal Online Sales of Prescrip...
 
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
Esperanza Lopez Aguilera - Using a Bayesian Neural Network in the Detection o...
 
Neel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to codeNeel Sundaresan - Teaching a machine to code
Neel Sundaresan - Teaching a machine to code
 
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
Rishabh Mehrotra - Recommendations in a Marketplace: Personalizing Explainabl...
 
Soumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better SoftwareSoumith Chintala - Increasing the Impact of AI Through Better Software
Soumith Chintala - Increasing the Impact of AI Through Better Software
 
Roy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime ChangesRoy Lowrance - Predicting Bond Prices: Regime Changes
Roy Lowrance - Predicting Bond Prices: Regime Changes
 
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
Madalina Fiterau - Hybrid Machine Learning Methods for the Interpretation and...
 
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
Niels Bantilan - Augmenting Mental Health Care in the Digital Age: Machine Le...
 
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
LeAnna Kent - Using Network Analysis to Detect Kickback Schemes Among Medical...
 
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
Liliana Cruz Lopez - Deep Reinforcement Learning based Insulin Controller for...
 

Recently uploaded

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 

Recently uploaded (20)

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 

Isabelle Guyon, President, ChaLearn at MLconf SF - 11/13/15

  • 2. 2 Motivation BIG data makes lots of BIG promises, but… … will the promises be held? DIFFICULTY VALUE Classical statistics Machine learning What happened? How happened? Explicative power Forecasting power http://chalearn.org/ Decisional power What will happen?
  • 3. What is Causal graph reconstruction? http://chalearn.org/
  • 5. 5 Causal questions http://chalearn.org/ actions …your health? …climate changes?… the economy? What affects…
  • 7. 7 Thanks to Jonas Peters for this example Observe correlations http://chalearn.org/
  • 8. 8 Hypothesize causal relationships http://chalearn.org/ Thanks to Jonas Peters for this example
  • 9. 9 Hypothesize causal relationships http://chalearn.org/ Thanks to Jonas Peters for this example
  • 10. 10 Hypothesize causal relationships http://chalearn.org/ Chocolate Nobel Chocolate Nobel Chocolate Nobel Chocolate Nobel ?
  • 11. 11 “Please test your researchers for ten years: Randomly pick half of them and give them chocolate for desert and give apples to the other half. Then compare the number of Nobel prizes in the two populations.” Perform randomized controlled experiments http://chalearn.org/
  • 12. 12        How far can we get to improve causal hypotheses … … to minimize the need for experiments?
  • 13. 13 • Pioneer work: Glymour, Scheines, Spirtes, Pearl (Turing Award, 2011), Rubin, in the US, since the 80’s. • New wave: Hyvärinen, Schölkopf, Bühlmann in the EU. • Nobel prizes in econometrics: Haavelmo (1989), Granger (2003), Sargent and Sims (2011). • DARPA programs: Big mechanisms (2014), upcoming program (Schwartz, program manager). Landmark work http://chalearn.org/
  • 14. 14 Game changing work: Causality challenges http://chalearn.org/ Cause-Effect Pairs (2013) Neural Connectomics (2014) Causation and Prediction (2007) Pot-luck challenge (2008)
  • 15. 15 To make a long story short… http://chalearn.org/ 1. Discovering dependencies: easiest = classical feature selection. Hard to beat! 2. Removing spurious dependencies: harder and “dangerous” because removing good features is more harmful than keeping bad ones. 3. Orienting dependencies: hardest.
  • 16. 16 Cause-effect pair challenge (2013) http://chalearn.org/ Initial impulse: Joris Mooij, Dominik Janzing, and Bernhard Schölkopf. Examples of algorithms and data: Povilas Daniušis, Arthur Gretton, Patrik O. Hoyer, Dominik Janzing, Antti Kerminen, Joris Mooij, Jonas Peters, Bernhard Schölkopf, Shohei Shimizu, Oliver Stegle, and Kun Zhang, Jakob Zscheischler. Datasets and result analysis: Isabelle Guyon + Mehreen Saeed + {Mikael Henaff, Sisi Ma, and Alexander Statnikov}, from NYU. Website and sample code: Isabelle Guyon + Phase 1: Ben Hamner (Kaggle) https://www.kaggle.com/c/cause- effect-pairs Phase 2: Ivan Judson, Christophe Poulain, Evelyne Viegas, Michael Zyskowski https://www.codalab.org/competitions/1381 Review, testing: Marc Boullé, Hugo Jair Escalante, Frederick Eberhardt, Seth Flaxman, Patrik Hoyer, Dominik Janzing, Richard Kennaway, Vincent Lemaire, Joris Mooij, Jonas Peters, Florin Popescu, Peter Spirtes, Ioannis Tsamardinos, Jianxin Yin, Kun Zhang. Mehreen Evelyne Joris Dominik Bernhard Kun Ben Alexander Marc
  • 18. 18 Problem setting http://chalearn.org/ A F I H E B D G C J INPUT OUTPUT A -> B ? 0 / 1
  • 19. 19 B =Temperature A = log(Altitude) A  B ? http://chalearn.org/
  • 20. 20 A  B A  B Best fit: A  B http://chalearn.org/
  • 21. 21 The data: A B Z A  B B A Z A <- B A B Z ZBZA A  Z  B A B A | B Demographics: Sex  Height Age  Wages Country  Education Latitude  Infant mortality Ecology: City elevation  Temperature Water level  Algal frequency Elevation  Vegetation Dist. to hydrology  Fire Econometrics: Mileage  Car resell price Num.rooms  House price Trade price last day  Trade price Medicine: Cancer vol.  Recurrence Metastasis  Prognosis Age  Blood pressure Genomics (mRNA level): transcription factor  protein induced Engineering: Car model year  Horsepower Number of cylinders  MPG Cache memory  Compute power Roof area  Heating load Cement used  Compressive strength 20% 80% http://chalearn.org/
  • 23. 23 Amazing: an operational causation coefficient! http://chalearn.org/
  • 24. 24 Neural connectomics Challenge (2014) http://chalearn.org/ Coordinator: Isabelle Guyon Data Providers: Demian Battaglia Javier Orlandi Jordi Soriano Fradera Olav Stetter Advisors: Gavin Cawley Gideon Dror Hugo-Jair Escalante Alice Guyon Vincent Lemaire Sisi Ma Eric Peskin Florin Popescu Bisakha Ray, Mehreen Saeed Alexander Statnikov Demian Olav Jordi Javier Bisakha Mehreen
  • 27. 27 Conclusion • Causal models: – Better explain data. – Make decisions. • Challenges: – Fair evaluations. – Innovation. • Machine Learning: – Novel approaches to causal discovery. – Operational “causation coefficient”: • First detect oriented pairs, then prune indirect effects and confounders. • First build undirected graph, then orient edges.
  • 28. 28 http://chalearn.org/ Fully automatic machine learning without ANY human intervention automl.chalearn.org December 2014 – January 2016 $30,000 in prizes Thank you! AutoML Challenge