Workshop talk from:
https://mltrain.cc/events/enabling-reproducibility-in-machine-learning-mltrainrml-icml-2018/
Thoughts on the challenges of reproducibility in ML and computational sciences, and some engineering solutions based on my experience writing scikit-learn for the last 8 years.
ICML 2018 Reproducible Machine Learning - A. Gramfort
1. Reproducible ML:
software challenges, anecdotes and
some engineering solutions
Alexandre Gramfort
http://alexandre.gramfort.net
GitHub : @agramfort Twitter : @agramfort
2. FreeSurfer: popular software for extracting features from MRI
(e.g. cortical thickness used to predict Alzheimer’s disease, etc.)
https://surfer.nmr.mgh.harvard.edu/
3. FreeSurfer: popular software for extracting features from MRI
(e.g. cortical thickness used to predict Alzheimer’s disease, etc.)
https://surfer.nmr.mgh.harvard.edu/
4. FreeSurfer: popular software for extracting features from MRI
(e.g. cortical thickness used to predict Alzheimer’s disease, etc.)
Hardware and software
differences can lead to different
features / statistical results and
scientific conclusions
https://surfer.nmr.mgh.harvard.edu/
13. Alex Gramfort Reproducible ML: challenges and some engineering solutions
Do not reinvent the wheel…
7
#JSM2016Jake VanderPlas
We provide one
component in the
Python ecosystem
14. Alex Gramfort Reproducible ML: challenges and some engineering solutions
Do not reinvent the wheel…
7
#JSM2016Jake VanderPlas
We provide one
component in the
Python ecosystem
Code reuse and
tight community
Bigger user base
35. Alex Gramfort Reproducible ML: challenges and some engineering solutions
Wrapping up
24
• Even hardware/software replication is hard and costly
36. Alex Gramfort Reproducible ML: challenges and some engineering solutions
Wrapping up
24
• Even hardware/software replication is hard and costly
• Disclaimer: Not every problem has an engineering solution
37. Alex Gramfort Reproducible ML: challenges and some engineering solutions
Wrapping up
24
• Even hardware/software replication is hard and costly
Sphinx-Gallery
• Yet, technology and engineering can make ML more replicable
• Modern science is Open Science
• Disclaimer: Not every problem has an engineering solution