SlideShare ist ein Scribd-Unternehmen logo
1 von 42
Downloaden Sie, um offline zu lesen
brian m. bot | sage bionetworks | @BrianMBot
mayo clinic - 2015 jan 28
tools for reproducible research
in an increasingly digital world
sage bionetworks
~40 FTEs
1/2 research - 1/3 platform - 1/6 leadership/support
sage bionetworks
focused on a world where biomedical research
will fundamentally change to be
more open and collaborative
production
distribution
aggregation
6%
21%
8%
11%
54%
cannot
reproduce
can
reproduce in
principle
can reproduce
w/discrepancies
can reproduce from
processed data w/
discrepancies
can
reproduce
partially
the status quo tolerates
poor communication of
findings
Ioannidis A. et al. Nature Genetics 2009
208,294,724
datapoints
124 pages
supplemental material
?? lines
unobtainable source code
?? version or architecture of
statistical analysis program (R)
enumerable R packages
and package dependencies
key R package “ClaNC”
no longer available
1231 citations
often what is in principle
reproducible, is not
practically reproducible
unidentified publication
‣ from journal with 5 year impact factor of 27
‣ article freely available for download
‣ data freely available for download
“Scientists often study the past as
obsessively as historians because few
other professions depend so acutely on it.
Every experiment is a conversation with
a prior experiment,
every new theory a refutation of the old”
-Siddhartha Mukherjee, The Emperor of All Maladies
scientific method
1. define a question
2. gather information and resources (background research)
3. form a hypothesis
8. retest (frequently done by other scientists)
4. test hypothesis experimentally
5. analyze experimental data
7. publish results
6. draw conclusions based on data
7. publish results
finitein
∞
...
conducting research for others to consume
(even if the ‘other’ is future you)
reproducible research
tools for reproducible research
code
data
analysis
tools for reproducible research
code
version control
tools for reproducible research
code
version control
client-server
distributed
tools for reproducible research
code
version control
client-server
(e.g svn, cvs)
tools for reproducible research
code
version control
client-server
distributed
tools for reproducible research
code
version control
(e.g git, mercurial)
distributed
tools for reproducible research
code
version control
distributed
tools for reproducible research
code
data
analysis
tools for reproducible research
data
generic
domain
repositories
results
tools for reproducible research
data
digital object identifier (doi)
a unique identifier which remains fixed over the
lifetime of a web-accessible object
metadata, including the object’s location, is stored in
association with the doi and may change over time
referring to an online document by its doi provides
more stable linking than simply referring to a url
tools for reproducible research
code
data
analysis
tools for reproducible research
analysis
R Sweave knitr
great if you know LaTeX
tools for reproducible research
analysis
R Sweave knitr
great if you are lazy
(like me)
tools for reproducible research
analysis
knitr
# Hello World Title
### Author: Brian M. Bot
This is a narrative with inline
code execution to tell me that pi
is equal to `r pi`. And a plot to
show a simple function.
```{r}
x <- 1:100
y <- log(x)/x
plot(x,y)
```
tools for reproducible research
analysis
knitr
# Hello World Title
### Author: Brian M. Bot
This is a narrative with inline
code execution to tell me that pi
is equal to `r pi`. And a plot to
show a simple function.
```{r}
x <- 1:100
y <- log(x)/x
plot(x,y)
```
tools for reproducible research
analysis
ipython
notebook
tools for reproducible research
other tools
galaxy
docker
packrat
shiny
tools for reproducible research
other tools
enables sharing of all resources (data, code, results)
and their relationships to one another
tools for reproducible research
tools for reproducible research
tools for reproducible research
Go Hawks!
mayo clinic - 2015 jan 28
in an increasingly digital world
brian m. bot
——————
brian.bot@sagebase.org
@BrianMBot
sage bionetworks
tools for reproducible research

Weitere ähnliche Inhalte

Was ist angesagt?

Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
Michel Dumontier
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Michel Dumontier
 
Greene Bosc2008
Greene Bosc2008Greene Bosc2008
Greene Bosc2008
bosc_2008
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadata
dgarijo
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...
James Howison
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databases
Valery Tkachenko
 

Was ist angesagt? (20)

Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
MERmaid - a React WebGL-Based Tool for Exploring Spatially Resolved Single-Ce...
MERmaid - a React WebGL-Based Tool for Exploring Spatially Resolved Single-Ce...MERmaid - a React WebGL-Based Tool for Exploring Spatially Resolved Single-Ce...
MERmaid - a React WebGL-Based Tool for Exploring Spatially Resolved Single-Ce...
 
4A2B2C-2013
4A2B2C-20134A2B2C-2013
4A2B2C-2013
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug DiscoveryCheminformatics Workflows Using Mobile Apps for Drug Discovery
Cheminformatics Workflows Using Mobile Apps for Drug Discovery
 
Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI) Advancing the International Plant Names Index (IPNI)
Advancing the International Plant Names Index (IPNI)
 
Reproducible research: practice
Reproducible research: practiceReproducible research: practice
Reproducible research: practice
 
UKON 2014
UKON 2014UKON 2014
UKON 2014
 
MLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed ToolsMLA CE Course: Third-Party PubMed Tools
MLA CE Course: Third-Party PubMed Tools
 
Third-Party PubMed Tools
Third-Party PubMed ToolsThird-Party PubMed Tools
Third-Party PubMed Tools
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
A guided tour of Araport
A guided tour of AraportA guided tour of Araport
A guided tour of Araport
 
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
Metagenomic Data Provenance and Management using the ISA infrastructure --- o...
 
Greene Bosc2008
Greene Bosc2008Greene Bosc2008
Greene Bosc2008
 
Towards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software MetadataTowards Knowledge Graphs of Reusable Research Software Metadata
Towards Knowledge Graphs of Reusable Research Software Metadata
 
Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...Software in the scientific literature: Problems with seeing, finding, and usi...
Software in the scientific literature: Problems with seeing, finding, and usi...
 
Tools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databasesTools and approaches for data deposition into nanomaterial databases
Tools and approaches for data deposition into nanomaterial databases
 
Beyond the PDF 2, 2013
Beyond the PDF 2, 2013Beyond the PDF 2, 2013
Beyond the PDF 2, 2013
 
ISMB Workshop 2014
ISMB Workshop 2014ISMB Workshop 2014
ISMB Workshop 2014
 

Ähnlich wie tools for reproducible research in an increasingly digital world

Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
Carole Goble
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
Sanjay Padhi, Ph.D
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize Agriculture
David LeBauer
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Greg Landrum
 

Ähnlich wie tools for reproducible research in an increasingly digital world (20)

Reproducible research: theory
Reproducible research: theoryReproducible research: theory
Reproducible research: theory
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
The Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational ResearchThe Role of Metadata in Reproducible Computational Research
The Role of Metadata in Reproducible Computational Research
 
The beauty of workflows and models
The beauty of workflows and modelsThe beauty of workflows and models
The beauty of workflows and models
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
Mtsr2015 goble-keynote
Mtsr2015 goble-keynoteMtsr2015 goble-keynote
Mtsr2015 goble-keynote
 
Tag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh PlatformTag.bio: Self Service Data Mesh Platform
Tag.bio: Self Service Data Mesh Platform
 
Reusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize AgricultureReusable Software and Open Data To Optimize Agriculture
Reusable Software and Open Data To Optimize Agriculture
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
A Step Towards Reproducibility in R
A Step Towards Reproducibility in RA Step Towards Reproducibility in R
A Step Towards Reproducibility in R
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALSBROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
BROWN BAG TALK WITH MICAH ALTMAN INTEGRATING OPEN DATA INTO OPEN ACCESS JOURNALS
 
Standards and tools for model management in biomedical research
Standards and tools for model management in biomedical researchStandards and tools for model management in biomedical research
Standards and tools for model management in biomedical research
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
 
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
Scott Edmunds at #GAMe2017: GigaGalaxy & publishing workflows for publishing ...
 
2019 03-11 bio it-world west genepattern notebook slides
2019 03-11 bio it-world west genepattern notebook slides2019 03-11 bio it-world west genepattern notebook slides
2019 03-11 bio it-world west genepattern notebook slides
 
RDVW Hands-on session: Python
RDVW Hands-on session: PythonRDVW Hands-on session: Python
RDVW Hands-on session: Python
 
Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.Data Communities - reusable data in and outside your organization.
Data Communities - reusable data in and outside your organization.
 

Mehr von Brian Bot

leveraging the web to make science more collaborative
leveraging the web to make science more collaborativeleveraging the web to make science more collaborative
leveraging the web to make science more collaborative
Brian Bot
 

Mehr von Brian Bot (20)

supporting communities in an increasingly decentralized biomedical research e...
supporting communities in an increasingly decentralized biomedical research e...supporting communities in an increasingly decentralized biomedical research e...
supporting communities in an increasingly decentralized biomedical research e...
 
decentralization: a trend in biomedical research
decentralization: a trend in biomedical researchdecentralization: a trend in biomedical research
decentralization: a trend in biomedical research
 
research participation as a social contract
research participation as a social contractresearch participation as a social contract
research participation as a social contract
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
 
mobile technologies: riding the hype cycle together
mobile technologies: riding the hype cycle togethermobile technologies: riding the hype cycle together
mobile technologies: riding the hype cycle together
 
biomedical research in an increasingly digital world
biomedical research in an increasingly digital worldbiomedical research in an increasingly digital world
biomedical research in an increasingly digital world
 
Seattle-Denver VA Center for Innovation
Seattle-Denver VA Center for InnovationSeattle-Denver VA Center for Innovation
Seattle-Denver VA Center for Innovation
 
Mozilla Science Labs Berlin Meetup
Mozilla Science Labs Berlin MeetupMozilla Science Labs Berlin Meetup
Mozilla Science Labs Berlin Meetup
 
smartphones and surveys and sensors, oh my!
smartphones and surveys and sensors, oh my!smartphones and surveys and sensors, oh my!
smartphones and surveys and sensors, oh my!
 
Heart BD2K - mHealth
Heart BD2K - mHealthHeart BD2K - mHealth
Heart BD2K - mHealth
 
the beginnings of an open ecosystem in mHealth
the beginnings of an open ecosystem in mHealththe beginnings of an open ecosystem in mHealth
the beginnings of an open ecosystem in mHealth
 
biomedical research in an increasingly digital world
biomedical research in an increasingly digital worldbiomedical research in an increasingly digital world
biomedical research in an increasingly digital world
 
enabling communities of researchers working together across institutional bou...
enabling communities of researchers working together across institutional bou...enabling communities of researchers working together across institutional bou...
enabling communities of researchers working together across institutional bou...
 
biomedical research in an increasingly digital world
biomedical research in an increasingly digital worldbiomedical research in an increasingly digital world
biomedical research in an increasingly digital world
 
leveraging the web to make science more collaborative
leveraging the web to make science more collaborativeleveraging the web to make science more collaborative
leveraging the web to make science more collaborative
 
evaluating the quality of open access content
evaluating the quality of open access contentevaluating the quality of open access content
evaluating the quality of open access content
 
caveat emptor: what you need to know about online journals, open access, and ...
caveat emptor: what you need to know about online journals, open access, and ...caveat emptor: what you need to know about online journals, open access, and ...
caveat emptor: what you need to know about online journals, open access, and ...
 
enabling transparent, reproducible research
enabling transparent, reproducible researchenabling transparent, reproducible research
enabling transparent, reproducible research
 
infrastructure for communicating data-intensive science
infrastructure for communicating data-intensive scienceinfrastructure for communicating data-intensive science
infrastructure for communicating data-intensive science
 
tools for communicating in the computational sciences
tools for communicating in the computational sciencestools for communicating in the computational sciences
tools for communicating in the computational sciences
 

Kürzlich hochgeladen

Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Sérgio Sacani
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
Areesha Ahmad
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
Scintica Instrumentation
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Silpa
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
Areesha Ahmad
 

Kürzlich hochgeladen (20)

Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Stages in the normal growth curve
Stages in the normal growth curveStages in the normal growth curve
Stages in the normal growth curve
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
Bacterial Identification and Classifications
Bacterial Identification and ClassificationsBacterial Identification and Classifications
Bacterial Identification and Classifications
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Velocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.pptVelocity and Acceleration PowerPoint.ppt
Velocity and Acceleration PowerPoint.ppt
 
Zoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdfZoology 5th semester notes( Sumit_yadav).pdf
Zoology 5th semester notes( Sumit_yadav).pdf
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 

tools for reproducible research in an increasingly digital world