+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
Peter Li at GCC2014: A journal’s experiences of reproducing published data analyses
1. A journal’s experiences of
reproducing published data
analyses
Peter Li
peter@gigasciencejournal.com
2. Journal and database
for large-scale data studies
Editor-in-Chief: Laurie Goodman
Executive Editor: Scott Edmunds
Commissioning Editor: Nicole Nogoy
GigaDB: Chris Hunter, Jesse Xiao
GigaGalaxy: Peter Li
in conjunction with
5. Publication only Full replication
Not reproducible Gold standard
Data Code and data
Linked and
executable
code and data
Publication +
Reproducibility spectrum
Adapted from Roger Peng (2011) Reproducible research in computational science. Science 334: 122
8. Publication only Full replication
Not reproducible Gold standard
Data Code and data
Linked and
executable
code and data
Publication +
Reproducibility spectrum
Adapted from Roger Peng (2011) Reproducible research in computational science. Science 334: 1226-1227.
9. Can the results in a GigaScience
paper be replicated using Galaxy?
16. Short reads
Downloaded
pipeline
Downloaded pipeline is missing
two tools for reproducibility
KmerFreq_AR
Corrector_AR
SOAPdenovo2
GapCloser
Scaffold seqs
Short reads
Table 2 N50 &
corrected N50
scores
Required
pipeline
KmerFreq_AR
Corrector_AR
SOAPdenovo2
GapCloser
ExtractACGT
GAGE eval
17. Short reads
Table 2 N50 &
corrected N50
scores
Required
pipeline
KmerFreq_AR
Corrector_AR
SOAPdenovo2
GapCloser
ExtractACGT
GAGE eval
Need to add
two extra
tools into
GigaGalaxy
22. Observations
• Complete scientific reproduction is difficult
– Time and effort required
• Requires help from authors
• Do we need education and training in
scientific reproducibility?
25. Ruibang Luo (BGI/HKU)
Shaoguang Liang (BGI-SZ)
Tin-Lap Lee (CUHK)
Qiong Luo (HKUST)
Senghong Wang (HKUST)
Yan Zhou (HKUST)
Thanks to:
@gigascience
facebook.com/GigaScience
blogs.biomedcentral.com/gigablog/
Peter Li
Huayan Gao
Chris Hunter
Jesse Si Zhe
Nicole Nogoy
Laurie Goodman
Amye Kenall (BMC)
Marco Roos (LUMC)
Mark Thompson (LUMC)
Jun Zhao (Lancaster)
Susanna Sansone (Oxford)
Philippe Rocca-Serra (Oxford)
Alejandra Gonzalez-Beltran (Oxford)
www.gigadb.org
galaxy.cbiit.cuhk.edu.hk
www.gigasciencejournal.com
Funding from:
Our collaborators:team: Case study:
Hinweis der Redaktion
Publication = Selective reporting?
For research involving computation
DOIs
Provide example of a GigaScience paper
Mention DOI for the paper itself
Highlight data set generated and its DOI