The document discusses tools and methods for improving reproducibility in research, including open data and open source tools. It summarizes that less than 30% of published studies are reproducible due to lack of sharing of data, code, and workflows. It promotes hosting research objects like data, images, and workflows to make them accessible and citable. Specific tools mentioned include GigaDB for data, OMERO for images, Galaxy and executable documents for workflows, and virtual machines for replicable computational environments.
12. Re-producing Images
· Image LIMS
· Keeps metadata with image
· Means the image can be
found later!
· Image can be understood
· Also some processing
options
http://www.openmicroscopy.org/site/products/omero
13. Accessible, transparent Images
· Embed in web
· Full res
· View without special
software
· Adjust contrast etc
· Link all images to pub!
· No cherry picking!
http://www.openmicroscopy.org/site/products/omero
15. Accessible Cyber-Centipede images
OMERO: providing
access to imaging data
View, filter, measure raw
images with direct links
from journal article.
See all image data, not
just cherry picked
examples.
Download and reprocess.
20. Implement workflows in a community-accepted format
http://galaxyproject.org
Open source
Over 45,000 main
Galaxy server users
Over 1,000 papers
citing Galaxy use
Over 55 Galaxy
servers deployed
21. Implement workflows in an intuitive format
Tool list TooCol ppyraighrta NmBAFe-Bt 2e0r1i3sation Results panel
24. Birmingham Metabo-Galaxy
Tools wrapped in Python and XML
User sees web form (easy!)
Data stored centrally (secure!)
Work done centrally (easy update)
31. Open lab books, dynamic documents
• Facilitate reuse and sharing with tools like: Knitr, Sweave,
iPython Notebook
Sweave
• Working towards executable papers…
34. Some testimonials for Knitr
Authors (Wolfgang Huber)
“I do all my projects in Knitr. Having the textual explanation,
the associated code and the results all in one place really
increases productivity, and helps explaining my analyses to
colleagues, or even just to my future self.”
Reviewers (Christophe Pouzat)
“It took me a couple of hours to get the data, the few custom
developed routines, the “vignette” and to REPRODUCE
EXACTLY the analysis presented in the manuscript. With few
more hours, I was able to modify the authors’ code to change
their Fig. 4. In addition to making the presented research
trustworthy, the reproducible research paradigm definitely
makes the reviewer’s job much more fun!
41. Share data in GigaDB
Share all images in GigaDB
-View images via OMERO
Share code in GigaDB!
Share pipeline using:
Executable docs!
Galaxy!
VMs!
42. Improve
reproducibility!
Give us data, papers &
pipelines*
Contact us:
scott@gigasciencejournal.com
editorial@gigasciencejournal.com
database@gigasciencejournal.com
* APC’s currently generously covered
by BGI until 2015
www.gigasciencejournal.com
43. team: Our collaborators: Case study:
Ruibang Luo (BGI/HKU)
Shaoguang Liang (BGI-SZ)
Tin-Lap Lee (CUHK)
Qiong Luo (HKUST)
Senghong Wang (HKUST)
Yan Zhou (HKUST)
Thanks to:
Funding from: CBIIT
@gigascience
facebook.com/GigaScience
blogs.biomedcentral.com/gigablog/
Peter Li
Huayan Gao
Chris Hunter
Jesse Si Zhe
Nicole Nogoy
Laurie Goodman
Amye Kenall (BMC)
Marco Roos (LUMC)
Mark Thompson (LUMC)
Jun Zhao (Lancaster)
Susanna Sansone (Oxford)
Philippe Rocca-Serra (Oxford)
Alejandra Gonzalez-Beltran (Oxford)
www.gigadb.org
galaxy.cbiit.cuhk.edu.hk
www.gigasciencejournal.com
Hinweis der Redaktion
Over 20,000 users on the main server
Over 500 papers citing the use of Galaxy
Over 55 servers deployed on the Web
That just leaves me to thank the GigaScience team: Laurie, Scott, Alexandra, Peter and Jesse, BGI for their support - specifically Shaoguang for IT and bioinformatics support – our collaborators on the database, website and tools: Tin-Lap, Qiong, Senhong, Yan, the Cogini web design team, Datacite for providing the DOI service and the isacommons team for their support and advocacy for best practice use of metadata reporting and sharing.
Thank you for listening.