E research overview gahegan bioinformatics workshop 2010
1. eResearch: the evolution of science Mark Gahegan Center for eResearch The University of Auckland
2.
3. The data explosion (from Wired ‘Big Data, July 2008) Terabytes What it stores 1 2,600 songs Large hard disk ($200) 20 Photos uploaded to FaceBook every month 120 All the data collected by the Hubble telescope 330 Weekly data produced by the Large Hadron Collider (est.) 440 All the international climate / weather data compiled by the National Climatic Data Center in the USA 530 All the videos in YouTube 1000 (1 petabyte) Data processed by Google servers every 72 minutes
4. Sarah E. Fratesi, 2008 Journal of Research Practice Volume 4, Issue 1, Article M1, Scientific Journals as Fossil Traces of Sweeping Change in the Structure and Practice of Modern Geology
9. Reproducible, or rather “fully supported”, Transparent science, Composite research components Methods Lab Books Preprints Data Video Blogs Podcasts Codes Algorithms Models Presentations Ontologies Intermediate Results Related Articles Comments & Reviews Plans Models Carole Goble, UK eScience
10. Connections run both ways… Methods Lab Books Preprints Data Video Blogs Podcasts Codes Algorithms Models Presentations Ontologies Intermediate Results Related Articles Comments & Reviews Carole Goble, UK eScience
11. Virtual Research Environments Support for knowledge communities Social networks of collaboration, use cases, Emergent trends and patterns
18. Example 1 Fossils and climate: Paleo-Integration (Community and data integration) Graphic Correlation Database PGAP PaleoIntegration Project Allister Rees, University of Arizona
19. Architecture —simplified 3-tier architecture: Front - user interface (computer terminal, user-friendly search terms and tools) Back - databases (schema, ontology coding - age, geography, content) Middleware - translates user-selected parameters for database searches - keeps track of user selections (workflow), so a modified search doesn’t mean “starting over” - routes user requests to different software components (e.g. data query, spatial data conversion), bringing results from multiple databases and tools together on one screen How?
20. Early Jurassic Climates, Vegetation, and Dinosaur Distributions Integration of various data, datasets and databases Download search results, analyze and interpret data Fossil collection and publication Publish new results and interpretations?
25. Example 2: Earthquake simulation (data integration & HPC) GEON SYNSEIS Integration Platform Dogan Seber, SDSC Seismic GEON portal and HPC Environment Gravity Magnetic Simulation, Analyses and Integration Scientific Discoveries Internal and External Datasets Subsurface Model
Hinweis der Redaktion
Reproducible Science
Some might be the teams and some might be publicly available. Journal of Visualised Experiments Workflows are methods Blue are outputs Yellow is publishing Pink are third party commentary and relationship to third parties Pointing to all of these. And each can also point to a range of articles.
Some might be the teams and some might be publicly available. Journal of Visualised Experiments Think the pebble business Points the other way too. It’s an open linked data web of science.
Research is becoming more interdisciplinary and that the teams are becoming more distributed - There is therefore an substantial investment being made in e-research and VREs as a means of supporting this; There continues to be an awesome amount of investment in e-Science, cyberinfrastructure and web-based infrastructure to support scientific collaboration. This was the whole point of the UK’s eScience programme (£240million) and the US Cyberinfrastructure program Research Information Centre (Microsoft and British Library) is a small step. How does the British Library plan to support internationally distributed, sometimes collaborating yet sometimes competing, multi and cross disciplinary researchers?