SlideShare a Scribd company logo
1 of 24
Download to read offline
1
Digital Science
Reproducibility and Visibility in Astronomy
José Enrique Ruiz on behalf of the Wf4Ever Team
SCIOPS 2013
ESAC, FRIDAY 13th SEPTEMBER 2013
2
Wf4Ever
Digital Science - Reproducibility and Visibility in Astronomy
1.  Intelligent Software Components (ISOCO, Spain)
2.  University of Manchester (UNIMAN, UK)
3.  Universidad Politécnica de Madrid (UPM, Spain)
4.  Poznan Supercomputing and Networking Centre (Poland)
5.  University of Oxford and OeRC (OXF, UK)
6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain)
7.  Leiden University Medical Centre (LUMC, NL)
4
Wf4Ever
Advanced Workflow Preservation Technologies for Enhanced Science
3
1
6
7
5
2
2011 - 2013
3
Astronomy research lifecycle is entirely digital
»  Observation proposals
»  Data reduction pipelines
»  Analysis of science ready data
»  Catalogs of objects and data archives
»  Publish process
›  Final data results
›  Experiment in DL
ADS/arXiv
Reproducible research is still not
possible in a digital world
A rich infrastructure of data is not
efficiently used
A normalized preservation of
methodology is needed
Tools
Astronomy Research Lifecycle
Digital Science - Reproducibility and Visibility in Astronomy
4
Reproducibility and The Scientific Method
Digital Science - Reproducibility and Visibility in Astronomy
http://xkcd.com/242/
Benefits
»  Publishing knowledge, not advertising
»  The author, the referee, the re-user
»  Reputation, prestige and respect
»  Higher quality of publications
›  Authors will be more careful
›  Many eyes to check results
Challenges
»  Hard and time consuming
»  Need incentives – not rewarded now
5
Reproducibility and The Scientific Method
Digital Science - Reproducibility and Visibility in Astronomy
I don’t know how!
6
Visibility, Efficiency and Reuse
Digital Science - Reproducibility and Visibility in Astronomy
Optimize return on investments made on big facilities
»  Avoid duplication of efforts and reinvention
»  How to discover and not duplicate ?
»  How to re-use and not duplicate ?
»  How to make use of best practices ?
»  How to use the rich infrastructure of data ?
»  Intellectual contribs are encoded in software
More data in archives does not imply more knowledge
»  Expose complete scientific record, not the story
»  Allow easy discovery of methods and tools
7
Visibility and Social Discovery
Digital Science - Reproducibility and Visibility in Astronomy
Paper discovery: the social dimension
8
The Executable Paper
Digital Science - Reproducibility and Visibility in Astronomy
Time has come to go beyond the PDF
9
Digital Astronomy in the Local Desktop
Digital Science - Reproducibility and Visibility in Astronomy
Going beyond automation
Organization!
10
Digital Astronomy in the Local Desktop
Workflows to Access and Massage VO Data
# CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag
1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383
2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157
3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457
4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685
5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128
72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735
Capture !
Actions, Tasks, Dependencies, Provenance!
!
Improve !
Clarity and Reproducibility!
11
Scientific Workflows
Digital Science - Reproducibility and Visibility in Astronomy
Living Tutorials!
Templates for Re-use!
Expedite Training!
Reduce time to insight!
Avoid reinvention!
Digital Libraries of workflows may boost the use
of the existing infrastructure of data (VO) !
12
!
!
Software
›  Taverna
›  Kepler
›  Pegasus
›  Triana
›  ESO Reflex
Scientific Workflows
Digital Science - Reproducibility and Visibility in Astronomy
Related Initiatives
›  ER-Flow
›  VAMDC
›  HELIO
›  Cyber-SKA
›  IceCore
›  Montage
›  Astro-WISE
›  AstroGrid
IVOA
›  AstroGrid
›  Grid&WS WG
›  VO France Wf WG
Self descriptive WS
›  PDL
›  SimDAL, S3
13
!
!
AstroTaverna: Create, annotate and run a workflow
http://amiga.iaa.es/p/290-astrotaverna.htm
Astronomical Research Objects in Action
Digital Science - Reproducibility and Visibility in Astronomy
14
!
!
AstroTaverna: Create, annotate and run a workflow
http://amiga.iaa.es/p/290-astrotaverna.htm
Astronomical Research Objects in Action
Digital Science - Reproducibility and Visibility in Astronomy
15
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Prof. Kevin Vinsen
ASKAP Datacubes
16
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Prof. Kevin Vinsen
SKA Datacubes
17
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Much wider FoV and spectral coverage
»  Large volumes for a single observed dataset
Automated surveys
»  Huge amounts of tabular data
We are moving into a world where
»  computing and storage are cheap
»  data movement is death
Extraction of scientifically relevant
info from a multiD param. space
»  Exploration services
»  Anomaly detection
»  Cross-matching data
»  Dimensionality reduction
Detailed inspection and
subset
»  Filtering
»  Extraction
»  Re-Projection
»  Analysis services
18
»  A cloud of Web Services
»  Archives speaking Web Services
Process should benefit of the same privileges acquired by data
Preserving the method ensures replication of final results at any moment
Archives should evolve from data providers into
»  Virtual Data providers
»  Software Tasks providers
Astronomy of multi archives/facilities/wavelength
Interconnected and interoperable archives
»  Data -> Virtual Observatory
»  Software Tasks
The next generation of archives
Digital Science - Reproducibility and Visibility in Astronomy
Preservation
The move computing to data paradigm
19
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Distributed
Technical Objects Social Objects
Expose experimental context in a structured way in order to be understood
20
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
IPython Notebook solutions
»  Web-browser as the working desktop
»  Python code, plots and data, living with rich-text documentation
»  Cloud-based adaptive scalable computing environment
»  Fully shareable, re-usable and executable wikis
»  Social platform and Git versioning
21
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
ADSLabs
ADO Linked Components
»  Authors
»  Publications
»  Journals
»  Objects SIMBAD
»  Tabular data behind the plots CDS
»  ASCL reference of used software
»  Observing time Proposals
»  Used facilities, surveys or missions
http://labs.adsabs.harvard.edu/
Incentives
Similar Initiative to ESO Telbib!
22
!
!
The Incentive
Papers with data links are cited more than those without
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Effect of E-printing on Citation Rates in Astronomy and Physics
2006. Edwin A. Henneken et al.
23
!
!
The Incentive
Papers with data links are cited more than those without
Research Objects
Digital Science - Reproducibility and Visibility in Astronomy
Effect of E-printing on Citation Rates in Astronomy and Physics
2006. Edwin A. Henneken et al.
24
Conclusions
Digital Science - Reproducibility and Visibility in Astronomy
»  Reproducibility is at the very heart of the scientific method
»  Improving visibility is key in order to avoid reinvention
»  Social dimension of science stressed in the discovery process
»  Highly specialized science needs re-use to achieve efficiency
»  In a digital world, publish decomposable executable papers
»  Capture provenance and structure in the local desktop
»  Scientific workflows go beyond automation: provide clarity and structure
»  Transfer rate is more than an issue for next generation of archives
»  The move computing to data paradigm -> back to old terminals
»  Process should benefit of the same privileges acquired by data
»  Digital libraries of web-services-based workflows
»  The distributed digital workflow-centric Research Object
»  Preserving knowledge - not only data or advertising
jer@iaa.es

More Related Content

What's hot

Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLourdes Verdes-Montenegro
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: EywaEugene Siow
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilitiesIan Foster
 
Introduction NL-HUG (April)
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)Evert Lammerts
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationIan Foster
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformMario Juric
 
Big Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeBig Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeLiana Ye
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudAmazon Web Services
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceUniversity of Washington
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Ola Spjuth
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterIan Foster
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!Ian Foster
 
Big data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at KitwareBig data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at Kitwarebigdataviz_bay
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualizationbigdataviz_bay
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Ian Foster
 
NERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardNERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardPacificResearchPlatform
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science ServicesIan Foster
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaUniversity of Washington
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational ScienceChelle Gentemann
 

What's hot (20)

Love for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 versionLove for science or 'Academic Prostitution' - DFD2014 version
Love for science or 'Academic Prostitution' - DFD2014 version
 
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
 
A Biological Internet?: Eywa
A Biological Internet?: EywaA Biological Internet?: Eywa
A Biological Internet?: Eywa
 
Big data at experimental facilities
Big data at experimental facilitiesBig data at experimental facilities
Big data at experimental facilities
 
Introduction NL-HUG (April)
Introduction NL-HUG (April)Introduction NL-HUG (April)
Introduction NL-HUG (April)
 
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and AutomationThe Discovery Cloud: Accelerating Science via Outsourcing and Automation
The Discovery Cloud: Accelerating Science via Outsourcing and Automation
 
What to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science PlatformWhat to Expect of the LSST Archive: The LSST Science Platform
What to Expect of the LSST Archive: The LSST Science Platform
 
Big Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No CodeBig Data Modeling Challenges and Machine Learning with No Code
Big Data Modeling Challenges and Machine Learning with No Code
 
Time to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the CloudTime to Science/Time to Results: Transforming Research in the Cloud
Time to Science/Time to Results: Transforming Research in the Cloud
 
A New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScienceA New Partnership for Cross-Scale, Cross-Domain eScience
A New Partnership for Cross-Scale, Cross-Domain eScience
 
Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...Data-intensive applications on cloud computing resources: Applications in lif...
Data-intensive applications on cloud computing resources: Applications in lif...
 
Scaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and JupyterScaling collaborative data science with Globus and Jupyter
Scaling collaborative data science with Globus and Jupyter
 
Taming Big Data!
Taming Big Data!Taming Big Data!
Taming Big Data!
 
Big data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at KitwareBig data visualization frameworks and applications at Kitware
Big data visualization frameworks and applications at Kitware
 
Big Data Visualization
Big Data VisualizationBig Data Visualization
Big Data Visualization
 
Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013Big Process for Big Data @ PNNL, May 2013
Big Process for Big Data @ PNNL, May 2013
 
NERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie BardNERSC, AI and the Superfacility, Debbie Bard
NERSC, AI and the Superfacility, Debbie Bard
 
Accelerating Discovery via Science Services
Accelerating Discovery via Science ServicesAccelerating Discovery via Science Services
Accelerating Discovery via Science Services
 
XLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and MyriaXLDB South America Keynote: eScience Institute and Myria
XLDB South America Keynote: eScience Institute and Myria
 
Empowering Transformational Science
Empowering Transformational ScienceEmpowering Transformational Science
Empowering Transformational Science
 

Similar to Digital Science: Reproducibility and Visibility in Astronomy

Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Andrea Scharnhorst
 
Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview   Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview Jennifer D'Souza
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDavid De Roure
 
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDigital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDavid De Roure
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingAndrea Scharnhorst
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesAndrea Scharnhorst
 
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkDatabricks
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Jisc
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402vrij
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Kerstin Lehnert
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFOlga Scrivner
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentationdri_ireland
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and researchkchine3
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014VinothkumaR Ramu
 

Similar to Digital Science: Reproducibility and Visibility in Astronomy (20)

Digital Science
Digital ScienceDigital Science
Digital Science
 
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
Knowledge – dynamics – landscape - navigation – what have interfaces to digit...
 
Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview   Open Research Knowledge Graph (ORKG) - an overview
Open Research Knowledge Graph (ORKG) - an overview
 
Digital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social MachinesDigital Scholarship Intersection Scale Social Machines
Digital Scholarship Intersection Scale Social Machines
 
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social MachinesDigital Scholarship: Intersection, Automation, and Scholarly Social Machines
Digital Scholarship: Intersection, Automation, and Scholarly Social Machines
 
Drowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research fundingDrowning in information – the need of macroscopes for research funding
Drowning in information – the need of macroscopes for research funding
 
Rare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studiesRare (and emergent) disciplines in the light of science studies
Rare (and emergent) disciplines in the light of science studies
 
Astronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache SparkAstronomical Data Processing on the LSST Scale with Apache Spark
Astronomical Data Processing on the LSST Scale with Apache Spark
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015Keynote speech - Carole Goble - Jisc Digital Festival 2015
Keynote speech - Carole Goble - Jisc Digital Festival 2015
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402Gridforum David De Roure Newe Science 20080402
Gridforum David De Roure Newe Science 20080402
 
Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)Research Data Infrastructure for Geochemistry (DFG Roundtable)
Research Data Infrastructure for Geochemistry (DFG Roundtable)
 
If only I had a map!
If only I had a map!If only I had a map!
If only I had a map!
 
Building Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVFBuilding Effective Visualization Shiny WVF
Building Effective Visualization Shiny WVF
 
Knowledge Graphs for Scholarly Communication
Knowledge Graphs for Scholarly CommunicationKnowledge Graphs for Scholarly Communication
Knowledge Graphs for Scholarly Communication
 
Christine borgman keynote
Christine borgman keynoteChristine borgman keynote
Christine borgman keynote
 
Rebecca Grant DAH Research Presentation
Rebecca Grant DAH Research PresentationRebecca Grant DAH Research Presentation
Rebecca Grant DAH Research Presentation
 
Use r 2013 tutorial - r and cloud computing for higher education and research
Use r 2013   tutorial - r and cloud computing for higher education and researchUse r 2013   tutorial - r and cloud computing for higher education and research
Use r 2013 tutorial - r and cloud computing for higher education and research
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
06 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.201406 e science-bio diversity@ pacc 18.07.2014
06 e science-bio diversity@ pacc 18.07.2014
 

More from Jose Enrique Ruiz

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroidsJose Enrique Ruiz
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web ServicesJose Enrique Ruiz
 
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationJose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesJose Enrique Ruiz
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflowsJose Enrique Ruiz
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataJose Enrique Ruiz
 
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsJose Enrique Ruiz
 
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital ExperimentsJose Enrique Ruiz
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCAJose Enrique Ruiz
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VOJose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropJose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iJose Enrique Ruiz
 

More from Jose Enrique Ruiz (14)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
Wf4Ever: Workflow Preservation
Wf4Ever: Workflow PreservationWf4Ever: Workflow Preservation
Wf4Ever: Workflow Preservation
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
 
Workflow Preservation
Workflow PreservationWorkflow Preservation
Workflow Preservation
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
 
Web services based workflows to deal with 3D data
Web services based workflows to deal with 3D dataWeb services based workflows to deal with 3D data
Web services based workflows to deal with 3D data
 
Curating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital ExperimentsCurating and Preserving Collaborative Digital Experiments
Curating and Preserving Collaborative Digital Experiments
 
Collaborative Digital Experiments
Collaborative Digital ExperimentsCollaborative Digital Experiments
Collaborative Digital Experiments
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...AliaaTarek5
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
(How to Program) Paul Deitel, Harvey Deitel-Java How to Program, Early Object...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 

Digital Science: Reproducibility and Visibility in Astronomy

  • 1. 1 Digital Science Reproducibility and Visibility in Astronomy José Enrique Ruiz on behalf of the Wf4Ever Team SCIOPS 2013 ESAC, FRIDAY 13th SEPTEMBER 2013
  • 2. 2 Wf4Ever Digital Science - Reproducibility and Visibility in Astronomy 1.  Intelligent Software Components (ISOCO, Spain) 2.  University of Manchester (UNIMAN, UK) 3.  Universidad Politécnica de Madrid (UPM, Spain) 4.  Poznan Supercomputing and Networking Centre (Poland) 5.  University of Oxford and OeRC (OXF, UK) 6.  Instituto Astrofísica Andalucía (IAA-CSIC, Spain) 7.  Leiden University Medical Centre (LUMC, NL) 4 Wf4Ever Advanced Workflow Preservation Technologies for Enhanced Science 3 1 6 7 5 2 2011 - 2013
  • 3. 3 Astronomy research lifecycle is entirely digital »  Observation proposals »  Data reduction pipelines »  Analysis of science ready data »  Catalogs of objects and data archives »  Publish process ›  Final data results ›  Experiment in DL ADS/arXiv Reproducible research is still not possible in a digital world A rich infrastructure of data is not efficiently used A normalized preservation of methodology is needed Tools Astronomy Research Lifecycle Digital Science - Reproducibility and Visibility in Astronomy
  • 4. 4 Reproducibility and The Scientific Method Digital Science - Reproducibility and Visibility in Astronomy http://xkcd.com/242/ Benefits »  Publishing knowledge, not advertising »  The author, the referee, the re-user »  Reputation, prestige and respect »  Higher quality of publications ›  Authors will be more careful ›  Many eyes to check results Challenges »  Hard and time consuming »  Need incentives – not rewarded now
  • 5. 5 Reproducibility and The Scientific Method Digital Science - Reproducibility and Visibility in Astronomy I don’t know how!
  • 6. 6 Visibility, Efficiency and Reuse Digital Science - Reproducibility and Visibility in Astronomy Optimize return on investments made on big facilities »  Avoid duplication of efforts and reinvention »  How to discover and not duplicate ? »  How to re-use and not duplicate ? »  How to make use of best practices ? »  How to use the rich infrastructure of data ? »  Intellectual contribs are encoded in software More data in archives does not imply more knowledge »  Expose complete scientific record, not the story »  Allow easy discovery of methods and tools
  • 7. 7 Visibility and Social Discovery Digital Science - Reproducibility and Visibility in Astronomy Paper discovery: the social dimension
  • 8. 8 The Executable Paper Digital Science - Reproducibility and Visibility in Astronomy Time has come to go beyond the PDF
  • 9. 9 Digital Astronomy in the Local Desktop Digital Science - Reproducibility and Visibility in Astronomy Going beyond automation Organization!
  • 10. 10 Digital Astronomy in the Local Desktop Workflows to Access and Massage VO Data # CIG Vhel e_Vhel r_Vhel Dist MType e_MType OptAssym r_MType Bmag e_Bmag 1 7299.0 3.0 1 96.9 5.0 1.5 1 1 14.167 0.271 0.173 0.571 0.040 13.383 2 6983.0 6.0 2 94.7 6.0 1.5 0 1 15.722 0.324 0.255 0.278 0.031 15.157 3 4.0 1.5 0 1 16.057 0.507 0.246 0.354 15.457 4 2310.0 1.0 3 31.9 3.0 1.5 0 1 12.818 0.424 0.252 0.863 0.017 11.685 5 7865.0 10.0 3 105.9 0.0 1.5 0 1 15.602 0.364 0.225 0.131 0.118 15.128 72 5164.0 9.0 2 68.5 5.0 1.5 1 1 14.445 0.325 0.315 0.367 0.028 13.735 Capture ! Actions, Tasks, Dependencies, Provenance! ! Improve ! Clarity and Reproducibility!
  • 11. 11 Scientific Workflows Digital Science - Reproducibility and Visibility in Astronomy Living Tutorials! Templates for Re-use! Expedite Training! Reduce time to insight! Avoid reinvention! Digital Libraries of workflows may boost the use of the existing infrastructure of data (VO) !
  • 12. 12 ! ! Software ›  Taverna ›  Kepler ›  Pegasus ›  Triana ›  ESO Reflex Scientific Workflows Digital Science - Reproducibility and Visibility in Astronomy Related Initiatives ›  ER-Flow ›  VAMDC ›  HELIO ›  Cyber-SKA ›  IceCore ›  Montage ›  Astro-WISE ›  AstroGrid IVOA ›  AstroGrid ›  Grid&WS WG ›  VO France Wf WG Self descriptive WS ›  PDL ›  SimDAL, S3
  • 13. 13 ! ! AstroTaverna: Create, annotate and run a workflow http://amiga.iaa.es/p/290-astrotaverna.htm Astronomical Research Objects in Action Digital Science - Reproducibility and Visibility in Astronomy
  • 14. 14 ! ! AstroTaverna: Create, annotate and run a workflow http://amiga.iaa.es/p/290-astrotaverna.htm Astronomical Research Objects in Action Digital Science - Reproducibility and Visibility in Astronomy
  • 15. 15 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Prof. Kevin Vinsen ASKAP Datacubes
  • 16. 16 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Prof. Kevin Vinsen SKA Datacubes
  • 17. 17 The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Much wider FoV and spectral coverage »  Large volumes for a single observed dataset Automated surveys »  Huge amounts of tabular data We are moving into a world where »  computing and storage are cheap »  data movement is death Extraction of scientifically relevant info from a multiD param. space »  Exploration services »  Anomaly detection »  Cross-matching data »  Dimensionality reduction Detailed inspection and subset »  Filtering »  Extraction »  Re-Projection »  Analysis services
  • 18. 18 »  A cloud of Web Services »  Archives speaking Web Services Process should benefit of the same privileges acquired by data Preserving the method ensures replication of final results at any moment Archives should evolve from data providers into »  Virtual Data providers »  Software Tasks providers Astronomy of multi archives/facilities/wavelength Interconnected and interoperable archives »  Data -> Virtual Observatory »  Software Tasks The next generation of archives Digital Science - Reproducibility and Visibility in Astronomy Preservation The move computing to data paradigm
  • 19. 19 Research Objects Digital Science - Reproducibility and Visibility in Astronomy Distributed Technical Objects Social Objects Expose experimental context in a structured way in order to be understood
  • 20. 20 Research Objects Digital Science - Reproducibility and Visibility in Astronomy IPython Notebook solutions »  Web-browser as the working desktop »  Python code, plots and data, living with rich-text documentation »  Cloud-based adaptive scalable computing environment »  Fully shareable, re-usable and executable wikis »  Social platform and Git versioning
  • 21. 21 Research Objects Digital Science - Reproducibility and Visibility in Astronomy ADSLabs ADO Linked Components »  Authors »  Publications »  Journals »  Objects SIMBAD »  Tabular data behind the plots CDS »  ASCL reference of used software »  Observing time Proposals »  Used facilities, surveys or missions http://labs.adsabs.harvard.edu/ Incentives Similar Initiative to ESO Telbib!
  • 22. 22 ! ! The Incentive Papers with data links are cited more than those without Research Objects Digital Science - Reproducibility and Visibility in Astronomy Effect of E-printing on Citation Rates in Astronomy and Physics 2006. Edwin A. Henneken et al.
  • 23. 23 ! ! The Incentive Papers with data links are cited more than those without Research Objects Digital Science - Reproducibility and Visibility in Astronomy Effect of E-printing on Citation Rates in Astronomy and Physics 2006. Edwin A. Henneken et al.
  • 24. 24 Conclusions Digital Science - Reproducibility and Visibility in Astronomy »  Reproducibility is at the very heart of the scientific method »  Improving visibility is key in order to avoid reinvention »  Social dimension of science stressed in the discovery process »  Highly specialized science needs re-use to achieve efficiency »  In a digital world, publish decomposable executable papers »  Capture provenance and structure in the local desktop »  Scientific workflows go beyond automation: provide clarity and structure »  Transfer rate is more than an issue for next generation of archives »  The move computing to data paradigm -> back to old terminals »  Process should benefit of the same privileges acquired by data »  Digital libraries of web-services-based workflows »  The distributed digital workflow-centric Research Object »  Preserving knowledge - not only data or advertising jer@iaa.es