SlideShare ist ein Scribd-Unternehmen logo
1 von 16
Downloaden Sie, um offline zu lesen
Grant agreement no.: 27092




    Curating and Preserving !
Collaborative Digital Experiments!


               Jose Enrique Ruiz!
                    IAA-CSIC!
                        !
                 May 19th 2011!
    2011 IVOA Spring Interop Meeting - Naples!
Wf4Ever: preserving experiments!


Wf4Ever team!
                 1.  Intelligent Software Components (ISOCO, Spain)!
                 2.  University of Manchester (UNIMAN, UK)!
  2     7        3.  Universidad Politécnica de Madrid (UPM, Spain)!
   5!       4!   4.  Poznan Supercomputing and Networking Centre
                     (PSNC, Poland)!
                 5.  Universisty of Oxford (OXF, UK)!
                 6.  Instituto de Astrofísica de Andalucía (IAA, Spain)!
1! 3!
                 7.  Leiden University Medical Centre (LUMC, NL)!
 6!




                                                                           2
Wf4Ever: preserving experiments!


Astronomy research is entirely digital !
Time has come to go “Beyond the PDF”!
•    Preserved experiments!
•    Methodology “in action”!
•    All data exposed!
•    Reproducible!
•    Repeatable!
•    Reusable!
•    Repurposeable!
•    Participatory!
•    Collaborative!
•    Formative!
                                                               3
Wf4Ever: preserving experiments!


Wf4Ever goals!
!
All components related to the!
research lifecycle should be available. !
!
Preserved and easily retrievables !
!
•    Proposals!
•    Data!
•    Processes!
•    Workflows!
•    Publications!

!
                                                              4
Research Objects: the ingredients!


Research Objects in Astronomy!
•    Metadata (Author, Instrument, Research group, etc.) !
•    Description of the experiment (Strategy, Expected results, etc.)!
•    Observing proposal!
•    Auxiliary and raw data!
•    Reduced science-ready data!
•    Digital environment needed !
•    Scripting and software used!
•    Web services!
•    Scientific workflow!
•    Final data products!
•    Standard publication !
                                                                         5
Scientific workflows: the cooking recipes!


Scientific Workflows!
      !
•    Automation!
•    Repeatable !
•    Reproducible!
•    Encourage best practices!
•    Modular nature allows !
        •  Reuse!
        •  Repurpose!
•    Exposes the scientific method!
•    Formative!
•    Scientist friendly!


                                                                6
Scientific workflows: the cooking recipes!


Scientific Workflows!
!
•    Automation vs. The intrinsic exploratory nature of Science!
•    Documented vs. Hidden knowledge!
•    Web services vs. Local software!
•    On-line data vs. Local data!
•    Modular vs. Unstructured!
•    Open Science vs. Proprietary!

•  Preserved!
•  Classified and indexed!
•  Referenced and retrievable!
!
                                                                   7
Scientific workflows: the cooking recipes!


Workflow preservation is complex!
!
!
•    Interpreted through their execution!
•    Complex models are required to describe them!
•    Provenance is a complex issue in a cloud of services!
•    Need of Web Semantics, Ontologies, Linked Data, etc..!
•    Resources are often beyond control of scientists!




                                                              8
Scientific workflows: the cooking recipes!


The oven!
A workflow enactment and management system!
University of Manchester !
!
•  AstroTaverna (AstroGrid)!
    •  SOAP!
    •  AstroRuntime!


•  Reflex (ESO)!
•  Aladin JLOW Plugin (CDS) !



                                                            9
Collaborative tools: “Le marché”!


The recipes store!
Oxford University!
!
•    Find workflows!
•    Share workflows and files!
•    Find people!
•    Build communities!
•    Publish packages!
•    Tag workflows!
•    Score and rate workflows!
•    Comment on workflows!
•    Write reviews!


                                                                 10
Wf4Ever Platform Requirements!


Living Working Research Objects!
!
!
•    Ubiquitous storing and computing!
•    Data archives and local data!
•    Web services and scripts!
•    Python based community!
•    VO standards!


•  Modular to reuse individual parts!
•  Access rights at different levels of granularity!
•  VOSpaces!
                                                          11
Wf4Ever Platform Requirements!


Published Research Objects!
 !
•    Archival!
•    Classification!
•    Indexing!
•    Retrieval!
•    Versioning!

•  Community reuse!
•  Rating, scoring and annotations!
•  Scalable in semantic repositories!

•  Permanent URIs, Linked Data, Semantics, etc.!
•  Interlink with catalogs/digital libraries!
                                                         12
Wf4Ever Platform Requirements!


Users roles!
!
Collaborator!
Dealing with Living Working Research Objects in a research group. !
Reader!
Skims titles and abstracts of Published Research Objects. !
Comparator!
Looking for similar Research Objects to those she/he is working with.!
Re-user!
Extract modules from workflows and use them for his own purpose.!
Publisher!
Wants her/his work to be known.!
Evaluator!
He evaluates, rates, comments and recommend a specific Research Object. !
!
Most of them are active roles run the workflows with (different) data !
                                                                            13
First Developments!


ROBox: the basket!
Seamless contribution to a collaborative platform!
A shared folder in Dropbox becomes a Working Research Object!
Automatic generation of metadata  !




                                                                14
First Developments!

           !
Migration to VOSpace
 needed for Big Data
     Astronomy!
               !!
Services should run where
       the data live!




                         15
Open questions!

  We are moving into a world where !
  computing and storage are cheap and data movement is death.!
  !
  In a Cloud of services and data, Web Services should benefit of the
  same privileges acquired by Data.!
  !
  •    Curation and preservation (identifiers)!
  •    Discovery (semantics) of web services (linked “services”?)!
  •    Characterization: input, outputs, functionality, etc.!
  •    Copies (authenticity) or similar web services used as alternates !
  •    Permissions, licenses, platform, costs, etc.!
  •    Metrics for quality: popularity, use stats, logs uptime, etc.!
  •    Versioning and authoring (referenced and acknowledged)!


http://www.wf4ever-project.org!                                             16

Weitere ähnliche Inhalte

Ähnlich wie Curating and Preserving Collaborative Digital Experiments

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
Jose Enrique Ruiz
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
Jose Enrique Ruiz
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
John Kunze
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
Jose Enrique Ruiz
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science Preservation
Joint ALMA Observatory
 

Ähnlich wie Curating and Preserving Collaborative Digital Experiments (20)

Workflows in the Virtual Observatory
Workflows in the Virtual ObservatoryWorkflows in the Virtual Observatory
Workflows in the Virtual Observatory
 
OAI7 Research Objects
OAI7 Research ObjectsOAI7 Research Objects
OAI7 Research Objects
 
VO web-services-based astronomy workflows
VO web-services-based astronomy workflowsVO web-services-based astronomy workflows
VO web-services-based astronomy workflows
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
Oak meeting 18/09/2014
Oak meeting 18/09/2014Oak meeting 18/09/2014
Oak meeting 18/09/2014
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 
Research Objects in Wf4Ever
Research Objects in Wf4EverResearch Objects in Wf4Ever
Research Objects in Wf4Ever
 
myExperiment and the Rise of Social Machines
myExperiment and the Rise of Social MachinesmyExperiment and the Rise of Social Machines
myExperiment and the Rise of Social Machines
 
Where are we going and how are we going to get there?
Where are we going and how are we going to get there?Where are we going and how are we going to get there?
Where are we going and how are we going to get there?
 
Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011Acs denver dirks potenzone 30 aug2011
Acs denver dirks potenzone 30 aug2011
 
Snakes on the Web; Developing web applications in python
Snakes on the Web; Developing web applications in pythonSnakes on the Web; Developing web applications in python
Snakes on the Web; Developing web applications in python
 
Research Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibilityResearch Objects for improved sharing and reproducibility
Research Objects for improved sharing and reproducibility
 
Curation and Characterization of Web Services
Curation and Characterization of Web ServicesCuration and Characterization of Web Services
Curation and Characterization of Web Services
 
ORCID Identifiers in Reactome
ORCID Identifiers in ReactomeORCID Identifiers in Reactome
ORCID Identifiers in Reactome
 
My projects at University of Oxford e-Research Centre - Nov 2014
My projects at University of Oxford e-Research Centre - Nov 2014My projects at University of Oxford e-Research Centre - Nov 2014
My projects at University of Oxford e-Research Centre - Nov 2014
 
2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects2012 03-28 Wf4ever, preserving workflows as digital research objects
2012 03-28 Wf4ever, preserving workflows as digital research objects
 
1st meeting of PG PUSHPIN
1st meeting of PG PUSHPIN1st meeting of PG PUSHPIN
1st meeting of PG PUSHPIN
 
2012 09 aos-workshop-johanneskeizer
2012 09 aos-workshop-johanneskeizer2012 09 aos-workshop-johanneskeizer
2012 09 aos-workshop-johanneskeizer
 
Wf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science PreservationWf4Ever: Work!ows for Methodology and Science Preservation
Wf4Ever: Work!ows for Methodology and Science Preservation
 

Mehr von Jose Enrique Ruiz

Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxies
Jose Enrique Ruiz
 
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
Jose Enrique Ruiz
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VOData
Jose Enrique Ruiz
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
Jose Enrique Ruiz
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
Jose Enrique Ruiz
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
Jose Enrique Ruiz
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
Jose Enrique Ruiz
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Jose Enrique Ruiz
 

Mehr von Jose Enrique Ruiz (14)

Jupyter notebooks on steroids
Jupyter notebooks on steroidsJupyter notebooks on steroids
Jupyter notebooks on steroids
 
Velocity cubes of galaxies
Velocity cubes of galaxiesVelocity cubes of galaxies
Velocity cubes of galaxies
 
Implementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxiesImplementing a VO archive for datacubes of galaxies
Implementing a VO archive for datacubes of galaxies
 
Open Science and Executable Papers
Open Science and Executable PapersOpen Science and Executable Papers
Open Science and Executable Papers
 
Digital Science: Towards the executable paper
Digital Science: Towards the executable paperDigital Science: Towards the executable paper
Digital Science: Towards the executable paper
 
Digital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in AstronomyDigital Science: Reproducibility and Visibility in Astronomy
Digital Science: Reproducibility and Visibility in Astronomy
 
Workflows to access and massage VOData
Workflows to access and massage VODataWorkflows to access and massage VOData
Workflows to access and massage VOData
 
Digital Science
Digital ScienceDigital Science
Digital Science
 
Use of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubesUse of CharDM in an archive of velocity cubes
Use of CharDM in an archive of velocity cubes
 
SVO Activities - SEA 2008
SVO Activities - SEA 2008SVO Activities - SEA 2008
SVO Activities - SEA 2008
 
El Observatorio Virtual - eCA
El Observatorio Virtual - eCAEl Observatorio Virtual - eCA
El Observatorio Virtual - eCA
 
Multidimensional Data in the VO
Multidimensional Data in the VOMultidimensional Data in the VO
Multidimensional Data in the VO
 
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall InteropB0DEGA 3D VO Archive - IVOA 2010 Fall Interop
B0DEGA 3D VO Archive - IVOA 2010 Fall Interop
 
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science iWf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
Wf4Ever: Advanced Workflow Preservation Technologies for Enhanced Science i
 

Curating and Preserving Collaborative Digital Experiments

  • 1. Grant agreement no.: 27092 Curating and Preserving ! Collaborative Digital Experiments! Jose Enrique Ruiz! IAA-CSIC! ! May 19th 2011! 2011 IVOA Spring Interop Meeting - Naples!
  • 2. Wf4Ever: preserving experiments! Wf4Ever team! 1.  Intelligent Software Components (ISOCO, Spain)! 2.  University of Manchester (UNIMAN, UK)! 2 7 3.  Universidad Politécnica de Madrid (UPM, Spain)! 5! 4! 4.  Poznan Supercomputing and Networking Centre (PSNC, Poland)! 5.  Universisty of Oxford (OXF, UK)! 6.  Instituto de Astrofísica de Andalucía (IAA, Spain)! 1! 3! 7.  Leiden University Medical Centre (LUMC, NL)! 6! 2
  • 3. Wf4Ever: preserving experiments! Astronomy research is entirely digital ! Time has come to go “Beyond the PDF”! •  Preserved experiments! •  Methodology “in action”! •  All data exposed! •  Reproducible! •  Repeatable! •  Reusable! •  Repurposeable! •  Participatory! •  Collaborative! •  Formative! 3
  • 4. Wf4Ever: preserving experiments! Wf4Ever goals! ! All components related to the! research lifecycle should be available. ! ! Preserved and easily retrievables ! ! •  Proposals! •  Data! •  Processes! •  Workflows! •  Publications! ! 4
  • 5. Research Objects: the ingredients! Research Objects in Astronomy! •  Metadata (Author, Instrument, Research group, etc.) ! •  Description of the experiment (Strategy, Expected results, etc.)! •  Observing proposal! •  Auxiliary and raw data! •  Reduced science-ready data! •  Digital environment needed ! •  Scripting and software used! •  Web services! •  Scientific workflow! •  Final data products! •  Standard publication ! 5
  • 6. Scientific workflows: the cooking recipes! Scientific Workflows! ! •  Automation! •  Repeatable ! •  Reproducible! •  Encourage best practices! •  Modular nature allows ! •  Reuse! •  Repurpose! •  Exposes the scientific method! •  Formative! •  Scientist friendly! 6
  • 7. Scientific workflows: the cooking recipes! Scientific Workflows! ! •  Automation vs. The intrinsic exploratory nature of Science! •  Documented vs. Hidden knowledge! •  Web services vs. Local software! •  On-line data vs. Local data! •  Modular vs. Unstructured! •  Open Science vs. Proprietary! •  Preserved! •  Classified and indexed! •  Referenced and retrievable! ! 7
  • 8. Scientific workflows: the cooking recipes! Workflow preservation is complex! ! ! •  Interpreted through their execution! •  Complex models are required to describe them! •  Provenance is a complex issue in a cloud of services! •  Need of Web Semantics, Ontologies, Linked Data, etc..! •  Resources are often beyond control of scientists! 8
  • 9. Scientific workflows: the cooking recipes! The oven! A workflow enactment and management system! University of Manchester ! ! •  AstroTaverna (AstroGrid)! •  SOAP! •  AstroRuntime! •  Reflex (ESO)! •  Aladin JLOW Plugin (CDS) ! 9
  • 10. Collaborative tools: “Le marché”! The recipes store! Oxford University! ! •  Find workflows! •  Share workflows and files! •  Find people! •  Build communities! •  Publish packages! •  Tag workflows! •  Score and rate workflows! •  Comment on workflows! •  Write reviews! 10
  • 11. Wf4Ever Platform Requirements! Living Working Research Objects! ! ! •  Ubiquitous storing and computing! •  Data archives and local data! •  Web services and scripts! •  Python based community! •  VO standards! •  Modular to reuse individual parts! •  Access rights at different levels of granularity! •  VOSpaces! 11
  • 12. Wf4Ever Platform Requirements! Published Research Objects! ! •  Archival! •  Classification! •  Indexing! •  Retrieval! •  Versioning! •  Community reuse! •  Rating, scoring and annotations! •  Scalable in semantic repositories! •  Permanent URIs, Linked Data, Semantics, etc.! •  Interlink with catalogs/digital libraries! 12
  • 13. Wf4Ever Platform Requirements! Users roles! ! Collaborator! Dealing with Living Working Research Objects in a research group. ! Reader! Skims titles and abstracts of Published Research Objects. ! Comparator! Looking for similar Research Objects to those she/he is working with.! Re-user! Extract modules from workflows and use them for his own purpose.! Publisher! Wants her/his work to be known.! Evaluator! He evaluates, rates, comments and recommend a specific Research Object. ! ! Most of them are active roles run the workflows with (different) data ! 13
  • 14. First Developments! ROBox: the basket! Seamless contribution to a collaborative platform! A shared folder in Dropbox becomes a Working Research Object! Automatic generation of metadata ! 14
  • 15. First Developments! ! Migration to VOSpace needed for Big Data Astronomy! !! Services should run where the data live! 15
  • 16. Open questions! We are moving into a world where ! computing and storage are cheap and data movement is death.! ! In a Cloud of services and data, Web Services should benefit of the same privileges acquired by Data.! ! •  Curation and preservation (identifiers)! •  Discovery (semantics) of web services (linked “services”?)! •  Characterization: input, outputs, functionality, etc.! •  Copies (authenticity) or similar web services used as alternates ! •  Permissions, licenses, platform, costs, etc.! •  Metrics for quality: popularity, use stats, logs uptime, etc.! •  Versioning and authoring (referenced and acknowledged)! http://www.wf4ever-project.org! 16