SlideShare ist ein Scribd-Unternehmen logo
1 von 63
Workflow Classification and Open-Sourcing Methods: Towards a New Publication Model Richard Littauer, Karthik Ram, Bertram Ludäscher, William Michener, Rebecca Koskela DataONE 1
Scientific Workflows Tools that help scientists: Automate repetitive or difficult work DataONE 2
Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments DataONE 3
Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments Track provenance DataONE 4
Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments Track provenance Share their data with other scientists DataONE 5
Workflow Workbenches DataONE 6
Workflow Workbenches DataONE 7
Workflow Workbenches DataONE 8
Workflow Workbenches These facilitate: DataONE 9 Creation http://www.flickr.com/photos/ideacreamanuelapps/3542203718/
Workflow Workbenches These facilitate: DataONE 10 Mapping http://www.flickr.com/photos/fatguyinalittlecoat/5716492273
Workflow Workbenches These facilitate: DataONE 11 Scheduling http://www.flickr.com/photos/silent-penguin/232394/
Workflow Workbenches These facilitate: DataONE 12 Execution http://www.flickr.com/photos/pagedooley/4039784738/
Workflow Workbenches These facilitate: DataONE 13 Visualisation http://www.flickr.com/photos/cnon/5698746966/
Workflow Workbenches These facilitate: DataONE 14 Re-use http://www.flickr.com/photos/nihonbunka/32774212/
Workflow Workbenches Not all scientists are coders.  DataONE 15
Workflow Workbenches Not all scientists are coders.  By using front-end visualizations and eliminating the need for lower-level coding (ie, shell scripts)… DataONE 16
Workflow Workbenches Not all scientists are coders.  By usingfront-end visualizations and eliminating the need for lower-level coding (ie, shell scripts)… …it is easier for scientists to do and share their work. DataONE 17 http://www.flickr.com/photos/wouterverhelst/362538835/
Workflow Workbenches This is a common way how workflows are ‘sold’. DataONE 18 http://www.flickr.com/photos/amagill/3366720659/
Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. DataONE 19 http://www.flickr.com/photos/amagill/3366720659/
Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. Often it is just replacing one style of coding (conventional) with another (workflows). DataONE 20 http://www.flickr.com/photos/amagill/3366720659/
Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. Often it is just replacing one style of coding (conventional) with another (workflows). We’re trying to see if we can get to the bottom of how the promises cash out.  DataONE 21 http://www.flickr.com/photos/amagill/3366720659/
Our Study However, there have been few studies done looking at how these workflows work. DataONE 22 http://www.flickr.com/photos/eleaf/2536358399
Our Study How do we classify workflows? DataONE 23 http://www.flickr.com/photos/eleaf/2536358399
Our Study How do we classify workflows? Where do existing workflow systems fall short?  DataONE 24 http://www.flickr.com/photos/eleaf/2536358399
Our Study How do we classify workflows? Where do existing workflow systems fall short?  How can the process of creating workflows be improved? DataONE 25 http://www.flickr.com/photos/eleaf/2536358399
Our Study How do we classify workflows? Where do existing workflow systems fall short?  How can the process of creating workflows be improved? How about executing them? DataONE 26 http://www.flickr.com/photos/eleaf/2536358399
Our Study How do we classify workflows? Where do existing workflow systems fall short?  How can the process of creating workflows be improved? How about executing them? And sharing them? DataONE 27 http://www.flickr.com/photos/eleaf/2536358399
Our Study Some studies have been done. DataONE 28
Our Study Some studies have been done. For example,  as much as 30% of workflow components have been assessed to be so-called data conversion shims [4].  DataONE 29
Our Study Some studies have been done. For example,  as much as 30% of workflow components have been assessed to be so-called data conversion shims [4]. This large percentage and the difficulty of developing custom shims suggest that workflow design technology can still be improved.  DataONE 30
Our Study But most importantly, these studies have not significantly changed the way we use workflows.  DataONE 31
Our Study But most importantly, these studies have not significantly changed the way we use workflows.  In some cases, studies run on the same data came up with different results, which suggests that open data alone does not lead to reproducible science [5]. DataONE 32
Our Study But most importantly, these studies have not significantly changed the way we use workflows.  In some cases, studies run on the same data came up with different results, which suggests that open data alone does not lead to reproducible science [5]. Therefore, a greater understanding of workflows and how we can most adequately implement them into open science is called for. DataONE 33
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   DataONE 34
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   Our main repository: http://www.myexperiment.org DataONE 35
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   Our main repository: http://www.myexperiment.org Est. 2007 DataONE 36
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   Our main repository: http://www.myexperiment.org Est. 2007 4500+ users DataONE 37
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   Our main repository: http://www.myexperiment.org Est. 2007 4500+ users 1850+ workflows (mostly Taverna 1, 2, and RapidMiner) DataONE 38
Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.   Our main repository: http://www.myexperiment.org Est. 2007 4500+ users 1850+ workflows (mostly Taverna 1, 2, and RapidMiner) Minable by SPARQL DataONE 39
Our Study Methods:  For each workflow, we’re gathering three tiers of information.  DataONE 40 http://www.flickr.com/photos/jpvargas/83258973/
Our Study Methods:  For each workflow, we’re gathering three tiers of information.  DataONE 41 Meta-Data 		Description `Worth’ http://www.flickr.com/photos/jpvargas/83258973/
Tier 1 Metadata: Workflow source Workflow system Works on run Area of research Type Description User User total uploads	 Published citations Downloads Date uploaded DataONE 42
Tier 2 Description: Foreign components	 QA/QC steps Visual Output Number of inputs Intermediate input Linear Embedded Embedded details Number of databases Type conversion Tag conversion Multiple outputs Processing Stats Scalable Smart reruns provenance retained Multipurpose research mining Query Loop Grid Accounts necessary External results DataONE 43
Tier 3 `Worth’: Sufficiency of metadata Sufficiency of Natural Language Description Reuse in published articles Relevant issues based on the system it was created in. DataONE 44
Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. DataONE 45 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. DataONE 46 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. Workflows become more powerful over time.  DataONE 47 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. Workflows become more powerful over time.  Workflows become more complex as one gains more experience.  DataONE 48 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. DataONE 49 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation.  DataONE 50 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation.  Reuse is proportional to the age of the workflow.  DataONE 51 http://www.flickr.com/photos/nauright/5391995939/
Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation.  Reuse is proportional to the age of the workflow.  Workflow reuse is proportional to the proficiency of the creator.  DataONE 52 http://www.flickr.com/photos/nauright/5391995939/
Data Still being gathered and analysed. DataONE 53
Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 54
Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 55
Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 56
Data One of the issues with this is the amount of workflows being created by each user.  However, this still should allow for a diachronic analysis.  DataONE 57
Conclusion Old publishing model: Write paper. 		Submit paper. 		Drink wine.  DataONE 58 http://www.flickr.com/photos/joelmontes/4762384399/
Conclusion Old publishing model: Write paper. 		Submit paper. 		Drink wine.  New publishing model: Write paper.		Submit paper.		Get feedback. 		Submit data.		Replication (?) DataONE 59 http://www.flickr.com/photos/joelmontes/4762384399/
Conclusion Better publishing model: Write paper using 	Submit paper.		Get feedback. Workflows.		Submit data.		Replication DataONE 60 http://www.flickr.com/photos/mactitioner/5595830505
Conclusion Better publishing model: Write paper using 	Submit paper.		Get feedback. Workflows.		Submit data.		Replication 		Submit workflows.	That works. DataONE 61 http://www.flickr.com/photos/mactitioner/5595830505
Conclusion Better publishing model: Write paper using 	Submit paper.		Get feedback. Workflows.		Submit data.		Replication 		Submit workflows.	That works. As this is done, questions of how effective workflows are, and how they can be utilized in the new research and publishing paradigm, might be answered. DataONE 62 http://www.flickr.com/photos/mactitioner/5595830505
References [1] Kepler Project. http://www.kepler-project.org [2] Taverna. http://www.taverna.org.uk/ [3] Vistrailshttp://www.vistrails.org/ [4] Cui Lin, Shiyong Lu, XuboFei, DarshanPai, and Jing Hua. 2009. A Task Abstraction and Mapping Approach to the Shimming Problem in Scientific Workflows. In Proceedings of the 2009 IEEE International Conference on Services Computing (SCC '09). IEEE Computer Society, Washington, DC, USA, http://dx.doi.org/10.1109/SCC.2009.77 [5]Coombes, K. R., Wang, J. & Baggerly, K. A. Microarrays: retracing steps.Nature Med.13, 1276–1277 (2007). DataONEWorkflows Project: http://notebooks.dataone.org/workflows Mendeley Research Group: http://www.mendeley.com/groups/1189721/scientific-workflows-and-workflow-systems/ DataONE 63 http://www.flickr.com/photos/wwworks/4759535950/

Weitere ähnliche Inhalte

Andere mochten auch

CP IT Monitor June 2011
CP IT Monitor June 2011CP IT Monitor June 2011
CP IT Monitor June 2011michellekegg
 
Gaia TV hospitality solution
Gaia TV hospitality solutionGaia TV hospitality solution
Gaia TV hospitality solutionsbukkapa
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...ASIS&T
 
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Andrew Sallans
 
Open Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingOpen Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingAndrew Sallans
 
Building Reproducible Network Data Analysis / Visualization Workflows
Building Reproducible Network Data Analysis / Visualization WorkflowsBuilding Reproducible Network Data Analysis / Visualization Workflows
Building Reproducible Network Data Analysis / Visualization WorkflowsKeiichiro Ono
 

Andere mochten auch (11)

CP IT Monitor June 2011
CP IT Monitor June 2011CP IT Monitor June 2011
CP IT Monitor June 2011
 
Gaia TV hospitality solution
Gaia TV hospitality solutionGaia TV hospitality solution
Gaia TV hospitality solution
 
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
RDAP 16 Lightning: An Open Science Framework for Solving Institutional Challe...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Eco-Leadership 2011
Eco-Leadership 2011Eco-Leadership 2011
Eco-Leadership 2011
 
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...Improving Integrity, Transparency, and Reproducibility Through Connection of ...
Improving Integrity, Transparency, and Reproducibility Through Connection of ...
 
Open Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and TrainingOpen Science Framework (OSF): Presentation and Training
Open Science Framework (OSF): Presentation and Training
 
Enterprise Europe Network - Partnership Tool
Enterprise Europe Network - Partnership ToolEnterprise Europe Network - Partnership Tool
Enterprise Europe Network - Partnership Tool
 
Building Reproducible Network Data Analysis / Visualization Workflows
Building Reproducible Network Data Analysis / Visualization WorkflowsBuilding Reproducible Network Data Analysis / Visualization Workflows
Building Reproducible Network Data Analysis / Visualization Workflows
 

Ähnlich wie Workflow Classification and Open-Sourcing Methods

Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceDavid De Roure
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsdgarijo
 
Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Rafael Ferreira da Silva
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsGaignard Alban
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research ObjectsDavid De Roure
 
Scientific workflow-overview-2012-01-rev-2
Scientific workflow-overview-2012-01-rev-2Scientific workflow-overview-2012-01-rev-2
Scientific workflow-overview-2012-01-rev-2Terence Critchlow
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemHerbert Van de Sompel
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasMerce Crosas
 
Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)Aysylu Greenberg
 
An Overview of VIEW
An Overview of VIEWAn Overview of VIEW
An Overview of VIEWShiyong Lu
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Paragon_Science_Inc
 
Accelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceAccelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceIan Foster
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)Stian Soiland-Reyes
 
Simulagora (Euroscipy2014 - Logilab)
Simulagora (Euroscipy2014 - Logilab)Simulagora (Euroscipy2014 - Logilab)
Simulagora (Euroscipy2014 - Logilab)Logilab
 
Networking Materials Data
Networking Materials DataNetworking Materials Data
Networking Materials DataIan Foster
 
Bhagat Myexperiment Bosc2008
Bhagat Myexperiment Bosc2008Bhagat Myexperiment Bosc2008
Bhagat Myexperiment Bosc2008bosc_2008
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 

Ähnlich wie Workflow Classification and Open-Sourcing Methods (20)

Knowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems ScienceKnowledge Infrastructure for Global Systems Science
Knowledge Infrastructure for Global Systems Science
 
PhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflowsPhD Thesis: Mining abstractions in scientific workflows
PhD Thesis: Mining abstractions in scientific workflows
 
Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...Towards an Infrastructure for Enabling Systematic Development and Research of...
Towards an Infrastructure for Enabling Systematic Development and Research of...
 
Sharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reportsSharing massive data analysis: from provenance to linked experiment reports
Sharing massive data analysis: from provenance to linked experiment reports
 
Towards Computational Research Objects
Towards Computational Research ObjectsTowards Computational Research Objects
Towards Computational Research Objects
 
Scientific workflow-overview-2012-01-rev-2
Scientific workflow-overview-2012-01-rev-2Scientific workflow-overview-2012-01-rev-2
Scientific workflow-overview-2012-01-rev-2
 
Towards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication SystemTowards a Machine-Actionable Scholarly Communication System
Towards a Machine-Actionable Scholarly Communication System
 
Abcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosasAbcd iqs ssoftware-projects-mercecrosas
Abcd iqs ssoftware-projects-mercecrosas
 
Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)Distributed systems in practice, in theory (ScaleConf Colombia)
Distributed systems in practice, in theory (ScaleConf Colombia)
 
An Overview of VIEW
An Overview of VIEWAn Overview of VIEW
An Overview of VIEW
 
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
Finding Emerging Topics Using Chaos and Community Detection in Social Media G...
 
Data Citation Made Easy
Data Citation Made EasyData Citation Made Easy
Data Citation Made Easy
 
Accelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy ScienceAccelerating Data-driven Discovery in Energy Science
Accelerating Data-driven Discovery in Energy Science
 
2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)2013 06-24 Wf4Ever: Annotating research objects (PDF)
2013 06-24 Wf4Ever: Annotating research objects (PDF)
 
The Chemtools LaBLog
The Chemtools LaBLogThe Chemtools LaBLog
The Chemtools LaBLog
 
UCIAD overview
UCIAD overviewUCIAD overview
UCIAD overview
 
Simulagora (Euroscipy2014 - Logilab)
Simulagora (Euroscipy2014 - Logilab)Simulagora (Euroscipy2014 - Logilab)
Simulagora (Euroscipy2014 - Logilab)
 
Networking Materials Data
Networking Materials DataNetworking Materials Data
Networking Materials Data
 
Bhagat Myexperiment Bosc2008
Bhagat Myexperiment Bosc2008Bhagat Myexperiment Bosc2008
Bhagat Myexperiment Bosc2008
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 

Mehr von Richard Littauer

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...Richard Littauer
 
Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationRichard Littauer
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationRichard Littauer
 
Building Corpora from Social Media
Building Corpora from Social MediaBuilding Corpora from Social Media
Building Corpora from Social MediaRichard Littauer
 
Visualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat MapsVisualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat MapsRichard Littauer
 
On Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem IsoglossOn Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem IsoglossRichard Littauer
 
The Evolution of Morphological Agreement
The Evolution of Morphological AgreementThe Evolution of Morphological Agreement
The Evolution of Morphological AgreementRichard Littauer
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaRichard Littauer
 
The Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer SimulationThe Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer SimulationRichard Littauer
 
Towards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsTowards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsRichard Littauer
 
A Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for LanguageA Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for LanguageRichard Littauer
 

Mehr von Richard Littauer (13)

Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
Academic Research in the Blogosphere: Adapting to New Risks and Opportunities...
 
Named Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 PresentationNamed Entity Recognition - ACL 2011 Presentation
Named Entity Recognition - ACL 2011 Presentation
 
Marcu 2000 presentation
Marcu 2000 presentationMarcu 2000 presentation
Marcu 2000 presentation
 
Barzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentationBarzilay & Lapata 2008 presentation
Barzilay & Lapata 2008 presentation
 
Saarland and UdS
Saarland and UdSSaarland and UdS
Saarland and UdS
 
Building Corpora from Social Media
Building Corpora from Social MediaBuilding Corpora from Social Media
Building Corpora from Social Media
 
Visualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat MapsVisualising Typological Relationships: Plotting WALS with Heat Maps
Visualising Typological Relationships: Plotting WALS with Heat Maps
 
On Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem IsoglossOn Tocharian Exceptionality to the centum/satem Isogloss
On Tocharian Exceptionality to the centum/satem Isogloss
 
The Evolution of Morphological Agreement
The Evolution of Morphological AgreementThe Evolution of Morphological Agreement
The Evolution of Morphological Agreement
 
Evolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche KuchaEvolution of Morphological Agreement - Peche Kucha
Evolution of Morphological Agreement - Peche Kucha
 
The Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer SimulationThe Evolution of Speech Segmentation: A Computer Simulation
The Evolution of Speech Segmentation: A Computer Simulation
 
Towards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in LinguisticsTowards Open Methods: Using Scientific Workflows in Linguistics
Towards Open Methods: Using Scientific Workflows in Linguistics
 
A Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for LanguageA Reanalysis of Anatomical Changes for Language
A Reanalysis of Anatomical Changes for Language
 

Kürzlich hochgeladen

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 

Kürzlich hochgeladen (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 

Workflow Classification and Open-Sourcing Methods

  • 1. Workflow Classification and Open-Sourcing Methods: Towards a New Publication Model Richard Littauer, Karthik Ram, Bertram Ludäscher, William Michener, Rebecca Koskela DataONE 1
  • 2. Scientific Workflows Tools that help scientists: Automate repetitive or difficult work DataONE 2
  • 3. Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments DataONE 3
  • 4. Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments Track provenance DataONE 4
  • 5. Scientific Workflows Tools that help scientists: Automate repetitive or difficult work Provide reproducibility to their experiments Track provenance Share their data with other scientists DataONE 5
  • 9. Workflow Workbenches These facilitate: DataONE 9 Creation http://www.flickr.com/photos/ideacreamanuelapps/3542203718/
  • 10. Workflow Workbenches These facilitate: DataONE 10 Mapping http://www.flickr.com/photos/fatguyinalittlecoat/5716492273
  • 11. Workflow Workbenches These facilitate: DataONE 11 Scheduling http://www.flickr.com/photos/silent-penguin/232394/
  • 12. Workflow Workbenches These facilitate: DataONE 12 Execution http://www.flickr.com/photos/pagedooley/4039784738/
  • 13. Workflow Workbenches These facilitate: DataONE 13 Visualisation http://www.flickr.com/photos/cnon/5698746966/
  • 14. Workflow Workbenches These facilitate: DataONE 14 Re-use http://www.flickr.com/photos/nihonbunka/32774212/
  • 15. Workflow Workbenches Not all scientists are coders. DataONE 15
  • 16. Workflow Workbenches Not all scientists are coders. By using front-end visualizations and eliminating the need for lower-level coding (ie, shell scripts)… DataONE 16
  • 17. Workflow Workbenches Not all scientists are coders. By usingfront-end visualizations and eliminating the need for lower-level coding (ie, shell scripts)… …it is easier for scientists to do and share their work. DataONE 17 http://www.flickr.com/photos/wouterverhelst/362538835/
  • 18. Workflow Workbenches This is a common way how workflows are ‘sold’. DataONE 18 http://www.flickr.com/photos/amagill/3366720659/
  • 19. Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. DataONE 19 http://www.flickr.com/photos/amagill/3366720659/
  • 20. Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. Often it is just replacing one style of coding (conventional) with another (workflows). DataONE 20 http://www.flickr.com/photos/amagill/3366720659/
  • 21. Workflow Workbenches This is a common way how workflows are ‘sold’. However, the reality isn't quite there yet. Often it is just replacing one style of coding (conventional) with another (workflows). We’re trying to see if we can get to the bottom of how the promises cash out. DataONE 21 http://www.flickr.com/photos/amagill/3366720659/
  • 22. Our Study However, there have been few studies done looking at how these workflows work. DataONE 22 http://www.flickr.com/photos/eleaf/2536358399
  • 23. Our Study How do we classify workflows? DataONE 23 http://www.flickr.com/photos/eleaf/2536358399
  • 24. Our Study How do we classify workflows? Where do existing workflow systems fall short? DataONE 24 http://www.flickr.com/photos/eleaf/2536358399
  • 25. Our Study How do we classify workflows? Where do existing workflow systems fall short? How can the process of creating workflows be improved? DataONE 25 http://www.flickr.com/photos/eleaf/2536358399
  • 26. Our Study How do we classify workflows? Where do existing workflow systems fall short? How can the process of creating workflows be improved? How about executing them? DataONE 26 http://www.flickr.com/photos/eleaf/2536358399
  • 27. Our Study How do we classify workflows? Where do existing workflow systems fall short? How can the process of creating workflows be improved? How about executing them? And sharing them? DataONE 27 http://www.flickr.com/photos/eleaf/2536358399
  • 28. Our Study Some studies have been done. DataONE 28
  • 29. Our Study Some studies have been done. For example,  as much as 30% of workflow components have been assessed to be so-called data conversion shims [4]. DataONE 29
  • 30. Our Study Some studies have been done. For example,  as much as 30% of workflow components have been assessed to be so-called data conversion shims [4]. This large percentage and the difficulty of developing custom shims suggest that workflow design technology can still be improved. DataONE 30
  • 31. Our Study But most importantly, these studies have not significantly changed the way we use workflows. DataONE 31
  • 32. Our Study But most importantly, these studies have not significantly changed the way we use workflows. In some cases, studies run on the same data came up with different results, which suggests that open data alone does not lead to reproducible science [5]. DataONE 32
  • 33. Our Study But most importantly, these studies have not significantly changed the way we use workflows. In some cases, studies run on the same data came up with different results, which suggests that open data alone does not lead to reproducible science [5]. Therefore, a greater understanding of workflows and how we can most adequately implement them into open science is called for. DataONE 33
  • 34. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  DataONE 34
  • 35. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  Our main repository: http://www.myexperiment.org DataONE 35
  • 36. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  Our main repository: http://www.myexperiment.org Est. 2007 DataONE 36
  • 37. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  Our main repository: http://www.myexperiment.org Est. 2007 4500+ users DataONE 37
  • 38. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  Our main repository: http://www.myexperiment.org Est. 2007 4500+ users 1850+ workflows (mostly Taverna 1, 2, and RapidMiner) DataONE 38
  • 39. Our Study We are analyzing a wide variety of workflow systems and publicly available workflows.  Our main repository: http://www.myexperiment.org Est. 2007 4500+ users 1850+ workflows (mostly Taverna 1, 2, and RapidMiner) Minable by SPARQL DataONE 39
  • 40. Our Study Methods: For each workflow, we’re gathering three tiers of information. DataONE 40 http://www.flickr.com/photos/jpvargas/83258973/
  • 41. Our Study Methods: For each workflow, we’re gathering three tiers of information. DataONE 41 Meta-Data Description `Worth’ http://www.flickr.com/photos/jpvargas/83258973/
  • 42. Tier 1 Metadata: Workflow source Workflow system Works on run Area of research Type Description User User total uploads Published citations Downloads Date uploaded DataONE 42
  • 43. Tier 2 Description: Foreign components QA/QC steps Visual Output Number of inputs Intermediate input Linear Embedded Embedded details Number of databases Type conversion Tag conversion Multiple outputs Processing Stats Scalable Smart reruns provenance retained Multipurpose research mining Query Loop Grid Accounts necessary External results DataONE 43
  • 44. Tier 3 `Worth’: Sufficiency of metadata Sufficiency of Natural Language Description Reuse in published articles Relevant issues based on the system it was created in. DataONE 44
  • 45. Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. DataONE 45 http://www.flickr.com/photos/nauright/5391995939/
  • 46. Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. DataONE 46 http://www.flickr.com/photos/nauright/5391995939/
  • 47. Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. Workflows become more powerful over time. DataONE 47 http://www.flickr.com/photos/nauright/5391995939/
  • 48. Research Hypotheses Most workflows perform simple, but repetitive data acquisition tasks as opposed to complex operations. Workflows are becoming more complex over time. Workflows become more powerful over time. Workflows become more complex as one gains more experience. DataONE 48 http://www.flickr.com/photos/nauright/5391995939/
  • 49. Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. DataONE 49 http://www.flickr.com/photos/nauright/5391995939/
  • 50. Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation. DataONE 50 http://www.flickr.com/photos/nauright/5391995939/
  • 51. Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation. Reuse is proportional to the age of the workflow. DataONE 51 http://www.flickr.com/photos/nauright/5391995939/
  • 52. Research Hypotheses Workflow re-use is proportional to the complexity of tasks performed by the workflow. Workflow re-use is proportional to the sufficiency of the documentation. Reuse is proportional to the age of the workflow. Workflow reuse is proportional to the proficiency of the creator. DataONE 52 http://www.flickr.com/photos/nauright/5391995939/
  • 53. Data Still being gathered and analysed. DataONE 53
  • 54. Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 54
  • 55. Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 55
  • 56. Data Still being gathered and analysed. We’re using myExperiment download rate as a proxy for workflow reuse. DataONE 56
  • 57. Data One of the issues with this is the amount of workflows being created by each user. However, this still should allow for a diachronic analysis. DataONE 57
  • 58. Conclusion Old publishing model: Write paper. Submit paper. Drink wine. DataONE 58 http://www.flickr.com/photos/joelmontes/4762384399/
  • 59. Conclusion Old publishing model: Write paper. Submit paper. Drink wine. New publishing model: Write paper. Submit paper. Get feedback. Submit data. Replication (?) DataONE 59 http://www.flickr.com/photos/joelmontes/4762384399/
  • 60. Conclusion Better publishing model: Write paper using Submit paper. Get feedback. Workflows. Submit data. Replication DataONE 60 http://www.flickr.com/photos/mactitioner/5595830505
  • 61. Conclusion Better publishing model: Write paper using Submit paper. Get feedback. Workflows. Submit data. Replication Submit workflows. That works. DataONE 61 http://www.flickr.com/photos/mactitioner/5595830505
  • 62. Conclusion Better publishing model: Write paper using Submit paper. Get feedback. Workflows. Submit data. Replication Submit workflows. That works. As this is done, questions of how effective workflows are, and how they can be utilized in the new research and publishing paradigm, might be answered. DataONE 62 http://www.flickr.com/photos/mactitioner/5595830505
  • 63. References [1] Kepler Project. http://www.kepler-project.org [2] Taverna. http://www.taverna.org.uk/ [3] Vistrailshttp://www.vistrails.org/ [4] Cui Lin, Shiyong Lu, XuboFei, DarshanPai, and Jing Hua. 2009. A Task Abstraction and Mapping Approach to the Shimming Problem in Scientific Workflows. In Proceedings of the 2009 IEEE International Conference on Services Computing (SCC '09). IEEE Computer Society, Washington, DC, USA, http://dx.doi.org/10.1109/SCC.2009.77 [5]Coombes, K. R., Wang, J. & Baggerly, K. A. Microarrays: retracing steps.Nature Med.13, 1276–1277 (2007). DataONEWorkflows Project: http://notebooks.dataone.org/workflows Mendeley Research Group: http://www.mendeley.com/groups/1189721/scientific-workflows-and-workflow-systems/ DataONE 63 http://www.flickr.com/photos/wwworks/4759535950/