SlideShare ist ein Scribd-Unternehmen logo
1 von 55
Downloaden Sie, um offline zu lesen
The Future of the Journal


Anita de Waard , a.dewaard@elsevier.com
Disruptive Technologies Director, Elsevier Labs



June 21, 2010
Science is made of information...
Science is made of information...




   ...that gets created...
Science is made of information...




   ...that gets created...   ... and destroyed.
What is the problem?
What is the problem?



1. Researchers can’t keep track of their data.
What is the problem?



1. Researchers can’t keep track of their data.


2. Data is not stored in a way that is easy for authors.
What is the problem?



1. Researchers can’t keep track of their data.


2. Data is not stored in a way that is easy for authors.


3. For readers, article text is not linked to the underlying data.
The Vision   Work done with Ed Hovy, Phil Bourne,
             Gully Burns and Cartic Ramakrishnan
The Vision                                                        Work done with Ed Hovy, Phil Bourne,
                                                                  Gully Burns and Cartic Ramakrishnan

                                                 1. Research: Each item in the system has metadata
                        metadata                 (including provenance) and relations to other data items
                                   metadata      added to it.

       metadata




             metadata

                                      metadata
The Vision                                                        Work done with Ed Hovy, Phil Bourne,
                                                                  Gully Burns and Cartic Ramakrishnan

                                                 1. Research: Each item in the system has metadata
                        metadata                 (including provenance) and relations to other data items
                                   metadata      added to it.
                                                 2. Workflow: All data items created in the lab are added
       metadata
                                                 to a (lab-owned) workflow system.




             metadata

                                      metadata
The Vision                                                                         Work done with Ed Hovy, Phil Bourne,
                                                                                   Gully Burns and Cartic Ramakrishnan

                                                                  1. Research: Each item in the system has metadata
                                         metadata                 (including provenance) and relations to other data items
                                                    metadata      added to it.
                                                                  2. Workflow: All data items created in the lab are added
             metadata
                                                                  to a (lab-owned) workflow system.
                                                                  3. Authoring: A paper is written in an authoring tool which
                                                                  can pull data with provenance from the workflow tool in the
                                                                  appropriate representation into the document.

                    metadata

                                                       metadata




     Rats were subjected to two
     grueling tests
     (click on fig 2 to see underlying
     data). These results suggest that
     the neurological pain pro-
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.




       Rats were subjected to two
       grueling tests
       (click on fig 2 to see underlying
       data). These results suggest that
       the neurological pain pro-



    Review
                                   Revise
                    Edit
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.
                                                                    5. Publishing and distribution: When a paper is
                                                                    published, a collection of validated information is
                                                                    exposed to the world. It remains connected to its related
       Rats were subjected to two
                                                                    data item, and its heritage can be traced.
       grueling tests
       (click on fig 2 to see underlying
       data). These results suggest that
       the neurological pain pro-



    Review
                                   Revise
                    Edit
The Vision                                                                           Work done with Ed Hovy, Phil Bourne,
                                                                                     Gully Burns and Cartic Ramakrishnan

                                                                    1. Research: Each item in the system has metadata
                                           metadata                 (including provenance) and relations to other data items
                                                      metadata      added to it.
                                                                    2. Workflow: All data items created in the lab are added
               metadata
                                                                    to a (lab-owned) workflow system.
                                                                    3. Authoring: A paper is written in an authoring tool which
                                                                    can pull data with provenance from the workflow tool in the
                                                                    appropriate representation into the document.

                      metadata                                      4. Editing and review: Once the co-authors agree, the
                                                                    paper is ‘exposed’ to the editors, who in turn expose it to
                                                         metadata   reviewers. Reports are stored in the authoring/editing
                                                                    system, the paper gets updated, until it is validated.
                                                                    5. Publishing and distribution: When a paper is
                                                                    published, a collection of validated information is
                                                                    exposed to the world. It remains connected to its related
       Rats were subjected to two
                                                                    data item, and its heritage can be traced.
       grueling tests
       (click on fig 2 to see underlying
                                                                    6. User applications: distributed applications run on this
       data). These results suggest that                            ‘exposed data’ universe.
       the neurological pain pro-


                                                                                   Some other publisher
    Review
                                   Revise
                    Edit
What is needed to get there?
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Social change: Scientists who store, track and annotate
 their work
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
 sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
 and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
 information on any knowledge item created in a lab,
 including provenance/privacy/IPR rights
D. Social change: Scientists who store, track and annotate
 their work
E. Semantic/Linked Data XML repositories.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Social change: Scientists who store, track and annotate
  their work
E. Semantic/Linked Data XML repositories.
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly   tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Social change: Scientists who store, track and annotate
  their work
E. Semantic/Linked Data XML repositories.
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly   tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements               tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights
D. Social change: Scientists who store, track and annotate
  their work
E. Semantic/Linked Data XML repositories.
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly   tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements               tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Social change: Scientists who store, track and annotate
  their work
E. Semantic/Linked Data XML repositories.
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly      tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                  tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Social change: Scientists who store, track and annotate
  their work              institutes, funding bodies, individuals
E. Semantic/Linked Data XML repositories.
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly      tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                  tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Social change: Scientists who store, track and annotate
  their work              institutes, funding bodies, individuals
E. Semantic/Linked Data XML repositories.             publishers
F. Publishing systems that run application servers.
What is needed to get there?
A. Workflow tools: Linked-data-based workflow tools for all
  sciences: scalable, safe, and user-friendly      tool builders
B. Authoring and reviewing tools: that enable use of rich
  and provenance-tracked elements                  tool builders
C. Metadata standards: Standards that allow exchange of
  information on any knowledge item created in a lab,
  including provenance/privacy/IPR rights standards bodies
D. Social change: Scientists who store, track and annotate
  their work              institutes, funding bodies, individuals
E. Semantic/Linked Data XML repositories.             publishers
F. Publishing systems that run application servers.
                                                      publishers
A. Workflow tools are emerging
A. Workflow tools are emerging



                                http://MyExperiment.org
A. Workflow tools are emerging
         http://VisTrails.org




                                http://MyExperiment.org
A. Workflow tools are emerging
                http://VisTrails.org




                                       http://MyExperiment.org




    http://wings.isi.edu/
B. Authoring ‘ecosystems: SWAN
       person
                      SWAN Semantic Relationships
                                                                                comment


                                                                                         concept
                                      Claim           publication
               hypothesis

                                                                                          gene
                                      Claim           publication

       group

                                                      publication

     Public            Excel file
                                                                                  PDFs

     Private
                                                                                   comment

                                                    publication     person
                                    Claim



                                                    publication




                                              MSWORD file                    Slide by Tim Clark
B. Authoring ‘ecosystems: SWAN
           person
                          SWAN Semantic Relationships
                                                                             annotates
                                                                                                       comment
   authoredBy
                                makes             hasEvidence
                                                                                                               concept
                                                                                annotates
                                          Claim                  publication
   shareWith       hypothesis
                                makes             hasEvidence
                                                                                                                 gene
                                          Claim                  publication
                                                  hasEvidence                    discussedIn
           group

                                                                 publication

         Public            Excel file              describes                 describes
                                                                                                        PDFs

         Private                makes         hasEvidence                      annotates
                                                                                                         comment

                                                               publication          person
                                        Claim
                                             hasEvidence                authoredBy          authorOf


                                                               publication
                                                                         shareWith
                                                   describes

                                                    MSWORD file                                  Slide by Tim Clark
C. Metadata: HCLS SiG Scientific Discourse
http://esw.w3.org/HCLSIG/SWANSIOC:
Project Description
Provide a Semantic Web platform for biomedical discourse which can be
evolved over time into a more general facility for many types of scientific
discourse, and which is linked to key biological categories specified by
ontologies.
Discourse categories should include research questions, scientific
assertions or claims, hypotheses, comments and discussion, experiments,
data, publications, citations, and evidence.
Our primary scientific use cases will be derived from problems in digital
scientific communications and web-based research collaboratories
supporting research in neurological disorders and therapies.
The scientific use cases will motivate a series of informatics use cases
which can later be generalized across wider areas of biology and
medicine.
C. Metadata: SWAN

     The Knowledge Ecosystem:
         Interlocking Cycles of Research
               Draw conclusions                  Draw conclusions


                                  Communicate
                                                                      Collect data
Collect data



           Perform                                                    Perform
           experiment              Gather info                        experiment




                                  Synthesize
                Create/modify                         Create/modify
                hypothesis                            hypothesis
                                   SWAN
                                                                        Slide by Tim Clark
C. Metadata: Annotation Ontology
    foaf:person             rdf:Type

                                           http://www.ht.org/
                                               foaf.rdf#me

       June 1, 2010
                                                pav:createdBy

                   pav:createdOn                                      ann:annotates                   http://anyurl.com/sf_pat01.html


                     hasTag

                                                rdf:Type
                                   hasTopic
       Tag
                                                       Atomic

           tag
                                   FMA:skull                ann:context
                                                                               onDocument

Linear skull fracture



                                                                 rdf:Type
Other annotations on the same document:
1. Atomic annotation on image (tag: “hematoma”)
2. General annotation (tag: “injury”)                                     InitEndCornerSelector
                                                                                                             init
Other annotations on similar documents:                                                                                      (304, 507)
1. General annotation (tag: “skull fracture”)                                  rdfs:SubClassOf
                                                                                                             end
                                                                                                                             (380, 618)
                                                                                      ImageSelector
                                                                                                                       Slide by Tim Clark
D. Linked Data: E.g. for Elsevier
D. Linked Data: E.g. for Elsevier




 <ce:section id=#123>
D. Linked Data: E.g. for Elsevier




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier




                                       said @anita
                                     on May 31 2010




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier

                                          but we all know
                                       she was jetlagged then


                                       said @anita
                                     on May 31 2010




                         this says
 <ce:section id=#123>                mice like cheese
D. Linked Data: E.g. for Elsevier
             immutable, $$, proprietary
                                                       but we all know
                                                    she was jetlagged then


                                                    said @anita
                                                  on May 31 2010




                                      this says
 <ce:section id=#123>                             mice like cheese
D. Linked Data: E.g. for Elsevier
             immutable, $$, proprietary     dynamic, personal, task-driven, - open?
                                                           but we all know
                                                        she was jetlagged then


                                                        said @anita
                                                      on May 31 2010




                                      this says
 <ce:section id=#123>                                mice like cheese
D. What to link? Semantic annotation grid
D. What to link? Semantic annotation grid
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim

      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
Meansmanual

   semi-automated

automated
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim
                     Automated Copy Editing
      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
Meansmanual

   semi-automated

automated
D. What to link? Semantic annotation grid
Granularity
  collection

  document
      claim
                     Automated Copy Editing
      triple

     entity                                                  Moment
               measure author/editor typesetter/production reader/data minin
                                                             Reflect
Meansmanual

   semi-automated

automated
D. A start: .XMP RDF in all our PDFs: DC + PRISM
E. Publishing on an Application server
E. SD as application server: an example
Next Steps:
• Fall 2010:
  ‘Beyond the PDF’: Workshop organized by Phil Bourne
  @UCSD:
   –Take one paper from his group
   –And all data that went into making that paper
   –Including all correspondence, raw data, etc.
   –Challenge: how better to represent that?
• 2010 - 2011: Try to gather resources, current efforts,
  etc. on virtual platform
• August 2011:
  FoRC: Future of Research Communications
   –Dagstuhl Workshop
   –Involve key people (include funding bodies, libraries,
    institutions) to see where bottlenecks are
• Start using these tools and writing this way!

Weitere ähnliche Inhalte

Andere mochten auch

Knowledge Media Panel U Toronto, Sept 30 2010
Knowledge Media Panel U Toronto, Sept 30 2010Knowledge Media Panel U Toronto, Sept 30 2010
Knowledge Media Panel U Toronto, Sept 30 2010
Anita de Waard
 
Creating organisation of the future
Creating organisation of the futureCreating organisation of the future
Creating organisation of the future
OPUS Management
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
Anita de Waard
 

Andere mochten auch (19)

Social barriers at http://projects.iq.harvard.edu/attribution_workshop/
Social barriers at http://projects.iq.harvard.edu/attribution_workshop/Social barriers at http://projects.iq.harvard.edu/attribution_workshop/
Social barriers at http://projects.iq.harvard.edu/attribution_workshop/
 
Is Assessment Really So Horrible?
Is Assessment Really So Horrible?Is Assessment Really So Horrible?
Is Assessment Really So Horrible?
 
Epistemics
EpistemicsEpistemics
Epistemics
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective Data
 
Whither Small Data?
Whither Small Data?Whither Small Data?
Whither Small Data?
 
Towards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data ServicesTowards Incidental Collaboratories; Research Data Services
Towards Incidental Collaboratories; Research Data Services
 
Executing the Research Paper
Executing the Research PaperExecuting the Research Paper
Executing the Research Paper
 
Knowledge Media Panel U Toronto, Sept 30 2010
Knowledge Media Panel U Toronto, Sept 30 2010Knowledge Media Panel U Toronto, Sept 30 2010
Knowledge Media Panel U Toronto, Sept 30 2010
 
De Waard Carusi
De Waard CarusiDe Waard Carusi
De Waard Carusi
 
Creating organisation of the future
Creating organisation of the futureCreating organisation of the future
Creating organisation of the future
 
Assessment
AssessmentAssessment
Assessment
 
Designing Sideways : integrating emergence with authorship
Designing Sideways : integrating emergence with authorshipDesigning Sideways : integrating emergence with authorship
Designing Sideways : integrating emergence with authorship
 
Overview of scientific discourse annotatoin
Overview of scientific discourse annotatoinOverview of scientific discourse annotatoin
Overview of scientific discourse annotatoin
 
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014'Stories that persuade with data' - talk at CENDI meeting January 9 2014
'Stories that persuade with data' - talk at CENDI meeting January 9 2014
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with data
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
How to Execute A Research Paper
How to Execute A Research PaperHow to Execute A Research Paper
How to Execute A Research Paper
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papers
 

Ähnlich wie Vu210610futurejournal

Ähnlich wie Vu210610futurejournal (8)

The Future of the Journal And Applications in an Open Scientific Ecosystem
The Future of the Journal And Applications in an Open Scientific Ecosystem The Future of the Journal And Applications in an Open Scientific Ecosystem
The Future of the Journal And Applications in an Open Scientific Ecosystem
 
Research 3.0 & the Future of Scholarly Communications
Research 3.0 & the Future of Scholarly CommunicationsResearch 3.0 & the Future of Scholarly Communications
Research 3.0 & the Future of Scholarly Communications
 
Datum in action jisc final event 23032012 v1 1 linked
Datum in action jisc final event 23032012 v1 1 linkedDatum in action jisc final event 23032012 v1 1 linked
Datum in action jisc final event 23032012 v1 1 linked
 
Description &amp; annotation of biomedical experimental data sets: work in p...
Description &amp; annotation of biomedical experimental data sets:  work in p...Description &amp; annotation of biomedical experimental data sets:  work in p...
Description &amp; annotation of biomedical experimental data sets: work in p...
 
Linked data and the future of scientific publishing
Linked data and the future of scientific publishingLinked data and the future of scientific publishing
Linked data and the future of scientific publishing
 
Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...Exploring Process Barriers to Release Public Sector Information in Local Gove...
Exploring Process Barriers to Release Public Sector Information in Local Gove...
 
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID TestbedTowards FAIR Open Science with PID Kernel Information: RPID Testbed
Towards FAIR Open Science with PID Kernel Information: RPID Testbed
 
Invited talk @ DCC09 workshop
Invited talk @ DCC09 workshopInvited talk @ DCC09 workshop
Invited talk @ DCC09 workshop
 

Mehr von Anita de Waard

Mehr von Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 

Kürzlich hochgeladen

1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
QucHHunhnh
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
ciinovamais
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Krashi Coaching
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
fonyou31
 

Kürzlich hochgeladen (20)

Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 
Arihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdfArihant handbook biology for class 11 .pdf
Arihant handbook biology for class 11 .pdf
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 

Vu210610futurejournal

  • 1. The Future of the Journal Anita de Waard , a.dewaard@elsevier.com Disruptive Technologies Director, Elsevier Labs June 21, 2010
  • 2. Science is made of information...
  • 3. Science is made of information... ...that gets created...
  • 4. Science is made of information... ...that gets created... ... and destroyed.
  • 5. What is the problem?
  • 6. What is the problem? 1. Researchers can’t keep track of their data.
  • 7. What is the problem? 1. Researchers can’t keep track of their data. 2. Data is not stored in a way that is easy for authors.
  • 8. What is the problem? 1. Researchers can’t keep track of their data. 2. Data is not stored in a way that is easy for authors. 3. For readers, article text is not linked to the underlying data.
  • 9. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan
  • 10. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. metadata metadata metadata
  • 11. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. metadata metadata
  • 12. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata metadata Rats were subjected to two grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro-
  • 13. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. Rats were subjected to two grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro- Review Revise Edit
  • 14. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. 5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related Rats were subjected to two data item, and its heritage can be traced. grueling tests (click on fig 2 to see underlying data). These results suggest that the neurological pain pro- Review Revise Edit
  • 15. The Vision Work done with Ed Hovy, Phil Bourne, Gully Burns and Cartic Ramakrishnan 1. Research: Each item in the system has metadata metadata (including provenance) and relations to other data items metadata added to it. 2. Workflow: All data items created in the lab are added metadata to a (lab-owned) workflow system. 3. Authoring: A paper is written in an authoring tool which can pull data with provenance from the workflow tool in the appropriate representation into the document. metadata 4. Editing and review: Once the co-authors agree, the paper is ‘exposed’ to the editors, who in turn expose it to metadata reviewers. Reports are stored in the authoring/editing system, the paper gets updated, until it is validated. 5. Publishing and distribution: When a paper is published, a collection of validated information is exposed to the world. It remains connected to its related Rats were subjected to two data item, and its heritage can be traced. grueling tests (click on fig 2 to see underlying 6. User applications: distributed applications run on this data). These results suggest that ‘exposed data’ universe. the neurological pain pro- Some other publisher Review Revise Edit
  • 16. What is needed to get there?
  • 17. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly
  • 18. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements
  • 19. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights
  • 20. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Social change: Scientists who store, track and annotate their work
  • 21. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Social change: Scientists who store, track and annotate their work E. Semantic/Linked Data XML repositories.
  • 22. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Social change: Scientists who store, track and annotate their work E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.
  • 23. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Social change: Scientists who store, track and annotate their work E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.
  • 24. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights D. Social change: Scientists who store, track and annotate their work E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.
  • 25. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Social change: Scientists who store, track and annotate their work E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.
  • 26. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Social change: Scientists who store, track and annotate their work institutes, funding bodies, individuals E. Semantic/Linked Data XML repositories. F. Publishing systems that run application servers.
  • 27. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Social change: Scientists who store, track and annotate their work institutes, funding bodies, individuals E. Semantic/Linked Data XML repositories. publishers F. Publishing systems that run application servers.
  • 28. What is needed to get there? A. Workflow tools: Linked-data-based workflow tools for all sciences: scalable, safe, and user-friendly tool builders B. Authoring and reviewing tools: that enable use of rich and provenance-tracked elements tool builders C. Metadata standards: Standards that allow exchange of information on any knowledge item created in a lab, including provenance/privacy/IPR rights standards bodies D. Social change: Scientists who store, track and annotate their work institutes, funding bodies, individuals E. Semantic/Linked Data XML repositories. publishers F. Publishing systems that run application servers. publishers
  • 29. A. Workflow tools are emerging
  • 30. A. Workflow tools are emerging http://MyExperiment.org
  • 31. A. Workflow tools are emerging http://VisTrails.org http://MyExperiment.org
  • 32. A. Workflow tools are emerging http://VisTrails.org http://MyExperiment.org http://wings.isi.edu/
  • 33. B. Authoring ‘ecosystems: SWAN person SWAN Semantic Relationships comment concept Claim publication hypothesis gene Claim publication group publication Public Excel file PDFs Private comment publication person Claim publication MSWORD file Slide by Tim Clark
  • 34. B. Authoring ‘ecosystems: SWAN person SWAN Semantic Relationships annotates comment authoredBy makes hasEvidence concept annotates Claim publication shareWith hypothesis makes hasEvidence gene Claim publication hasEvidence discussedIn group publication Public Excel file describes describes PDFs Private makes hasEvidence annotates comment publication person Claim hasEvidence authoredBy authorOf publication shareWith describes MSWORD file Slide by Tim Clark
  • 35. C. Metadata: HCLS SiG Scientific Discourse http://esw.w3.org/HCLSIG/SWANSIOC: Project Description Provide a Semantic Web platform for biomedical discourse which can be evolved over time into a more general facility for many types of scientific discourse, and which is linked to key biological categories specified by ontologies. Discourse categories should include research questions, scientific assertions or claims, hypotheses, comments and discussion, experiments, data, publications, citations, and evidence. Our primary scientific use cases will be derived from problems in digital scientific communications and web-based research collaboratories supporting research in neurological disorders and therapies. The scientific use cases will motivate a series of informatics use cases which can later be generalized across wider areas of biology and medicine.
  • 36. C. Metadata: SWAN The Knowledge Ecosystem: Interlocking Cycles of Research Draw conclusions Draw conclusions Communicate Collect data Collect data Perform Perform experiment Gather info experiment Synthesize Create/modify Create/modify hypothesis hypothesis SWAN Slide by Tim Clark
  • 37. C. Metadata: Annotation Ontology foaf:person rdf:Type http://www.ht.org/ foaf.rdf#me June 1, 2010 pav:createdBy pav:createdOn ann:annotates http://anyurl.com/sf_pat01.html hasTag rdf:Type hasTopic Tag Atomic tag FMA:skull ann:context onDocument Linear skull fracture rdf:Type Other annotations on the same document: 1. Atomic annotation on image (tag: “hematoma”) 2. General annotation (tag: “injury”) InitEndCornerSelector init Other annotations on similar documents: (304, 507) 1. General annotation (tag: “skull fracture”) rdfs:SubClassOf end (380, 618) ImageSelector Slide by Tim Clark
  • 38. D. Linked Data: E.g. for Elsevier
  • 39. D. Linked Data: E.g. for Elsevier <ce:section id=#123>
  • 40. D. Linked Data: E.g. for Elsevier this says <ce:section id=#123> mice like cheese
  • 41. D. Linked Data: E.g. for Elsevier said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 42. D. Linked Data: E.g. for Elsevier but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 43. D. Linked Data: E.g. for Elsevier immutable, $$, proprietary but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 44. D. Linked Data: E.g. for Elsevier immutable, $$, proprietary dynamic, personal, task-driven, - open? but we all know she was jetlagged then said @anita on May 31 2010 this says <ce:section id=#123> mice like cheese
  • 45. D. What to link? Semantic annotation grid
  • 46. D. What to link? Semantic annotation grid
  • 47. D. What to link? Semantic annotation grid Granularity collection document claim triple entity
  • 48. D. What to link? Semantic annotation grid Granularity collection document claim triple entity Moment measure author/editor typesetter/production reader/data minin
  • 49. D. What to link? Semantic annotation grid Granularity collection document claim triple entity Moment measure author/editor typesetter/production reader/data minin Meansmanual semi-automated automated
  • 50. D. What to link? Semantic annotation grid Granularity collection document claim Automated Copy Editing triple entity Moment measure author/editor typesetter/production reader/data minin Meansmanual semi-automated automated
  • 51. D. What to link? Semantic annotation grid Granularity collection document claim Automated Copy Editing triple entity Moment measure author/editor typesetter/production reader/data minin Reflect Meansmanual semi-automated automated
  • 52. D. A start: .XMP RDF in all our PDFs: DC + PRISM
  • 53. E. Publishing on an Application server
  • 54. E. SD as application server: an example
  • 55. Next Steps: • Fall 2010: ‘Beyond the PDF’: Workshop organized by Phil Bourne @UCSD: –Take one paper from his group –And all data that went into making that paper –Including all correspondence, raw data, etc. –Challenge: how better to represent that? • 2010 - 2011: Try to gather resources, current efforts, etc. on virtual platform • August 2011: FoRC: Future of Research Communications –Dagstuhl Workshop –Involve key people (include funding bodies, libraries, institutions) to see where bottlenecks are • Start using these tools and writing this way!