SlideShare ist ein Scribd-Unternehmen logo
1 von 18
| 1
Anita de Waard 0000-0002-9034-4119
VP Research Data Collaborations
Elsevier RDM Services
a.dewaard@elsevier.com
CMMI Workshop
February 6, 2016
The Economics of Data
Sharing
| 2
How do we get scientists to share their data?
How do we make data repositories sustainable?
• The economics of science
• Cost recovery models of data repositories
• Some examples that work
• Some thoughts on the future.
How do we create effective and sustainable
ecosystems for storing, sharing and reusable data—
and get people to use them?
| 3
Debit Economy (like a pie)
• Single pile of ‘stuff’ gets divided:
- Thing can only be for one person
at one time
- “If you get more, I get less”
• Examples:
- Money
- Jobs
- Samples, equipment, space, etc.
• Behaviors:
- Hoarding, secrecy
- (Cut-throat) competition
- Winning by owning
(and not sharing)
Credit Economy (like a song)
• Credit comes from visibility:
- The more you give away,
the more you benefit
- “Only if I share do I really own”
(“You need me to do you!” JW)
• Examples:
- Papers, citations
- Good ideas (if credited)
- Skills
• Behaviors:
- Open access, citation game
- Collaboration with top-X
- Winning by sharing
(to enable priority & visibility)
Two Economies of Science [1]:
[1] Paula Stephan: “How Economics Shapes Science”, Harvard University Press, 2012: http://www.jstor.org/stable/j.ctt2jbqd1
<<<DATA???
| 4
RDA IG Repository Cost Recovery
• Interviewed 22 repositories, globally
• Different income streams:
1. Structurally funded
2. Mostly data access charges
3. Mostly data deposit fees
4. Membership fees (for deposits and/or access)
5. Serial project funding
6. Supported by host institution
• Different new models under considerations:
• Sponsorships/services for the commercial sector
• Contracts for specific services offered (hosting, archiving, curation)
• Expanding the number of affiliated institutions
• Deposit fees
• More services for “national memory institutes”
• Some comments:
• Some countries structurally fund repositories (not US!)
• Some repositories embedded in scholarly practice
• Hard to come up with new models: no time, no skill sets!
| 5
Object of
Study
Raw
Data
Processed
Data
Data
With
Paper
Curated
Record
Method Analysis
Tables/
Figures
Curate
Methods Software
Four Types of Repositories:
Research
Question
NOAA: 20 TB/
NASA streaming > 24 PB/day
NASA Reverb: 12 PB Data
NSSD: > 230 TB of digital data
NSIDC: 1 PB data, : 1 PB total
ALMA Telescope: 40 TB/day
Local Storage/
Instrument Repositories
Size: PB
Nr of files: Trillions
Deep Blue (Umich): 80k
MIT Dspace: 75 k
HAL (France): 60 k
D-Space Cambr: 1.5 k
Of which data: hundreds
Institutional/Local
Repositories
Size: GB
Nr of files: Billions
Figshare: 1.2 M
DataDryad: 3 k
Dataverse: 58 k
Non-Domain
Repositories
Size: MB
Nr of files: Milliions
Domain
Repositories
PetDB: 6 k
PDB: 100 k
NIST ASD: 170 k
Size: kB
Nr of files: 100ks
Publication
| 6
YES:
• Astronomy: telescopes
• High-energy physics: accelerators
• Earth science: satellites
• Social science: censuses
• Medicine (sometimes): patient data in
large studies
• Life science: sequence data
NO:
• Low-temperature physics: cryostats
• Earth science: samples
• Materials science: catalysts,
microscopes, etc.
• Social science: interviews
• Medicine: individual patient data
• Neuroscience: microscope
Where is data sharing happening?
• Big equipment, not a single lab/person
can run
• Can’t do science without it
• Tools in place to be effective
• Small equipment, single lab/person can
run
• Can do science without sharing
• No effective tools in place
Communicate
Prepare
Observe
Analyze
Ponder
| 7
Prepare
Analyze Communicate
Prepare
Analyze Communicate
Observations
Observations
Observations
Identify entities from the start
Connecting small science
| 8
Prepare
Analyze Communicate
Prepare
Analyze Communicate
Observations
Observations
Observations
Compare outcome of interactions
with these entities
Connecting small science
| 9
Prepare
Analyze Communicate
Prepare
AnalyzeCommunicate
Observations
Observations
Observations
Build a ‘virtual reagent
spectrogram’ by comparing
how different entities
interacted in different
experiments
Think
Reason collectively!
Connecting small science
| 10
A small change for small science: Urban Legend [2]
• Encourage data sharing of raw data files + experimental metadata
• Add metadata to your experiment while you’re performing it
• Improved data practices made lab more productive and more creative, and
enabled effective and novel collaborations
• Lesson: split the data storage and curation from data sharing!
- Provide direct reward to storage: now we can find our own data!
- Enable simple upload to embargo’d data set when owner is ready.
[2] Tripathy et al, 2014: http://www.frontiersin.org/10.3389/conf.fninf.2014.18.00077/event_abstract
| 11
Researche
r
Funding
AgencyInstitution
Data
Repository
Dataset
JournalPaper
Addressing the fear of scooping with embargo’s:
1. Researcher creates datasets
2. Researcher writes paper & publishes in journal
3. (Sometimes,) dataset gets posted to repository
4. Researcher reports (post-hoc) to Institution and Funder
2
2
1
3
4
4
| 12
Researche
r
Funding
AgencyInstitution
Data
Repository
Dataset
JournalPaper
2
2
1
3
4
4
iii. No links between
data and paper
iv. Funders/Institutions informed as an afterthought
i. Too much work for researchers
ii. Data posting not mandatory
Addressing the fear of scooping with embargo’s:
| 13
Researche
r
Funding
Agency
Institution
Data
Repository
Dataset
Journal
Paper
1. Researcher creates datasets and posts to repository
(under embargo – not publicly viewable)
2. Funder is automatically notified of dataset posting
3. Researcher writes paper & publishes in journal; embargo is lifted and data linked
- NB this also allows release of non-used data for negative result and reproducibility
4. Funder and institution get report on publication and embargo lifting
2
1
1
3
3
3
4
4
Addressing the fear of scooping with embargo’s:
| 14
A System for Linking Data Links: Scholix
• ICSU-WDS/RDA Publishing Data Service Working group,
merged with National Data Service pilot
• Cross-stakeholder – with input from CrossRef, DataCite, OpenAIRE, Europe
PubMed Central, ANDS, PANGAEA, Thomson Reuters, Elsevier, and others
• Proposed long-term architecture and interoperability framework: www.scholix.org
• Operational prototype at http://dliservice.research-infrastructures.eu/#/api
(including 1.4 Million links from various sources)
• Making links between datasets and articles available could/should encourage
data citation and deposition
• Together with Force11 Data Citation Principles, encourage Research Object
citation/credit metrics.
| 15
The Commons
Cloud Provider
A
NIH
Option:
Direct Funding
NIH
BD2K
A System for A New Data Economics: NIH Data Commons
Phil Bourne, Dec15
Enables Search
Discovery Index
Indexes
Search
Engines
Cloud Provider
B
Investigator
Provides credits
Uses credits in
the Commons
User
| 16
Drivers for Data Sharing: A Study in Behavioral Economics
• Study scholarly reward systems from point of view of economics
• Develop economic model for entire scholarly rewards ecosystem:
career, prestige, tenure, finances, etc
• Two intended outcomes:
- Understanding current behavior with respect to data sharing: can we
explain what we see, and the differences between different domains?
- Theoretical foundation for recommendations for policies and practices to
stakeholders such as funders, publishers and standards bodies
• Small group working on it, planning first meeting:
- Mike Huerta (NLM), Micah Altman (MIT), Fran Berman (RPI), Carol
Tenopir (TN), Carole Palmer (UW), Greg Gordon (SSRN).
• Thoughts, join?
| 17
• The Economy of Science: pies vs. songs
- RDA Data Repositories Cost Recovery IG:
- Different types of repositories, different types of science
- Need to move from ‘small’ to ‘big’ science thinking
• Some examples of successful data sharing:
- Online electronic lab notebooks: making it too easy not to use
- RDA Scholix: linking systems of links using existing technology
- The NIH Data Commons: enabling a data economy in practice
• Some things we can do:
- Embargo pilots: circumvent the fear of scooping
- Drivers for data sharing report: science is a human endeavor
In summary:
cyberinfrastucture
| 18
Thank you!
Links:
• https://www.hivebench.com
• https://www.elsevier.com/physical-sciences/earth-and-planetary-sciences/the-
2015-international-data-rescue-award-in-the-geosciences
• http://www.journals.elsevier.com/softwarex/
• https://www.elsevier.com/books-and-journals/content-innovation/data-base-
linking
• https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html
• https://rd-alliance.org/bof-data-search.html
• https://data.mendeley.com/
• https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
• https://www.force11.org/
• http://www.nationaldataservice.org/
• https://rd-alliance.org/
• https://www.elsevier.com/about/open-science/research-data
Anita de Waard, a.dewaard@elsevier.com

Weitere ähnliche Inhalte

Was ist angesagt?

THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierMaaike Duine
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsAnita de Waard
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
 
THOR Workshop - Data Publishing
THOR Workshop - Data PublishingTHOR Workshop - Data Publishing
THOR Workshop - Data PublishingMaaike Duine
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryAnita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseAnita de Waard
 
University of Northumbria Research
University of Northumbria ResearchUniversity of Northumbria Research
University of Northumbria ResearchKevin Ashley
 
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...DeVonne Parks, CEM
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?Anita de Waard
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...datascienceiqss
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016Susanna-Assunta Sansone
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgmandatascienceiqss
 
Comeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal AgenciesComeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal AgenciesASIS&T
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identificationguest453b14
 

Was ist angesagt? (20)

THOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing ElsevierTHOR Workshop - Data Publishing Elsevier
THOR Workshop - Data Publishing Elsevier
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
 
THOR Workshop - Data Publishing
THOR Workshop - Data PublishingTHOR Workshop - Data Publishing
THOR Workshop - Data Publishing
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
University of Northumbria Research
University of Northumbria ResearchUniversity of Northumbria Research
University of Northumbria Research
 
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types  Pa...
December 16, 2015 NISO Webinar: Two-Part Webinar: Emerging Resource Types Pa...
 
Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
Journal Data Requirements
Journal Data Requirements Journal Data Requirements
Journal Data Requirements
 
BioSharing - Update - Feb2016
BioSharing - Update - Feb2016BioSharing - Update - Feb2016
BioSharing - Update - Feb2016
 
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
Preservation of Research Data: Dataverse / Archivematica Integration by Allan...
 
Jan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortiumJan Brase: Data and Libraries - the DataCite consortium
Jan Brase: Data and Libraries - the DataCite consortium
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016
 
Baker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated AudiencesBaker - Evolution of Data Products and Designated Audiences
Baker - Evolution of Data Products and Designated Audiences
 
Valen Metadata and the [Data] Repository
Valen Metadata and the [Data] RepositoryValen Metadata and the [Data] Repository
Valen Metadata and the [Data] Repository
 
Publishers and RDM
Publishers and RDMPublishers and RDM
Publishers and RDM
 
Dataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. BorgmanDataverse in the Universe of Data by Christine L. Borgman
Dataverse in the Universe of Data by Christine L. Borgman
 
Comeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal AgenciesComeaux RDAP11 Data Archives in Federal Agencies
Comeaux RDAP11 Data Archives in Federal Agencies
 
Dataset Citation and Identification
Dataset Citation and IdentificationDataset Citation and Identification
Dataset Citation and Identification
 

Andere mochten auch

Media future 2013 keynote - The Economist: The New Way
Media future 2013 keynote - The Economist: The New WayMedia future 2013 keynote - The Economist: The New Way
Media future 2013 keynote - The Economist: The New WayNick Blunden
 
Excellence, Innovation, Evaluation: Collaboration x Innovation
Excellence, Innovation, Evaluation: Collaboration x InnovationExcellence, Innovation, Evaluation: Collaboration x Innovation
Excellence, Innovation, Evaluation: Collaboration x InnovationCameron Neylon
 
ATEED & The Economist Group - Innovation summit
ATEED & The Economist Group - Innovation summitATEED & The Economist Group - Innovation summit
ATEED & The Economist Group - Innovation summitLuigi Wewege
 
Brand Analysis The Economist
Brand Analysis  The EconomistBrand Analysis  The Economist
Brand Analysis The EconomistAravind Suresh
 
The Economist Integrated Marketing Strategy
The Economist Integrated Marketing Strategy The Economist Integrated Marketing Strategy
The Economist Integrated Marketing Strategy Sona Martirosian
 
The Economist Brand Strategy Presentation
The Economist Brand Strategy PresentationThe Economist Brand Strategy Presentation
The Economist Brand Strategy PresentationGaurav Venkateswar
 
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of Brands
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of BrandsWhat Is the Future of Data Sharing? - Consumer Mindsets and the Power of Brands
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of BrandsDavid Rogers
 

Andere mochten auch (11)

Safeguarding Media Pluralism in the EU
Safeguarding Media Pluralism in the EUSafeguarding Media Pluralism in the EU
Safeguarding Media Pluralism in the EU
 
Media future 2013 keynote - The Economist: The New Way
Media future 2013 keynote - The Economist: The New WayMedia future 2013 keynote - The Economist: The New Way
Media future 2013 keynote - The Economist: The New Way
 
Excellence, Innovation, Evaluation: Collaboration x Innovation
Excellence, Innovation, Evaluation: Collaboration x InnovationExcellence, Innovation, Evaluation: Collaboration x Innovation
Excellence, Innovation, Evaluation: Collaboration x Innovation
 
Case study 3
Case study 3Case study 3
Case study 3
 
ATEED & The Economist Group - Innovation summit
ATEED & The Economist Group - Innovation summitATEED & The Economist Group - Innovation summit
ATEED & The Economist Group - Innovation summit
 
Brand Analysis The Economist
Brand Analysis  The EconomistBrand Analysis  The Economist
Brand Analysis The Economist
 
The Economist and social media
The Economist and social mediaThe Economist and social media
The Economist and social media
 
The Economist Integrated Marketing Strategy
The Economist Integrated Marketing Strategy The Economist Integrated Marketing Strategy
The Economist Integrated Marketing Strategy
 
Economist - Final
Economist - Final Economist - Final
Economist - Final
 
The Economist Brand Strategy Presentation
The Economist Brand Strategy PresentationThe Economist Brand Strategy Presentation
The Economist Brand Strategy Presentation
 
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of Brands
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of BrandsWhat Is the Future of Data Sharing? - Consumer Mindsets and the Power of Brands
What Is the Future of Data Sharing? - Consumer Mindsets and the Power of Brands
 

Ähnlich wie The Economics of Data Sharing

Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Anita de Waard
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...ariadnenetwork
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsMartin Donnelly
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Jeroen Rombouts
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...hsuleslie
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Datacunera
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Anita de Waard
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to ReuseAnita de Waard
 
Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Philip Bourne
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinarSarah Jones
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...datacite
 
Research Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesResearch Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesCelia Emmelhainz
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH Philip Bourne
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020Sarah Jones
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College LondonSarah Anna Stewart
 
Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Carly Strasser
 

Ähnlich wie The Economics of Data Sharing (20)

Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
 
How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...How to overcome obstacles to data publication: Issues, requirements, and good...
How to overcome obstacles to data publication: Issues, requirements, and good...
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 
Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10Elag workshop sessie 1 en 2 v10
Elag workshop sessie 1 en 2 v10
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
 
Data Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach DataData Literacy: Creating and Managing Reserach Data
Data Literacy: Creating and Managing Reserach Data
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Data Science BD2K Update for NIH
Data Science BD2K Update for NIH Data Science BD2K Update for NIH
Data Science BD2K Update for NIH
 
Data 101: A Gentle Introduction
Data 101: A Gentle IntroductionData 101: A Gentle Introduction
Data 101: A Gentle Introduction
 
RDM LIASA webinar
RDM LIASA webinarRDM LIASA webinar
RDM LIASA webinar
 
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
2013 DataCite Summer Meeting - DOIs and Supercomputing (Terry Jones - Oak Rid...
 
DBMS
DBMSDBMS
DBMS
 
Research Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social SciencesResearch Data Management in the Humanities and Social Sciences
Research Data Management in the Humanities and Social Sciences
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH     Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
Ask Not What the NIH Can Do For You; Ask What You Can Do For the NIH
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 
Research Data Management at Imperial College London
Research Data Management at Imperial College LondonResearch Data Management at Imperial College London
Research Data Management at Imperial College London
 
Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014Data Stewardship for SPATIAL/IsoCamp 2014
Data Stewardship for SPATIAL/IsoCamp 2014
 

Mehr von Anita de Waard

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseAnita de Waard
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesAnita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data ManagementAnita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of PublishingAnita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingAnita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...Anita de Waard
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupAnita de Waard
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papersAnita de Waard
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective DataAnita de Waard
 
Ten Habits of Highly Successful Data
Ten Habits of Highly Successful DataTen Habits of Highly Successful Data
Ten Habits of Highly Successful DataAnita de Waard
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with dataAnita de Waard
 
Ten habits of highly effective data
Ten habits of highly effective dataTen habits of highly effective data
Ten habits of highly effective dataAnita de Waard
 
The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data: Anita de Waard
 

Mehr von Anita de Waard (20)

Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
Argumentation in biology papers
Argumentation in biology papersArgumentation in biology papers
Argumentation in biology papers
 
Ten Habits of Highly Effective Data
Ten Habits of Highly Effective DataTen Habits of Highly Effective Data
Ten Habits of Highly Effective Data
 
Ten Habits of Highly Successful Data
Ten Habits of Highly Successful DataTen Habits of Highly Successful Data
Ten Habits of Highly Successful Data
 
How to persuade with data
How to persuade with dataHow to persuade with data
How to persuade with data
 
Ten habits of highly effective data
Ten habits of highly effective dataTen habits of highly effective data
Ten habits of highly effective data
 
The habits of highly successful data:
The habits of highly successful data: The habits of highly successful data:
The habits of highly successful data:
 

Kürzlich hochgeladen

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Silpa
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICEayushi9330
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Servicenishacall1
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxseri bangash
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑Damini Dixit
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 

Kürzlich hochgeladen (20)

Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.Proteomics: types, protein profiling steps etc.
Proteomics: types, protein profiling steps etc.
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICESAMASTIPUR CALL GIRL 7857803690  LOW PRICE  ESCORT SERVICE
SAMASTIPUR CALL GIRL 7857803690 LOW PRICE ESCORT SERVICE
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
9999266834 Call Girls In Noida Sector 22 (Delhi) Call Girl Service
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
High Profile 🔝 8250077686 📞 Call Girls Service in GTB Nagar🍑
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 

The Economics of Data Sharing

  • 1. | 1 Anita de Waard 0000-0002-9034-4119 VP Research Data Collaborations Elsevier RDM Services a.dewaard@elsevier.com CMMI Workshop February 6, 2016 The Economics of Data Sharing
  • 2. | 2 How do we get scientists to share their data? How do we make data repositories sustainable? • The economics of science • Cost recovery models of data repositories • Some examples that work • Some thoughts on the future. How do we create effective and sustainable ecosystems for storing, sharing and reusable data— and get people to use them?
  • 3. | 3 Debit Economy (like a pie) • Single pile of ‘stuff’ gets divided: - Thing can only be for one person at one time - “If you get more, I get less” • Examples: - Money - Jobs - Samples, equipment, space, etc. • Behaviors: - Hoarding, secrecy - (Cut-throat) competition - Winning by owning (and not sharing) Credit Economy (like a song) • Credit comes from visibility: - The more you give away, the more you benefit - “Only if I share do I really own” (“You need me to do you!” JW) • Examples: - Papers, citations - Good ideas (if credited) - Skills • Behaviors: - Open access, citation game - Collaboration with top-X - Winning by sharing (to enable priority & visibility) Two Economies of Science [1]: [1] Paula Stephan: “How Economics Shapes Science”, Harvard University Press, 2012: http://www.jstor.org/stable/j.ctt2jbqd1 <<<DATA???
  • 4. | 4 RDA IG Repository Cost Recovery • Interviewed 22 repositories, globally • Different income streams: 1. Structurally funded 2. Mostly data access charges 3. Mostly data deposit fees 4. Membership fees (for deposits and/or access) 5. Serial project funding 6. Supported by host institution • Different new models under considerations: • Sponsorships/services for the commercial sector • Contracts for specific services offered (hosting, archiving, curation) • Expanding the number of affiliated institutions • Deposit fees • More services for “national memory institutes” • Some comments: • Some countries structurally fund repositories (not US!) • Some repositories embedded in scholarly practice • Hard to come up with new models: no time, no skill sets!
  • 5. | 5 Object of Study Raw Data Processed Data Data With Paper Curated Record Method Analysis Tables/ Figures Curate Methods Software Four Types of Repositories: Research Question NOAA: 20 TB/ NASA streaming > 24 PB/day NASA Reverb: 12 PB Data NSSD: > 230 TB of digital data NSIDC: 1 PB data, : 1 PB total ALMA Telescope: 40 TB/day Local Storage/ Instrument Repositories Size: PB Nr of files: Trillions Deep Blue (Umich): 80k MIT Dspace: 75 k HAL (France): 60 k D-Space Cambr: 1.5 k Of which data: hundreds Institutional/Local Repositories Size: GB Nr of files: Billions Figshare: 1.2 M DataDryad: 3 k Dataverse: 58 k Non-Domain Repositories Size: MB Nr of files: Milliions Domain Repositories PetDB: 6 k PDB: 100 k NIST ASD: 170 k Size: kB Nr of files: 100ks Publication
  • 6. | 6 YES: • Astronomy: telescopes • High-energy physics: accelerators • Earth science: satellites • Social science: censuses • Medicine (sometimes): patient data in large studies • Life science: sequence data NO: • Low-temperature physics: cryostats • Earth science: samples • Materials science: catalysts, microscopes, etc. • Social science: interviews • Medicine: individual patient data • Neuroscience: microscope Where is data sharing happening? • Big equipment, not a single lab/person can run • Can’t do science without it • Tools in place to be effective • Small equipment, single lab/person can run • Can do science without sharing • No effective tools in place Communicate Prepare Observe Analyze Ponder
  • 7. | 7 Prepare Analyze Communicate Prepare Analyze Communicate Observations Observations Observations Identify entities from the start Connecting small science
  • 8. | 8 Prepare Analyze Communicate Prepare Analyze Communicate Observations Observations Observations Compare outcome of interactions with these entities Connecting small science
  • 9. | 9 Prepare Analyze Communicate Prepare AnalyzeCommunicate Observations Observations Observations Build a ‘virtual reagent spectrogram’ by comparing how different entities interacted in different experiments Think Reason collectively! Connecting small science
  • 10. | 10 A small change for small science: Urban Legend [2] • Encourage data sharing of raw data files + experimental metadata • Add metadata to your experiment while you’re performing it • Improved data practices made lab more productive and more creative, and enabled effective and novel collaborations • Lesson: split the data storage and curation from data sharing! - Provide direct reward to storage: now we can find our own data! - Enable simple upload to embargo’d data set when owner is ready. [2] Tripathy et al, 2014: http://www.frontiersin.org/10.3389/conf.fninf.2014.18.00077/event_abstract
  • 11. | 11 Researche r Funding AgencyInstitution Data Repository Dataset JournalPaper Addressing the fear of scooping with embargo’s: 1. Researcher creates datasets 2. Researcher writes paper & publishes in journal 3. (Sometimes,) dataset gets posted to repository 4. Researcher reports (post-hoc) to Institution and Funder 2 2 1 3 4 4
  • 12. | 12 Researche r Funding AgencyInstitution Data Repository Dataset JournalPaper 2 2 1 3 4 4 iii. No links between data and paper iv. Funders/Institutions informed as an afterthought i. Too much work for researchers ii. Data posting not mandatory Addressing the fear of scooping with embargo’s:
  • 13. | 13 Researche r Funding Agency Institution Data Repository Dataset Journal Paper 1. Researcher creates datasets and posts to repository (under embargo – not publicly viewable) 2. Funder is automatically notified of dataset posting 3. Researcher writes paper & publishes in journal; embargo is lifted and data linked - NB this also allows release of non-used data for negative result and reproducibility 4. Funder and institution get report on publication and embargo lifting 2 1 1 3 3 3 4 4 Addressing the fear of scooping with embargo’s:
  • 14. | 14 A System for Linking Data Links: Scholix • ICSU-WDS/RDA Publishing Data Service Working group, merged with National Data Service pilot • Cross-stakeholder – with input from CrossRef, DataCite, OpenAIRE, Europe PubMed Central, ANDS, PANGAEA, Thomson Reuters, Elsevier, and others • Proposed long-term architecture and interoperability framework: www.scholix.org • Operational prototype at http://dliservice.research-infrastructures.eu/#/api (including 1.4 Million links from various sources) • Making links between datasets and articles available could/should encourage data citation and deposition • Together with Force11 Data Citation Principles, encourage Research Object citation/credit metrics.
  • 15. | 15 The Commons Cloud Provider A NIH Option: Direct Funding NIH BD2K A System for A New Data Economics: NIH Data Commons Phil Bourne, Dec15 Enables Search Discovery Index Indexes Search Engines Cloud Provider B Investigator Provides credits Uses credits in the Commons User
  • 16. | 16 Drivers for Data Sharing: A Study in Behavioral Economics • Study scholarly reward systems from point of view of economics • Develop economic model for entire scholarly rewards ecosystem: career, prestige, tenure, finances, etc • Two intended outcomes: - Understanding current behavior with respect to data sharing: can we explain what we see, and the differences between different domains? - Theoretical foundation for recommendations for policies and practices to stakeholders such as funders, publishers and standards bodies • Small group working on it, planning first meeting: - Mike Huerta (NLM), Micah Altman (MIT), Fran Berman (RPI), Carol Tenopir (TN), Carole Palmer (UW), Greg Gordon (SSRN). • Thoughts, join?
  • 17. | 17 • The Economy of Science: pies vs. songs - RDA Data Repositories Cost Recovery IG: - Different types of repositories, different types of science - Need to move from ‘small’ to ‘big’ science thinking • Some examples of successful data sharing: - Online electronic lab notebooks: making it too easy not to use - RDA Scholix: linking systems of links using existing technology - The NIH Data Commons: enabling a data economy in practice • Some things we can do: - Embargo pilots: circumvent the fear of scooping - Drivers for data sharing report: science is a human endeavor In summary: cyberinfrastucture
  • 18. | 18 Thank you! Links: • https://www.hivebench.com • https://www.elsevier.com/physical-sciences/earth-and-planetary-sciences/the- 2015-international-data-rescue-award-in-the-geosciences • http://www.journals.elsevier.com/softwarex/ • https://www.elsevier.com/books-and-journals/content-innovation/data-base- linking • https://rd-alliance.org/groups/rdawds-publishing-data-services-wg.html • https://rd-alliance.org/bof-data-search.html • https://data.mendeley.com/ • https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data • https://www.force11.org/ • http://www.nationaldataservice.org/ • https://rd-alliance.org/ • https://www.elsevier.com/about/open-science/research-data Anita de Waard, a.dewaard@elsevier.com

Hinweis der Redaktion

  1. IUPAC has recommendations for what word you should use to describe a given property, but the vocabulary itself isn’t very accessible or usable itself, thus is not universally implemented. Each site decides how it wants to label a given property, which hinders indexing and reuse of the data across silos. Structured capture of information using an ELN such as Hivebench enables the researcher to report data using a consistent vocabulary without extra effort.