Presentation slides from a talk by Gareth Knight which discussed the need to consider data sharing activities in academic citizenship, different approaches that may be taken to publish data associated with publications, and the opportunities presented by data journals
Scaling API-first – The story of a global engineering organization
Data Journals and repositories: Getting academic credit for data sharing
1. DATA JOURNALS AND REPOSITORIES:
Getting academic credit for data sharing
Gareth Knight
LSHTM Research Data Management Service
gareth.knight@lshtm.ac.uk
Open Access Week
27 October 2016
2. Agenda
1. Publish data as supplemental material
2. Submit data to a digital repository
3. Publish a Data Paper
3. Motivations
Enable new research
Higher rate of citation More outputs that can be cited
Ensure validity & replicability
Piwowar HA, Vision TJ. (2013)
Data reuse and the open data citation
advantage https://doi.org/10.7717/peerj.175
4. 1. Publish as supplemental material
PLOS One + FigshareBioMed Central
6. Data Deposit Process
1
Prepare your data
o Select data that underpins
research findings
o Remove personal,
sensitive, and
superfluous details
o Prepare documentation
and other support files
2
Create metadata record
o Create a metadata record
and enter title, description,
names, and other details.
o Specify access status and
licence for each file
o Identify corresponding
author
3
Update article
o Obtain Digital Object
Identifier
o Add to appropriate location
in article
7. OPEN DATA
Anonymised data that can be made available
for open access
• Journals encourage use of licences that
allow widest access and use
SAFEGUARDED DATA
Data with personal & sensitive information that
can’t be removed requires extra protection
• Provide method of contact
• Access request must be reviewed by multiple
people, not just creator
• Use Data Transfer Agreement which
establishes permitted & non-permitted uses
Data access and use
Data should be as open as possible, but as closed as necessary
8. Data Citation Practices
Reference List Data Access Statement
Some journals require datasets to be
included in the reference list:
Mandatory:
• Author/creator
• Title,
• Publication date
• Publisher: a repository, institution
• DOI/URL or other access information
Optional:
• Resource Type (dataset, survey)
• Version
A section on 1st page of a journal article,
typically next to funding information
If data is openly available:
• the repository name
• the DOI/URL of the data record
• Any licence conditions (e.g. CC-BY)
If data is restricted:
• a DOI/URL that describes data
• reason for access constraints
• how it may be requested
• Brief details of access conditions
9. Data Access Statements
Open data:
“All data supporting this paper are openly available from the LSHTM data repository at
http://dx.doi.org/10.17037/DATA.1234”
Safeguarded data:
“Due to the (ethically/politically/commercially) sensitive nature of the research, interview
transcripts will be made available to bona fide researchers, on condition that they provide evidence
of ethics approval and sign a Data Transfer Agreement. Further information on the data and access
conditions can be found at http://dx.doi.org/10.17037/DATA.120”
Third party data:
"This study used secondary data obtained from third parties. Quarterly labour force data can be
accessed by visiting http://dx.doi.org/10.5255/UKDA-SN-7037-5 and registering with the UK Data
Service. Traffic pollution information was provided by Brighton County Council and may be requested
by contacting their traffic department at [email address]. Additional details on how these datasets
were processed are openly available at LSHTM data repository at
http://dx.doi.org/10.17037/DATA.120”
10. Publish a Data Paper
Purpose:
1. Provides a citable journal publication that provides scholarly credit to data
creator and others involved in process
2. Ensures data is documented to a minimum standard that considers data
reuse
3. It improve awareness of the data within the scholarly community
a peer reviewed publication which contains facts about one or more datasets
Content:
• Describes the data itself, not hypotheses or arguments built on the data
• Published in XML & PDF format (with additional metadata in ISA-Tab)
• Authors retain copyright and grant first publication right
to journal – encourage
12. Pure & hybrid data journals
https://www.wiki.ed.ac.uk/display/datashare/Sources+of+dataset+peer+review
13. Pure & hybrid data journals
https://www.wiki.ed.ac.uk/display/datashare/Sources+of+dataset+peer+review
Peer review Criteria
• Rigour of procedures described in paper
• Level of detail provided in data description
Data must be:
• hosted in a DOI-capable repository
• publicly available without restriction,
excepting reasonable controls related to
human privacy issues or public safety
• Ideally made available in an open format
and open licence (e.g.CC-BY)
14. Open Health Data: Data Papers
1. Overview:
• Title, paper author details, abstract, keywords, introduction/Study
Description
2. Context:
• Spatial coverage, temporal coverage, species
3. Methods:
• Steps, sampling strategy, quality control, constraints, privacy, ethics
4. Dataset description:
• Dataset creators, object name, data type, ontologies, file format,
creation/publication dates, dataset creators, language, licence,
repository location
5. Reuse potential
• Funding statement, references
APC:
£100http://openhealthdata.metajnl.com
/about/submissions/
15. Scientific Data: Data Descriptor
• Title
• Authors
• Affiliations
• Abstract
• Background & Summary
• Methods (capture, processing, etc.)
• Code availability
• Data Records - description
• Technical Validation
• Usage Notes
• Author contributions
• References
• Data Citationshttp://www.nature.com/sdata/publish/
submission-guidelines#templates
APC:
£890 + optional
data storage
costs