SlideShare a Scribd company logo
1 of 28
Publishing perspectives on data
management & future directions
Research Integrity Advisors Data management workshop
Friday 31 March
Virginia Barbour
Director, AOASG
ORCID: 0000-0002-2358-2440
ginny.barbour@qut.edu.au
My roles
Director, Australasian Open Access Strategy Group
Chair, Committee on Publication Ethics (COPE)
Editor PLOS Medicine, then Editorial Director, PLOS 2004 - 2015
Involved publishing initiatives, including AllTrials, reporting guidelines
Joint appointment between Office of Research Ethics and Integrity, and
Division of Technology, Information and Library Services, QUT
Journals’ interest in data
Background
https://commons.wikimedia.org/wiki/File:Network-mapping.gif
https://commons.wikimedia.org/wiki/File:Question_mark_1.svg
Motives
Practicalities
Background
https://commons.wikimedia.org/wiki/File:Network-mapping.gif
https://commons.wikimedia.org/wiki/File:Question_mark_1.svg
“Today, the CMS Collaboration at
CERN has released more than 300
terabytes (TB) of high-quality open
data. These include over 100 TB, or 2.5
inverse femtobarns (fb−1), of data
from proton collisions”
This is the age of data
https://www.flickr.com/photos/jerry-raia/13522426525/in/photostream/
Journals may see the
problem first, but they
are not the source of
the problem
https://www.flickr.com/photos/studiomiguel/3946174063
Data are often an
issue in ethics
cases
Cases can
be complex
Classification of COPE cases, 1997-2012.
Categories with >7 instances in a 4-year period
Increasing number of cases relevant to data
Data
• Top: over 16yr - fabrication 17%, selective/misleading
reporting/interpretation 13%;
• High: 2009-12 – unauthorized use & image manipulation
Correction of the literature
• retractions 47%, corrections 27%, expressions of concern 11%,
disputes 9%, corrigenda & errata 6%
Poor data management scuppers research:
a case study
“Dear Editor
In xxx, yyy published my colleagues’ and my article .
Since the manuscript’s publication, we have been working on other, unrelated studies
using the same database. When results in these new, unrelated studies were
implausible, I undertook an intensive, several weeks-long investigation … I found we
had failed to load 8 files of data into the dataset. This mistake resulted in the under-
reporting of xxx … this mistake occurred despite the intensive quality checks we
have in place to ensure data quality and accuracy.
We sincerely apologize for these data issues and are committed to correcting the
article…”
Poor data management leads to accusation of research misconduct:
a case study
A student submitted a paper to a journal as part of his PhD work. The research was data
heavy – it was based on digital scans of cell images.
The paper was published.
Six months later a reader noted an anomaly, asked the journal for the underlying data,
who in turn asked the author.
The PhD student had moved on. None of his data had been stored securely at his
previous institution and it could not be found. The journal felt that the lack of availability
of data meant that the paper was unreliable and asked the institution to investigate
whether misconduct had occurred.
In the investigation it turned out that the student had asked repeatedly for a place to
store his data but the university had not been able to provide one.
The university accepted responsibility and the investigation led to the development of a
policy on data management there. The student was exonerated.
Motives
https://commons.wikimedia.org/wiki/File:Network-mapping.gif
https://commons.wikimedia.org/wiki/File:Question_mark_1.svg
Institutions want data managed
Journals want data published
From: How Does the Availability of Research Data Change With Time Since Publication? Timothy H. Vines and colleagues, Abstract (podium),
Peer Review Congress, 2013
15
Do some
research
Write a narrative
description that is
inextricably linked to
the data and methods
Integrated collection
of methods, results,
data, metadata
Store all data in
accessible,
usable format,
link to publication
Facilitate re-use & replication
by people or
machines
The ideal situation
What we often have at journals
• Unextractable data
• Everything “extra” in one (unreadable) file
• Third party licenses
• Proprietary data
• No metadata
17
Data availability in research papers allows
Replication
Validation
New analysis
Better interpretation
Inclusion in meta-analyses
Facilitation of reproducibility of research
Closer scrutiny of published work
Better ‘bang for the buck’ out of research investment
Practicalities
https://commons.wikimedia.org/wiki/File:Network-mapping.gif
https://commons.wikimedia.org/wiki/File:Question_mark_1.svg
“The evidence shows that the current research data policy ecosystem is
in critical need of standardization and harmonization”
How many journals have a research data policy?
52.4
23.2
23.2
All Journals
64.8
14.4
18.4
Science Journals
40
32
28
Social Science
Journals
Full Policy Partial Policy No Policy
Data source: Linda Naughton, JISC Journal Research Data Policy Bank project presentation (n = 250)
Iain Hrynaszkiewicz
Different levels of openness in research data publishing:
1. Accessible only to an individual researcher/group
2. Accessible to others on (reasonable) request
3. Published as electronic supplementary material
4. Deposited in a general or institutional data repository (e.g.
figshare)
5. Deposited in a subject/community specific data repository
Not all research data are Open Data
More open
Wiley data sharing survey
2886 responses (3.2% response rate) – 52% had shared/published data
Data publishing
• 67% via supplementary material in journals
• 28% via an institutional repository
• 19% use a discipline-specific data repository
• 6% use a general-purpose repository, such as Dryad or figshare
Data sharing (informal)
• 57% sharing at a conference
• 42% sharing on request via email, direct contact, etc.
• 37% via personal, institutional, or project website
Are researchers sharing research data?
Slide from Iain Hrynaszkiewicz
Data management is largely
regarded by academics as:
• Boring
• Waste of time
• Expensive
• Hard
• Confusing
They need to be persuaded that it is:
• Boring
• Waste of time
• Expensive
• Hard
• Confusing
• Part of the job
• Time saving
• Cost effective
• Easy
• Rewarded
• Content types e.g. data articles and journals
• Credit and incentives e.g. data citation and data articles
• Encouraging reuse e.g. open licenses
• Improving data quality e.g. data peer review, community standards and
repositories
• Data discoverability e.g. repository partnerships, linking, integration with
submission systems and research data metadata
• Raising awareness e.g. editorials, outreach
• Guidance e.g. information for authors
• Policy – and its implementation
What are publishers doing about it?
Iain Hrynaszkiewicz
Journal data policy landscape
• Nothing stated
• Data sharing encouraged
• Data sharing implied as a condition of submission/publication with mandates for specific data
types (eg Nature pre -2016)
• Mandated data availability statements in every paper and mandates for specific data types
(Royal Society, BioMed Central, Palgrave Communications, Nature 2016 – )
• Mandated data sharing for all, with exceptions, with statement in paper (PLOS, BMJ)
• Mandated data sharing for all with statement & link to data (e.g. American Economics Rev)
• Mandated open data and data citation as a condition of submission (e.g. F1000Research) STRONGER
Adapted from Iain Hrynaszkiewicz
“PLOS journals require
authors to make
all data underlying the
findings described in
their manuscript fully
available without
restriction, with rare
exception”
References
• Naughton, L. & Kernohan, D., (2016). Making sense of journal research data policies. Insights. 29(1),
pp.84–89. DOI: http://doi.org/10.1629/uksg.284
• Lin J, Strasser C (2014) Recommendations for the Role of Publishers in Access to Data. PLoS Biol 12(10):
e1001975. doi:10.1371/journal.pbio.1001975
• Hrynaszkiewicz I, Li P, Edmunds SC. Open science and the role of publishers in reproducible research. In:
Stodden V, Leisch F, Peng, RD, editors. Implementing Reproducible Research. CRC Press; 2014. Public
(https://osf.io/35s9d/)
• https://scholarlykitchen.files.wordpress.com/2014/11/researcher-data-insights-infographic-final.pdf

More Related Content

What's hot

Jeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional ResponsibilityJeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional Responsibility
Jisc
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
University of California Curation Center
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
Sherry Lake
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
OAbooks
 

What's hot (20)

Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11Tijerina-RDA-NISO-Task Groups-sept11
Tijerina-RDA-NISO-Task Groups-sept11
 
RDAP14: Collaboration and tension between institutions and units providing da...
RDAP14: Collaboration and tension between institutions and units providing da...RDAP14: Collaboration and tension between institutions and units providing da...
RDAP14: Collaboration and tension between institutions and units providing da...
 
RDAP14: It’s a Real World: Developing Preservation Policy for Dryad
RDAP14: It’s a Real World: Developing Preservation Policy for DryadRDAP14: It’s a Real World: Developing Preservation Policy for Dryad
RDAP14: It’s a Real World: Developing Preservation Policy for Dryad
 
UWA Research Week 2016
UWA Research Week 2016UWA Research Week 2016
UWA Research Week 2016
 
Jeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional ResponsibilityJeff Haywood - Research Integrity: Institutional Responsibility
Jeff Haywood - Research Integrity: Institutional Responsibility
 
Borgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the UniversityBorgman - Privacy, Policy and Data Governance in the University
Borgman - Privacy, Policy and Data Governance in the University
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Stephenson - Data Curation for Quantitative Social Science Research
Stephenson - Data Curation for Quantitative Social Science ResearchStephenson - Data Curation for Quantitative Social Science Research
Stephenson - Data Curation for Quantitative Social Science Research
 
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-researchUc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
Uc3 pasig-asis&t-2013-08-20-support-of-data-intensive-research
 
RDAP14: OSTP Panel NIH’s Update Public Access
RDAP14: OSTP Panel NIH’s Update Public Access RDAP14: OSTP Panel NIH’s Update Public Access
RDAP14: OSTP Panel NIH’s Update Public Access
 
Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)Research Data Management Services at UWA (November 2015)
Research Data Management Services at UWA (November 2015)
 
Introduction to Research Data Management at UWA
Introduction to Research Data Management at UWAIntroduction to Research Data Management at UWA
Introduction to Research Data Management at UWA
 
Re tooling for data management-support
Re tooling for data management-supportRe tooling for data management-support
Re tooling for data management-support
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
 
No more waiting! Tools that work Today to reveal dataset use
No more waiting!  Tools that work Today to reveal dataset useNo more waiting!  Tools that work Today to reveal dataset use
No more waiting! Tools that work Today to reveal dataset use
 
RDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuseRDAP13 Elizabeth Moss: The impact of data reuse
RDAP13 Elizabeth Moss: The impact of data reuse
 
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
Strand 1: Connecting research and researchers: An introduction to ORCID by Ed...
 
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot ProjectRDAP 15 Local ICPSR Data Curation Workshop Pilot Project
RDAP 15 Local ICPSR Data Curation Workshop Pilot Project
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Research Data Management in practice
Research Data Management in practiceResearch Data Management in practice
Research Data Management in practice
 

Similar to Publishing perspectives on data management & future directions

Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018
SusanMRob
 
Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant development
rds-wayne-edu
 

Similar to Publishing perspectives on data management & future directions (20)

Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6Fsci 2018 monday30_july_am6
Fsci 2018 monday30_july_am6
 
Rscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simonsRscd 2018 Journal policies - natasha simons
Rscd 2018 Journal policies - natasha simons
 
Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018Journal Data Sharing Policies rscd2018
Journal Data Sharing Policies rscd2018
 
Open Science Incentives/Veerle van den Eynden
Open Science Incentives/Veerle van den EyndenOpen Science Incentives/Veerle van den Eynden
Open Science Incentives/Veerle van den Eynden
 
Research data: publishers, policies and patient privacy
Research data: publishers, policies and patient privacyResearch data: publishers, policies and patient privacy
Research data: publishers, policies and patient privacy
 
ACRL STS Liaisons Forum - AIBS
ACRL STS Liaisons Forum - AIBSACRL STS Liaisons Forum - AIBS
ACRL STS Liaisons Forum - AIBS
 
Library resources and services for grant development
Library resources and services for grant developmentLibrary resources and services for grant development
Library resources and services for grant development
 
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
Research Data Sharing and Re-Use: Practical Implications for Data Citation Pr...
 
Research Integrity Advisor and Data Management
Research Integrity Advisor and Data ManagementResearch Integrity Advisor and Data Management
Research Integrity Advisor and Data Management
 
2013 DataCite Summer Meeting - Closing Keynote: Building Community Engagement...
2013 DataCite Summer Meeting - Closing Keynote: Building Community Engagement...2013 DataCite Summer Meeting - Closing Keynote: Building Community Engagement...
2013 DataCite Summer Meeting - Closing Keynote: Building Community Engagement...
 
Gaining credit for sharing research data
Gaining credit for sharing research dataGaining credit for sharing research data
Gaining credit for sharing research data
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
 
Data publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminarData publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminar
 
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...NESCent visit:  Measuring progress toward a cultural norm of shared (and reus...
NESCent visit: Measuring progress toward a cultural norm of shared (and reus...
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
NIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - HandoutNIH Data Sharing Plan Workshop - Handout
NIH Data Sharing Plan Workshop - Handout
 
So, what's it all about then? Why we share research data
So, what's it all about then? Why we share research dataSo, what's it all about then? Why we share research data
So, what's it all about then? Why we share research data
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 

More from ARDC

More from ARDC (20)

Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Architecture and Standards
Architecture and StandardsArchitecture and Standards
Architecture and Standards
 
Data Sharing and Release Legislation
Data Sharing and Release Legislation   Data Sharing and Release Legislation
Data Sharing and Release Legislation
 
Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)Australian Dementia Network (ADNet)
Australian Dementia Network (ADNet)
 
Investigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspectiveInvestigator-initiated clinical trials: a community perspective
Investigator-initiated clinical trials: a community perspective
 
NCRIS and the health domain
NCRIS and the health domainNCRIS and the health domain
NCRIS and the health domain
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
Clinical trials data sharing
Clinical trials data sharingClinical trials data sharing
Clinical trials data sharing
 
Clinical trials and cohort studies
Clinical trials and cohort studiesClinical trials and cohort studies
Clinical trials and cohort studies
 
Introduction to vision and scope
Introduction to vision and scopeIntroduction to vision and scope
Introduction to vision and scope
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian DuncanARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
ARDC 2018 state engagements - Nov-Dec 2018 - Slides - Ian Duncan
 
Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128Skilling-up-in-research-data-management-20181128
Skilling-up-in-research-data-management-20181128
 
Research data management and sharing of medical data
Research data management and sharing of medical dataResearch data management and sharing of medical data
Research data management and sharing of medical data
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Applying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and ChallengesApplying FAIR principles to linked datasets: Opportunities and Challenges
Applying FAIR principles to linked datasets: Opportunities and Challenges
 
How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018How to make your data count webinar, 26 Nov 2018
How to make your data count webinar, 26 Nov 2018
 
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global SprintReady, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
Ready, Set, Go! Join the Top 10 FAIR Data Things Global Sprint
 
How FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of dataHow FAIR is your data? Copyright, licensing and reuse of data
How FAIR is your data? Copyright, licensing and reuse of data
 
Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018Peter neish DMPs BoF eResearch 2018
Peter neish DMPs BoF eResearch 2018
 

Recently uploaded

Recently uploaded (20)

FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptxHow to setup Pycharm environment for Odoo 17.pptx
How to setup Pycharm environment for Odoo 17.pptx
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptxOn_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
On_Translating_a_Tamil_Poem_by_A_K_Ramanujan.pptx
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 

Publishing perspectives on data management & future directions

  • 1. Publishing perspectives on data management & future directions Research Integrity Advisors Data management workshop Friday 31 March Virginia Barbour Director, AOASG ORCID: 0000-0002-2358-2440 ginny.barbour@qut.edu.au
  • 2. My roles Director, Australasian Open Access Strategy Group Chair, Committee on Publication Ethics (COPE) Editor PLOS Medicine, then Editorial Director, PLOS 2004 - 2015 Involved publishing initiatives, including AllTrials, reporting guidelines Joint appointment between Office of Research Ethics and Integrity, and Division of Technology, Information and Library Services, QUT
  • 3. Journals’ interest in data Background https://commons.wikimedia.org/wiki/File:Network-mapping.gif https://commons.wikimedia.org/wiki/File:Question_mark_1.svg Motives Practicalities
  • 5. “Today, the CMS Collaboration at CERN has released more than 300 terabytes (TB) of high-quality open data. These include over 100 TB, or 2.5 inverse femtobarns (fb−1), of data from proton collisions” This is the age of data https://www.flickr.com/photos/jerry-raia/13522426525/in/photostream/
  • 6. Journals may see the problem first, but they are not the source of the problem https://www.flickr.com/photos/studiomiguel/3946174063
  • 7. Data are often an issue in ethics cases
  • 9. Classification of COPE cases, 1997-2012. Categories with >7 instances in a 4-year period
  • 10. Increasing number of cases relevant to data Data • Top: over 16yr - fabrication 17%, selective/misleading reporting/interpretation 13%; • High: 2009-12 – unauthorized use & image manipulation Correction of the literature • retractions 47%, corrections 27%, expressions of concern 11%, disputes 9%, corrigenda & errata 6%
  • 11. Poor data management scuppers research: a case study “Dear Editor In xxx, yyy published my colleagues’ and my article . Since the manuscript’s publication, we have been working on other, unrelated studies using the same database. When results in these new, unrelated studies were implausible, I undertook an intensive, several weeks-long investigation … I found we had failed to load 8 files of data into the dataset. This mistake resulted in the under- reporting of xxx … this mistake occurred despite the intensive quality checks we have in place to ensure data quality and accuracy. We sincerely apologize for these data issues and are committed to correcting the article…”
  • 12. Poor data management leads to accusation of research misconduct: a case study A student submitted a paper to a journal as part of his PhD work. The research was data heavy – it was based on digital scans of cell images. The paper was published. Six months later a reader noted an anomaly, asked the journal for the underlying data, who in turn asked the author. The PhD student had moved on. None of his data had been stored securely at his previous institution and it could not be found. The journal felt that the lack of availability of data meant that the paper was unreliable and asked the institution to investigate whether misconduct had occurred. In the investigation it turned out that the student had asked repeatedly for a place to store his data but the university had not been able to provide one. The university accepted responsibility and the investigation led to the development of a policy on data management there. The student was exonerated.
  • 14. Institutions want data managed Journals want data published
  • 15. From: How Does the Availability of Research Data Change With Time Since Publication? Timothy H. Vines and colleagues, Abstract (podium), Peer Review Congress, 2013 15
  • 16. Do some research Write a narrative description that is inextricably linked to the data and methods Integrated collection of methods, results, data, metadata Store all data in accessible, usable format, link to publication Facilitate re-use & replication by people or machines The ideal situation
  • 17. What we often have at journals • Unextractable data • Everything “extra” in one (unreadable) file • Third party licenses • Proprietary data • No metadata 17
  • 18. Data availability in research papers allows Replication Validation New analysis Better interpretation Inclusion in meta-analyses Facilitation of reproducibility of research Closer scrutiny of published work Better ‘bang for the buck’ out of research investment
  • 20. “The evidence shows that the current research data policy ecosystem is in critical need of standardization and harmonization” How many journals have a research data policy? 52.4 23.2 23.2 All Journals 64.8 14.4 18.4 Science Journals 40 32 28 Social Science Journals Full Policy Partial Policy No Policy Data source: Linda Naughton, JISC Journal Research Data Policy Bank project presentation (n = 250) Iain Hrynaszkiewicz
  • 21. Different levels of openness in research data publishing: 1. Accessible only to an individual researcher/group 2. Accessible to others on (reasonable) request 3. Published as electronic supplementary material 4. Deposited in a general or institutional data repository (e.g. figshare) 5. Deposited in a subject/community specific data repository Not all research data are Open Data More open
  • 22. Wiley data sharing survey 2886 responses (3.2% response rate) – 52% had shared/published data Data publishing • 67% via supplementary material in journals • 28% via an institutional repository • 19% use a discipline-specific data repository • 6% use a general-purpose repository, such as Dryad or figshare Data sharing (informal) • 57% sharing at a conference • 42% sharing on request via email, direct contact, etc. • 37% via personal, institutional, or project website Are researchers sharing research data? Slide from Iain Hrynaszkiewicz
  • 23. Data management is largely regarded by academics as: • Boring • Waste of time • Expensive • Hard • Confusing
  • 24. They need to be persuaded that it is: • Boring • Waste of time • Expensive • Hard • Confusing • Part of the job • Time saving • Cost effective • Easy • Rewarded
  • 25. • Content types e.g. data articles and journals • Credit and incentives e.g. data citation and data articles • Encouraging reuse e.g. open licenses • Improving data quality e.g. data peer review, community standards and repositories • Data discoverability e.g. repository partnerships, linking, integration with submission systems and research data metadata • Raising awareness e.g. editorials, outreach • Guidance e.g. information for authors • Policy – and its implementation What are publishers doing about it? Iain Hrynaszkiewicz
  • 26. Journal data policy landscape • Nothing stated • Data sharing encouraged • Data sharing implied as a condition of submission/publication with mandates for specific data types (eg Nature pre -2016) • Mandated data availability statements in every paper and mandates for specific data types (Royal Society, BioMed Central, Palgrave Communications, Nature 2016 – ) • Mandated data sharing for all, with exceptions, with statement in paper (PLOS, BMJ) • Mandated data sharing for all with statement & link to data (e.g. American Economics Rev) • Mandated open data and data citation as a condition of submission (e.g. F1000Research) STRONGER Adapted from Iain Hrynaszkiewicz
  • 27. “PLOS journals require authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception”
  • 28. References • Naughton, L. & Kernohan, D., (2016). Making sense of journal research data policies. Insights. 29(1), pp.84–89. DOI: http://doi.org/10.1629/uksg.284 • Lin J, Strasser C (2014) Recommendations for the Role of Publishers in Access to Data. PLoS Biol 12(10): e1001975. doi:10.1371/journal.pbio.1001975 • Hrynaszkiewicz I, Li P, Edmunds SC. Open science and the role of publishers in reproducible research. In: Stodden V, Leisch F, Peng, RD, editors. Implementing Reproducible Research. CRC Press; 2014. Public (https://osf.io/35s9d/) • https://scholarlykitchen.files.wordpress.com/2014/11/researcher-data-insights-infographic-final.pdf