SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Getting to grips with
Research Data Management
10th
November 2015
Isabel Chadwick,
Research Data Librarian
library-research-support@open.ac.uk
Overview of the workshop
• What is Research Data Management?
• Sharing data
• Working with data
• Planning for data
• Useful resources
• Questions?
What is Research Data Management?
“Research data management concerns the
organisation of data, from its entry to the research
cycle through to the dissemination and archiving of
valuable results. It aims to ensure reliable
verification of results, and permits new and
innovative research built on existing information."
Digital Curation Centre (2011)
Making the Case for Research Data Management
http://www.dcc.ac.uk/sites/default/files/documents/publications/Making%20the%20case.pdf
What is Research Data Management?
Discussion
• Describe your research
• What type of data do you create/use?
• What data management challenges do you face?
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Design research
Plan data
management
Plan consent for
sharing
Locate existing data
Collect data
Capture and create
metadata
Creating data
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Enter data, digitise,
transcribe, translate
Check, validate,
clean data
Anonymise data
Describe data
Manage and store
data
Processing data
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Interpret data
Derive data
Produce research
outputs
Author publications
Prepare data for
publications
Analysing data
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Migrate data to best
format
Migrate data to
suitable medium
Back-up and store
data
Create metadata
and documentation
Archive data
Preserving data
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Distribute data
Share data
Control access
Establish copyright
Assign licences
Promote data
Giving access to data
What is Research Data Management?
UK Data Archive Data Lifecycle model
http://www.data-archive.ac.uk/create-manage/life-cycle
Follow-up research
New research
Undertake research
reviews
Scrutinise findings
Teach and learn
Re-using data
What is Research Data Management?
Why spend time and effort on this?
• So you can work efficiently and
effectively
–Save time and reduce frustration
–Highlight patterns or connections
that might otherwise be missed
• Because your data is precious
• To enable data re-use and sharing
• To meet funders’ and institutional
requirements
What is Research Data Management?
What does the OU expect?
“Research data must be managed to the highest
standards throughout their life-cycle in order to
support excellence in research practice.
In keeping with OU principles of open-ness, it is
expected that research data will be open and
accessible to other researchers, as soon as
appropriate and verifiable, subject to the
application of appropriate safeguards relating to
the sensitivity of the data and legal
requirements.”
OU Principles of Research Data Management, April 2013
http://intranet.open.ac.uk/research-school/strategy-info-governance/docs/CoPamendedJuly
What is Research Data Management?
What do funders expect?
“Publicly funded research data are a public good,
produced in the public interest, which should be
made openly available with as few restrictions as
possible in a timely and responsible manner that
does not harm intellectual property.”
RCUK Common Principles on Research Data Policy, 2011
http://www.rcuk.ac.uk/research/datapolicy/
What is Research Data Management?
What do funders expect?
http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
Sharing data
Benefits of sharing data
Sharing data
Benefits of sharing data (2)
Sharing data
Benefits of sharing data (3)
Sharing data
What do you need to share?
• Raw data
• Derived data
• Data underpinning
publications
• Code
• Methods
What are research data in your context?
What would others need to understand your research?
Sharing data
Barriers to sharing data: discussion
Discuss barriers to sharing
your research data.
These could be:
•Ethical
•Legal
•Professional
Can these barriers be
overcome?
Sharing data
How can I share my data?
OU Data Catalogue in ORO
Data access statements
Online data sharing services
•Figshare
•Zenodo
•CKAN DataHub
•Mendeley Data
Directories
•re3data
Funders’ repository services
•UK Data Service ReShare
•NERC data centres
Working with data
“Start as you mean to go on”
The end point of all projects should
involve making the data publicly
available. Many data will be
deposited in national archives which
have regulations for files and
metadata.
Thinking about the requirements at
the beginning of the project will limit
the transformations needed at the
end of the project.
Data Sharing
• Shared areas or SharePoint
• Zendto
• Be wary of Dropbox & similar
• OU collaboration tool in pipeline
IT support for researchers:
http://intranet6.open.ac.uk/library/main/supporting-ou-research/re
Working with data
External collaborators: IT Options
Working with data
Filing systems
Filing is more than saving files, it’s making
sure you can find them later in your project
•Naming
•Directory Structure
•File Types
•Versioning
All these help to keep your data safe and
accessible.
Decide on a file naming convention at the start of your project. Useful file
names are:
•consistent.
•meaningful to you and your colleagues.
•allow you to find the file easily.
Agree on the following elements of a file name:
•Vocabulary
•Punctuation
•Dates (YYYY-MM-DD)
•Order
•Numbers
•Version information
Ideally you should be able to tell what’s in a file before opening it.
Tip: create a readme file detailing the naming scheme.
Working with data
Naming conventions
Working with data
File formats
• Unencrypted
• Uncompressed
• Non-proprietary/patent-encumbered
• Open, documented standard
• Standard representation (ASCII, Unicode)
Type Recommended Avoid for data sharing
Tabular data CSV, TSV, SPSS portable Excel
Text Plain text, HTML, RTF
PDF/A only if layout matters
Word
Media Container: MP4, Ogg
Codec: Theora, Dirac, FLAC
Quicktime
H264
Images TIFF, JPEG2000, PNG GIF, JPG
Structured data XML, RDF RDBMS
Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
Working with data
Metadata & documentation
• Metadata is additional information that is required to
make sense of your files – it’s data about data.
Guidance on disciplinary metadata standards:
http://www.dcc.ac.uk/resources/metadata-standards
Working with data
Metadata & documentation (2)
Think FAIR!
Findable
Accessible
Interoperable
Re-usable
Data FAIRport initiative: http://datafairport.org/
Working with data
Sensitive data
When working with research participants....
•Ensure you have obtained valid consent
•Consider who needs access to the data
•Inform your participants what will happen with the data after
the project has finished
•Pre-planning and agreeing with participants during the
consent process, on what may and may not be recorded or
transcribed, can be more effective than anonymisation
•Consider controlling access if anonymisation or consent for
sharing are impossible
Working with data
Sensitive data (2)
Managing sensitive data
•If possible, collect the necessary data without using
personally identifying information
•De-identify your data upon collection or as soon as
possible thereafter
•Avoid transmitting unencrypted personal data
electronically
•Consider whether you need to keep original collection
instruments (recordings, surveys etc.) once they have
been transcribed and quality assured
Planning for data
• Make informed decisions to anticipate
and avoid problems
• Avoid duplication, data loss and
security breaches
• Develop procedures early on for
consistency
• Ensure data are accurate, complete,
reliable and secure
• Save time and effort – make your life
easier!
Data Management Plans are useful
whenever you are creating data to:
Planning for data
Which funders require a DMP?
www.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies
Note: Data Management Plans are a requirement of
Horizon 2020 projects included in the Research Data pilot
Planning for data
Activity
Think about your own
research.
What actions would you
need to perform on your
data at each stage of the
UKDA’s Lifecycle model?
How would you do this?
Would you need any
additional funding/staff?
Planning for data
DMPOnline
https://dmponline.dcc.ac.uk
A web-based tool to help you
write DMPs according to
different requirements. DCC,
funder and OU guidance.
Planning for data
Tips
• Keep it simple, short and specific
• Seek advice - consult and
collaborate
• Base plans on available skills and
support
• Make sure implementation is
feasible
• Justify any resources or
restrictions needed
Library Services
How we can help
• Data Management Plan checking
• Support with setting up new projects
• Advice on preparation of data for sharing
• Data catalogue on ORO
• Online guidance
• Enquiries
• Development of new tools to enable data management
and sharing
Email: library-research-
support@open.ac.uk
Useful links
• The OU Research Data Management intranet site:
http://intranet6.open.ac.uk/library/main/supporting-ou-research/research-
data-management
• Digital Curation Centre: http://www.dcc.ac.uk/
• DMPOnline: https://dmponline.dcc.ac.uk/
• UK Data Archive: http://www.data-archive.ac.uk/
• MANTRA: http://datalib.edina.ac.uk/mantra/
• The Orb: http://open.ac.uk/blogs/the_orb
Questions?
Image credits
Unless otherwise stated, all images are by
Jørgen Stamp at http://www.digitalbevaring.dk

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Practical Strategies for Research Data Management
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Management
 
Working with Research Data
Working with Research DataWorking with Research Data
Working with Research Data
 
Working with Research Data, 21/05/20
Working with Research Data, 21/05/20Working with Research Data, 21/05/20
Working with Research Data, 21/05/20
 
Planning for Research Data Managment
Planning for Research Data ManagmentPlanning for Research Data Managment
Planning for Research Data Managment
 
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
 
OU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research dataOU Library Research Support webinar: Working with research data
OU Library Research Support webinar: Working with research data
 
Data sharing: Legal and ethical issues
Data sharing: Legal and ethical issuesData sharing: Legal and ethical issues
Data sharing: Legal and ethical issues
 
Data sharing: How, what and why?
Data sharing: How, what and why?Data sharing: How, what and why?
Data sharing: How, what and why?
 
OU Library Research Support webinar: Data sharing
OU Library Research Support webinar: Data sharingOU Library Research Support webinar: Data sharing
OU Library Research Support webinar: Data sharing
 
Working with Research Data 17th October 2019
Working with Research Data 17th October 2019Working with Research Data 17th October 2019
Working with Research Data 17th October 2019
 
Writing a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPToolWriting a successful data management plan with the DMPTool
Writing a successful data management plan with the DMPTool
 
Managing your research data
Managing your research dataManaging your research data
Managing your research data
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Ands ttt2 perth_accelerate your data skills training_ top tips for topics and...
Ands ttt2 perth_accelerate your data skills training_ top tips for topics and...Ands ttt2 perth_accelerate your data skills training_ top tips for topics and...
Ands ttt2 perth_accelerate your data skills training_ top tips for topics and...
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 
Introduction to Data Management Planning
Introduction to Data Management PlanningIntroduction to Data Management Planning
Introduction to Data Management Planning
 
Practical Strategies for Research Data Management
Practical Strategies for Research Data ManagementPractical Strategies for Research Data Management
Practical Strategies for Research Data Management
 
How and Why to Share Your Data
How and Why to Share Your DataHow and Why to Share Your Data
How and Why to Share Your Data
 
Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Open Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and SolutionsOpen Access to Research Data: Challenges and Solutions
Open Access to Research Data: Challenges and Solutions
 

Ähnlich wie Getting to grips with Research Data Management

Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Rob Daley
 

Ähnlich wie Getting to grips with Research Data Management (20)

Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Practical strategies for RDM
Practical strategies for RDMPractical strategies for RDM
Practical strategies for RDM
 
Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016Planning for Research Data Management: 26th January 2016
Planning for Research Data Management: 26th January 2016
 
Planning for Research Data Management
Planning for Research Data ManagementPlanning for Research Data Management
Planning for Research Data Management
 
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_SharedManaging Your Research Data for Maximum Impact -Rob Daley 300616_Shared
Managing Your Research Data for Maximum Impact -Rob Daley 300616_Shared
 
DC101 UWE
DC101 UWEDC101 UWE
DC101 UWE
 
Introduction to Research Data Management
Introduction to Research Data ManagementIntroduction to Research Data Management
Introduction to Research Data Management
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Creating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant ApplicationCreating a Data Management Plan for your Grant Application
Creating a Data Management Plan for your Grant Application
 
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un... Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
Data Management Planning for Researchers - An Introduction - 2015-11-04 - Un...
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfrey
 
RDM: a briefing for Health Sciences
RDM: a briefing for Health SciencesRDM: a briefing for Health Sciences
RDM: a briefing for Health Sciences
 
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
PIDs, Data and Software: How Libraries Can Support Researchers in an Evolving...
 
Research Data Management: Why is it important?
Research Data Management: Why is it  important?Research Data Management: Why is it  important?
Research Data Management: Why is it important?
 
RDM for Librarians
RDM for LibrariansRDM for Librarians
RDM for Librarians
 
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of OxfordData Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
 
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...Data Management Planning for Researchers -  An Introduction - 2015-02-18 - Un...
Data Management Planning for Researchers - An Introduction - 2015-02-18 - Un...
 
Support Your Data, Kyoto University
Support Your Data, Kyoto UniversitySupport Your Data, Kyoto University
Support Your Data, Kyoto University
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Winter school in research data science research data management - final
Winter school in research data science research data management - finalWinter school in research data science research data management - final
Winter school in research data science research data management - final
 

Kürzlich hochgeladen

1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
negromaestrong
 

Kürzlich hochgeladen (20)

How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
Asian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptxAsian American Pacific Islander Month DDSD 2024.pptx
Asian American Pacific Islander Month DDSD 2024.pptx
 
Magic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptxMagic bus Group work1and 2 (Team 3).pptx
Magic bus Group work1and 2 (Team 3).pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
Seal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptxSeal of Good Local Governance (SGLG) 2024Final.pptx
Seal of Good Local Governance (SGLG) 2024Final.pptx
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 

Getting to grips with Research Data Management

  • 1. Getting to grips with Research Data Management 10th November 2015 Isabel Chadwick, Research Data Librarian library-research-support@open.ac.uk
  • 2. Overview of the workshop • What is Research Data Management? • Sharing data • Working with data • Planning for data • Useful resources • Questions?
  • 3. What is Research Data Management? “Research data management concerns the organisation of data, from its entry to the research cycle through to the dissemination and archiving of valuable results. It aims to ensure reliable verification of results, and permits new and innovative research built on existing information." Digital Curation Centre (2011) Making the Case for Research Data Management http://www.dcc.ac.uk/sites/default/files/documents/publications/Making%20the%20case.pdf
  • 4. What is Research Data Management? Discussion • Describe your research • What type of data do you create/use? • What data management challenges do you face?
  • 5. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Design research Plan data management Plan consent for sharing Locate existing data Collect data Capture and create metadata Creating data
  • 6. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Enter data, digitise, transcribe, translate Check, validate, clean data Anonymise data Describe data Manage and store data Processing data
  • 7. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Interpret data Derive data Produce research outputs Author publications Prepare data for publications Analysing data
  • 8. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Migrate data to best format Migrate data to suitable medium Back-up and store data Create metadata and documentation Archive data Preserving data
  • 9. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Distribute data Share data Control access Establish copyright Assign licences Promote data Giving access to data
  • 10. What is Research Data Management? UK Data Archive Data Lifecycle model http://www.data-archive.ac.uk/create-manage/life-cycle Follow-up research New research Undertake research reviews Scrutinise findings Teach and learn Re-using data
  • 11. What is Research Data Management? Why spend time and effort on this? • So you can work efficiently and effectively –Save time and reduce frustration –Highlight patterns or connections that might otherwise be missed • Because your data is precious • To enable data re-use and sharing • To meet funders’ and institutional requirements
  • 12. What is Research Data Management? What does the OU expect? “Research data must be managed to the highest standards throughout their life-cycle in order to support excellence in research practice. In keeping with OU principles of open-ness, it is expected that research data will be open and accessible to other researchers, as soon as appropriate and verifiable, subject to the application of appropriate safeguards relating to the sensitivity of the data and legal requirements.” OU Principles of Research Data Management, April 2013 http://intranet.open.ac.uk/research-school/strategy-info-governance/docs/CoPamendedJuly
  • 13. What is Research Data Management? What do funders expect? “Publicly funded research data are a public good, produced in the public interest, which should be made openly available with as few restrictions as possible in a timely and responsible manner that does not harm intellectual property.” RCUK Common Principles on Research Data Policy, 2011 http://www.rcuk.ac.uk/research/datapolicy/
  • 14. What is Research Data Management? What do funders expect? http://www.dcc.ac.uk/resources/policy-and-legal/overview-funders-data-policies
  • 15. Sharing data Benefits of sharing data
  • 16. Sharing data Benefits of sharing data (2)
  • 17. Sharing data Benefits of sharing data (3)
  • 18. Sharing data What do you need to share? • Raw data • Derived data • Data underpinning publications • Code • Methods What are research data in your context? What would others need to understand your research?
  • 19. Sharing data Barriers to sharing data: discussion Discuss barriers to sharing your research data. These could be: •Ethical •Legal •Professional Can these barriers be overcome?
  • 20. Sharing data How can I share my data? OU Data Catalogue in ORO Data access statements Online data sharing services •Figshare •Zenodo •CKAN DataHub •Mendeley Data Directories •re3data Funders’ repository services •UK Data Service ReShare •NERC data centres
  • 21. Working with data “Start as you mean to go on” The end point of all projects should involve making the data publicly available. Many data will be deposited in national archives which have regulations for files and metadata. Thinking about the requirements at the beginning of the project will limit the transformations needed at the end of the project. Data Sharing
  • 22. • Shared areas or SharePoint • Zendto • Be wary of Dropbox & similar • OU collaboration tool in pipeline IT support for researchers: http://intranet6.open.ac.uk/library/main/supporting-ou-research/re Working with data External collaborators: IT Options
  • 23. Working with data Filing systems Filing is more than saving files, it’s making sure you can find them later in your project •Naming •Directory Structure •File Types •Versioning All these help to keep your data safe and accessible.
  • 24. Decide on a file naming convention at the start of your project. Useful file names are: •consistent. •meaningful to you and your colleagues. •allow you to find the file easily. Agree on the following elements of a file name: •Vocabulary •Punctuation •Dates (YYYY-MM-DD) •Order •Numbers •Version information Ideally you should be able to tell what’s in a file before opening it. Tip: create a readme file detailing the naming scheme. Working with data Naming conventions
  • 25. Working with data File formats • Unencrypted • Uncompressed • Non-proprietary/patent-encumbered • Open, documented standard • Standard representation (ASCII, Unicode) Type Recommended Avoid for data sharing Tabular data CSV, TSV, SPSS portable Excel Text Plain text, HTML, RTF PDF/A only if layout matters Word Media Container: MP4, Ogg Codec: Theora, Dirac, FLAC Quicktime H264 Images TIFF, JPEG2000, PNG GIF, JPG Structured data XML, RDF RDBMS Further examples: http://www.data-archive.ac.uk/create-manage/format/formats-table
  • 26. Working with data Metadata & documentation • Metadata is additional information that is required to make sense of your files – it’s data about data. Guidance on disciplinary metadata standards: http://www.dcc.ac.uk/resources/metadata-standards
  • 27. Working with data Metadata & documentation (2) Think FAIR! Findable Accessible Interoperable Re-usable Data FAIRport initiative: http://datafairport.org/
  • 28. Working with data Sensitive data When working with research participants.... •Ensure you have obtained valid consent •Consider who needs access to the data •Inform your participants what will happen with the data after the project has finished •Pre-planning and agreeing with participants during the consent process, on what may and may not be recorded or transcribed, can be more effective than anonymisation •Consider controlling access if anonymisation or consent for sharing are impossible
  • 29. Working with data Sensitive data (2) Managing sensitive data •If possible, collect the necessary data without using personally identifying information •De-identify your data upon collection or as soon as possible thereafter •Avoid transmitting unencrypted personal data electronically •Consider whether you need to keep original collection instruments (recordings, surveys etc.) once they have been transcribed and quality assured
  • 30. Planning for data • Make informed decisions to anticipate and avoid problems • Avoid duplication, data loss and security breaches • Develop procedures early on for consistency • Ensure data are accurate, complete, reliable and secure • Save time and effort – make your life easier! Data Management Plans are useful whenever you are creating data to:
  • 31. Planning for data Which funders require a DMP? www.dcc.ac.uk/resources/policy-and-legal/ overview-funders-data-policies Note: Data Management Plans are a requirement of Horizon 2020 projects included in the Research Data pilot
  • 32. Planning for data Activity Think about your own research. What actions would you need to perform on your data at each stage of the UKDA’s Lifecycle model? How would you do this? Would you need any additional funding/staff?
  • 33. Planning for data DMPOnline https://dmponline.dcc.ac.uk A web-based tool to help you write DMPs according to different requirements. DCC, funder and OU guidance.
  • 34. Planning for data Tips • Keep it simple, short and specific • Seek advice - consult and collaborate • Base plans on available skills and support • Make sure implementation is feasible • Justify any resources or restrictions needed
  • 35. Library Services How we can help • Data Management Plan checking • Support with setting up new projects • Advice on preparation of data for sharing • Data catalogue on ORO • Online guidance • Enquiries • Development of new tools to enable data management and sharing Email: library-research- support@open.ac.uk
  • 36. Useful links • The OU Research Data Management intranet site: http://intranet6.open.ac.uk/library/main/supporting-ou-research/research- data-management • Digital Curation Centre: http://www.dcc.ac.uk/ • DMPOnline: https://dmponline.dcc.ac.uk/ • UK Data Archive: http://www.data-archive.ac.uk/ • MANTRA: http://datalib.edina.ac.uk/mantra/ • The Orb: http://open.ac.uk/blogs/the_orb
  • 38. Image credits Unless otherwise stated, all images are by Jørgen Stamp at http://www.digitalbevaring.dk

Hinweis der Redaktion

  1. (2 minutes) •Welcome •Introduce myself •Housekeeping
  2. (2 minutes) Overview of the workshop When I first planned this workshop, I intended to start with planning and end with sharing as that is the order that you would do things in your project. However the end aim of RDM is to make research data openly available, and I think that discussing why and how to do this first will give further context to why the rdm processes we’re going to cover today should be undertaken.
  3. 1 min (5) Read the quotation. This quotation from the Digital Curation Centre sums up what Research Data Management is all about. It covers the management of data throughout your research lifecycle (more on that later) and beyond, when you will be sharing your data with other researchers. This is relevant to all research which produces data, although you may find that the methods you use differ depending on your type of research or academic discipline. A quick word on the Digital Curation Centre (DCC). They are the leading experts in the UK on Research Data Management, and gave us a lot of help when we set up the RDM project. Their website is a great source of information and guidance.
  4. 5 minutes (10) Slide 4 Discussion Introduce yourself to the person sitting next to you & talk about the type of data which you produce, and any data management challenges you’ve come across.
  5. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  6. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  7. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  8. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  9. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  10. 7 minutes (17) Data often have a longer lifespan than the research project that creates them. Researchers may continue to work on data after funding has ceased, follow-up projects may analyse or add to the data, and data may be re-used by other researchers. Well organised, well documented, preserved and shared data are invaluable to advance scientific inquiry and to increase opportunities for learning and innovation.
  11. 3 mins (20) Good data management does require an investment of effort – but ultimately it’s something that can actually save you time, by helping you work more efficiently. Many of us are all too well acquainted with the frustration of trying to track down a fact or a document we know we have somewhere. Good research data management – setting up an organizational system that works for you, and ensuring everything is properly filed or labelled to enable re-identification and retrieval – can make life a lot easier. And it’s not just a matter of saving time and reducing unnecessary effort (though clearly that’s a major benefit): having everything well ordered can also help you get a better feel of the shape and scope of your research material, which in turn can enable you to spot patterns or connections that might otherwise get missed. It’s also well worth doing, because the data you’re producing or working with is valuable As well as this being true for your own research, the data might ultimately be of use to other researchers. Having everything well organized and properly labelled also has the potential to save you a lot of time at the end of a research project, when it comes to deciding what to do with your data – but more of that later. Finally, there may be requirements imposed by your funding body and/or the university which you need to meet
  12. 2 mins (22) In 2013, the OU wrote a set of principles for research data management. These have since been added as an appendix to the research code of practice. The principles are high-level, but they confirm the OU’s commitment to ensuring that research data is properly managed and shared as much as possible. Note: All those engaged in research at the OU, including those involved in collaborating with other institutions, must take personal responsibility for managing their research data in accordance with University and funder requirements
  13. 1 min (23) The RCUK policy was released in 2011, and this has been followed up by all of the UK research councils releasing their own policies. The basic premise (as stated in this slide) is the same for all councils, but there are variations in the ways in which they expect this to be achieved.
  14. Here’s an overview of what the research councils expect. If you haven’t done so already, find your funder’s research data policy and check that you are compliant. It’s not only RCUK funders which have requirements, e.g. Horizon 2020 and government funding. Make sure you check out your funder policy as early as possible even if last time you checked they didn’t have one, as more and more policies are being released.
  15. 1 mins (24) Sharing data can have huge impacts on collaboration between researchers world wide as this example shows.
  16. 1 min (25) You might remember this news story about George Osborne basing the austerity plan on research data which had been incorrectly analysed. By making data public these kinds of anomalies are more likely to be spotted and incidents like this less likely to happen!
  17. 1 min (26) And of course there is a personal benefit to you as a researcher. Studies have found that there is between a 9% and a 30% increase in citations for papers which make the underlying data available.
  18. 1 min (27) Think about what research data are in your context. Depending on your academic discipline and the data type, what you share may vary. You might want to share raw data, but in some disciplines this might be totally innappropriate, as they will be too vast and meaningless to other people. You might just want to share your derived, analysed data Or you might only want to share the data which underpins your publications, but you need to think about whether this will be understandable to others, would they be able to replicate your results? So you might also want to share your code or your methods to enable better understanding.
  19. 5 mins discussion 3 mins feedback (35) In some cases, there may be concerns about sharing data, or reasons why all or part of a dataset needs to be kept private. These may be ethical (the data is confidential), legal (the dataset includes third party material with restrictions on usage), or professional (you intend to publish the results, and don’t want someone to get there first). It’s worth noting that many difficulties or concerns about sharing data can be alleviated by advance planning. For example, ensuring you get proper permissions when data is collected can reduce problems with sharing personal data. If your dataset is a combination of third party data and new material, you may need to have a version of the data where these are kept separate. Proper documentation is also important here: this will help keep track of what you’re allowed to do with data, and what’s happened to it in the course of the project.
  20. 2 mins (37) There are a number of ways that you can share your data. The OU does not currently have the capacity to archive research data and make it publicly available, but there is a project happening which is looking into ways that we can achieve this. The first step will be to include metadata records of research data in ORO, which will directly link to your publications in ORO and also to the underpinning data wherever that may be stored. This should be ready in the autumn, and it will be a requirement that all research data created at the OU is recorded. Externally, there are a number of repositories. Your funder may well have a repository in which you are required to deposit your data, like the ESRC which has recently re-branded its ESRC datastore. Those who had experienced the datastore will be please to hear that this now seems to be a faster, more user-friendly service than the previous incarnation. Also, the NERC data centres. In addition to this there are several free, online services like Figshare, which was devised by someone from UCL and is used now by various journals to publish data underpinning research publications. It can also be used as a datastore throughout your project, as it allows online analysis of data, and collaboration with other partners. You may upload unlimited public data and you also get a 1GB allowance for private data. Zenodo is a similar tool, but can only be used for publication, this was developed by CERN as part of the EU OpenAIRE project and is aimed at the long-tail of science. There is a maximum threshold for upload of 2GB per file, but you are able to include multiple files in one dataset or collection. CKAN datahub is another similar, free-to-use tool. There are now a number of journals which specialise in research data, here are 2 examples. Other journals may allow you to link to your data stored in Figshare or Dryad. And finally here are 2 directories of data repositories, which list a range of repositories according to academic discipline.
  21. 4 mins (41) Start as you mean to go on Consider all the preparation necessary for making your data shareable and how you can reduce the workload at the end of the project by doing the work during the project Metadata and documentation (logs, instructions, records) File formats File naming Data security and storage
  22. 1 min (42) Think about names and formats before clicking save Where do you need this file; is it used by another program? Do the name and location make sense? Consideration at the beginning makes it easier to find files and related documents later.
  23. 1 min (43) Vocabulary – choose a standard vocabulary for file names, so that everyone uses a common language. Punctuation – decide on conventions for if and when to use punctuation symbols, capitals, hyphens and spaces. Dates – agree on a logical use of dates so that they display chronologically i.e. YYYY-MM-DD. Order - confirm which element should go first, so that files on the same theme are listed together and can therefore be found easily. Numbers – specify the amount of digits that will be used in numbering so that files are listed numerically e.g. 01, 002, etc.
  24. 1 min (44) When thinking about file formats, certain formats are more appropriate for long-term preservation and sharing. Avoid using proprietary formats, these are formats which can only be opened by a specific type of software, like Work and Quicktime, as the software may become obsolete in the future and the files will more difficult to open. You can of course migrate your files into different formats at the end of your project prior to deposit in a repository or archive, but by thinking about this from the beginning and ensuring the right formats have been used throughout will save you a lot of time when you come to thinking about sharing your data later.
  25. 1 min (45) Slide 19- metadata (1) (2 mins) It’s not a new idea Most people do it to a certain extent without thinking You might organize your collection by artist, title, even colour! This is made much easier in a digital environment
  26. 1 min (46) 1. To be Findable any Data Object should be uniquely and persistently identifiable [4]1.1. The same Data Object should be re-findable at any point in time, thus Data Objects should be persistent, with emphasis on their metadata, [4 and JDDCP 4 and JDDCP 6]1.2. A Data Object should minimally contain basic machine readable metadata that allows it to be distinguished from other Data Objects [seeJDDCP 5]1.3. Identifiers for any concept used in Data Objects should therefore be Unique and Persistent [5 and JDDCP 4 and JDDCP 6]. 2. Data is Accessible in that it can be always obtained by machines and humans2.1 Upon appropriate authorization [6]2.2 Through a well-defined protocol [7 and JDDCP 5]2.3 Thus, machines and humans alike will be able to judge the actual accessibilty of each Data Object. 3. Data Objects can be Interoperable only if:3.1. (Meta) data is machine-readable [8]3.2. (Meta) data formats utilize shared vocabularies and/or ontologies [9]3.3  (Meta) data within the Data Object should thus be both syntactically parseable and semantically machine-accessible [10] 4. For Data Objects to be Re-usable additional criteria are:4.1 Data Objects should be compliant with principles 1-34.2 (Meta) data should be sufficiently well-described and rich that it can be automatically (or with minimal human effort) linked or integrated, like-with-like, with other data sources [11 and JDDCP 7 and JDDCP 8]4.3 Published Data Objects should refer to their sources with rich enough metadata and provenance to enable proper citation (ref to JDDCP 1-3).
  27. 2 mins (50) In the past researchers gained consent from participants primarily so that they could collect data.  However, many funders are now increasingly requesting researchers to share and preserve their data as part of their requirements. It is therefore important that participants fully understand: how you will store, publish and share their data how you will ensure that their data remains confidential and anonymous (where applicable) throughout the duration of the project and after Failure to obtain consent could result in non-compliance with your funder's requirements and limit the opportunities you have to share, publish and preserve your data. If things change, you may be able to go back to your participants and change the details of the agreement. Anonymisation can be time-consuming, so agreeing what can and can’t be recorded or transcribed may well save you time and effort. For example, if they don’t want you to use names, then conduct the interview without using names.
  28. 2 mins (52) As mentioned before, if possible in the collection process, not using personally identifying information can save time and effort as you will have less to anonymise. Make sure you are storing your sensitive data sensibly. If possible, de-identify your data upon collection, this will reduce the damage is a security breach happens. Make sure you are encrypting your data if you have to send it electronically (eg by email) Do you need to keep the original recording? If it’s been transcribed, what value does it hold? By destroying it as early as possible you are reducing the risk.
  29. Slide 9 – Planning for data 2 minutes (62)
  30. Slide 11 – Which funders require a DMP? (2 mins) •Quick overview – point out EPSRC does not require one, and Horizon 2020 only for projects included in the pilot •However, the OU recommends that all researchers write a DMP regardless of whether their funder requires them to do so or not, as it is a useful exercise for ensuring that data will be managed responsibly throughout the lifecycle.
  31. Slide 10 – Data Management Planning Activity (5 minutes) Think about the research you are working on at the moment, or a recent project. Consider the actions you will need to take and the barriers you might face at all the different stages of the DCC data curation lifecycle. How could they be overcome? This is a useful exercise to start thinking about the information you would need to put in your plan.
  32. 3 mins (65) DMPOnline is a tool developed by the DCC which helps you to write your data management plan. There are templates for dmps for all the research councils, Horizon 2020, Wellcome Trust and CRUK. It takes you through the sections of the templates and gives guidance as you work. We’ve now incorporated some OU guidance into this as well. There is also an OU template for researchers who are not funded by any of the bodies for which there is a template, but feel it would be helpful to write a data management plan anyway. If you do try out this tool, please give me any feedback you might have.
  33. 1 min (66) Keep it simple – not all the reviewers are going to be data management experts Be specific – instead of saying “we will follow standards” explain WHICH standards, instead of “we will create a large amount of data” HOW MUCH data? Short – some funders have requirements for how long the plan should be (eg. ESRC 3 pages) Seek advice – from other researchers at the university who have written plans, or done similar projects. Example of the reading experience database taking advice from colleagues who had worked on the listening experience database. Be realistic! RDM is an allowable cost for all RCUK funders, but any costs have to be fully accounted for. All expenditure on direct costs must take place before the actual end date of the project and must be fully auditable. No expenditure can be ‘double funded’ (a service that is centrally supported by the indirect costs paid on all research grants cannot then also be included as a direct cost on a grant)
  34. Send DMPs in advance of bid submission! Preferably a week ahead, if possible. But later is better than never! I am happy to meet with Pis and project teams at the beginning of projects to discuss strategies for managing data and clarify funder requirements. Also able to set up bespoke training sessions for departments/research groups At the end of your project, hopefully your data will have been managed in a way that facilitates sharing, but if in doubt get in touch for help Guidance is on the intranet site, URL on next slide. Send enquiries to email at bottom of screen, this way anyone from the team can pick it up if I’m away. The RDM project is developing some infrastructure, with 2 aims: collaborating on data during projects, and sharing and preserving data post-project. Just starting procurement process now and hope to have something in place by mid-2016.
  35. 2 mins (68) Links to additional resources are available on the RDM intranet site. I’ll put this presentation on the site after the workshop.
  36. Max. 12 mins (80)