Research Data Management from a Software Engineering Perspective
1. Research Data Management – From a
Software Engineering Perspective
Research Software Engineering Community Event
Imperial College London, 17th February 2016
Sarah A. Stewart
Research Data Support Assistant
Imperial College London
sarah.stewart@imperial.ac.uk / @Biostew
http://orcid.org/0000-0002-9465-4042
4. Software = Data?
• ‘Software is used to create, interpret, present,
manipulate and manage data.’
(Software Sustainability Institute)
• Data: ‘recorded factual material commonly retained by
and accepted...as necessary to validate research
findings’ (EPSRC)
• Is the software necessary to validate research findings?
• Good practice to make software available with
adequate documentation to enable others to more
easily validate your research findings and to access
and re-use your research data.
• Software = Data!
5. Missing Research Data
“In their parents' attic, in boxes in the garage, or stored on now-defunct floppy disks —
these are just some of the inaccessible places in which scientists have admitted to
keeping their old research data.”
http://www.nature.com/news/scientists-losing-data-at-a-rapid-rate-1.14416
6. Why is RDM in Software Engineering Important ?
• Software Preservation! Digital Preservation a pressing concern!
• Storage, Retrieval, Reconstruction, Replay - all are complexities relating to
code libraries, dependencies and software engineering overall.
• Preserving data is talked about more often, but software that created it is also
important, particularly if data needs to be re-created, or requires the software
in order to interrogate or visualise the data.
• Planning is essential for subsequent retrieval, reconstruction and replay.
• Software is a digital object which is frequently the result of research and is
often a vital prerequisite for the preservation of other digital objects.
• Software preservation should be part of a broader preservation strategy -
RDM.
7. Imperial College RDM Policy
The Provost’s Board has approved a research data management policy in order to
assist principle investigators in meeting requirements from research funding
bodies:
“Imperial College London is committed to promoting the highest standards of
academic research, including excellence in research data management. This
includes a robust digital curation infrastructure that supports open data access
and protects confidential data. The College acknowledges legal, ethical and
commercial constraints on data sharing and the need to preserve the academic
entitlement to publication.”
“Principal Investigators have overall responsibility for the effective management
of research data generated within or obtained for their research, including by
their research groups. The Library and ICT will provide training, guidance and
services to support PIs.”
https://workspace.imperial.ac.uk/researchservices/Public/Imperial%20College%20RD
M%20Policy.pdf
11. The RDM Team : Here to Help!
We provide:
• guidance to assist you to comply with
Imperial College and funders’ policies
regarding the management of research data
• advice about tools and resources to help
you manage your data throughout the
lifecycle of your research
• workshops and information sessions to
keep you updated with the latest RDM
practices
You can contact us with any queries at: rdm-enquiries@imperial.ac.uk
12. Any Questions ?
Thank you!
For more Information:
Webpage: www.imperial.ac.uk/research-data-management
E-mail : rdm-enquiries@imperial.ac.uk
RDM Team – Ash Barnes, Sarah Stewart