INCLUSIVE EDUCATION PRACTICES FOR TEACHERS AND TRAINERS.pptx
Service integration to Enhance RDM: RSpace electronic lab notebook at the University of Edinburgh
1. Service Integration to Enhance RDM:
RSpace electronic laboratory notebook (ELN) case
study at the University of Edinburgh
Stuart Macdonald
RDM Service Coordinator
University of Edinburgh
stuart.macdonald@ed.ac.uk
Rory Macneil
CEO
Research Space
rmacneil@researchspace.com
10th International Digital Curation Conference, London, 10 February 2014
2. University of Edinburgh RDM Policy
University of Edinburgh is one of the
first Universities in UK to adopt a
policy for managing research data:
http://www.ed.ac.uk/is/research-
data-policy
The policy was approved by the
University Court on 16 May 2011.
It’s acknowledged that this is an
aspirational policy and that
implementation will take some
years.
3. RDM Policy Implementation charged with delivering
services that will meet RDM policy objectives:
• Membership from across Information Services
• Iterate with researchers to ensure services meet the needs of researchers
Steering Committee with members of the Research
Committee & Heads of IT from the 3 colleges, IS, and
the Research Office (ERI)
• Provide oversight to the activity of the Implementation Committee
• Ensure services meet researcher requirements without harming research
competitiveness
2-tiered Governance
4. Policyimplementation- ResearchData Management
Roadmap(2012-2015)
Cross-divisional collaboration
Services already in place:
o Data management planning
o Active working file space = DataStore
o Data publication repository =
DataShare
Services in development:
o Long term data archive = DataVault
o Data Asset Register (DAR)
RDM support: Awareness raising,
training & consultancy
http://edin.ac/1u3sKqy
Before research During research After research
5. Research Data Management Planning
Customised instance of DCC’s DMPonline toolkit for Univ. of Edinburgh use:
Funders and local (non-funder) DMP templates
Institutional guidance (storage, services, support)
Tailored DMP assistance for researchers submitting research proposals (F-2-F)
NAS facility to store data that are actively used in current research activities
0.5 TB (500GB) allocated per researchers, PGR upwards
Up to 0.25TB can be used for “shared” group storage
Extra storage costs: £200 per TB per year incl. back—up and DR copies
Infrastructure in place. Allocation of space devolved to School IT departments overseen by
Heads of IT from each College.
De-allocation policy detailing responsibilities and storage costs for ‘orphaned data’ - pending
approval by Steering Committee
DataStore
6. DataShare
Edinburgh DataShare is the University’s open access multi-disciplinary data repository:
http://datashare.is.ed.ac.uk
Assists researchers disseminate their research, get credit for data publication, and
preserve their data for the long-term (DOI, licence, citation)
Data Vault
Safe, private and secure long-term data archive
Current focus on front-end application requirements (authorisation, retention & deletion,
file structure, file transfer, integration)
Data Asset Register (DAR)
A catalogue of data assets produced by Edinburgh researchers for discovery, access, and
re-use as appropriate.
Interoperation
Systems do not live in isolation, and more likely to be used if some or all of the components
are integrated and developed to minimise ‘duplication’ of effort
7. RDM Support
• RDM team work with Research Administrators , Academic Support Librarians and IT
staff in each of the 22 Schools.
• Queries can be sent to the IS Helpline who will direct them as appropriate.
• Introductory sessions on RDM services and support for research active and research
admin staff in Schools / Institutes
• RDM website: http://www.ed.ac.uk/is/data-management
• RDM blog: http://datablog.is.ed.ac.uk
• RDM wiki:
https://www.wiki.ed.ac.uk/display/RDM/Research+Data+Management+Wiki
8. Training: Tailored Courses
Formal and informal training the form of workshops, power sessions, seminars and drop
in sessions to help researchers with RDM issues.
Creating a data management plan for your grant application
Research Data Management Programme at the University of Edinburgh
Good practice in Research Data Management
Handling data using SPSS
Handling data with ArcGIS
Managing your research data: why it is important and what should you do?
Publishing and sharing sensitive data (pilot)
MANTRA
An internationally recognized free online RDM training course for researchers - developed by the
Data Library
Software-specific data handling exercises
RDM DIY Training Kit for Librarians
CC License & embed units in VLE’s e.g. Moodle
9. Service Integration examples
• DataShare is a customised DSpace instance with OAI-PMH
compliant DCMI metadata fields for data discovery through Google and
other search engines
• Records are harvested by Thomson-Reuters Data Citation Index
• SWORD API utilised for batch deposit of large and/or many files from
computers (‘Push using http’)
• Internal batch ingest of many/large files to circumvent 2.1GB limit via
interface (‘Pull via command line interface’)
• checksums determine that delivered object mirrors deposited object
• DSpace GITHUB plugin* - allows software to be archived from GitHub
similar) source code repository into DataShare, which can then be
DOI to facilitate citation - using the SWORD deposit protocol
10. DataSync – a secure dropbox-like facility for synchronising data on DataStore with
desktop and mobile machines:
• uses open source ‘ownCloud’ technology
ECDF Computing Cluster (‘Eddie’) revamp complete with ‘Data Centric
Computing’ business model – integrate Eddie storage & HPC, parallel and cloud
computing services with DataStore for data sharing
Linking of SDA toolkit with numeric ASCII data held in DataShare for the purposes of
analysis (re-use)
Facility to embargo variables within numeric files (in statistical analysis package
formats) for subsequent open deposit into DataShare and/or use in statistical analysis
packages
Research data deposit from RSpace Electronic Lab Notebook (ELN) interface into
DataShare (and Datastore & Data Vault) using SWORD
11. Who and what is driving demand for ELNs?
● Researchers
– Utility and convenience of paper lab book + online capabilities
– On multiple devices
– File management/integration
● Groups/PIs
– Controlled sharing
– Collaboration
– Group management
– File management/integration
● Institutions: data librarians, research admins, IT, commercialisation offices
– Enterprise features: Scalable deployment, Single Sign On
– IP protection: audit trail, signing
– Publishing
– Archiving
– Repository integration
– File management/integration
13. Business Model
● Free public cloud for labs and individuals
● Institutional deployments @$100/user/year
● Seamless movement of groups and data between different RSpaces
Researchers Institutions Funders
Value
Edinburgh
Public
Cloud
Stanford
Lab
LabLab
Convenience
Productivity
Portability
Control
Compliance
Data mining
Data mining
14. RSpace at Edinburgh
– Linking to files in Edinburgh DataStore
– Depositing content in Edinburgh DataShare
– Archiving in Edinburgh DataVault
15. Linking to DataStore
“My plan for workflow would be generally to deposit
my data in DataStore either from the wet lab
instruments (gel photos, elisa data, etc, and also
possibly directly from an iPad) or from in silico data
analysis I’ve been doing, and then link to it from within
RSpace.”
23. Archiving in Edinburgh DataVault
● DataVault functionality/API not yet specified
● Anticipate use of XML zip archive
● Many requirements to be determined
– e.g., searching, restoration
24. RSpace and Edinburgh RDM
RSpace
server
DataShareDataStore
DataVault User / Browser
Details responsibilities of PIs and researchers, responsibilities and roles of institution and any joint responsibilties
All projects must have DMP plan
Responsibility for RDM lies with PI
Univ. will provide DMP support, advice
Univ. will provide storage, backup, security, deposit and retention services
Any data held elsewhere should be recorded in asset register
Funders have policies, responsibilities fall to the
university as well as the researcher
Researchers are mobile
Institution and researcher must work together,
define the responsibilities
Awareness raising within university of practicalties
What data will be collected or created?
How the data will be documented and described?
Where the data will be stored?
Who will be responsible for data security and backup?
Which data will be shared and/or preserved?
How the data will be shared and with whom?
Edinburgh Data Science Institute
Centre for Doctoral Training in Data Science (School of Informatics)
Data Lab Innovation centre - FOCUSED ON HELPING SCOTTISH INDUSTRY TO CAPITALISE ON A GROWING MARKET OPPORTUNITY IN DATA SCIENCE
Designed and developed over three years with teams at three leading global research institutions
The first and only ELN developed to meet the needs of research institutions – third generation supplanting second generation lab-focused ELNs.
Comparison with major competitor, Lab Archives, shows dominance of RSpace for institutional requirements.
Individual features not remarkable; advantage comes from the combination.
Sustainable advantage from relationships with three partners, long head start and difficulty/impossibility of discovering and implementing detailed requirements of this customer set.
Result: no real contest – like Man U vs. Cambridge United!