SlideShare ist ein Scribd-Unternehmen logo
1 von 10
DATUM in Action
   Supporting researchers to plan and manage
               their research data
        www.northumbria.ac.uk/datum
                      Dr Jeremy Ellman
                      Prof Julie McLeod
             School of Computing, Engineering &
                    Information Sciences

JISCMRD Action – Healthy research needs RDM PLanning,23 Mar 2012
DATUM in Workshop: Meeting challenges in healthy data
Outline
 DATUM in Action & its research project target:
  MATSIQEL

 MATSIQEL data requirements


 Infrastructure element: a software solution


 Conclusions
DATUM in Action – Healthy research needs healthy data   1
DATUM in Action
          Steps                                         Infrastructure

1. Requirements analysis                        1. Training & mentoring
                                                   DMP (based on DCC & tailored)
2. Data management plan
                                                2. Guidance
3. Supporting infrastructure                     researcher focused; what they
                                                 wanted; ’how to’ & links to existing
                                                 NU guidance; usable by other HEIs
     Action Research
        Approach                                  • roles & responsibilities
                                                  • folders, files & version control
                                                  • metadata
                                                  • information security


                                                3. Technology
                                                i. shared drive, MS Office
                                                ii. SharePoint (collaborative prototype)
                                                iii.Bazaar – version control

DATUM in Action – Healthy research needs healthy data                                   2
MATSIQEL: EU FP7 Research Project

 Data management requirements complex as
  research data is
   Confidential
      Partially anonymised
      Partly Proprietary
   Multiple Versions
      both Raw Data and Derived Data
      Multiple Level of access required/allowed
 Required Internationally
    EU and non EU
 DATUM in Action – Healthy research needs healthy data   3
Data Requirements
 Version Control
    Multiple copies of Raw Data
         Versions of Processing Software
    Multiple Versions of Processed Data
       Different Research Centres
 Distribution Control
 Central shared data space
    Authoritative record
 Permissions Management
DATUM in Action – Healthy research needs healthy data   4
BZR: Bazaar
 Distributed Version Control System
 Multiple Repositories
 Multiple Branches
 Local Check-in Check-out
 Multiple Platform
   Windows, Mac, Linux
 Huge Range of Tools
   bazaar-explorer cross platform GUI
DATUM in Action – Healthy research needs healthy data   5
VCS Vs DVCS




   Source: Auvray 2008 “Distributed Version Control Systems: A Not-So-Quick Guide”

DATUM in Action – Healthy research needs healthy data                                 6
Core Commands
   "bzr init" initializes Bazaar management for the current directory.
   "bzr add" makes all unknown files in the current directory known to Bazaar.
   "bzr status" generates a report of the current state of the local branch.
   "bzr commit -m "[commit-message]"" creates a commit
   "bzr mv [versioned-file] [new-location]" moves the [versioned-file]
   "bzr remove [file]" removes the specified file or files
    "bzr log" generates a log of every commit in sequence
    "bzr help [command]" the Bazaar help command provides embedded
    "bzr merge [location]" instructs Bazaar to merge changes
    "bzr pull" performs a fast-forward update of the local working
    "bzr update" merges the contents of the remote branch into the local branch
    "bzr push" performs the equivalent of a "bzr update" on the remote mirror
    "bzr uncommit" "rewinds" the branch to the last commit
   "bzr revert [file]" takes the specified file and reverts the contents of that

    DATUM in Action – Healthy research needs healthy data                           7
Conclusions
 Research data needs to be controlled
    Version control, protection and distribution are
     common requirements
 Distributed version control software is freely
  available
    BZR Repositories can be archived in SharePoint




DATUM in Action – Healthy research needs healthy data   8
DATUM in Action
  www.northumbria.ac.uk/datum
  Project funded by JISC
  Copyright holder: Northumbria University, School of
  Computing, Engineering & Information Sciences, 2011
  Materials made freely available under a Creative Commons Attribution-
  NonCommercial-ShareAlike 2.0 UK: England & Wales license

  For accompanying video see HTTP://YOUTU.BE/YQV1VOQWY1C

DATUM in Action – Healthy research needs healthy data

Weitere ähnliche Inhalte

Ähnlich wie Datum in action jisc final event 23032012 v1 1 linked

Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 

Ähnlich wie Datum in action jisc final event 23032012 v1 1 linked (20)

Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Early Lessons from Building Sensor.Network: An Open Data Exchange for the Web...
Early Lessons from Building Sensor.Network: An Open Data Exchange for the Web...Early Lessons from Building Sensor.Network: An Open Data Exchange for the Web...
Early Lessons from Building Sensor.Network: An Open Data Exchange for the Web...
 
Database Management System 1
Database Management System 1Database Management System 1
Database Management System 1
 
Considerations for using personal information management (pim) software for d...
Considerations for using personal information management (pim) software for d...Considerations for using personal information management (pim) software for d...
Considerations for using personal information management (pim) software for d...
 
THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009THE Jisc Supplement 25 Nov 2009
THE Jisc Supplement 25 Nov 2009
 
Data Management - Full Stack Deep Learning
Data Management - Full Stack Deep LearningData Management - Full Stack Deep Learning
Data Management - Full Stack Deep Learning
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
 
Session19 Globus
Session19 GlobusSession19 Globus
Session19 Globus
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and Application
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
Hawaii Pacific GIS Conference 2012: GIS in Education: K-12 and University - H...
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
INT 1010 07-2.pdf
INT 1010 07-2.pdfINT 1010 07-2.pdf
INT 1010 07-2.pdf
 
The Architecture of Continuous Innovation - OSCON 2015
The Architecture of Continuous Innovation - OSCON 2015The Architecture of Continuous Innovation - OSCON 2015
The Architecture of Continuous Innovation - OSCON 2015
 
GDSC Cloud Jam.pptx
GDSC Cloud Jam.pptxGDSC Cloud Jam.pptx
GDSC Cloud Jam.pptx
 
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012Analytic Platforms in the Real World with 451Research and Calpont_July 2012
Analytic Platforms in the Real World with 451Research and Calpont_July 2012
 
Metadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiencesMetadata in general and Dublin Core in specific; some experiences
Metadata in general and Dublin Core in specific; some experiences
 
Data commons bonazzi bd2 k fundamentals of science feb 2017
Data commons bonazzi   bd2 k fundamentals of science feb 2017Data commons bonazzi   bd2 k fundamentals of science feb 2017
Data commons bonazzi bd2 k fundamentals of science feb 2017
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 

Datum in action jisc final event 23032012 v1 1 linked

  • 1. DATUM in Action Supporting researchers to plan and manage their research data www.northumbria.ac.uk/datum Dr Jeremy Ellman Prof Julie McLeod School of Computing, Engineering & Information Sciences JISCMRD Action – Healthy research needs RDM PLanning,23 Mar 2012 DATUM in Workshop: Meeting challenges in healthy data
  • 2. Outline  DATUM in Action & its research project target: MATSIQEL  MATSIQEL data requirements  Infrastructure element: a software solution  Conclusions DATUM in Action – Healthy research needs healthy data 1
  • 3. DATUM in Action Steps Infrastructure 1. Requirements analysis 1. Training & mentoring DMP (based on DCC & tailored) 2. Data management plan 2. Guidance 3. Supporting infrastructure researcher focused; what they wanted; ’how to’ & links to existing NU guidance; usable by other HEIs Action Research Approach • roles & responsibilities • folders, files & version control • metadata • information security 3. Technology i. shared drive, MS Office ii. SharePoint (collaborative prototype) iii.Bazaar – version control DATUM in Action – Healthy research needs healthy data 2
  • 4. MATSIQEL: EU FP7 Research Project  Data management requirements complex as research data is  Confidential  Partially anonymised  Partly Proprietary  Multiple Versions  both Raw Data and Derived Data  Multiple Level of access required/allowed  Required Internationally  EU and non EU DATUM in Action – Healthy research needs healthy data 3
  • 5. Data Requirements  Version Control  Multiple copies of Raw Data  Versions of Processing Software  Multiple Versions of Processed Data  Different Research Centres  Distribution Control  Central shared data space  Authoritative record  Permissions Management DATUM in Action – Healthy research needs healthy data 4
  • 6. BZR: Bazaar  Distributed Version Control System  Multiple Repositories  Multiple Branches  Local Check-in Check-out  Multiple Platform  Windows, Mac, Linux  Huge Range of Tools  bazaar-explorer cross platform GUI DATUM in Action – Healthy research needs healthy data 5
  • 7. VCS Vs DVCS  Source: Auvray 2008 “Distributed Version Control Systems: A Not-So-Quick Guide” DATUM in Action – Healthy research needs healthy data 6
  • 8. Core Commands  "bzr init" initializes Bazaar management for the current directory.  "bzr add" makes all unknown files in the current directory known to Bazaar.  "bzr status" generates a report of the current state of the local branch.  "bzr commit -m "[commit-message]"" creates a commit  "bzr mv [versioned-file] [new-location]" moves the [versioned-file]  "bzr remove [file]" removes the specified file or files  "bzr log" generates a log of every commit in sequence  "bzr help [command]" the Bazaar help command provides embedded  "bzr merge [location]" instructs Bazaar to merge changes  "bzr pull" performs a fast-forward update of the local working  "bzr update" merges the contents of the remote branch into the local branch  "bzr push" performs the equivalent of a "bzr update" on the remote mirror  "bzr uncommit" "rewinds" the branch to the last commit  "bzr revert [file]" takes the specified file and reverts the contents of that DATUM in Action – Healthy research needs healthy data 7
  • 9. Conclusions  Research data needs to be controlled  Version control, protection and distribution are common requirements  Distributed version control software is freely available  BZR Repositories can be archived in SharePoint DATUM in Action – Healthy research needs healthy data 8
  • 10. DATUM in Action www.northumbria.ac.uk/datum Project funded by JISC Copyright holder: Northumbria University, School of Computing, Engineering & Information Sciences, 2011 Materials made freely available under a Creative Commons Attribution- NonCommercial-ShareAlike 2.0 UK: England & Wales license For accompanying video see HTTP://YOUTU.BE/YQV1VOQWY1C DATUM in Action – Healthy research needs healthy data

Hinweis der Redaktion

  1. There is a video to go with this presentation at: http://youtu.be/yqV1VOqwy1c
  2. To help a collaborative group of researchers on an EU FP7 staff exchange project define & implement good research data management (RDM) practice
  3. MATSIQELisdeveloping mathematical & computer models for technological solutions (e.g. monitors, telecare, recreational games) for improving and enhancing quality of life. Focus on WP4/5.