SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
Manage your data:
why and how?

OULS WISER Trinity 2009
22 May 2009


Luis Martinez Uribe
Luis.Martinez-Uribe@oerc.ox.ac.uk
Summary

   •   Background
   •   What are research data?
   •   What is research data management and curation?
   •   Oxford activities
   •   How to manage data and services available to researchers
Background


 New tools and infrastructures available
 to researchers




 A key characteristic is the generation of
 digital research data
How much data? A data deluge!

“More digital data will be
produce in the next 5 years
than in whole human history”
(Australian DEST )

2007 is the “crossover year”
where the amount of digital
information is greater than the
amount of available storage




          Source: “The Expanding Digital Universe: A forecast of Worldwide Information Growth through 2010” IDC Whitepaper, March 2007
What are research data?
   http://www.flickr.com/photos/iscjorgegarcia/2359144636/*
What are research data?


  “Research data is the evidence base on which academic researchers build
  their analytic or other work.
  It includes the widest possible range of data volumes from relatively small
  data sets up to vast data volumes generated by research in fields such as
  particle physics. It also includes great variety and heterogeneity of data and
  its accompanying metadata and documentation to make it usable and
  understood, or the digital representations and records for physical research
  data.” (UKRDS final report)
http://www.flickr.com/photos/piet_musterd/2231850447/   Examples
From Dr David Shotton presentation
http://www.flickr.com/photos/oliastro/2987657532/




http://www.flickr.com/photos/djmccrady/1883226927/
http://www.rcsb.org/pdb/explore.do?structureId=1BA4
http://ecrystals.chem.soton.ac.uk/604/
http://www.beazley.ox.ac.uk/XDB/ASP/recordDetails.asp?recordCount=37&start=0




From Building a VRE for the Humanities poster
                                                                                                             http://www.flickr.com/photos/wrowlands/2270729405/
presented at All Hands Meeting 2007
http://www.flickr.com/photos/piper/22584430/   http://www.flickr.com/photos/althouse/273160052/
http://www.flickr.com/photos/thivierr/540241947/




                                                   EUROBAROMETER 69 PUBLIC OPINION IN THE EUROPEAN UNION FIRST RESULTS
http://www.flickr.com/photos/hanan_cohen/455238557/




             Data Management and Curation
Research data management and curation


   •   Takes from knowledge/information management
   •   “…is understanding the current data needs and future ones” (US
       Department of Defence)
   •   A means to an end
   •   Not just technical infrastructure but also procedures and policies
   •   Preservation http://www.youtube.com/watch?v=pbBa6Oam7-w
   •   Digital Curation
          “maintaining and adding value to a trusted body of digital information for
          current and future use; it encompasses the active management of data
          throughout the information lifecycle” DCC Charter and Statement Principles
Why?


  •    Ensuring data quality and authenticity of research results
  •    Not re-inventing the wheel - data collection can be expensive!
  •    Better access to information (which in many cases is publicly funded) will
       produce high quality research
  •    Future access (preservation) http://www.youtube.com/watch?v=pbBa6Oam7-
       w
  •    Added value from data mining or combining datasets
  •    and …
Comply with requirements of funding agencies
                                       “the outputs from current and future research must be
                                        preserved and remain accessible for future generations”


 “expects research data generated as part of BBSRC support
 to be made available…data should be retain for a period of 10
 years after completion of the project”


                                     “require that the applicants provide a data management and
                                      sharing plan as part of their application”



 “requires all grant holders to offer for deposit copies of data to
  the UK Data Archive”




   SHERPA JULIET SERVICE http://tinyurl.com/datapolicies
Data management and curation
activities in Oxford
Scoping Digital Repository Services
for Research Data Management
Interviews with researchers
Researcher’s data - the challenges
         I COULDN’T MAKE SENSE OF THE              WHEN RESEARCHERS LEAVE THE
         DATA I COLLECTED FOR MY PhD 5          DEPARTMENT WE LOOSE ALL THE DATA
                  YEARS AGO                              THEY CREATED




       HELP! I AM REQUIRED TO                 WE HAD TO MIGRATE DATA TO NEW
          PRODUCE A DATA                      FORMATS AS NOT TO LOSE THEM. IT
         MANAGEMENT PLAN                            TOOK US MONTHS!!




      I WANT TO PUBLISH THE DATA AS AN
      ADDITIONAL RESOURCE FOR READERS
           OF MY        PUBLISHED
               BOOK/ARTICLE
                                                  TO SHARE OUR DATA WE HAD
                                                 TO PHYSICALLY TRANSPORT THE
                                                            SERVER



CLINICAL TRIALS DATA COLLECTED 30 YEARS AGO
 CAN BE USED TO IDENTIFY THE DAUGHTERS OF
  THOSE WOMAN WHO WERE ADMINISTERED A
DRUG THAT CAUSES CANCER IN THEIR DAUGHTERS    WE COLLECTED DATA AS PART OF AN
                                                      INTERNATIONAL
                                               COLLABORATION BUT WE DON’T
                                                KNOW WHO OWNS THE DATA?
Top requirements for services
Consultation with service units



                              •   Aiming to
                                  – Validate the researchers’
                                    requirements for services
                                  – Determine the data management
                                     services available to researchers in
                                     Oxford
                                  – Identify gaps in service provision
Findings



  •   Widespread expertise in data management and curation amongst service units in
      Oxford
  •   Support provided in ad-hoc basis but services not made explicit
  •   Overall, the majority of the services in the data management and curation framework
      are not offered fully or at all.
  •   There is a need for a university wide policy on data management and curation
Research data management and curation services
Embedding Institutional Data
Curation Services in Research (EIDCSR)
Where to start? A data management plan

     *With details about:
     •   the need for access to existing data sources
     •   the data to be produced by the research project
     •   the planned quality assurance and back-up procedures for data
     •   the plans for management and archiving of collected data
     •   any expected difficulties in making data available for secondary research
         (through data archiving) and measures to overcome such difficulties
     •   who holds copyright and Intellectual Property Rights of the data
     •   data management responsibility roles within the research team
         [Support from Departments’ IT or research facilitators or Research Services]


     * RELU Data Management Plans
File handling


     •   Use open file formats if possible (ODF, PNG, TIFF, JPEG)
          [Training providers (OUCS, OULS, departmental…)]
     •   Use a clear directory structure
     •   Name files consistently (http://mst.nerc.ac.uk/file_naming_conventions.html)
     •   Use version control tools
          [OUCS Subversion Repositories]
Collect metadata : “data about data”


     •   Different types
          – descriptive metadata : describing the intellectual content of the object
          Simple DC: Title/ Creator/Subject/Description/Publisher/Contributor/Date/
          Type/Format/ Identifier/Source/Language/Relation/Coverage/Rights
          – administrative metadata: information used to manage the object or
             control access to it.
          – structural metadata: information that ties each object to others.
          [OUCS Research Technology Service may be able to help]
          or the Digital Curation Centre (DCC)
Storage



     •   Check with your departmental IT
     •   Need a back-up strategy
          – How often/stored for how long/ who will be responsible?
          [Hierarchical File Server for back-up your files and long term storage]
          [OUCS Research Technology Service may be able to help]
     •   Ethics and confidentiality
          [Research Ethics Committee http://www.admin.ox.ac.uk/curec/]
          – http://www.data-archive.ac.uk/sharing/confidential.asp
Data sharing and long-term preservation

     •   Sharing through:
          – Papers, local repositories, national repositories or web tools
          – Informally at conferences, blogs or email
     •   Be aware of IP and copyright issues
          – [http://www.data-archive.ac.uk/sharing/copyright.asp]
          [ISI Innovation]
          [Legal Services can help]
     •   Long-term preservation and sharing at national data centres
          – UK Data Archive
          – NERC data centres
          – Archeological Data Service
          – European Bioinformatics Centre (EBI)
          – Many more like this at: http://tinyurl.com/globaldatarepo
Services available in Oxford


     •   ORA for research articles and other grey literature
     •   Hierarchical File Server for back-up your files
     •   OUCS Research Technology Service
     •   Departmental support through IT or research facilitators
     •   Departmental storage
     •   Legal Services
     •   Research Services
     •   Central University Research Ethics Committee
     •   Different training providers
Basic Data Management Principles


     1. Plan before producing data
     2. When possible choose right standards for open formats
     3. Document your data
     4. Store your data securely and always backup
     5. Use trusted repositories to deposit your data for sharing and long-term
        preservation
Other useful resources



     •   UK Data Archive Manage and Share guidelines
         – http://tinyurl.com/datamanage
     •   Research Data Management Services: Findings of the Consultation
         with Service Providers
         – http://tinyurl.com/Oxdataservices
     •   MIT Data Management and Publishing guide
         – http://tinyurl.com/qjz6ay

     • Australian National University data management planning
         – http://ilp.anu.edu.au/dm/
Thanks

Weitere ähnliche Inhalte

Was ist angesagt?

Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATTony Ross-Hellauer
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP introSarah Jones
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar SeriesKatina Toufexis
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassAaron Collie
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data ThingsKatina Toufexis
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciencesSarah Jones
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE
 
Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management Jian Qin
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingLisa Haddow
 
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Anita de Waard
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourKNOWeSCAPE2014
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)robin fay
 

Was ist angesagt? (20)

BLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, FigshareBLC & Digital Science: Mark Hahnel, Figshare
BLC & Digital Science: Mark Hahnel, Figshare
 
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
Introduction to Research Data Management - 2015-05-27 - Social Sciences Divis...
 
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDATResearch Data Management: An Introductory Webinar from OpenAIRE and EUDAT
Research Data Management: An Introductory Webinar from OpenAIRE and EUDAT
 
RDM and DMP intro
RDM and DMP introRDM and DMP intro
RDM and DMP intro
 
Smith - Developing Campus Stakeholders' Collaborations - Sept 8
Smith - Developing Campus Stakeholders' Collaborations - Sept 8Smith - Developing Campus Stakeholders' Collaborations - Sept 8
Smith - Developing Campus Stakeholders' Collaborations - Sept 8
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
 
Research Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities ClassResearch Data Curation _ Grad Humanities Class
Research Data Curation _ Grad Humanities Class
 
20160414 23 Research Data Things
20160414 23 Research Data Things20160414 23 Research Data Things
20160414 23 Research Data Things
 
DMP health sciences
DMP health sciencesDMP health sciences
DMP health sciences
 
Introduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD StudentsIntroduction to RDM for Geoscience PhD Students
Introduction to RDM for Geoscience PhD Students
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy Issues
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management Infrastructure, Standards, and Policies for Research Data Management
Infrastructure, Standards, and Policies for Research Data Management
 
Supporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of StirlingSupporting Research Data Management at the University of Stirling
Supporting Research Data Management at the University of Stirling
 
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of OxfordWriting a Research Data Management Plan - 2016-11-09 - University of Oxford
Writing a Research Data Management Plan - 2016-11-09 - University of Oxford
 
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
Preparing Your Research Material for the Future - 2016-11-16 - Humanities Div...
 
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"Some Ideas on Making Research Data: "It's the Metadata, stupid!"
Some Ideas on Making Research Data: "It's the Metadata, stupid!"
 
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of OxfordData Management Planning for Researchers - 2016-02-08 - University of Oxford
Data Management Planning for Researchers - 2016-02-08 - University of Oxford
 
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific EndeavourBeyond Meta-Data: Nano-Publications Recording Scientific Endeavour
Beyond Meta-Data: Nano-Publications Recording Scientific Endeavour
 
Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)Linked data presentation for libraries (COMO)
Linked data presentation for libraries (COMO)
 

Ähnlich wie Wiser2009 Luis Martinez

Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing dataSarah Jones
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research DataMartin Donnelly
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...ICPSR
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopAaike De Wever
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simonsARDC
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfreypvhead123
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020Sarah Jones
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...heila1
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data CommonsVivien Bonazzi
 

Ähnlich wie Wiser2009 Luis Martinez (20)

Managing and sharing data
Managing and sharing dataManaging and sharing data
Managing and sharing data
 
Managing and Sharing Research Data
Managing and Sharing Research DataManaging and Sharing Research Data
Managing and Sharing Research Data
 
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...Meeting Federal Research Requirements for Data Management Plans, Public Acces...
Meeting Federal Research Requirements for Data Management Plans, Public Acces...
 
Introduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshopIntroduction to Data Management Planning at Alien Challenge COST workshop
Introduction to Data Management Planning at Alien Challenge COST workshop
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
RDM & ELNs @ Edinburgh
RDM & ELNs @ EdinburghRDM & ELNs @ Edinburgh
RDM & ELNs @ Edinburgh
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Scholze liber 2015-06-25_final
Scholze liber 2015-06-25_finalScholze liber 2015-06-25_final
Scholze liber 2015-06-25_final
 
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
Introduction to Research Data Management - 2017-02-15 - MPLS Division, Univer...
 
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
Introduction to Research Data Management - 2016-02-03 - MPLS Division, Univer...
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
 
Ucla july 2018 natasha simons
Ucla july 2018 natasha simonsUcla july 2018 natasha simons
Ucla july 2018 natasha simons
 
Looking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ EdinburghLooking After Your Data: RDM @ Edinburgh
Looking After Your Data: RDM @ Edinburgh
 
Data Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn WoolfreyData Management for Postgraduate students by Lynn Woolfrey
Data Management for Postgraduate students by Lynn Woolfrey
 
Data Management and Horizon 2020
Data Management and Horizon 2020Data Management and Horizon 2020
Data Management and Horizon 2020
 
What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...What infrastructure is necessary for successful research data management (RDM...
What infrastructure is necessary for successful research data management (RDM...
 
EMBL Australian Bioinformatics Resource AHM - Data Commons
EMBL Australian Bioinformatics Resource AHM   - Data CommonsEMBL Australian Bioinformatics Resource AHM   - Data Commons
EMBL Australian Bioinformatics Resource AHM - Data Commons
 
Introduction to Research Data Management - 2014-01-27 - Social Sciences Divis...
Introduction to Research Data Management - 2014-01-27 - Social Sciences Divis...Introduction to Research Data Management - 2014-01-27 - Social Sciences Divis...
Introduction to Research Data Management - 2014-01-27 - Social Sciences Divis...
 
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLANINCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
INCLUSION OF DATA ARCHIVES IN DATA MANAGEMENT PLAN
 

Kürzlich hochgeladen

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Wiser2009 Luis Martinez

  • 1. Manage your data: why and how? OULS WISER Trinity 2009 22 May 2009 Luis Martinez Uribe Luis.Martinez-Uribe@oerc.ox.ac.uk
  • 2. Summary • Background • What are research data? • What is research data management and curation? • Oxford activities • How to manage data and services available to researchers
  • 3. Background New tools and infrastructures available to researchers A key characteristic is the generation of digital research data
  • 4. How much data? A data deluge! “More digital data will be produce in the next 5 years than in whole human history” (Australian DEST ) 2007 is the “crossover year” where the amount of digital information is greater than the amount of available storage Source: “The Expanding Digital Universe: A forecast of Worldwide Information Growth through 2010” IDC Whitepaper, March 2007
  • 5. What are research data? http://www.flickr.com/photos/iscjorgegarcia/2359144636/*
  • 6. What are research data? “Research data is the evidence base on which academic researchers build their analytic or other work. It includes the widest possible range of data volumes from relatively small data sets up to vast data volumes generated by research in fields such as particle physics. It also includes great variety and heterogeneity of data and its accompanying metadata and documentation to make it usable and understood, or the digital representations and records for physical research data.” (UKRDS final report)
  • 8. From Dr David Shotton presentation
  • 11. http://www.beazley.ox.ac.uk/XDB/ASP/recordDetails.asp?recordCount=37&start=0 From Building a VRE for the Humanities poster http://www.flickr.com/photos/wrowlands/2270729405/ presented at All Hands Meeting 2007
  • 12. http://www.flickr.com/photos/piper/22584430/ http://www.flickr.com/photos/althouse/273160052/
  • 13. http://www.flickr.com/photos/thivierr/540241947/ EUROBAROMETER 69 PUBLIC OPINION IN THE EUROPEAN UNION FIRST RESULTS
  • 15. Research data management and curation • Takes from knowledge/information management • “…is understanding the current data needs and future ones” (US Department of Defence) • A means to an end • Not just technical infrastructure but also procedures and policies • Preservation http://www.youtube.com/watch?v=pbBa6Oam7-w • Digital Curation “maintaining and adding value to a trusted body of digital information for current and future use; it encompasses the active management of data throughout the information lifecycle” DCC Charter and Statement Principles
  • 16.
  • 17. Why? • Ensuring data quality and authenticity of research results • Not re-inventing the wheel - data collection can be expensive! • Better access to information (which in many cases is publicly funded) will produce high quality research • Future access (preservation) http://www.youtube.com/watch?v=pbBa6Oam7- w • Added value from data mining or combining datasets • and …
  • 18. Comply with requirements of funding agencies “the outputs from current and future research must be preserved and remain accessible for future generations” “expects research data generated as part of BBSRC support to be made available…data should be retain for a period of 10 years after completion of the project” “require that the applicants provide a data management and sharing plan as part of their application” “requires all grant holders to offer for deposit copies of data to the UK Data Archive” SHERPA JULIET SERVICE http://tinyurl.com/datapolicies
  • 19. Data management and curation activities in Oxford
  • 20. Scoping Digital Repository Services for Research Data Management
  • 22. Researcher’s data - the challenges I COULDN’T MAKE SENSE OF THE WHEN RESEARCHERS LEAVE THE DATA I COLLECTED FOR MY PhD 5 DEPARTMENT WE LOOSE ALL THE DATA YEARS AGO THEY CREATED HELP! I AM REQUIRED TO WE HAD TO MIGRATE DATA TO NEW PRODUCE A DATA FORMATS AS NOT TO LOSE THEM. IT MANAGEMENT PLAN TOOK US MONTHS!! I WANT TO PUBLISH THE DATA AS AN ADDITIONAL RESOURCE FOR READERS OF MY PUBLISHED BOOK/ARTICLE TO SHARE OUR DATA WE HAD TO PHYSICALLY TRANSPORT THE SERVER CLINICAL TRIALS DATA COLLECTED 30 YEARS AGO CAN BE USED TO IDENTIFY THE DAUGHTERS OF THOSE WOMAN WHO WERE ADMINISTERED A DRUG THAT CAUSES CANCER IN THEIR DAUGHTERS WE COLLECTED DATA AS PART OF AN INTERNATIONAL COLLABORATION BUT WE DON’T KNOW WHO OWNS THE DATA?
  • 24. Consultation with service units • Aiming to – Validate the researchers’ requirements for services – Determine the data management services available to researchers in Oxford – Identify gaps in service provision
  • 25. Findings • Widespread expertise in data management and curation amongst service units in Oxford • Support provided in ad-hoc basis but services not made explicit • Overall, the majority of the services in the data management and curation framework are not offered fully or at all. • There is a need for a university wide policy on data management and curation
  • 26. Research data management and curation services
  • 27. Embedding Institutional Data Curation Services in Research (EIDCSR)
  • 28. Where to start? A data management plan *With details about: • the need for access to existing data sources • the data to be produced by the research project • the planned quality assurance and back-up procedures for data • the plans for management and archiving of collected data • any expected difficulties in making data available for secondary research (through data archiving) and measures to overcome such difficulties • who holds copyright and Intellectual Property Rights of the data • data management responsibility roles within the research team [Support from Departments’ IT or research facilitators or Research Services] * RELU Data Management Plans
  • 29. File handling • Use open file formats if possible (ODF, PNG, TIFF, JPEG) [Training providers (OUCS, OULS, departmental…)] • Use a clear directory structure • Name files consistently (http://mst.nerc.ac.uk/file_naming_conventions.html) • Use version control tools [OUCS Subversion Repositories]
  • 30. Collect metadata : “data about data” • Different types – descriptive metadata : describing the intellectual content of the object Simple DC: Title/ Creator/Subject/Description/Publisher/Contributor/Date/ Type/Format/ Identifier/Source/Language/Relation/Coverage/Rights – administrative metadata: information used to manage the object or control access to it. – structural metadata: information that ties each object to others. [OUCS Research Technology Service may be able to help] or the Digital Curation Centre (DCC)
  • 31. Storage • Check with your departmental IT • Need a back-up strategy – How often/stored for how long/ who will be responsible? [Hierarchical File Server for back-up your files and long term storage] [OUCS Research Technology Service may be able to help] • Ethics and confidentiality [Research Ethics Committee http://www.admin.ox.ac.uk/curec/] – http://www.data-archive.ac.uk/sharing/confidential.asp
  • 32. Data sharing and long-term preservation • Sharing through: – Papers, local repositories, national repositories or web tools – Informally at conferences, blogs or email • Be aware of IP and copyright issues – [http://www.data-archive.ac.uk/sharing/copyright.asp] [ISI Innovation] [Legal Services can help] • Long-term preservation and sharing at national data centres – UK Data Archive – NERC data centres – Archeological Data Service – European Bioinformatics Centre (EBI) – Many more like this at: http://tinyurl.com/globaldatarepo
  • 33. Services available in Oxford • ORA for research articles and other grey literature • Hierarchical File Server for back-up your files • OUCS Research Technology Service • Departmental support through IT or research facilitators • Departmental storage • Legal Services • Research Services • Central University Research Ethics Committee • Different training providers
  • 34. Basic Data Management Principles 1. Plan before producing data 2. When possible choose right standards for open formats 3. Document your data 4. Store your data securely and always backup 5. Use trusted repositories to deposit your data for sharing and long-term preservation
  • 35. Other useful resources • UK Data Archive Manage and Share guidelines – http://tinyurl.com/datamanage • Research Data Management Services: Findings of the Consultation with Service Providers – http://tinyurl.com/Oxdataservices • MIT Data Management and Publishing guide – http://tinyurl.com/qjz6ay • Australian National University data management planning – http://ilp.anu.edu.au/dm/