SlideShare ist ein Scribd-Unternehmen logo
1 von 12
Organising and Documenting Data

Stuart Macdonald
EDINA & Data Library
stuart.macdonald@ed.ac.uk




RDM Academic Liaison Librarian Training, 15 November 2012
Organising your data
•RDM is one of the essential areas of responsible conduct of
research.

•Research data files and folders need to be organised in a
systematic way to be:
   • identifiable and accessible for yourself,
   • identifiable and accessible for colleagues, and for
      future users.

•Thus it is important to plan the organisation of your data
before a research project begins.

•Doing so will prevent any confusion while research is
underway or when multiple individuals will be editing and / or
analysing the data.
This can be achieved through:
•Directory structure & file naming conventions
•(File naming conventions for specific disciplines)
•File renaming
•File version control

For this to be successful a consistent and disciplined
approach is required.

Easier to accomplish as and when data files are generated
rather than retrospectively attempting to implement.

When organization methods become too time consuming,
consider automated methods.
File Naming conventions
•Naming datasets according to agreed conventions should
make file naming easier for colleagues because they will not
have to ‘re-think’ the process each time.
•File names should provide context for the contents of the file,
making it distinguishable from files with similar subjects or
different versions of the same file.
•Many files are used independently of their file or directory
structure, so provide sufficient description in the file name.
•Suggested strategies: identify the project; avoid special
characters; use underscores rather than spaces; include date
of creation or modification in a standard format (e.g.
YYYY_MM_DD or YYYYMMDD): use project number
•Be consistent! Avoid being cryptic!
Batch (or bulk) renaming
•   Software tools exist that can organise data files and
    folders in a consistent and automated way through
    batch renaming.

•   There are many situations where batch renaming may
    be useful, such as:

     – where images from digital cameras are automatically
       assigned filenames consisting of sequential numbers
     – where proprietary software or instrumentation generate
       crude, default or multiple filenames
     – where files are transferred from a system that supports
       spaces and/or non-English characters in filenames to one
       that doesn't (or vice versa). Batch renaming software can
       be used to substitute such characters with acceptable
       ones.
Benefits of consistent data file
labelling are:
•Data files are not accidentally overwritten or deleted

•Data files are distinguishable from each other within their
containing folder

•Data file naming prevents confusion when multiple people are
working on shared files

•Data files are easier to locate and browse

•Data files can be retrieved both by creator and by other users

•Data files can be sorted in logical sequence

•Different versions of data files can be identified

•If data files are moved to other storage platform their names will
retain useful context
Version Control
It is important to consistently identify and distinguish versions
of data files.

This ensures that a clear audit trail exists for tracking the
development of a data file and identifying earlier versions
especially if data is frequently updated by multiple users.

Suggested strategies:

•   Use a sequential numbered system: v1, v2, v3, etc.
•   Don't use confusing labels: revision, final, final2, etc.
•   Record all changes -- no matter how small
•   Discard obsolete versions (but never the raw copy)
•   Use auto-backup instead of self-archiving, if possible

The alternative is to use version control software. (Bazaar,
TortoiseSVN, SubVersion)
Documenting Data
There are many reasons why you need to document
your data:
•To help you remember the details later
•To help others understand your research
•Verify your findings
•Replicate your results
•Archive your data for access and re-use

Some examples of data documentation are:
•Laboratory notebooks
•Field notes
•Questionnaires
•SOPs
•Methodologies
Documenting Data
Laboratory or field notebooks, for example play an
important role in supporting claims relating to
intellectual property developed by University
researchers, and even defending claims against
scientific fraud.

Research data need to be documented at various
levels:

•Project level
   •   study background, methodologies, instruments,
       research hypothesis
•File or database level
   •   formats, relationships between files
•Variable or item level
   •   How variable was generated & label descriptions
Metadata – ‘data about data’
The difference between documentation and metadata is
that the first is meant to be read by humans and the
second implies computer-processing (though may also be
human-readable) to assist location and access to data
through search interfaces.
Three broad categories of metadata are:

•Descriptive - common fields such as title, author, abstract,
keywords which help users to discover online sources through
searching and browsing e.g. DC, MARC

•Administrative - preservation, rights management, and technical
metadata about formats.

•Structural - how different components of a set of associated
data relate to one another, such as a schema describing relations
between tables in a database.
Need for metadata
                 Metadata may not be
Public           required if you are
                 working alone on your
                 own computer, but
                 become crucial when data
Research         are shared online.
Community
                 Metadata help to place
                 your dataset in a broader
                 context, allowing those
Project          outside your institution,
                 discipline, or research
                 environment to
Researcher       understand how to
                 interpret your data.
THANK YOU!

Weitere ähnliche Inhalte

Was ist angesagt?

Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisEDINA, University of Edinburgh
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyRobin Rice
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...EDINA, University of Edinburgh
 
Open Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKOpen Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKEDINA, University of Edinburgh
 
Research Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of EdinburghResearch Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of EdinburghEDINA, University of Edinburgh
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...EDINA, University of Edinburgh
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 
JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training EDINA, University of Edinburgh
 
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...EDINA, University of Edinburgh
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional RepositoriesRobin Rice
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationSEAD
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsSEAD
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...ASIS&T
 
RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update ASIS&T
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreMichael Day
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software managementHistoric Environment Scotland
 

Was ist angesagt? (20)

Repository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and AnalysisRepository Fringe 2016 - Survey Documentation and Analysis
Repository Fringe 2016 - Survey Documentation and Analysis
 
Research Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional PolicyResearch Data Management: Approaches to Institutional Policy
Research Data Management: Approaches to Institutional Policy
 
Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...Addressing Institutional Research Data Management - University of Edinburgh R...
Addressing Institutional Research Data Management - University of Edinburgh R...
 
RDM for trainee physicians
RDM for trainee physiciansRDM for trainee physicians
RDM for trainee physicians
 
Open Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UKOpen Repositories and Interoperability Challenges in UK
Open Repositories and Interoperability Challenges in UK
 
Research Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of EdinburghResearch Data Management (RDM) Initiatives at the University of Edinburgh
Research Data Management (RDM) Initiatives at the University of Edinburgh
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 
JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training JISC Managing Research Data: Liaison Librarian Training
JISC Managing Research Data: Liaison Librarian Training
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
Recommendation to the EU Hearing on Access to and Preservation of Scientific ...
 
Open Data and Institutional Repositories
Open Data and Institutional RepositoriesOpen Data and Institutional Repositories
Open Data and Institutional Repositories
 
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Practical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object PreservationPractical and Conceptual Considerations of Research Object Preservation
Practical and Conceptual Considerations of Research Object Preservation
 
DSpace for Data Revisited
DSpace for Data RevisitedDSpace for Data Revisited
DSpace for Data Revisited
 
ESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and ToolsESA14 Workshop on SEAD's Data Services and Tools
ESA14 Workshop on SEAD's Data Services and Tools
 
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
RDAP14 Poster: openICPSR: a public access repository for storing and sharing ...
 
RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update RDAP14: DataNet Federal Consortium Update
RDAP14: DataNet Federal Consortium Update
 
Data curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation CentreData curation and preservation: the Digital Curation Centre
Data curation and preservation: the Digital Curation Centre
 
EPSRC research data expectations and research software management
EPSRC research data expectations and research software managementEPSRC research data expectations and research software management
EPSRC research data expectations and research software management
 

Andere mochten auch

Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...
Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...
Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...JISC GECO
 
Web Services Tiered Internet Authorization (WSTIERIA)
Web Services Tiered Internet Authorization (WSTIERIA)Web Services Tiered Internet Authorization (WSTIERIA)
Web Services Tiered Internet Authorization (WSTIERIA)EDINA, University of Edinburgh
 
Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415EDINA, University of Edinburgh
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...EDINA, University of Edinburgh
 
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEnsuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEDINA, University of Edinburgh
 
Digital maps: past, present; on your desktop and in the palm of your hand
Digital maps: past, present; on your desktop and in the palm of your handDigital maps: past, present; on your desktop and in the palm of your hand
Digital maps: past, present; on your desktop and in the palm of your handEDINA, University of Edinburgh
 
Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617EDINA, University of Edinburgh
 
PEPRS and the Keepers Registry
PEPRS and the Keepers RegistryPEPRS and the Keepers Registry
PEPRS and the Keepers RegistrySUNCAT
 
Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...EDINA, University of Edinburgh
 
Report on EDINA Authentication Related Academic Sector Activities
Report on EDINA Authentication Related Academic Sector ActivitiesReport on EDINA Authentication Related Academic Sector Activities
Report on EDINA Authentication Related Academic Sector ActivitiesEDINA, University of Edinburgh
 

Andere mochten auch (20)

Introduction to RDM for trainee physicians
Introduction to RDM for trainee physiciansIntroduction to RDM for trainee physicians
Introduction to RDM for trainee physicians
 
Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...
Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...
Map Styling Tools and Interactive maps on the web with OpenLayers - Addy Pope...
 
Digimap for Schools for Primary Schools
Digimap for Schools for Primary SchoolsDigimap for Schools for Primary Schools
Digimap for Schools for Primary Schools
 
Web Services Tiered Internet Authorization (WSTIERIA)
Web Services Tiered Internet Authorization (WSTIERIA)Web Services Tiered Internet Authorization (WSTIERIA)
Web Services Tiered Internet Authorization (WSTIERIA)
 
MANTRA for Change
MANTRA for ChangeMANTRA for Change
MANTRA for Change
 
Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415Oxford University Geospatial Metadata Workshop 20110415
Oxford University Geospatial Metadata Workshop 20110415
 
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
Hiberlink: Prototypes of pro-active approaches to support the archiving of we...
 
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of ScholarshipEnsuring the Integrity (& Continuity) of Our Record of Scholarship
Ensuring the Integrity (& Continuity) of Our Record of Scholarship
 
Tales from the Keepers Registry
Tales from the Keepers RegistryTales from the Keepers Registry
Tales from the Keepers Registry
 
Digital maps: past, present; on your desktop and in the palm of your hand
Digital maps: past, present; on your desktop and in the palm of your handDigital maps: past, present; on your desktop and in the palm of your hand
Digital maps: past, present; on your desktop and in the palm of your hand
 
Agile Data Access Initiative
Agile Data Access InitiativeAgile Data Access Initiative
Agile Data Access Initiative
 
UK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas WorkshopUK RepositoryNet+ Mimas Workshop
UK RepositoryNet+ Mimas Workshop
 
Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617Leeds University Geospatial Metadata Workshop 20110617
Leeds University Geospatial Metadata Workshop 20110617
 
IASSIST Latin Engagement Strategic Action Group
IASSIST Latin Engagement Strategic Action GroupIASSIST Latin Engagement Strategic Action Group
IASSIST Latin Engagement Strategic Action Group
 
A Importância da IDE-a no Reino-Unido
A Importância da IDE-a no Reino-UnidoA Importância da IDE-a no Reino-Unido
A Importância da IDE-a no Reino-Unido
 
PEPRS and the Keepers Registry
PEPRS and the Keepers RegistryPEPRS and the Keepers Registry
PEPRS and the Keepers Registry
 
Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...Using Social Media to Develop Your Academic Profile and Engage Others in Your...
Using Social Media to Develop Your Academic Profile and Engage Others in Your...
 
AddressingHistory: crowdsourcing the past
AddressingHistory: crowdsourcing the pastAddressingHistory: crowdsourcing the past
AddressingHistory: crowdsourcing the past
 
Report on EDINA Authentication Related Academic Sector Activities
Report on EDINA Authentication Related Academic Sector ActivitiesReport on EDINA Authentication Related Academic Sector Activities
Report on EDINA Authentication Related Academic Sector Activities
 
UKLA Content Development
UKLA Content DevelopmentUKLA Content Development
UKLA Content Development
 

Ähnlich wie Organising and Documenting Data

Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersRebekah Cummings
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...Projeto RCAAP
 
Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Rebekah Cummings
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate StudentsRebekah Cummings
 
Lec20.pptx introduction to data bases and information systems
Lec20.pptx introduction to data bases and information systemsLec20.pptx introduction to data bases and information systems
Lec20.pptx introduction to data bases and information systemssamiullahamjad06
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 
DATA RESOURCE MANAGEMENT
DATA RESOURCE MANAGEMENT DATA RESOURCE MANAGEMENT
DATA RESOURCE MANAGEMENT huma sh
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and ApplicationDr. Chitra Dhawale
 
DBMS introduction
DBMS introductionDBMS introduction
DBMS introductionBHARATH KUMAR
 
Database management system lecture notes
Database management system lecture notesDatabase management system lecture notes
Database management system lecture notesUTSAHSINGH2
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Jenn Riley
 
Lecture-1.ppt
Lecture-1.pptLecture-1.ppt
Lecture-1.pptChSheraz3
 
Database Systems Lec 1.pptx
Database Systems Lec 1.pptxDatabase Systems Lec 1.pptx
Database Systems Lec 1.pptxNishaTariq1
 
Database part1-
Database part1-Database part1-
Database part1-Taymoor Nazmy
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdmSarah Jones
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and SharingC. Tobin Magle
 

Ähnlich wie Organising and Documenting Data (20)

Data Management for Undergraduate Researchers
Data Management for Undergraduate ResearchersData Management for Undergraduate Researchers
Data Management for Undergraduate Researchers
 
The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...The state of global research data initiatives: observations from a life on th...
The state of global research data initiatives: observations from a life on th...
 
Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)Data Management for Undergraduate Researchers (updated - 02/2016)
Data Management for Undergraduate Researchers (updated - 02/2016)
 
Data Management for Graduate Students
Data Management for Graduate StudentsData Management for Graduate Students
Data Management for Graduate Students
 
Lec20.pptx introduction to data bases and information systems
Lec20.pptx introduction to data bases and information systemsLec20.pptx introduction to data bases and information systems
Lec20.pptx introduction to data bases and information systems
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
DATA RESOURCE MANAGEMENT
DATA RESOURCE MANAGEMENT DATA RESOURCE MANAGEMENT
DATA RESOURCE MANAGEMENT
 
Data Analytics: HDFS with Big Data : Issues and Application
Data Analytics:  HDFS  with  Big Data :  Issues and ApplicationData Analytics:  HDFS  with  Big Data :  Issues and Application
Data Analytics: HDFS with Big Data : Issues and Application
 
Data management
Data management Data management
Data management
 
DBMS introduction
DBMS introductionDBMS introduction
DBMS introduction
 
Database management system lecture notes
Database management system lecture notesDatabase management system lecture notes
Database management system lecture notes
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
Lecture-1.ppt
Lecture-1.pptLecture-1.ppt
Lecture-1.ppt
 
Database Systems Lec 1.pptx
Database Systems Lec 1.pptxDatabase Systems Lec 1.pptx
Database Systems Lec 1.pptx
 
Database part1-
Database part1-Database part1-
Database part1-
 
What is-rdm
What is-rdmWhat is-rdm
What is-rdm
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
Dbms mca-section a
Dbms mca-section aDbms mca-section a
Dbms mca-section a
 

Mehr von EDINA, University of Edinburgh

We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?EDINA, University of Edinburgh
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...EDINA, University of Edinburgh
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...EDINA, University of Edinburgh
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...EDINA, University of Edinburgh
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...EDINA, University of Edinburgh
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEDINA, University of Edinburgh
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneEDINA, University of Edinburgh
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneEDINA, University of Edinburgh
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceEDINA, University of Edinburgh
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesEDINA, University of Edinburgh
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...EDINA, University of Edinburgh
 

Mehr von EDINA, University of Edinburgh (20)

The Making of the English Landscape:
The Making of the English Landscape: The Making of the English Landscape:
The Making of the English Landscape:
 
Spatial Data, Spatial Humanities
Spatial Data, Spatial HumanitiesSpatial Data, Spatial Humanities
Spatial Data, Spatial Humanities
 
Land Cover Map 2015
Land Cover Map 2015Land Cover Map 2015
Land Cover Map 2015
 
We have the technology... We have the data... What next?
We have the technology... We have the data... What next?We have the technology... We have the data... What next?
We have the technology... We have the data... What next?
 
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
Reference Rot in Theses: A HiberActive Pilot - 10x10 session for Repository F...
 
GeoForum EDINA report 2017
GeoForum EDINA report 2017GeoForum EDINA report 2017
GeoForum EDINA report 2017
 
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
If I Googled You, What Would I Find? Managing your digital footprint - Nicola...
 
Moray housemarch2017
Moray housemarch2017Moray housemarch2017
Moray housemarch2017
 
Uniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondaryUniof stirlingmarch2017secondary
Uniof stirlingmarch2017secondary
 
Uniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondaryUniof glasgow jan2017_secondary
Uniof glasgow jan2017_secondary
 
Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...Managing your Digital Footprint : Taking control of the metadata and tracks a...
Managing your Digital Footprint : Taking control of the metadata and tracks a...
 
Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...Social media and blogging to develop and communicate research in the arts and...
Social media and blogging to develop and communicate research in the arts and...
 
Enhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola OsborneEnhancing your research impact through social media - Nicola Osborne
Enhancing your research impact through social media - Nicola Osborne
 
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola OsborneSocial Media in Marketing in Support of Your Personal Brand - Nicola Osborne
Social Media in Marketing in Support of Your Personal Brand - Nicola Osborne
 
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola OsborneBest Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
Best Practice for Social Media in Teaching & Learning Contexts - Nicola Osborne
 
SCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison serviceSCURL and SUNCAT serials holdings comparison service
SCURL and SUNCAT serials holdings comparison service
 
Big data in Digimap
Big data in DigimapBig data in Digimap
Big data in Digimap
 
Introduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data servicesIntroduction to Edinburgh University Data Library and national data services
Introduction to Edinburgh University Data Library and national data services
 
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...Digimap for Schools: Introduction to an ICT based cross curricular resource f...
Digimap for Schools: Introduction to an ICT based cross curricular resource f...
 
Digimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarvaDigimap Update - Geoforum 2016 - Guy McGarva
Digimap Update - Geoforum 2016 - Guy McGarva
 

KĂźrzlich hochgeladen

Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxPoojaSen20
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfMr Bounab Samir
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONHumphrey A BeĂąa
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxMaryGraceBautista27
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17Celine George
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptxiammrhaywood
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfSpandanaRallapalli
 

KĂźrzlich hochgeladen (20)

Culture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptxCulture Uniformity or Diversity IN SOCIOLOGY.pptx
Culture Uniformity or Diversity IN SOCIOLOGY.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdfLike-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
Like-prefer-love -hate+verb+ing & silent letters & citizenship text.pdf
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATIONTHEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
THEORIES OF ORGANIZATION-PUBLIC ADMINISTRATION
 
Science 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptxScience 7 Quarter 4 Module 2: Natural Resources.pptx
Science 7 Quarter 4 Module 2: Natural Resources.pptx
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17How to Add Barcode on PDF Report in Odoo 17
How to Add Barcode on PDF Report in Odoo 17
 
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptxFINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
FINALS_OF_LEFT_ON_C'N_EL_DORADO_2024.pptx
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
Raw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptxRaw materials used in Herbal Cosmetics.pptx
Raw materials used in Herbal Cosmetics.pptx
 
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptxAUDIENCE THEORY -CULTIVATION THEORY -  GERBNER.pptx
AUDIENCE THEORY -CULTIVATION THEORY - GERBNER.pptx
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
ACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdfACC 2024 Chronicles. Cardiology. Exam.pdf
ACC 2024 Chronicles. Cardiology. Exam.pdf
 

Organising and Documenting Data

  • 1. Organising and Documenting Data Stuart Macdonald EDINA & Data Library stuart.macdonald@ed.ac.uk RDM Academic Liaison Librarian Training, 15 November 2012
  • 2. Organising your data •RDM is one of the essential areas of responsible conduct of research. •Research data files and folders need to be organised in a systematic way to be: • identifiable and accessible for yourself, • identifiable and accessible for colleagues, and for future users. •Thus it is important to plan the organisation of your data before a research project begins. •Doing so will prevent any confusion while research is underway or when multiple individuals will be editing and / or analysing the data.
  • 3. This can be achieved through: •Directory structure & file naming conventions •(File naming conventions for specific disciplines) •File renaming •File version control For this to be successful a consistent and disciplined approach is required. Easier to accomplish as and when data files are generated rather than retrospectively attempting to implement. When organization methods become too time consuming, consider automated methods.
  • 4. File Naming conventions •Naming datasets according to agreed conventions should make file naming easier for colleagues because they will not have to ‘re-think’ the process each time. •File names should provide context for the contents of the file, making it distinguishable from files with similar subjects or different versions of the same file. •Many files are used independently of their file or directory structure, so provide sufficient description in the file name. •Suggested strategies: identify the project; avoid special characters; use underscores rather than spaces; include date of creation or modification in a standard format (e.g. YYYY_MM_DD or YYYYMMDD): use project number •Be consistent! Avoid being cryptic!
  • 5. Batch (or bulk) renaming • Software tools exist that can organise data files and folders in a consistent and automated way through batch renaming. • There are many situations where batch renaming may be useful, such as: – where images from digital cameras are automatically assigned filenames consisting of sequential numbers – where proprietary software or instrumentation generate crude, default or multiple filenames – where files are transferred from a system that supports spaces and/or non-English characters in filenames to one that doesn't (or vice versa). Batch renaming software can be used to substitute such characters with acceptable ones.
  • 6. Benefits of consistent data file labelling are: •Data files are not accidentally overwritten or deleted •Data files are distinguishable from each other within their containing folder •Data file naming prevents confusion when multiple people are working on shared files •Data files are easier to locate and browse •Data files can be retrieved both by creator and by other users •Data files can be sorted in logical sequence •Different versions of data files can be identified •If data files are moved to other storage platform their names will retain useful context
  • 7. Version Control It is important to consistently identify and distinguish versions of data files. This ensures that a clear audit trail exists for tracking the development of a data file and identifying earlier versions especially if data is frequently updated by multiple users. Suggested strategies: • Use a sequential numbered system: v1, v2, v3, etc. • Don't use confusing labels: revision, final, final2, etc. • Record all changes -- no matter how small • Discard obsolete versions (but never the raw copy) • Use auto-backup instead of self-archiving, if possible The alternative is to use version control software. (Bazaar, TortoiseSVN, SubVersion)
  • 8. Documenting Data There are many reasons why you need to document your data: •To help you remember the details later •To help others understand your research •Verify your findings •Replicate your results •Archive your data for access and re-use Some examples of data documentation are: •Laboratory notebooks •Field notes •Questionnaires •SOPs •Methodologies
  • 9. Documenting Data Laboratory or field notebooks, for example play an important role in supporting claims relating to intellectual property developed by University researchers, and even defending claims against scientific fraud. Research data need to be documented at various levels: •Project level • study background, methodologies, instruments, research hypothesis •File or database level • formats, relationships between files •Variable or item level • How variable was generated & label descriptions
  • 10. Metadata – ‘data about data’ The difference between documentation and metadata is that the first is meant to be read by humans and the second implies computer-processing (though may also be human-readable) to assist location and access to data through search interfaces. Three broad categories of metadata are: •Descriptive - common fields such as title, author, abstract, keywords which help users to discover online sources through searching and browsing e.g. DC, MARC •Administrative - preservation, rights management, and technical metadata about formats. •Structural - how different components of a set of associated data relate to one another, such as a schema describing relations between tables in a database.
  • 11. Need for metadata Metadata may not be Public required if you are working alone on your own computer, but become crucial when data Research are shared online. Community Metadata help to place your dataset in a broader context, allowing those Project outside your institution, discipline, or research environment to Researcher understand how to interpret your data.

Hinweis der Redaktion

  1. Is there a filenaming convention for your specific discipline (e.g. The Open Biological and Biomedical Ontologies, DOE’s Atmospheric Radiation measurement (ARM) program )
  2. For qualitative data or small-scale surveys, the documentation might exist only in your head. Take the time to write it down while it is fresh in your mind. This may include writing methodology reports, creating codebooks with full variable and value labels, documenting decisions about software, tracking changes to different versions of the dataset, recording assumptions made during analysis.
  3. METS, a Digital Library Federation initiative, attempts to build upon the work of MOA2 and provide an XML document format for encoding metadata necessary for both management of digital library objects within a repository and exchange of such objects between repositories (or between repositories and their users) Administrative metadata provides information necessary to allow a repository to manage objects, such as when, how and by whom a resource was created and how it can be accessed. Provenance and licensing