SlideShare ist ein Scribd-Unternehmen logo
1 von 26
CSC – Suomalainen tutkimuksen, koulutuksen ja julkishallinnon ICT-osaamiskeskus
Get the most out of your data! Data
publication, tracking and citation
Jessica PvE 28.2.2018
Working with data: amount of work
28.2.20182
Data Science Report 2016 http://visit.crowdflower.com/rs/416-ZBE-
142/images/CrowdFlower_DataScienceReport_2016.pdf
Monya Baker: 1,500 scientists lift the lid on reproducibility.
Survey sheds light on the ‘crisis’ rocking research. Nature 533,
2016. doi:10.1038/533452a
Data citation
• Project by Finnish Committee for Research
• Research data should be FAIR
• Data should have: Creator, title, publisher, publication
time, identifier
• Recommended additional information: Version,
resource type, copyright status
28.2.20184
28.2.20185
FCRD
https://www.fcrd.fi/data-citation/
• Include principles of data as evidence
and data transparency in next version
of Finnish RCR guideline byTENK.
• Recognise data authorship as a
distinct issue and discussion in the
TENK authorship guideline
http://www.tenk.fi/sites/tenk.fi/files/TENK_suositus_tek
ijyys.pdf
• Create a multi-institutional,
multidisciplinary working group to
define principles for defining data
authorship, coordinated f.e. byTENK,
OR assign national representation to
a relevant international activity with
the same goal.
• Organize multidisciplinary
discussion on data management
and citation, with the aim of
creating interoperable practices.
• Promote the use of data reference
model also when referring to
authors own primary source data.
• Define field specific level of
granularity for data citation.
Recommendations
6
• data licensing
• metadata
standards
• open data
formats
• shared
vocabularies
• informed decisions
to share data
• access rights
• descriptive
metadata
• persistent
identifiers
Findable Accessible
ReusableInter-operable
force11.org/group/fairgroup/fairprinciples
FAIR
Findable
Accessible
Interoperable
Reusable
By Martin Grandjean - Own work : http://www.martingrandjean.ch/wp-content/uploads/2013/10/Graphe3.png, CC BY-SA 3.0,
https://commons.wikimedia.org/w/index.php?curid=29364647
1.(Meta)data are assigned a globally
unique and persistent identifier
2. Data are described with rich metadata
3. Metadata clearly and explicitly include
the identifier of the data it describes
4. (Meta)data are registered or indexed in
a searchable resource
URN
1.(Meta)data are retrievable by their
identifier using a standardized
communications protocol
2.Metadata are accessible, even when
the data are no longer available
1.(Meta)data use a formal, accessible,
shared, and broadly applicable language
for knowledge representation.
2.(Meta)data use vocabularies that
follow FAIR principles
3. (Meta)data include qualified
references to other (meta)data
Meta(data) are richly described with a
plurality of accurate and relevant
attributes
Persistent Identifiers (DOI, URN)
Data Catalog
Resolver
PID
Data file
License
Configuration
files
Read me
Landing page
Nano publications, linking data and compact identifiers
• https://www.go-fair.org/
• http://identifiers.org/
28.2.201814
Schofield et al (2015) http://digitalhumanities.org:8081/dhq/vol/9/3/000227/000227.html
Rutgers lib guide
Alan O’Rourke CC-BY
audiencestack.com
Organising your data
• Sort and classify your information
• For instance: don’t mix different types of information in excel
columns: it is usually easier to combine datasets than sort out
ill structured data later
• Think about granularity (file size) and metadata
• Decide on formats, units, codes etc and be consequent
28.2.201817
Organising your data
• Write a code book, document
• Think about intelligibility
• Be careful when rearranging, reformatting, sorting or copy-
pasting data
• Use common file formats, preferably open
• Have processes in place for checking the data quality and
completeness
• Be clear about master copies and other copies
28.2.201818
Organising your data
• Be careful and plan well for sensitive data and anonymization
• Think about security and access rights
• Plan and agree on which versions of a dataset will be archived
and/or published
• Think about reproducibility and citing data
28.2.201819
Files and folders
• Create and agree on a system for naming files and folders and
be consequent
• Avoid very deep folder structures, since they can be difficult to
handle
• Use meaningful, unique file and folder names
• Keep names as short as possible but relevant. 25 characters is
usually considered maximum.
28.2.201820
Files and folders
• Dates inYYYY-MM-DD format allows you to sort and search
your files
• Avoid using special characters such as % & /  : ; * . ? < > ^! “ ()
and Scandinavians
• Use three digits (or 4 if you have a large number of files) i.e.
001, 002…….201, 202 (not 1, 2, 21).
• Use underscores (_) instead of spaces
28.2.201821
Files and folders
• If using a personal name in the name give the surname first
followed by first name.
• Though, be very careful with personal data when naming files
and folders
• Indicate version number by using ‘V’ or “version” and number
(and subversions with more digits if minor changes)
28.2.201822
Types of data in research
Open Data Generic Research Data Fixed Research
Datasets
Description • Public data for any
use, commercial
• May be dynamic
• PSI
• Mature data
products
• Not produced
/monitored by
researchers
• Ex. Weather data,
public transport
• Produced
by/with/for
researchers
• Validated, good
quality
• Well documented
• Raw or processed
• Datasets
stable/version
controlled
• Ex. Corpus, SMEAR
• Produced for
specific research
question
• Underpins
research, for
replication
• Might be very
processed
• Reuse value
might be low
unless mature
field
Format Stable, standardised May vary over time Must be preserved
according to plan
28.2.201823
License and publish your data
• FSDTietoarkisto
• Language Bank of Finland
• Figshare
• Zenodo
• EUDAT
• IDA and Etsin
28.2.201824
pexels
https://www.facebook.com/CSCfi
https://twitter.com/CSCfi
https://www.youtube.com/c/CSCfi
https://www.linkedin.com/company/csc---it-center-for-science
Thank you!
Jessica Parland-von Essen
Parland (at) csc.fi
https://orcid.org/0000-0003-4460-3906
28.2.201826

Weitere ähnliche Inhalte

Was ist angesagt?

A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...Leon Osinski
 
A basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your dataA basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your dataLeon Osinski
 
EDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEnvironmental Data Initiative
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU EindhovenLeon Osinski
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your DataEnvironmental Data Initiative
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your dataLeon Osinski
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)mhb120
 
Advantages of metadata
Advantages of metadataAdvantages of metadata
Advantages of metadataAzeem Sultan
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataCyndy Parr
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataTom Plasterer
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsTom Plasterer
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...dkNET
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEllen Verbakel
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!Renaine Julian
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Historic Environment Scotland
 

Was ist angesagt? (20)

A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...A basic course on Reseach data management, part 2: protecting and organizing ...
A basic course on Reseach data management, part 2: protecting and organizing ...
 
A basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your dataA basic course on Research data management, part 3: sharing your data
A basic course on Research data management, part 3: sharing your data
 
EDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable UnitsEDI Training Module 4: Organizing Data Into Publishable Units
EDI Training Module 4: Organizing Data Into Publishable Units
 
DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?DataONE Education Module 01: Why Data Management?
DataONE Education Module 01: Why Data Management?
 
DataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy IssuesDataONE Education Module 10: Legal and Policy Issues
DataONE Education Module 10: Legal and Policy Issues
 
Research data management at TU Eindhoven
Research data management at TU EindhovenResearch data management at TU Eindhoven
Research data management at TU Eindhoven
 
data citation
data citationdata citation
data citation
 
EDI Training Module 12: Learn to Cite and Link Your Data
EDI Training Module 12:  Learn to Cite and Link Your DataEDI Training Module 12:  Learn to Cite and Link Your Data
EDI Training Module 12: Learn to Cite and Link Your Data
 
DataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management PlanningDataONE Education Module 03: Data Management Planning
DataONE Education Module 03: Data Management Planning
 
What funders want you to do with your data
What funders want you to do with your dataWhat funders want you to do with your data
What funders want you to do with your data
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)
 
Advantages of metadata
Advantages of metadataAdvantages of metadata
Advantages of metadata
 
Ag Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and dataAg Data Commons: Agricultural research metadata and data
Ag Data Commons: Agricultural research metadata and data
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
Va sla nov 15 final
Va sla nov 15 finalVa sla nov 15 final
Va sla nov 15 final
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
bioCADDIE Webinar: The NIDDK Information Network (dkNET) - A Community Resear...
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data Support
 
You down with dmp yeah you know me!
You down with dmp  yeah you know me!You down with dmp  yeah you know me!
You down with dmp yeah you know me!
 
Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...Supporting the development of a national Research Data Discovery Service - A ...
Supporting the development of a national Research Data Discovery Service - A ...
 

Ähnlich wie Research data management for historians

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016Rebecca Raworth, MLIS
 
Research data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsResearch data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsLeon Osinski
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)Graça Gabriel
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampSherry Lake
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data managementCunera Buys
 
Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...Leon Osinski
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and SharingC. Tobin Magle
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...BigData_Europe
 
Make your data great now
Make your data great nowMake your data great now
Make your data great nowDaniel JACOB
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataAnita de Waard
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflowVarsha Khodiyar
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshopVarsha Khodiyar
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data managementdri_ireland
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataStuart Chalk
 

Ähnlich wie Research data management for historians (20)

The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 
Research data management workshop April 2016
Research data management workshop April 2016Research data management workshop April 2016
Research data management workshop April 2016
 
Research data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research MethodsResearch data management: course 0HV90, Behavioral Research Methods
Research data management: course 0HV90, Behavioral Research Methods
 
Data management (newest version)
Data management (newest version)Data management (newest version)
Data management (newest version)
 
Documentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM BootcampDocumentation and Metdata - VA DM Bootcamp
Documentation and Metdata - VA DM Bootcamp
 
Introduction to data management
Introduction to data managementIntroduction to data management
Introduction to data management
 
Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...Research data management : [part of] PROOF course Finding and controlling sci...
Research data management : [part of] PROOF course Finding and controlling sci...
 
Data Archiving and Sharing
Data Archiving and SharingData Archiving and Sharing
Data Archiving and Sharing
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
Make your data great now
Make your data great nowMake your data great now
Make your data great now
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
Data sharing as part of the research workflow
Data sharing as part of the research workflowData sharing as part of the research workflow
Data sharing as part of the research workflow
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Introduction to research data management
Introduction to research data managementIntroduction to research data management
Introduction to research data management
 
Good Practice in Research Data Management
Good Practice in Research Data ManagementGood Practice in Research Data Management
Good Practice in Research Data Management
 
NISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management PlanNISO Training Thursday Crafting a Scientific Data Management Plan
NISO Training Thursday Crafting a Scientific Data Management Plan
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical DataA Generic Scientific Data Model and Ontology for Representation of Chemical Data
A Generic Scientific Data Model and Ontology for Representation of Chemical Data
 

Mehr von Jessica Parland-von Essen

Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuusTutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuusJessica Parland-von Essen
 
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytysFairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytysJessica Parland-von Essen
 
Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...Jessica Parland-von Essen
 
Supporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorizationSupporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorizationJessica Parland-von Essen
 
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminenTutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminenJessica Parland-von Essen
 

Mehr von Jessica Parland-von Essen (20)

Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap Planning a Finnish PID Roadmap
Planning a Finnish PID Roadmap
 
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuusTutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
Tutkimusaineistojen kuvailu, metadata ja yhteentoimivuus
 
Pid landscape in finland
Pid landscape in finlandPid landscape in finland
Pid landscape in finland
 
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytysFairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
Fairdata-palvelut ja tutkimusaineistojen pitkäaikaissäilytys
 
Open Science goes FAIR
Open Science goes FAIROpen Science goes FAIR
Open Science goes FAIR
 
Metatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdataMetatiedot tunnisteet tutkimisdata
Metatiedot tunnisteet tutkimisdata
 
Towards a FAIR lifecycle
Towards a FAIR lifecycleTowards a FAIR lifecycle
Towards a FAIR lifecycle
 
A Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputsA Finnish perspective on FAIRsFAIR outputs
A Finnish perspective on FAIRsFAIR outputs
 
Persistence and Interoperability
Persistence and InteroperabilityPersistence and Interoperability
Persistence and Interoperability
 
Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...Collections meet the researcher. Digitalization, disintegration and disillusi...
Collections meet the researcher. Digitalization, disintegration and disillusi...
 
Supporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorizationSupporting FAIR data principles with data categorization
Supporting FAIR data principles with data categorization
 
FAIR data and the Etsin service
FAIR data and the Etsin serviceFAIR data and the Etsin service
FAIR data and the Etsin service
 
Yhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistotYhteiskuntatieteen aineistot
Yhteiskuntatieteen aineistot
 
Avoimen suomen historia
Avoimen suomen historiaAvoimen suomen historia
Avoimen suomen historia
 
Open Science Process
Open Science ProcessOpen Science Process
Open Science Process
 
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminenTutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
Tutkimusaineistoihiin viittaaminen, pysyvät tunnisteet ja linkittäminen
 
AffarerAllianserAnseende
AffarerAllianserAnseendeAffarerAllianserAnseende
AffarerAllianserAnseende
 
Avoin tiede Suomessa
Avoin tiede SuomessaAvoin tiede Suomessa
Avoin tiede Suomessa
 
Forskningsdataforhumanister
ForskningsdataforhumanisterForskningsdataforhumanister
Forskningsdataforhumanister
 
Data Management in Research
Data Management in ResearchData Management in Research
Data Management in Research
 

Kürzlich hochgeladen

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Colleen Farrelly
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhijennyeacort
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...Boston Institute of Analytics
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 217djon017
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptSonatrach
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...limedy534
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxEmmanuel Dauda
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFAAndrei Kaleshka
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxdolaknnilon
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfchwongval
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一F sss
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]📊 Markus Baersch
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfgstagge
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degreeyuu sss
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home ServiceSapana Sha
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectBoston Institute of Analytics
 

Kürzlich hochgeladen (20)

Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024Generative AI for Social Good at Open Data Science East 2024
Generative AI for Social Good at Open Data Science East 2024
 
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝DelhiRS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
RS 9000 Call In girls Dwarka Mor (DELHI)⇛9711147426🔝Delhi
 
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
NLP Data Science Project Presentation:Predicting Heart Disease with NLP Data ...
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2Easter Eggs From Star Wars and in cars 1 and 2
Easter Eggs From Star Wars and in cars 1 and 2
 
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.pptdokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
dokumen.tips_chapter-4-transient-heat-conduction-mehmet-kanoglu.ppt
 
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
Effects of Smartphone Addiction on the Academic Performances of Grades 9 to 1...
 
Customer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptxCustomer Service Analytics - Make Sense of All Your Data.pptx
Customer Service Analytics - Make Sense of All Your Data.pptx
 
Call Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort ServiceCall Girls in Saket 99530🔝 56974 Escort Service
Call Girls in Saket 99530🔝 56974 Escort Service
 
How we prevented account sharing with MFA
How we prevented account sharing with MFAHow we prevented account sharing with MFA
How we prevented account sharing with MFA
 
IMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptxIMA MSN - Medical Students Network (2).pptx
IMA MSN - Medical Students Network (2).pptx
 
Multiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdfMultiple time frame trading analysis -brianshannon.pdf
Multiple time frame trading analysis -brianshannon.pdf
 
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
办理学位证中佛罗里达大学毕业证,UCF成绩单原版一比一
 
GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]GA4 Without Cookies [Measure Camp AMS]
GA4 Without Cookies [Measure Camp AMS]
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdfRadioAdProWritingCinderellabyButleri.pdf
RadioAdProWritingCinderellabyButleri.pdf
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
毕业文凭制作#回国入职#diploma#degree澳洲中央昆士兰大学毕业证成绩单pdf电子版制作修改#毕业文凭制作#回国入职#diploma#degree
 
9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service9654467111 Call Girls In Munirka Hotel And Home Service
9654467111 Call Girls In Munirka Hotel And Home Service
 
Heart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis ProjectHeart Disease Classification Report: A Data Analysis Project
Heart Disease Classification Report: A Data Analysis Project
 

Research data management for historians

  • 1. CSC – Suomalainen tutkimuksen, koulutuksen ja julkishallinnon ICT-osaamiskeskus Get the most out of your data! Data publication, tracking and citation Jessica PvE 28.2.2018
  • 2. Working with data: amount of work 28.2.20182 Data Science Report 2016 http://visit.crowdflower.com/rs/416-ZBE- 142/images/CrowdFlower_DataScienceReport_2016.pdf
  • 3. Monya Baker: 1,500 scientists lift the lid on reproducibility. Survey sheds light on the ‘crisis’ rocking research. Nature 533, 2016. doi:10.1038/533452a
  • 4. Data citation • Project by Finnish Committee for Research • Research data should be FAIR • Data should have: Creator, title, publisher, publication time, identifier • Recommended additional information: Version, resource type, copyright status 28.2.20184
  • 6. • Include principles of data as evidence and data transparency in next version of Finnish RCR guideline byTENK. • Recognise data authorship as a distinct issue and discussion in the TENK authorship guideline http://www.tenk.fi/sites/tenk.fi/files/TENK_suositus_tek ijyys.pdf • Create a multi-institutional, multidisciplinary working group to define principles for defining data authorship, coordinated f.e. byTENK, OR assign national representation to a relevant international activity with the same goal. • Organize multidisciplinary discussion on data management and citation, with the aim of creating interoperable practices. • Promote the use of data reference model also when referring to authors own primary source data. • Define field specific level of granularity for data citation. Recommendations 6
  • 7. • data licensing • metadata standards • open data formats • shared vocabularies • informed decisions to share data • access rights • descriptive metadata • persistent identifiers Findable Accessible ReusableInter-operable force11.org/group/fairgroup/fairprinciples
  • 8. FAIR Findable Accessible Interoperable Reusable By Martin Grandjean - Own work : http://www.martingrandjean.ch/wp-content/uploads/2013/10/Graphe3.png, CC BY-SA 3.0, https://commons.wikimedia.org/w/index.php?curid=29364647
  • 9. 1.(Meta)data are assigned a globally unique and persistent identifier 2. Data are described with rich metadata 3. Metadata clearly and explicitly include the identifier of the data it describes 4. (Meta)data are registered or indexed in a searchable resource URN
  • 10. 1.(Meta)data are retrievable by their identifier using a standardized communications protocol 2.Metadata are accessible, even when the data are no longer available
  • 11. 1.(Meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation. 2.(Meta)data use vocabularies that follow FAIR principles 3. (Meta)data include qualified references to other (meta)data
  • 12. Meta(data) are richly described with a plurality of accurate and relevant attributes
  • 13. Persistent Identifiers (DOI, URN) Data Catalog Resolver PID Data file License Configuration files Read me Landing page
  • 14. Nano publications, linking data and compact identifiers • https://www.go-fair.org/ • http://identifiers.org/ 28.2.201814
  • 15. Schofield et al (2015) http://digitalhumanities.org:8081/dhq/vol/9/3/000227/000227.html Rutgers lib guide
  • 17. Organising your data • Sort and classify your information • For instance: don’t mix different types of information in excel columns: it is usually easier to combine datasets than sort out ill structured data later • Think about granularity (file size) and metadata • Decide on formats, units, codes etc and be consequent 28.2.201817
  • 18. Organising your data • Write a code book, document • Think about intelligibility • Be careful when rearranging, reformatting, sorting or copy- pasting data • Use common file formats, preferably open • Have processes in place for checking the data quality and completeness • Be clear about master copies and other copies 28.2.201818
  • 19. Organising your data • Be careful and plan well for sensitive data and anonymization • Think about security and access rights • Plan and agree on which versions of a dataset will be archived and/or published • Think about reproducibility and citing data 28.2.201819
  • 20. Files and folders • Create and agree on a system for naming files and folders and be consequent • Avoid very deep folder structures, since they can be difficult to handle • Use meaningful, unique file and folder names • Keep names as short as possible but relevant. 25 characters is usually considered maximum. 28.2.201820
  • 21. Files and folders • Dates inYYYY-MM-DD format allows you to sort and search your files • Avoid using special characters such as % & / : ; * . ? < > ^! “ () and Scandinavians • Use three digits (or 4 if you have a large number of files) i.e. 001, 002…….201, 202 (not 1, 2, 21). • Use underscores (_) instead of spaces 28.2.201821
  • 22. Files and folders • If using a personal name in the name give the surname first followed by first name. • Though, be very careful with personal data when naming files and folders • Indicate version number by using ‘V’ or “version” and number (and subversions with more digits if minor changes) 28.2.201822
  • 23. Types of data in research Open Data Generic Research Data Fixed Research Datasets Description • Public data for any use, commercial • May be dynamic • PSI • Mature data products • Not produced /monitored by researchers • Ex. Weather data, public transport • Produced by/with/for researchers • Validated, good quality • Well documented • Raw or processed • Datasets stable/version controlled • Ex. Corpus, SMEAR • Produced for specific research question • Underpins research, for replication • Might be very processed • Reuse value might be low unless mature field Format Stable, standardised May vary over time Must be preserved according to plan 28.2.201823
  • 24. License and publish your data • FSDTietoarkisto • Language Bank of Finland • Figshare • Zenodo • EUDAT • IDA and Etsin 28.2.201824

Hinweis der Redaktion

  1. It gets funded It’s reproducible It’s well documented
  2. Fair data is good It’s linked to other information be It’s sustainable, the links are unique and persistent It’s standardized and interoperable