As a contribution to the currently ongoing larger effort to establish Open Science as best practices in academia, this article focuses on the Open Source and Open Access tiers of the Open Science triad and community software projects. The current situation of research software development and the need to recognize it as a significant contribution to science is introduced in relation to Open Science. The adoption of the Open Science paradigms occurs at different speeds and on different levels within the various fields of science and crosscutting software communities. This is paralleled by the emerging of an underlying futuresafe technical infrastructure based on open standards to enable proper recognition for published articles, data, and software. Currently the number of journal publications about research software remains low in comparison to the amount of research code published on various software repositories in the WWW. Because common standards for the citation of software projects (containers) and versions of software are lacking, the FORCE11 group and the CodeMeta project recommending to establish Persistent Identifiers (PIDs), together with suitable metadata setss to reliably cite research software. This approach is compared to the best practices implemented by the OSGeo Foundation for geospatial community software projects. For GRASS GIS, a OSGeo project and one of the oldest geospatial open source community projects, the external requirements for DOI-based software citation are compared with the projects software documentation standards. Based on this status assessment, application scenarios are derived, how OSGeo projects can approach DOI-based software citation, both as a standalone option and also as a means to foster open access journal publications as part of reproducible Open Science.
Ähnlich wie INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES: TOWARDS OSGEO BEST PRACTICES FOR SCIENTIFIC SOFTWARE CITATION
Ähnlich wie INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES: TOWARDS OSGEO BEST PRACTICES FOR SCIENTIFIC SOFTWARE CITATION (20)
Role of AI in seed science Predictive modelling and Beyond.pptx
INTEGRATION OPTIONS FOR PERSISTENT IDENTIFIERS IN OSGEO PROJECT REPOSITORIES: TOWARDS OSGEO BEST PRACTICES FOR SCIENTIFIC SOFTWARE CITATION
1. TOWARDS OSGEO BEST PRACTICES FOR SCIENTIFIC SOFTWARE CITATION
INTEGRATION OPTIONS FOR PERSISTENT
IDENTIFIERS IN OSGEO PROJECT
REPOSITORIES
Peter Löwe, Markus Neteler, Jan Goebel, Marco Tullney
Boston, August 17 2017
2. 4
Original sin ?
Science + Culture of Sharing = Open Science
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation2
https://xkcd.com/1228/
Does Prometheus
receive due credit for
his creativity ?
3. 4 Open Science
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation3
https://en.wikipedia.org/wiki/Open_science#/media/File:Open_Science_-_Prinzipien.png
Open Science is the
movement to make
scientific research
and data accessible
to all
4. Open Science Triangle: Science-related benefits
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation4
Open Access
Open Data Open Source
Society:
• Greater availability and
accessibility of publicly funded
scientific research outputs
• Greater reproducibility and
transparency of scientific works
Community:
• Possibility for rigorous peer-
review
Individual:
• Greater impact of scientific
research
Code citation: Requires standards and infrastructure
Code citation
required
5. Motivation for Code Citation
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation5
Understanding research fields: code as important part of the
record of research and progress in science (no „throwaway code“)
Credit: Researchers on all levels(including students!) deserve credit in
their coin of the realm (aka citation), especially when this work enables
further research by others.
Discoverability: Citation enables finding and reuse
Reproducability: Citation of a specific software is required, but also
information about underlying software stack and configurations are
needed
7. Software Citation Best Practices
according to FORCE 11
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation7
• Importance Software matters in Science
• Credit and Attribution Get due credit for your work
• Unique Identification Unique, presistent, interoperable
• Persistence Identifier & metadata never expire
• Accessibility Code & documentation, interop.
• Specifity Reference to specific code versions
https://www.force11.org/
8. PUBLISHING RESEARCH SOFTWARE
Open Access Journals for Geospatial Research Software ?
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation8
In comparison to the actual magnitude of research code
being produced, only a fraction is being communicated by
journals.
As a result, advances in scientific software are not being
properly communicated and therefore remain inaccessible
to other scientists.
9. Reality Check: OSGeo Journal
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation9
• Founded in 2007
• Online Journal
• ISSN
• Publishes FOSS4G proceedings
• No defined standards for software citation (yet)
http://www.osgeo.org/journal
10. Journal of Open Source Software
-a Role Model ?
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation10
„DOI-Link“ points to code
within github repository.
Metadata stored in Zenodo.
11. Motivation for DOI links
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation11
Long term perspective:
• Data and code will move within the WWW,
• URL links to webpages will expire over time .
Digital Object Identifers (DOIs) as a way to ensure stable links,
preventing:
Very bad
12. 5 Introducing Digital Object Identifiers (DOI)
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation12
• DOI System ISO Standard 26324 (2012)
• International DOI Foundation (1998).
• Based on the Handle system.
• Long-term persistence and
accessibility of information.
• Global infrastructure provider for research data and code
DataCite (non-profit, software infrastructure is FOSS):
https://www.datacite.org/
13. 5 What is a DOI ?
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation13
DOI: Acronym for "digital object identifier“.
A DOI identifies the object itself and not the place where it is
located.
What you see: alphanumeric string (never changes)
Associated with: location (such as URL)
Accompanied with: who, what, when… (metadata)
14. WHAT TO USE DOIs FOR ?
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation14
DOIs can be used to reference
• Publications
• Code
• Data
Open Access
Open Data Open Source
15. DOI magic explained:
Man in the middle – can be friendly…
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation15
https://image.slidesharecdn.com/doi-100203060339-phpapp01/95/doi-in-he-11-728.jpg?cb=1265177093
https://www.deepdotweb.com/wp-content/uploads/2016/10/word-image-19.png
• DOIs are resolved by a resolving entity („man in the middle“).
• The resolving entity does not host the data itself.
• It receives updates from the hosting data repository whenever the
data changes location (new URL).
• A DOI will then always resolve to a valid landing page.
16. DOI magic explained:
Man in the middle – can be friendly…
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation16
https://image.slidesharecdn.com/doi-100203060339-phpapp01/95/doi-in-he-11-728.jpg?cb=1265177093
https://www.deepdotweb.com/wp-content/uploads/2016/10/word-image-19.png
• DOIs are resolved by a resolving entity („man in the middle“).
• The resolving entity does not host the data itself.
• It receives updates from the hosting data repository whenever the
data changes location (new URL).
• A DOI will then always resolve to a valid landing page.
Code,
Data
17. Requirement: DOIs resolve to landing pages
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation17
• Every DOI resolves to a landing page.
• Landing pages provide metadata and further
content
• DOIs are designed to outlive their content,
OSGeo content like GRASS module manual pages
already qualify as landing pages for DOIs
18. 5 DOI is a quality label
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation18
A digital object with a DOI has to be:
Stable° (i.e. not going to be modified)
Complete (i.e. not going to be updated)
Permanent – by assigning a DOI we’re committing to make
the dataset available for posterity
Good quality – by assigning a DOI its receiving the data
centre’s stamp of approval, saying that it’s complete and all
the metadata is available
(°DOI can handle software-versioning )
Seal of
Approval
19. DOIs are on the rise
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation19
www.datacite.org
20. Example: DOI for Journal Articles
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation20
21. Example: DOIs for Data
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation21
22. Example: DOIs for Code
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation22
23. DOIs being currently used by OSGeo: Video
Scientific-technical video is part of the research cycle
• FOSS4G presentations deserve scientific credit by citation and
long term preservation in a repository
Open Access
Open Data Open Source
24. OSGeo Videos with DOIs
• OSGeo conference recordings are hosted by FOSS4G
media partner German National Library for Science and
Technology (TIB) .
• The annual growth exceeds 100 hours of new content
• OSGeo videos are part of the record of science
https://wiki.osgeo.org/wiki/Global_conferences_overview
http://dx.doi.org/10.5446/14749#t=39:10,39:33
DOI Timestamp
Scientifc
citation
25. GRASS GIS
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation25
• GRASS GIS,
• Over 3 decades experience (since 1982)
• OSGeo project
• Over 350 modules
• Additional add-on modules
• Main repository: SVN
https://grass.osgeo.org/
26. GRASS Code Citation
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation26
The GRASS GIS project wiki provides advice, how to cite
versions of GRASS GIS in scientific. No coverage of DOIs (yet).
27. GRASS Code Levels
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation27
1. „External code“, based on GRASS repo, not shared with community,
not hosted in OSGeo GRASS repository. Potentially volatile.
2. Add-on modules: Hosted and preserved in OSGeo GRASS
repository, minimal quality standards, including standardized
landing page (GRASS module manual page), limited peer review,
discoverable by GRASS search functions
3. Core modules: Hosted and preserved in GRASS repository, manual
page with links to previous code versions, demo data, reference to
devolpers, rigorous peer review by GRASS community, discoverable
by GRASS search functions
28. Individual Level: Zenodo Option
external code / add-ons
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation28
International
DOI Foundation
Registration
Agencies
Member
Datacenter
DatacenterOther
Datacentes
Managing Agent
Member
Member
Other Members
Datacenter
Personal
github repo
Dawn of a
code
diaspora ?
29. Community Level: Zenodo Option for
GRASS Repository
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation29
International
DOI Foundation
Registration
Agencies
Member
Datacenter
DatacenterOther
Datacentes
Managing Agent
Member
Member
Other Members
Datacenter
SVN Repo
migration
(RISK)
Other OSGeo
projects ? ?
GRASS
code
Repo
30. Reality check: Zenodo (and figshare) are
all-purpose Repositories: One size fits all ?
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation30
Rueda, Laura. (2017, May). Introduction to DataCite. Zenodo. http://doi.org/10.5281/zenodo.571808
All
purpose.
Good ?
31. Project Community Level:
GRASS Project DataCenter
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation31
International
DOI Foundation
Registration
Agencies (9)
Member
Datacenter
DatacenterOther
Datacentes
Managing Agent
Member
Member
Other Members
Datacenter
SVN Repo
!
GRASS
SVN
Repo
32. Umbrella Option: OSGeo becomes a DOI member,
unlimited DOI minting for all OSGeo projects.
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation32
International
DOI Foundation
Registration
Agencies
Member
Datacenter
DatacenterOther
Datacentes
Managing Agent
Member
Member
Other Members
Datacenter
Repo
All OSGeo
Projects !
Meta data
guidelines
Meta data
guidelines
GRASS
SVN
Repo
33. 5
Opportunity:
OSGeo to benefit from Datacite Services
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation33
Search.datacite.org
34. Proposal for Follow-up Action
Löwe, Neteler, Goebel, Tullney: FOSS4G 2017
Towards OSGeo Best Practices for Scientific Software Citation34
• Make code citation a OSGeo topic
• Journal
• Projects
• Incubation
• Discuss DOI-/citation-related best-practices within OSGeo
• Explore:
Conduct tests on project level
Geo For All
35. Vielen Dank für Ihre Aufmerksamkeit.
DIW Berlin — Deutsches Institut
für Wirtschaftsforschung e.V.
Mohrenstraße 58, 10117 Berlin
www.diw.de
Redaktion
Peter Löwe (ploewe@diw.de)
³German Institute for Economic Research
Mohrenstraße 58, 10117 Berlin, Germany
E-mail: jgoebel@diw.de
http://orcid.org/0000-0002-3243-1935
⁴Technische Informationsbibliothek
Welfengarten 1B, 30167 Hannover, Germany
E-mail: marco.tullney@tib.eu
https://orcid.org/0000-0002-5111-2788
¹German Institute for Economic Research
Mohrenstraße 58, 10117 Berlin, Germany
E-mail: ploewe@diw.de
http://orcid.org/0000-0003-2257-0517
²Mundialis GmbH & Co. KG
Kölnstraße 99, 53111 Bonn, Germany
E-mail: neteler@mundialis.de
https://orcid.org/0000-0003-1916-1966
Peter Löwe¹, Markus Neteler², Jan Goebel³ and Marco Tullney⁴