SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Digital Object Identifiers for
EOSDIS data
HDF Workshop
April 17, 2012
John Moses, ESDIS
John.f.moses@nasa.gov
Assessment of identification schemes

Study by ESIP Cluster on Preservation and Stewardship in 2009
Unique
Identifier

ID Scheme

Data
Set

Unique
Locator

Item

Data
Set

Citable
Locator

Item

Data
Set

Scientifically
Unique ID

Item

Data
Set

Item

URL/N/I
PURL
XRI
Handle
DOI
ARK
LSID
OID
UUID
Adapted from Duerr, R. E., et al.. 2011 (submitted). On the utility of identification schemes for digital Earth
science data: An assessment and recommendations. Earth Science Informatics.
2

2
Digital Object Identifier for EOS products
• The DOI® system and the Handle System provide an Internet
resolution service for unique and persistent identifiers of
digital objects
– Internet Infrastructure components owned by International DOI Foundation
(IDF)– www.doi.org

• A DOI consists of two part alphanumeric string
– doi:[prefix]/[suffix]; for example doi: 10.5067/123;
– Prefix 10 identifies the DOI registry; 5067 identifies the Registrant Agent
– Suffix alphanumeric string 123 uniquely identifies the data item

• The purpose in assigning DOIs to EOSDIS products is to
provide a permanent data identifier for citation in
publications
– ESIP citation guideline using doi:
– Doe, J. and R. Roe. 2001. The FOO Data Set. Version 2.3. The FOO Data Center.
http://dx.doi.org/10.xxxx/notfoo.547983. Accessed 1 May 2011.
3
Implementing DOIs for EOSDIS
– Develop ops concept through pilot processes
• Guidelines for DOI suffix, location & citation information.
• Request, assign, monitor DOIs, location & citation metadata
• Add DOIs to DAAC product citation web pages
• Imbed DOIs into product metadata at next reprocessing
– HIRDLS, GLAS, AMSR-E data providers are in final
reprocessing
• Add DOIs to GCMD and ECHO through metadata updates
• Add DOI metadata to NTRS for searchable documentation
• Setup metrics collection from journal citation reports

4
Implementation in Interoperable
Architectures
Provenance
collection
DOI

tools

Provenance
Services

tools
DOI

DOI

NASA
Technical
Reports
Server

tools

Metadata flows in NASA Earth Science Data Systems
5
Attributes for embedding DOIs
• Framework structures in HDF and netCDF
– HDF global attribute name and value verses naming an identifier
group (which would allow discovery of identifier types)
– ECS CoreMetadata Product Specific Attributes in the
AdditionalAttributes group section
– netCDF file-level attribute name: “Id” and “naming authority”

• Consider attribute names for DOI value:
– Advantage to having two parts – a key code to indicated this is an
identifier, and namespace that indicates the type/application of DOI;
e.g., that it applies to the data product level (i.e., has same value for
all granules/files of the series – a series identifier).

• Hypothetical DOI example
– Attribute name: identifier_product_DOI
– Attribute value: 10.5067/Aura/HIRDLS/data1

6
MORE BACKGROUND

7
DOI Examples for Pilot Projects
Suffix Model String

[mission]/[instrument]/data[1-n]

Example

doi: 10.5067/Aura/HIRDLS/data1234
doi: 10.5067/ICESat/GLAS/data1234
doi: 10.5067/Aqua/AMSR-E/data1234

[campaign]/[measurement group]/data[1-n] doi: 10.5067/BOREAS/Airborne/data1234
[campaign]/[platform group]/data[1-n]
[program]/[measurement group]/data[1-n]
[measurement group]/[data[1-n]

doi: 10.
5067/MEaSUREs/OceanFluxes/data1234
Doi:
10:5067/MEaSUREs/SnowExtent/data1234

8
DOI Registration and Guidelines
• A DOI will be assigned for each EOSDIS standard data
products
• The DOI subscription holder (ESDIS) will provide location &
citation metadata to DOI subscription provider (CDL EZID) and
will be notified when the DOI has been registered
– Ideally we want one DOI per data item but the registry
does not preclude multiple registrations of similar data
• New DOI metadata can be uploaded as frequently as desired
– Typically when location or citation information changes
• A major new version of the data product would be assigned a
new DOI. DOIs of old versions that are no longer available
would have updated locators that point to the new version
(with explanation)
9
Guidelines for DOI suffix
• The DOI itself should be a relatively short string so that users
can read from printed material or display and key into a
browser with minimum error.
• The DOI suffix (ASCI characters with no spaces):
– Would be a descriptive name of domain-specific structure that reflects
the science data product contents
– Should have some recognition by the research community, such as a
semantic name or acronym, e.g.,
instrument/platform/campaign/investigation name or measurement
parameter
– Should help readers distinguish between published paper and dataset
– Should not have organizational reference subject to change (i.e.,
publisher, archive, owner)
10
Member Institute using DataCite (RA):
California Digital Library and EZID
• EZID is a service providing researchers a way to manage identifiers
persistently for datasets, files, and resources of all types.
• The service is available via a machine to machine programming
interface (an API) and as a web user interface.
• Core functions:

– Create a persistent identifier: DOI
– Add object location (URL landing page, separate from citation)
– Add citation metadata (DataCite repository, mandatory shown below)
•
•
•
•

Creator (person or organization)
Title (long name of dataset)
Publisher (holder of the data – organization making it available)
Publication Year (year when data was, or will be first available)

– Update object location
– Update object metadata

11
DOI Persistence

12
Registration Agent: DataCite
•
•

•

•

DataCite, established a scientific data
application with IDF.
Service is run by open membership
organization of gov and edu libraries.
Focused on improving the scholarly
infrastructure around datasets.
Most appropriate RA because of their focus
on working with data centers to assign
persistent identifiers to datasets leveraging
the Digital Object Identifier (DOI)
infrastructure.
United States Member Institutes

– California Digital Library (Founding Member)
•

TIB: German
National Library of
Science and
Technology

Recommended subscription provider because of
bulk pricing and EZID Web/API services

– Office of Scientific and Technical Information, US
Department of Energy ( new Member Dec 2010)
– Purdue University Libraries (Member)
– Interuniversity Consortium for Political and
Social Research - ICPSR (Associate Member)
– Microsoft Research (Associate Member)
13

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Tools to improve the usability of NASA HDF Data
Tools to improve the usability of NASA HDF DataTools to improve the usability of NASA HDF Data
Tools to improve the usability of NASA HDF Data
 
HDF-EOS 2/5 to netCDF Converter
HDF-EOS 2/5 to netCDF ConverterHDF-EOS 2/5 to netCDF Converter
HDF-EOS 2/5 to netCDF Converter
 
Introduction to HDF5
Introduction to HDF5Introduction to HDF5
Introduction to HDF5
 
Data Interoperability
Data InteroperabilityData Interoperability
Data Interoperability
 
HDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSSHDF Group Support for NPP/NPOESS/JPSS
HDF Group Support for NPP/NPOESS/JPSS
 
Product Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the WebProduct Designer Hub - Taking HPD to the Web
Product Designer Hub - Taking HPD to the Web
 
HDF and netCDF Data Support in ArcGIS
HDF and netCDF Data Support in ArcGISHDF and netCDF Data Support in ArcGIS
HDF and netCDF Data Support in ArcGIS
 
Images of HDF5
Images of HDF5Images of HDF5
Images of HDF5
 
HDF & HDF-EOS Data & Support at NSIDC
HDF & HDF-EOS Data & Support at NSIDCHDF & HDF-EOS Data & Support at NSIDC
HDF & HDF-EOS Data & Support at NSIDC
 
HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)HDF Update for DAAC Managers (2017-02-27)
HDF Update for DAAC Managers (2017-02-27)
 
Advanced HDF5 Features
Advanced HDF5 FeaturesAdvanced HDF5 Features
Advanced HDF5 Features
 
Efficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAPEfficiently serving HDF5 via OPeNDAP
Efficiently serving HDF5 via OPeNDAP
 
Introduction to NetCDF-4
Introduction to NetCDF-4Introduction to NetCDF-4
Introduction to NetCDF-4
 
Introduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIsIntroduction to HDF5 Data Model, Programming Model and Library APIs
Introduction to HDF5 Data Model, Programming Model and Library APIs
 
Hierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) UpdateHierarchical Data Formats (HDF) Update
Hierarchical Data Formats (HDF) Update
 
Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4Moving form HDF4 to HDF5/netCDF-4
Moving form HDF4 to HDF5/netCDF-4
 
HDF5 FastQuery
HDF5 FastQueryHDF5 FastQuery
HDF5 FastQuery
 
HDF Tools Tutorial
HDF Tools TutorialHDF Tools Tutorial
HDF Tools Tutorial
 
NEON HDF5
NEON HDF5NEON HDF5
NEON HDF5
 
Open-source Scientific Computing and Data Analytics using HDF
Open-source Scientific Computing and Data Analytics using HDFOpen-source Scientific Computing and Data Analytics using HDF
Open-source Scientific Computing and Data Analytics using HDF
 

Andere mochten auch

Rabbi hirschberg
Rabbi hirschbergRabbi hirschberg
Rabbi hirschbergparole1
 
112233
112233112233
112233iliawa
 

Andere mochten auch (18)

Bw backup
Bw backupBw backup
Bw backup
 
Rabbi hirschberg
Rabbi hirschbergRabbi hirschberg
Rabbi hirschberg
 
Tester123
Tester123Tester123
Tester123
 
112233
112233112233
112233
 
1234
12341234
1234
 
Bridging ICESat and ICESat-2 Standard Data Products
Bridging ICESat and ICESat-2 Standard Data ProductsBridging ICESat and ICESat-2 Standard Data Products
Bridging ICESat and ICESat-2 Standard Data Products
 
Using IDL with Suomi NPP VIIRS Data
Using IDL with Suomi NPP VIIRS DataUsing IDL with Suomi NPP VIIRS Data
Using IDL with Suomi NPP VIIRS Data
 
HDF Tools Updates and Discussions
HDF Tools Updates and DiscussionsHDF Tools Updates and Discussions
HDF Tools Updates and Discussions
 
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs ProjectsGES DISC Eexperiences with HDF Formats for MEaSUREs Projects
GES DISC Eexperiences with HDF Formats for MEaSUREs Projects
 
HDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and DemoHDF OPeNDAP Project Update and Demo
HDF OPeNDAP Project Update and Demo
 
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFViewHDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
HDF-EOS to GeoTIFF Conversion Tool & HDF-EOS Plug-in for HDFView
 
Earth Science Data and Information System (ESDIS) Project Update
Earth Science Data and Information System (ESDIS) Project UpdateEarth Science Data and Information System (ESDIS) Project Update
Earth Science Data and Information System (ESDIS) Project Update
 
Status of HDF-EOS, Related Software and Tools
 Status of HDF-EOS, Related Software and Tools Status of HDF-EOS, Related Software and Tools
Status of HDF-EOS, Related Software and Tools
 
Granules Are Forever
Granules Are ForeverGranules Are Forever
Granules Are Forever
 
HDF Project Status and Plans
HDF Project Status and PlansHDF Project Status and Plans
HDF Project Status and Plans
 
2011 ACSI Survey Summary
2011 ACSI Survey Summary2011 ACSI Survey Summary
2011 ACSI Survey Summary
 
Web-based On-demand Global NDVI Data Services
Web-based On-demand Global NDVI Data ServicesWeb-based On-demand Global NDVI Data Services
Web-based On-demand Global NDVI Data Services
 
Data Storage for Remote Monitoring of CAT Machines Using HDF
Data Storage for Remote Monitoring of CAT Machines Using HDFData Storage for Remote Monitoring of CAT Machines Using HDF
Data Storage for Remote Monitoring of CAT Machines Using HDF
 

Ähnlich wie Digital Object Identifiers for EOSDIS data

RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsCarole Goble
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesAndrea Bollini
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsAnita de Waard
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)mhb120
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...OpenAIRE
 
X.500 More Than a Global Directory
X.500 More Than a Global DirectoryX.500 More Than a Global Directory
X.500 More Than a Global Directorylurdhu agnes
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISAndrea Bollini
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS4Science
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
 
DataFinder concepts and example: General (20100503)
DataFinder concepts and example: General (20100503)DataFinder concepts and example: General (20100503)
DataFinder concepts and example: General (20100503)Data Finder
 
VRE Cancer Imaging BL RIC Workshop 22032011
VRE Cancer Imaging BL RIC Workshop 22032011VRE Cancer Imaging BL RIC Workshop 22032011
VRE Cancer Imaging BL RIC Workshop 22032011djmichael156
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnBonaria Biancu
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederOpenAIRE
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesRichard.Sapon-White
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Taggingpauloshea
 
Hypatia for dlf 2011
Hypatia for dlf 2011Hypatia for dlf 2011
Hypatia for dlf 2011DLFCLIR
 
The JISC Information Environment and collection description
The JISC Information Environment and collection descriptionThe JISC Information Environment and collection description
The JISC Information Environment and collection descriptionAndy Powell
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformAndrea Bollini
 

Ähnlich wie Digital Object Identifiers for EOSDIS data (20)

RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
 
CNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data CommonsCNI 2018: A Research Object Authoring Tool for the Data Commons
CNI 2018: A Research Object Authoring Tool for the Data Commons
 
Fedora
FedoraFedora
Fedora
 
Metadata lecture(9 17-14)
Metadata lecture(9 17-14)Metadata lecture(9 17-14)
Metadata lecture(9 17-14)
 
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
X.500 More Than a Global Directory
X.500 More Than a Global DirectoryX.500 More Than a Global Directory
X.500 More Than a Global Directory
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
Understanding data -latest
Understanding data  -latestUnderstanding data  -latest
Understanding data -latest
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
DataFinder concepts and example: General (20100503)
DataFinder concepts and example: General (20100503)DataFinder concepts and example: General (20100503)
DataFinder concepts and example: General (20100503)
 
VRE Cancer Imaging BL RIC Workshop 22032011
VRE Cancer Imaging BL RIC Workshop 22032011VRE Cancer Imaging BL RIC Workshop 22032011
VRE Cancer Imaging BL RIC Workshop 22032011
 
Resource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turnResource discovery and information sharing: reaching the 2.0 turn
Resource discovery and information sharing: reaching the 2.0 turn
 
EUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan BroederEUDAT data architecture and interoperability aspects – Daan Broeder
EUDAT data architecture and interoperability aspects – Daan Broeder
 
Metadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemesMetadata lecture 3, metadata schemes
Metadata lecture 3, metadata schemes
 
Metadata and Tagging
Metadata and TaggingMetadata and Tagging
Metadata and Tagging
 
Hypatia for dlf 2011
Hypatia for dlf 2011Hypatia for dlf 2011
Hypatia for dlf 2011
 
The JISC Information Environment and collection description
The JISC Information Environment and collection descriptionThe JISC Information Environment and collection description
The JISC Information Environment and collection description
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platform
 

Mehr von The HDF-EOS Tools and Information Center

STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...The HDF-EOS Tools and Information Center
 

Mehr von The HDF-EOS Tools and Information Center (20)

Cloud-Optimized HDF5 Files
Cloud-Optimized HDF5 FilesCloud-Optimized HDF5 Files
Cloud-Optimized HDF5 Files
 
Accessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDSAccessing HDF5 data in the cloud with HSDS
Accessing HDF5 data in the cloud with HSDS
 
The State of HDF
The State of HDFThe State of HDF
The State of HDF
 
Highly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance FeaturesHighly Scalable Data Service (HSDS) Performance Features
Highly Scalable Data Service (HSDS) Performance Features
 
Creating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 FilesCreating Cloud-Optimized HDF5 Files
Creating Cloud-Optimized HDF5 Files
 
HDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance DiscussionHDF5 OPeNDAP Handler Updates, and Performance Discussion
HDF5 OPeNDAP Handler Updates, and Performance Discussion
 
Hyrax: Serving Data from S3
Hyrax: Serving Data from S3Hyrax: Serving Data from S3
Hyrax: Serving Data from S3
 
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLABAccessing Cloud Data and Services Using EDL, Pydap, MATLAB
Accessing Cloud Data and Services Using EDL, Pydap, MATLAB
 
HDF - Current status and Future Directions
HDF - Current status and Future DirectionsHDF - Current status and Future Directions
HDF - Current status and Future Directions
 
HDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and FutureHDFEOS.org User Analsys, Updates, and Future
HDFEOS.org User Analsys, Updates, and Future
 
HDF - Current status and Future Directions
HDF - Current status and Future Directions HDF - Current status and Future Directions
HDF - Current status and Future Directions
 
H5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only LibraryH5Coro: The Cloud-Optimized Read-Only Library
H5Coro: The Cloud-Optimized Read-Only Library
 
MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10MATLAB Modernization on HDF5 1.10
MATLAB Modernization on HDF5 1.10
 
HDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDFHDF for the Cloud - Serverless HDF
HDF for the Cloud - Serverless HDF
 
HDF5 <-> Zarr
HDF5 <-> ZarrHDF5 <-> Zarr
HDF5 <-> Zarr
 
HDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server FeaturesHDF for the Cloud - New HDF Server Features
HDF for the Cloud - New HDF Server Features
 
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
Apache Drill and Unidata THREDDS Data Server for NASA HDF-EOS on S3
 
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
STARE-PODS: A Versatile Data Store Leveraging the HDF Virtual Object Layer fo...
 
HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?HDF5 and Ecosystem: What Is New?
HDF5 and Ecosystem: What Is New?
 
HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020HDF5 Roadmap 2019-2020
HDF5 Roadmap 2019-2020
 

Kürzlich hochgeladen

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 

Kürzlich hochgeladen (20)

CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 

Digital Object Identifiers for EOSDIS data

  • 1. Digital Object Identifiers for EOSDIS data HDF Workshop April 17, 2012 John Moses, ESDIS John.f.moses@nasa.gov
  • 2. Assessment of identification schemes Study by ESIP Cluster on Preservation and Stewardship in 2009 Unique Identifier ID Scheme Data Set Unique Locator Item Data Set Citable Locator Item Data Set Scientifically Unique ID Item Data Set Item URL/N/I PURL XRI Handle DOI ARK LSID OID UUID Adapted from Duerr, R. E., et al.. 2011 (submitted). On the utility of identification schemes for digital Earth science data: An assessment and recommendations. Earth Science Informatics. 2 2
  • 3. Digital Object Identifier for EOS products • The DOI® system and the Handle System provide an Internet resolution service for unique and persistent identifiers of digital objects – Internet Infrastructure components owned by International DOI Foundation (IDF)– www.doi.org • A DOI consists of two part alphanumeric string – doi:[prefix]/[suffix]; for example doi: 10.5067/123; – Prefix 10 identifies the DOI registry; 5067 identifies the Registrant Agent – Suffix alphanumeric string 123 uniquely identifies the data item • The purpose in assigning DOIs to EOSDIS products is to provide a permanent data identifier for citation in publications – ESIP citation guideline using doi: – Doe, J. and R. Roe. 2001. The FOO Data Set. Version 2.3. The FOO Data Center. http://dx.doi.org/10.xxxx/notfoo.547983. Accessed 1 May 2011. 3
  • 4. Implementing DOIs for EOSDIS – Develop ops concept through pilot processes • Guidelines for DOI suffix, location & citation information. • Request, assign, monitor DOIs, location & citation metadata • Add DOIs to DAAC product citation web pages • Imbed DOIs into product metadata at next reprocessing – HIRDLS, GLAS, AMSR-E data providers are in final reprocessing • Add DOIs to GCMD and ECHO through metadata updates • Add DOI metadata to NTRS for searchable documentation • Setup metrics collection from journal citation reports 4
  • 6. Attributes for embedding DOIs • Framework structures in HDF and netCDF – HDF global attribute name and value verses naming an identifier group (which would allow discovery of identifier types) – ECS CoreMetadata Product Specific Attributes in the AdditionalAttributes group section – netCDF file-level attribute name: “Id” and “naming authority” • Consider attribute names for DOI value: – Advantage to having two parts – a key code to indicated this is an identifier, and namespace that indicates the type/application of DOI; e.g., that it applies to the data product level (i.e., has same value for all granules/files of the series – a series identifier). • Hypothetical DOI example – Attribute name: identifier_product_DOI – Attribute value: 10.5067/Aura/HIRDLS/data1 6
  • 8. DOI Examples for Pilot Projects Suffix Model String [mission]/[instrument]/data[1-n] Example doi: 10.5067/Aura/HIRDLS/data1234 doi: 10.5067/ICESat/GLAS/data1234 doi: 10.5067/Aqua/AMSR-E/data1234 [campaign]/[measurement group]/data[1-n] doi: 10.5067/BOREAS/Airborne/data1234 [campaign]/[platform group]/data[1-n] [program]/[measurement group]/data[1-n] [measurement group]/[data[1-n] doi: 10. 5067/MEaSUREs/OceanFluxes/data1234 Doi: 10:5067/MEaSUREs/SnowExtent/data1234 8
  • 9. DOI Registration and Guidelines • A DOI will be assigned for each EOSDIS standard data products • The DOI subscription holder (ESDIS) will provide location & citation metadata to DOI subscription provider (CDL EZID) and will be notified when the DOI has been registered – Ideally we want one DOI per data item but the registry does not preclude multiple registrations of similar data • New DOI metadata can be uploaded as frequently as desired – Typically when location or citation information changes • A major new version of the data product would be assigned a new DOI. DOIs of old versions that are no longer available would have updated locators that point to the new version (with explanation) 9
  • 10. Guidelines for DOI suffix • The DOI itself should be a relatively short string so that users can read from printed material or display and key into a browser with minimum error. • The DOI suffix (ASCI characters with no spaces): – Would be a descriptive name of domain-specific structure that reflects the science data product contents – Should have some recognition by the research community, such as a semantic name or acronym, e.g., instrument/platform/campaign/investigation name or measurement parameter – Should help readers distinguish between published paper and dataset – Should not have organizational reference subject to change (i.e., publisher, archive, owner) 10
  • 11. Member Institute using DataCite (RA): California Digital Library and EZID • EZID is a service providing researchers a way to manage identifiers persistently for datasets, files, and resources of all types. • The service is available via a machine to machine programming interface (an API) and as a web user interface. • Core functions: – Create a persistent identifier: DOI – Add object location (URL landing page, separate from citation) – Add citation metadata (DataCite repository, mandatory shown below) • • • • Creator (person or organization) Title (long name of dataset) Publisher (holder of the data – organization making it available) Publication Year (year when data was, or will be first available) – Update object location – Update object metadata 11
  • 13. Registration Agent: DataCite • • • • DataCite, established a scientific data application with IDF. Service is run by open membership organization of gov and edu libraries. Focused on improving the scholarly infrastructure around datasets. Most appropriate RA because of their focus on working with data centers to assign persistent identifiers to datasets leveraging the Digital Object Identifier (DOI) infrastructure. United States Member Institutes – California Digital Library (Founding Member) • TIB: German National Library of Science and Technology Recommended subscription provider because of bulk pricing and EZID Web/API services – Office of Scientific and Technical Information, US Department of Energy ( new Member Dec 2010) – Purdue University Libraries (Member) – Interuniversity Consortium for Political and Social Research - ICPSR (Associate Member) – Microsoft Research (Associate Member) 13

Hinweis der Redaktion

  1. Debate in LSID community weakens it. - - an LSID is a locator; but also the ObjectID part of it is an Identifier and most people use a UUID for the ObjectID part of it OID problem ARK is a bit better than the rest of the locators because it has additional trust value ... maybe the color should have been more orange than yellow but I didn’t want to add more colors.  
  2. Handle System was developed by Corporation for National Research Initiatives; original version developed with DARPA. A standard data locator accepted for use in published literature. Provides an actionable, interoperable, persistent link through the use of identifier syntax and network resolution mechanism – i.e., the Handler System©. Currently standardised in ISO (via ISO TC46/SC9); through the home of ISBN, URI etc “content identifiers”; syntax through ANSI/NISO standard Z39.84-2005 Implemented through a federation of Registration Agencies (RAs), under policies and common infrastructure provided by the IDF which controls the system. US “Not for profit” open membership (with $35k annual membership fee) RAs have obligations for persistence, back-up in the event of failure – must sign agreement for use of the DOI System RAs pay operational fees to IDF’s technical operator for registering and maintaining DOI names (sliding scale per volume) Allows the transition of the management of persistent identifiers between RAs Current best known applications are: www.crossref.org and www.datacite.org One DOI per unique data item – seems like first come- first serve approach. How to avoid multiple registrations for the same product TBD. The suffix is an alpha-numeric string and has no special significance to the DOI system other than uniqueness and permanence.
  3. Working with California Digital Library (Joan Starr), Dept Of Energy (Sharon Jordan), and NASA Scientific &amp; Technical Information (Gerald Steeman)
  4. Metadata – descriptions that assign meaning to the data product. DOI added by DAAC to product metadata sent to GCMD and ECHO DOI embedded into product metadata by science product generation system in next reprocessing campaign RE: NSIDC chart Metadata Evolution for NASA Data Systems (MENDS) Metadata for some instrument-products is entirely contained in sections of the product data files. Extensions must be kept in separate files and linked to the product files. Provenance collection (e.g., DOI) could be accomplished at various places and times – TBD support for provenance services. DOI could be inserted into granules, collection or granule level metadata, added to technical documentation in NTRS to allow DOI-based queries.
  5. Handle System was developed by Corporation for National Research Initiatives; original version developed with DARPA. A standard data locator accepted for use in published literature. Provides an actionable, interoperable, persistent link through the use of identifier syntax and network resolution mechanism – i.e., the Handler System©. Currently standardised in ISO (via ISO TC46/SC9); through the home of ISBN, URI etc “content identifiers”; syntax through ANSI/NISO standard Z39.84-2005 Implemented through a federation of Registration Agencies (RAs), under policies and common infrastructure provided by the IDF which controls the system. US “Not for profit” open membership (with $35k annual membership fee) RAs have obligations for persistence, back-up in the event of failure – must sign agreement for use of the DOI System RAs pay operational fees to IDF’s technical operator for registering and maintaining DOI names (sliding scale per volume) Allows the transition of the management of persistent identifiers between RAs Current best known applications are: www.crossref.org and www.datacite.org One DOI per unique data item – seems like first come- first serve approach. How to avoid multiple registrations for the same product TBD. The suffix is an alpha-numeric string and has no special significance to the DOI system other than uniqueness and permanence.
  6. The suffix is an alpha-numeric string and has no special significance to the DOI system other than uniqueness and permanence.
  7. Founding Members: the British Library; the Technical Information Center of Denmark; TU Delft Library; the National Research Council’s Canada Institute for Scientific and Technical Information (NRC-CISTI); California Digital Library; Purdue University; and the German National Library of Science and Technology. Membership: DataCite has two levels of participation: full membership and associate membership. Full membership is geared towards national libraries and data centers, while associate membership is open to a broader group of organizations who support the aims and interests of DataCite. Managing Agent: TIB defers to local DataCite Member who provides access to DataCite service for minting DOIs.