SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Functional and Architectural
           Requirements for Metadata:
                  Supporting Discovery and
                 Management of Scientific Data

      Jian Qin                Alex Ball                Jane Greenberg
School of Information   Digital Curation Centre    School of Information and
       Studies                   UKOLN                  Library Science
 Syracuse University      University of Bath      University of North Caroline
         USA                       UK                  Chapel Hill, USA



          International Conference of Dublin Core Metadata
                Kuching, Malaysia, September 5, 2012
Metadata standards for scientific data

                     Ecological
                     Metadata
                     Language (EML)




CSDGM

                                Access to Biological
                                Collections Data –
                                ABCD

                                Darwin Core
Many tools have been developed…
Many tools have been developed…
Many tools have been developed…
Motivation for adoption
• Standardize the format and terminology of
  metadata
• Enable fast and effective discovery of datasets
  across different data repositories
• Enable data sharing, reuse, and preservation
• Provide information for obtaining datasets
  from the data owners
Hindering factors


• large numbers of           • steep learning curve
  elements                   • difficult to automate
• many layers in structure     metadata generation
   – “Unwieldy to apply”     • unnecessary duplicate
• been created for             data entry
  manual data entry,         • high costs in time,
  rather than for              resource, and
  automatic generation         personnel expertise
Same entity data repeated
                 in the same record…
Seamless Daily Precipitation for the Conterminous United States

Metadata:
Identification_Information
Data_Quality_Information
Spatial_Data_Organization_Information
Spatial_Reference_Information
Entity_and_Attribute_Information
Distribution_Information
Metadata_Reference_Information
…and they are already in…
Research questions

• What functions do metadata standards for
  scientific data serve?

• How should metadata standards for scientific
  data be modeled to support these functions
  by meeting the associated requirements?
Functions expected
• Resource discovery and use,
• Data interoperability,
• Automatic and semi-automatic metadata
  generation,
• Linking of publications and underlying
  datasets,
• Data/metadata quality control, and
• Data security.
Metadata requirements for
     scientific data
Architectural view
Identity metadata


• Person: researcherID,     • Name repositories
  URI                       • Linked data architecture
• Institution: ORCID, URI   • Customizable research
• Data object: DOI,           group/community
  Handle, URI                 member name lists
• Associated publication:
  DOI
Semantic metadata
• Large semantic resources available in linked data format, but
  usually not suitable for representing scientific data because
  they are designed for publications, especially books and
  journals (containers)
• Format is contemporary but the content is far from it



     Smaller, specialized semantic
resources are necessary for automatic
    semantic metadata generation
Contextual
                             metadata
• Provenance

               Provenance data model
               (W3C, http://www.w3.org/TR/prov-primer/)

               Provenance data represent the origins of
               digital objects and describe the entities and
               activities involved in producing and delivering
               or otherwise influencing a given object.
Geospatial metadata
                 Biological sciences                                    Climate
                                        Ecological
                       Darwin           Metadata
  Biological            Core                                             NetCDF
 Data Profile                           Language
                       (DwC)              (EML)                       Climate and
                                                                      Forecast (CF)
                                Georeferencing                         Metadata
  Shoreline                     elements                              Conventions
  Metadata
   Profile                        FGDC               Georeferencing
                                                     elements           Astronomy
CSDGM Profiles
                                 CSDGM
                                                                         Astronomy
                                                                        Visualization
                                                                         Metadata
                           ISO 19115: 2003                                Standard
                        Geographic information -
                              Metadata.
                                       18
Temporal metadata

•   Mean solar time    • Different measurement
•   Civil time           systems result in
•   GPS time             different units and
                         format
•   Terrestrial time
                       • Conversion between
•   Atomic time          systems

• Geologic time
• The least effort principle
• The infrastructure service principle
• The portable principle
TABLE 1. Mapping data user tasks with metadata functions and
                     architectural building blocks

Data          Metadata function       Architectural building block
user
tasks
Discover    Descriptive metadata Identity and semantic metadata
Identify    Descriptive metadata Identity metadata
Select      Descriptive, technical
                                 Identity, semantic, scientific
            metadata             context, geospatial, temporal,
                                 miscellany metadata
Obtain      Descriptive metadata Identify metadata
Verify      Descriptive metadata Scientific context metadata
Analyze                          Scientific context, geospatial, and
                                 temporal metadata
Manage      Descriptive,         Identify, semantic, scientific
            administrative,      context, geospatial, temporal,
            structural, and      miscellany metadata
            technical metadata
Archive     Descriptive,         Identify, semantic, scientific
            administrative,      context, geospatial, temporal,
            structural, and      miscellany metadata
            technical metadata
Publish     Descriptive metadata Identity, semantic, scientific
                                 context, geospatial, and temporal
                                 metadata
Cite        Descriptive metadata Identify metadata
Development potentials
• An infrastructure of metadata services
  – Entities as linked data
  – Tools for “slicing” members by research group,
    community, or institution to customize the entity
    set
  – Tools for grabbing entity data from existing
    resources through interoperability protocols
Application scenarios
• Cross-domain discovery and verification
• Automatically populating entity information
  from customized slices of entities into
  metadata records
• And more…
Conclusion
• Scientific data are inherently complex and diverse
• Functional metadata requirements should be
  translated into an effective and efficient architecture
   – Three principles for modeling metadata for
     scientific data
• Metadata for scientific data (or other domains at
  large) should adopt an infrastructure service approach
• Much to be explored, experimented, and evaluated
Thank you!

Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and LibrariansJohann van Wyk
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...TERN Australia
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...ManjulaPatel
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceLizLyon
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013ECNOfficer
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR
 
Disciplinary RDM
Disciplinary RDMDisciplinary RDM
Disciplinary RDMSarah Jones
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collectionSherry Lake
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...Megan O'Donnell
 
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkASIS&T
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesMicah Altman
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycleMarieke Guy
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data ManagementOpenAIRE
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGPhilip Bourne
 
Introduction to DATS v2.2 - NIH May 2017
Introduction to DATS v2.2 - NIH May 2017Introduction to DATS v2.2 - NIH May 2017
Introduction to DATS v2.2 - NIH May 2017Susanna-Assunta Sansone
 

Was ist angesagt? (20)

NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
NISO Virtual Conference Scientific Data Management: Caring for Your Instituti...
 
Research Data Management and Librarians
Research Data Management and LibrariansResearch Data Management and Librarians
Research Data Management and Librarians
 
Organising and Documenting Data
Organising and Documenting DataOrganising and Documenting Data
Organising and Documenting Data
 
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
Stuart Phinn_Many kinds of infrastructure: resolving and advancing ecosystem ...
 
The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...The Role of OAIS Representation Information in the Digital Curation of Crysta...
The Role of OAIS Representation Information in the Digital Curation of Crysta...
 
Research data life cycle
Research data life cycleResearch data life cycle
Research data life cycle
 
UK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalfaceUK Digital Curation Centre: enabling research data management at the coalface
UK Digital Curation Centre: enabling research data management at the coalface
 
D paul ecn2013
D paul ecn2013D paul ecn2013
D paul ecn2013
 
ICPSR Data Exploration Tools
ICPSR Data Exploration ToolsICPSR Data Exploration Tools
ICPSR Data Exploration Tools
 
Disciplinary RDM
Disciplinary RDMDisciplinary RDM
Disciplinary RDM
 
Best practices data collection
Best practices data collectionBest practices data collection
Best practices data collection
 
Putnam Data Quality and the IR
Putnam Data Quality and the IRPutnam Data Quality and the IR
Putnam Data Quality and the IR
 
An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...An analysis and characterization of DMPs in NSF proposals from the University...
An analysis and characterization of DMPs in NSF proposals from the University...
 
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information FrameworkRDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
RDAP14: Maryann Martone, Keynote, The Neuroscience Information Framework
 
Linking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual ArchivesLinking Data to Publications through Citation and Virtual Archives
Linking Data to Publications through Citation and Virtual Archives
 
Managing data throughout the research lifecycle
Managing data throughout the research lifecycleManaging data throughout the research lifecycle
Managing data throughout the research lifecycle
 
Basics of Research Data Management
Basics of Research Data ManagementBasics of Research Data Management
Basics of Research Data Management
 
The NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAGThe NIH as a Digital Enterprise: Implications for PAG
The NIH as a Digital Enterprise: Implications for PAG
 
Working with Global Infrastructure at a National Level
Working with Global Infrastructure at a National LevelWorking with Global Infrastructure at a National Level
Working with Global Infrastructure at a National Level
 
Introduction to DATS v2.2 - NIH May 2017
Introduction to DATS v2.2 - NIH May 2017Introduction to DATS v2.2 - NIH May 2017
Introduction to DATS v2.2 - NIH May 2017
 

Andere mochten auch

Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needsAndrew Sallans
 
DataUp Presentation at Cal Poly
DataUp Presentation at Cal PolyDataUp Presentation at Cal Poly
DataUp Presentation at Cal PolyCarly Strasser
 
Embrace The Chaos
Embrace The ChaosEmbrace The Chaos
Embrace The Chaosjonphipps
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management EcosystemJohn Kunze
 
MozCon 2013 Recap - Day One
MozCon 2013 Recap - Day OneMozCon 2013 Recap - Day One
MozCon 2013 Recap - Day OneKane Jamison
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataASIS&T
 
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 PittsburghDCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 PittsburghDiane Hillmann
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talkDan Brickley
 
Digital Curation: Definitions, Tools, and Strategies
Digital Curation: Definitions, Tools, and StrategiesDigital Curation: Definitions, Tools, and Strategies
Digital Curation: Definitions, Tools, and StrategiesDavid Kelly
 
Unicorns and Other Wild Things
Unicorns and Other Wild ThingsUnicorns and Other Wild Things
Unicorns and Other Wild ThingsAlberta Soranzo
 

Andere mochten auch (20)

Chapter 3
Chapter 3Chapter 3
Chapter 3
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
NISO/DCMI Webinar: International Bibliographic Standards, Linked Data, and th...
 
NISO/DCMI Webinar: Metadata Harmonization: Making Standards Work Together
NISO/DCMI Webinar: Metadata Harmonization: Making Standards Work TogetherNISO/DCMI Webinar: Metadata Harmonization: Making Standards Work Together
NISO/DCMI Webinar: Metadata Harmonization: Making Standards Work Together
 
Aligning library services with emerging research data needs
Aligning library services with emerging research data needsAligning library services with emerging research data needs
Aligning library services with emerging research data needs
 
DataUp Presentation at Cal Poly
DataUp Presentation at Cal PolyDataUp Presentation at Cal Poly
DataUp Presentation at Cal Poly
 
Embrace The Chaos
Embrace The ChaosEmbrace The Chaos
Embrace The Chaos
 
The Data Management Ecosystem
The Data Management EcosystemThe Data Management Ecosystem
The Data Management Ecosystem
 
MozCon 2013 Recap - Day One
MozCon 2013 Recap - Day OneMozCon 2013 Recap - Day One
MozCon 2013 Recap - Day One
 
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for MetadataRDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
RDAP13 Jian Qin: Functional and Architectural Requirements for Metadata
 
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 PittsburghDCMI/RDA Task Group Report, DC-2010 Pittsburgh
DCMI/RDA Task Group Report, DC-2010 Pittsburgh
 
Dagstuhl FOAF history talk
Dagstuhl FOAF history talkDagstuhl FOAF history talk
Dagstuhl FOAF history talk
 
Digital Curation: Definitions, Tools, and Strategies
Digital Curation: Definitions, Tools, and StrategiesDigital Curation: Definitions, Tools, and Strategies
Digital Curation: Definitions, Tools, and Strategies
 
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti... NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
NISO/DCMI May 22 Webinar: Semantic Mashups Across Large, Heterogeneous Insti...
 
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
NISO/DCMI September 25 Webinar: Implementing Linked Data in Developing Countr...
 
Unicorns and Other Wild Things
Unicorns and Other Wild ThingsUnicorns and Other Wild Things
Unicorns and Other Wild Things
 
NISO/DCMI Webinar: Metadata for Public Sector Administration
NISO/DCMI Webinar: Metadata for Public Sector AdministrationNISO/DCMI Webinar: Metadata for Public Sector Administration
NISO/DCMI Webinar: Metadata for Public Sector Administration
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123NISO DCMI Webinar bibframe-20130123
NISO DCMI Webinar bibframe-20130123
 
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
April 24, 2013 NISO/DCMI Webinar: Deployment of RDA (Resource Description and...
 

Ähnlich wie Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data

SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science Robert H. McDonald
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...SEAD
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceRobert H. McDonald
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
Technologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic RecordsTechnologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic Recordspbajcsy
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...ASIS&T
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?Jian Qin
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?Jian Qin
 
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 FinalLibby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Finala.carusi
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesManjulaPatel
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksNational Institute of Informatics
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...Ian Foster
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesBDLSS
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...aceas13tern
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycleSherry Lake
 

Ähnlich wie Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data (20)

SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science SEAD Datanet and Sustainability Science
SEAD Datanet and Sustainability Science
 
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
CNI Fall 2011 Meeting Presentation Margaret Hedstrom & Robert McDonald (Dec. ...
 
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research DataNISO/DCMI Webinar: Metadata for Managing Scientific Research Data
NISO/DCMI Webinar: Metadata for Managing Scientific Research Data
 
Building a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability ScienceBuilding a Data Discovery Network for Sustainability Science
Building a Data Discovery Network for Sustainability Science
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
Role of Semantic Web in Health Informatics
Role of Semantic Web in Health InformaticsRole of Semantic Web in Health Informatics
Role of Semantic Web in Health Informatics
 
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
NISO Forum, Denver, Sept. 24, 2012: Scientific discovery and innovation in an...
 
Technologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic RecordsTechnologies For Appraising and Managing Electronic Records
Technologies For Appraising and Managing Electronic Records
 
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data EquivalenceNISO Forum, Denver, Sept. 24, 2012: Data Equivalence
NISO Forum, Denver, Sept. 24, 2012: Data Equivalence
 
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
RDAP 15: Beyond Metadata: Leveraging the “README” to support disciplinary Doc...
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
 
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 FinalLibby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
Libby Bishop, Ethics Of Data Sharing Ncess Jun 09 Final
 
Integrated research data management in the Structural Sciences
Integrated research data management in the Structural SciencesIntegrated research data management in the Structural Sciences
Integrated research data management in the Structural Sciences
 
Emerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networksEmerging domain agnostic functionalities on the handle-centered networks
Emerging domain agnostic functionalities on the handle-centered networks
 
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
Materials Data Facility: Streamlined and automated data sharing,  discovery, ...Materials Data Facility: Streamlined and automated data sharing,  discovery, ...
Materials Data Facility: Streamlined and automated data sharing, discovery, ...
 
CSHALS 2013
CSHALS 2013CSHALS 2013
CSHALS 2013
 
Cni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferiesCni research data_oxford_horstmann_jefferies
Cni research data_oxford_horstmann_jefferies
 
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
SPatially Explicit Data Discovery, Extraction and Evaluation Services (SPEDDE...
 
Managing the research life cycle
Managing the research life cycleManaging the research life cycle
Managing the research life cycle
 

Mehr von Jian Qin

Survey research
Survey research Survey research
Survey research Jian Qin
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)Jian Qin
 
Developing Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearchDeveloping Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearchJian Qin
 
Scientific data management (v2)
Scientific data management (v2)Scientific data management (v2)
Scientific data management (v2)Jian Qin
 
Scientific Data Management
Scientific Data ManagementScientific Data Management
Scientific Data ManagementJian Qin
 
Research literature review
Research literature reviewResearch literature review
Research literature reviewJian Qin
 
Scholarly communication
Scholarly communicationScholarly communication
Scholarly communicationJian Qin
 
Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Jian Qin
 

Mehr von Jian Qin (9)

Survey research
Survey research Survey research
Survey research
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)
 
Developing Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearchDeveloping Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearch
 
Scientific data management (v2)
Scientific data management (v2)Scientific data management (v2)
Scientific data management (v2)
 
Scientific Data Management
Scientific Data ManagementScientific Data Management
Scientific Data Management
 
Research literature review
Research literature reviewResearch literature review
Research literature review
 
Scholarly communication
Scholarly communicationScholarly communication
Scholarly communication
 
Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)
 

Kürzlich hochgeladen

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESmohitsingh558521
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Kürzlich hochgeladen (20)

Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICESSALESFORCE EDUCATION CLOUD | FEXLE SERVICES
SALESFORCE EDUCATION CLOUD | FEXLE SERVICES
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data

  • 1. Functional and Architectural Requirements for Metadata: Supporting Discovery and Management of Scientific Data Jian Qin Alex Ball Jane Greenberg School of Information Digital Curation Centre School of Information and Studies UKOLN Library Science Syracuse University University of Bath University of North Caroline USA UK Chapel Hill, USA International Conference of Dublin Core Metadata Kuching, Malaysia, September 5, 2012
  • 2. Metadata standards for scientific data Ecological Metadata Language (EML) CSDGM Access to Biological Collections Data – ABCD Darwin Core
  • 3. Many tools have been developed…
  • 4. Many tools have been developed…
  • 5. Many tools have been developed…
  • 6. Motivation for adoption • Standardize the format and terminology of metadata • Enable fast and effective discovery of datasets across different data repositories • Enable data sharing, reuse, and preservation • Provide information for obtaining datasets from the data owners
  • 7. Hindering factors • large numbers of • steep learning curve elements • difficult to automate • many layers in structure metadata generation – “Unwieldy to apply” • unnecessary duplicate • been created for data entry manual data entry, • high costs in time, rather than for resource, and automatic generation personnel expertise
  • 8. Same entity data repeated in the same record… Seamless Daily Precipitation for the Conterminous United States Metadata: Identification_Information Data_Quality_Information Spatial_Data_Organization_Information Spatial_Reference_Information Entity_and_Attribute_Information Distribution_Information Metadata_Reference_Information
  • 9. …and they are already in…
  • 10. Research questions • What functions do metadata standards for scientific data serve? • How should metadata standards for scientific data be modeled to support these functions by meeting the associated requirements?
  • 11. Functions expected • Resource discovery and use, • Data interoperability, • Automatic and semi-automatic metadata generation, • Linking of publications and underlying datasets, • Data/metadata quality control, and • Data security.
  • 12. Metadata requirements for scientific data
  • 13.
  • 15. Identity metadata • Person: researcherID, • Name repositories URI • Linked data architecture • Institution: ORCID, URI • Customizable research • Data object: DOI, group/community Handle, URI member name lists • Associated publication: DOI
  • 16. Semantic metadata • Large semantic resources available in linked data format, but usually not suitable for representing scientific data because they are designed for publications, especially books and journals (containers) • Format is contemporary but the content is far from it Smaller, specialized semantic resources are necessary for automatic semantic metadata generation
  • 17. Contextual metadata • Provenance Provenance data model (W3C, http://www.w3.org/TR/prov-primer/) Provenance data represent the origins of digital objects and describe the entities and activities involved in producing and delivering or otherwise influencing a given object.
  • 18. Geospatial metadata Biological sciences Climate Ecological Darwin Metadata Biological Core NetCDF Data Profile Language (DwC) (EML) Climate and Forecast (CF) Georeferencing Metadata Shoreline elements Conventions Metadata Profile FGDC Georeferencing elements Astronomy CSDGM Profiles CSDGM Astronomy Visualization Metadata ISO 19115: 2003 Standard Geographic information - Metadata. 18
  • 19. Temporal metadata • Mean solar time • Different measurement • Civil time systems result in • GPS time different units and format • Terrestrial time • Conversion between • Atomic time systems • Geologic time
  • 20. • The least effort principle • The infrastructure service principle • The portable principle
  • 21. TABLE 1. Mapping data user tasks with metadata functions and architectural building blocks Data Metadata function Architectural building block user tasks Discover Descriptive metadata Identity and semantic metadata Identify Descriptive metadata Identity metadata Select Descriptive, technical Identity, semantic, scientific metadata context, geospatial, temporal, miscellany metadata Obtain Descriptive metadata Identify metadata Verify Descriptive metadata Scientific context metadata Analyze Scientific context, geospatial, and temporal metadata Manage Descriptive, Identify, semantic, scientific administrative, context, geospatial, temporal, structural, and miscellany metadata technical metadata Archive Descriptive, Identify, semantic, scientific administrative, context, geospatial, temporal, structural, and miscellany metadata technical metadata Publish Descriptive metadata Identity, semantic, scientific context, geospatial, and temporal metadata Cite Descriptive metadata Identify metadata
  • 22. Development potentials • An infrastructure of metadata services – Entities as linked data – Tools for “slicing” members by research group, community, or institution to customize the entity set – Tools for grabbing entity data from existing resources through interoperability protocols
  • 23. Application scenarios • Cross-domain discovery and verification • Automatically populating entity information from customized slices of entities into metadata records • And more…
  • 24. Conclusion • Scientific data are inherently complex and diverse • Functional metadata requirements should be translated into an effective and efficient architecture – Three principles for modeling metadata for scientific data • Metadata for scientific data (or other domains at large) should adopt an infrastructure service approach • Much to be explored, experimented, and evaluated