SlideShare ist ein Scribd-Unternehmen logo
1 von 8
Downloaden Sie, um offline zu lesen
| ELAG 2019 Lightning Talk | 10 May 20191
Publishing Metadata Provenance
Jana Hentschke | German National Library
Our Problem
– Metadata from multiple sources
– Attribution needed - also in data exchange
– Example:
Users of our metadata should be able to recognise if a
GND Subject Heading was assigned to a publication by a
cataloguer or by an automated process
| ELAG 2019 Lightning Talk | 10 May 20192
Solutions
– MARC Bibliographic has field 883 (… digestible?)
– For RDF data we came up with
- PROV Ontology (PROV-O)
- Design pattern: Qualified Relations
- URIs and descriptions for new entities:
prov:Activity, prov:Plan, prov:SoftwareAgent, …
| ELAG 2019 Lightning Talk | 10 May 20193
| ELAG 2019 Lightning Talk | 10 May 20194
| ELAG 2019 Lightning Talk | 10 May 20195
dnb:1050604636
dnbt:hasQualifiedSubject [
agrelon:metadataConfidence "0.62493"^^xsd:float ;
dnbt:qualifiesSubject gnd:4530909-7 ;
a dnbt:QualifiedSubject, prov:Entity ;
prov:generatedAtTime "2018-06-13"^^xsd:date ;
prov:wasGeneratedBy dnb-prov-a:aep-gnd:2018-06-13
] .
dnb-prov-a:aep-gnd:2018-06-13 a prov:Activity, dnbt:AutomatedActivity ;
prov:qualifiedAssociation [
prov:hadPlan dnb-prov-p:aep-gnd ;
prov:agent [
a prov:Agent
]
] ;
prov:startedAtTime "2018-06-13"^^xsd:date ;
prov:endedAtTime "2018-06-13"^^xsd:date .
dnb-prov-p:aep-gnd rdfs:label "Machine assignment of subject headings"@en .
What‘s on offer
– Data dumps at https://data.dnb.de/opendata/
(„dnb-all_ldsprov…“ in .rdf, .ttl, .jsonld, .nt, .hdt)
– Details + schedule at https://wiki.dnb.de/x/46G5C (German)
- Next: Provenance information also on URI resolving of individual entities
– Temporary solution for provenance of outgoing links in GND RDF:
… a table on a wiki page: https://wiki.dnb.de/x/TZa5C
| ELAG 2019 Lightning Talk | 10 May 20196
Feedback + Exchange appreciated!
j.hentschke@dnb.de
OR
Discussion on public mailing list
lds@lists.dnb.de
| ELAG 2019 Lightning Talk | 10 May 20197
Prefixes
@prefix dnbt: <https://d-nb.info/standards/elementset/dnb#> .
@prefix dnb-prov-a: <https://d-nb.info/provenance/activity/> .
@prefix dnb-prov-p: <https://d-nb.info/provenance/plan#> .
@prefix dnb-prov-s: <https://d-nb.info/provenance/software#> .
@prefix isil: <https://ld.zdb-services.de/resource/organisations/> .
@prefix prov: <http://www.w3.org/ns/prov#> .
@prefix agrelon: <https://d-nb.info/standards/elementset/agrelon#> .
@prefix xsd: <http://www.w3.org/2001/XMLSchema#> .
@prefix dnb: <https:d-nb.info/> .
@prefix gnd: <https:d-nb.info/gnd/> .
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .
| ELAG 2019 Lightning Talk | 10 May 20198

Weitere ähnliche Inhalte

Was ist angesagt?

Intro To Graph Databases - Oxana Goriuc
Intro To Graph Databases - Oxana GoriucIntro To Graph Databases - Oxana Goriuc
Intro To Graph Databases - Oxana GoriucFraugster
 
Airline Reservations and Routing: A Graph Use Case
Airline Reservations and Routing: A Graph Use CaseAirline Reservations and Routing: A Graph Use Case
Airline Reservations and Routing: A Graph Use CaseJason Plurad
 
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"BigData_Europe
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataGiorgos Santipantakis
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphIoan Toma
 
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...BigData_Europe
 
Graph Computing with Apache TinkerPop
Graph Computing with Apache TinkerPopGraph Computing with Apache TinkerPop
Graph Computing with Apache TinkerPopJason Plurad
 
Organising principles
Organising principlesOrganising principles
Organising principlesIntranätverk
 
The Rhizomer Semantic Content Management System
The Rhizomer Semantic Content Management SystemThe Rhizomer Semantic Content Management System
The Rhizomer Semantic Content Management SystemRoberto García
 
Exploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraphExploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraphJason Plurad
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016Sergio Fernández
 
CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.Peter Overgaard
 
On-the-fly Integration of Static and Dynamic Linked Data
On-the-fly Integration of Static and Dynamic Linked DataOn-the-fly Integration of Static and Dynamic Linked Data
On-the-fly Integration of Static and Dynamic Linked Dataaharth
 
MLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsMLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsBigML, Inc
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityJoshua Shinavier
 
Graph Computing with JanusGraph
Graph Computing with JanusGraphGraph Computing with JanusGraph
Graph Computing with JanusGraphJason Plurad
 
Text and Data Mining at Springer Nature
Text and Data Mining at Springer NatureText and Data Mining at Springer Nature
Text and Data Mining at Springer NatureSpringer Nature
 
Event Object Boundaries in RDF Streams - A Position Paper (slides)
Event Object Boundaries in RDF Streams - A Position Paper (slides)Event Object Boundaries in RDF Streams - A Position Paper (slides)
Event Object Boundaries in RDF Streams - A Position Paper (slides)RobinKeskisarkka
 

Was ist angesagt? (20)

Intro To Graph Databases - Oxana Goriuc
Intro To Graph Databases - Oxana GoriucIntro To Graph Databases - Oxana Goriuc
Intro To Graph Databases - Oxana Goriuc
 
Airline Reservations and Routing: A Graph Use Case
Airline Reservations and Routing: A Graph Use CaseAirline Reservations and Routing: A Graph Use Case
Airline Reservations and Routing: A Graph Use Case
 
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
SC7 Webinar 5 13/12/2017 NCSR "Demokritos" Presentation "Event Detection"
 
RDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival dataRDF-Gen: Generating RDF from streaming and archival data
RDF-Gen: Generating RDF from streaming and archival data
 
Querying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge GraphQuerying the Wikidata Knowledge Graph
Querying the Wikidata Knowledge Graph
 
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
SC7 Webinar 5 13/12/2017 UoA Presentation "Technical aspects of the 3rd secur...
 
Graph Computing with Apache TinkerPop
Graph Computing with Apache TinkerPopGraph Computing with Apache TinkerPop
Graph Computing with Apache TinkerPop
 
Organising principles
Organising principlesOrganising principles
Organising principles
 
The Rhizomer Semantic Content Management System
The Rhizomer Semantic Content Management SystemThe Rhizomer Semantic Content Management System
The Rhizomer Semantic Content Management System
 
Exploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraphExploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraph
 
Timbuctoo 2 EASY
Timbuctoo 2 EASYTimbuctoo 2 EASY
Timbuctoo 2 EASY
 
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016Geospatial Querying in Apache Marmotta -  Apache Big Data North America 2016
Geospatial Querying in Apache Marmotta - Apache Big Data North America 2016
 
CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.CHAOS Platform presentation, The Royal Library in Copenhagen.
CHAOS Platform presentation, The Royal Library in Copenhagen.
 
On-the-fly Integration of Static and Dynamic Linked Data
On-the-fly Integration of Static and Dynamic Linked DataOn-the-fly Integration of Static and Dynamic Linked Data
On-the-fly Integration of Static and Dynamic Linked Data
 
MLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning SessionsMLSD18. Summary of Morning Sessions
MLSD18. Summary of Morning Sessions
 
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from RealityBuilding an Enterprise Knowledge Graph @Uber: Lessons from Reality
Building an Enterprise Knowledge Graph @Uber: Lessons from Reality
 
Graph Computing with JanusGraph
Graph Computing with JanusGraphGraph Computing with JanusGraph
Graph Computing with JanusGraph
 
Text and Data Mining at Springer Nature
Text and Data Mining at Springer NatureText and Data Mining at Springer Nature
Text and Data Mining at Springer Nature
 
Event Object Boundaries in RDF Streams - A Position Paper (slides)
Event Object Boundaries in RDF Streams - A Position Paper (slides)Event Object Boundaries in RDF Streams - A Position Paper (slides)
Event Object Boundaries in RDF Streams - A Position Paper (slides)
 
Semantic Web talk TEMPLATE
Semantic Web talk TEMPLATESemantic Web talk TEMPLATE
Semantic Web talk TEMPLATE
 

Ähnlich wie Publishing metadata provenance

Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in SparkPaco Nathan
 
Learning Knowledge Rich User Models from the Semantic Web
Learning Knowledge Rich User Models from the Semantic WebLearning Knowledge Rich User Models from the Semantic Web
Learning Knowledge Rich User Models from the Semantic Webbutest
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesPaco Nathan
 
Connecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebConnecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebJean-Paul Calbimonte
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesTony Hammond
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015Michele Pasin
 
Spark Summit East 2015 Advanced Devops Student Slides
Spark Summit East 2015 Advanced Devops Student SlidesSpark Summit East 2015 Advanced Devops Student Slides
Spark Summit East 2015 Advanced Devops Student SlidesDatabricks
 
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Turi, Inc.
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseJimmy Angelakos
 
ArangoML Pipeline Cloud - Managed Machine Learning Metadata
ArangoML Pipeline Cloud - Managed Machine Learning MetadataArangoML Pipeline Cloud - Managed Machine Learning Metadata
ArangoML Pipeline Cloud - Managed Machine Learning MetadataArangoDB Database
 
Data Infrastructure for a World of Music
Data Infrastructure for a World of MusicData Infrastructure for a World of Music
Data Infrastructure for a World of MusicLars Albertsson
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphTigerGraph
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learningPaco Nathan
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE
 
RDF Stream Processing Tutorial: RSP implementations
RDF Stream Processing Tutorial: RSP implementationsRDF Stream Processing Tutorial: RSP implementations
RDF Stream Processing Tutorial: RSP implementationsJean-Paul Calbimonte
 
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"Paco Nathan
 
Intro to Spark development
 Intro to Spark development  Intro to Spark development
Intro to Spark development Spark Summit
 
Knowledge Graph for Cybersecurity: An Introduction By Kabul Kurniawan
Knowledge Graph for Cybersecurity: An Introduction By  Kabul KurniawanKnowledge Graph for Cybersecurity: An Introduction By  Kabul Kurniawan
Knowledge Graph for Cybersecurity: An Introduction By Kabul KurniawanKabul Kurniawan
 
Introduction to Spark Training
Introduction to Spark TrainingIntroduction to Spark Training
Introduction to Spark TrainingSpark Summit
 

Ähnlich wie Publishing metadata provenance (20)

Graph Analytics in Spark
Graph Analytics in SparkGraph Analytics in Spark
Graph Analytics in Spark
 
Learning Knowledge Rich User Models from the Semantic Web
Learning Knowledge Rich User Models from the Semantic WebLearning Knowledge Rich User Models from the Semantic Web
Learning Knowledge Rich User Models from the Semantic Web
 
GraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communitiesGraphX: Graph analytics for insights about developer communities
GraphX: Graph analytics for insights about developer communities
 
Connecting Stream Reasoners on the Web
Connecting Stream Reasoners on the WebConnecting Stream Reasoners on the Web
Connecting Stream Reasoners on the Web
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015The Nature.com ontologies portal - Linked Science 2015
The Nature.com ontologies portal - Linked Science 2015
 
Spark Summit East 2015 Advanced Devops Student Slides
Spark Summit East 2015 Advanced Devops Student SlidesSpark Summit East 2015 Advanced Devops Student Slides
Spark Summit East 2015 Advanced Devops Student Slides
 
Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark Better {ML} Together: GraphLab Create + Spark
Better {ML} Together: GraphLab Create + Spark
 
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph DatabaseBringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
Bringing the Semantic Web closer to reality: PostgreSQL as RDF Graph Database
 
ArangoML Pipeline Cloud - Managed Machine Learning Metadata
ArangoML Pipeline Cloud - Managed Machine Learning MetadataArangoML Pipeline Cloud - Managed Machine Learning Metadata
ArangoML Pipeline Cloud - Managed Machine Learning Metadata
 
Data Infrastructure for a World of Music
Data Infrastructure for a World of MusicData Infrastructure for a World of Music
Data Infrastructure for a World of Music
 
Graph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise GraphGraph Gurus Episode 1: Enterprise Graph
Graph Gurus Episode 1: Enterprise Graph
 
Microservices, containers, and machine learning
Microservices, containers, and machine learningMicroservices, containers, and machine learning
Microservices, containers, and machine learning
 
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software ComponentsFIWARE Global Summit - IDS Implementation with FIWARE Software Components
FIWARE Global Summit - IDS Implementation with FIWARE Software Components
 
RDF Stream Processing Tutorial: RSP implementations
RDF Stream Processing Tutorial: RSP implementationsRDF Stream Processing Tutorial: RSP implementations
RDF Stream Processing Tutorial: RSP implementations
 
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
July Clojure Users Group Meeting: "Using Cascalog with Palo Alto Open Data"
 
Towards a Linked Data Publishing Methodology
Towards a Linked Data Publishing MethodologyTowards a Linked Data Publishing Methodology
Towards a Linked Data Publishing Methodology
 
Intro to Spark development
 Intro to Spark development  Intro to Spark development
Intro to Spark development
 
Knowledge Graph for Cybersecurity: An Introduction By Kabul Kurniawan
Knowledge Graph for Cybersecurity: An Introduction By  Kabul KurniawanKnowledge Graph for Cybersecurity: An Introduction By  Kabul Kurniawan
Knowledge Graph for Cybersecurity: An Introduction By Kabul Kurniawan
 
Introduction to Spark Training
Introduction to Spark TrainingIntroduction to Spark Training
Introduction to Spark Training
 

Kürzlich hochgeladen

MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptaigil2
 
RESEARCH METHODOLOGY FORMAT.docx
RESEARCH METHODOLOGY FORMAT.docxRESEARCH METHODOLOGY FORMAT.docx
RESEARCH METHODOLOGY FORMAT.docxkittustudy7
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best PracticesDataArchiva
 
Task 3 - Model Presentation data an.pptx
Task 3 - Model Presentation data an.pptxTask 3 - Model Presentation data an.pptx
Task 3 - Model Presentation data an.pptxShristiKumari28
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024Becky Burwell
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.JasonViviers2
 
Create Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopCreate Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopThinkInnovation
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Vladislav Solodkiy
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructuresonikadigital1
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxVenkatasubramani13
 
Cyclistic Memberships Data Analysis Project
Cyclistic Memberships Data Analysis ProjectCyclistic Memberships Data Analysis Project
Cyclistic Memberships Data Analysis Projectdanielbell861
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerPavel Šabatka
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationGiorgio Carbone
 

Kürzlich hochgeladen (13)

MEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .pptMEASURES OF DISPERSION I BSc Botany .ppt
MEASURES OF DISPERSION I BSc Botany .ppt
 
RESEARCH METHODOLOGY FORMAT.docx
RESEARCH METHODOLOGY FORMAT.docxRESEARCH METHODOLOGY FORMAT.docx
RESEARCH METHODOLOGY FORMAT.docx
 
5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices5 Ds to Define Data Archiving Best Practices
5 Ds to Define Data Archiving Best Practices
 
Task 3 - Model Presentation data an.pptx
Task 3 - Model Presentation data an.pptxTask 3 - Model Presentation data an.pptx
Task 3 - Model Presentation data an.pptx
 
SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024SFBA Splunk Usergroup meeting March 13, 2024
SFBA Splunk Usergroup meeting March 13, 2024
 
YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.YourView Panel Book.pptx YourView Panel Book.
YourView Panel Book.pptx YourView Panel Book.
 
Create Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI DesktopCreate Data Model & Conduct Visualisation in Power BI Desktop
Create Data Model & Conduct Visualisation in Power BI Desktop
 
Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023Cash Is Still King: ATM market research '2023
Cash Is Still King: ATM market research '2023
 
ChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics InfrastructureChistaDATA Real-Time DATA Analytics Infrastructure
ChistaDATA Real-Time DATA Analytics Infrastructure
 
Mapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptxMapping the pubmed data under different suptopics using NLP.pptx
Mapping the pubmed data under different suptopics using NLP.pptx
 
Cyclistic Memberships Data Analysis Project
Cyclistic Memberships Data Analysis ProjectCyclistic Memberships Data Analysis Project
Cyclistic Memberships Data Analysis Project
 
The Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayerThe Universal GTM - how we design GTM and dataLayer
The Universal GTM - how we design GTM and dataLayer
 
Master's Thesis - Data Science - Presentation
Master's Thesis - Data Science - PresentationMaster's Thesis - Data Science - Presentation
Master's Thesis - Data Science - Presentation
 

Publishing metadata provenance

  • 1. | ELAG 2019 Lightning Talk | 10 May 20191 Publishing Metadata Provenance Jana Hentschke | German National Library
  • 2. Our Problem – Metadata from multiple sources – Attribution needed - also in data exchange – Example: Users of our metadata should be able to recognise if a GND Subject Heading was assigned to a publication by a cataloguer or by an automated process | ELAG 2019 Lightning Talk | 10 May 20192
  • 3. Solutions – MARC Bibliographic has field 883 (… digestible?) – For RDF data we came up with - PROV Ontology (PROV-O) - Design pattern: Qualified Relations - URIs and descriptions for new entities: prov:Activity, prov:Plan, prov:SoftwareAgent, … | ELAG 2019 Lightning Talk | 10 May 20193
  • 4. | ELAG 2019 Lightning Talk | 10 May 20194
  • 5. | ELAG 2019 Lightning Talk | 10 May 20195 dnb:1050604636 dnbt:hasQualifiedSubject [ agrelon:metadataConfidence "0.62493"^^xsd:float ; dnbt:qualifiesSubject gnd:4530909-7 ; a dnbt:QualifiedSubject, prov:Entity ; prov:generatedAtTime "2018-06-13"^^xsd:date ; prov:wasGeneratedBy dnb-prov-a:aep-gnd:2018-06-13 ] . dnb-prov-a:aep-gnd:2018-06-13 a prov:Activity, dnbt:AutomatedActivity ; prov:qualifiedAssociation [ prov:hadPlan dnb-prov-p:aep-gnd ; prov:agent [ a prov:Agent ] ] ; prov:startedAtTime "2018-06-13"^^xsd:date ; prov:endedAtTime "2018-06-13"^^xsd:date . dnb-prov-p:aep-gnd rdfs:label "Machine assignment of subject headings"@en .
  • 6. What‘s on offer – Data dumps at https://data.dnb.de/opendata/ („dnb-all_ldsprov…“ in .rdf, .ttl, .jsonld, .nt, .hdt) – Details + schedule at https://wiki.dnb.de/x/46G5C (German) - Next: Provenance information also on URI resolving of individual entities – Temporary solution for provenance of outgoing links in GND RDF: … a table on a wiki page: https://wiki.dnb.de/x/TZa5C | ELAG 2019 Lightning Talk | 10 May 20196
  • 7. Feedback + Exchange appreciated! j.hentschke@dnb.de OR Discussion on public mailing list lds@lists.dnb.de | ELAG 2019 Lightning Talk | 10 May 20197
  • 8. Prefixes @prefix dnbt: <https://d-nb.info/standards/elementset/dnb#> . @prefix dnb-prov-a: <https://d-nb.info/provenance/activity/> . @prefix dnb-prov-p: <https://d-nb.info/provenance/plan#> . @prefix dnb-prov-s: <https://d-nb.info/provenance/software#> . @prefix isil: <https://ld.zdb-services.de/resource/organisations/> . @prefix prov: <http://www.w3.org/ns/prov#> . @prefix agrelon: <https://d-nb.info/standards/elementset/agrelon#> . @prefix xsd: <http://www.w3.org/2001/XMLSchema#> . @prefix dnb: <https:d-nb.info/> . @prefix gnd: <https:d-nb.info/gnd/> . @prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> . | ELAG 2019 Lightning Talk | 10 May 20198