SlideShare ist ein Scribd-Unternehmen logo
1 von 34
Downloaden Sie, um offline zu lesen
KOS evolution in Linked Data 
Joachim Neubert 
ZBW – Leibniz Information Centre for Economics, Hamburg 
ZBW is member of the Leibniz Association 
SWIB14 
Bonn, Germany 
03.12.2014
Agenda 
 Introduction 
 Current versioning approach with STW 
 User questions and requirements 
 Getting a grip on changes: 
the dataset versioning and skos-history approach 
 Overview 
 Application 
 Selected useful reports 
 Outlook: Future work and the skos-history project 
Page 2
Page 3 
STW Thesaurus for Economics 
 Created in the 1990s, now maintained and enhanced by ZBW 
 More than 6,000 descriptors in English and German 
 Since 2009 published as Linked Data in SKOS 
 Roughly every year a new version 
 Major overhaul in progress – subject area by subject area
Short digression: SKOS as a RDF data format 
 Based on concepts (“units of thought”), which may bear labels in 
multiple languages 
 All semantic relations (hierarchies, mappings etc.) exist between 
concepts 
 Per language at most one skos:prefLabel (should be unique) 
 Additional properties for notations, notes, mappings, etc. 
Classes for ConceptSchemes and Collections of concepts 
 Widely in use today as a common interchange format 
Page 4
How did we handle KOS evolution in the past? 
Page 5
RDF statements about a particular version 
Page 6 
<http://zbw.eu/stw> 
a skos:ConceptScheme, void:Dataset ; 
dcterms:issued "2013-10-30"^^xsd:date ; 
owl:versionInfo "8.12" ; 
... 
Others do this in a similar, yet slightly different way (dcterms:modified, 
dcterms:hasVersion, …) – and sometimes, this changes over time
Page 7 
STW versions in URIs 
Stable URIs for skos:Concept (and similar for skos:ConceptScheme) 
 http://zbw.eu/stw/descriptor/19664-4 
303 redirect to versioned URLs (RDFa/rdf/ttl files) 
 http://zbw.eu/stw/versions/latest/descriptor/19664-4/about 
Archived RDFa/rdf/ttl files available 
 http://zbw.eu/stw/versions/8.06/descriptor/19664-4/about 
(Currently, search functions and web services always work on the latest 
version)
Page 8 
Deprecated concepts 
No deletion – URI is still defined, shown on a RDFa page like this: 
<http://zbw.eu/stw/descriptor/12257-3> 
a skos:Concept, zbwext:Descriptor ; 
skos:inScheme <http://zbw.eu/stw> ; 
rdfs:label "Real estate loan"@en, "Realkredit"@de ; 
owl:deprecated true ; 
dcterms:isReplacedBy <http://zbw.eu/stw/descriptor/13775-4> ; 
skos:historyNote "Deprecated (used at last in version 
8.04)"@en .
Page 9 
Pragmatic version history solution: 
Don‘t delete anything 
Changes are traceable 
only intellectually (but 
at all)
Page 10 
Detailed changelog 
From legacy maintance system (simple text file, in German):
How to handle this better? 
Page 11 
What users want to know when we publish a new KOS version: 
 What‘s new? 
 What has changed?
Use cases for extended change information 
 Human indexers wanting to learn about new and deprecated 
concepts 
 Human indexers (and supporting applications) re-indexing large sets 
of documents 
 People maintaining a derived subset of a KOS 
 People maintaining mappings to other vocabularies, and applications 
supporting them 
 Automatic or semi-automatic indexing applications which make use 
of the KOS and/or its mappings 
 Search applications which make use of the KOS and/or its mappings 
Page 12
Getting a grip on changes 
(Provided that we have no access to the KOS maintenance system 
where the changes take place originally, or can’t extend it to report this 
changes comprehensively.) 
Dataset versioning + skos-history 
- should basically work on every SKOS vocabulary 
Page 13
5 basic steps to an actionable skos-history 
1) Start with a sorted n-triple file per version. 
(This poses one triple on every single line.) 
2) Create a raw diff between two version files. 
(This gives you thousands and thousands of differences, even 
excluding bnodes.) 
3) Split the resulting diff into an insertions and a deletions file. 
4) Load the version files, the insertions and deletions files into a triple 
store as named graphs. 
5) Add metadata about the versions and the deltas in a separate 
„version history graph“. 
Page 14 
https://github.com/jneubert/skos-history/blob/master/bin/load_versions.sh
Version History Graph, discoverable via 
fix URI, e.g.: http://zbw.eu/stw/version 
Page 15 
Example endpoint:http://zbw.eu/beta/sparql/stwv/query
Vocabularies for the plumbing 
 dc:/dcterms: 
Dublin Core, as usual the base for everything 
 void: http://rdfs.org/ns/void# 
Vocabulary of interlinked datasets 
 sd: http://www.w3.org/ns/sparql-service-description# 
SPARQL service description 
 delta: http://www.w3.org/2004/delta# 
Differences between RDF graphs 
 dsv: http://purl.org/iso25964/DataSet/Versioning# 
Version history records (providing version identifier and date) and a 
pointer to the current version – outside the actual version data 
 sh: http://purl.org/skos-history/ 
Scheme and concept version deltas 
Page 16
What’s the benefit? 
A database of all versions of a KOS and all deltas between versions 
– which can be queried in parallel! 
Page 17
Query for added concepts 
Page 18 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/added_concepts.rq
Results: Newly inserted concepts 
Page 19
New concepts by subject category 
Page 20 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/added_by_category.rq
Statistics via aggregation queries: STW 
* Computed column - deprecation and redirects for thsys will be introduced for STW v 8.14 (retrospectively) 
Page 21 
Version Date Added 
descriptors 
Deprecated 
descriptors redirected 
Added 
thsys 
Deprecated 
thsys* 
v 8.04 16.02.2009 
v 8.06 22.04.2010 224 4 4 3 
v 8.08 30.06.2011 131 57 54 14 1 
v 8.10 21.03.2012 105 141 110 7 4 
v 8.12 30.10.2013 260 487 485 12 26 
v 8.14 18.11.2014 227 342 342 ? ? 
https://github.com/jneubert/skos-history/blob/master/bin/create_change_statistics.pl
Statistics via aggregation queries: TheSoz 
Page 22 
Version Date Added concepts Deleted concepts 
v 0.7 11.01.2011 
v 0.86 08.11.2011 1 1 
v 0.91 30.04.2012 240 4 
v 0.92 19.09.2012 15 3 
v 0.93 25.02.2014 42 4 
Thesaurus for the Social Sciences 
http://www.gesis.org/en/services/research/thesauri-und-klassifikationen/social-science-thesaurus/ 
https://github.com/jneubert/skos-history/blob/master/bin/create_change_statistics.pl
Selected useful reports 
 Changed notations 
 Splits and merges of concepts 
 History of a single concept 
Page 23
Changed notations (general case) 
Page 24 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/changed_notations.rq
Changed notations (linking STW versioned pages) 
Page 25 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/changed_notations_thsys.rq
Page 26
Page 27
Merges and splits of concepts 
… can be recognized by tracing the movement of labels 
Page 28
New concepts, split from old ones 
Page 29 
Labels moved to added concepts: 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/labels_moved_to_added_concepts.rq
Concept removed and merged into multiple 
Minor split-ups of concepts can be revealed by label movements, too: 
Page 30 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/merged_partially.rq
Change history of a concept: “Personnel selection” 
Page 31 
http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/concept_deltas.rq
Future work 
 For STW: 
 Create a web service for concept history and link a history report 
to every concept 
 Provide drilldowns for new/deprecated/… concepts from the 
category level, perhaps visualizations / heat maps 
 For skos-history: 
 Apply to differing concept schemes 
 Distill general properties useful for human-readable change 
reports as well as machine-actionable data 
Page 32
Consider joining the skos-history project … 
… particularly if 
 you are in charge of a KOS and want to publish its change history 
 you are using one or several KOS in an application, or intellectually, 
and want to trace and re-apply upstream changes 
 just feel challenged by the task 
Code, issues, wiki pages etc.: https://github.com/jneubert/skos-history 
Currently, Johan DeSmedt (Tenforce) , Sini Pessala (National Library 
of Finland) and Agis Papantoniou (Tenforce) are involved in the project 
and in discussions on which this presentation was based. 
Page 33
Page 34 
Thanks for listening! 
Joachim Neubert 
ZBW – Leibniz Information Centre for Economics 
j.neubert@zbw.eu 
http://zbw.eu/stw 
https://github.com/jneubert/skos-history 
http://zbw.eu/labs

Weitere ähnliche Inhalte

Was ist angesagt?

Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Equipex Biblissima
 
The Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesThe Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesEquipex Biblissima
 
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016Ivan Ermilov
 
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of Interest
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of InterestWherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of Interest
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of InterestWhereCampBerlin
 

Was ist angesagt? (7)

Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
Digital Manuscripts Without Borders: A Discovery Platform of Manuscripts and ...
 
The Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical NamesThe Biblissima Authority File of Geographical Names
The Biblissima Authority File of Geographical Names
 
DBpedia ♥ Commons
DBpedia ♥ CommonsDBpedia ♥ Commons
DBpedia ♥ Commons
 
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
 
Redis
RedisRedis
Redis
 
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of Interest
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of InterestWherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of Interest
Wherecamp Navigation Conference 2015 - SPOI SDI4pps: Points of Interest
 
Dbms quiz
Dbms quiz Dbms quiz
Dbms quiz
 

Ähnlich wie KOS evolution in Linked Data

Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friendslucenerevolution
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsJulien Nioche
 
Nosql-columbia-feb2011
Nosql-columbia-feb2011Nosql-columbia-feb2011
Nosql-columbia-feb2011siculars
 
BDE SC3.3 Workshop - BDE Platform: Technical overview
 BDE SC3.3 Workshop -  BDE Platform: Technical overview BDE SC3.3 Workshop -  BDE Platform: Technical overview
BDE SC3.3 Workshop - BDE Platform: Technical overviewBigData_Europe
 
From SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesFrom SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesSemantic Web Company
 
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...C4Media
 
Processing large-scale graphs with Google Pregel
Processing large-scale graphs with Google PregelProcessing large-scale graphs with Google Pregel
Processing large-scale graphs with Google PregelMax Neunhöffer
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD MicrothesauriMarcia Zeng
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic WikiJie Bao
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA KeynoteAxel Polleres
 
Ceph Day SF 2015 - Community Update
Ceph Day SF 2015 - Community Update Ceph Day SF 2015 - Community Update
Ceph Day SF 2015 - Community Update Ceph Community
 
Towards an architecture and adoption process for Linked Data technologies in ...
Towards an architecture and adoption process for Linked Data technologies in ...Towards an architecture and adoption process for Linked Data technologies in ...
Towards an architecture and adoption process for Linked Data technologies in ...Jose Emilio Labra Gayo
 
BigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigData_Europe
 
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update Ceph Day Amsterdam 2015 - Ceph Ecosystem Update
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update Ceph Community
 
Digital Fabrication Studio.02 _Information @ Aalto Media Factory
Digital Fabrication Studio.02 _Information @ Aalto Media FactoryDigital Fabrication Studio.02 _Information @ Aalto Media Factory
Digital Fabrication Studio.02 _Information @ Aalto Media FactoryMassimo Menichinelli
 
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...DataStax Academy
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsJakob .
 
Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03Johannes Keizer
 
Digital Fabrication Studio v.0.2: Information
Digital Fabrication Studio v.0.2: InformationDigital Fabrication Studio v.0.2: Information
Digital Fabrication Studio v.0.2: InformationMassimo Menichinelli
 
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...StreamNative
 

Ähnlich wie KOS evolution in Linked Data (20)

Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
 
Large Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and FriendsLarge Scale Crawling with Apache Nutch and Friends
Large Scale Crawling with Apache Nutch and Friends
 
Nosql-columbia-feb2011
Nosql-columbia-feb2011Nosql-columbia-feb2011
Nosql-columbia-feb2011
 
BDE SC3.3 Workshop - BDE Platform: Technical overview
 BDE SC3.3 Workshop -  BDE Platform: Technical overview BDE SC3.3 Workshop -  BDE Platform: Technical overview
BDE SC3.3 Workshop - BDE Platform: Technical overview
 
From SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom OntologiesFrom SKOS over SKOS-XL to Custom Ontologies
From SKOS over SKOS-XL to Custom Ontologies
 
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
Making Distributed Data Persistent Services Elastic (Without Losing All Your ...
 
Processing large-scale graphs with Google Pregel
Processing large-scale graphs with Google PregelProcessing large-scale graphs with Google Pregel
Processing large-scale graphs with Google Pregel
 
AAT LOD Microthesauri
AAT LOD MicrothesauriAAT LOD Microthesauri
AAT LOD Microthesauri
 
Lessons learned from Semantic Wiki
Lessons learned from Semantic WikiLessons learned from Semantic Wiki
Lessons learned from Semantic Wiki
 
20100614 ISWSA Keynote
20100614 ISWSA Keynote20100614 ISWSA Keynote
20100614 ISWSA Keynote
 
Ceph Day SF 2015 - Community Update
Ceph Day SF 2015 - Community Update Ceph Day SF 2015 - Community Update
Ceph Day SF 2015 - Community Update
 
Towards an architecture and adoption process for Linked Data technologies in ...
Towards an architecture and adoption process for Linked Data technologies in ...Towards an architecture and adoption process for Linked Data technologies in ...
Towards an architecture and adoption process for Linked Data technologies in ...
 
BigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal PilotsBigDataEurope @BDVA Summit2016 2: Societal Pilots
BigDataEurope @BDVA Summit2016 2: Societal Pilots
 
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update Ceph Day Amsterdam 2015 - Ceph Ecosystem Update
Ceph Day Amsterdam 2015 - Ceph Ecosystem Update
 
Digital Fabrication Studio.02 _Information @ Aalto Media Factory
Digital Fabrication Studio.02 _Information @ Aalto Media FactoryDigital Fabrication Studio.02 _Information @ Aalto Media Factory
Digital Fabrication Studio.02 _Information @ Aalto Media Factory
 
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...
Big Data Open Source Security LLC: Realtime log analysis with Mesos, Docker, ...
 
Linking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization SystemsLinking Folksonomies to Knowledge Organization Systems
Linking Folksonomies to Knowledge Organization Systems
 
Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03Un unbis-agrovoc 2010-09-03
Un unbis-agrovoc 2010-09-03
 
Digital Fabrication Studio v.0.2: Information
Digital Fabrication Studio v.0.2: InformationDigital Fabrication Studio v.0.2: Information
Digital Fabrication Studio v.0.2: Information
 
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
Change Data Capture to Data Lakes Using Apache Pulsar and Apache Hudi - Pulsa...
 

Mehr von Joachim Neubert

Linking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledgeLinking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledgeJoachim Neubert
 
Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Joachim Neubert
 
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...Joachim Neubert
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudJoachim Neubert
 
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...Joachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes WikidataJoachim Neubert
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataJoachim Neubert
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersJoachim Neubert
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes WikidataJoachim Neubert
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Joachim Neubert
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsJoachim Neubert
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through WikidataJoachim Neubert
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Joachim Neubert
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hubJoachim Neubert
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)Joachim Neubert
 
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data RepositoryUsing Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data RepositoryJoachim Neubert
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebJoachim Neubert
 
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Joachim Neubert
 
Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Joachim Neubert
 

Mehr von Joachim Neubert (20)

Linking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledgeLinking the 20th century paper history to the sum of all knowledge
Linking the 20th century paper history to the sum of all knowledge
 
Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...Exploring and mapping the category system of the world‘s largest public press...
Exploring and mapping the category system of the world‘s largest public press...
 
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...Donating data to Wikidata: First experiences from the „20th Century Press Arc...
Donating data to Wikidata: First experiences from the „20th Century Press Arc...
 
Wikidata (für Archive)
Wikidata (für Archive)Wikidata (für Archive)
Wikidata (für Archive)
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
 
Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...Wikidata as opportunity for special collections: the 20th Century Press Archi...
Wikidata as opportunity for special collections: the 20th Century Press Archi...
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
 
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und WikidataChancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
Chancen und Herausforderungen einer komplementären Nutzung von GND und Wikidata
 
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und FirmendossiersPressemappe 20. Jahrhundert: Personen- und Firmendossiers
Pressemappe 20. Jahrhundert: Personen- und Firmendossiers
 
20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata20th Century Press Archives goes Wikidata
20th Century Press Archives goes Wikidata
 
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
Linking Knowledge Organization Systems via Wikidata (DCMI conference 2018)
 
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization SystemsMaking Wikidata fit as a Linking Hub for Knowledge Organization Systems
Making Wikidata fit as a Linking Hub for Knowledge Organization Systems
 
Linking authorities through Wikidata
Linking authorities through WikidataLinking authorities through Wikidata
Linking authorities through Wikidata
 
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...Wikidata as a linking hub for knowledge organization systems? Integrating an ...
Wikidata as a linking hub for knowledge organization systems? Integrating an ...
 
Wikidata as authority linking hub
Wikidata as authority linking hubWikidata as authority linking hub
Wikidata as authority linking hub
 
EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)EconBiz Research Dataset (SWIB16 Lightning Talk)
EconBiz Research Dataset (SWIB16 Lightning Talk)
 
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data RepositoryUsing Wikidata as an Authority for the SowiDataNet Research Data Repository
Using Wikidata as an Authority for the SowiDataNet Research Data Repository
 
Anforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic WebAnforderungen an Thesauri im Semantic Web
Anforderungen an Thesauri im Semantic Web
 
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
Exploiting the version history of SKOS files: skos-history (SWIB13 Lightning ...
 
Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)Linked Data Publishing with Drupal (SWIB13 workshop)
Linked Data Publishing with Drupal (SWIB13 workshop)
 

Kürzlich hochgeladen

Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac RoomVip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Roommeghakumariji156
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...kajalverma014
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书F
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理F
 
一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理F
 
Down bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirtsrahman018755
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制pxcywzqs
 
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...MOHANI PANDEY
 
一比一原版帝国理工学院毕业证如何办理
一比一原版帝国理工学院毕业证如何办理一比一原版帝国理工学院毕业证如何办理
一比一原版帝国理工学院毕业证如何办理F
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查ydyuyu
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样ayvbos
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsMonica Sydney
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"growthgrids
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...meghakumariji156
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtrahman018755
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdfMatthew Sinclair
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirtrahman018755
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge GraphsEleniIlkou
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdfMatthew Sinclair
 

Kürzlich hochgeladen (20)

Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac RoomVip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
Vip Firozabad Phone 8250092165 Escorts Service At 6k To 30k Along With Ac Room
 
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
best call girls in Hyderabad Finest Escorts Service 📞 9352988975 📞 Available ...
 
一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书一比一原版贝德福特大学毕业证学位证书
一比一原版贝德福特大学毕业证学位证书
 
一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理一比一原版奥兹学院毕业证如何办理
一比一原版奥兹学院毕业证如何办理
 
一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理一比一原版犹他大学毕业证如何办理
一比一原版犹他大学毕业证如何办理
 
Down bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirtsDown bad crying at the gym t shirts
Down bad crying at the gym t shirtsDown bad crying at the gym t shirts
 
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
一比一原版(Offer)康考迪亚大学毕业证学位证靠谱定制
 
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
Call girls Service Canacona - 8250092165 Our call girls are sure to provide y...
 
一比一原版帝国理工学院毕业证如何办理
一比一原版帝国理工学院毕业证如何办理一比一原版帝国理工学院毕业证如何办理
一比一原版帝国理工学院毕业证如何办理
 
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查在线制作约克大学毕业证(yu毕业证)在读证明认证可查
在线制作约克大学毕业证(yu毕业证)在读证明认证可查
 
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
一比一原版(Flinders毕业证书)弗林德斯大学毕业证原件一模一样
 
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girlsRussian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
Russian Call girls in Abu Dhabi 0508644382 Abu Dhabi Call girls
 
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency""Boost Your Digital Presence: Partner with a Leading SEO Agency"
"Boost Your Digital Presence: Partner with a Leading SEO Agency"
 
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
Tadepalligudem Escorts Service Girl ^ 9332606886, WhatsApp Anytime Tadepallig...
 
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
APNIC Policy Roundup, presented by Sunny Chendi at the 5th ICANN APAC-TWNIC E...
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf20240508 QFM014 Elixir Reading List April 2024.pdf
20240508 QFM014 Elixir Reading List April 2024.pdf
 
Trump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts SweatshirtTrump Diapers Over Dems t shirts Sweatshirt
Trump Diapers Over Dems t shirts Sweatshirt
 
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
2nd Solid Symposium: Solid Pods vs Personal Knowledge Graphs
 
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
20240509 QFM015 Engineering Leadership Reading List April 2024.pdf
 

KOS evolution in Linked Data

  • 1. KOS evolution in Linked Data Joachim Neubert ZBW – Leibniz Information Centre for Economics, Hamburg ZBW is member of the Leibniz Association SWIB14 Bonn, Germany 03.12.2014
  • 2. Agenda  Introduction  Current versioning approach with STW  User questions and requirements  Getting a grip on changes: the dataset versioning and skos-history approach  Overview  Application  Selected useful reports  Outlook: Future work and the skos-history project Page 2
  • 3. Page 3 STW Thesaurus for Economics  Created in the 1990s, now maintained and enhanced by ZBW  More than 6,000 descriptors in English and German  Since 2009 published as Linked Data in SKOS  Roughly every year a new version  Major overhaul in progress – subject area by subject area
  • 4. Short digression: SKOS as a RDF data format  Based on concepts (“units of thought”), which may bear labels in multiple languages  All semantic relations (hierarchies, mappings etc.) exist between concepts  Per language at most one skos:prefLabel (should be unique)  Additional properties for notations, notes, mappings, etc. Classes for ConceptSchemes and Collections of concepts  Widely in use today as a common interchange format Page 4
  • 5. How did we handle KOS evolution in the past? Page 5
  • 6. RDF statements about a particular version Page 6 <http://zbw.eu/stw> a skos:ConceptScheme, void:Dataset ; dcterms:issued "2013-10-30"^^xsd:date ; owl:versionInfo "8.12" ; ... Others do this in a similar, yet slightly different way (dcterms:modified, dcterms:hasVersion, …) – and sometimes, this changes over time
  • 7. Page 7 STW versions in URIs Stable URIs for skos:Concept (and similar for skos:ConceptScheme)  http://zbw.eu/stw/descriptor/19664-4 303 redirect to versioned URLs (RDFa/rdf/ttl files)  http://zbw.eu/stw/versions/latest/descriptor/19664-4/about Archived RDFa/rdf/ttl files available  http://zbw.eu/stw/versions/8.06/descriptor/19664-4/about (Currently, search functions and web services always work on the latest version)
  • 8. Page 8 Deprecated concepts No deletion – URI is still defined, shown on a RDFa page like this: <http://zbw.eu/stw/descriptor/12257-3> a skos:Concept, zbwext:Descriptor ; skos:inScheme <http://zbw.eu/stw> ; rdfs:label "Real estate loan"@en, "Realkredit"@de ; owl:deprecated true ; dcterms:isReplacedBy <http://zbw.eu/stw/descriptor/13775-4> ; skos:historyNote "Deprecated (used at last in version 8.04)"@en .
  • 9. Page 9 Pragmatic version history solution: Don‘t delete anything Changes are traceable only intellectually (but at all)
  • 10. Page 10 Detailed changelog From legacy maintance system (simple text file, in German):
  • 11. How to handle this better? Page 11 What users want to know when we publish a new KOS version:  What‘s new?  What has changed?
  • 12. Use cases for extended change information  Human indexers wanting to learn about new and deprecated concepts  Human indexers (and supporting applications) re-indexing large sets of documents  People maintaining a derived subset of a KOS  People maintaining mappings to other vocabularies, and applications supporting them  Automatic or semi-automatic indexing applications which make use of the KOS and/or its mappings  Search applications which make use of the KOS and/or its mappings Page 12
  • 13. Getting a grip on changes (Provided that we have no access to the KOS maintenance system where the changes take place originally, or can’t extend it to report this changes comprehensively.) Dataset versioning + skos-history - should basically work on every SKOS vocabulary Page 13
  • 14. 5 basic steps to an actionable skos-history 1) Start with a sorted n-triple file per version. (This poses one triple on every single line.) 2) Create a raw diff between two version files. (This gives you thousands and thousands of differences, even excluding bnodes.) 3) Split the resulting diff into an insertions and a deletions file. 4) Load the version files, the insertions and deletions files into a triple store as named graphs. 5) Add metadata about the versions and the deltas in a separate „version history graph“. Page 14 https://github.com/jneubert/skos-history/blob/master/bin/load_versions.sh
  • 15. Version History Graph, discoverable via fix URI, e.g.: http://zbw.eu/stw/version Page 15 Example endpoint:http://zbw.eu/beta/sparql/stwv/query
  • 16. Vocabularies for the plumbing  dc:/dcterms: Dublin Core, as usual the base for everything  void: http://rdfs.org/ns/void# Vocabulary of interlinked datasets  sd: http://www.w3.org/ns/sparql-service-description# SPARQL service description  delta: http://www.w3.org/2004/delta# Differences between RDF graphs  dsv: http://purl.org/iso25964/DataSet/Versioning# Version history records (providing version identifier and date) and a pointer to the current version – outside the actual version data  sh: http://purl.org/skos-history/ Scheme and concept version deltas Page 16
  • 17. What’s the benefit? A database of all versions of a KOS and all deltas between versions – which can be queried in parallel! Page 17
  • 18. Query for added concepts Page 18 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/added_concepts.rq
  • 19. Results: Newly inserted concepts Page 19
  • 20. New concepts by subject category Page 20 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/added_by_category.rq
  • 21. Statistics via aggregation queries: STW * Computed column - deprecation and redirects for thsys will be introduced for STW v 8.14 (retrospectively) Page 21 Version Date Added descriptors Deprecated descriptors redirected Added thsys Deprecated thsys* v 8.04 16.02.2009 v 8.06 22.04.2010 224 4 4 3 v 8.08 30.06.2011 131 57 54 14 1 v 8.10 21.03.2012 105 141 110 7 4 v 8.12 30.10.2013 260 487 485 12 26 v 8.14 18.11.2014 227 342 342 ? ? https://github.com/jneubert/skos-history/blob/master/bin/create_change_statistics.pl
  • 22. Statistics via aggregation queries: TheSoz Page 22 Version Date Added concepts Deleted concepts v 0.7 11.01.2011 v 0.86 08.11.2011 1 1 v 0.91 30.04.2012 240 4 v 0.92 19.09.2012 15 3 v 0.93 25.02.2014 42 4 Thesaurus for the Social Sciences http://www.gesis.org/en/services/research/thesauri-und-klassifikationen/social-science-thesaurus/ https://github.com/jneubert/skos-history/blob/master/bin/create_change_statistics.pl
  • 23. Selected useful reports  Changed notations  Splits and merges of concepts  History of a single concept Page 23
  • 24. Changed notations (general case) Page 24 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/changed_notations.rq
  • 25. Changed notations (linking STW versioned pages) Page 25 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/changed_notations_thsys.rq
  • 28. Merges and splits of concepts … can be recognized by tracing the movement of labels Page 28
  • 29. New concepts, split from old ones Page 29 Labels moved to added concepts: http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/labels_moved_to_added_concepts.rq
  • 30. Concept removed and merged into multiple Minor split-ups of concepts can be revealed by label movements, too: Page 30 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/stw/merged_partially.rq
  • 31. Change history of a concept: “Personnel selection” Page 31 http://zbw.eu/beta/sparql-lab/?queryRef=https://api.github.com/repos/jneubert/skos-history/contents/sparql/concept_deltas.rq
  • 32. Future work  For STW:  Create a web service for concept history and link a history report to every concept  Provide drilldowns for new/deprecated/… concepts from the category level, perhaps visualizations / heat maps  For skos-history:  Apply to differing concept schemes  Distill general properties useful for human-readable change reports as well as machine-actionable data Page 32
  • 33. Consider joining the skos-history project … … particularly if  you are in charge of a KOS and want to publish its change history  you are using one or several KOS in an application, or intellectually, and want to trace and re-apply upstream changes  just feel challenged by the task Code, issues, wiki pages etc.: https://github.com/jneubert/skos-history Currently, Johan DeSmedt (Tenforce) , Sini Pessala (National Library of Finland) and Agis Papantoniou (Tenforce) are involved in the project and in discussions on which this presentation was based. Page 33
  • 34. Page 34 Thanks for listening! Joachim Neubert ZBW – Leibniz Information Centre for Economics j.neubert@zbw.eu http://zbw.eu/stw https://github.com/jneubert/skos-history http://zbw.eu/labs