"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
Session 3: Vocabulary enrichment, Gerda Koch
1. local content in a Europeana cloud
Session 3: Vocabulary enrichment
Gerda Koch, kochg@ait.co.at
AIT Angewandte Informationstechnik
Forschungsgesellschaft mbH
LoCloud is funded by the
European Commission's ICT Policy Support Programme
2. WP3: Micro services for small and medium
institutions
• establishing a cloud-based collaborative testing
environment for tools and services
• developing cloud-based SaaS services (Software as a
Service) and applications suitable for use by small
and medium institutions
• providing the basis for a continuing process of
participative testing and validation of each of the
services and applications
Introduction
3. WP3: Service framework
• Geolocation enrichment
• Metadata enrichment
• Vocabularies and languages
• Historic place names
• Wikimedia applications
Introduction
4. This presentation will provide you:
A brief introduction to …
1. Web Services
2. Vocabulary Standards used by Task 3.4
3. Vocabulary Management Tool (Sample)
Contents
5. What is a Web Service?
A web service is a software function provided at a
network address over the web or the cloud (24/7)
• are application components
• communicate using open protocols
• are self-contained and self-describing
• can be used by other applications
• XML is the basis for Web services
1 Web Services
6. Query a Web Service
offered online
Results: Rivers in Germany
TGN – Getty Thesaurus of Geographic Names
Query
Result
1 Web Services
7. Xataface: editing a record
Integrate a Web Service
in a local application
Which
Descriptors
should I use?
1 Web Services
8. Integrate a Web Service in a
local application
The vocabulary
webservice is
directly addressed
within the entry field
of the application.
The user chooses
the vocabulary terms
that are taken over
into the application.
(auto-suggest)
Results: Music Genres
DISMARC Genres Vocabulary
1 Web Services
9. Task 3.4: Vocabularies and languages
• Experimental application to enable local cultural
institutions to collaborate in the development of
multilingual vocabularies for local history and
archaeology
2 Vocabularies
10. The application will be based on:
ISO 25964: standard for building thesauri
• Part 1: Thesauri for information retrieval
- published in 2011
- developing a thesaurus (mono- and
multilingual)
- replaced previous standards ISO 2788/5964
- includes data model and XML schema
• Part 2: Interoperability with other vocabularies
- published in 2013
- recommendations for the establishment and maintenance of
mappings between multiple thesauri, or between thesauri and
other types of vocabularies
2 Vocabularies
11. The application will use SKOS as exchange
format:
SKOS Simple Knowledge Organisation System
• is a W3C recommendation designed for
representation of controlled vocabularies
• main objective is to enable easy publication and use
of such vocabularies as linked data
2 Vocabularies
12. How this two relate…
• The SKOS metamodel is broadly compatible with the
data model of ISO 25964-1 - Thesauri for Information
Retrieval.
• ISO 25964-1 advises on the selection and fitting together
of concepts, terms and relationships to make a good
thesaurus
• SKOS addresses the next step - porting the thesaurus to
the Web.
2 Vocabularies
13. Using SKOS, concepts can be identified using URIs,
labeled with lexical strings in one or more natural
languages…
The SKOS Core Vocabulary is an application of the
Resource Description Framework (RDF), that can be
used to express a concept scheme as an RDF graph.
Using RDF allows data to be linked to and/or merged
with other data, enabling data sources to be
distributed across the web, but still be meaningfully
composed and integrated.
2 Vocabularies
14. Vocabulary examples (SKOS Format)
• DDC Dewey Decimal Classification
• Library of Congress’ vocabularies
• VIAF person authorities
• UKAT UK Archival Thesaurus
• UNESCO Thesaurus
• …..
Example….
2 Vocabularies
16. Using a vocabulary for classification
Semantic Net presentation
Graph
presentation
(Topic Map)
Tree
presentation
Object
Predicate
Subject
2 Vocabularies
17. Usage of SKOS within the
Europeana Data Model (EDM)
Contextual
Classes (EDM)ObjectPredicate
Subject
2 Vocabularies
22. Vocabulary Management – Business Process/Workflow
The process of editing and
managing a vocabulary
involves the communication
between different persons
and the technical system:
Eg. Finding a term,
Requesting a candidate term,
Accepting the new term,
Updating the vocabulary etc.
User uses web
service for finding a
vocabulary term
Term exists in the vocabulary
SaaS
Crowdsourcing
Term does not
exist in the
vocabulary
Use
crowdsourcing
for finding a new
candidate term
3 Voc. Tool
23. Outcomes of Copenhagen - WS1
Access to the vocabulary:
• Online access point (see TGN example)
• Integration into local application (see Genre example)
Import of existing vocabularies into the LoCloud experimental vocabulary
application (pre-requisite: skosified, multilingual, open access)
• Subject vocabularies
• Object Types
• Geographic Names
• Suggested vocabularies: UNESCO Thesaurus, UKAT
(http://www.heritagedata.org/blog/vocabularies-provided/ ? UK)
24. Outcomes of Copenhagen
General usage of the tool
• Everybody allowed to enter new terms vs. Suggest candidate terms
• Multilinguality: language translations assigned to partners
Other wishes?
• Crowdsourcing?
• Automated Vocabulary enrichment
26. LoCloud is funded by the European Commission's
ICT Policy Support Programme
The views and opinions expressed in this
presentation are the sole responsibility of the
authors and do not necessarily reflect the views of
the European Commission.
Funding