SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Downloaden Sie, um offline zu lesen
SKOS:
Building taxonomies
with minimum ontological commitment
Szymon Klarman
http://klarman.synthasite.com
Grakn Labs
Semantic Web London Meetup
April 18, 2017
Simple Knowledge Organisation System
• W3C standard (2009)…
• …for sharing thesauri, taxonomies,
classification schemes and subject heading
systems…
• …using the Semantic Web data model RDF.
https://www.w3.org/TR/skos-reference/
Simple Knowledge Organisation System
Car
Motor vehicle
Motorcycle
"car" @en "automobile" @en "auto" @pl"samochod" @pl
skos:broader
Car washes
Services
skos:broaderskos:broader
skos:relatedMatch
Vehicles
skos:inScheme
Industry classification
skos:inScheme
skos:prefLabel skos:prefLabelskos:altLabel skos:altLabel
Knowledge organisation systems
controlled
vocabulary
Expressiveness
taxonomy
thesaurus
ontologyOntological
commitment
SKOS
OWL/RDFS
list of
terms
hierarchy
hierarchy
+ relations
classes
instances
properties
Ontological commitment & semantics
motor vehicle
a wheeled vehicle, powered by an internal combustion engine, that does
not operate on rails
car
a motor vehicle, typically with 4 wheels, able to carry a small number of
people
Ontological commitment & semantics
Ontological commitment & semantics
?x rdf:type ?y
?y rdfs:subClassOf ?z
---------------------------
?x rdf:type ?z
Car
Motor vehicle
has engine some
Combustion engine
myTesla
rdf:type
rdfs:subClassOf
rdfs:subClassOf
Semantics = logical inference
rdf:type
rdf:type
Ontological commitment & semantics
Car
Motor vehicle
myTeslaPhoto
subject
skos:broader
X skos:broader Y
=
„Y has a broader
meaning than X”
semantics = shared understanding
Ontological commitment & semantics
Domain: Real-world things
Questions:
-What are they?
-What are they like?
Semantics: Set-theoretic
OWL/RDFS ontologies
Domain: Information resources
(and concepts).
Question: What are they about?
Semantics: No formal semantics.
Integrity constraints imposed by
the data model specs.
SKOS taxonomies
RDF data model
SKOS specification
• concepts and labels
• semantic relations
• SKOS mappings
• integrity constraints
Concepts and labels
"car" @en "automobile" @en "auto" @pl"samochód" @pl
skos:prefLabel skos:prefLabelskos:altLabel skos:altLabel
skos:Concept
rdf:type
Car
"A motor vehicle, typically with 4 wheels, able to
carry a small number of people" @en
skos:definition
Semantic relations
Car
Motor vehicle
skos:broader
Electric car
skos:narrower
skos:related
Electric vehicle
skos:broader skos:narrower
SKOS mappings
Car Motor car
skos:exactMatch
Vehicles
skos:inScheme
Road traffic vocabulary
Car (automatic)
skos:inScheme
skos:narrowMatch
skos:inScheme
skos:broadMatch
Integrity constraints (entailments)
•skos:semanticRelation - domain & range = skos:Concept
•skos:related - symmetric
•skos:relatedMatch - symmetric
•skos:broaderTransitive - transitive, inverseOf skos:narrowerTransitive
•skos:broader - inverseOf skos:narrower
•skos:broadMatch - inverseOf skos:narrowMatch
•skos:narrowerTransitive - transitive, inverseOf skos:broaderTransitive
•skos:narrower - inverseOf skos:broader
•skos:narrowMatch - inverseOf skos:broadMatch
•skos:closeMatch - symmetric
•skos:exactMatch - symmetric, transitive
Integrity constraints (non-entailments)
Car
Motor vehicle
skos:broader
skos:broader
Vehicle
skos:broader
Car
Car wash
skos:related
skos:related
Cleaning products
skos:related
Integrity constraints (not consistent)
"car" @en
"automobile" @en
skos:prefLabel
Car
skos:prefLabel
"car" @en
skos:prefLabel
Car
skos:altLabel
Integrity constraints (not consistent)
Car
Motor vehicle
skos:broader
skos:related
Vehicle
skos:broader
Car Motor vehicle
skos:broader
skos:related
Integrity constraints (not consistent)
Car Motor car
skos:exactMatch
skos:broadMatch
Car Motor car
skos:exactMatch
skos:relatedMatch
SKOS + OWL/RDFS
Car
Motor vehicle
powered by some
Combustion engine
myCar
rdf:type
rdfs:subClassOf
rdfs:subClassOf
myCarPhoto
subject
skos:broader
skos:Concept
rdf:type
rdf:type
owl:Class
rdf:type
rdf:type
rdf:type
Curated SKOS taxonomies
-AGROVOC Multilingual agricultural thesaurus
http://aims.fao.org/vest-registry/vocabularies/agrovoc-multilingual-
agricultural-thesaurus
- STW Thesaurus for Economics
http://zbw.eu/stw/
-UK Public sector vocabularies
http://standards.esd.org.uk
-EUROVOC European Union multilingual thesaurus
http://eurovoc.europa.eu/
Crowdsourced SKOS taxonomies
Crowdsourced SKOS taxonomies
•Films_about_technology
•Films_about_artificial_intelligence
•Films_about_nuclear_war_and_weapons
•Robot_films
•Android_(robot)_films
•Terminator_films
•Documentary_films_about_robotics
•Films_about_computing
•Films_about_security_and_surveillance
•Films_about_telepresence
Crowdsourced SKOS taxonomies
skos:Concept 1,561,776
skos:broader 3,262,643
skos:related 23,483
skos:exactMatch 307
dct:subject 24,941,650
Crowdsourced SKOS taxonomies
crowdsourced taxonomy + crowdsourced tagging
=
very noisy data + highly imprecise categorisation
But: less ontological committment
 broader participation
 knowledge acquisition bottleneck reduced
 cost of building intelligent applications reduced
SKOS-based recommendations
DBpedia movie recommender: http://52.26.26.74:8890/dbmr
SKOS-based similarity
Horror
Movie
Comedy
skos:broaderskos:broader
Monster movie Haunted house movie
skos:broaderskos:broader
Slapstick
skos:broader
A B C
subject subject subject
taxonomy
taggedcontent
Boosting SKOS management
Taxonomical
knowledge
Information
resources
SKOS
taxonomy
Boosting SKOS management
Taxonomical
knowledge
Information
resources
SKOS
taxonomy
Discovery
•iterative
•interactive
•semi-automated
Pool Party Semantic Suite
(Semantic Web Company)
DBpedia subjects
reconciliation / import Concept extraction
Recurrent term suggestion
Context disambiguation
Movies
„jaws" @en
skos:altLabel
„jaws" @en
skos:altLabel
Anatomy
Summary
SKOS is an RDF-based standard for representing taxonomies, thesauri, etc.
•Conceptually simple with minimum ontological commitment:
•No complex formal semantics and logical implications
•Laverages NL semantics and facilitates sharing it
•Lower entry treshold for engaging experts/contributors/users
•Structurally simple:
•Easy to visualise and navigate
•Easy to exploit with graph-based algorithms
•Convenient bridge between structured and unstructured data.

Weitere ähnliche Inhalte

Mehr von Szymon Klarman

Formal Verification of Data Provenance Records
Formal Verification of Data Provenance RecordsFormal Verification of Data Provenance Records
Formal Verification of Data Provenance RecordsSzymon Klarman
 
Data driven approaches to empirical discovery
Data driven approaches to empirical discoveryData driven approaches to empirical discovery
Data driven approaches to empirical discoverySzymon Klarman
 
ABox Abduction in the Description Logic
ABox Abduction in the Description LogicABox Abduction in the Description Logic
ABox Abduction in the Description LogicSzymon Klarman
 
Judgment Aggregation as Maximization of Epistemic and Social Utility
Judgment Aggregation as Maximization of Epistemic and Social UtilityJudgment Aggregation as Maximization of Epistemic and Social Utility
Judgment Aggregation as Maximization of Epistemic and Social UtilitySzymon Klarman
 
Description Logics of Context
Description Logics of ContextDescription Logics of Context
Description Logics of ContextSzymon Klarman
 
Prediction and Explanation over DL-Lite Data Streams
Prediction and Explanation over DL-Lite Data StreamsPrediction and Explanation over DL-Lite Data Streams
Prediction and Explanation over DL-Lite Data StreamsSzymon Klarman
 
Querying Temporal Databases via OWL 2 QL
Querying Temporal Databases via OWL 2 QLQuerying Temporal Databases via OWL 2 QL
Querying Temporal Databases via OWL 2 QLSzymon Klarman
 
Ontology learning from interpretations in lightweight description logics
Ontology learning from interpretations in lightweight description logicsOntology learning from interpretations in lightweight description logics
Ontology learning from interpretations in lightweight description logicsSzymon Klarman
 
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesKnowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesSzymon Klarman
 
What makes a linked data pattern interesting?
What makes a linked data pattern interesting?What makes a linked data pattern interesting?
What makes a linked data pattern interesting?Szymon Klarman
 

Mehr von Szymon Klarman (11)

HyperGraphQL
HyperGraphQLHyperGraphQL
HyperGraphQL
 
Formal Verification of Data Provenance Records
Formal Verification of Data Provenance RecordsFormal Verification of Data Provenance Records
Formal Verification of Data Provenance Records
 
Data driven approaches to empirical discovery
Data driven approaches to empirical discoveryData driven approaches to empirical discovery
Data driven approaches to empirical discovery
 
ABox Abduction in the Description Logic
ABox Abduction in the Description LogicABox Abduction in the Description Logic
ABox Abduction in the Description Logic
 
Judgment Aggregation as Maximization of Epistemic and Social Utility
Judgment Aggregation as Maximization of Epistemic and Social UtilityJudgment Aggregation as Maximization of Epistemic and Social Utility
Judgment Aggregation as Maximization of Epistemic and Social Utility
 
Description Logics of Context
Description Logics of ContextDescription Logics of Context
Description Logics of Context
 
Prediction and Explanation over DL-Lite Data Streams
Prediction and Explanation over DL-Lite Data StreamsPrediction and Explanation over DL-Lite Data Streams
Prediction and Explanation over DL-Lite Data Streams
 
Querying Temporal Databases via OWL 2 QL
Querying Temporal Databases via OWL 2 QLQuerying Temporal Databases via OWL 2 QL
Querying Temporal Databases via OWL 2 QL
 
Ontology learning from interpretations in lightweight description logics
Ontology learning from interpretations in lightweight description logicsOntology learning from interpretations in lightweight description logics
Ontology learning from interpretations in lightweight description logics
 
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic TechniquesKnowledge Assembly at Scale with Semantic and Probabilistic Techniques
Knowledge Assembly at Scale with Semantic and Probabilistic Techniques
 
What makes a linked data pattern interesting?
What makes a linked data pattern interesting?What makes a linked data pattern interesting?
What makes a linked data pattern interesting?
 

Kürzlich hochgeladen

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 

Kürzlich hochgeladen (20)

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

SKOS: Building taxonomies with minimum ontological commitment