A mini project by 'Don't move the plants' at the 8th Summer School on Ontology Engineering and the Semantic Web 2011. This Project was completed by: Andrea Nuzzolese, Esther Lozano, Ferdinand Dhombres, Luca Greco and Tim Hodson.
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
Ontology Alignment using Linked Data
1. Ontology Alignment Discovery
using Linked Open Data
Esther LOZANO, Andrea NUZZOLESE, Luca GRECO,
Tim HODSON, Ferdinand DHOMBRES
JE1: What linked data tells us about ontology relations?
2. Objective
• Explore dataset links in the LOD Cloud
to infer alignments between ontologies
Sussex St.
Reading Andrews NDL
Audio- Lists Resource subjects t4gm
MySpace scrobbler Lists
Moseley (DBTune) (DBTune) RAMEAU
Folk NTU SH lobid
GTAA Plymouth Resource
Lists
Organi-
Reading
Lists
sations
Music The Open ECS
Magna- Brainz Music
DB tune Library LCSH South-
(Data Brainz LIBRIS ampton
Tropes lobid Ulm
Incubator) (zitgist) Man- EPrints
Resources
chester
Surge Reading
biz. Music RISKS
Radio Lists The Open ECS
data. John Brainz
Discogs Library PSH Gem. UB South-
gov.uk Peel (DBTune)
FanHubz (Data In- (Talis) Norm- Mann- ampton
(DB cubator) Jamendo datei heim RESEX
Tune)
Popula- Poké- DEPLOY
Last.fm
tion (En- pédia
Artists Last.FM Linked RDF
AKTing) research EUTC (DBTune) (rdfize) LCCN VIAF Book Wiki
data.gov Produc- Pisa Eurécom
P20 Mashup semantic
NHS .uk tions classical web.org
(EnAKTing) Pokedex
(DB
Mortality Tune) PBAC ECS
(En-
AKTing)
BBC MARC (RKB Budapest
Program Codes Explorer)
Energy education OpenEI BBC List Semantic Lotico Revyu OAI
(En- CO2 data.gov mes Music Crunch SW
AKTing) (En- .uk Chronic- Linked Dog
NSZL Base
AKTing) ling Event- MDB RDF Food IRIT
America Media Catalog
ohloh
BBC DBLP ACM IBM
Good- BibBase
Ord- Wildlife (RKB
Openly Recht- win
nance Finder Explorer)
Local spraak. Family DBLP
legislation Survey Tele- New VIVO UF
.gov.uk nl graphis York flickr (L3S) New-
VIVO castle
Times URI wrappr Open Indiana RAE2001
UK Post- Burner Calais DBLP
codes statistics (FU
VIVO CiteSeer Roma
data.gov LOIUS Taxon iServe Berlin) IEEE
.uk Cornell
Concept Geo
World data
ESD Fact- OS dcs
Names book dotAC
stan- reference Project
Linked Data NASA (FUB) Freebase
dards data.gov Guten-
.uk
for Intervals (Data GESIS Course-
transport DBpedia berg STW ePrints CORDIS
Incu- ware
data.gov bator) (FUB)
Fishes ERA UN/
.uk
of Texas Geo LOCODE
Uberblic
Euro- Species
The stat dbpedia TCM SIDER Pub KISTI
(FUB) lite Gene STITCH Chem JISC
London Geo KEGG
DIT LAAS
Gazette TWC LOGD Linked Daily OBO Drug
Eurostat Data UMBEL lingvoj Med
(es) Disea-
YAGO Medi some
Care ChEBI KEGG NSF
Linked KEGG KEGG
Linked Drug Cpd
GovTrack rdfabout Glycan
Sensor Data CT Bank Pathway
US SEC Open Reactome
(Kno.e.sis) riese Uni
Cyc Lexvo Path-
way PDB Media
Semantic totl.net Pfam
HGNC
XBRL
WordNet KEGG KEGG Geographic
Linked Taxo- CAS Reaction
Twarql (VUA) UniProt Enzyme
rdfabout EUNIS Open nomy
US Census Publications
Numbers PRO- ProDom
SITE Chem2
UniRef Bio2RDF User-generated content
Climbing WordNet SGD Homolo
Linked (W3C) Affy- Gene
GeoData
Cornetto
metrix Government
PubMed Gene
UniParc
Ontology
GeneID Cross-domain
Airports
Product
DB UniSTS MGI
Gen Life sciences
Bank OMIM InterPro
As of September 2010
http://richard.cyganiak.de/2007/10/lod/lod-
http://www.webology.org/2006/v3n3/images/sample.JPG
datasets_2010-09-22_colored.pdf
3. Methods
• Find appropriate datasets in the LOD CLOUD
(with an ontology and a SPARQL Endpoint)
• Retrieve/build linksets between datasets
• Generate new graph describing candidate
alignments
• Infer mappings between ontologies
• Alignment evaluation
4. Principle
Class C1 owl:equivalentClass C2
Individual D1 owl:sameAs D2
5. Principle
Class C1 owl:equivalentClass C2
Individual D1 owl:sameAs D2
6. Principle
Property isPossibleDrug
rdfs:domain
owl:equivalentClass
rdfs:range
rdfs:subClassOf
health
Class Antalgic disease
condition
Aspirin
Individual isPossibleDrug hangover
(500 mg)
7. Principle
Property isPossibleDrug
rdfs:domain
owl:equivalentClass
rdfs:range
rdfs:subClassOf
health
Class Antalgic disease
condition
Aspirin
Individual isPossibleDrug hangover
(500 mg)
8. From links to candidates
A metric and a threshold to identify potential alignments
Class C1 owl:equivalentClass C2
D1 owl:sameAs D2
D1
Individual
D1' owl:sameAs D2'
D2
D1
D1
D1" owl:sameAs D2"
9. From links to candidates
A metric and a threshold to identify potential alignments
Class C1 owl:equivalentClass C2
P1 P2
D1 owl:sameAs D2
D1
Individual
D1' owl:sameAs D2'
D2
D1
D1
D1" owl:sameAs D2"
P1=0.5 P2=0.75
10. From links to candidates
A metric and a threshold to identify potential alignments
Class C1 owl:equivalentClass C2
(P1+P2)/2 > x
P1 P2
D1 owl:sameAs D2
D1
Individual
D1' owl:sameAs D2'
D2
D1
D1
D1" owl:sameAs D2"
P1=0.5 P2=0.75
11. Results - 1
Material collection issues
OS
Freebase Project
Guten-
DBpedia berg STW
(FUB)
ERA
dbpedia
lite
TCM
Gene
SIDER
STITCH • Datasets without
Daily
Med
DIT
Disea-
ontology or with one
AGO Medi
Care
some
ChEBI
class
Linked Drug
•
CT Bank
totl.net
Uni
Path-
way Pfam PDB
SPARQL Endpoint
unavailabilty
Taxo-
UniProt
•
nomy
UniRef
PRO-
SITE
ProDom
Linksets without
owl:sameAs
• Linksets with only
one class
12. Results - 1
Material collection issues
OS
Freebase Project
Guten-
DBpedia berg STW
(FUB)
ERA
dbpedia
lite
TCM
Gene
SIDER
STITCH • Datasets without
Daily
Med
DIT
Disea-
ontology or with one
AGO Medi
Care
some
ChEBI
class
Linked Drug
•
CT Bank
totl.net
Uni
Path-
way Pfam PDB
SPARQL Endpoint
unavailabilty
Taxo-
UniProt
•
nomy
UniRef
PRO-
SITE
ProDom
Linksets without
owl:sameAs
• Linksets with only
one class
13. Result - 2
Poké-
pédia
Linked
Linked Movie DB & DBpedia
LCCN
classical
Pokedex
(DB
•
Tune) PBAC
MARC
Codes
List
Datasets with good ontologies
Linked
•
Event- MDB NSZL
Media Catalog
Good-
win
SPARQL endpoints available
•
Family
Times URI
Burner
flickr
wrappr Open
Calais
owl:sameAs links available
World
act-
book
iServe
• Linked individuals belonging to
(FUB) Freebase
DBpedia
different classes
Uberblic
dbpedia TCM
lite Gene
DIT
Daily
Med
14. Result - 3
Poké-
Correlation
pédia
Linked
LCCN Linked
classical DBpedia
(DB
Tune)
Pokedex
PBAC MDB
MARC
Codes
List
Artist
Linked
NSZL Actor 1869
Event-
Media
MDB
Catalog 83847
Good-
win
Family
1735 Actor
37751
Times URI
Burner
flickr
wrappr Open
Calais
50603 2027 Person
World
iServe 363751
act-
book
(FUB) Freebase 30 ...
DBpedia
Comedian
Uberblic
dbpedia TCM
675
lite Gene
DIT
Daily
Med
Movie Film
15. Result - 3
Poké-
Correlation
pédia
Linked
LCCN Linked
classical DBpedia
(DB
Tune)
Pokedex
PBAC MDB
MARC
Codes
List
Event-
Linked
MDB NSZL
Catalog
Actor 2.95 Artist 2.2 %
Media
3.7 %
Good-
win
Family
3.4 %
4 Actor 4.6 %
flickr
Times URI wrappr Open
2.0 % 2.3
Burner Calais
Person 0.6 %
iServe
World
act-
book
Freebase
0.06 % 2.23 ...
(FUB)
DBpedia
Uberblic
Comedian 4.4 %
dbpedia TCM
lite Gene
DIT
Daily
Med
Movie Film
18. Conclusion
• Quality issue in datasets
• Existing ontology matcher was not effective
• New alignments discovered using LOD
• Approach could be used where natural
languages of ontologies differ
• Assessment of the approach has to be made
(other datasets, ontologies, matchers)