Breaking the Kubernetes Kill Chain: Host Path Mount
gStore: A Graph-based SPARQL Query Engine
1. gStore: A Graph-based SPARQL Query Engine
¨
M. Tamer Ozsu
University of Waterloo
David R. Cheriton School of Computer Science
Joint work with Lei Zou, Peking University and Lei Chen, Hong
Kong University of Science and Technology
UPenn/2012-12-04 1
2. RDF and Semantic Web
RDF is a language for the conceptual modeling of information
about web resources
A building block of semantic web
Facilitates exchange of information
Search engines can retrieve more relevant information
Facilitates data integration (mashes)
Machine understandable
Understand the information on the web and the
interrelationships among them
UPenn/2012-12-04 2
3. RDF Uses
Yago and DBPedia extract facts from Wikipedia & represent
as RDF → structural queries
Communities build RDF data
E.g., biologists: Bio2RDF and Uniprot RDF
Web data integration
Linked Data Cloud
...
UPenn/2012-12-04 3
4. RDF Data Volumes . . .
. . . are growing – and fast
Linked data cloud currently consists of 325 datasets with
>25B triples
Size almost doubling every year
UPenn/2012-12-04 4
5. RDF Data Volumes . . .
. . . are growing – and fast
Linked data cloud currently consists of 325 datasets with
>25B triples
Size almost doubling every year
Sem- Wiki-
Surge Web- company
Radio LIBRIS Central RDF
ohloh
Doap-
Music- space Semantic Resex
brainz Audio- Eurécom
Flickr Web.org
MySpace Scrobbler QDOS SW
exporter
Wrapper
Conference IRIT
Corpus Toulouse
RAE
BBC BBC Crunch 2001
FOAF SIOC ACM
BBC Later + John Base Revyu
Jamendo Peel profiles Sites
Playcount TOTP Open- Buda-
Data Guides pest
DBLP BME
flickr RKB
Project
Pub Geo- Euro- wrappr Explorer
Guten- Virtuoso
Guide names stat berg Pisa
BBC Sponger eprints
Programm
Open
es
Calais New-
riese World Linked ECS
castle
Fact- MDB South-
IEEE
book ampton
Magna-
Gov- tune RDF Book
Track Mashup
DBpedia
US
Census
Data
W3C
WordNet
lingvoj Freebase
DBLP
Hannover
CiteSeer
UniRef
LAAS-
CNRS
IBM
March ’09:
GEO
Open
Cyc Yago
UMBEL
LinkedCT
Species DBLP
Berlin
Reactome
UniParc
Taxonomy
89 datasets
Drug
PROSITE
Daily Bank
Med
Pub GeneID
Homolo Chem
Gene KEGG UniProt
Pfam ProDom
Disea- CAS
Gene
some
ChEBI Ontology
Symbol OMIM
Inter
Pro
UniSTS PDB
HGNC
MGI
PubMed
As of March 2009
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
http://lod-cloud.net/
UPenn/2012-12-04 4
6. RDF Data Volumes . . .
. . . are growing – and fast
Linked data cloud currently consists of 325 datasets with
>25B triples
Size almost doubling every year
Sussex St.
Reading Andrews NDL
Audio- Lists Resource subjects t4gm
MySpace scrobbler Lists
Moseley (DBTune) (DBTune) RAMEAU
Folk NTU SH lobid
GTAA Plymouth Resource
Lists
Organi-
Reading
Lists
sations
Music The Open ECS
Magna- Brainz Music
DB tune Library LCSH South-
(Data Brainz LIBRIS ampton
Tropes lobid Ulm
Incubator) (zitgist) Man- EPrints
Resources
chester
Surge Reading
biz. Music RISKS
Radio Lists The Open ECS
data. John Brainz
Discogs Library PSH Gem. UB South-
gov.uk Peel (DBTune)
FanHubz (Data In- (Talis) Norm- Mann- ampton
(DB cubator) Jamendo datei heim RESEX
Tune)
Popula- Poké- DEPLOY
Last.fm
tion (En- pédia
Artists Last.FM Linked RDF
AKTing) research EUTC (DBTune) (rdfize) LCCN VIAF Book Wiki
data.gov Produc- Pisa Eurécom
P20 Mashup semantic
NHS .uk tions classical web.org
(EnAKTing) Pokedex
(DB
Mortality Tune) PBAC ECS
(En-
AKTing)
BBC MARC (RKB Budapest
Program Codes Explorer)
Energy education OpenEI BBC List Semantic Lotico Revyu OAI
(En- CO2 data.gov mes Music Crunch SW
AKTing) (En- .uk Chronic- Linked Dog
NSZL Base
AKTing) ling Event- MDB RDF Food IRIT
America Media Catalog
ohloh
BBC DBLP ACM IBM
Good- BibBase
Ord- Wildlife (RKB
Openly Recht- win
nance Finder Explorer)
Local spraak. Family DBLP
legislation Survey Tele- New VIVO UF
.gov.uk nl graphis York flickr (L3S) New-
VIVO castle
Times URI wrappr Open Indiana RAE2001
UK Post- Burner Calais DBLP
codes statistics (FU
VIVO CiteSeer Roma
September ’10:
data.gov LOIUS Taxon iServe Berlin) IEEE
.uk Cornell
Concept Geo
World data
ESD Fact- OS dcs
Names book dotAC
stan- reference Project
Linked Data NASA (FUB) Freebase
dards data.gov Guten-
.uk
for Intervals (Data GESIS Course-
transport DBpedia berg STW ePrints CORDIS
Incu- ware
data.gov bator) (FUB)
Fishes ERA UN/
.uk
203 datasets
of Texas Geo LOCODE
Uberblic
Euro- Species
The stat dbpedia TCM SIDER Pub KISTI
(FUB) lite Gene STITCH Chem JISC
London Geo KEGG
DIT LAAS
Gazette TWC LOGD Linked Daily OBO Drug
Eurostat Data UMBEL lingvoj Med
(es) Disea-
YAGO Medi some
Care ChEBI KEGG NSF
Linked KEGG KEGG
Linked Drug Cpd
GovTrack rdfabout Glycan
Sensor Data CT Bank Pathway
US SEC Open Reactome
(Kno.e.sis) riese Uni
Cyc Lexvo Path-
way PDB Media
Semantic totl.net Pfam
HGNC
XBRL
WordNet KEGG KEGG Geographic
(VUA) Linked Taxo- CAS Reaction
rdfabout Twarql UniProt Enzyme
EUNIS Open nomy
US Census Publications
Numbers PRO- ProDom
SITE Chem2
UniRef Bio2RDF User-generated content
Climbing WordNet SGD Homolo
Linked (W3C) Affy- Gene
GeoData
Cornetto
metrix Government
PubMed Gene
UniParc
Ontology
GeneID Cross-domain
Airports
Product
DB UniSTS MGI
Gen Life sciences
Bank OMIM InterPro
As of September 2010
Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.
http://lod-cloud.net/
UPenn/2012-12-04 4