Machine-Interpretable Dataset and Service Descriptions for Heterogeneous Data Access and Retrieval
1. Machine-Interpretable
Dataset and Service Descriptions
for Heterogeneous Data Access & Retrieval
Anastasia Dimou, Ruben Verborgh,
Miel Vander Sande, Erik Mannens, Rik Van de Walle
Anastasia.Dimou@UGent.be @natadimou
Ghent University – iMinds – Multimedia Lab
http://RML.io
11. ex:{id}
ex:{lab}
sets of triples of a dataset have repetitive patterns
“{firstname}
{surname}”
triple-oriented mapping languages
formalize patterns into rules to map data to RDF
ex:located
ex:{lab} ex:{city}
12. RDF Mapping Language (RML)
map any data to RDF
uniform, integrable, interoperable, extensible
extends the W3C-recommended R2RML
http://RML.io
A. Dimou, M. Vander Sande, P. Colpaert, R. Verborgh, E. Mannens, and R. Van de Walle.
RML: A Generic Language for Integrated RDF Mappings of Heterogeneous Data.
In Proceedings of the 7th Workshop on Linked Data on the Web (LDOW2014), 2014.
13. RML describes
rules to map any structured data to RDF
RML supports any data independently of
which structure and format they have
where they originally reside
how they are accessed & retrieved
14. data access and retrieval
is manually performed
remains hard-coded
15. Mapping data
any data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
16. Mapping data
any data to RDF with RML
Specifying
which data form a data input
how to reference data input extracts
Accessing & Retrieving
data input from original source(s)
20. Mapping data
data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
23. Support data in Heterogeneous Structures
tabular-structured
hierarchical-structured
(semi-)structured
… … …
24. Support data in Heterogeneous Structures and Formats
tabular-structured
tables in DBs or CSV files …
hierarchical-structured
JSON or XML …
(semi-)structured
HTML …
… … …
33. Mapping data
data to RDF with RML
Specifying data
which data form a data source
how to reference data extracts
Accessing & Retrieving data
data from their original sources
37. Support different Locations and Access Interfaces
Local File(s)
Database connectivity
Web source(s)
RDF source(s)
38. Dataset and Service Vocabularies
advertising in machine-interpretable fashion
how to access the underlying data
can also be used in combination with RML
to retrieve the data input to be mapped
from its original source
39. Support different Locations and Access Interfaces
Local File(s)
Database connectivity
D2RQ
Web source(s) (Web API/service)
DCAT, CSVW, Hydra, VOiD (Dataset)
RDF source(s)
VOiD (Endpoint), SPARQL-SD
56. Mapping data
any data to RDF with RML
Specifying data
which data form a data input
how to reference data input extracts
Accessing & Retrieving data
data input from original source(s)
57. Data access, retrieval and mapping descriptions
are machine-interpretable
Granular robust solution based on RML
which further automates and facilitates
the generation of RDF representations