1. Building arguments onBuilding arguments on
open dataopen data
philippe duchesne
phd@highlatitud.es
@pduchesne
OpenBelgium 2018
Louvain-la-Neuve
March 12th, 2018
how annotated open datahow annotated open data
fragments can rest your casefragments can rest your case
4. Ultimately...Ultimately...
goal is to model and share a thought process based on open
data
IntroIntro
has to be
shareable -> open data
verifiable -> assessment of authenticity of sources
reproducible -> standards-based, open description of process
5. Preamble : Data standardsPreamble : Data standardsTechTech
as the 3rd star of open data,
it should be obvious by now... but considering the current state
of open data, it's better to rub it in
XLS sheets
ZIP files
a link to a webpage that gathers some
data
....
are not open data standards!
6. Verifiability & reproduceability
--> capture data lineage
TechTech Towards a Mosaic standard :Towards a Mosaic standard :
Metadata standardsMetadata standards
data provenance metadata
origin
authenticity
based on Dublin Core vocabulary
data processing modelling
need for data process description vocabulary
Considered ontologies: OntoDM, DMOP
7. qualifying annotations : per domain controlled vocabularies
Reproduceability : need for data process description vocabulary
http://www.ontodm.com/doku.php?id=ontodm-core
http://www.e-lico.eu/DMOP.html
TechTech Towards a Mosaic standard :Towards a Mosaic standard :
Controlled VocabulariesControlled Vocabularies
8. largely based on Open Annotation Model
TechTech Towards a Mosaic standard :Towards a Mosaic standard :
Annotation ModelAnnotation Model
consolidates URL fragment syntax
9. standard modelling language for visualization
TechTech Towards a Mosaic standard :Towards a Mosaic standard :
Visualization modelVisualization model
Vega from Interactive Data Lab
http://idl.cs.washington.edu
10. IntroIntro Towards a Mosaic standardTowards a Mosaic standard
Open
Annotation
Model
Dublin
Core
Metadata
Vega viz
grammar
+ Domain controlled vocabularies
Process
ML?
15. But also...But also...
open science
shareable annotations -->
http://demo.highlatitud.es/api/mosaics/openbelgium18
ariosarios
16. ROI : Enriching back the dataROI : Enriching back the dataWhat'sWhat's
nextnext
17. Grounding DataGrounding Data
annotations model is RDF-based and queriable in SPARQL
--> graph of all annotations constitutes grounding material for
data itself, and gives context and semantics to data
What'sWhat's
nextnext
19. Grounding Data : integrationGrounding Data : integrationWhat'sWhat's
nextnext
20. philippe duchesne
phd @ highlatitud.es
@ pduchesne
thank you
questions?questions?
more information
http://demo.highlatitud.es/#/doc
this demo material
http://demo.highlatitud.es/api/mosaics/openbelgium18