This is the presentation showed during ISWC 2014 at Riva del Garda. The session was titled "Developers Workshop", and the focus was on how you solved practical problems for Linked Data. We presented dandelion platform and our data curation workflow, and the overall idea of dataGEM APIs.
ISWC 2014 - Dandelion: from raw data to dataGEMs for developers
1. Dandelion: from raw data
to dataGEMs for
developers
Stefano Parmesan
Tatiana Tarasova
Ugo Scaiella
Michele Barbera
2. A bit of context
• SpazioDati s.r.l.
• Italian startup: Pisa & Trento
• Members of the DBpedia Association
• Manage the italian DBpedia
3. Goal
• Close the gap between getting the data and
using it
• Build a Knowledge Graph as-a-service:
• Make it querable
• Make it stable, make it scale
• Support different access levels
4. How?
• Phase #1: PUT the data in
• Data normalization
• Entity deduplication
• Phase #2: GET the data out
• Slices
5. How?
Data Normalisation Entity Deduplication Data Storage Data Access
Sample
Raw Data
Reconciliation Services
Source 1
Source N
Azkaban Silk
Framework Titan Graph dandelion.eu
Linked Data
Slices
dataGEM
6. Why…
• … slices?
• SQL-like APIs
• Common knowledge, linked data
• … a graph at all?
• Traversals
• Data is centralized
• Different sources, different access levels
8. And now what?
• Still a prototype:
• Private beta access to slices (demo)
• English and italian DBpedia
• Corporate private data
9. Future?
• Phase #1b: PUT the data in
• Scalable entity deduplication
• Phase #2b: GET the data out
• API for graph traversal
• Text analysis tools (dataTXT)
• Customizations