SlideShare ist ein Scribd-Unternehmen logo
1 von 36
Downloaden Sie, um offline zu lesen
Initial Usage Analysis

of DBpedia's

Triple Pattern Fragments
Ruben Verborgh,

Erik Mannens & Rik Van de Walle
What role will the Semantic Web play

for the future generation?
Will it be remotely as important

as the Web now is to us?
There used to be no applications

because there was no data.
Linked Data more of less solved

this chicken-and-egg problem.
There are no applications

because data is not queryable.
SPARQL endpoints are unreliable.

Data dumps are not live.
We analyzed DBpedia’s low-cost

Triple Pattern Fragments interface

between Nov 2014 and Feb 2015.
Over 4M requests were made.

There was 1 minute of downtime.
Web interfaces to triples
Four months of fragments
Extending the analysis
Web interfaces to triples
Four months of fragments
Extending the analysis
Web interfaces act as gateways

between clients and databases.
Database Client
Web
interface
The interface hides the database schema.
The interface restricts the kind of queries.
No sane Web developer or admin

would give direct database access.
Database Client
Web
interface
The client must know the database schema.
The client can ask any query.
SPARQL endpoints happily give

direct access to the database.
Triple

store
Client
SPARQL
protocol
The client must know the database schema.
The client can ask any query.
SPARQL interfaces are expensive,

so we have an availability problem.
There are few SPARQL endpoints

because hosting them is expensive.
Many of the endpoints that exist

suffer from low availability.
You already give data for free.

Do you have to pay for query time as well?
Data dumps allow you to set up

your own private SPARQL endpoint.
But then we no longer query the Web…
No usage statistics whatsoever.
Not everybody can do this:

mobile devices, non-technical users, …
The interface hides the database schema.
The interface restricts the kind of queries.
A Triple Pattern Fragments interface

acts as a gateway to an RDF source.
RDF

source
Client
TPF

interface
A Triple Pattern Fragments interface

acts as a gateway to an RDF source.
Client can only ask ?s ?p ?o patterns.
Decompose complex SPARQL queries

on the client client-side.
Low server cost, highly cacheable,

but higher bandwidth and query time.
Web interfaces to triples
Four months of fragments
Extending the analysis
In mid-October 2014, we started

an official TPF interface for DBpedia.
Will this interface be used?
How will clients use it?
Will the availability be sufficient

for live application usage?
The server is deployed virtually,
availability monitored externally.
Amazon Elastic Compute Cloud

c3.2xlarge machine (8 CPU, 15GB RAM)
Compressed HDT format as backend
Pingdom for analysis
4.5 million Triple Pattern Fragments

of DBpedia 2014@en were requested.
The TPF client library consumed most,

followed by crawlers and Chrome.
Turtle as content type is most popular,

but being surpassed by TriG.
Most requests come from Europe,

the US and China are following.
The “type” fragment was popular,

but it’s hard to conclude anything.
A quarter of all requests was cached

(but we could cache everything).
During four months,

the API had 99.9994% availability.
We deeply apologize for

that one minute of downtime

in November.
Web interfaces to triples
Four months of fragments
Extending the analysis
We don’t know exactly

which clients executed queries.
Was the TPF client used standalone?
As a library of another application?
Also hard for SPARQL endpoints.
The analysis did not give insights

in which queries clients executed.
Good for privacy!
We can try reconstructing SPARQL queries,

but maybe clients did something else.
We only know with SPARQL endpoints,

not with data dumps or LD documents.
We could learn from the human Web:

can clients give explicit feedback?
“This is the query I executed.

It took me 5 seconds.”
Potential source of insights,

but clients need a gain.
Will this be representative/truthful?
Web interfaces to triples
Four months of fragments
Extending the analysis
We have a >99.999% available API

to the most popular RDF datasource.
No more excuses not to build apps.
So where are they?
Is something else holding us back?
We need to think differently

on how to build Linked Data apps.
The paradigm of querying a database

and waiting for the results

does not scale to the Web.
Live data requires new interfaces

and new visualizations.
We need developers to build bridges

from data to end users.
Now that the chicken-and-egg problem

and the availability problems are solved,

we need to tackle fundamental questions.
Where are the killer apps

the next generation is waiting for?
Initial Usage Analysis

of DBpedia's

Triple Pattern Fragments
@RubenVerborgh

ruben.verborgh.org

linkeddatafragments.org

Weitere ähnliche Inhalte

Was ist angesagt?

Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsRuben Verborgh
 
Linking media, data, and services
Linking media, data, and servicesLinking media, data, and services
Linking media, data, and servicesRuben Verborgh
 
Hypermedia APIs that make sense
Hypermedia APIs that make senseHypermedia APIs that make sense
Hypermedia APIs that make senseRuben Verborgh
 
The web – A hypermedia story
The web – A hypermedia storyThe web – A hypermedia story
The web – A hypermedia storyRuben Verborgh
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD CloudRuben Verborgh
 
Distributed Affordance
Distributed AffordanceDistributed Affordance
Distributed AffordanceRuben Verborgh
 
Functional Composition of Sensor Web APIs
Functional Composition of Sensor Web APIsFunctional Composition of Sensor Web APIs
Functional Composition of Sensor Web APIsRuben Verborgh
 
RESTdesc – Efficient runtime service discovery and consumption
RESTdesc – Efficient runtime service discovery and consumptionRESTdesc – Efficient runtime service discovery and consumption
RESTdesc – Efficient runtime service discovery and consumptionRuben Verborgh
 
(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web PagesMichael Nelson
 
On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebMartin Klein
 
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Benjamin Adrian
 
Synchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesSynchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesMichael Nelson
 
Scraping data from the web and documents
Scraping data from the web and documentsScraping data from the web and documents
Scraping data from the web and documentsTommy Tavenner
 
Semantic framework for web scraping.
Semantic framework for web scraping.Semantic framework for web scraping.
Semantic framework for web scraping.Shyjal Raazi
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis PlatformLeigh Dodds
 

Was ist angesagt? (20)

Querying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern FragmentsQuerying federations 
of Triple Pattern Fragments
Querying federations 
of Triple Pattern Fragments
 
Linked Data Fragments
Linked Data FragmentsLinked Data Fragments
Linked Data Fragments
 
Linking media, data, and services
Linking media, data, and servicesLinking media, data, and services
Linking media, data, and services
 
Hypermedia APIs that make sense
Hypermedia APIs that make senseHypermedia APIs that make sense
Hypermedia APIs that make sense
 
The web – A hypermedia story
The web – A hypermedia storyThe web – A hypermedia story
The web – A hypermedia story
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD Cloud
 
Distributed Affordance
Distributed AffordanceDistributed Affordance
Distributed Affordance
 
Functional Composition of Sensor Web APIs
Functional Composition of Sensor Web APIsFunctional Composition of Sensor Web APIs
Functional Composition of Sensor Web APIs
 
RESTdesc – Efficient runtime service discovery and consumption
RESTdesc – Efficient runtime service discovery and consumptionRESTdesc – Efficient runtime service discovery and consumption
RESTdesc – Efficient runtime service discovery and consumption
 
(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages(Re-)Discovering Lost Web Pages
(Re-)Discovering Lost Web Pages
 
2010 Sopac Cosugi
2010 Sopac Cosugi2010 Sopac Cosugi
2010 Sopac Cosugi
 
On the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly WebOn the Persistence of Persistent Identifiers of the Scholarly Web
On the Persistence of Persistent Identifiers of the Scholarly Web
 
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web o...
 
Introducing Placemaker
Introducing PlacemakerIntroducing Placemaker
Introducing Placemaker
 
Synchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web PagesSynchronicity: Just-In-Time Discovery of Lost Web Pages
Synchronicity: Just-In-Time Discovery of Lost Web Pages
 
Scraping data from the web and documents
Scraping data from the web and documentsScraping data from the web and documents
Scraping data from the web and documents
 
Semantic framework for web scraping.
Semantic framework for web scraping.Semantic framework for web scraping.
Semantic framework for web scraping.
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
 
Christian Jakenfelds
Christian JakenfeldsChristian Jakenfelds
Christian Jakenfelds
 
Web Scraping Basics
Web Scraping BasicsWeb Scraping Basics
Web Scraping Basics
 

Andere mochten auch

Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...
Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...
Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...Markus Lanthaler
 
Adrs Presentation March 2008
Adrs Presentation March 2008Adrs Presentation March 2008
Adrs Presentation March 2008guestabd20
 
A Semantic Description Language for RESTful Data Services to Combat Semaphobia
A Semantic Description Language for RESTful Data Services to Combat SemaphobiaA Semantic Description Language for RESTful Data Services to Combat Semaphobia
A Semantic Description Language for RESTful Data Services to Combat SemaphobiaMarkus Lanthaler
 
SAPS - Semantic AtomPub-based Services
SAPS - Semantic AtomPub-based ServicesSAPS - Semantic AtomPub-based Services
SAPS - Semantic AtomPub-based ServicesMarkus Lanthaler
 
Semantic Web Services: State of the Art
Semantic Web Services: State of the ArtSemantic Web Services: State of the Art
Semantic Web Services: State of the ArtMarkus Lanthaler
 
Hypermedia Cannot be the Engine
Hypermedia Cannot be the EngineHypermedia Cannot be the Engine
Hypermedia Cannot be the EngineRuben Verborgh
 
End-to-end W3C APIs - tpac 2012
End-to-end W3C APIs - tpac 2012End-to-end W3C APIs - tpac 2012
End-to-end W3C APIs - tpac 2012Alexandre Morgaut
 
A Deep Dive into JSON-LD and Hydra
A Deep Dive into JSON-LD and HydraA Deep Dive into JSON-LD and Hydra
A Deep Dive into JSON-LD and HydraMarkus Lanthaler
 
Web Standards adoption in the AR market
Web Standards adoption in the AR marketWeb Standards adoption in the AR market
Web Standards adoption in the AR marketRob Manson
 
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...Fabio Benedetti
 
Linked Data Generation Process
Linked Data Generation ProcessLinked Data Generation Process
Linked Data Generation ProcessLD4SC
 
What is Hydra?
What is Hydra?What is Hydra?
What is Hydra?Findwise
 
Lisp Macros in 20 Minutes (Featuring Clojure)
Lisp Macros in 20 Minutes (Featuring Clojure)Lisp Macros in 20 Minutes (Featuring Clojure)
Lisp Macros in 20 Minutes (Featuring Clojure)Phil Calçado
 
HTTP and Your Angry Dog
HTTP and Your Angry DogHTTP and Your Angry Dog
HTTP and Your Angry DogRoss Tuck
 
A Web of Things to Reduce Energy Wastage
A Web of Things to Reduce Energy WastageA Web of Things to Reduce Energy Wastage
A Web of Things to Reduce Energy WastageMarkus Lanthaler
 
Rest and the hypermedia constraint
Rest and the hypermedia constraintRest and the hypermedia constraint
Rest and the hypermedia constraintInviqa
 
The Web 3.0 is just around the corner. Be prepared!
The Web 3.0 is just around the corner. Be prepared!The Web 3.0 is just around the corner. Be prepared!
The Web 3.0 is just around the corner. Be prepared!Markus Lanthaler
 
Exploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesExploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesLuiz Henrique Zambom Santana
 

Andere mochten auch (19)

Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...
Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...
Aligning Web Services with the Semantic Web to Create a Global Read-Write Gra...
 
Adrs Presentation March 2008
Adrs Presentation March 2008Adrs Presentation March 2008
Adrs Presentation March 2008
 
A Semantic Description Language for RESTful Data Services to Combat Semaphobia
A Semantic Description Language for RESTful Data Services to Combat SemaphobiaA Semantic Description Language for RESTful Data Services to Combat Semaphobia
A Semantic Description Language for RESTful Data Services to Combat Semaphobia
 
SAPS - Semantic AtomPub-based Services
SAPS - Semantic AtomPub-based ServicesSAPS - Semantic AtomPub-based Services
SAPS - Semantic AtomPub-based Services
 
Semantic Web Services: State of the Art
Semantic Web Services: State of the ArtSemantic Web Services: State of the Art
Semantic Web Services: State of the Art
 
Hypermedia Cannot be the Engine
Hypermedia Cannot be the EngineHypermedia Cannot be the Engine
Hypermedia Cannot be the Engine
 
F-interop Meetup
F-interop MeetupF-interop Meetup
F-interop Meetup
 
End-to-end W3C APIs - tpac 2012
End-to-end W3C APIs - tpac 2012End-to-end W3C APIs - tpac 2012
End-to-end W3C APIs - tpac 2012
 
A Deep Dive into JSON-LD and Hydra
A Deep Dive into JSON-LD and HydraA Deep Dive into JSON-LD and Hydra
A Deep Dive into JSON-LD and Hydra
 
Web Standards adoption in the AR market
Web Standards adoption in the AR marketWeb Standards adoption in the AR market
Web Standards adoption in the AR market
 
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...
LODeX: Schema Summarization and automatic SPARQL query generation for Linked ...
 
Linked Data Generation Process
Linked Data Generation ProcessLinked Data Generation Process
Linked Data Generation Process
 
What is Hydra?
What is Hydra?What is Hydra?
What is Hydra?
 
Lisp Macros in 20 Minutes (Featuring Clojure)
Lisp Macros in 20 Minutes (Featuring Clojure)Lisp Macros in 20 Minutes (Featuring Clojure)
Lisp Macros in 20 Minutes (Featuring Clojure)
 
HTTP and Your Angry Dog
HTTP and Your Angry DogHTTP and Your Angry Dog
HTTP and Your Angry Dog
 
A Web of Things to Reduce Energy Wastage
A Web of Things to Reduce Energy WastageA Web of Things to Reduce Energy Wastage
A Web of Things to Reduce Energy Wastage
 
Rest and the hypermedia constraint
Rest and the hypermedia constraintRest and the hypermedia constraint
Rest and the hypermedia constraint
 
The Web 3.0 is just around the corner. Be prepared!
The Web 3.0 is just around the corner. Be prepared!The Web 3.0 is just around the corner. Be prepared!
The Web 3.0 is just around the corner. Be prepared!
 
Exploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queriesExploiting the query structure for efficient join ordering in SPARQL queries
Exploiting the query structure for efficient join ordering in SPARQL queries
 

Ähnlich wie Initial Usage Analysis of DBpedia's Triple Pattern Fragments

Semantic Web Servers
Semantic Web ServersSemantic Web Servers
Semantic Web Serverswebhostingguy
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformClark & Parsia LLC
 
Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic WebAditya Tuli
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008Blogtalk 2008
 
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationSigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationRichard Cyganiak
 
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleGoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleMartin Hepp
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...apidays
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streamsconfluent
 
Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015Jackson F. de A. Mafra
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8dallemang
 
How Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information DiscoveryHow Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information DiscoveryAlex Meadows
 
Flickr Services
Flickr ServicesFlickr Services
Flickr Servicesroyans
 
Flickr Services
Flickr ServicesFlickr Services
Flickr Servicesroyans
 
Question Answering over Linked Data - Reasoning Issues
Question Answering over Linked Data - Reasoning IssuesQuestion Answering over Linked Data - Reasoning Issues
Question Answering over Linked Data - Reasoning IssuesMichael Petychakis
 
Things you must know on ruby on rails single page application
Things you must know on ruby on rails single page applicationThings you must know on ruby on rails single page application
Things you must know on ruby on rails single page applicationAndolasoft Inc
 
A sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data ArchivesA sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data ArchivesMiel Vander Sande
 

Ähnlich wie Initial Usage Analysis of DBpedia's Triple Pattern Fragments (20)

Semantic Web Servers
Semantic Web ServersSemantic Web Servers
Semantic Web Servers
 
SemTech 2010: Pelorus Platform
SemTech 2010: Pelorus PlatformSemTech 2010: Pelorus Platform
SemTech 2010: Pelorus Platform
 
Graphql
GraphqlGraphql
Graphql
 
Explaining The Semantic Web
Explaining The Semantic WebExplaining The Semantic Web
Explaining The Semantic Web
 
Spivack Blogtalk 2008
Spivack Blogtalk 2008Spivack Blogtalk 2008
Spivack Blogtalk 2008
 
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integrationSigma EE: Reaping low-hanging fruits in RDF-based data integration
Sigma EE: Reaping low-hanging fruits in RDF-based data integration
 
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web ScaleGoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
GoodRelations & RDFa for Deep Comparison Shopping on a Web Scale
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...
apidays Australia 2023 - The Playful Bond Between REST And Data Streams, Warr...
 
The Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data StreamsThe Playful Bond Between REST And Data Streams
The Playful Bond Between REST And Data Streams
 
Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015Phalcon 2 High Performance APIs - DevWeekPOA 2015
Phalcon 2 High Performance APIs - DevWeekPOA 2015
 
Sem tech 2011 v8
Sem tech 2011 v8Sem tech 2011 v8
Sem tech 2011 v8
 
How Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information DiscoveryHow Linked Data Can Speed Information Discovery
How Linked Data Can Speed Information Discovery
 
Flickr Services
Flickr ServicesFlickr Services
Flickr Services
 
Flickr Services
Flickr ServicesFlickr Services
Flickr Services
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
Question Answering over Linked Data - Reasoning Issues
Question Answering over Linked Data - Reasoning IssuesQuestion Answering over Linked Data - Reasoning Issues
Question Answering over Linked Data - Reasoning Issues
 
Things you must know on ruby on rails single page application
Things you must know on ruby on rails single page applicationThings you must know on ruby on rails single page application
Things you must know on ruby on rails single page application
 
Semantic web
Semantic webSemantic web
Semantic web
 
A sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data ArchivesA sweet affordable combo for Linked Data Archives
A sweet affordable combo for Linked Data Archives
 

Initial Usage Analysis of DBpedia's Triple Pattern Fragments

  • 1. Initial Usage Analysis
 of DBpedia's
 Triple Pattern Fragments Ruben Verborgh,
 Erik Mannens & Rik Van de Walle
  • 2.
  • 3. What role will the Semantic Web play
 for the future generation? Will it be remotely as important
 as the Web now is to us?
  • 4. There used to be no applications
 because there was no data. Linked Data more of less solved
 this chicken-and-egg problem.
  • 5. There are no applications
 because data is not queryable. SPARQL endpoints are unreliable.
 Data dumps are not live.
  • 6. We analyzed DBpedia’s low-cost
 Triple Pattern Fragments interface
 between Nov 2014 and Feb 2015. Over 4M requests were made.
 There was 1 minute of downtime.
  • 7. Web interfaces to triples Four months of fragments Extending the analysis
  • 8. Web interfaces to triples Four months of fragments Extending the analysis
  • 9. Web interfaces act as gateways
 between clients and databases. Database Client Web interface The interface hides the database schema. The interface restricts the kind of queries.
  • 10. No sane Web developer or admin
 would give direct database access. Database Client Web interface The client must know the database schema. The client can ask any query.
  • 11. SPARQL endpoints happily give
 direct access to the database. Triple
 store Client SPARQL protocol The client must know the database schema. The client can ask any query.
  • 12. SPARQL interfaces are expensive,
 so we have an availability problem. There are few SPARQL endpoints
 because hosting them is expensive. Many of the endpoints that exist
 suffer from low availability. You already give data for free.
 Do you have to pay for query time as well?
  • 13. Data dumps allow you to set up
 your own private SPARQL endpoint. But then we no longer query the Web… No usage statistics whatsoever. Not everybody can do this:
 mobile devices, non-technical users, …
  • 14. The interface hides the database schema. The interface restricts the kind of queries. A Triple Pattern Fragments interface
 acts as a gateway to an RDF source. RDF
 source Client TPF
 interface
  • 15. A Triple Pattern Fragments interface
 acts as a gateway to an RDF source. Client can only ask ?s ?p ?o patterns. Decompose complex SPARQL queries
 on the client client-side. Low server cost, highly cacheable,
 but higher bandwidth and query time.
  • 16.
  • 17.
  • 18. Web interfaces to triples Four months of fragments Extending the analysis
  • 19. In mid-October 2014, we started
 an official TPF interface for DBpedia. Will this interface be used? How will clients use it? Will the availability be sufficient
 for live application usage?
  • 20. The server is deployed virtually, availability monitored externally. Amazon Elastic Compute Cloud
 c3.2xlarge machine (8 CPU, 15GB RAM) Compressed HDT format as backend Pingdom for analysis
  • 21. 4.5 million Triple Pattern Fragments
 of DBpedia 2014@en were requested.
  • 22. The TPF client library consumed most,
 followed by crawlers and Chrome.
  • 23. Turtle as content type is most popular,
 but being surpassed by TriG.
  • 24. Most requests come from Europe,
 the US and China are following.
  • 25. The “type” fragment was popular,
 but it’s hard to conclude anything.
  • 26. A quarter of all requests was cached
 (but we could cache everything).
  • 27. During four months,
 the API had 99.9994% availability. We deeply apologize for
 that one minute of downtime
 in November.
  • 28. Web interfaces to triples Four months of fragments Extending the analysis
  • 29. We don’t know exactly
 which clients executed queries. Was the TPF client used standalone? As a library of another application? Also hard for SPARQL endpoints.
  • 30. The analysis did not give insights
 in which queries clients executed. Good for privacy! We can try reconstructing SPARQL queries,
 but maybe clients did something else. We only know with SPARQL endpoints,
 not with data dumps or LD documents.
  • 31. We could learn from the human Web:
 can clients give explicit feedback? “This is the query I executed.
 It took me 5 seconds.” Potential source of insights,
 but clients need a gain. Will this be representative/truthful?
  • 32. Web interfaces to triples Four months of fragments Extending the analysis
  • 33. We have a >99.999% available API
 to the most popular RDF datasource. No more excuses not to build apps. So where are they? Is something else holding us back?
  • 34. We need to think differently
 on how to build Linked Data apps. The paradigm of querying a database
 and waiting for the results
 does not scale to the Web. Live data requires new interfaces
 and new visualizations.
  • 35. We need developers to build bridges
 from data to end users. Now that the chicken-and-egg problem
 and the availability problems are solved,
 we need to tackle fundamental questions. Where are the killer apps
 the next generation is waiting for?
  • 36. Initial Usage Analysis
 of DBpedia's
 Triple Pattern Fragments @RubenVerborgh
 ruben.verborgh.org
 linkeddatafragments.org