SlideShare ist ein Scribd-Unternehmen logo
1 von 11
The data behind the HuisKluis
Christophe Guéret (@cgueret)
Researcher
PiLOD closing event
July 3rd
2013
DANS is een instituut van KNAW en NWO
Getting the open data in the app.
BAG
Top10
Photo
archive
CBS
data
https://data.overheid.nl/ http://huiskluis.nl
?
90's style: server side black box
● Server side data store and page rendering
BAG
Top10
Photo
archive
CBS
data
Big DB
Data integration process
(creation of links)
Resource intensive: server load, data integration
Site engine Web browser
ClientServer
Modern: Use APIs and Javascript
● Split the data from the application
BAG
Top10
Photo
archive
CBS
data
Big DB Site engine Web browser
ClientServer
Soon: Full Web applications
● Client side application that use only APIs
BAG
Top10
Photo
archive
CBS
data
Site engine Web browser
ClientServer
Why APIs are so cool...
● No need to get all the data into some local store
● Application send targeted information needs
– Leaflet: draw a map showing N 52° 22' 26''E 4° 53' 30''
– CBS: return the statistics for around 'Gravenstraat'
– BeeldBank: list the pictures about 'Nieuwe Kerk'
– BAG: get the contour of the home at 1012NL-17
– ...
The problem with APIs
● Some data sources do not have one
– Provide data dumps instead
● There are few standards for APIs
– Need to learn how to dialog with the API
● Do not tackle the data integration issue
– Need to know what to ask
APIs with Linked Data : links
● The links guide the client through the different
APIs to find more information
● Thanks to URIs, data integration can be
outsourced
BAG
Site engine
WOZ
BAG ↔
Wikidata
Tip: make a business
out of linking data
APIs with Linked Data : SPARQL
● SPARQL is a query language for RDF data
– In practice, a universal API that generate views over
the data
– Every data need = 1 SPARQL query
● Example: HuisKluis data need from the BAG
– “Give me the geolocation, the contour and the
construction year of the building at 1012NL-17
along with the name of the street it is on”
To use with http://data.resc.info/bag/sparql
PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#>
PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#>
PREFIX xsd: <http://www.w3.org/2001/XMLSchema#>
PREFIX vocab: <http://data.resc.info/bag/resource/vocab/>
SELECT ?straat ?bouwjaar ?lat ?lng ?polygon WHERE {
?huis <http://www.w3.org/ns/locn#postCode> "{POSTCODE}".
?huis <http://www.w3.org/ns/locn#locatorDesignator> "{NUMMER}"^^xsd:decimal.
?huis <http://www.w3.org/ns/locn#thoroughfare> ?straat.
?huis <http://www.w3.org/2006/vcard/ns#longitude> ?lng.
?huis <http://www.w3.org/2006/vcard/ns#latitude> ?lat.
?huis vocab:adres_adresseerbaarobject ?obj.
?obj vocab:verblijfsobjectactueel_pand ?pand.
?pand vocab:pandactueel_bouwjaar ?bouwjaar.
?pand vocab:geovlak ?polygon.
}
If you own a data set...
● Provide APIs, not only a data dump!
● Consider following the W3C standards for
publishing data on the Web
– Makes your data easier to use
– Facilitates doing external linking

Weitere ähnliche Inhalte

Was ist angesagt?

Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
BigData_Europe
 
Sys711 spatial-apps, presentation slide for Advanced Database Technology Subject
Sys711 spatial-apps, presentation slide for Advanced Database Technology SubjectSys711 spatial-apps, presentation slide for Advanced Database Technology Subject
Sys711 spatial-apps, presentation slide for Advanced Database Technology Subject
Syahman Mohamad
 

Was ist angesagt? (20)

Web mining
Web miningWeb mining
Web mining
 
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
Apache Big_Data Europe event: "Integrators at work! Real-life applications of...
 
Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?Digitisation and Digital Humanities - what is the role of Libraries?
Digitisation and Digital Humanities - what is the role of Libraries?
 
Öppen Upphandling short presentation 5.2020
Öppen Upphandling short presentation 5.2020Öppen Upphandling short presentation 5.2020
Öppen Upphandling short presentation 5.2020
 
Visualizing large datasets with elasticsearch and kibana
Visualizing large datasets with elasticsearch and kibanaVisualizing large datasets with elasticsearch and kibana
Visualizing large datasets with elasticsearch and kibana
 
.Net Distributed Caching
.Net Distributed Caching.Net Distributed Caching
.Net Distributed Caching
 
Multimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical NewspapersMultimodal Perspectives for Digitised Historical Newspapers
Multimodal Perspectives for Digitised Historical Newspapers
 
Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOExposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVO
 
Emerging Trends in Data Engineering
Emerging Trends in Data EngineeringEmerging Trends in Data Engineering
Emerging Trends in Data Engineering
 
Linked base Registries | The Scottish Government - Webinar 2017
Linked base Registries | The Scottish Government - Webinar 2017Linked base Registries | The Scottish Government - Webinar 2017
Linked base Registries | The Scottish Government - Webinar 2017
 
Exploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraphExploring Graph Use Cases with JanusGraph
Exploring Graph Use Cases with JanusGraph
 
Active archives @SBB
Active archives @SBBActive archives @SBB
Active archives @SBB
 
SC5 Hangout2 pilot 1 description
SC5 Hangout2  pilot 1 descriptionSC5 Hangout2  pilot 1 description
SC5 Hangout2 pilot 1 description
 
Forestry Commission - Smart Collaboration - Esri UK Annual Conference 2017
Forestry Commission - Smart Collaboration - Esri UK Annual Conference 2017Forestry Commission - Smart Collaboration - Esri UK Annual Conference 2017
Forestry Commission - Smart Collaboration - Esri UK Annual Conference 2017
 
SC1 Workshop 2 Technical overview
SC1 Workshop 2 Technical overviewSC1 Workshop 2 Technical overview
SC1 Workshop 2 Technical overview
 
Sys711 spatial-apps, presentation slide for Advanced Database Technology Subject
Sys711 spatial-apps, presentation slide for Advanced Database Technology SubjectSys711 spatial-apps, presentation slide for Advanced Database Technology Subject
Sys711 spatial-apps, presentation slide for Advanced Database Technology Subject
 
2018 GIS in Government: Publishing BLM Data On the Web
2018 GIS in Government: Publishing BLM Data On the Web2018 GIS in Government: Publishing BLM Data On the Web
2018 GIS in Government: Publishing BLM Data On the Web
 
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
GeoSEO and Map Series - Discovery Integrated With Geographical Search in Map ...
 
( Big ) Data Management - Collect - Global concepts in 5 slides
( Big ) Data Management - Collect - Global concepts in 5 slides( Big ) Data Management - Collect - Global concepts in 5 slides
( Big ) Data Management - Collect - Global concepts in 5 slides
 
Nyc web perf-final-july-23
Nyc web perf-final-july-23Nyc web perf-final-july-23
Nyc web perf-final-july-23
 

Ähnlich wie The data behind the HuisKluis

ArangoDB – A different approach to NoSQL
ArangoDB – A different approach to NoSQLArangoDB – A different approach to NoSQL
ArangoDB – A different approach to NoSQL
ArangoDB Database
 
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
Data Beers
 
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
South Tyrol Free Software Conference
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
eswcsummerschool
 
Automatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked DataAutomatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked Data
Albert Meroño-Peñuela
 

Ähnlich wie The data behind the HuisKluis (20)

SC7 Workshop 2: Big Data Technologies and Scenarios
SC7 Workshop 2: Big Data Technologies and ScenariosSC7 Workshop 2: Big Data Technologies and Scenarios
SC7 Workshop 2: Big Data Technologies and Scenarios
 
Intro to Exhibit Workshop
Intro to Exhibit WorkshopIntro to Exhibit Workshop
Intro to Exhibit Workshop
 
Visualising statistical Linked Data with Plone
Visualising statistical Linked Data with PloneVisualising statistical Linked Data with Plone
Visualising statistical Linked Data with Plone
 
ArangoDB – A different approach to NoSQL
ArangoDB – A different approach to NoSQLArangoDB – A different approach to NoSQL
ArangoDB – A different approach to NoSQL
 
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
[Databeers] 06/05/2014 - Boris Villazon: “Data Integration - A Linked Data ap...
 
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
SFScon22 - Francesco Corcoglioniti - Integrating Dynamically-Computed Data an...
 
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL EndpointsA BASILar Approach for Building Web APIs on top of SPARQL Endpoints
A BASILar Approach for Building Web APIs on top of SPARQL Endpoints
 
Upstream consultancy and Ceph RadosGW/S3 - Javier Munhoz
Upstream consultancy and Ceph RadosGW/S3 - Javier MunhozUpstream consultancy and Ceph RadosGW/S3 - Javier Munhoz
Upstream consultancy and Ceph RadosGW/S3 - Javier Munhoz
 
Upstream Consultancy and Ceph RadosGW/S3 (AMTEGA Ceph Day 2018)
Upstream Consultancy and Ceph RadosGW/S3 (AMTEGA Ceph Day 2018)Upstream Consultancy and Ceph RadosGW/S3 (AMTEGA Ceph Day 2018)
Upstream Consultancy and Ceph RadosGW/S3 (AMTEGA Ceph Day 2018)
 
Wed roman tut_open_datapub
Wed roman tut_open_datapubWed roman tut_open_datapub
Wed roman tut_open_datapub
 
Linked data tooling XML
Linked data tooling XMLLinked data tooling XML
Linked data tooling XML
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Automatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked DataAutomatic Query-Centric API for Routine Access to Linked Data
Automatic Query-Centric API for Routine Access to Linked Data
 
Lightning Talk: Why and How to Integrate MongoDB and NoSQL into Hadoop Big Da...
Lightning Talk: Why and How to Integrate MongoDB and NoSQL into Hadoop Big Da...Lightning Talk: Why and How to Integrate MongoDB and NoSQL into Hadoop Big Da...
Lightning Talk: Why and How to Integrate MongoDB and NoSQL into Hadoop Big Da...
 
Solutions for bi-directional integration between Oracle RDBMS and Apache Kafka
Solutions for bi-directional integration between Oracle RDBMS and Apache KafkaSolutions for bi-directional integration between Oracle RDBMS and Apache Kafka
Solutions for bi-directional integration between Oracle RDBMS and Apache Kafka
 
Simple APIs for Spatial Data (OGC API - Features)
Simple APIs for Spatial Data (OGC API - Features)Simple APIs for Spatial Data (OGC API - Features)
Simple APIs for Spatial Data (OGC API - Features)
 
Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...Stop making tools! Nobody likes them anyway...
Stop making tools! Nobody likes them anyway...
 
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
Lodstats: The Data Web Census Dataset. Kobe, Japan, 2016
 
Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Land
 
Offline-First Mobile Web Apps with PouchDB, IBM Cloudant, and IBM Bluemix
Offline-First Mobile Web Apps with PouchDB, IBM Cloudant, and IBM BluemixOffline-First Mobile Web Apps with PouchDB, IBM Cloudant, and IBM Bluemix
Offline-First Mobile Web Apps with PouchDB, IBM Cloudant, and IBM Bluemix
 

Mehr von Christophe Guéret

ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
Christophe Guéret
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
Christophe Guéret
 

Mehr von Christophe Guéret (20)

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid Intelligence
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RES
 
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
The Entity Registry System: Collaborative Editing of Entity Data in Poorly Co...
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
 
The Entity Registry System (ERS)
The Entity Registry System (ERS)The Entity Registry System (ERS)
The Entity Registry System (ERS)
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-U
 
Linking knowledge spaces
Linking knowledge spacesLinking knowledge spaces
Linking knowledge spaces
 
Digital archiving 3.0
Digital archiving 3.0Digital archiving 3.0
Digital archiving 3.0
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystem
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
Downscaling information systems for education
Downscaling information systems for educationDownscaling information systems for education
Downscaling information systems for education
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
 
ICT4D course 2013 - Sugar
ICT4D course 2013 - SugarICT4D course 2013 - Sugar
ICT4D course 2013 - Sugar
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de données
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information society
 
Is linked data something for me?
Is linked data something for me?Is linked data something for me?
Is linked data something for me?
 
Decentralised entity registry “WikiReg”
Decentralised entity registry “WikiReg”Decentralised entity registry “WikiReg”
Decentralised entity registry “WikiReg”
 
Evolutionary and Swarm Computing for scaling up the Semantic Web
Evolutionary and Swarm Computing for scaling up the Semantic WebEvolutionary and Swarm Computing for scaling up the Semantic Web
Evolutionary and Swarm Computing for scaling up the Semantic Web
 
Decentralised Open Data for World Citizens
Decentralised Open Data  for World CitizensDecentralised Open Data  for World Citizens
Decentralised Open Data for World Citizens
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Kürzlich hochgeladen (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
A Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source MilvusA Beginners Guide to Building a RAG App Using Open Source Milvus
A Beginners Guide to Building a RAG App Using Open Source Milvus
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

The data behind the HuisKluis

  • 1. The data behind the HuisKluis Christophe Guéret (@cgueret) Researcher PiLOD closing event July 3rd 2013 DANS is een instituut van KNAW en NWO
  • 2. Getting the open data in the app. BAG Top10 Photo archive CBS data https://data.overheid.nl/ http://huiskluis.nl ?
  • 3. 90's style: server side black box ● Server side data store and page rendering BAG Top10 Photo archive CBS data Big DB Data integration process (creation of links) Resource intensive: server load, data integration Site engine Web browser ClientServer
  • 4. Modern: Use APIs and Javascript ● Split the data from the application BAG Top10 Photo archive CBS data Big DB Site engine Web browser ClientServer
  • 5. Soon: Full Web applications ● Client side application that use only APIs BAG Top10 Photo archive CBS data Site engine Web browser ClientServer
  • 6. Why APIs are so cool... ● No need to get all the data into some local store ● Application send targeted information needs – Leaflet: draw a map showing N 52° 22' 26''E 4° 53' 30'' – CBS: return the statistics for around 'Gravenstraat' – BeeldBank: list the pictures about 'Nieuwe Kerk' – BAG: get the contour of the home at 1012NL-17 – ...
  • 7. The problem with APIs ● Some data sources do not have one – Provide data dumps instead ● There are few standards for APIs – Need to learn how to dialog with the API ● Do not tackle the data integration issue – Need to know what to ask
  • 8. APIs with Linked Data : links ● The links guide the client through the different APIs to find more information ● Thanks to URIs, data integration can be outsourced BAG Site engine WOZ BAG ↔ Wikidata Tip: make a business out of linking data
  • 9. APIs with Linked Data : SPARQL ● SPARQL is a query language for RDF data – In practice, a universal API that generate views over the data – Every data need = 1 SPARQL query ● Example: HuisKluis data need from the BAG – “Give me the geolocation, the contour and the construction year of the building at 1012NL-17 along with the name of the street it is on”
  • 10. To use with http://data.resc.info/bag/sparql PREFIX rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> PREFIX rdfs: <http://www.w3.org/2000/01/rdf-schema#> PREFIX xsd: <http://www.w3.org/2001/XMLSchema#> PREFIX vocab: <http://data.resc.info/bag/resource/vocab/> SELECT ?straat ?bouwjaar ?lat ?lng ?polygon WHERE { ?huis <http://www.w3.org/ns/locn#postCode> "{POSTCODE}". ?huis <http://www.w3.org/ns/locn#locatorDesignator> "{NUMMER}"^^xsd:decimal. ?huis <http://www.w3.org/ns/locn#thoroughfare> ?straat. ?huis <http://www.w3.org/2006/vcard/ns#longitude> ?lng. ?huis <http://www.w3.org/2006/vcard/ns#latitude> ?lat. ?huis vocab:adres_adresseerbaarobject ?obj. ?obj vocab:verblijfsobjectactueel_pand ?pand. ?pand vocab:pandactueel_bouwjaar ?bouwjaar. ?pand vocab:geovlak ?polygon. }
  • 11. If you own a data set... ● Provide APIs, not only a data dump! ● Consider following the W3C standards for publishing data on the Web – Makes your data easier to use – Facilitates doing external linking
  • 12.
  • 13. 10 minutes overview ● Warning! Techy keywords ahead