SlideShare ist ein Scribd-Unternehmen logo
1 von 19
KIT – University of the State of Baden-Wuerttemberg and
National Research Center of the Helmholtz Association
!) INSTITUTE AIFB, KARLSRUHE INSTITUTE OF TECHNOLOGY, GERMANY; 2) DERI, NATIONAL UNIVERSITY OF IRELAND, GALWAY
http://swse.deri.org/dyldo/
Observing Linked Data Dynamics
Tobias Käfer1, Ahmed Abdelrahman2, Patrick O’Byrne2, Jürgen Umbrich2, Aidan Hogan2
May 30, 2013
Extended Semantic Web Conference (ESWC 2013), Montpellier, France
2
http://swse.deri.org/dyldo/
Linked Data Dynamics
… more than the growth of the LOD-Cloud
Why you might care:
As a publisher:
Versioning
Link Maintenance
As a consumer:
Reasoning
Hybrid Linked Data Warehouses
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
3
http://swse.deri.org/dyldo/
The Dynamic Linked Data Observatory – Part of a
Bigger Movement (Web Observatories)
“[…] in order to study the Web, you
need to observe what happens on
the Web. To do this, one has to
study it every day to understand
the dynamics of the Web and the
interaction with technology, and
what people do with it.”
“[…] to create a distributed archive
of data on the Web and its
activity, and […] mechanisms and
tools that will be able to explore its
development in the past, to
examine its present condition and
to establish potential
developments in the future.”
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Prof. Dame Wendy Hall, 2013
http://www.thehindu.com/sci-tech/internet/web-observatory-for-
cybergazing/article4386613.ece
WebScience Trust: definition of a Web Observatory
A definition of the Web Observatory
4
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
5
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
Core part: Combination of
LOD/CKAN and BTC
220 example URIs from the data
sets in the LOD cloud
220 top PageRanked URIs from the
BTC 2010 dataset
Crawled from there to get approx.
100k URIs (Union of 10 crawls)
6
http://swse.deri.org/dyldo/
Mission: To capture the dynamics of Linked Data
 Weekly snapshots of a URI list derived from the LOD cloud and 2010‘s
Billion triple challenge dataset, chosen for coverage and variety.
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
The Dynamic Linked Data Observatory
May 30, 2013
Billion
Triple
Challenge
Dataset
of 2010
+
LOD cloud
Fixed
URI list
The Linked Data Web
May 6, 2012 today
1 week
7
http://swse.deri.org/dyldo/
Nominal size of a snapshot: 95,737 (Kernel) / 191,474 URIs (Extended)
May to November 2012: 6 months, 29 (weekly) snapshots
Statistics on the data basis:
This presentation: Findings from the first half year
of observation
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Statistic Kernel Extended
Mean pay-level domains 573.6 ± 16.6 1,738.6 ± 218
Mean documents 68,996.9 ± 5,555.2 152,355.7 ± 2,356.3
Mean quadruples 16,001,671 ± 988,820 94,725,595 ± 10,279,806
Sum quadruples 464,048,460 2,747,042,282
May 6, 2012 today
1 week
8
http://swse.deri.org/dyldo/
Secret questions of a Linked Data geek
 Call for observations on different levels of abstraction:
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
granularity
RDF Graphs Documents Hosts (PLD)
9
http://swse.deri.org/dyldo/
Document-level dynamics: Life (Availability)…
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
snapshots
10
0
20
30
% documents of 87k *)
0 5 10 15 20 25
Mean = 23.1 (~80%)
26% URIs available
in all snapshots
*)86,696RDFdocumentseverappearedin≥1kernelsnapshot
10
http://swse.deri.org/dyldo/
Document-level dynamics: … and Death
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Last Heart-Beat:
Overestimates death…
… and death certificate filled:
underestimates death
HTTP-500etc.
11
http://swse.deri.org/dyldo/
Document-level dynamics: Changes
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
12
http://swse.deri.org/dyldo/
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
avg.#Snapshotswithchanges
indocumentswithchanges
Share of documents with changes
on the host (PLD)
Document-level changes clustered by host (PLD)
13
http://swse.deri.org/dyldo/
Document-level changes per topic and party
Grouping domains by metadata from the
LOD cloud and the DataHub
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
The LOD cloud colour-coded by topic
LOD-cloudtopicParty
14
http://swse.deri.org/dyldo/
RDF-level dynamics: triples
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Only 27,6% of the
documents updated
values for terms
(i.e. one per triple)
24% monotonic
additions
*
* given there are changes at all
*
15
http://swse.deri.org/dyldo/
RDF-level dynamics: terms
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
16
http://swse.deri.org/dyldo/
RDF-level dynamics: The most dynamic
predicates
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Indicating a timestamp
*) provenance time updated, and provenance time added respectively
17
http://swse.deri.org/dyldo/
Dynamics of the RDF link structure
Outward links from the kernel to other documents
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed
Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
Low-volume but constant stream of fresh outward links :
sec.gov, identi.ca, zitgist.com,
dbtropes.org, ontologycentral.com,
freebase.com
New links in batches: bbc.co.uk, bnf.fr,
dbpedia.org, linkedct.org, bio2rdf.org
Cf. Ntoulas et al.
(2004): 25% new
links each week
(in a growing
HTML data set)
18
http://swse.deri.org/dyldo/
Summary and Q&A
Analyses from first half year
Data collection is continuing
Future work:
More sources & analyses, results as RDF
We appreciate your feed-
back and speculations
What would you
look for in the data?
Thanks for your attention
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013
10
0
20
30
% documents of the 87k
0 5 10 15 20 25
snapshots
http://swse.deri.org/dyldo/
19
http://swse.deri.org/dyldo/
This presentation is CC BY SA – picture credits
Picture on title slide based on a picture by A. Sparrow
http://www.flickr.com/photos/49937157@N03/
CC BY 2.0
Linking Open Data cloud diagram, by Richard Cyganiak and Anja
Jentzsch. http://lod-cloud.net/
CC BY SA
Evolution
http://commons.wikimedia.org/wiki/File:Human_evolution_scheme.svg
CC BY SA
Death http://commons.wikimedia.org/wiki/File:Death.jpg
CC BY SA 3.0
Seismogram http://www.flickr.com/photos/brettneilson/2281403809/
CC BY
Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed,
Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013
May 30, 2013

Weitere ähnliche Inhalte

Was ist angesagt?

The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?Martin Hepp
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationMartin Klein
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Werner Leyh
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsChris Bizer
 
Webtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingWebtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingCameron Neylon
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis VZW
 
20180226 data driven smart governance
20180226 data driven smart governance20180226 data driven smart governance
20180226 data driven smart governanceDongpo Deng
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open DataDongpo Deng
 
Answers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataAnswers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataOlaf Hartig
 
Linked Data and Services
Linked Data and ServicesLinked Data and Services
Linked Data and ServicesBarry Norton
 
Open Data - The Fingal Perspective
Open Data - The Fingal PerspectiveOpen Data - The Fingal Perspective
Open Data - The Fingal PerspectiveFingal Open Data
 
OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? Mr. Bill Proudfit
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingTobias Kuhn
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Mat Kelly
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerHeiko Paulheim
 
Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Biplab Debnath
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphJennifer D'Souza
 

Was ist angesagt? (20)

The Web We Want
The Web We WantThe Web We Want
The Web We Want
 
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
The Semantic Web – A Vision Come True, or Giving Up the Great Plan?
 
To the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly CommunicationTo the Rescue of the Orphans of Scholarly Communication
To the Rescue of the Orphans of Scholarly Communication
 
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
Interlinking Standardized OpenStreetMap Data and Citizen Science Data in the ...
 
Adoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical DomainsAdoption of the Linked Data Best Practices in Different Topical Domains
Adoption of the Linked Data Best Practices in Different Topical Domains
 
Webtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data MeetingWebtracks at JISC Managing Research Data Meeting
Webtracks at JISC Managing Research Data Meeting
 
PID Signposting Pattern
PID Signposting PatternPID Signposting Pattern
PID Signposting Pattern
 
Flagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertierFlagis linked open_data_stijn_goedertier
Flagis linked open_data_stijn_goedertier
 
20180226 data driven smart governance
20180226 data driven smart governance20180226 data driven smart governance
20180226 data driven smart governance
 
The methods and practices of Linked Open Data
The methods and practices of Linked Open DataThe methods and practices of Linked Open Data
The methods and practices of Linked Open Data
 
Linking Open Data
Linking Open DataLinking Open Data
Linking Open Data
 
Answers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked DataAnswers to usual issues in getting started with consuming Linked Data
Answers to usual issues in getting started with consuming Linked Data
 
Linked Data and Services
Linked Data and ServicesLinked Data and Services
Linked Data and Services
 
Open Data - The Fingal Perspective
Open Data - The Fingal PerspectiveOpen Data - The Fingal Perspective
Open Data - The Fingal Perspective
 
OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data? OpenDataHK Meetup 13 June 2013 What is Open Data?
OpenDataHK Meetup 13 June 2013 What is Open Data?
 
Nanopublications and Decentralized Publishing
Nanopublications and Decentralized PublishingNanopublications and Decentralized Publishing
Nanopublications and Decentralized Publishing
 
Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count Impact of URI Canonicalization on Memento Count
Impact of URI Canonicalization on Memento Count
 
Mining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMinerMining the Web of Linked Data with RapidMiner
Mining the Web of Linked Data with RapidMiner
 
Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences Data mining on social networks for students learning experiences
Data mining on social networks for students learning experiences
 
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge GraphIntegrating Covid-19 Bioassays in the Open Research Knowledge Graph
Integrating Covid-19 Bioassays in the Open Research Knowledge Graph
 

Ähnlich wie Observing Linked Data Dynamics

From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeSören Auer
 
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...GigaScience, BGI Hong Kong
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013Stefan Dietze
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Oscar Corcho
 
Web at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataWeb at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataAI4BD GmbH
 
Dataset Sources Repositories.pptx
Dataset Sources Repositories.pptxDataset Sources Repositories.pptx
Dataset Sources Repositories.pptxmantatheralyasriy
 
Dynamic Data Center concept
Dynamic Data Center concept  Dynamic Data Center concept
Dynamic Data Center concept Miha Ahronovitz
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereAlex Hardisty
 
Linked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGLinked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGChris Ewing
 
Visualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssVisualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssF. Tim Knight
 
Experiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataExperiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataProgCity
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVEUDAT
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgAI4BD GmbH
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?Anna Fensel
 
Internet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchInternet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchEd Dodds
 

Ähnlich wie Observing Linked Data Dynamics (20)

Cornell 2011 05-13
Cornell 2011 05-13Cornell 2011 05-13
Cornell 2011 05-13
 
Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data Ciard Initiative and a Global Infrastructure for Linked Open Data
Ciard Initiative and a Global Infrastructure for Linked Open Data
 
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked KnowledgeFrom Open Linked Data towards an Ecosystem of Interlinked Knowledge
From Open Linked Data towards an Ecosystem of Interlinked Knowledge
 
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
Jesse Xiao at CODATA2017: Updates to the GigaDB open access data publishing p...
 
KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013KnowEscape workshop, OKCon 2013
KnowEscape workshop, OKCon 2013
 
Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)Linked Data Tutorial (Florianópolis)
Linked Data Tutorial (Florianópolis)
 
Web at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open DataWeb at 25 - Ontos Linked Open Data
Web at 25 - Ontos Linked Open Data
 
Dataset Sources Repositories.pptx
Dataset Sources Repositories.pptxDataset Sources Repositories.pptx
Dataset Sources Repositories.pptx
 
Open Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon HodsonOpen Science - Global Perspectives/Simon Hodson
Open Science - Global Perspectives/Simon Hodson
 
Dynamic Data Center concept
Dynamic Data Center concept  Dynamic Data Center concept
Dynamic Data Center concept
 
Data accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphereData accessibility and the role of informatics in predicting the biosphere
Data accessibility and the role of informatics in predicting the biosphere
 
Linked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIGLinked Data Overview - AGI Technical SIG
Linked Data Overview - AGI Technical SIG
 
Visualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ssVisualizing linkeddata aall2012d-ss
Visualizing linkeddata aall2012d-ss
 
Experiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open dataExperiences as a producer, consumer and observer of open data
Experiences as a producer, consumer and observer of open data
 
LOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink SoftwareLOD2 webinar series: Virtuoso by OpenLink Software
LOD2 webinar series: Virtuoso by OpenLink Software
 
Modeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROVModeling Data Life Cycles with PROV
Modeling Data Life Cycles with PROV
 
LOD2 Webinar Series FOX
LOD2 Webinar Series FOXLOD2 Webinar Series FOX
LOD2 Webinar Series FOX
 
KESW2012 Hackathon St Petersburg
KESW2012 Hackathon St PetersburgKESW2012 Hackathon St Petersburg
KESW2012 Hackathon St Petersburg
 
The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?The Semantic Web Exists. What Next?
The Semantic Web Exists. What Next?
 
Internet2 Support for Biomedical Research
Internet2 Support for Biomedical ResearchInternet2 Support for Biomedical Research
Internet2 Support for Biomedical Research
 

Kürzlich hochgeladen

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Bhuvaneswari Subramani
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontologyjohnbeverley2021
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxRemote DBA Services
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 

Kürzlich hochgeladen (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 

Observing Linked Data Dynamics

  • 1. KIT – University of the State of Baden-Wuerttemberg and National Research Center of the Helmholtz Association !) INSTITUTE AIFB, KARLSRUHE INSTITUTE OF TECHNOLOGY, GERMANY; 2) DERI, NATIONAL UNIVERSITY OF IRELAND, GALWAY http://swse.deri.org/dyldo/ Observing Linked Data Dynamics Tobias Käfer1, Ahmed Abdelrahman2, Patrick O’Byrne2, Jürgen Umbrich2, Aidan Hogan2 May 30, 2013 Extended Semantic Web Conference (ESWC 2013), Montpellier, France
  • 2. 2 http://swse.deri.org/dyldo/ Linked Data Dynamics … more than the growth of the LOD-Cloud Why you might care: As a publisher: Versioning Link Maintenance As a consumer: Reasoning Hybrid Linked Data Warehouses Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 3. 3 http://swse.deri.org/dyldo/ The Dynamic Linked Data Observatory – Part of a Bigger Movement (Web Observatories) “[…] in order to study the Web, you need to observe what happens on the Web. To do this, one has to study it every day to understand the dynamics of the Web and the interaction with technology, and what people do with it.” “[…] to create a distributed archive of data on the Web and its activity, and […] mechanisms and tools that will be able to explore its development in the past, to examine its present condition and to establish potential developments in the future.” Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Prof. Dame Wendy Hall, 2013 http://www.thehindu.com/sci-tech/internet/web-observatory-for- cybergazing/article4386613.ece WebScience Trust: definition of a Web Observatory A definition of the Web Observatory
  • 4. 4 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web
  • 5. 5 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web Core part: Combination of LOD/CKAN and BTC 220 example URIs from the data sets in the LOD cloud 220 top PageRanked URIs from the BTC 2010 dataset Crawled from there to get approx. 100k URIs (Union of 10 crawls)
  • 6. 6 http://swse.deri.org/dyldo/ Mission: To capture the dynamics of Linked Data  Weekly snapshots of a URI list derived from the LOD cloud and 2010‘s Billion triple challenge dataset, chosen for coverage and variety. Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 The Dynamic Linked Data Observatory May 30, 2013 Billion Triple Challenge Dataset of 2010 + LOD cloud Fixed URI list The Linked Data Web May 6, 2012 today 1 week
  • 7. 7 http://swse.deri.org/dyldo/ Nominal size of a snapshot: 95,737 (Kernel) / 191,474 URIs (Extended) May to November 2012: 6 months, 29 (weekly) snapshots Statistics on the data basis: This presentation: Findings from the first half year of observation Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Statistic Kernel Extended Mean pay-level domains 573.6 ± 16.6 1,738.6 ± 218 Mean documents 68,996.9 ± 5,555.2 152,355.7 ± 2,356.3 Mean quadruples 16,001,671 ± 988,820 94,725,595 ± 10,279,806 Sum quadruples 464,048,460 2,747,042,282 May 6, 2012 today 1 week
  • 8. 8 http://swse.deri.org/dyldo/ Secret questions of a Linked Data geek  Call for observations on different levels of abstraction: Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 granularity RDF Graphs Documents Hosts (PLD)
  • 9. 9 http://swse.deri.org/dyldo/ Document-level dynamics: Life (Availability)… Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 snapshots 10 0 20 30 % documents of 87k *) 0 5 10 15 20 25 Mean = 23.1 (~80%) 26% URIs available in all snapshots *)86,696RDFdocumentseverappearedin≥1kernelsnapshot
  • 10. 10 http://swse.deri.org/dyldo/ Document-level dynamics: … and Death Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Last Heart-Beat: Overestimates death… … and death certificate filled: underestimates death HTTP-500etc.
  • 11. 11 http://swse.deri.org/dyldo/ Document-level dynamics: Changes Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 12. 12 http://swse.deri.org/dyldo/ Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 avg.#Snapshotswithchanges indocumentswithchanges Share of documents with changes on the host (PLD) Document-level changes clustered by host (PLD)
  • 13. 13 http://swse.deri.org/dyldo/ Document-level changes per topic and party Grouping domains by metadata from the LOD cloud and the DataHub Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 The LOD cloud colour-coded by topic LOD-cloudtopicParty
  • 14. 14 http://swse.deri.org/dyldo/ RDF-level dynamics: triples Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Only 27,6% of the documents updated values for terms (i.e. one per triple) 24% monotonic additions * * given there are changes at all *
  • 15. 15 http://swse.deri.org/dyldo/ RDF-level dynamics: terms Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013
  • 16. 16 http://swse.deri.org/dyldo/ RDF-level dynamics: The most dynamic predicates Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Indicating a timestamp *) provenance time updated, and provenance time added respectively
  • 17. 17 http://swse.deri.org/dyldo/ Dynamics of the RDF link structure Outward links from the kernel to other documents Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 Low-volume but constant stream of fresh outward links : sec.gov, identi.ca, zitgist.com, dbtropes.org, ontologycentral.com, freebase.com New links in batches: bbc.co.uk, bnf.fr, dbpedia.org, linkedct.org, bio2rdf.org Cf. Ntoulas et al. (2004): 25% new links each week (in a growing HTML data set)
  • 18. 18 http://swse.deri.org/dyldo/ Summary and Q&A Analyses from first half year Data collection is continuing Future work: More sources & analyses, results as RDF We appreciate your feed- back and speculations What would you look for in the data? Thanks for your attention Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013 10 0 20 30 % documents of the 87k 0 5 10 15 20 25 snapshots http://swse.deri.org/dyldo/
  • 19. 19 http://swse.deri.org/dyldo/ This presentation is CC BY SA – picture credits Picture on title slide based on a picture by A. Sparrow http://www.flickr.com/photos/49937157@N03/ CC BY 2.0 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/ CC BY SA Evolution http://commons.wikimedia.org/wiki/File:Human_evolution_scheme.svg CC BY SA Death http://commons.wikimedia.org/wiki/File:Death.jpg CC BY SA 3.0 Seismogram http://www.flickr.com/photos/brettneilson/2281403809/ CC BY Observing Linked Data Dynamics // TOBIAS KÄFER, Ahmed Abdelgayed, Patrick O'Byrne, Jürgen Umbrich, Aidan Hogan // ESWC 2013 May 30, 2013