SlideShare ist ein Scribd-Unternehmen logo
1 von 40
Downloaden Sie, um offline zu lesen
Sebastian Hellmann
http://dbpedia.org
1
DBpedia and the Custody of Linked Open Data
https://tinyurl.com/dbpedia-kedl-2019
Outline
DBpedia History and Challenges
● The beginning phase (2007 - 2011)
● The manifestation phase (2011 - 2016)
● Glass ceiling phase (2016 - 2019)
● Lessons learned
The future brings solutions
● Skyrocketing phase (2020 - 2025)
○ Databus
○ Knowledge Library
○ FlexiFusion
https://tinyurl.com/dbpedia-kedl-2019
Beginning Phase of DBpedia 2007-2011
2007 - First Extraction of Wikipedia’s Infoboxes
● https://en.wikipedia.org/wiki/National_Digital_Library_of_India
● (en) https:/dbpedia.org/resource/National_Digital_Library_of_India
● (new, global) https://global.dbpedia.org/?s=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FNational_Digital_Library_of_India
● (new, global) https://global.dbpedia.org/id/3FdBE
https://tinyurl.com/dbpedia-kedl-2019
Beginning Phase of DBpedia 2007-2011
2007 - First Extraction of Wikipedia’s Infoboxes
Query Wikipedia Like a Database (query working for 12 years)
soccer players, who are born in a country with more than 10 million inhabitants, who played as goalkeeper for a club that has a stadium
with more than 30.000 seats and the club country is different from the birth country
Starting members:
● Uni Leipzig, Open Link Software, FU Berlin (affiliation change of key persons)
https://tinyurl.com/dbpedia-kedl-2019
Beginning Phase of DBpedia 2007-2011
Exceptional boost of research and industrial innovation
● DBpedia Dataset became the foundation for around 25000 scientific papers
● High industry adoption: BBC, New York Times, Yahoo, Watson (Jeopardy)
● Emergence of semantic technologies:
○ Knowledge Extraction
○ Entity Linking (Databases)
○ Entity Linking (Natural Language Processing) - DBpedia Spotlight
○ Graph Databases
https://tinyurl.com/dbpedia-kedl-2019
Beginning Phase of DBpedia 2007-2011
LOD = Linked Open Data
See full slides here with history:
https://slidewiki.org/presentation/661/_/661/5475-3/#/slide-5475-3
Recent: http://lod-cloud.net
https://tinyurl.com/dbpedia-kedl-2019
Beginning Phase of DBpedia 2007-2011
DBpedia shaped Linked Data Best Practices
1. Use URIs as names for things
2. Use HTTP URIs so that people can look up those names.
3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL)
4. Include links to other URIs. so that they can discover more things. [Identifier Linking Paradigm]
https://tinyurl.com/dbpedia-kedl-2019
Manifestation phase 2011-2016
Growth phase
● Internationalization
○ Covering all 140 Wikipedia languages
○ 22 language chapters, worldwide DBpedia embassies and institutional collaborators
● http://mappings.dbpedia.org
○ ~300 editors
○ DBpedia Ontology incl. links to other ontologies
○ Data cleaning rules
● Data extensions such as Wikimedia Commons, Wikidata, NIF
○ 14 Billion statements
● Ontology contributions YAGO, SUMO, Umbel, LHD, DBTax
○ 8 taxonomies to query DBpedia
https://tinyurl.com/dbpedia-kedl-2019
Manifestation phase 2011-2015
Foundation of the DBpedia Association in 2014:
● Reliable core of organisational supporters for sustainability and network multiplication
● A network around a non-profit coordinator (DBpedia Association, Primus inter pares)
https://tinyurl.com/dbpedia-kedl-2019
Glass ceiling phase 2016 - 2019
Glass ceiling is a term from gender inequality
Appropriate metaphor:
● Women have to work harder than men for their career
● Pushing harder yields less and less results
https://wiki.ubc.ca/Glass_Ceiling
https://tinyurl.com/dbpedia-kedl-2019
Glass ceiling phase 2016 - 2019
Glass ceiling is a term from gender inequality
Appropriate metaphor:
● Women have to work harder than men for their career
● Pushing harder yields less and less results
Totally applies to all data projects
https://wiki.ubc.ca/Glass_Ceiling
https://tinyurl.com/dbpedia-kedl-2019
Glass ceiling phase 2016 - 2019
What did DBpedia do?
● We formed a think tank with our members
and community (engineers and technology
leaders) and discussed and re-designed
Results:
● Open data needs a scalable business model
● Identified problems of decentralization and
innovated to provide solutions to boost
decentralized approaches (LOD)
Only coordinated decentralisation is able to
break the glass ceiling
What happened in the world?
● Fierce competition over editorial workforce
○ Orcid
○ Wikidata
○ Microsoft Academic
○ Google
○ Thousands more
● Identifier wars (Identifier Linking Paradigm)
○ everybody wants you to include their IDs
in order to gain attention and workforce
● Plethora of data graveyards (project end,
running out of funding)
https://tinyurl.com/dbpedia-kedl-2019
Foundations
ALIGNED: Aligning Software & Data Engineering 2015 - 2018
http://aligned-project.eu
Engineering Agile Big-Data Systems
defines three dimensions to evaluate systems:
● productivity
● quality
● agility
https://www.riverpublishers.com/book_details.php?book_id=659
https://tinyurl.com/dbpedia-kedl-2019
4 riders of the datacalypse
1. Law of Diminishing Returns
-> intrinsic to data quality, the glass ceiling
2. Copying data
-> Duplication of effort by copy
3. Non-collaboration of data publishers
-> Multiplication of effort by individual non-synced data sets
4. Network Disaster of Linking / Mapping
-> Multiplication of effort at consumer side
https://tinyurl.com/dbpedia-kedl-2019
Law of Diminishing Returns
Data Quality is pareto-efficient
20/80 rule
Law totally applies
-> No exception to manual curation or
AI-generated datasets
More data (coverage) means lower
quality
More quality means remaining errors
are harder to fix
Update is more difficult than Create
https://tinyurl.com/dbpedia-kedl-2019
Law of Diminishing Returns
Online communities aka users are great
-> they are easy to motivate by the “greater good”, “better data”
-> they work for free, i.e. zero or low cost on the budget
If you are already in the “Diminishing Returns” phase, what happens if you
double community activity (workforce)?
Twice as much data?
-> No, maybe 25% more
Better overall quality?
-> Maybe not, if you increase data size
https://tinyurl.com/dbpedia-kedl-2019
Law of Diminishing Returns
No exceptions.
Test-driven Evaluation of Linked Data Quality by Dimitris Kontokostas et al.
(2014, WWW) in a joint project with Dutch Libraries (Enno Meijers)
Dimitris Kontokostas was the former CTO of DBpedia
and editor of the W3C standard Shapes Constraint Language
(SHACL) in 2017, implemented as OS in RDFUnit
Test-driven data engineering follows the 20/80 rule
Small set of initial test cases is efficient, then you hit the glass ceiling
https://tinyurl.com/dbpedia-kedl-2019
Rider no. 2: Copying data
● 10,000 downloads of a dataset dump
-> If one unparseable line needs 15 minutes to find and fix, we are
talking about 104 days of work
● publishers are struggling with data quality, but their
consumers have invested 50-5000 times their effort in cleaning
● The ~600k yearly file downloads and 20 million API hits daily of DBpedia
re-incarnate as local data quality problems
Step 1: Download data
Step 2: Clean and integrate
If we could just capture consumer-invested time 

https://tinyurl.com/dbpedia-kedl-2019
Rider no. 3: Non-collaboration
● Overlapping dataspaces
● Open licenses (compatible)
● Each organisation/project pushes against their own glass ceiling
● Wikipedia/Wikidata create yet another glass ceiling, since they
aggregate from above sources
What have the following organisations in common (could be thousands more)?
https://tinyurl.com/dbpedia-kedl-2019
Rider no. 2 & 3: Syncing upstream solves it
https://tinyurl.com/dbpedia-kedl-2019
Rider no. 4: Network Disaster of Linking / Mapping
21
O(n2
/2) -> O(n) with identifier linking paradigm, but actually
O (n) + Client-side created links + work for crawling (no standards)
Solved by Re-use via DBpedia Knowledge Library
https://tinyurl.com/dbpedia-kedl-2019
http://tinyurl.com/dbpedia-xml-2019
Less work, more benefits
Source:http://artpictures.club/shans-february-5-13.html
“Open Data” is too altruistic
(for others)
Less work by innovation
- Reusability / Deduplication of effort
- AI assistance (human in the loop)
- Off-the-shelf apps
- Power tools
Benefits aka Incentive Models
- Effective Third-Party contributions
- Open Data business models
- Attention / attribution
DBpedia Strategy Overview
Starting point
DBpedia is the most successful
open knowledge graph (OKG),
established 2008
24
Medium term goals
● 200 orgs share value via platform
● 10% of public IT projects curate data
● 1 million user, high contribution rate
● thousands of new businesses and initiatives
around the platform
Global DBpedia Platform
● Communication
& collaboration
● Share efforts and results
● Maximise societal valuetake DBpedia to a global level
DBpedia Strategy Overview
25
DBpedia bootstrapped Linked Open Data
LOD is the largest knowledge graph on earth
Migration from P2P to an efficient platform
- lower entry barriers
- improved discovery and cooperation
DBpedia LOD Custody
data silo
Open Decentral Platform
- file access and processing in network
- FAIR principles and W3C standards (DCAT)
- agile data engineering (Gitflow)
- high degree of automation (Maven)
Knowledge Library
- Semantic layer on data
- Semantic interoperability
- AI assistance
Custody:
Coordinate decentralisation
Digital Factory Platform
https://databus.dbpedia.org/
https://databus.dbpedia.org/repo/sparql
https://databus.dbpedia.org/yasgui
Inspired by
Databus - Digital Factory Platform
Registry of files on the Web
● Virtual file warehouse
● Decentralised file storage
● Storage is cheap
● Downloadable
● File format doesn’t matter
○ RDF
○ CSV, XML
○ PDF or PDF collection
28
Databus - Digital Factory Platform
29
Rejected Approved

 but very strict metadata
● Provenance (who? - you!)
● Machine-readable licenses
● Private key signature, X509 (Trust)
● Dataset identity
● Versioning
Minimal metadata core
Everybody can add more metadata systematically (no user tagging)
Databus - Digital Factory Platform
Build automation tool based on Maven
● Dataset Identity (ArtifactId)
○ Variance in content/format/compression
● Optimized for re-releasing the same files
○ ~ 3 days to learn and setup the tool (once)
○ 10 minutes to publish an update
https://github.com/dbpedia/databus-maven-plugin
30
Time versioned: 2018.04.10
https://tinyurl.com/dbpedia-kedl-2019
Databus Demo
Download all data on the bus
Stable Ids for collections with fixed or dynamic versions:
https://databus.dbpedia.org/dbpedia/collections/pre-release-2019-08-30
Dynamic versions:
- Latest
- Passed test suite x
- Most popular / other criteria
https://tinyurl.com/dbpedia-kedl-2019
Databus-Client (alpha)
Download
Compression
File format
Mapping
Integration of existing frameworks
RML, R2RML, XSLT, R2R
(sameAs/equivalentClass), SPARQL Construct
{ RDF/XML, TTL, JSON-LD, N-Triples },
{ CSV, TSV }
gz, bz2, snappy-framed, xz,
deflate, lzma, zstd
any file
LVL 0
LVL 1
LVL 2
LVL 3
sha256 checksum
Featured formats Implementation
Unify with
Apache Compress
RDF Formats now,
all media types planned
CSV to RDF (tarql)
https://tinyurl.com/dbpedia-kedl-2019
Databus-Client (alpha)
Download
Compression
File format
Mapping
Integration of existing frameworks
RML, R2RML, XSLT, R2R
(sameAs/equivalentClass), SPARQL Construct
{ RDF/XML, TTL, JSON-LD, N-Triples },
{ CSV, TSV }
gz, bz2, snappy-framed, xz,
deflate, lzma, zstd
any file
LVL 0
LVL 1
LVL 2
LVL 3
sha256 checksum
Featured formats Implementation
Unify with
Apache Compress
RDF Formats now,
all media types planned
CSV to RDF (tarql)
Application Replication and Deployment Layer (load data into software or databases automatically)
https://tinyurl.com/dbpedia-kedl-2019
Summary and outlook
● 4 riders of datacalypse
○ rethink data processes
● Databus as a technical platform to version, access and process files in an automated manner
○ own file server required
● Knowledge library as a central tool of the integration platform
○ Third-party consumers
● FlexiFusion - PreFusion
○ Synchronisation and comparison of tangible data records
● FlexiFusion - Fusion
○ Second largest open knowledge graph
https://tinyurl.com/dbpedia-kedl-2019
Debugging - publish first, then test
Test results
https://tinyurl.com/dbpedia-kedl-2019
DBpedia Knowledge Library
Mission is to improve external data
● Caches the core fraction of digital data (master records)
○ Aggregated from external data
○ Centrally indexed, linked and structured
● Curative focus
○ Ontologies → yes, all ontologies
○ Mappings → manage connection between all schema
○ Links → global view on entities
● AI assistance
○ Guides effective curation (Symbolic ILP & active learning)
○ Mitigates the gap between stability and evolution
○ Powerful discovery
https://tinyurl.com/dbpedia-kedl-2019
Disambiguation of Master Records
Tangible Entities (clear identification):
Persons, Video games, Power Plants,
Bibliographic References
Tangible Facts:
Height of Basketball Players
Barack Obama was the 44th president of the
United States from 2009 to 2017
----------------------------------------------------------
Excellent cost/benefit ratio
Useful for 90% of use cases
Elusive Entities (vague Identification)
Books, Songs, Products, Events
Elusive Facts:
Leipzig has 600,000 citizens
Trump is the current US president
----------------------------------------------------------------
Burn thousands of hours of discussion
with minimal results
OntoGrate Methodology will produce global and ultimate lists for tangible entities and facts
https://tinyurl.com/dbpedia-kedl-2019
Use Cases - Flexi Fusion
Third Party effort by
DBpedia Data Wranglers & Users
PreFusion Fusion Source Enrichment Export
Aggregation, incl.
Provenance
Comparison of Data
“Best of”
Like DBpedia KG, just
bigger & better
Improve sources
(sync)
Lower curation effort
Custom fusion for use
cases and applications
Applications
Frey et al. (ISWC 2019):
DBpedia FlexiFusion the Best of Wikipedia > Wikidata > Your Data
https://tinyurl.com/dbpedia-kedl-2019
Demo of Fusion Result
Scalable global id management will allow growth into the second largest open knowledge graph
https://databus.dbpedia.org/vehnem/flexifusion/fusion/2019.11.15
3.8 million birthdates
Three targets at the moment:
- Ultimate lists of European authors (French, German, Dutch, Swiss library data)
- Power plants and energy sector
- Company data
https://tinyurl.com/dbpedia-kedl-2019
Next steps
We are looking for two PhD students (female or other) to help us build the second largest open
knowledge graph with AI-assistance
We are looking for strong partners to help us set up national data infrastructure that sync with the
global DBpedia infrastructure
Stay informed via http://forum.dbpedia.org and http://blog.dbpedia.org

Weitere Àhnliche Inhalte

Was ist angesagt?

Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Projectvty
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligencevty
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsPeter Haase
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2vty
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloudvty
 
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)EUDAT
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFDimitris Kontokostas
 
ODFPlugfest 2015 Kolab, WebODF, Pleio
ODFPlugfest 2015 Kolab, WebODF, PleioODFPlugfest 2015 Kolab, WebODF, Pleio
ODFPlugfest 2015 Kolab, WebODF, Pleiohcderaad
 
8th DBpedia meeting / California 2016
8th DBpedia meeting /  California 20168th DBpedia meeting /  California 2016
8th DBpedia meeting / California 2016Dimitris Kontokostas
 
Building a Canadian National Research Data Management Framework - Mark Leggott
Building a Canadian National Research Data Management Framework - Mark LeggottBuilding a Canadian National Research Data Management Framework - Mark Leggott
Building a Canadian National Research Data Management Framework - Mark LeggottCASRAI
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...Andrea Bollini
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research datavty
 
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)Vladimir Alexiev, PhD, PMP
 
Research Data Alliance Member Statistics October 2015
Research Data Alliance Member Statistics October 2015Research Data Alliance Member Statistics October 2015
Research Data Alliance Member Statistics October 2015Research Data Alliance
 
Research Data Alliance Member Statistics August 2015
Research Data Alliance Member Statistics August 2015Research Data Alliance Member Statistics August 2015
Research Data Alliance Member Statistics August 2015Research Data Alliance
 
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar SlidesDuraSpace
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkSebastian Hellmann
 
Research Data Alliance Member Statistics July 2015
Research Data Alliance Member Statistics July 2015Research Data Alliance Member Statistics July 2015
Research Data Alliance Member Statistics July 2015Research Data Alliance
 
Research Data Alliance Member Statistics June 2015
Research Data Alliance Member Statistics June 2015Research Data Alliance Member Statistics June 2015
Research Data Alliance Member Statistics June 2015Research Data Alliance
 

Was ist angesagt? (20)

Building COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science ProjectBuilding COVID-19 Museum as Open Science Project
Building COVID-19 Museum as Open Science Project
 
Fighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial IntelligenceFighting COVID-19 with Artificial Intelligence
Fighting COVID-19 with Artificial Intelligence
 
Discovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data PortalsDiscovering Related Data Sources in Data Portals
Discovering Related Data Sources in Data Portals
 
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2Dataverse SSHOC enrichment of DDI support at EDDI'19 2
Dataverse SSHOC enrichment of DDI support at EDDI'19 2
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloud
 
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)
SeaDataCloud - Developing the SeaDataCloud Virtual Research Environment (VRE)
 
Graph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDFGraph databases & data integration - the case of RDF
Graph databases & data integration - the case of RDF
 
ODFPlugfest 2015 Kolab, WebODF, Pleio
ODFPlugfest 2015 Kolab, WebODF, PleioODFPlugfest 2015 Kolab, WebODF, Pleio
ODFPlugfest 2015 Kolab, WebODF, Pleio
 
8th DBpedia meeting / California 2016
8th DBpedia meeting /  California 20168th DBpedia meeting /  California 2016
8th DBpedia meeting / California 2016
 
Building a Canadian National Research Data Management Framework - Mark Leggott
Building a Canadian National Research Data Management Framework - Mark LeggottBuilding a Canadian National Research Data Management Framework - Mark Leggott
Building a Canadian National Research Data Management Framework - Mark Leggott
 
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
DSpace-CRIS: an open source solution - Cineca euroCRIS membership meeting Por...
 
Setting up Dataverse repository for research data
Setting up Dataverse repository for research dataSetting up Dataverse repository for research data
Setting up Dataverse repository for research data
 
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)
Museum LOD (Ontotext, 1 May 2019, Doha, Qatar)
 
Research Data Alliance Member Statistics October 2015
Research Data Alliance Member Statistics October 2015Research Data Alliance Member Statistics October 2015
Research Data Alliance Member Statistics October 2015
 
Research Data Alliance Member Statistics August 2015
Research Data Alliance Member Statistics August 2015Research Data Alliance Member Statistics August 2015
Research Data Alliance Member Statistics August 2015
 
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
5.15.17 Powering Linked Data and Hosted Solutions with Fedora Webinar Slides
 
Linguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future WorkLinguistic Linked Open Data, Challenges, Approaches, Future Work
Linguistic Linked Open Data, Challenges, Approaches, Future Work
 
Research Data Alliance Member Statistics July 2015
Research Data Alliance Member Statistics July 2015Research Data Alliance Member Statistics July 2015
Research Data Alliance Member Statistics July 2015
 
euroCRIS DSpace-CRIS presentation at #CRIS2014 conference
euroCRIS DSpace-CRIS presentation at #CRIS2014 conferenceeuroCRIS DSpace-CRIS presentation at #CRIS2014 conference
euroCRIS DSpace-CRIS presentation at #CRIS2014 conference
 
Research Data Alliance Member Statistics June 2015
Research Data Alliance Member Statistics June 2015Research Data Alliance Member Statistics June 2015
Research Data Alliance Member Statistics June 2015
 

Ähnlich wie KEDL DBpedia 2019

Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalMartin Kaltenböck
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016Sebastian Hellmann
 
20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage networkEnno Meijers
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked .
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015Comsode - FP7 project
 
How to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st centuryHow to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st centuryAli Dasdan
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentMartin Kaltenböck
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationDenodo
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Artefactual Systems - AtoM
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for BiopharmaTom Plasterer
 
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePoint
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePointSemantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePoint
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePointDIQA Projektmanagement GmbH
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidatapeterchanws
 
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE PlatformBigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE PlatformBigData_Europe
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commonsJesse Wang
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpdaElena Simperl
 

Ähnlich wie KEDL DBpedia 2019 (20)

Sebastian Hellmann
Sebastian HellmannSebastian Hellmann
Sebastian Hellmann
 
Easy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance ProfessionalEasy SPARQLing for the Building Performance Professional
Easy SPARQLing for the Building Performance Professional
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016DBpedia/association Introduction The Hague 12.2.2016
DBpedia/association Introduction The Hague 12.2.2016
 
20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network20191210 NDLI KEDL2019 Building the dutch digital heritage network
20191210 NDLI KEDL2019 Building the dutch digital heritage network
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015
 
How to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st centuryHow to build and run a big data platform in the 21st century
How to build and run a big data platform in the 21st century
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
Unlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data VirtualizationUnlock Your Data for ML & AI using Data Virtualization
Unlock Your Data for ML & AI using Data Virtualization
 
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
Technologie Proche: Imagining the Archival Systems of Tomorrow With the Tools...
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePoint
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePointSemantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePoint
Semantische Technologien (nicht nur) fĂŒr die verbesserte Suche in SharePoint
 
Video game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidataVideo game controlled vocabulary in wikidata
Video game controlled vocabulary in wikidata
 
BigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE PlatformBigDataEurope @BDVA Summit2016 1: The BDE Platform
BigDataEurope @BDVA Summit2016 1: The BDE Platform
 
The Web of data and web data commons
The Web of data and web data commonsThe Web of data and web data commons
The Web of data and web data commons
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Planetdata simpda
Planetdata simpdaPlanetdata simpda
Planetdata simpda
 
PlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web ScalePlanetData: Consuming Structured Data at Web Scale
PlanetData: Consuming Structured Data at Web Scale
 
Open Data is not Enough
Open Data is not EnoughOpen Data is not Enough
Open Data is not Enough
 

Mehr von Sebastian Hellmann

Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015Sebastian Hellmann
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015Sebastian Hellmann
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked DataSebastian Hellmann
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web Sebastian Hellmann
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationSebastian Hellmann
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by ExampleSebastian Hellmann
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...Sebastian Hellmann
 
Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationSebastian Hellmann
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012Sebastian Hellmann
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23Sebastian Hellmann
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange FormatSebastian Hellmann
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddataSebastian Hellmann
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftSebastian Hellmann
 

Mehr von Sebastian Hellmann (16)

Lider Reference Model ld4lt session March, 3rd, 2015
Lider Reference Model ld4lt session  March, 3rd, 2015Lider Reference Model ld4lt session  March, 3rd, 2015
Lider Reference Model ld4lt session March, 3rd, 2015
 
LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015LD4LT Roadmap session 19_02_2015
LD4LT Roadmap session 19_02_2015
 
Integrating NLP using Linked Data
Integrating NLP using Linked DataIntegrating NLP using Linked Data
Integrating NLP using Linked Data
 
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web  NIF 2.0 Tutorial: Content Analysis and the Semantic Web
NIF 2.0 Tutorial: Content Analysis and the Semantic Web
 
Linked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and SegmentationLinked Data for Abbreviations and Segmentation
Linked Data for Abbreviations and Segmentation
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Navigation-induced Knowledge Engineering by Example
 Navigation-induced Knowledge Engineering by Example Navigation-induced Knowledge Engineering by Example
Navigation-induced Knowledge Engineering by Example
 
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...Improving the Performance of the  DL-Learner SPARQL Component for Semantic We...
Improving the Performance of the DL-Learner SPARQL Component for Semantic We...
 
NIF 2.0 draft for Pisa
NIF 2.0 draft for PisaNIF 2.0 draft for Pisa
NIF 2.0 draft for Pisa
 
Linked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web AnnotationLinked Data in Linguistics for NLP and Web Annotation
Linked Data in Linguistics for NLP and Web Annotation
 
Introduction to LDL 2012
Introduction to LDL 2012Introduction to LDL 2012
Introduction to LDL 2012
 
Thesis presentation
Thesis presentationThesis presentation
Thesis presentation
 
NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23NIF - Version 1.0 - 2011/10/23
NIF - Version 1.0 - 2011/10/23
 
NIF - NLP Interchange Format
NIF - NLP Interchange FormatNIF - NLP Interchange Format
NIF - NLP Interchange Format
 
Tool collection as linkeddata
Tool collection as linkeddataTool collection as linkeddata
Tool collection as linkeddata
 
NLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draftNLP2RDF Wortschatz and Linguistic LOD draft
NLP2RDF Wortschatz and Linguistic LOD draft
 

KĂŒrzlich hochgeladen

Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
 
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girl
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girlCall Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girl
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girlkumarajju5765
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxolyaivanovalion
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxolyaivanovalion
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...shivangimorya083
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 

KĂŒrzlich hochgeladen (20)

Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
 
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
 
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girl
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girlCall Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girl
Call Girls đŸ«€ Dwarka âžĄïž 9711199171 âžĄïž Delhi đŸ«Š Two shot with one girl
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Ravak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptxRavak dropshipping via API with DroFx.pptx
Ravak dropshipping via API with DroFx.pptx
 
Zuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptxZuja dropshipping via API with DroFx.pptx
Zuja dropshipping via API with DroFx.pptx
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
꧁❀ Aerocity Call Girls Service Aerocity Delhi ❀꧂ 9999965857 ☎ Hard And Sexy ...
꧁❀ Aerocity Call Girls Service Aerocity Delhi ❀꧂ 9999965857 ☎ Hard And Sexy ...꧁❀ Aerocity Call Girls Service Aerocity Delhi ❀꧂ 9999965857 ☎ Hard And Sexy ...
꧁❀ Aerocity Call Girls Service Aerocity Delhi ❀꧂ 9999965857 ☎ Hard And Sexy ...
 
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130
 
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...
Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...Vip Model  Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...
Vip Model Call Girls (Delhi) Karol Bagh 9711199171✔Body to body massage wit...
 
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 

KEDL DBpedia 2019

  • 1. Sebastian Hellmann http://dbpedia.org 1 DBpedia and the Custody of Linked Open Data
  • 2. https://tinyurl.com/dbpedia-kedl-2019 Outline DBpedia History and Challenges ● The beginning phase (2007 - 2011) ● The manifestation phase (2011 - 2016) ● Glass ceiling phase (2016 - 2019) ● Lessons learned The future brings solutions ● Skyrocketing phase (2020 - 2025) ○ Databus ○ Knowledge Library ○ FlexiFusion
  • 3. https://tinyurl.com/dbpedia-kedl-2019 Beginning Phase of DBpedia 2007-2011 2007 - First Extraction of Wikipedia’s Infoboxes ● https://en.wikipedia.org/wiki/National_Digital_Library_of_India ● (en) https:/dbpedia.org/resource/National_Digital_Library_of_India ● (new, global) https://global.dbpedia.org/?s=https%3A%2F%2Fen.wikipedia.org%2Fwiki%2FNational_Digital_Library_of_India ● (new, global) https://global.dbpedia.org/id/3FdBE
  • 4. https://tinyurl.com/dbpedia-kedl-2019 Beginning Phase of DBpedia 2007-2011 2007 - First Extraction of Wikipedia’s Infoboxes Query Wikipedia Like a Database (query working for 12 years) soccer players, who are born in a country with more than 10 million inhabitants, who played as goalkeeper for a club that has a stadium with more than 30.000 seats and the club country is different from the birth country Starting members: ● Uni Leipzig, Open Link Software, FU Berlin (affiliation change of key persons)
  • 5. https://tinyurl.com/dbpedia-kedl-2019 Beginning Phase of DBpedia 2007-2011 Exceptional boost of research and industrial innovation ● DBpedia Dataset became the foundation for around 25000 scientific papers ● High industry adoption: BBC, New York Times, Yahoo, Watson (Jeopardy) ● Emergence of semantic technologies: ○ Knowledge Extraction ○ Entity Linking (Databases) ○ Entity Linking (Natural Language Processing) - DBpedia Spotlight ○ Graph Databases
  • 6. https://tinyurl.com/dbpedia-kedl-2019 Beginning Phase of DBpedia 2007-2011 LOD = Linked Open Data See full slides here with history: https://slidewiki.org/presentation/661/_/661/5475-3/#/slide-5475-3 Recent: http://lod-cloud.net
  • 7. https://tinyurl.com/dbpedia-kedl-2019 Beginning Phase of DBpedia 2007-2011 DBpedia shaped Linked Data Best Practices 1. Use URIs as names for things 2. Use HTTP URIs so that people can look up those names. 3. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 4. Include links to other URIs. so that they can discover more things. [Identifier Linking Paradigm]
  • 8. https://tinyurl.com/dbpedia-kedl-2019 Manifestation phase 2011-2016 Growth phase ● Internationalization ○ Covering all 140 Wikipedia languages ○ 22 language chapters, worldwide DBpedia embassies and institutional collaborators ● http://mappings.dbpedia.org ○ ~300 editors ○ DBpedia Ontology incl. links to other ontologies ○ Data cleaning rules ● Data extensions such as Wikimedia Commons, Wikidata, NIF ○ 14 Billion statements ● Ontology contributions YAGO, SUMO, Umbel, LHD, DBTax ○ 8 taxonomies to query DBpedia
  • 9. https://tinyurl.com/dbpedia-kedl-2019 Manifestation phase 2011-2015 Foundation of the DBpedia Association in 2014: ● Reliable core of organisational supporters for sustainability and network multiplication ● A network around a non-profit coordinator (DBpedia Association, Primus inter pares)
  • 10. https://tinyurl.com/dbpedia-kedl-2019 Glass ceiling phase 2016 - 2019 Glass ceiling is a term from gender inequality Appropriate metaphor: ● Women have to work harder than men for their career ● Pushing harder yields less and less results https://wiki.ubc.ca/Glass_Ceiling
  • 11. https://tinyurl.com/dbpedia-kedl-2019 Glass ceiling phase 2016 - 2019 Glass ceiling is a term from gender inequality Appropriate metaphor: ● Women have to work harder than men for their career ● Pushing harder yields less and less results Totally applies to all data projects https://wiki.ubc.ca/Glass_Ceiling
  • 12. https://tinyurl.com/dbpedia-kedl-2019 Glass ceiling phase 2016 - 2019 What did DBpedia do? ● We formed a think tank with our members and community (engineers and technology leaders) and discussed and re-designed Results: ● Open data needs a scalable business model ● Identified problems of decentralization and innovated to provide solutions to boost decentralized approaches (LOD) Only coordinated decentralisation is able to break the glass ceiling What happened in the world? ● Fierce competition over editorial workforce ○ Orcid ○ Wikidata ○ Microsoft Academic ○ Google ○ Thousands more ● Identifier wars (Identifier Linking Paradigm) ○ everybody wants you to include their IDs in order to gain attention and workforce ● Plethora of data graveyards (project end, running out of funding)
  • 13. https://tinyurl.com/dbpedia-kedl-2019 Foundations ALIGNED: Aligning Software & Data Engineering 2015 - 2018 http://aligned-project.eu Engineering Agile Big-Data Systems defines three dimensions to evaluate systems: ● productivity ● quality ● agility https://www.riverpublishers.com/book_details.php?book_id=659
  • 14. https://tinyurl.com/dbpedia-kedl-2019 4 riders of the datacalypse 1. Law of Diminishing Returns -> intrinsic to data quality, the glass ceiling 2. Copying data -> Duplication of effort by copy 3. Non-collaboration of data publishers -> Multiplication of effort by individual non-synced data sets 4. Network Disaster of Linking / Mapping -> Multiplication of effort at consumer side
  • 15. https://tinyurl.com/dbpedia-kedl-2019 Law of Diminishing Returns Data Quality is pareto-efficient 20/80 rule Law totally applies -> No exception to manual curation or AI-generated datasets More data (coverage) means lower quality More quality means remaining errors are harder to fix Update is more difficult than Create
  • 16. https://tinyurl.com/dbpedia-kedl-2019 Law of Diminishing Returns Online communities aka users are great -> they are easy to motivate by the “greater good”, “better data” -> they work for free, i.e. zero or low cost on the budget If you are already in the “Diminishing Returns” phase, what happens if you double community activity (workforce)? Twice as much data? -> No, maybe 25% more Better overall quality? -> Maybe not, if you increase data size
  • 17. https://tinyurl.com/dbpedia-kedl-2019 Law of Diminishing Returns No exceptions. Test-driven Evaluation of Linked Data Quality by Dimitris Kontokostas et al. (2014, WWW) in a joint project with Dutch Libraries (Enno Meijers) Dimitris Kontokostas was the former CTO of DBpedia and editor of the W3C standard Shapes Constraint Language (SHACL) in 2017, implemented as OS in RDFUnit Test-driven data engineering follows the 20/80 rule Small set of initial test cases is efficient, then you hit the glass ceiling
  • 18. https://tinyurl.com/dbpedia-kedl-2019 Rider no. 2: Copying data ● 10,000 downloads of a dataset dump -> If one unparseable line needs 15 minutes to find and fix, we are talking about 104 days of work ● publishers are struggling with data quality, but their consumers have invested 50-5000 times their effort in cleaning ● The ~600k yearly file downloads and 20 million API hits daily of DBpedia re-incarnate as local data quality problems Step 1: Download data Step 2: Clean and integrate If we could just capture consumer-invested time 

  • 19. https://tinyurl.com/dbpedia-kedl-2019 Rider no. 3: Non-collaboration ● Overlapping dataspaces ● Open licenses (compatible) ● Each organisation/project pushes against their own glass ceiling ● Wikipedia/Wikidata create yet another glass ceiling, since they aggregate from above sources What have the following organisations in common (could be thousands more)?
  • 20. https://tinyurl.com/dbpedia-kedl-2019 Rider no. 2 & 3: Syncing upstream solves it
  • 21. https://tinyurl.com/dbpedia-kedl-2019 Rider no. 4: Network Disaster of Linking / Mapping 21 O(n2 /2) -> O(n) with identifier linking paradigm, but actually O (n) + Client-side created links + work for crawling (no standards) Solved by Re-use via DBpedia Knowledge Library
  • 23. http://tinyurl.com/dbpedia-xml-2019 Less work, more benefits Source:http://artpictures.club/shans-february-5-13.html “Open Data” is too altruistic (for others) Less work by innovation - Reusability / Deduplication of effort - AI assistance (human in the loop) - Off-the-shelf apps - Power tools Benefits aka Incentive Models - Effective Third-Party contributions - Open Data business models - Attention / attribution
  • 24. DBpedia Strategy Overview Starting point DBpedia is the most successful open knowledge graph (OKG), established 2008 24 Medium term goals ● 200 orgs share value via platform ● 10% of public IT projects curate data ● 1 million user, high contribution rate ● thousands of new businesses and initiatives around the platform Global DBpedia Platform ● Communication & collaboration ● Share efforts and results ● Maximise societal valuetake DBpedia to a global level
  • 25. DBpedia Strategy Overview 25 DBpedia bootstrapped Linked Open Data LOD is the largest knowledge graph on earth Migration from P2P to an efficient platform - lower entry barriers - improved discovery and cooperation
  • 26. DBpedia LOD Custody data silo Open Decentral Platform - file access and processing in network - FAIR principles and W3C standards (DCAT) - agile data engineering (Gitflow) - high degree of automation (Maven) Knowledge Library - Semantic layer on data - Semantic interoperability - AI assistance Custody: Coordinate decentralisation
  • 28. Databus - Digital Factory Platform Registry of files on the Web ● Virtual file warehouse ● Decentralised file storage ● Storage is cheap ● Downloadable ● File format doesn’t matter ○ RDF ○ CSV, XML ○ PDF or PDF collection 28
  • 29. Databus - Digital Factory Platform 29 Rejected Approved 
 but very strict metadata ● Provenance (who? - you!) ● Machine-readable licenses ● Private key signature, X509 (Trust) ● Dataset identity ● Versioning Minimal metadata core Everybody can add more metadata systematically (no user tagging)
  • 30. Databus - Digital Factory Platform Build automation tool based on Maven ● Dataset Identity (ArtifactId) ○ Variance in content/format/compression ● Optimized for re-releasing the same files ○ ~ 3 days to learn and setup the tool (once) ○ 10 minutes to publish an update https://github.com/dbpedia/databus-maven-plugin 30 Time versioned: 2018.04.10
  • 31. https://tinyurl.com/dbpedia-kedl-2019 Databus Demo Download all data on the bus Stable Ids for collections with fixed or dynamic versions: https://databus.dbpedia.org/dbpedia/collections/pre-release-2019-08-30 Dynamic versions: - Latest - Passed test suite x - Most popular / other criteria
  • 32. https://tinyurl.com/dbpedia-kedl-2019 Databus-Client (alpha) Download Compression File format Mapping Integration of existing frameworks RML, R2RML, XSLT, R2R (sameAs/equivalentClass), SPARQL Construct { RDF/XML, TTL, JSON-LD, N-Triples }, { CSV, TSV } gz, bz2, snappy-framed, xz, deflate, lzma, zstd any file LVL 0 LVL 1 LVL 2 LVL 3 sha256 checksum Featured formats Implementation Unify with Apache Compress RDF Formats now, all media types planned CSV to RDF (tarql)
  • 33. https://tinyurl.com/dbpedia-kedl-2019 Databus-Client (alpha) Download Compression File format Mapping Integration of existing frameworks RML, R2RML, XSLT, R2R (sameAs/equivalentClass), SPARQL Construct { RDF/XML, TTL, JSON-LD, N-Triples }, { CSV, TSV } gz, bz2, snappy-framed, xz, deflate, lzma, zstd any file LVL 0 LVL 1 LVL 2 LVL 3 sha256 checksum Featured formats Implementation Unify with Apache Compress RDF Formats now, all media types planned CSV to RDF (tarql) Application Replication and Deployment Layer (load data into software or databases automatically)
  • 34. https://tinyurl.com/dbpedia-kedl-2019 Summary and outlook ● 4 riders of datacalypse ○ rethink data processes ● Databus as a technical platform to version, access and process files in an automated manner ○ own file server required ● Knowledge library as a central tool of the integration platform ○ Third-party consumers ● FlexiFusion - PreFusion ○ Synchronisation and comparison of tangible data records ● FlexiFusion - Fusion ○ Second largest open knowledge graph
  • 36. https://tinyurl.com/dbpedia-kedl-2019 DBpedia Knowledge Library Mission is to improve external data ● Caches the core fraction of digital data (master records) ○ Aggregated from external data ○ Centrally indexed, linked and structured ● Curative focus ○ Ontologies → yes, all ontologies ○ Mappings → manage connection between all schema ○ Links → global view on entities ● AI assistance ○ Guides effective curation (Symbolic ILP & active learning) ○ Mitigates the gap between stability and evolution ○ Powerful discovery
  • 37. https://tinyurl.com/dbpedia-kedl-2019 Disambiguation of Master Records Tangible Entities (clear identification): Persons, Video games, Power Plants, Bibliographic References Tangible Facts: Height of Basketball Players Barack Obama was the 44th president of the United States from 2009 to 2017 ---------------------------------------------------------- Excellent cost/benefit ratio Useful for 90% of use cases Elusive Entities (vague Identification) Books, Songs, Products, Events Elusive Facts: Leipzig has 600,000 citizens Trump is the current US president ---------------------------------------------------------------- Burn thousands of hours of discussion with minimal results OntoGrate Methodology will produce global and ultimate lists for tangible entities and facts
  • 38. https://tinyurl.com/dbpedia-kedl-2019 Use Cases - Flexi Fusion Third Party effort by DBpedia Data Wranglers & Users PreFusion Fusion Source Enrichment Export Aggregation, incl. Provenance Comparison of Data “Best of” Like DBpedia KG, just bigger & better Improve sources (sync) Lower curation effort Custom fusion for use cases and applications Applications Frey et al. (ISWC 2019): DBpedia FlexiFusion the Best of Wikipedia > Wikidata > Your Data
  • 39. https://tinyurl.com/dbpedia-kedl-2019 Demo of Fusion Result Scalable global id management will allow growth into the second largest open knowledge graph https://databus.dbpedia.org/vehnem/flexifusion/fusion/2019.11.15 3.8 million birthdates Three targets at the moment: - Ultimate lists of European authors (French, German, Dutch, Swiss library data) - Power plants and energy sector - Company data
  • 40. https://tinyurl.com/dbpedia-kedl-2019 Next steps We are looking for two PhD students (female or other) to help us build the second largest open knowledge graph with AI-assistance We are looking for strong partners to help us set up national data infrastructure that sync with the global DBpedia infrastructure Stay informed via http://forum.dbpedia.org and http://blog.dbpedia.org