DBPedia past, present and future - Dimitris Kontokostas. Reveals recent developments in the Linked Data and knowledge graphs field and how DBPedia progress with wikipedia data.
3. Get me all soccer players, who played as goalkeeper for a
club that has a stadium with more than 40.000 seats and
who are born in a country with more than 10 million
inhabitants
6. How it all started
- 2006 - Sören Auer (busy with his PhD) asking people: “Wikipedia
fact tables look like triples, don’t you want to write some
extractor?”
- 6 months later: Sören wrote the extractor himself and asked Jens
Lehmann to help with writing a paper
- Chris Bizer : “We are extracting people and place information from
Wikipedia too – lets join efforts and call it DBpedia.”
- Kingsley Idehen: “I need a showcase for my Virtuoso triple store.”
8. Taking a closer look
at heterogeneity…
- DBpedia Mappings wiki
9. Milestones
- 2008: DBpedia Live
- 2009: Scala-Based framework
- 2009: Mappings wiki
- 2011: Internationalization
- 2011: DBpedia Spotlight
- 2014: DBpedia Association
10. Now
DBpedia 2014 (English):
4.58 mio. entities and 583 mio. triples
131,2 mio. fact assertions (derived from infoboxes)
168,5 mio. triples representing Wikipedia structure
57,1 mio. links to external datasets
Localized DBpedia version for 125 languages, built from
corresponding Wikipedia versions
12 DBpedia language chapters
16. NLP
- Exploit the text…
- Let different NLP tools & approaches
compete for the best quality (in a certain
language)
- Need to define the interface (help needed)
17. Every Enterprise needs its DBpedia
- Represent common sense knowledge (DBpedia and
other LOD datasets) as well as the specific enterprise
knowledge
- Crystallization points for Linked Data intranets – an
addition to SOA facilitating enterprise-wide data linking
& integration
- Slicing & Dicing