The document summarizes Wikidata, which is presented as the next big thing for Wikipedia. Wikidata aims to provide a database of the world's knowledge that is machine-readable and can be edited by anyone. It seeks to collect references for data items, engage a global community to collect data, increase Wikipedia quality, and enable other data collection projects. The presentation outlines Wikidata's goals and a three-phase project plan to link language editions, augment info boxes, and enable inline queries in Wikipedia.
Unblocking The Main Thread Solving ANRs and Frozen Frames
Wikidata presentation at SemTechBiz Berlin 2012
1. Wikidata
The next big thing for Wikipedia
SemTechBiz Berlin, February 2012
Denny Vrandečić
KIT Karlsruhe Institute of Technology / Wikimedia Deutschland
Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren
KIT – University of the State of Baden-Württemberg and
National Large-scale Research Center of the Helmholtz Association www.kit.edu
14. Coverage by language
English, German, French, Dutch: 1 Mio+
40 languages: 100,000+
107 languages: 10.000+
But what about other languages?
07.02.2012 Wikidata Denny Vrandečić
14
15. English
07.02.2012 Wikidata Denny Vrandečić
15
21. Chinese
07.02.2012 Wikidata Denny Vrandečić
21
22. What Wikipedia knows
Wikipedia has articles about…
… all cities
… their populations
… their mayors
So can I ask for a list of the world’s ten largest
cities with a female mayor?
07.02.2012 Wikidata Denny Vrandečić
22
23. Let’s see what happens…
07.02.2012 Wikidata Denny Vrandečić
23
43. What humans see
Berlin
... has a population of 3,490,445
... is located inGermany
... has mayorKlaus Wowereit
... has an area of 892 km2
07.02.2012 Wikidata Denny Vrandečić
43
48. Berlin edit
From Wikidata
Capital of Germany edit
Main page
Also known as: City of Berlin edit |x
Contents
Access the API
Random page Continent Europe [3 sources]
Donate to Wikidata
Country Germany [2 sources]
Interaction
Help Population 3,490,445 [1 source]
About Wikidata
Community portal
3,500,000 [2 sources]
Recent changes
[other values]
Languages
Catalá
Cesky Calling code 030 [2 sources]
Dansk
Deutsch Mayor Klaus W| [0 sources]
Eesti Klaus Wowereit
Español Vehicle registration BGerman politician [1 source]
Esperanto Klaus Wunderlich
Français German musician
Area 891.85km” [2 sources]
Hrvatski Klaus Waldeck
Italiano Austrian musician and former lawyer
O’zbek Twin city Los Angeles [3 sources]
Klaus Wagner
Complete list German mathematician
[new fact] Klaus Wagner
Stalker of the British Royal Family
07.02.2012 Wikidata Denny Vrandečić
48
49. Berlin edit
From Wikidata
Hauptstadtvon Deutschland edit
Hauptseite
Auchbekanntals:Stadt Berlin edit |x
Inhalt
API
ZufälligeSeite Kontinent Europa [3 sources]
Spende an Wikidata
Land Deutschland [2 sources]
Interaktion
Hilfe Einwohner 3.490.445 [1 source]
ÜberWikidata
Benutzerportal
3.500.000 [2 sources]
LetzeÄnderungen
[weitereWerte]
Sprachen
Catalá
Cesky Telefonvorwahl 030 [2 sources]
Dansk
Eesti Bürgermeister Klaus Wowereit [2 sources]
English
Español AmtlichesKennzeichen B [1 source]
Esperanto
Français
Fläche 891,85 km” [2 sources]
Hrvatski
Italiano
O’zbek Parnerstadt Los Angeles [3 sources]
Complete list
[new fact]
07.02.2012 Wikidata Denny Vrandečić
49
50. Berlin Continent Europe.
Berlin Country Germany.
Berlin Population 3490445.
Berlin Calling_code 030.
Berlin Vehicle_registration B.
Berlin Mayor Klaus_Wowereit.
Berlin Twin_cityLos_Angeles.
07.02.2012 Wikidata Denny Vrandečić
50
51. Klaus
Wowereit
Mayor
Berlin
07.02.2012 Wikidata Denny Vrandečić
51
52. WikiData
Provide a database of the world’s knowledge that anyone can edit
Collect references and quotes for millions of data items
Engage a sustainable community that collects data from everywhere in
a machine-readable way
Increase the quality and lower the maintenance costs of Wikipedia and
related projects
Deliver software and community best practices enabling others to
engage in projects of data collection and provisioning
07.02.2012 Wikidata Denny Vrandečić
52
53. Extracts facts from Wikipedia infoboxes
Publishes them in RDF
Shows potential of machine-readable data
07.02.2012 Wikidata Denny Vrandečić
53
54. WikiData
Provide a database of the world’s knowledge that anyone can edit
Collect references and quotes for millions of data items
Engage a sustainable community that collects data from everywhere in
a machine-readable way
Increase the quality and lower the maintenance costs of Wikipedia and
related projects
Deliver software and community best practices enabling others to
engage in projects of data collection and provisioning
07.02.2012 Wikidata Denny Vrandečić
54
55. Secondary database
Sources for every fact
Reflect diversity
07.02.2012 Wikidata Denny Vrandečić
55
56. WikiData
Provide a database of the world’s knowledge that anyone can edit
Collect references and quotes for millions of data items
Engage a sustainable community that collects data from everywhere in
a machine-readable way
Increase the quality and lower the maintenance costs of Wikipedia and
related projects
Deliver software and community best practices enabling others to
engage in projects of data collection and provisioning
07.02.2012 Wikidata Denny Vrandečić
56
58. Phase 1: Language links
Current: every language links to every other
In Wikidata: create one page for each entity,
list representations in each language
In Wikipedias: pull language links from
Wikidata
07.02.2012 Wikidata Denny Vrandečić
58
59. Phase 2: Infobox augmentation
Current: each article calls an infobox with
values
In Wikidata: centralize the values
In Wikipedias: just call the infobox and
populate it with values from Wikidata
07.02.2012 Wikidata Denny Vrandečić
59
60. Phase 3: Inline queries
Enable inline queries in Wikipedias
With several formats
07.02.2012 Wikidata Denny Vrandečić
60
61. Open source project
400+ users
NASA, Europeana, Deut
sche Telekom, …
20+ languages
World-wide community
Commercial support
Many extensions
semantic-mediawiki.org
07.02.2012 Wikidata Denny Vrandečić
61
63. Conclusions
Editable, common resource for data
Enables much smaller contribution size
Freely reusable, machine-readable data
Able to answer question
Available in 280+ languages
07.02.2012 Wikidata Denny Vrandečić
63
64. Imagine a world
in which
every single person
is given free access
to the sum of
all human knowledge.
07.02.2012 Wikidata
64 Denny Vrandečić
64
65. Thank you!
http://meta.wikipedia.org/wiki/Wikidata_WMDE
Institut AIFB – Angewandte Informatik und Formale Beschreibungsverfahren
presenting work done by Markus Krötzsch, YaronKoren, Daniel Kinzler,
QamarnisoIsmoilova, Sergey Chernishev, Max Völkel, Heiko Haller,
Sebastian Blohm, Philipp Sorg, Peter Haase, Than Tran, Basil Ell, Daniel
Herzig, BenediktKämpgen, Elena Simperl, Delia Rusu, Marko Grobelnik,
Michael Cariaso, AmélieCordier, Jean Lieber, Emmanuel Nauer, Yannick
Toussaint, Pascal Molli, HalaSkaf-Molli, Joel Natividad, Daniel Hansch
and the Ontoprise team, and many others
KIT – University of the State of Baden-Württemberg and
National Large-scale Research Center of the Helmholtz Association www.kit.edu