SlideShare ist ein Scribd-Unternehmen logo
1 von 36
The Semantic Web and
why Wikipedia should bother
          Jakob Voß




                       Wikimania 2007
            Taipei, Taiwan, 2007-08-03
Agenda

(1) The Semantic Web
(2) Wikipedia’s contribution
(3) Examples and problems
(4) Possible solutions
The Semantic Web

    Everything can be linked via its URI
●



    Every data in triples with typed links
●




                      Image taken from: Semantic Wikipedia (2006)
The Semantic Web

    Ontologies define
●


    common structures and rules
    More data is generated by aggregation
●


    and reasoning on distributed data from
    several sources
    Software agents understand your
●


    commands, aggregate, reason, decide
    and act independently (at least in theory)
Wikipedia’s contribution

    Largest source of freely available
●


    non-specialized data
    Templates and categories
●


    contain structured data
        Persondata
    –

        DBpedia.org
    –

        Geodata
    –

        ...
    –

    Semantic MediaWiki
●


    adds typed links and attributes
Aggregating and Reasoning
Aggregating and Reasoning

Which polish authors are currently
   most published in Germany?
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●


        List of published books by book vendors
    –
        or by the German National Library
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●


        National Library catalouge contains author
    –
        and uniquely identifies author by PND-ID
Aggregating and Reasoning

    Which polish authors are currently
       most published in Germany?
    Currently published in Germany
●



    Authors
●



    Polish authors
●


        German Wikipedia contains PND => article
    –

        Article linked via Interwiki => more articles
    –

        Biographical articles contain place of birth
    –

        Place of birth linked to country via category
    –
Aggregating and Reasoning

subject       predicate       object
Publication   published-in   Germany
Publication   has-author      Person
Person          born-in         Town
Town            place-in      Poland
Where is Poland?
Where is Poland?


          Somewhere here
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY
Where is Poland?


                        Somewhere here
Or five times here in
Maine, Ohia, or NY




                               Or did you mean
                               Poland, Kiribati?
Poland around 1619




Polish-Lithuanian Commonwealth
Poland 1772...1793..1795
Poland 1945–
Where is Poland?

    Reality is complex, confusing, and fuzzy
●



    What’s the »default« Poland?
●



    Humans can look up context in Wikipedia
●



    Semantic Web only consists of statements
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●
Example #2

    Presidents of the United States
    Bill Clinton       1993-01-20 – 2001-01-20
●



    George W. Bush     2001-01-20 – 2009-01-20
●



    Barack Obama       2009-01-20 –
●
Example #2

    Presidents of the United States
    Bill Clinton        1993-01-20 – 2001-01-20
●



    George W. Bush      2001-01-20 – 2009-01-20
●



    Barack Obama        2009-01-20 – 2013-01-20
●



    A. Schwarzenegger   2013-01-20 –
●
Presidents of the United States

    George W. Bush       2001-01-20 – 2002-06-29
●



    Dick Cheney               07:09 – 09:24 a.m.
●



    George W. Bush       2002-06-29 – 2007-07-21
●



    Dick Cheney               07:14 – 09:21 a.m.
●



    George W. Bush      2007–07-21 –
●




                  Twice president of the US
                     (see 25th amendment)
Presidents of the United States

    The devil is in the details ;-)
●



    Automatic reasoning will
●


    give you inconvenient results
Example #3

 Finally a clear division




                        女性
男性
                        XX
XY
So let’s formalize...

owl:disjointWith
”Classes may be stated to be disjoint from
 each other. For example, Man and Woman
 can be stated to be disjoint classes. [...] a
 reasoner can deduce that if A is an
 instance of Man, then A is not an instance
 of Woman.“
OWL Web Ontology Language Guide
 http://www.w3.org/TR/owl-guide/
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female
A clear division?

     Other chromosal sexes (karotype)
    Turner syndrome (X_), Trisomy X...
●



    Klinefelter syndrome (XXY), XYY-Syndrome ...
●




       Intersexuality, Hermaproditism
    Chromosomal sex inconsistent with phenotypic
●


    sex or phenotype is not just male or female

                Gender identity
    Gender with which a person identifies
●


    independent from biological sex.
A clear division?

    Reality is far more complicated
●



    Many kinds of exceptions
●
Problems

    Clear divisions discriminate
●



    Discussion and context gets lost
●



    Example #4
●


     IF  your name = X
     AND X on a list of suspected terrorists
    THEN you have a problem
Not our problem?

    Ẁikipedia is already used as
●


    source by millions of people
    People can think, judge and ask,
●


    computers cannot
    We create definitions that will be used in
●


    thousands of applications
    Statistics lie
●


    Aggragation/resoning even lies better
Possible Solutions

    More of all (data, aggregation, reasoning)
●



    Less of all
●



    Statements about statements
●



    Fuzzy logic
●



    Data provenance / data lineage
●



    Allow exceptions
●



    Teach people to be careful
●



    Do not expect or believe simple answers
●



    It’s just dirty data
●
Summary

    Semantic Web is great
●



    Reality is based on exceptions
●



    Simplification is useful but dangerous
●



    Data POV != NPOV
●



    We also bear responsability for
●


    stupid use of Wikipedia data
    Never stop analyzing and thinking
●


    instead of relying on computers
More to read

    Shadbolt, Berners-Lee, and Hall: The Semantic Web
●


    Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101.
    May/June 2006.
    http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf

    Völkel, Krötzsch, Vrandecic, Haller, and Studer:
●


    Semantic Wikipedia. Proceedings of the WWW2006.
    http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id=

    Doctorow: Metacrap: Putting the torch to seven straw-
●


    men of the meta-utopia. August 2001.
    http://www.well.com/~doctorow/metacrap.htm

    Geoffrey and Star: Sorting Things Out: Classification
●


    and Its Consequences. MIT Press, 1999.

Weitere ähnliche Inhalte

Ähnlich wie Jakob Voss Wikipedia2007

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009Tom Moritz
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDan Brickley
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagementSTIinnsbruck
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Cartegraph
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay StructureVeronica Withers
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysisLuke Czarnecki
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic EngagementAndrew Lih
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsJohn Breslin
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritDominik Lukes
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0Carsten Ullrich
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchEnrico Daga
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.pptSupreethaS8
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionSTLab
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches EssayPatty Buckley
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And UnderstandingAlexandre Linhares
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory DoctorowGeorge Grayson
 

Ähnlich wie Jakob Voss Wikipedia2007 (20)

University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009University of California, Berkeley: iSchool Nov, 2009
University of California, Berkeley: iSchool Nov, 2009
 
Describing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classificationDescribing Everything - Open Web standards and classification
Describing Everything - Open Web standards and classification
 
Semantic engagement
Semantic engagementSemantic engagement
Semantic engagement
 
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
Loras College 2014 Business Analytics Symposium | Andy Stevens: Big Data Anal...
 
Argumentative Essay Structure
Argumentative Essay StructureArgumentative Essay Structure
Argumentative Essay Structure
 
Infooverload
InfooverloadInfooverload
Infooverload
 
I want to know more about compuerized text analysis
I want to know more about   compuerized text analysisI want to know more about   compuerized text analysis
I want to know more about compuerized text analysis
 
Essay Writing Examples Uk
Essay Writing Examples UkEssay Writing Examples Uk
Essay Writing Examples Uk
 
Wikipedia and Civic Engagement
Wikipedia and Civic EngagementWikipedia and Civic Engagement
Wikipedia and Civic Engagement
 
Coincidences
CoincidencesCoincidences
Coincidences
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
 
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spiritWho's In Charge: ?: Text, cognition, socialization and the freedom of spirit
Who's In Charge: ?: Text, cognition, socialization and the freedom of spirit
 
The Potential of Web 3.0
The Potential of Web 3.0The Potential of Web 3.0
The Potential of Web 3.0
 
Linked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities researchLinked data for knowledge curation in humanities research
Linked data for knowledge curation in humanities research
 
The Persuasive Speech.ppt
The Persuasive Speech.pptThe Persuasive Speech.ppt
The Persuasive Speech.ppt
 
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge ExtractionFrom Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
From Hyperlinks to Semantic Web Properties using Open Knowledge Extraction
 
Developmental Psychology Theoretical Approaches Essay
 Developmental Psychology Theoretical Approaches Essay Developmental Psychology Theoretical Approaches Essay
Developmental Psychology Theoretical Approaches Essay
 
Class 4
Class 4Class 4
Class 4
 
(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding(Slideshare Version) 2. Emergence, Priming, And Understanding
(Slideshare Version) 2. Emergence, Priming, And Understanding
 
Content - Cory Doctorow
Content - Cory DoctorowContent - Cory Doctorow
Content - Cory Doctorow
 

Mehr von Bertalan Mesko, MD

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comBertalan Mesko, MD
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Bertalan Mesko, MD
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldBertalan Mesko, MD
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraBertalan Mesko, MD
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsBertalan Mesko, MD
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Bertalan Mesko, MD
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerBertalan Mesko, MD
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareBertalan Mesko, MD
 

Mehr von Bertalan Mesko, MD (9)

Medical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.comMedical Social Media Guide to Webicina.com
Medical Social Media Guide to Webicina.com
 
Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma Webicina Open access social media guidelines for pharma
Webicina Open access social media guidelines for pharma
 
Medicine in Second Life, the virtual world
Medicine in Second Life, the virtual worldMedicine in Second Life, the virtual world
Medicine in Second Life, the virtual world
 
Practicing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 EraPracticing Medicine in the Web 2.0 Era
Practicing Medicine in the Web 2.0 Era
 
Jason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired PatientsJason Young: Improving Communication With Cognitively Impaired Patients
Jason Young: Improving Communication With Cognitively Impaired Patients
 
Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0Medical education and building an online reputation in the world of web 2.0
Medical education and building an online reputation in the world of web 2.0
 
Medicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student bloggerMedicine 2.0 with the eye of a medical student blogger
Medicine 2.0 with the eye of a medical student blogger
 
The impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcareThe impact of web 2.0 on medicine and healthcare
The impact of web 2.0 on medicine and healthcare
 
Medicine 2.0
Medicine 2.0Medicine 2.0
Medicine 2.0
 

Kürzlich hochgeladen

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 

Kürzlich hochgeladen (20)

Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 

Jakob Voss Wikipedia2007

  • 1. The Semantic Web and why Wikipedia should bother Jakob Voß Wikimania 2007 Taipei, Taiwan, 2007-08-03
  • 2. Agenda (1) The Semantic Web (2) Wikipedia’s contribution (3) Examples and problems (4) Possible solutions
  • 3. The Semantic Web Everything can be linked via its URI ● Every data in triples with typed links ● Image taken from: Semantic Wikipedia (2006)
  • 4. The Semantic Web Ontologies define ● common structures and rules More data is generated by aggregation ● and reasoning on distributed data from several sources Software agents understand your ● commands, aggregate, reason, decide and act independently (at least in theory)
  • 5. Wikipedia’s contribution Largest source of freely available ● non-specialized data Templates and categories ● contain structured data Persondata – DBpedia.org – Geodata – ... – Semantic MediaWiki ● adds typed links and attributes
  • 7. Aggregating and Reasoning Which polish authors are currently most published in Germany?
  • 8. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● List of published books by book vendors – or by the German National Library
  • 9. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● National Library catalouge contains author – and uniquely identifies author by PND-ID
  • 10. Aggregating and Reasoning Which polish authors are currently most published in Germany? Currently published in Germany ● Authors ● Polish authors ● German Wikipedia contains PND => article – Article linked via Interwiki => more articles – Biographical articles contain place of birth – Place of birth linked to country via category –
  • 11. Aggregating and Reasoning subject predicate object Publication published-in Germany Publication has-author Person Person born-in Town Town place-in Poland
  • 13. Where is Poland? Somewhere here
  • 14. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY
  • 15. Where is Poland? Somewhere here Or five times here in Maine, Ohia, or NY Or did you mean Poland, Kiribati?
  • 19. Where is Poland? Reality is complex, confusing, and fuzzy ● What’s the »default« Poland? ● Humans can look up context in Wikipedia ● Semantic Web only consists of statements ●
  • 20. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ●
  • 21. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ●
  • 22. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – ●
  • 23. Example #2 Presidents of the United States Bill Clinton 1993-01-20 – 2001-01-20 ● George W. Bush 2001-01-20 – 2009-01-20 ● Barack Obama 2009-01-20 – 2013-01-20 ● A. Schwarzenegger 2013-01-20 – ●
  • 24. Presidents of the United States George W. Bush 2001-01-20 – 2002-06-29 ● Dick Cheney 07:09 – 09:24 a.m. ● George W. Bush 2002-06-29 – 2007-07-21 ● Dick Cheney 07:14 – 09:21 a.m. ● George W. Bush 2007–07-21 – ● Twice president of the US (see 25th amendment)
  • 25. Presidents of the United States The devil is in the details ;-) ● Automatic reasoning will ● give you inconvenient results
  • 26. Example #3 Finally a clear division 女性 男性 XX XY
  • 27. So let’s formalize... owl:disjointWith ”Classes may be stated to be disjoint from each other. For example, Man and Woman can be stated to be disjoint classes. [...] a reasoner can deduce that if A is an instance of Man, then A is not an instance of Woman.“ OWL Web Ontology Language Guide http://www.w3.org/TR/owl-guide/
  • 28. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ●
  • 29. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female
  • 30. A clear division? Other chromosal sexes (karotype) Turner syndrome (X_), Trisomy X... ● Klinefelter syndrome (XXY), XYY-Syndrome ... ● Intersexuality, Hermaproditism Chromosomal sex inconsistent with phenotypic ● sex or phenotype is not just male or female Gender identity Gender with which a person identifies ● independent from biological sex.
  • 31. A clear division? Reality is far more complicated ● Many kinds of exceptions ●
  • 32. Problems Clear divisions discriminate ● Discussion and context gets lost ● Example #4 ● IF your name = X AND X on a list of suspected terrorists THEN you have a problem
  • 33. Not our problem? Ẁikipedia is already used as ● source by millions of people People can think, judge and ask, ● computers cannot We create definitions that will be used in ● thousands of applications Statistics lie ● Aggragation/resoning even lies better
  • 34. Possible Solutions More of all (data, aggregation, reasoning) ● Less of all ● Statements about statements ● Fuzzy logic ● Data provenance / data lineage ● Allow exceptions ● Teach people to be careful ● Do not expect or believe simple answers ● It’s just dirty data ●
  • 35. Summary Semantic Web is great ● Reality is based on exceptions ● Simplification is useful but dangerous ● Data POV != NPOV ● We also bear responsability for ● stupid use of Wikipedia data Never stop analyzing and thinking ● instead of relying on computers
  • 36. More to read Shadbolt, Berners-Lee, and Hall: The Semantic Web ● Revisited. IEEE Intelligent Systems 21 (3) pp. 96-101. May/June 2006. http://eprints.ecs.soton.ac.uk/12614/01/Semantic_Web_Revisted.pdf Völkel, Krötzsch, Vrandecic, Haller, and Studer: ● Semantic Wikipedia. Proceedings of the WWW2006. http://www.aifb.uni-karlsruhe.de/Publikationen/showPublikation_english?publ_id= Doctorow: Metacrap: Putting the torch to seven straw- ● men of the meta-utopia. August 2001. http://www.well.com/~doctorow/metacrap.htm Geoffrey and Star: Sorting Things Out: Classification ● and Its Consequences. MIT Press, 1999.