SlideShare ist ein Scribd-Unternehmen logo
1 von 32
Hacking with Semantic Web


                           Tom Praison
                   Developer @ Yahoo!
         http://twitter.com/tompraison
What’s in here?
• Evolution of the web
• Poorly Solved Information Needs
• Semantic Web Technologies
• Linked Data
• Demo of confhopper.in, a site built using open
  datasets
• Some techniques for getting Structured
  Information from Web.
• Demo of Yahoo! Contextual Analysis Platform and
  Open Dapper
I just had to take the hypertext
                   idea and connect it to the
                   Transmission Control Protocol
                   and domain name system ideas
                   and—ta-da!—the World Wide
                   Web.




Tim Berners Lee – Inventor of the WWW
WEB 1.0
Few Content Creators! Majority Consumers!




                         http://www.flickr.com/photos/leandrociuffo/3665883373/
WEB 2.0




          Web as a platform
          http://www.flickr.com/photos/lambertwm/4737580179/
WEB 1.0 vs WEB 2.0

       Ofoto                                  Flickr



  Personal Website                          Blogging



  Britannica Online                        Wikipedia



Directories(taxonomy)                 Tagging(“folksonomy”)



Content Management                            Wikis
      Systems
WEB 3.0




      Which direction will it take?

                          http://www.flickr.com/photos/markhillary/337685031
Semantic Web




Virtual Web        WEB 3.0               Pervasive Web



                Could be anything!



                                    Artificial
      Personalization
                                  Intelligence
Today’s Web




A Web of Documents rather than Data!
Poorly Solved Information Needs
• Multiple interpretations
   – Apple
• Long tail queries
   – Roja (I meant a south indian actress)
• Imprecise or overly precise searches
   – jim hendler
   – pictures of strong adventures people
• Searches for descriptions
   – countries in africa
   – 25 year old computer engineer living in Bangalore
   – Reliable smart phone under 15,000 rupees
THE SOLUTION




               Semantic Web
Publish data on the Web
• Linked Data: linking data similar to how we link
  documents on the Web
• Query databases over the Web
Architectural Challenges
• A common format for sharing data
• Sharing the meaning of data
• Infrastructure
Semantic Web standards from W3C
• Data and schema
  languages
  (RDF, OWL, RIF)
• Document formats
  (RDF/XML, RDFa)
• Protocols
  (SPARQL, HTTP)
Current Researches & Other Efforts
• Semantic Web research into knowledge
  representation and reasoning, data
  integration, data quality and many other
  topics
• Community effort (Linked Data movement)
RDF (Resource Description
                Framework)
• The basic data model of the Semantic Web
   – A universal model to capture all sorts of data:
     networks, relational, object-oriented…
• Basic unit of information is a triple
   – A tuple of (subject, predicate, object)
   – Example: (Joe, loves, Mary)
   – Each triple gives the value of a property for a given
     resource or relates two objects to one another
      • Object is either a resource or a literal
• An RDF model is a set of triples
   – Ordering of statements in an RDF document is irrelevant
     (unlike XML)
Graphical and textual notation
                              foaf:Person
                    type

     my:Joe
                       name

                              “Joe A.”



A number of ways to serialize an RDF model into an
                RDF document
          RDF/XML, Turtle, N3, N-Triples
RDF is designed for the Web
• URIs provide web-wide global identification across datasets
   – A resource may be described by multiple
     documents
   – URIs are intended to be reused
   – Unique, but not single identifiers: two URIs may
     denote the same thing
RDF is designed for the Web
• URIs can be retrieved from the Web
   – A well-behaved URI returns a description of the
     resource
   – Provides authority: the definition of foaf:Person
     lives at that URI
• Ontologies can be looked up as well
   – Typically at the root of the URIs, also known as the
     namespace
   – Example: http://xmlns.com/foaf/0.1/Person
     redirects to the specification
URIs implicitly link data together

                                        (#joe, #loves, #mary)

(#joe, #name, “Joe A.”)
(#joe, #email, mailto:joe@joe.com)          A social networking site
                                                                       (#mary, name, “Mary B.”)
   Joe’s homepage                                                      (#mary, gender, “female”)

                                                                           Mary’s homepage

                              (#name, #type, #Property)
                              (#name, #domain, #Person)

                      Schema doc
Put together, triples form a single
           ‘global’ graph
               #name          “Joe A.”
#joe
                    #email

                              “joe@joe.com”

               #loves

                             #name       “Mary B.”

            #mary
                                 #gender

                                         “female”
RDF Example
Linked Data cloud: interlinked RDF
          datasets on the Web
http://linkeddata.org/
DBPedia
• Dbpedia is dataset that contains much of the
  structured data in Wikipedia
  – Data from the info-boxes
  – Links between Wikipedia pages
  – Categories
  – Disambiguation and redirect pages
• Links to other datasets
Fetching individual resources
• Use your web browser
  • http://dbpedia.org/resource/Yahoo redirects to
    http://dbpedia.org/page/Yahoo
  • You can plug in this URI into other Linked Data browsers
• HTTP GET to fetch data
  – Using curl: add Accept: application/rdf+xml for RDF
    and enable redirect
      • curl -L -H 'Accept:application/rdf+xml'
        'http://dbpedia.org/resource/Berlin’
• Data dumps
  – http://wiki.dbpedia.org/Datasets
Querying using SPARQL
• Interactive query builders
     • SPARQL Explorer: http://dbpedia.org/snorql/
     • Examples at: http://wiki.dbpedia.org/OnlineAccess
• Using HTTP GET
  – GET /sparql/?query=EncodedQuery HTTP/1.1
  – Example:
     • SELECT ?film ?x WHERE {
       ?film <http://dbpedia.org/ontology/language>
       <http://dbpedia.org/resource/French_language> . ?film
       <http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
       <http://dbpedia.org/ontology/Film>}
     • curl 'http://dbpedia.org/sparql?query=encodedQuery’
ConfHopper.in
• Award winning app in WWW2012 Metadata
  Challenge.
• Confhopper.in is a desktop / mobile HTML5 based
  application designed for conference attendees.
• Built with the help of open datasets from
  http://data.semanticweb.org/ and various other
  sources.
Some Techniques for getting
   Structured Information from Web
• Semantic Markup
• NER
• Extraction Tools (Dapper)
Semantic Markup
•   Microdata (Schema.org)
•   RDFa
•   Open Graph Protocol (ogp.me)
•   Example:
    http://getschema.org/microdataextractor?url
    =http://www.tompraison.com&out=json
NER – Named Entity Recognition
• Yahoo! Content Analysis API
• http://developer.yahoo.com/contentanalysis/
Dapper




http://open.dapper.net

Dapper is a tool that enables users to create update feeds for
their favorite sites and website owners to optimize and
distribute their content in new ways.
References
• http://www.slideshare.net/tompraison
• http://inkdroid.org/journal/2010/06/04/the-
  5-stars-of-open-linked-data/
• http://www.freebase.com/
• http://dbpedia.org/About

Weitere ähnliche Inhalte

Was ist angesagt?

Linked Data at ISAW: How and Why
Linked Data at ISAW: How and WhyLinked Data at ISAW: How and Why
Linked Data at ISAW: How and Whyparegorios
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Hong (Jenny) Jing
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic WebRoberto García
 
Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentDiane Hillmann
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)ALATechSource
 
when the link makes sense
when the link makes sensewhen the link makes sense
when the link makes senseFabien Gandon
 
Linked Data Modeling for Beginner
Linked Data Modeling for BeginnerLinked Data Modeling for Beginner
Linked Data Modeling for BeginnerMyungjin Lee
 
Deep Web Presentation April 25
Deep Web Presentation April 25Deep Web Presentation April 25
Deep Web Presentation April 25nagold
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22jodischneider
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museumstrevorthornton
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebGillian Byrne
 
The Buzz About BIBFRAME, by Angela Kroeger
The Buzz About BIBFRAME, by Angela KroegerThe Buzz About BIBFRAME, by Angela Kroeger
The Buzz About BIBFRAME, by Angela KroegerAngela Kroeger
 
Libraries and Linked Data: Looking to the Future (1)
Libraries and Linked Data: Looking to the Future (1)Libraries and Linked Data: Looking to the Future (1)
Libraries and Linked Data: Looking to the Future (1)ALATechSource
 
Semantic Social Web
Semantic Social WebSemantic Social Web
Semantic Social WebSabin Buraga
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)ALATechSource
 
Microformats I: What & Why
Microformats I: What & WhyMicroformats I: What & Why
Microformats I: What & WhyRachael L Moore
 
Semantic Web: A web that is not the Web
Semantic Web: A web that is not the WebSemantic Web: A web that is not the Web
Semantic Web: A web that is not the WebBruce Esrig
 

Was ist angesagt? (20)

Linked Data at ISAW: How and Why
Linked Data at ISAW: How and WhyLinked Data at ISAW: How and Why
Linked Data at ISAW: How and Why
 
Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)Linked Open Data and Digital Curation (Islandora)
Linked Open Data and Digital Curation (Islandora)
 
Exploring the Semantic Web
Exploring the Semantic WebExploring the Semantic Web
Exploring the Semantic Web
 
Metadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data EnvironmentMetadata Training for Staff and Librarians for the New Data Environment
Metadata Training for Staff and Librarians for the New Data Environment
 
Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)Libraries and Linked Data: Looking to the Future (3)
Libraries and Linked Data: Looking to the Future (3)
 
when the link makes sense
when the link makes sensewhen the link makes sense
when the link makes sense
 
Linked Data Modeling for Beginner
Linked Data Modeling for BeginnerLinked Data Modeling for Beginner
Linked Data Modeling for Beginner
 
Deep Web Presentation April 25
Deep Web Presentation April 25Deep Web Presentation April 25
Deep Web Presentation April 25
 
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
NISO/DCMI Webinar: Schema.org and Linked Data: Complementary Approaches to Pu...
 
TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22TPDL2013 tutorial linked data for digital libraries 2013-10-22
TPDL2013 tutorial linked data for digital libraries 2013-10-22
 
Linked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and MuseumsLinked Open Data Fundamentals for Libraries, Archives and Museums
Linked Open Data Fundamentals for Libraries, Archives and Museums
 
Forging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic WebForging New Links: Libraries in the Semantic Web
Forging New Links: Libraries in the Semantic Web
 
The Buzz About BIBFRAME, by Angela Kroeger
The Buzz About BIBFRAME, by Angela KroegerThe Buzz About BIBFRAME, by Angela Kroeger
The Buzz About BIBFRAME, by Angela Kroeger
 
Libraries and Linked Data: Looking to the Future (1)
Libraries and Linked Data: Looking to the Future (1)Libraries and Linked Data: Looking to the Future (1)
Libraries and Linked Data: Looking to the Future (1)
 
Semantic Social Web
Semantic Social WebSemantic Social Web
Semantic Social Web
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)Libraries and Linked Data: Looking to the Future (2)
Libraries and Linked Data: Looking to the Future (2)
 
Library Linked Data
Library Linked DataLibrary Linked Data
Library Linked Data
 
Microformats I: What & Why
Microformats I: What & WhyMicroformats I: What & Why
Microformats I: What & Why
 
Semantic Web: A web that is not the Web
Semantic Web: A web that is not the WebSemantic Web: A web that is not the Web
Semantic Web: A web that is not the Web
 

Ähnlich wie Hacking with Semantic Web

Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic WebPeter Mika
 
Year of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyYear of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyPeter Mika
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...cmitch41
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011Peter Mika
 
What happened to the Semantic Web?
What happened to the Semantic Web?What happened to the Semantic Web?
What happened to the Semantic Web?Peter Mika
 
WTF is Semantic Web?
WTF is Semantic Web?WTF is Semantic Web?
WTF is Semantic Web?milesw
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebSimon Price
 
Semantic Web: introduction & overview
Semantic Web: introduction & overviewSemantic Web: introduction & overview
Semantic Web: introduction & overviewAmit Sheth
 
Deepak semantic web_iitd
Deepak semantic web_iitdDeepak semantic web_iitd
Deepak semantic web_iitdDeepak Shevani
 
Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Juan Sequeda
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2Martin Hepp
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2guestecacad2
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Richard Urban
 
One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebVictor de Boer
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Juan Sequeda
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Andy Powell
 

Ähnlich wie Hacking with Semantic Web (20)

Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
Year of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkeyYear of the Monkey: Lessons from the first year of SearchMonkey
Year of the Monkey: Lessons from the first year of SearchMonkey
 
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
Semantic Web (IS 535 presentation) by ITRL students Deborah Ratliff and Maril...
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Webofdata
WebofdataWebofdata
Webofdata
 
What happened to the Semantic Web?
What happened to the Semantic Web?What happened to the Semantic Web?
What happened to the Semantic Web?
 
WTF is Semantic Web?
WTF is Semantic Web?WTF is Semantic Web?
WTF is Semantic Web?
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
A review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic WebA review of the state of the art in Machine Learning on the Semantic Web
A review of the state of the art in Machine Learning on the Semantic Web
 
Semantic Web: introduction & overview
Semantic Web: introduction & overviewSemantic Web: introduction & overview
Semantic Web: introduction & overview
 
Deepak semantic web_iitd
Deepak semantic web_iitdDeepak semantic web_iitd
Deepak semantic web_iitd
 
Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010Consuming Linked Data by Humans - WWW2010
Consuming Linked Data by Humans - WWW2010
 
ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2ISWC GoodRelations Tutorial Part 2
ISWC GoodRelations Tutorial Part 2
 
GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2GoodRelations Tutorial Part 2
GoodRelations Tutorial Part 2
 
Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2Publishing and Using Linked Open Data - Day 2
Publishing and Using Linked Open Data - Day 2
 
One day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic WebOne day workshop Linked Data and Semantic Web
One day workshop Linked Data and Semantic Web
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
Linked Data
Linked DataLinked Data
Linked Data
 
Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...Linked Data as an enabling framework for resource discovery across libraries,...
Linked Data as an enabling framework for resource discovery across libraries,...
 
Linked Data Basics
Linked Data BasicsLinked Data Basics
Linked Data Basics
 

Kürzlich hochgeladen

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfSeasiaInfotech2
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 

Kürzlich hochgeladen (20)

DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
The Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdfThe Future of Software Development - Devin AI Innovative Approach.pdf
The Future of Software Development - Devin AI Innovative Approach.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 

Hacking with Semantic Web

  • 1. Hacking with Semantic Web Tom Praison Developer @ Yahoo! http://twitter.com/tompraison
  • 2. What’s in here? • Evolution of the web • Poorly Solved Information Needs • Semantic Web Technologies • Linked Data • Demo of confhopper.in, a site built using open datasets • Some techniques for getting Structured Information from Web. • Demo of Yahoo! Contextual Analysis Platform and Open Dapper
  • 3. I just had to take the hypertext idea and connect it to the Transmission Control Protocol and domain name system ideas and—ta-da!—the World Wide Web. Tim Berners Lee – Inventor of the WWW
  • 4. WEB 1.0 Few Content Creators! Majority Consumers! http://www.flickr.com/photos/leandrociuffo/3665883373/
  • 5. WEB 2.0 Web as a platform http://www.flickr.com/photos/lambertwm/4737580179/
  • 6. WEB 1.0 vs WEB 2.0 Ofoto Flickr Personal Website Blogging Britannica Online Wikipedia Directories(taxonomy) Tagging(“folksonomy”) Content Management Wikis Systems
  • 7. WEB 3.0 Which direction will it take? http://www.flickr.com/photos/markhillary/337685031
  • 8. Semantic Web Virtual Web WEB 3.0 Pervasive Web Could be anything! Artificial Personalization Intelligence
  • 9. Today’s Web A Web of Documents rather than Data!
  • 10. Poorly Solved Information Needs • Multiple interpretations – Apple • Long tail queries – Roja (I meant a south indian actress) • Imprecise or overly precise searches – jim hendler – pictures of strong adventures people • Searches for descriptions – countries in africa – 25 year old computer engineer living in Bangalore – Reliable smart phone under 15,000 rupees
  • 11. THE SOLUTION Semantic Web
  • 12. Publish data on the Web • Linked Data: linking data similar to how we link documents on the Web • Query databases over the Web
  • 13. Architectural Challenges • A common format for sharing data • Sharing the meaning of data • Infrastructure
  • 14. Semantic Web standards from W3C • Data and schema languages (RDF, OWL, RIF) • Document formats (RDF/XML, RDFa) • Protocols (SPARQL, HTTP)
  • 15. Current Researches & Other Efforts • Semantic Web research into knowledge representation and reasoning, data integration, data quality and many other topics • Community effort (Linked Data movement)
  • 16. RDF (Resource Description Framework) • The basic data model of the Semantic Web – A universal model to capture all sorts of data: networks, relational, object-oriented… • Basic unit of information is a triple – A tuple of (subject, predicate, object) – Example: (Joe, loves, Mary) – Each triple gives the value of a property for a given resource or relates two objects to one another • Object is either a resource or a literal • An RDF model is a set of triples – Ordering of statements in an RDF document is irrelevant (unlike XML)
  • 17. Graphical and textual notation foaf:Person type my:Joe name “Joe A.” A number of ways to serialize an RDF model into an RDF document RDF/XML, Turtle, N3, N-Triples
  • 18. RDF is designed for the Web • URIs provide web-wide global identification across datasets – A resource may be described by multiple documents – URIs are intended to be reused – Unique, but not single identifiers: two URIs may denote the same thing
  • 19. RDF is designed for the Web • URIs can be retrieved from the Web – A well-behaved URI returns a description of the resource – Provides authority: the definition of foaf:Person lives at that URI • Ontologies can be looked up as well – Typically at the root of the URIs, also known as the namespace – Example: http://xmlns.com/foaf/0.1/Person redirects to the specification
  • 20. URIs implicitly link data together (#joe, #loves, #mary) (#joe, #name, “Joe A.”) (#joe, #email, mailto:joe@joe.com) A social networking site (#mary, name, “Mary B.”) Joe’s homepage (#mary, gender, “female”) Mary’s homepage (#name, #type, #Property) (#name, #domain, #Person) Schema doc
  • 21. Put together, triples form a single ‘global’ graph #name “Joe A.” #joe #email “joe@joe.com” #loves #name “Mary B.” #mary #gender “female”
  • 23. Linked Data cloud: interlinked RDF datasets on the Web http://linkeddata.org/
  • 24. DBPedia • Dbpedia is dataset that contains much of the structured data in Wikipedia – Data from the info-boxes – Links between Wikipedia pages – Categories – Disambiguation and redirect pages • Links to other datasets
  • 25. Fetching individual resources • Use your web browser • http://dbpedia.org/resource/Yahoo redirects to http://dbpedia.org/page/Yahoo • You can plug in this URI into other Linked Data browsers • HTTP GET to fetch data – Using curl: add Accept: application/rdf+xml for RDF and enable redirect • curl -L -H 'Accept:application/rdf+xml' 'http://dbpedia.org/resource/Berlin’ • Data dumps – http://wiki.dbpedia.org/Datasets
  • 26. Querying using SPARQL • Interactive query builders • SPARQL Explorer: http://dbpedia.org/snorql/ • Examples at: http://wiki.dbpedia.org/OnlineAccess • Using HTTP GET – GET /sparql/?query=EncodedQuery HTTP/1.1 – Example: • SELECT ?film ?x WHERE { ?film <http://dbpedia.org/ontology/language> <http://dbpedia.org/resource/French_language> . ?film <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://dbpedia.org/ontology/Film>} • curl 'http://dbpedia.org/sparql?query=encodedQuery’
  • 27. ConfHopper.in • Award winning app in WWW2012 Metadata Challenge. • Confhopper.in is a desktop / mobile HTML5 based application designed for conference attendees. • Built with the help of open datasets from http://data.semanticweb.org/ and various other sources.
  • 28. Some Techniques for getting Structured Information from Web • Semantic Markup • NER • Extraction Tools (Dapper)
  • 29. Semantic Markup • Microdata (Schema.org) • RDFa • Open Graph Protocol (ogp.me) • Example: http://getschema.org/microdataextractor?url =http://www.tompraison.com&out=json
  • 30. NER – Named Entity Recognition • Yahoo! Content Analysis API • http://developer.yahoo.com/contentanalysis/
  • 31. Dapper http://open.dapper.net Dapper is a tool that enables users to create update feeds for their favorite sites and website owners to optimize and distribute their content in new ways.
  • 32. References • http://www.slideshare.net/tompraison • http://inkdroid.org/journal/2010/06/04/the- 5-stars-of-open-linked-data/ • http://www.freebase.com/ • http://dbpedia.org/About