SlideShare ist ein Scribd-Unternehmen logo
1 von 23
EpiphanyAdaptable RDFa Generation Linking the Web of Documents to the Web of Data Benjamin Adrian, JörnHeesIvan Herman, Michael Sintek,Andreas Dengel
 Outline Web of Document vs. Web of Data RDFa, the glue combining both worlds Use Linked Data for RDFa generation Extract RDF from web pages Visualize RDFa via Linked Data Evaluation and Comparison with Open Calais 2
 Web of Documents Features Distributed textual content Addressed by URLs  Layout in HTML, CSS Connected with hyperlinks Access via HTTP Made for human readers! 3 World WideWeb WWW
 Web of Data Features Distributed data sets Addressed by URIs Format is RDF Connected with RDF Links Access via HTTP Made for machine readers! 4 Linked OpenData
Plain HTML <BODY> <H1>Epiphany</H1><UL> <LI>Ben</LI> <LI>Jörn</LI> <LI>Ivan</LI> <LI>Michael</LI> <LI>Andreas</LI></UL> </BODY> 5 RDFa – 	Adding machine-readable	data to web documents RDF in HTMLattributes
HTML + RDFa <BODY  vocab  = “http://xmlns.com/foaf/0.1/”prefix = “dc=http://purl.org/dc/terms/”><H1 about = “#epiphany”property = “dc:title”>Epiphany</H1><UL inv = “member”> <LI typeof   = “Person” property = “name”>Ben</LI> <LI typeof   = “Person”property = “name”>Jörn</LI> <LI typeof   = “Person”property = “name”>Ivan</LI> <LI typeof   = “Person”property = “name”>Michael</LI> <LI typeof   = “Person”property = “name”>Andreas</LI></UL> </BODY> 6 RDFa – 	Adding machine-readable	data to web documents RDF in HTMLattributes
RDF @prefix foaf: <http://xmlns.com/foaf/0.1/>@prefix dc: <http://purl.org/dc/terms/>#epiphany	dc:title	“Epiphany” .[]		foaf:name	“Ben” ;rdf:typefoaf:Person ;foaf:member	#epiphany .[]		foaf:name	“Jörn” ;rdf:typefoaf:Person ;foaf:member	#epiphany .[]		foaf:name	“Ivan” ;rdf:typefoaf:Person ;foaf:member	#epiphany .[]		foaf:name	“Michael” ;rdf:typefoaf:Person ;foaf:member	#epiphany .[]		foaf:name	“Andreas” ;rdf:typefoaf:Person ;foaf:member	#epiphany . 7 RDFa – 	Adding machine-readable	data to web documents RDF in TURTLE notation
 A bridge from document to data 8 RDFa RDFa WWW
 A bridge from document to data 9 RDFa Benefits RDFa is easy to generate by CMS ( i.e., Drupal), and other  dynamic content  providers. It is easy to annotate well-structured data with RDFa RDFa WWW Open Problems How to annotate unstructured plain text content? How to annotate the same documentsdifferently based on different data bases?
 Epiphany 10 How to consumeLinked Data forRDFa generation Generate RDFa-annotated versions of web pages on-the-fly. For different Linked Data Models used generate different RDFa annotations. Create interactive  boxes filled with additional information about annotated  resources. Linked Data Model EpiphanyRDFa Generation Linked Data Model HTML+RDFa HTML
 Epiphany - Example 11 How to consumeLinked Data forRDFa generation Tim Burton is a movie maker. <span about=“http://dbpedia.org/resource/Tim_Burton”property=“foaf:name”>Tim Burton</span>is a movie maker.
 Epiphany 12 How to extractRDF from web pages Ontology-based Information Extraction RDFgraph Preprocessing ExtractionPipeline RDFgraphstore Cache HTML RDFaGenerator Linked Data Model HTML+RDFa
 Ontology-based Information  Extraction 13 RDF-based Information Extraction Ontology-based  Information Extraction Pipeline “Ben is member of RDFa WG.” Text Normalization “Ben”, “is”, “member”, “of”, “RDFa WG”, ”.” Text Segmentation [] foaf:name “Ben”. [] foaf:name  “RDFa WG” . Symbol Recognition <#me>     foaf:name “Ben”.  <#RDFaWG> foaf:name ”RDFa WG.” Instance Recognition <#me> foaf:member of <#RDFaWG>. Contextual Fact Recognition <#me>     foaf:name   “Ben”.  <#RDFaWG> foaf:name   ”RDFa WG.” <#me>     foaf:member <#RDFaWG>. RDF Generation
RDFa Generation 14 How to create semantic annotations requestHTML tidy toXHTML list of RDF triples with literal object values DOM node traversal am:Burtonfoaf:name  “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name  “Autor”. Am:filmfoaf:name  “Film”.
RDFa Generation 15 How to create semantic annotations for each text node around matches create html:SPANelement with RDFa attributes requestHTML tidy toXHTML list of RDF triples with literal object values DOM node traversal am:Burtonfoaf:name  “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name  “Autor”. Am:filmfoaf:name  “Film”.
RDFa Generation 16 How to create semantic annotations for each text node around matches create html:SPANelement with RDFa attributes requestHTML tidy toXHTML add link  to RDF graph to header list of RDF triples with literal object values DOM node traversal add js:onclick listener to elements with RDFa:about attributes am:Burtonfoaf:name  “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name  “Autor”. Am:filmfoaf:name  “Film”.
RDFa Visualization 17 Use RDF togenerateEpiphanies Browser Epiphany Linked Data AJAX call toInformation Provider onclick  event on <SPAN/> elements GET /resource/Tim_Burton HTTP 1.1 HOST: dbpedia.org ACCEPT: RDF Render RDF data inHTML  as lighting box
 Evaluation 18 ComparedEpiphany and Open Calais Goal: 	Epiphany is at least as good as Open Calais But:
 Evaluation 19 ComparedEpiphany and Open Calais Linked data model : 12,462 pages + RDF graphs by BBC Music Artists </music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3#artist>  rdf:typemo:MusicGroup ; rdf:typemo:MusicArtist ; foaf:name „Queen“ . … http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3 http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3.rdf
 Evaluation – BBC Corpus 20 ComparedEpiphany and Open Calais For comparing results generated by Open Calais and Epiphany, we had to align Open Calais’ results to BBC’s vocabulary. BBC / Epiphany: mo:SoloMusicArtistmo:MusicGroup foaf:name Open Calais:		 oc:Person oc:MusicGroup oc:match, oc:name
 Evaluation 21 Named Entities How did Open Calais and Epiphany retrieve  any instances with known foaf:name values? e.g., [] foaf:name “Queen”
 Evaluation 22 Solo MusicArtists mo:SoloMusicArtistwith known  foaf:name values? e.g., [] foaf:name “Brian May”;        a mo:SoloMusicArtist.
 Evaluation 23 MusicGroups mo:MusicGroupwith known foaf:name values? e.g., [] foaf:name “Queen”;        a mo:MusicGroup .

Weitere ähnliche Inhalte

Was ist angesagt?

RDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFaRDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFaPlatypus
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Juan Sequeda
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineLeigh Dodds
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Juan Sequeda
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosEUCLID project
 
Making the Web searchable
Making the Web searchableMaking the Web searchable
Making the Web searchablePeter Mika
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Juan Sequeda
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Websamar_slideshare
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsNeo4j
 
The Digital Cavemen of Linked Lascaux
The Digital Cavemen of Linked LascauxThe Digital Cavemen of Linked Lascaux
The Digital Cavemen of Linked LascauxRuben Verborgh
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD CloudRuben Verborgh
 
2010 06 ipaw_prv
2010 06 ipaw_prv2010 06 ipaw_prv
2010 06 ipaw_prvJun Zhao
 
Open belgium 2015 - open tourism
Open belgium 2015 - open tourismOpen belgium 2015 - open tourism
Open belgium 2015 - open tourismRaf Buyle
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And VisualizationIvan Ermilov
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410Arnaud Le Hors
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!Armin Haller
 
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod LacoulShamod Lacoul
 

Was ist angesagt? (20)

RDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFaRDFa Introductory Course Session 3/4 Why RDFa
RDFa Introductory Course Session 3/4 Why RDFa
 
Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011Publishing Linked Data 3/5 Semtech2011
Publishing Linked Data 3/5 Semtech2011
 
Talis Platform: A Linked Data Engine
Talis Platform: A Linked Data EngineTalis Platform: A Linked Data Engine
Talis Platform: A Linked Data Engine
 
Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011Consuming Linked Data 4/5 Semtech2011
Consuming Linked Data 4/5 Semtech2011
 
Usage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application ScenariosUsage of Linked Data: Introduction and Application Scenarios
Usage of Linked Data: Introduction and Application Scenarios
 
Making the Web searchable
Making the Web searchableMaking the Web searchable
Making the Web searchable
 
Introduction to Linked Data 1/5
Introduction to Linked Data 1/5Introduction to Linked Data 1/5
Introduction to Linked Data 1/5
 
Search Engines After The Semanatic Web
Search Engines After The Semanatic WebSearch Engines After The Semanatic Web
Search Engines After The Semanatic Web
 
Debunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative Facts
 
The Digital Cavemen of Linked Lascaux
The Digital Cavemen of Linked LascauxThe Digital Cavemen of Linked Lascaux
The Digital Cavemen of Linked Lascaux
 
The Lonesome LOD Cloud
The Lonesome LOD CloudThe Lonesome LOD Cloud
The Lonesome LOD Cloud
 
2010 06 ipaw_prv
2010 06 ipaw_prv2010 06 ipaw_prv
2010 06 ipaw_prv
 
Tutorial Linked APIs
Tutorial Linked APIsTutorial Linked APIs
Tutorial Linked APIs
 
RDFa Tutorial
RDFa TutorialRDFa Tutorial
RDFa Tutorial
 
Sparql
SparqlSparql
Sparql
 
Open belgium 2015 - open tourism
Open belgium 2015 - open tourismOpen belgium 2015 - open tourism
Open belgium 2015 - open tourism
 
Data Integration And Visualization
Data Integration And VisualizationData Integration And Visualization
Data Integration And Visualization
 
WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410WWW2014 Overview of W3C Linked Data Platform 20140410
WWW2014 Overview of W3C Linked Data Platform 20140410
 
Get on the Linked Data Web!
Get on the Linked Data Web!Get on the Linked Data Web!
Get on the Linked Data Web!
 
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
"RDFa - what, why and how?" by Mike Hewett and Shamod Lacoul
 

Andere mochten auch

Two Sides - EMA 2012 Spring Conference
Two Sides - EMA 2012 Spring ConferenceTwo Sides - EMA 2012 Spring Conference
Two Sides - EMA 2012 Spring ConferencePhil Riebel
 
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...FTourism & Marketing
 
ネットワーク設計と構築090530
ネットワーク設計と構築090530ネットワーク設計と構築090530
ネットワーク設計と構築090530Toshiboumi Ohta
 
Cum sa faci plati rapide pe Internet
Cum sa faci plati rapide pe InternetCum sa faci plati rapide pe Internet
Cum sa faci plati rapide pe InterneteComunitate.ro
 
Where is uranr at
Where is uranr atWhere is uranr at
Where is uranr attbts77
 
Josep Ejarque e il Turismo Culturale in Umbria
Josep Ejarque e il Turismo Culturale in UmbriaJosep Ejarque e il Turismo Culturale in Umbria
Josep Ejarque e il Turismo Culturale in UmbriaFTourism & Marketing
 
China Limburg Presentatie Rob Hermans
China Limburg Presentatie Rob HermansChina Limburg Presentatie Rob Hermans
China Limburg Presentatie Rob HermansRobHermans
 
DDRR Chapter Five
DDRR Chapter FiveDDRR Chapter Five
DDRR Chapter Fiveholleyberry
 
How I started using Social Networking
How I started using Social NetworkingHow I started using Social Networking
How I started using Social NetworkingJeff Baxter
 
Html Practice
Html PracticeHtml Practice
Html Practiceritaester
 
Avinash Deshmukh
Avinash DeshmukhAvinash Deshmukh
Avinash Deshmukhguest8b1fb9
 
Four tourism destinations tourism marketing turistico n.4 four tourism
Four tourism destinations tourism marketing turistico n.4 four tourismFour tourism destinations tourism marketing turistico n.4 four tourism
Four tourism destinations tourism marketing turistico n.4 four tourismFTourism & Marketing
 
Presentation of Triton
Presentation of TritonPresentation of Triton
Presentation of Tritonsonia
 

Andere mochten auch (20)

RDFa Everywhere
RDFa EverywhereRDFa Everywhere
RDFa Everywhere
 
ccREL update 2008-07-29
ccREL update 2008-07-29ccREL update 2008-07-29
ccREL update 2008-07-29
 
Two Sides - EMA 2012 Spring Conference
Two Sides - EMA 2012 Spring ConferenceTwo Sides - EMA 2012 Spring Conference
Two Sides - EMA 2012 Spring Conference
 
Sabrina Pesarini Brand Reputation
Sabrina Pesarini Brand ReputationSabrina Pesarini Brand Reputation
Sabrina Pesarini Brand Reputation
 
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...
Corso di formazione "Informazione e Accoglienza 2.0 per operatori Info Point"...
 
ネットワーク設計と構築090530
ネットワーク設計と構築090530ネットワーク設計と構築090530
ネットワーク設計と構築090530
 
Cum sa faci plati rapide pe Internet
Cum sa faci plati rapide pe InternetCum sa faci plati rapide pe Internet
Cum sa faci plati rapide pe Internet
 
Where is uranr at
Where is uranr atWhere is uranr at
Where is uranr at
 
Josep Ejarque e il Turismo Culturale in Umbria
Josep Ejarque e il Turismo Culturale in UmbriaJosep Ejarque e il Turismo Culturale in Umbria
Josep Ejarque e il Turismo Culturale in Umbria
 
China Limburg Presentatie Rob Hermans
China Limburg Presentatie Rob HermansChina Limburg Presentatie Rob Hermans
China Limburg Presentatie Rob Hermans
 
Educa2011
Educa2011Educa2011
Educa2011
 
DDRR Chapter Five
DDRR Chapter FiveDDRR Chapter Five
DDRR Chapter Five
 
How I started using Social Networking
How I started using Social NetworkingHow I started using Social Networking
How I started using Social Networking
 
Suffixes
SuffixesSuffixes
Suffixes
 
Can Pres Eng2
Can Pres Eng2Can Pres Eng2
Can Pres Eng2
 
Html Practice
Html PracticeHtml Practice
Html Practice
 
Past continuous forms
Past continuous formsPast continuous forms
Past continuous forms
 
Avinash Deshmukh
Avinash DeshmukhAvinash Deshmukh
Avinash Deshmukh
 
Four tourism destinations tourism marketing turistico n.4 four tourism
Four tourism destinations tourism marketing turistico n.4 four tourismFour tourism destinations tourism marketing turistico n.4 four tourism
Four tourism destinations tourism marketing turistico n.4 four tourism
 
Presentation of Triton
Presentation of TritonPresentation of Triton
Presentation of Triton
 

Ähnlich wie Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Landval.cartei
 
Resource description framework
Resource description frameworkResource description framework
Resource description frameworkhozifa1010
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked DataJane Stevenson
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Hector Correa
 
RDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use itRDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use itJose Luis Lopez Pino
 
The Semantic Web An Introduction
The Semantic Web An IntroductionThe Semantic Web An Introduction
The Semantic Web An Introductionshaouy
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Cory Lampert
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis PlatformLeigh Dodds
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashupsgiurca
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsRinke Hoekstra
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic WebPeter Mika
 
W3C Linked Data Platform Overview
W3C Linked Data Platform OverviewW3C Linked Data Platform Overview
W3C Linked Data Platform OverviewSteve Speicher
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011Peter Mika
 
Using Hyperlinks to Enrich Message Board Content with Linked Data
Using Hyperlinks to Enrich Message Board Content with Linked DataUsing Hyperlinks to Enrich Message Board Content with Linked Data
Using Hyperlinks to Enrich Message Board Content with Linked DataSheila Kinsella
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaPlatypus
 

Ähnlich wie Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data (20)

Linked Data
Linked DataLinked Data
Linked Data
 
Culture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data LandCulture Geeks Feb talk: Adventures in Linked Data Land
Culture Geeks Feb talk: Adventures in Linked Data Land
 
Resource description framework
Resource description frameworkResource description framework
Resource description framework
 
Lifting the Lid on Linked Data
Lifting the Lid on Linked DataLifting the Lid on Linked Data
Lifting the Lid on Linked Data
 
Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)Introduction to Linked Data Platform (LDP)
Introduction to Linked Data Platform (LDP)
 
RDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use itRDFa: introduction, comparison with microdata and microformats and how to use it
RDFa: introduction, comparison with microdata and microformats and how to use it
 
The Semantic Web An Introduction
The Semantic Web An IntroductionThe Semantic Web An Introduction
The Semantic Web An Introduction
 
Linked data and voyager
Linked data and voyagerLinked data and voyager
Linked data and voyager
 
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
Linked data demystified:Practical efforts to transform CONTENTDM metadata int...
 
Getting Started With The Talis Platform
Getting Started With The Talis PlatformGetting Started With The Talis Platform
Getting Started With The Talis Platform
 
Semantic Pipes and Semantic Mashups
Semantic Pipes and Semantic MashupsSemantic Pipes and Semantic Mashups
Semantic Pipes and Semantic Mashups
 
Linked Data In Action
Linked Data In ActionLinked Data In Action
Linked Data In Action
 
SemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n BoltsSemanticWeb Nuts 'n Bolts
SemanticWeb Nuts 'n Bolts
 
Quick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & MicroformatsQuick Introduction to the Semantic Web, RDFa & Microformats
Quick Introduction to the Semantic Web, RDFa & Microformats
 
Publishing data on the Semantic Web
Publishing data on the Semantic WebPublishing data on the Semantic Web
Publishing data on the Semantic Web
 
W3C Linked Data Platform Overview
W3C Linked Data Platform OverviewW3C Linked Data Platform Overview
W3C Linked Data Platform Overview
 
Hack U Barcelona 2011
Hack U Barcelona 2011Hack U Barcelona 2011
Hack U Barcelona 2011
 
Using Hyperlinks to Enrich Message Board Content with Linked Data
Using Hyperlinks to Enrich Message Board Content with Linked DataUsing Hyperlinks to Enrich Message Board Content with Linked Data
Using Hyperlinks to Enrich Message Board Content with Linked Data
 
RDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFaRDFa Introductory Course Session 2/4 How RDFa
RDFa Introductory Course Session 2/4 How RDFa
 
How RDFa works
How RDFa worksHow RDFa works
How RDFa works
 

Kürzlich hochgeladen

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGSujit Pal
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 

Kürzlich hochgeladen (20)

Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Google AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAGGoogle AI Hackathon: LLM based Evaluator for RAG
Google AI Hackathon: LLM based Evaluator for RAG
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 

Epiphany: Adaptable RDFa Generation Linking the Web of Documents to the Web of Data

  • 1. EpiphanyAdaptable RDFa Generation Linking the Web of Documents to the Web of Data Benjamin Adrian, JörnHeesIvan Herman, Michael Sintek,Andreas Dengel
  • 2. Outline Web of Document vs. Web of Data RDFa, the glue combining both worlds Use Linked Data for RDFa generation Extract RDF from web pages Visualize RDFa via Linked Data Evaluation and Comparison with Open Calais 2
  • 3. Web of Documents Features Distributed textual content Addressed by URLs Layout in HTML, CSS Connected with hyperlinks Access via HTTP Made for human readers! 3 World WideWeb WWW
  • 4. Web of Data Features Distributed data sets Addressed by URIs Format is RDF Connected with RDF Links Access via HTTP Made for machine readers! 4 Linked OpenData
  • 5. Plain HTML <BODY> <H1>Epiphany</H1><UL> <LI>Ben</LI> <LI>Jörn</LI> <LI>Ivan</LI> <LI>Michael</LI> <LI>Andreas</LI></UL> </BODY> 5 RDFa – Adding machine-readable data to web documents RDF in HTMLattributes
  • 6. HTML + RDFa <BODY vocab = “http://xmlns.com/foaf/0.1/”prefix = “dc=http://purl.org/dc/terms/”><H1 about = “#epiphany”property = “dc:title”>Epiphany</H1><UL inv = “member”> <LI typeof = “Person” property = “name”>Ben</LI> <LI typeof = “Person”property = “name”>Jörn</LI> <LI typeof = “Person”property = “name”>Ivan</LI> <LI typeof = “Person”property = “name”>Michael</LI> <LI typeof = “Person”property = “name”>Andreas</LI></UL> </BODY> 6 RDFa – Adding machine-readable data to web documents RDF in HTMLattributes
  • 7. RDF @prefix foaf: <http://xmlns.com/foaf/0.1/>@prefix dc: <http://purl.org/dc/terms/>#epiphany dc:title “Epiphany” .[] foaf:name “Ben” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Jörn” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Ivan” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Michael” ;rdf:typefoaf:Person ;foaf:member #epiphany .[] foaf:name “Andreas” ;rdf:typefoaf:Person ;foaf:member #epiphany . 7 RDFa – Adding machine-readable data to web documents RDF in TURTLE notation
  • 8. A bridge from document to data 8 RDFa RDFa WWW
  • 9. A bridge from document to data 9 RDFa Benefits RDFa is easy to generate by CMS ( i.e., Drupal), and other dynamic content providers. It is easy to annotate well-structured data with RDFa RDFa WWW Open Problems How to annotate unstructured plain text content? How to annotate the same documentsdifferently based on different data bases?
  • 10. Epiphany 10 How to consumeLinked Data forRDFa generation Generate RDFa-annotated versions of web pages on-the-fly. For different Linked Data Models used generate different RDFa annotations. Create interactive boxes filled with additional information about annotated resources. Linked Data Model EpiphanyRDFa Generation Linked Data Model HTML+RDFa HTML
  • 11. Epiphany - Example 11 How to consumeLinked Data forRDFa generation Tim Burton is a movie maker. <span about=“http://dbpedia.org/resource/Tim_Burton”property=“foaf:name”>Tim Burton</span>is a movie maker.
  • 12. Epiphany 12 How to extractRDF from web pages Ontology-based Information Extraction RDFgraph Preprocessing ExtractionPipeline RDFgraphstore Cache HTML RDFaGenerator Linked Data Model HTML+RDFa
  • 13. Ontology-based Information Extraction 13 RDF-based Information Extraction Ontology-based Information Extraction Pipeline “Ben is member of RDFa WG.” Text Normalization “Ben”, “is”, “member”, “of”, “RDFa WG”, ”.” Text Segmentation [] foaf:name “Ben”. [] foaf:name “RDFa WG” . Symbol Recognition <#me> foaf:name “Ben”. <#RDFaWG> foaf:name ”RDFa WG.” Instance Recognition <#me> foaf:member of <#RDFaWG>. Contextual Fact Recognition <#me> foaf:name “Ben”. <#RDFaWG> foaf:name ”RDFa WG.” <#me> foaf:member <#RDFaWG>. RDF Generation
  • 14. RDFa Generation 14 How to create semantic annotations requestHTML tidy toXHTML list of RDF triples with literal object values DOM node traversal am:Burtonfoaf:name “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name “Autor”. Am:filmfoaf:name “Film”.
  • 15. RDFa Generation 15 How to create semantic annotations for each text node around matches create html:SPANelement with RDFa attributes requestHTML tidy toXHTML list of RDF triples with literal object values DOM node traversal am:Burtonfoaf:name “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name “Autor”. Am:filmfoaf:name “Film”.
  • 16. RDFa Generation 16 How to create semantic annotations for each text node around matches create html:SPANelement with RDFa attributes requestHTML tidy toXHTML add link to RDF graph to header list of RDF triples with literal object values DOM node traversal add js:onclick listener to elements with RDFa:about attributes am:Burtonfoaf:name “Tim Burton”. am:augustrdfs:label “August”. am:autorfoaf:name “Autor”. Am:filmfoaf:name “Film”.
  • 17. RDFa Visualization 17 Use RDF togenerateEpiphanies Browser Epiphany Linked Data AJAX call toInformation Provider onclick event on <SPAN/> elements GET /resource/Tim_Burton HTTP 1.1 HOST: dbpedia.org ACCEPT: RDF Render RDF data inHTML as lighting box
  • 18. Evaluation 18 ComparedEpiphany and Open Calais Goal: Epiphany is at least as good as Open Calais But:
  • 19. Evaluation 19 ComparedEpiphany and Open Calais Linked data model : 12,462 pages + RDF graphs by BBC Music Artists </music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3#artist> rdf:typemo:MusicGroup ; rdf:typemo:MusicArtist ; foaf:name „Queen“ . … http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3 http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3.rdf
  • 20. Evaluation – BBC Corpus 20 ComparedEpiphany and Open Calais For comparing results generated by Open Calais and Epiphany, we had to align Open Calais’ results to BBC’s vocabulary. BBC / Epiphany: mo:SoloMusicArtistmo:MusicGroup foaf:name Open Calais: oc:Person oc:MusicGroup oc:match, oc:name
  • 21. Evaluation 21 Named Entities How did Open Calais and Epiphany retrieve any instances with known foaf:name values? e.g., [] foaf:name “Queen”
  • 22. Evaluation 22 Solo MusicArtists mo:SoloMusicArtistwith known foaf:name values? e.g., [] foaf:name “Brian May”; a mo:SoloMusicArtist.
  • 23. Evaluation 23 MusicGroups mo:MusicGroupwith known foaf:name values? e.g., [] foaf:name “Queen”; a mo:MusicGroup .
  • 24. Discussion 24 DisambiguationProblems
  • 25.
  • 27. Personal Information Models (PIMO)
  • 28. BBC Music Artists (Music Ontology)
  • 29. Amazon (Good Relations)At a glance Epiphany is an RDFa Generator that enriches Web Pages with Information from Linked Data Models. Demo Version http://projects.dfki.uni-kl.de/epiphany/
  • 30.
  • 31. Increase Precision by adding context analyses
  • 32. Deploy Epiphany for DBpedia content on a scalable server
  • 34. Integrate existing RDFa widgets and visualisations26 NextSteps
  • 35. Thank you for Listening! Contributors JörnHees University of Kaiserslautern Benjamin Adrian DFKI Andreas Dengel DFKI Michael Sintek DFKI Ivan Herman W3C 27