SlideShare ist ein Scribd-Unternehmen logo
1 von 30
Linkator: enriching web pages by automatically adding dereferenceable semantic annotations Samur Araujo, Geert-Jan Houben, Daniel Schwabe Web Information Systems Delft University of Technology, the Netherlands
Summary – dereferencing semantic annotations What dereferencing semantic annotations is about? Automatic linking web pages. Summary Overview of the problem and motivation. Our approach for solving the problem. One example of use.
Motivation Links between HTML pages are the main mechanism to navigate on web pages. However, a lot of pages are unlinked or poorly linked. Terms on pages have meaning and are intrinsically associated to concepts or entities that the user is interested in. These terms can be interpreted by machines and automatically linked to relevant resources on the web.
Problem Statement  The problem of automatic linking can be divided in 3 sub-problems: How to identify candidate terms (anchors) for adding links? It denotes concepts in which the user is interested.   Which concept does a candidate term represent? Disambiguate a candidate term. How to identify a web resource to be the link target? How to select a source of data for finding the destination of the link?
State-of-the-Art in Automatic Linking Candidate Terms: Focused on term disambiguation using an auxiliary knowledge base or dictionaries (e.g. wikipedia and wordnet). Link Target: It is selected from a specific knowledge base [1] or from a collection [2] of target documents.  Limitations Does not support well users interested in a broader range of domains. [1] Mihalcea, R. and Csomai, A. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge management (CIKM 07), Lisbon, Portugal, pp. 233-242, 2007. [2] Gardner JJ, Krowne A, Xiong L. NNexus: An Automatic Linker for Collaborative Web-Based Corpora. IEEE Trans. Knowl. Data Eng. 21(6). 829-839. 2009.
Linkator Approach	 Linkator Extract Terms from Web Pages Associate Terms to Concepts Find Resources that Represents these Concepts Core Linkator Information Extraction Engine Semantic Annotator
Link Clicked Page Accessed Page is accessed Annotated page Term are extracted Annotation is extracted Page is semantically annotated Endpoint is chosen Semantic Links created Query is formulated If not found Search for a resource
Linkator Approach	 Web Browser Linkator Client - Firefox Plugin Annotator RDFa Annotator Information Extraction Engine HTTP HTTP Linkator Server Linked  Data Endpoint Resolution Sparql Query Formulation
Semantic Link – Definition  A semantic link is an HTML tag A that is semantically annotated with RDFa. It contains RDF triples associated to it. Semantic Link causes a query over Linked Data.
RDF Triples associated to the Semantic Link Semantic Links
Dereferencing Semantic Links Linkator uses the Linked Data cloudfor discovering a destination for the semantic link as opposed to querying search engines or a fixed knowledge base. Algorithm for Endpoint Resolution Algorithm for Query Formulation
Endpoint Resolution ,[object Object],Linkatorselects available endpoints based on the vocabulariesused in the semantic links. voiD(Vocabulary of Interlinked Datasets)
Endpoint Resolution Select the vocabulary of all RDF types associated with the annotation. Or select the vocabularies of all predicates associated with the annotation.
Endpoint Resolution The SelectEndpoint function find the resource: http://ontoware.org/swrc/swrc_v0.3.owl#Author It extracts the vocabulary associated with this resource:http://ontoware.org/swrc/swrc_v0.3.owl It queries the voiDdescriptor of the available SPARQL endpoints, looking for such a vocabulary.
Query Formulation Query is based on the object of the triple. Try to find a human-readable representation of the resource, i.e., try to match predicates such as: foaf:homepage, akt:has-web-address, rdfs:seeAlso.
Proof of Concept Semantic links for pages that contain bibliographic citations.  Extended version of FreeCite parsing engine. Example of bibliographic citation: Keesvan derSluijs, Geert-Jan Houben, Erwin Leonardi, Jan Hidders. Hera: Engineering Web Applications Using Semantic Web-Based Models. Book chapter: Semantic Web Information Management: A Model-Based Perspective, De Virgilio, Roberto; Giunchiglia, Fausto; Tanca, Letizia (Eds.), Chapter 22, 2010, Springer.
Linkator Extract Terms from Web Pages Associate Terms to Concepts Find Resources that Represents these Concepts Core Linkator Information Extraction Engine Semantic Annotator Html Page Sparql Endpoint Discovering and Selection Markup Removed Entity  Extraction  Plain Text Text Semantically Annotated Endpoint  Querying Semantic link clicked Semantic Annotation Insert annotations  on the page HTML Page Semantically Annotated URL Generation FreeCite Extraction Engine Core Linkator
Example – HTML Page without Links
Example – Page annotated with RDFa
Example – Page with Semantic Links
Conclusion and Future Work For a specific scenario of linking bibliographic citations Linkator provides a reasonable solution.  The composition of the Semantic Web technologies can provide a reasonable solution for the problem of automatic linking. Linkator is a concrete application that uses Semantic Web technologies. Future Work:  Use Linkator in a broader scenario. Enhance the Linkator algorithms. Evaluate the precision and recall of the linking.
Questions? Thank you for your attention! Samur Araujo s.f.cardosodearaujo@tudelft.nl You can download Linkator at: http://www.wis.ewi.tudelft.nl/
Annotation on the page are used to find the link destination Annotated HTML Page HTML Page Page is annotated Link is clicked RDF
State-of-the-Art in Automatic Linking Example:  Wikify! [1] is focused on linking keywords on web pages to Wikipedia articles Nnexus [2] focus on linking keywords obtained from an index extracted from target documents.  [1] Mihalcea, R. and Csomai, A. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge management (CIKM 07), Lisbon, Portugal, pp. 233-242, 2007. [2] Gardner JJ, Krowne A, Xiong L. NNexus: An Automatic Linker for Collaborative Web-Based Corpora. IEEE Trans. Knowl. Data Eng. 21(6). 829-839. 2009.
Endpoint Resolution FUNCTION SelectEndpoint 	E := Array 	R : = select all rdf:type objects associated to the semantic link 	T := ExtractVocabulary(R) FOR EACH vocabulary in T DO { E.add (select endpoints that contain this vocabulary) 	} IF E = Empty  	{ 		R := select all predicates associated to the semantic link 		T := ExtractVocabulary(R) FOR EACH vocabulary in T DO 		{ E.add (select endpoints that contain this vocabulary) 		} 	} RETURN E  FUNCTION ExtractVocabulary(R) 	V := Array FOR EACH resource in R DO 	{ V.add (extract the vocabulary from the resource) 	} RETURN V 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
Semantic Link – Example Triples associated with the semantic link.

Weitere ähnliche Inhalte

Andere mochten auch

Montreal Jardins
Montreal JardinsMontreal Jardins
Montreal Jardinsmadlenso
 
Alsacefrance Harry
Alsacefrance  HarryAlsacefrance  Harry
Alsacefrance Harrymadlenso
 
ActionCOACH - Social Media Success - Mari Smith
ActionCOACH - Social Media Success - Mari SmithActionCOACH - Social Media Success - Mari Smith
ActionCOACH - Social Media Success - Mari SmithMari Smith
 
Social Media Success - eWomenNetwork 2010
Social Media Success - eWomenNetwork 2010Social Media Success - eWomenNetwork 2010
Social Media Success - eWomenNetwork 2010Mari Smith
 
marginal cost
marginal costmarginal cost
marginal costramncsi
 
C. peter wagner confrontando a rainha dos céus.rev
C. peter wagner   confrontando a rainha dos céus.revC. peter wagner   confrontando a rainha dos céus.rev
C. peter wagner confrontando a rainha dos céus.revRose Lucia
 
Social Profit System 2014 - Free Webinar with Mari Smith!
Social Profit System 2014 - Free Webinar with Mari Smith!Social Profit System 2014 - Free Webinar with Mari Smith!
Social Profit System 2014 - Free Webinar with Mari Smith!Mari Smith
 
Designing for Purpose
Designing for PurposeDesigning for Purpose
Designing for PurposeZumio
 
Ambient interfaces: Influencing energy behaviours in urban environments
Ambient interfaces: Influencing energy behaviours in urban environmentsAmbient interfaces: Influencing energy behaviours in urban environments
Ambient interfaces: Influencing energy behaviours in urban environmentsZumio
 
Facebook Marketing: An Hour A Day - Session 1 of 3
Facebook Marketing: An Hour A Day - Session 1 of 3Facebook Marketing: An Hour A Day - Session 1 of 3
Facebook Marketing: An Hour A Day - Session 1 of 3Mari Smith
 
Information economics
Information economicsInformation economics
Information economicsramncsi
 
supply chain management
supply chain managementsupply chain management
supply chain managementramncsi
 
Facebook Marketing - Presentation for Jay Berkowitz
Facebook Marketing - Presentation for Jay BerkowitzFacebook Marketing - Presentation for Jay Berkowitz
Facebook Marketing - Presentation for Jay BerkowitzMari Smith
 

Andere mochten auch (18)

Montreal Jardins
Montreal JardinsMontreal Jardins
Montreal Jardins
 
Bella Edward
Bella  EdwardBella  Edward
Bella Edward
 
águas turbulentas
águas turbulentaságuas turbulentas
águas turbulentas
 
Alsacefrance Harry
Alsacefrance  HarryAlsacefrance  Harry
Alsacefrance Harry
 
Bella Edward
Bella  EdwardBella  Edward
Bella Edward
 
WS Final
WS FinalWS Final
WS Final
 
You Needed Me
You Needed MeYou Needed Me
You Needed Me
 
ActionCOACH - Social Media Success - Mari Smith
ActionCOACH - Social Media Success - Mari SmithActionCOACH - Social Media Success - Mari Smith
ActionCOACH - Social Media Success - Mari Smith
 
Social Media Success - eWomenNetwork 2010
Social Media Success - eWomenNetwork 2010Social Media Success - eWomenNetwork 2010
Social Media Success - eWomenNetwork 2010
 
marginal cost
marginal costmarginal cost
marginal cost
 
C. peter wagner confrontando a rainha dos céus.rev
C. peter wagner   confrontando a rainha dos céus.revC. peter wagner   confrontando a rainha dos céus.rev
C. peter wagner confrontando a rainha dos céus.rev
 
Social Profit System 2014 - Free Webinar with Mari Smith!
Social Profit System 2014 - Free Webinar with Mari Smith!Social Profit System 2014 - Free Webinar with Mari Smith!
Social Profit System 2014 - Free Webinar with Mari Smith!
 
Designing for Purpose
Designing for PurposeDesigning for Purpose
Designing for Purpose
 
Ambient interfaces: Influencing energy behaviours in urban environments
Ambient interfaces: Influencing energy behaviours in urban environmentsAmbient interfaces: Influencing energy behaviours in urban environments
Ambient interfaces: Influencing energy behaviours in urban environments
 
Facebook Marketing: An Hour A Day - Session 1 of 3
Facebook Marketing: An Hour A Day - Session 1 of 3Facebook Marketing: An Hour A Day - Session 1 of 3
Facebook Marketing: An Hour A Day - Session 1 of 3
 
Information economics
Information economicsInformation economics
Information economics
 
supply chain management
supply chain managementsupply chain management
supply chain management
 
Facebook Marketing - Presentation for Jay Berkowitz
Facebook Marketing - Presentation for Jay BerkowitzFacebook Marketing - Presentation for Jay Berkowitz
Facebook Marketing - Presentation for Jay Berkowitz
 

Ähnlich wie Linkator: enriching web pages by automatically adding dereferenceable semantic annotations

Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"hypertext2007
 
Future of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic WebFuture of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic Webis20090
 
Semantic Text Processing Powered by Wikipedia
Semantic Text Processing Powered by WikipediaSemantic Text Processing Powered by Wikipedia
Semantic Text Processing Powered by WikipediaMaxim Grinev
 
Intelligent expert systems for location planning
Intelligent expert systems for location planningIntelligent expert systems for location planning
Intelligent expert systems for location planningNavid Milanizadeh
 
Web Design Lecture1.pptx
Web Design Lecture1.pptxWeb Design Lecture1.pptx
Web Design Lecture1.pptxMohammedNoor74
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010Andreas Blumauer
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Researchadameq
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technologyStanley Wang
 
A Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic WebA Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic WebAaron Huang
 
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463IJRAT
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationSören Auer
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documentsmaria.grineva
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebMathieu d'Aquin
 
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...iosrjce
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016Jessie Chuang
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlinkbusserboy
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Laurent Alquier
 

Ähnlich wie Linkator: enriching web pages by automatically adding dereferenceable semantic annotations (20)

Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
Hypertext2007 Carole Goble Keynote - "The Return of the Prodigal Web"
 
Future of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic WebFuture of Web 2.0 & The Semantic Web
Future of Web 2.0 & The Semantic Web
 
Semantic Text Processing Powered by Wikipedia
Semantic Text Processing Powered by WikipediaSemantic Text Processing Powered by Wikipedia
Semantic Text Processing Powered by Wikipedia
 
Intelligent expert systems for location planning
Intelligent expert systems for location planningIntelligent expert systems for location planning
Intelligent expert systems for location planning
 
Web Design Lecture1.pptx
Web Design Lecture1.pptxWeb Design Lecture1.pptx
Web Design Lecture1.pptx
 
WEB MINING.pptx
WEB MINING.pptxWEB MINING.pptx
WEB MINING.pptx
 
PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010PoolParty Thesaurus Management - ISKO UK, London 2010
PoolParty Thesaurus Management - ISKO UK, London 2010
 
Corrib.org - OpenSource and Research
Corrib.org - OpenSource and ResearchCorrib.org - OpenSource and Research
Corrib.org - OpenSource and Research
 
Semantic web technology
Semantic web technologySemantic web technology
Semantic web technology
 
A Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic WebA Term Based Ranking Methodology for Resources on the Semantic Web
A Term Based Ranking Methodology for Resources on the Semantic Web
 
Paper id 25201463
Paper id 25201463Paper id 25201463
Paper id 25201463
 
Linked data for Enterprise Data Integration
Linked data for Enterprise Data IntegrationLinked data for Enterprise Data Integration
Linked data for Enterprise Data Integration
 
Semantic web
Semantic webSemantic web
Semantic web
 
Extracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme DocumentsExtracting Key Terms From Noisy and Multi-theme Documents
Extracting Key Terms From Noisy and Multi-theme Documents
 
Doing Clever Things with the Semantic Web
Doing Clever Things with the Semantic WebDoing Clever Things with the Semantic Web
Doing Clever Things with the Semantic Web
 
R01765113122
R01765113122R01765113122
R01765113122
 
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
Design and Implementation of SOA Enhanced Semantic Information Retrieval web ...
 
X api chinese cop monthly meeting feb.2016
X api chinese cop monthly meeting   feb.2016X api chinese cop monthly meeting   feb.2016
X api chinese cop monthly meeting feb.2016
 
Exposing the Hyperlink
Exposing the HyperlinkExposing the Hyperlink
Exposing the Hyperlink
 
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.Exploration of a Data Landscape using a Collaborative Linked Data Framework.
Exploration of a Data Landscape using a Collaborative Linked Data Framework.
 

Kürzlich hochgeladen

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????blackmambaettijean
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 

Kürzlich hochgeladen (20)

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
What is Artificial Intelligence?????????
What is Artificial Intelligence?????????What is Artificial Intelligence?????????
What is Artificial Intelligence?????????
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate AgentsRyan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
Ryan Mahoney - Will Artificial Intelligence Replace Real Estate Agents
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 

Linkator: enriching web pages by automatically adding dereferenceable semantic annotations

  • 1. Linkator: enriching web pages by automatically adding dereferenceable semantic annotations Samur Araujo, Geert-Jan Houben, Daniel Schwabe Web Information Systems Delft University of Technology, the Netherlands
  • 2. Summary – dereferencing semantic annotations What dereferencing semantic annotations is about? Automatic linking web pages. Summary Overview of the problem and motivation. Our approach for solving the problem. One example of use.
  • 3. Motivation Links between HTML pages are the main mechanism to navigate on web pages. However, a lot of pages are unlinked or poorly linked. Terms on pages have meaning and are intrinsically associated to concepts or entities that the user is interested in. These terms can be interpreted by machines and automatically linked to relevant resources on the web.
  • 4.
  • 5. Problem Statement The problem of automatic linking can be divided in 3 sub-problems: How to identify candidate terms (anchors) for adding links? It denotes concepts in which the user is interested. Which concept does a candidate term represent? Disambiguate a candidate term. How to identify a web resource to be the link target? How to select a source of data for finding the destination of the link?
  • 6. State-of-the-Art in Automatic Linking Candidate Terms: Focused on term disambiguation using an auxiliary knowledge base or dictionaries (e.g. wikipedia and wordnet). Link Target: It is selected from a specific knowledge base [1] or from a collection [2] of target documents. Limitations Does not support well users interested in a broader range of domains. [1] Mihalcea, R. and Csomai, A. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge management (CIKM 07), Lisbon, Portugal, pp. 233-242, 2007. [2] Gardner JJ, Krowne A, Xiong L. NNexus: An Automatic Linker for Collaborative Web-Based Corpora. IEEE Trans. Knowl. Data Eng. 21(6). 829-839. 2009.
  • 7. Linkator Approach Linkator Extract Terms from Web Pages Associate Terms to Concepts Find Resources that Represents these Concepts Core Linkator Information Extraction Engine Semantic Annotator
  • 8. Link Clicked Page Accessed Page is accessed Annotated page Term are extracted Annotation is extracted Page is semantically annotated Endpoint is chosen Semantic Links created Query is formulated If not found Search for a resource
  • 9. Linkator Approach Web Browser Linkator Client - Firefox Plugin Annotator RDFa Annotator Information Extraction Engine HTTP HTTP Linkator Server Linked Data Endpoint Resolution Sparql Query Formulation
  • 10. Semantic Link – Definition A semantic link is an HTML tag A that is semantically annotated with RDFa. It contains RDF triples associated to it. Semantic Link causes a query over Linked Data.
  • 11. RDF Triples associated to the Semantic Link Semantic Links
  • 12. Dereferencing Semantic Links Linkator uses the Linked Data cloudfor discovering a destination for the semantic link as opposed to querying search engines or a fixed knowledge base. Algorithm for Endpoint Resolution Algorithm for Query Formulation
  • 13.
  • 14. Endpoint Resolution Select the vocabulary of all RDF types associated with the annotation. Or select the vocabularies of all predicates associated with the annotation.
  • 15. Endpoint Resolution The SelectEndpoint function find the resource: http://ontoware.org/swrc/swrc_v0.3.owl#Author It extracts the vocabulary associated with this resource:http://ontoware.org/swrc/swrc_v0.3.owl It queries the voiDdescriptor of the available SPARQL endpoints, looking for such a vocabulary.
  • 16. Query Formulation Query is based on the object of the triple. Try to find a human-readable representation of the resource, i.e., try to match predicates such as: foaf:homepage, akt:has-web-address, rdfs:seeAlso.
  • 17. Proof of Concept Semantic links for pages that contain bibliographic citations. Extended version of FreeCite parsing engine. Example of bibliographic citation: Keesvan derSluijs, Geert-Jan Houben, Erwin Leonardi, Jan Hidders. Hera: Engineering Web Applications Using Semantic Web-Based Models. Book chapter: Semantic Web Information Management: A Model-Based Perspective, De Virgilio, Roberto; Giunchiglia, Fausto; Tanca, Letizia (Eds.), Chapter 22, 2010, Springer.
  • 18. Linkator Extract Terms from Web Pages Associate Terms to Concepts Find Resources that Represents these Concepts Core Linkator Information Extraction Engine Semantic Annotator Html Page Sparql Endpoint Discovering and Selection Markup Removed Entity Extraction Plain Text Text Semantically Annotated Endpoint Querying Semantic link clicked Semantic Annotation Insert annotations on the page HTML Page Semantically Annotated URL Generation FreeCite Extraction Engine Core Linkator
  • 19. Example – HTML Page without Links
  • 20.
  • 21. Example – Page annotated with RDFa
  • 22. Example – Page with Semantic Links
  • 23.
  • 24.
  • 25. Conclusion and Future Work For a specific scenario of linking bibliographic citations Linkator provides a reasonable solution. The composition of the Semantic Web technologies can provide a reasonable solution for the problem of automatic linking. Linkator is a concrete application that uses Semantic Web technologies. Future Work: Use Linkator in a broader scenario. Enhance the Linkator algorithms. Evaluate the precision and recall of the linking.
  • 26. Questions? Thank you for your attention! Samur Araujo s.f.cardosodearaujo@tudelft.nl You can download Linkator at: http://www.wis.ewi.tudelft.nl/
  • 27. Annotation on the page are used to find the link destination Annotated HTML Page HTML Page Page is annotated Link is clicked RDF
  • 28. State-of-the-Art in Automatic Linking Example: Wikify! [1] is focused on linking keywords on web pages to Wikipedia articles Nnexus [2] focus on linking keywords obtained from an index extracted from target documents. [1] Mihalcea, R. and Csomai, A. Wikify!: linking documents to encyclopedic knowledge. In Proceedings of the 16th ACM Conference on Information and Knowledge management (CIKM 07), Lisbon, Portugal, pp. 233-242, 2007. [2] Gardner JJ, Krowne A, Xiong L. NNexus: An Automatic Linker for Collaborative Web-Based Corpora. IEEE Trans. Knowl. Data Eng. 21(6). 829-839. 2009.
  • 29. Endpoint Resolution FUNCTION SelectEndpoint E := Array R : = select all rdf:type objects associated to the semantic link T := ExtractVocabulary(R) FOR EACH vocabulary in T DO { E.add (select endpoints that contain this vocabulary) } IF E = Empty { R := select all predicates associated to the semantic link T := ExtractVocabulary(R) FOR EACH vocabulary in T DO { E.add (select endpoints that contain this vocabulary) } } RETURN E FUNCTION ExtractVocabulary(R) V := Array FOR EACH resource in R DO { V.add (extract the vocabulary from the resource) } RETURN V 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28
  • 30. Semantic Link – Example Triples associated with the semantic link.

Hinweis der Redaktion

  1. I am in the start phase of the phd research. In this presentation, I will outline the vision at the start of the phd period on the research problem which is building trust in web content and our approach to solving this problem. Also I will give a brief plan of my PhD research.
  2. We focus on content trust and formulate our main research questions. The first key issue here is to investigate what kind factors that can influence trust in content.Following the first one, we also need to know how to capture and represent the information about these factors.The third key issue is how to assess or compute content trust based on the information we get from the second step. Ideally we want to have a trust value assigned to every piece of content. Different from the propagation of trust through the network of people, since we now have more information, and semantics about the content, we want to build metrics to assess the trustworthiness based content itself and the connection between different pieces of content, especially the semantic similarity and relation.