SlideShare ist ein Scribd-Unternehmen logo
1 von 37
 Copyright 2011 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition: Fallacies,
Challenges & Opportunities
Authors: Mónica Marrero, Julián Urbano, Sonia Sánchez-
Cuadrado, Jorge Morato, Juan Miguel Gómez-Berbís
Presented by: Bianca Pereira
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Alchemy API Raises $2M
“Alchemy, which launched in 2009, processes 3 billion
API calls per month. It is used in 36 countries (…)”
http://semanticweb.com/alchemy-api-raises-2m_b35276
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
“FOX can generate RDF out of natural language with
improved accuracy. FOX has been shown to be up to
15% more accurate than other frameworks, including
commercial software.”
http://semanticweb.com/aksw-announces-federated-knowledge-extraction_b21399
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
“There are many open-source and commercial products
out there that attempt to determine sentiment in
tweets, but what is interesting to find out is what entity
is that sentiment attached to.”
http://semanticweb.com/introducing-semanticweb-com-innovation-spotlight-series-with-pingar_b30106
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
“DBPedia Spotlight’s ability (…) to support (…) faceted
browsing, customized web feeds (…) enrich blog
content.”
“Many (…) relationship extraction algorithms rely on
entity identification beforehand(…)”
http://semanticweb.com/the-spotlight%E2%80%99s-on-dbpedia_b17942
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
“People and places (…) are only a small part of this
wider project (…) around entities that Bing embarked on
a while back.”
http://techcrunch.com/2013/03/21/bing-just-got-a-lot-smarter-now-knows-more-about-people-and-places/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Agenda
 What is a (Named) Entity?
 Named Entity Recognition evolution
 Named Entity Recognition evaluation
 Conclusions
 How is it related to my PhD?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition
 What is Named Entity Recognition?
“Identification of mentions to real world entities
in a natural language text. “
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition
 What is Named Entity Recognition?
“Identification of mentions to real world entities
in a natural language text. “
(my words)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition
 The term “named entity” was coined for the Named
Entity task at the 6th Message Understanding
Conference (MUC-6).
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition
 The term “named entity” was coined for the Named
Entity task at the 6th Message Understanding
Conference (MUC-6).
“Unique identifiers of entities (organizations, persons,
locations), times (dates, times), and quantities
(monetary values, percentages).”
(http://cs.nyu.edu/faculty/grishman/NEtask20.book_2.html#HEADING1)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Named Entity Recognition
 In the next year, the definition changed a little bit.
“Named Entities (NE) were defined as proper names
and quantities of interest. Person, organization, and
location names were marked as well as dates, times,
percentages, and monetary amounts.”
(http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings
/muc_7_proceedings/overview.html)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
MUC-7 Results
 The results for the MUC-7 Named Entity task are
very promising
(http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings/muc_7_
proceedings/marsh_slides.pdf)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Challenges
 There were no Message Understanding Conference
anymore…
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Challenges
 But there was..
 Automatic Content Extraction (ACE - 1999)
 Computational Natural Language Learning (CoNLL – 2002)
 INEX Entity Ranking Track (2007)
 TREC Entity Track (2009)
 TAC Knowledge Base Population (TAC-KBP – 2009)
 …
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
ACE
“ Recognition of entities, not just names. In the ACE
entity detection and tracking (EDT) task, all mentions
of an entity, whether a name, a description, or a
pronoun, are to be found and collected into
equivalence classes based on reference to the same
entity.”
(Doddington et al. 2004)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
CoNLL
“ Named entities are phrases that contain names of
persons, organizations, locations, times and
quantities. (…) We will concentrate on four types of
named entities: persons, locations, organizations and
names of miscellaneous entities that do not belong
to the previous three groups.(…)”
(http://www.clips.ua.ac.be/conll2002/ner/)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
INEX Entity Ranking Track
“ (…) entities (such as countries, people and dates)
requires the estimation of relevance of items (i.e.,
instances of entities) (…) we restricted candidate
items to those entities that have their own
Wikipedia article.”
(De Vries et al. 2007)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
TREC Entity Track
“ A web entity is uniquely identifiable by one of its
primary homepages. Real-world entities can be
represented by multiple homepages.”
(Balog et al. 2009)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
TAC-KBP
“ The tasks will be structured by having participants
process a list of target entities. The list will contain
entity types of Person, Organization and Geo-Political
Entity.”
(http://apl.jhu.edu/~paulmac/kbp/090601-KBPTaskGuidelines.pdf)
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
What is a Named Entity?
 Proper nouns
 Water? Whale? Twelve o’clock?
 Rigid designator
 Richard Nixon (V) vs President of the United States (X)
 Unique identifier
 “(…) virtually everything could be referred to uniquely,
depending on the context or the previous knowledge of
the receiver, although a unique identifier for one receiver
might not be so for another one, either because of lack of
shared knowledge or the ambiguity of the context.”
 Purpose and domain of application
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Evaluation
 As the definition changes the evaluation changes.
Each challenge has different..
 .. types of Named Entity to identify
 .. identification and annotation criteria
 .. valid boundaries of a Named Entity
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Other problems
 How to evaluate current tools with different
definitions of Named Entities?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Other problems
 How to evaluate current tools with different
definitions of Named Entities?
 Using only Person, Organization and Place.
 Using only those tools which work with numbers and
dates.
 Using current annotated corpora (and see what happens).
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Other problems
 How to evaluate current tools with different
definitions of Named Entities?
 Using only Person, Organization and Place.
 Using only those tools which work with numbers and
dates.
 Using current annotated corpora (and see what happens).
 How to choose the best tool?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Other problems
 How to evaluate current tools with different
definitions of Named Entities?
 Using only Person, Organization and Place.
 Using only those tools which work with numbers and
dates.
 Using current annotated corpora (and see what happen).
 How to choose the best tool?
 It depends on the application.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
 Content Validity
– Reflect the needs of the real user.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
 Content Validity
 External Validity
– The experiments can be generalized to other populations and
experimental settings.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
 Content Validity
 External Validity
 Convergent Validity
– The results agree with other results, theoretical or
experimental.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
 Content Validity
 External Validity
 Convergent Validity
 Conclusion Validity
– The conclusions drawn from the results are justified.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 Is NER really solved?
 Content Validity
 External Validity
 Convergent Validity
 Conclusion Validity
“There is not enough evidence to support the statement that
NER is solved: it rather suggests the opposite”
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
 What about..
 genes and diseases?
 entities identified by the same name as their classes?
(ambulance, airplane, and so on)
 entities identified by their attributes and description
 entities…
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusions
What is an entity?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
My PhD thesis
 How is it related to my PhD topic?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
My PhD thesis
 How is it related to my PhD topic?
 Entity Linking is the identification and disambiguation of
entities using a background knowledge base.
 Entity Recognition is the first step.
 What is an entity?
 And more.. What is an entity in different domains?
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
References
 (Balog et al 2009)
 Balog, Krisztian, et al. “Overview of the TREC 2009 Entity
Track.” 2009
 (Doddington et al 2004)
 Doddington, George, et al. “The automatic content
extraction (ACE) program-tasks, data, and evaluation.”
Proceedings of LREC. Vol 4. 2004.
 (De Vries et al 2007)
 De Vries, Arjen P., et al. “Overview of the INEX 2007 entity
ranking track.” Focused Access to XML Documents.
Springer Berlin Heidelberg, 2008. 245-251.

Weitere ähnliche Inhalte

Was ist angesagt?

WatchDox CDCR CaseStudy Dec2014
WatchDox CDCR CaseStudy Dec2014WatchDox CDCR CaseStudy Dec2014
WatchDox CDCR CaseStudy Dec2014joepanora
 
Folksonomy hated word but loved resource
Folksonomy hated word but loved resourceFolksonomy hated word but loved resource
Folksonomy hated word but loved resourceThomas Vander Wal
 
A Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use LicensesA Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use LicensesCESSDA Training
 
Building Knowledge Graphs in DIG
Building Knowledge Graphs in DIGBuilding Knowledge Graphs in DIG
Building Knowledge Graphs in DIGPalak Modi
 
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...kcortis
 
lawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management PanellawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management Panellawtechcamp
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYAmit Sheth
 
Information Architecture class1 01 09
Information Architecture class1 01 09Information Architecture class1 01 09
Information Architecture class1 01 09Marti Gukeisen
 

Was ist angesagt? (9)

WatchDox CDCR CaseStudy Dec2014
WatchDox CDCR CaseStudy Dec2014WatchDox CDCR CaseStudy Dec2014
WatchDox CDCR CaseStudy Dec2014
 
Folksonomy hated word but loved resource
Folksonomy hated word but loved resourceFolksonomy hated word but loved resource
Folksonomy hated word but loved resource
 
A Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use LicensesA Look at CESSDA and Data Re-use Licenses
A Look at CESSDA and Data Re-use Licenses
 
Building Knowledge Graphs in DIG
Building Knowledge Graphs in DIGBuilding Knowledge Graphs in DIG
Building Knowledge Graphs in DIG
 
Bottom Up Tagging
Bottom Up TaggingBottom Up Tagging
Bottom Up Tagging
 
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
Discovering Semantic Equivalence of People behind Online Profiles (RED 2012 -...
 
lawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management PanellawTechCamp - Knowledge Management Panel
lawTechCamp - Knowledge Management Panel
 
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITYSEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
SEMANTIC CONTENT MANAGEMENT FOR ENTERPRISES AND NATIONAL SECURITY
 
Information Architecture class1 01 09
Information Architecture class1 01 09Information Architecture class1 01 09
Information Architecture class1 01 09
 

Ähnlich wie Reading Group 2013 (DERI NUIG)

Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And FootballAmanda Gray
 
PhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsPhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsBianca Pereira
 
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...Stijn (Stan) Christiaens
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebEdward Curry
 
Profiling information sources and services for discovery
Profiling information sources and services for discoveryProfiling information sources and services for discovery
Profiling information sources and services for discoveryMathieu d'Aquin
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsSeth Grimes
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Amit Sheth
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignMarianne Sweeny
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachAndre Freitas
 
Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Marten den Haring
 
Co-creation of Learning and Social CRM
Co-creation of Learning and Social CRMCo-creation of Learning and Social CRM
Co-creation of Learning and Social CRMDarshan Desai
 
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...eswcsummerschool
 
Smart Content = Smart Business
Smart Content = Smart BusinessSmart Content = Smart Business
Smart Content = Smart BusinessSeth Grimes
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementTrey Grainger
 
Leaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di PisaLeaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di PisaDaniele Mazzei
 
Self-Sovereign Identity: Lightening Talk at RightsCon
Self-Sovereign Identity: Lightening Talk at RightsCon Self-Sovereign Identity: Lightening Talk at RightsCon
Self-Sovereign Identity: Lightening Talk at RightsCon Kaliya "Identity Woman" Young
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataEdward Curry
 

Ähnlich wie Reading Group 2013 (DERI NUIG) (20)

Questions On The And Football
Questions On The And FootballQuestions On The And Football
Questions On The And Football
 
PhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data DatasetsPhD Day: Entity Linking using Generic Linked Data Datasets
PhD Day: Entity Linking using Generic Linked Data Datasets
 
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
Successfully Kickstarting Data Governance's Social Dynamics: Define, Collabor...
 
Querying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data WebQuerying Heterogeneous Datasets on the Linked Data Web
Querying Heterogeneous Datasets on the Linked Data Web
 
Hope x talk
Hope x talkHope x talk
Hope x talk
 
Identity Talk at Net Squared 2008
Identity Talk at Net Squared 2008Identity Talk at Net Squared 2008
Identity Talk at Net Squared 2008
 
Profiling information sources and services for discovery
Profiling information sources and services for discoveryProfiling information sources and services for discovery
Profiling information sources and services for discovery
 
What Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content AnalyticsWhat Business Innovators Need to Know about Content Analytics
What Business Innovators Need to Know about Content Analytics
 
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
Semantic Web & Information Brokering: Opportunities, Commercialization and Ch...
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing ApproachCoping with Data Variety in the Big Data Era: The Semantic Computing Approach
Coping with Data Variety in the Big Data Era: The Semantic Computing Approach
 
Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014Digital Reasoning at AirSummit 2014
Digital Reasoning at AirSummit 2014
 
Co-creation of Learning and Social CRM
Co-creation of Learning and Social CRMCo-creation of Learning and Social CRM
Co-creation of Learning and Social CRM
 
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Kn...
 
Smart Content = Smart Business
Smart Content = Smart BusinessSmart Content = Smart Business
Smart Content = Smart Business
 
AI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge ManagementAI, Search, and the Disruption of Knowledge Management
AI, Search, and the Disruption of Knowledge Management
 
Leaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di PisaLeaning Lab il Living Lab di Pisa
Leaning Lab il Living Lab di Pisa
 
Self-Sovereign Identity: Lightening Talk at RightsCon
Self-Sovereign Identity: Lightening Talk at RightsCon Self-Sovereign Identity: Lightening Talk at RightsCon
Self-Sovereign Identity: Lightening Talk at RightsCon
 
Challenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial DataChallenges Ahead for Converging Financial Data
Challenges Ahead for Converging Financial Data
 
Gic2011 aula10-ingles
Gic2011 aula10-inglesGic2011 aula10-ingles
Gic2011 aula10-ingles
 

Mehr von Bianca Pereira

Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's blockBianca Pereira
 
HCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectHCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectBianca Pereira
 
Taxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionTaxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionBianca Pereira
 
How to build your topic?
How to build your topic?How to build your topic?
How to build your topic?Bianca Pereira
 
Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's blockBianca Pereira
 
Smart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeSmart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeBianca Pereira
 
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Bianca Pereira
 
Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Bianca Pereira
 
DBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterDBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterBianca Pereira
 
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Bianca Pereira
 
PhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingPhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingBianca Pereira
 
PhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationPhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationBianca Pereira
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014Bianca Pereira
 
AELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachAELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachBianca Pereira
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content SmarterBianca Pereira
 
Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Bianca Pereira
 

Mehr von Bianca Pereira (16)

Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's block
 
HCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science projectHCI Challenges in Crowd4Access Citizen Science project
HCI Challenges in Crowd4Access Citizen Science project
 
Taxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base ConstructionTaxonomy Extraction for Customer Service Knowledge Base Construction
Taxonomy Extraction for Customer Service Knowledge Base Construction
 
How to build your topic?
How to build your topic?How to build your topic?
How to build your topic?
 
Dealing with writer's block
Dealing with writer's blockDealing with writer's block
Dealing with writer's block
 
Smart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's CollegeSmart Futures presentation at St. Raphael's College
Smart Futures presentation at St. Raphael's College
 
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
Compreensão de Linguagem Natural no Insight: Construindo a Ponte entre Texto ...
 
Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015Tutorial de Web Semântica - CompSem 2015
Tutorial de Web Semântica - CompSem 2015
 
DBpedia as Gaeilge Chapter
DBpedia as Gaeilge ChapterDBpedia as Gaeilge Chapter
DBpedia as Gaeilge Chapter
 
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
Entity Linking with Multiple Knowledge Bases: an Ontology Modularization Appr...
 
PhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity LinkingPhD Day: Adaptive Entity Linking
PhD Day: Adaptive Entity Linking
 
PhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology ModularizationPhD Day: Entity Linking using Ontology Modularization
PhD Day: Entity Linking using Ontology Modularization
 
NUIG Research Showcase 2014
NUIG Research Showcase 2014NUIG Research Showcase 2014
NUIG Research Showcase 2014
 
AELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking ApproachAELA: An Adaptive Entity Linking Approach
AELA: An Adaptive Entity Linking Approach
 
How to Make Your Content Smarter
How to Make Your Content SmarterHow to Make Your Content Smarter
How to Make Your Content Smarter
 
Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)Reading Group 2014 (Insight NUIG)
Reading Group 2014 (Insight NUIG)
 

Kürzlich hochgeladen

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 

Kürzlich hochgeladen (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 

Reading Group 2013 (DERI NUIG)

  • 1.  Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition: Fallacies, Challenges & Opportunities Authors: Mónica Marrero, Julián Urbano, Sonia Sánchez- Cuadrado, Jorge Morato, Juan Miguel Gómez-Berbís Presented by: Bianca Pereira
  • 2. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Alchemy API Raises $2M “Alchemy, which launched in 2009, processes 3 billion API calls per month. It is used in 36 countries (…)” http://semanticweb.com/alchemy-api-raises-2m_b35276
  • 3. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge “FOX can generate RDF out of natural language with improved accuracy. FOX has been shown to be up to 15% more accurate than other frameworks, including commercial software.” http://semanticweb.com/aksw-announces-federated-knowledge-extraction_b21399
  • 4. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge “There are many open-source and commercial products out there that attempt to determine sentiment in tweets, but what is interesting to find out is what entity is that sentiment attached to.” http://semanticweb.com/introducing-semanticweb-com-innovation-spotlight-series-with-pingar_b30106
  • 5. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge “DBPedia Spotlight’s ability (…) to support (…) faceted browsing, customized web feeds (…) enrich blog content.” “Many (…) relationship extraction algorithms rely on entity identification beforehand(…)” http://semanticweb.com/the-spotlight%E2%80%99s-on-dbpedia_b17942
  • 6. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge “People and places (…) are only a small part of this wider project (…) around entities that Bing embarked on a while back.” http://techcrunch.com/2013/03/21/bing-just-got-a-lot-smarter-now-knows-more-about-people-and-places/
  • 7. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Agenda  What is a (Named) Entity?  Named Entity Recognition evolution  Named Entity Recognition evaluation  Conclusions  How is it related to my PhD?
  • 8. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition  What is Named Entity Recognition? “Identification of mentions to real world entities in a natural language text. “
  • 9. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition  What is Named Entity Recognition? “Identification of mentions to real world entities in a natural language text. “ (my words)
  • 10. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition  The term “named entity” was coined for the Named Entity task at the 6th Message Understanding Conference (MUC-6).
  • 11. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition  The term “named entity” was coined for the Named Entity task at the 6th Message Understanding Conference (MUC-6). “Unique identifiers of entities (organizations, persons, locations), times (dates, times), and quantities (monetary values, percentages).” (http://cs.nyu.edu/faculty/grishman/NEtask20.book_2.html#HEADING1)
  • 12. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Named Entity Recognition  In the next year, the definition changed a little bit. “Named Entities (NE) were defined as proper names and quantities of interest. Person, organization, and location names were marked as well as dates, times, percentages, and monetary amounts.” (http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings /muc_7_proceedings/overview.html)
  • 13. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge MUC-7 Results  The results for the MUC-7 Named Entity task are very promising (http://www.itl.nist.gov/iaui/894.02/related_projects/muc/proceedings/muc_7_ proceedings/marsh_slides.pdf)
  • 14. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Challenges  There were no Message Understanding Conference anymore…
  • 15. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Challenges  But there was..  Automatic Content Extraction (ACE - 1999)  Computational Natural Language Learning (CoNLL – 2002)  INEX Entity Ranking Track (2007)  TREC Entity Track (2009)  TAC Knowledge Base Population (TAC-KBP – 2009)  …
  • 16. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge ACE “ Recognition of entities, not just names. In the ACE entity detection and tracking (EDT) task, all mentions of an entity, whether a name, a description, or a pronoun, are to be found and collected into equivalence classes based on reference to the same entity.” (Doddington et al. 2004)
  • 17. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge CoNLL “ Named entities are phrases that contain names of persons, organizations, locations, times and quantities. (…) We will concentrate on four types of named entities: persons, locations, organizations and names of miscellaneous entities that do not belong to the previous three groups.(…)” (http://www.clips.ua.ac.be/conll2002/ner/)
  • 18. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge INEX Entity Ranking Track “ (…) entities (such as countries, people and dates) requires the estimation of relevance of items (i.e., instances of entities) (…) we restricted candidate items to those entities that have their own Wikipedia article.” (De Vries et al. 2007)
  • 19. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge TREC Entity Track “ A web entity is uniquely identifiable by one of its primary homepages. Real-world entities can be represented by multiple homepages.” (Balog et al. 2009)
  • 20. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge TAC-KBP “ The tasks will be structured by having participants process a list of target entities. The list will contain entity types of Person, Organization and Geo-Political Entity.” (http://apl.jhu.edu/~paulmac/kbp/090601-KBPTaskGuidelines.pdf)
  • 21. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge What is a Named Entity?  Proper nouns  Water? Whale? Twelve o’clock?  Rigid designator  Richard Nixon (V) vs President of the United States (X)  Unique identifier  “(…) virtually everything could be referred to uniquely, depending on the context or the previous knowledge of the receiver, although a unique identifier for one receiver might not be so for another one, either because of lack of shared knowledge or the ambiguity of the context.”  Purpose and domain of application
  • 22. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Evaluation  As the definition changes the evaluation changes. Each challenge has different..  .. types of Named Entity to identify  .. identification and annotation criteria  .. valid boundaries of a Named Entity
  • 23. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Other problems  How to evaluate current tools with different definitions of Named Entities?
  • 24. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Other problems  How to evaluate current tools with different definitions of Named Entities?  Using only Person, Organization and Place.  Using only those tools which work with numbers and dates.  Using current annotated corpora (and see what happens).
  • 25. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Other problems  How to evaluate current tools with different definitions of Named Entities?  Using only Person, Organization and Place.  Using only those tools which work with numbers and dates.  Using current annotated corpora (and see what happens).  How to choose the best tool?
  • 26. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Other problems  How to evaluate current tools with different definitions of Named Entities?  Using only Person, Organization and Place.  Using only those tools which work with numbers and dates.  Using current annotated corpora (and see what happen).  How to choose the best tool?  It depends on the application.
  • 27. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?
  • 28. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?  Content Validity – Reflect the needs of the real user.
  • 29. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?  Content Validity  External Validity – The experiments can be generalized to other populations and experimental settings.
  • 30. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?  Content Validity  External Validity  Convergent Validity – The results agree with other results, theoretical or experimental.
  • 31. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?  Content Validity  External Validity  Convergent Validity  Conclusion Validity – The conclusions drawn from the results are justified.
  • 32. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  Is NER really solved?  Content Validity  External Validity  Convergent Validity  Conclusion Validity “There is not enough evidence to support the statement that NER is solved: it rather suggests the opposite”
  • 33. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions  What about..  genes and diseases?  entities identified by the same name as their classes? (ambulance, airplane, and so on)  entities identified by their attributes and description  entities…
  • 34. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge Conclusions What is an entity?
  • 35. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge My PhD thesis  How is it related to my PhD topic?
  • 36. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge My PhD thesis  How is it related to my PhD topic?  Entity Linking is the identification and disambiguation of entities using a background knowledge base.  Entity Recognition is the first step.  What is an entity?  And more.. What is an entity in different domains?
  • 37. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge References  (Balog et al 2009)  Balog, Krisztian, et al. “Overview of the TREC 2009 Entity Track.” 2009  (Doddington et al 2004)  Doddington, George, et al. “The automatic content extraction (ACE) program-tasks, data, and evaluation.” Proceedings of LREC. Vol 4. 2004.  (De Vries et al 2007)  De Vries, Arjen P., et al. “Overview of the INEX 2007 entity ranking track.” Focused Access to XML Documents. Springer Berlin Heidelberg, 2008. 245-251.