SlideShare ist ein Scribd-Unternehmen logo
1 von 40
27 maj 2014
Overview of scenarios
Scenarios | Benefits of using entity extraction
Explore your content
Explore the enterprise graph
Discover insights about
your products
Monitor trends
Discover new expertise
inside your organization
Find the people with the right
competences
Enhance search
navigation
Filter unstructured data
Scenarios | Benefits of using entity extraction
Prevent duplicate work
Find similar content
Help your users find their
dream home
Extract potential decision criteria
from natural language
Visualize your content in
a new way
Enrich documents with metadata
Discover new expertise
inside your organization
Find the people with the right competences
Motivation
• Search for “usability”
• Only people that have tagged
themselves with “usability” will be
returned
• If we rely only on standard
category types, database
information, we get only what is in
that person database
• But what if you could find also
those that write, blog, or tweet
about “usability”, without them
being explicitly tagged with this
category?
Enhanced search index
• The search index is enhanced with information about what topics,
keywords, people, places, etc. authors write about
• Search for “usability”
• Get improved search results
 Discover competences people
have
 Discover interests people have
and share
 Gather all people writing about the
same topic
Enhanced expertise search
Enhance search navigation
Filter unstructured data
Motivation
• Search for “yoga”
• Lots of semi-structured
documents (HTML, Word,
PDF, etc)
• Some are missing
administrative metadata such
as author, date last saved
• Some are missing
descriptive metadata such as
title, topic, tags, category
No proper title
Will you go through
all results to find
the relevant ones?
Extract named entities and metadata
• Identity and add to document information such as title, keywords,
author, summary, subsection titles
New filters and improved metadata
• Search for “yoga”
• The newly created data is
used to filter documents and
improve relevance
 Improved visual results
(documents have titles)
 Improved relevance (titles
and subsection titles are
ranked higher than body text)
 Possibility to filter on authors,
topics, places, etc (use the
filter rather than pagination)
Explore your content
Explore the enterprise graph
Motivation
• Search for ‘Copenhagen’ on
your intranet
• Ambiguous query
• Lots of results
• Missing context
• What is the user intent with
this query?
Relationship Extraction for Entities
• Extract relations from unstructured data
• Built upon named entity recognition
• Relationship extraction enables us to do build a graph search
solution with unstructured data
Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et
Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere
ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim
sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien
enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per
inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue.
Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit
eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis.
Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce
ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie
vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis.
Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et
Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere
ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim
sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien
enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per
inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue.
Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit
eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis.
Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce
ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie
vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis.
Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et
Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere
ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim
sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien
enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per
inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue.
Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit
eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis.
Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce
ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie
vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis.
Sarah Jensen
Philadelphia
Copenhagen
Google
Anders Anderson
Findwise
Microsoft
Carl Sorensen
Sarah Jensen
Philadelphia
Copenhagen
Google
Anders Anderson
Findwise
Microsoft
Carl Sorensen
Suggestions as you type, using the
graph
• Search for ‘Copenhagen’ on
your intranet
 Narrow down search results
directly from the search box
 Disambiguate the query by
selecting one of the different
type of suggestions
(consultants, projects,
partners)
 Navigate directly to 2nd or
higher level connections on
the graph
Business Intelligence, using the graph
• Search for: ’Customers where we have done Projects based on
Google technology with at least 1000 hour consulting time and a revenue
of more than 1 MDKK and the word ”e-commerce” is mentioned many
times in the Project Documentation’
Business Intelligence
Project numbers
(worked hours)
Financial
numbers
(revenue,
profits)
Project
Documen
tation
How would this
query look like
in SQL?
Discover insights
about your products
Monitor trends
Motivation
• Search for the product name ‘Tusin’
• Product is mentioned in different
sources, under different contexts (user
feedback, marketing material, internal
specifications), and using different
terminologies (on social media
compared to website)
• How to keep track of all information?
• How easy is it to identify trends?
Identify the same product in different contexts
• Identify the entity denoting the same product from different
sources
Tusin
Azure
Internal name
for the same
product
Word
Doc
Internal
Production
Specification
PDF
Doc
Product
Marketing
Material from
Website
User
Comment
Feedback
about the
marketing
material / the
experience of
the user
User
Tweet
Mentions the
product
Product
Video
Video
View
Video
comment
User
feedback
Metric
Task
item
Internal
Issues
Management
System
Internal
News
Monitor trends on your products
• Search for ‘Tusin’
or
• Remember it as a search
term and create a
dashboard with content
driven by search
 Monitor trends
 Reduce time for replying
customers or users
 Stay competitive
Prevent duplicate work
Find similar content
Motivation
• Just started working on a new
material in a construction
company
• What is the cost of
duplicating the work?
• Will you perform a search on
previous work?
• What if another team has a
similar initiative?
Enhanced Search Index
• Automatically extract entities and representative keywords from
content
Documents
Announcements
Public EmailsNewsfeed
Steel Structures
Glass Type 1.A
Project ANSATorso Tower
Polyethylene Terephthalate
Prevent duplicate work
• Get suggestions of
similar work based
on extracted entities
 Identify similar work
early in the project
 Identify potential
collaborations
 Prevent duplicate
work
Visualize your content
in a new way
Enrich documents with metadata
Motivation
• Search for “financial results
Copenhagen”
• Search results: documents
• Clicking on a result opens
the document
• Does this search answer
the user question?
Identify entities in documents
• Identify locations, revenues, departments, etc from semi-
unstructured data
• Combine with data in spreadsheets or databases
Documents
Database
Spreadsheets
Answer
Visualize your content in a new way
• Search for “financial results
Copenhagen”
• Additional information shown
• Can show computed results
 Enrich documents with
metadata
 Visualise the content
 Compute answers
 Make comparisons
 Create dashboards based on
searches
Help your users find
their dream home
Extract potential decision criteria from
natural language
Motivation
• Searching for an ‘apartment with a
good view, located in central Copenhagen,
well sized bathroom, close to shopping
outlets, preferably with 3 rooms’
• The apartment information
consists of mostly structured data
(m2, number of rooms, post
number, floor)
• Can we improve the search
experience?
Long list of static
filters
Search query consists
of an area (post code,
street etc.)
Understanding what the users want
• Here’s how Facebook helps users define their queries:
• Can we interpret the query ‘apartment with a good view, located in
central Copenhagen, well sized bathroom, close to shopping outlets,
preferably with 3 rooms’ ?
Understanding what the users want
• Searching for ‘apartment with a good
view, located in central Copenhagen, well
sized bathroom, close to shopping
outlets, preferably with 3 rooms’
• Apartments with 3 rooms are shown
in search results but those with less
are not excluded
• Those that mention shopping outlets
(such as Netto or Fakta) are boosted
 Interpret natural language
 Boost results based on ‘preferences’
 Better search experience
 Increase user satisfaction
Boost those with 3 rooms
(boost on map can be
represented by a bigger
pointer)
Free text search
Behind the scene
Entity Extraction
Entity extraction is the process of identifying named entities (such as locations,
people, companies) in a block of text
Add structure to
unstructured data
New possibilities of
interpreting the data
Improve data quality and
findability of documents
Reduce time spent by
users manually
structuring content
Entity Extraction Framework
Combines dictionaries
with trained model and
regular expressions
based on needs
Scalable, adaptable and
extendable framework
Automatically enrich
documents with named
entities
Iterative approach to
continuously improve
accuracy
Built by Findwise as a reply to our customer requirements and vision
Entity Extraction Framework
Autotag
Edit
Evaluate Incremental
train
90% accuracy
The Danish and Swedish
entity extractors can
reach 90% accuracy
Graphical Annotation Tool
Visual representation of
annotated documents
Annotate more
documents to improve
precision
Easy-to-use, point and
click interaction
Built by Findwise as a reply to our customer requirements and visions
Graphical Annotation Tool
Anders Häggdahl
anders.haggdahl@findwise.com

Weitere ähnliche Inhalte

Was ist angesagt?

Semantic search and the 'new' seo
Semantic search and the 'new' seoSemantic search and the 'new' seo
Semantic search and the 'new' seoRichard Hussey
 
How to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortHow to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortPete S
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Top Floor Technologies
 
How to Use A Search Engine Effectively
How to Use A Search Engine EffectivelyHow to Use A Search Engine Effectively
How to Use A Search Engine EffectivelyBianca King
 
Maximizing Your SEO Results - June 2013
Maximizing Your SEO Results - June 2013Maximizing Your SEO Results - June 2013
Maximizing Your SEO Results - June 2013Top Floor Technologies
 
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...Bespoke Agency
 
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and Bloggers
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and BloggersSEO Tips, Tactics & Strategies for Outdoor Writers, Authors and Bloggers
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and BloggersPaul Krupin
 
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Marty Hayes
 
An Introduction to Python and Machine Learning for Technical SEO | All New Di...
An Introduction to Python and Machine Learning for Technical SEO | All New Di...An Introduction to Python and Machine Learning for Technical SEO | All New Di...
An Introduction to Python and Machine Learning for Technical SEO | All New Di...Ruth Everett
 
Why Can't Anyone Find My Website?
Why Can't Anyone Find My Website?Why Can't Anyone Find My Website?
Why Can't Anyone Find My Website?The URL Dr.
 
Linking SEO & PR
Linking SEO & PRLinking SEO & PR
Linking SEO & PRPegasus
 
Advanced operator-cheat-sheet
Advanced operator-cheat-sheetAdvanced operator-cheat-sheet
Advanced operator-cheat-sheetTanveer Razwan
 
Keyword research in Google AdWords for organic
Keyword research in Google AdWords for organicKeyword research in Google AdWords for organic
Keyword research in Google AdWords for organicJake Aull
 
How to use contexts for content - Put your site on a diet
How to use contexts for content - Put your site on a dietHow to use contexts for content - Put your site on a diet
How to use contexts for content - Put your site on a dietKoen Verbrugge
 
Boosting Traffic to Your Site
Boosting Traffic to Your SiteBoosting Traffic to Your Site
Boosting Traffic to Your SiteNavneet Kaushal
 

Was ist angesagt? (20)

Semantic search and the 'new' seo
Semantic search and the 'new' seoSemantic search and the 'new' seo
Semantic search and the 'new' seo
 
Seo basics
Seo basicsSeo basics
Seo basics
 
How to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effortHow to search the Internet, a guide to save time and effort
How to search the Internet, a guide to save time and effort
 
Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013Maximizing Your SEO Results Seminar 2-14-2013
Maximizing Your SEO Results Seminar 2-14-2013
 
How to Use A Search Engine Effectively
How to Use A Search Engine EffectivelyHow to Use A Search Engine Effectively
How to Use A Search Engine Effectively
 
Maximizing Your SEO Results - June 2013
Maximizing Your SEO Results - June 2013Maximizing Your SEO Results - June 2013
Maximizing Your SEO Results - June 2013
 
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...
Tips and tools for effective SEO and brand recognition - eCommerce Expo Melbo...
 
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and Bloggers
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and BloggersSEO Tips, Tactics & Strategies for Outdoor Writers, Authors and Bloggers
SEO Tips, Tactics & Strategies for Outdoor Writers, Authors and Bloggers
 
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
Arts Marketing Association North-East Network Meeting: The Evolution of Searc...
 
An Introduction to Python and Machine Learning for Technical SEO | All New Di...
An Introduction to Python and Machine Learning for Technical SEO | All New Di...An Introduction to Python and Machine Learning for Technical SEO | All New Di...
An Introduction to Python and Machine Learning for Technical SEO | All New Di...
 
Why Can't Anyone Find My Website?
Why Can't Anyone Find My Website?Why Can't Anyone Find My Website?
Why Can't Anyone Find My Website?
 
5 Search Engine Marketing Tips (Carmarthen Tourism Event)
5 Search Engine Marketing Tips (Carmarthen Tourism Event)5 Search Engine Marketing Tips (Carmarthen Tourism Event)
5 Search Engine Marketing Tips (Carmarthen Tourism Event)
 
Seo and Content Presentation
Seo and Content PresentationSeo and Content Presentation
Seo and Content Presentation
 
SEO Presentation at Introbiz Business Show 2013
SEO Presentation at Introbiz Business Show 2013SEO Presentation at Introbiz Business Show 2013
SEO Presentation at Introbiz Business Show 2013
 
Linking SEO & PR
Linking SEO & PRLinking SEO & PR
Linking SEO & PR
 
Advanced operator-cheat-sheet
Advanced operator-cheat-sheetAdvanced operator-cheat-sheet
Advanced operator-cheat-sheet
 
Class 6 seo class
Class 6   seo class Class 6   seo class
Class 6 seo class
 
Keyword research in Google AdWords for organic
Keyword research in Google AdWords for organicKeyword research in Google AdWords for organic
Keyword research in Google AdWords for organic
 
How to use contexts for content - Put your site on a diet
How to use contexts for content - Put your site on a dietHow to use contexts for content - Put your site on a diet
How to use contexts for content - Put your site on a diet
 
Boosting Traffic to Your Site
Boosting Traffic to Your SiteBoosting Traffic to Your Site
Boosting Traffic to Your Site
 

Ähnlich wie Learn more about Entity Extraction May 2014

Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtKristian Norling
 
The art of intranet search
The art of intranet searchThe art of intranet search
The art of intranet searchSam Marshall
 
Search Analytics for Fun and Profit
Search Analytics for Fun and ProfitSearch Analytics for Fun and Profit
Search Analytics for Fun and ProfitLouis Rosenfeld
 
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...Simplilearn
 
Introduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websitesIntroduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websitesKristian Norling
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasTe damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasImma Valls Bernaus
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasTe damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasImma Valls Bernaus
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Elasticsearch
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findabilityKristian Norling
 
How to Run LinkedIn Searches Like a Pro [Webcast]
How to Run LinkedIn Searches Like a Pro [Webcast]How to Run LinkedIn Searches Like a Pro [Webcast]
How to Run LinkedIn Searches Like a Pro [Webcast]LinkedIn Talent Solutions
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise SearchFindwise
 
Search Quality Management
Search Quality ManagementSearch Quality Management
Search Quality ManagementAgnes Molnar
 
Lean Analytics & Analytics Dashboards
Lean Analytics & Analytics DashboardsLean Analytics & Analytics Dashboards
Lean Analytics & Analytics DashboardsYves Ferket
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinciJohnny Lopez
 
Search Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your CustomersSearch Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your Customersrichwig
 
SharePoint Saturday Belgium - Contextual Search and More..
SharePoint Saturday Belgium - Contextual Search and More..SharePoint Saturday Belgium - Contextual Search and More..
SharePoint Saturday Belgium - Contextual Search and More..Mikael Svenson
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Duncan MacGruer
 

Ähnlich wie Learn more about Entity Extraction May 2014 (20)

Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivt
 
The art of intranet search
The art of intranet searchThe art of intranet search
The art of intranet search
 
Search Analytics for Fun and Profit
Search Analytics for Fun and ProfitSearch Analytics for Fun and Profit
Search Analytics for Fun and Profit
 
Getting started with Microsoft Search
Getting started with Microsoft Search Getting started with Microsoft Search
Getting started with Microsoft Search
 
Starting a search application
Starting a search applicationStarting a search application
Starting a search application
 
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
Keyword Research For SEO 2022 | How To Do Keyword Research? | Keyword Researc...
 
Introduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websitesIntroduction to enterprise search for intranets and websites
Introduction to enterprise search for intranets and websites
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasTe damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedasTe damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas
 
Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas Te damos la bienvenida a una nueva forma de realizar búsquedas
Te damos la bienvenida a una nueva forma de realizar búsquedas
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
How to Run LinkedIn Searches Like a Pro [Webcast]
How to Run LinkedIn Searches Like a Pro [Webcast]How to Run LinkedIn Searches Like a Pro [Webcast]
How to Run LinkedIn Searches Like a Pro [Webcast]
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Search Quality Management
Search Quality ManagementSearch Quality Management
Search Quality Management
 
Everything You Wish You Knew About Search
Everything You Wish You Knew About SearchEverything You Wish You Knew About Search
Everything You Wish You Knew About Search
 
Lean Analytics & Analytics Dashboards
Lean Analytics & Analytics DashboardsLean Analytics & Analytics Dashboards
Lean Analytics & Analytics Dashboards
 
The evolution of Search spscinci
The evolution of Search spscinciThe evolution of Search spscinci
The evolution of Search spscinci
 
Search Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your CustomersSearch Analytics: Conversations with Your Customers
Search Analytics: Conversations with Your Customers
 
SharePoint Saturday Belgium - Contextual Search and More..
SharePoint Saturday Belgium - Contextual Search and More..SharePoint Saturday Belgium - Contextual Search and More..
SharePoint Saturday Belgium - Contextual Search and More..
 
Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)Keyword research tools for Search Engine Optimisation (SEO)
Keyword research tools for Search Engine Optimisation (SEO)
 

Kürzlich hochgeladen

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 

Kürzlich hochgeladen (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure serviceWhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptxFactors to Consider When Choosing Accounts Payable Services Providers.pptx
Factors to Consider When Choosing Accounts Payable Services Providers.pptx
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 

Learn more about Entity Extraction May 2014

  • 3. Scenarios | Benefits of using entity extraction Explore your content Explore the enterprise graph Discover insights about your products Monitor trends Discover new expertise inside your organization Find the people with the right competences Enhance search navigation Filter unstructured data
  • 4. Scenarios | Benefits of using entity extraction Prevent duplicate work Find similar content Help your users find their dream home Extract potential decision criteria from natural language Visualize your content in a new way Enrich documents with metadata
  • 5. Discover new expertise inside your organization Find the people with the right competences
  • 6. Motivation • Search for “usability” • Only people that have tagged themselves with “usability” will be returned • If we rely only on standard category types, database information, we get only what is in that person database • But what if you could find also those that write, blog, or tweet about “usability”, without them being explicitly tagged with this category?
  • 7. Enhanced search index • The search index is enhanced with information about what topics, keywords, people, places, etc. authors write about
  • 8. • Search for “usability” • Get improved search results  Discover competences people have  Discover interests people have and share  Gather all people writing about the same topic Enhanced expertise search
  • 10. Motivation • Search for “yoga” • Lots of semi-structured documents (HTML, Word, PDF, etc) • Some are missing administrative metadata such as author, date last saved • Some are missing descriptive metadata such as title, topic, tags, category No proper title Will you go through all results to find the relevant ones?
  • 11. Extract named entities and metadata • Identity and add to document information such as title, keywords, author, summary, subsection titles
  • 12. New filters and improved metadata • Search for “yoga” • The newly created data is used to filter documents and improve relevance  Improved visual results (documents have titles)  Improved relevance (titles and subsection titles are ranked higher than body text)  Possibility to filter on authors, topics, places, etc (use the filter rather than pagination)
  • 13. Explore your content Explore the enterprise graph
  • 14. Motivation • Search for ‘Copenhagen’ on your intranet • Ambiguous query • Lots of results • Missing context • What is the user intent with this query?
  • 15. Relationship Extraction for Entities • Extract relations from unstructured data • Built upon named entity recognition • Relationship extraction enables us to do build a graph search solution with unstructured data Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue. Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis. Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis. Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue. Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis. Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis. Lorem ipsum dolor sit amet Sarah Jensen, consectetur adipiscing elit Philadelphia et Copenhagen. Fusce nec placerat libero. Suspendisse nibh quam, sodales in posuere ac, porttitor non erat. Sed semper sodales varius. Fusce elementum Findwise, enim sed semper ultrices Carl Sorensen, nisl ligula consectetur sapien, non feugiat sapien enim id quam. Class aptent taciti sociosqu ad litora torquent per conubia nostra, per inceptos himenaeos. Nullam egestas non velit nec accumsan. Google at orci augue. Proin tempus tristique arcu, a lobortis diam tempus ut. Nam arcu risus, tempor nec elit eu, Anders Anderson posuere viverra mauris. Donec tempor in magna in mollis. Suspendisse in elementum magna. Findwise in faucibus sapien, et Microsoft. Fusce ullamcorper malesuada sapien, sit amet viverra odio bibendum sed. Fusce molestie vel tortor nec eleifend. Nullam et leo ac felis iaculis convallis. Sarah Jensen Philadelphia Copenhagen Google Anders Anderson Findwise Microsoft Carl Sorensen Sarah Jensen Philadelphia Copenhagen Google Anders Anderson Findwise Microsoft Carl Sorensen
  • 16. Suggestions as you type, using the graph • Search for ‘Copenhagen’ on your intranet  Narrow down search results directly from the search box  Disambiguate the query by selecting one of the different type of suggestions (consultants, projects, partners)  Navigate directly to 2nd or higher level connections on the graph
  • 17. Business Intelligence, using the graph • Search for: ’Customers where we have done Projects based on Google technology with at least 1000 hour consulting time and a revenue of more than 1 MDKK and the word ”e-commerce” is mentioned many times in the Project Documentation’ Business Intelligence Project numbers (worked hours) Financial numbers (revenue, profits) Project Documen tation How would this query look like in SQL?
  • 18. Discover insights about your products Monitor trends
  • 19. Motivation • Search for the product name ‘Tusin’ • Product is mentioned in different sources, under different contexts (user feedback, marketing material, internal specifications), and using different terminologies (on social media compared to website) • How to keep track of all information? • How easy is it to identify trends?
  • 20. Identify the same product in different contexts • Identify the entity denoting the same product from different sources Tusin Azure Internal name for the same product Word Doc Internal Production Specification PDF Doc Product Marketing Material from Website User Comment Feedback about the marketing material / the experience of the user User Tweet Mentions the product Product Video Video View Video comment User feedback Metric Task item Internal Issues Management System Internal News
  • 21. Monitor trends on your products • Search for ‘Tusin’ or • Remember it as a search term and create a dashboard with content driven by search  Monitor trends  Reduce time for replying customers or users  Stay competitive
  • 22. Prevent duplicate work Find similar content
  • 23. Motivation • Just started working on a new material in a construction company • What is the cost of duplicating the work? • Will you perform a search on previous work? • What if another team has a similar initiative?
  • 24. Enhanced Search Index • Automatically extract entities and representative keywords from content Documents Announcements Public EmailsNewsfeed Steel Structures Glass Type 1.A Project ANSATorso Tower Polyethylene Terephthalate
  • 25. Prevent duplicate work • Get suggestions of similar work based on extracted entities  Identify similar work early in the project  Identify potential collaborations  Prevent duplicate work
  • 26. Visualize your content in a new way Enrich documents with metadata
  • 27. Motivation • Search for “financial results Copenhagen” • Search results: documents • Clicking on a result opens the document • Does this search answer the user question?
  • 28. Identify entities in documents • Identify locations, revenues, departments, etc from semi- unstructured data • Combine with data in spreadsheets or databases Documents Database Spreadsheets Answer
  • 29. Visualize your content in a new way • Search for “financial results Copenhagen” • Additional information shown • Can show computed results  Enrich documents with metadata  Visualise the content  Compute answers  Make comparisons  Create dashboards based on searches
  • 30. Help your users find their dream home Extract potential decision criteria from natural language
  • 31. Motivation • Searching for an ‘apartment with a good view, located in central Copenhagen, well sized bathroom, close to shopping outlets, preferably with 3 rooms’ • The apartment information consists of mostly structured data (m2, number of rooms, post number, floor) • Can we improve the search experience? Long list of static filters Search query consists of an area (post code, street etc.)
  • 32. Understanding what the users want • Here’s how Facebook helps users define their queries: • Can we interpret the query ‘apartment with a good view, located in central Copenhagen, well sized bathroom, close to shopping outlets, preferably with 3 rooms’ ?
  • 33. Understanding what the users want • Searching for ‘apartment with a good view, located in central Copenhagen, well sized bathroom, close to shopping outlets, preferably with 3 rooms’ • Apartments with 3 rooms are shown in search results but those with less are not excluded • Those that mention shopping outlets (such as Netto or Fakta) are boosted  Interpret natural language  Boost results based on ‘preferences’  Better search experience  Increase user satisfaction Boost those with 3 rooms (boost on map can be represented by a bigger pointer) Free text search
  • 35. Entity Extraction Entity extraction is the process of identifying named entities (such as locations, people, companies) in a block of text Add structure to unstructured data New possibilities of interpreting the data Improve data quality and findability of documents Reduce time spent by users manually structuring content
  • 36. Entity Extraction Framework Combines dictionaries with trained model and regular expressions based on needs Scalable, adaptable and extendable framework Automatically enrich documents with named entities Iterative approach to continuously improve accuracy Built by Findwise as a reply to our customer requirements and vision
  • 37. Entity Extraction Framework Autotag Edit Evaluate Incremental train 90% accuracy The Danish and Swedish entity extractors can reach 90% accuracy
  • 38. Graphical Annotation Tool Visual representation of annotated documents Annotate more documents to improve precision Easy-to-use, point and click interaction Built by Findwise as a reply to our customer requirements and visions