SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
How Text Analytics
Increases Search
Relevance
1
Users care about findability, not search
Findability is the ease of which someone can locate
the information they want. Often, it is confused
with search – but search is just one method of
achieving findability. Search allows people to enter
in words that they hope are contained in the
content they want to retrieve. Findability includes
any method of locating this content, including but
not limited to searching. Pingar DiscoveryOne
improves findability.
2
Significantly, findability includes Facetted Search. Facetted
search allows people to filter a search by various categories
and topics to remove irrelevant search results and more
rapidly spot the content they are looking for. Facets can also
be used to filter lists and views as well as search results.
Studies1 show that users evaluated facetted search as the
most desirable feature to improve findability.
Example of a facetted search by Category
1 Divoli, A. and Medelyan, A. Search interface feature evaluation in biosciences, HCIR 2011,
Google Mountain View, CA, USA Workshops
3
By removing the irrelevant content, facetted search
improves search relevance – the number of useful
documents on the first page of search results.
Without facetted search, your investment in
enterprise search cannot deliver its full potential.
Facetted search however relies on documents being
categorized and tagged with keywords and phrases
associated with them – this is called metadata.
Without metadata, there can be no facetted search.
Unfortunately your staff do not enter metadata.
Some systems, such as email, may not even allow
users to enter metadata. This is why we created
DiscoveryOne – it’s an automated way to add
metadata.
4
Users get most benefit from facetted
search with key phrases
Key words and phrases are the most beneficial metadata
for facetted search. When searching to gather specific
information or to find facts, people prefer a few relevant
facets of Pingar generated keyphrases. If you have
facetted search in your Enterprise Content Management
System (ECMS) or Enterprise Search engine, then facetted
search on keywords is critical.
As employees are unlikely to record keyphrases, they
must be automatically identified by a machine system
such as Pingar DiscoveryOne. DiscoveryOne reads a
document and identifies the words and phrases that best
describes the topics inside a document.
5
Document categories can also be useful
In addition to keyphrases, organizations define metadata
such as what project a document belongs to or which
client or product line, etc. Matching these to a document
allows this metadata to be used with facetted search as
well.
Unlike traditional technology, DiscoveryOne has two
advanced forms of categorizing content automatically:
• By topic (e.g. product or projects or known issues)
• By content-type (e.g. employment contract or financial
statement)
6
Categorizing by topic with taxonomies
Categorizing content by topic uses taxonomies. Taxonomies are a
pre-defined set of categories.
Taxonomies can be flat lists Taxonomies can have hierarchy
7
Taxonomy categorization works well when you:
• Have a clear idea of the categories you want
• Can determine the words and phrases that a
document will have to indicate what category it
matches
This is where Pingar text analytics expertise
becomes useful. Pingar does more than match the
names of the categories when it categorizes by
topic.
8
It also:
9
Traditional systems tried to use arcane rules that
your employees would have to learn and enter in,
however the modern text analytics developed by
Pingar does not require that, so it’s faster and less
expensive to implement.
10
Categorizing by content type with
statistical models
Taxonomy categorization does not work well with
determining what the nature of the content is – is
it a letter or a brochure or a contract or financial
statement? Statistical models are far superior
when categorizing documents by content type and
traditional technologies do not allow for this.
Statistical models are also useful when you don’t
know in advance what words are going to occur.
11
Text categorization uses a statistical model
built specially for the categories and Pingar
tools enable this. In order to generate the
model, example documents of each category
are fed into the tool and the tool learns what
makes documents in each category the same.
12
w w w . p i n g a r. c o m
North America
440 N. Wolfe Rd, CA 94085
Sunnyvale, USA
+1 408 663 2328
Asia Pacific
55 Anzac Ave, 1010
Auckland, New Zealand
+64 9 950 3299
Thank you
13

Weitere ähnliche Inhalte

Was ist angesagt?

Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
Fishbowl Solutions
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
Editor IJARCET
 
SharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 SearchSharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 Search
C/D/H Technology Consultants
 
Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010
Agnes Molnar
 
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND RANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ijaia
 

Was ist angesagt? (14)

Chest TermSet GDPR ScanR Presentation
Chest TermSet GDPR ScanR PresentationChest TermSet GDPR ScanR Presentation
Chest TermSet GDPR ScanR Presentation
 
Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Modeling & managing metadata for greater productivity
Modeling & managing metadata for greater productivityModeling & managing metadata for greater productivity
Modeling & managing metadata for greater productivity
 
Leveraging Analytics for Dynamic Review Strategies
Leveraging Analytics for Dynamic Review StrategiesLeveraging Analytics for Dynamic Review Strategies
Leveraging Analytics for Dynamic Review Strategies
 
A scalable hybrid research paper recommender system for micro
A scalable hybrid research paper recommender system for microA scalable hybrid research paper recommender system for micro
A scalable hybrid research paper recommender system for micro
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
 
BPC10 BuckleyMigration-share
BPC10 BuckleyMigration-shareBPC10 BuckleyMigration-share
BPC10 BuckleyMigration-share
 
Starting a search application
Starting a search applicationStarting a search application
Starting a search application
 
SharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 SearchSharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 Search
 
Graduation Thesis Sample
Graduation Thesis SampleGraduation Thesis Sample
Graduation Thesis Sample
 
Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010
 
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND RANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
 

Andere mochten auch

Andere mochten auch (6)

Solving SharePoint: Introducing Adhere for Alfresco
Solving SharePoint: Introducing Adhere for AlfrescoSolving SharePoint: Introducing Adhere for Alfresco
Solving SharePoint: Introducing Adhere for Alfresco
 
Turners & Growers (case study)
Turners & Growers (case study)Turners & Growers (case study)
Turners & Growers (case study)
 
Dynamics day 2015: Unlocking the Intelligence of Connected Platforms
Dynamics day 2015: Unlocking the Intelligence of Connected PlatformsDynamics day 2015: Unlocking the Intelligence of Connected Platforms
Dynamics day 2015: Unlocking the Intelligence of Connected Platforms
 
TechEd AU 2014: Microsoft Azure DocumentDB Deep Dive
TechEd AU 2014: Microsoft Azure DocumentDB Deep DiveTechEd AU 2014: Microsoft Azure DocumentDB Deep Dive
TechEd AU 2014: Microsoft Azure DocumentDB Deep Dive
 
Dynamics Day 2015: Dynamics AX and Enterprise Cloud Computing
Dynamics Day 2015: Dynamics AX and Enterprise Cloud ComputingDynamics Day 2015: Dynamics AX and Enterprise Cloud Computing
Dynamics Day 2015: Dynamics AX and Enterprise Cloud Computing
 
ICT and sustainability presentation
ICT and sustainability presentationICT and sustainability presentation
ICT and sustainability presentation
 

Ähnlich wie How Text Analytics Increase Search Relevance

How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
Concept Searching, Inc
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Findwise
 

Ähnlich wie How Text Analytics Increase Search Relevance (20)

AMCTO presentation on moving from records managment to information management
AMCTO presentation on moving from records managment to information managementAMCTO presentation on moving from records managment to information management
AMCTO presentation on moving from records managment to information management
 
Five fast ways to improve search and findability across enterprise networks
Five fast ways to improve search and findability across enterprise networksFive fast ways to improve search and findability across enterprise networks
Five fast ways to improve search and findability across enterprise networks
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
How to be successful with search in your organisation
How to be successful with search in your organisationHow to be successful with search in your organisation
How to be successful with search in your organisation
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR Organization
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
 
Search Behavior Patterns
Search Behavior PatternsSearch Behavior Patterns
Search Behavior Patterns
 
FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
Elqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds PeopleElqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds People
 
The Digital Workplace Powered by Intelligent Search
The Digital Workplace Powered by Intelligent SearchThe Digital Workplace Powered by Intelligent Search
The Digital Workplace Powered by Intelligent Search
 
Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivt
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
 
7 tips for better enterprise search
7 tips for better enterprise search7 tips for better enterprise search
7 tips for better enterprise search
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Five creative search solutions using text analytics
Five creative search solutions using text analyticsFive creative search solutions using text analytics
Five creative search solutions using text analytics
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint
 

Mehr von Simona Galdikaite, B.B.A, MBA

Mehr von Simona Galdikaite, B.B.A, MBA (6)

Avoid Expensive Electronic Dumping Grounds by Auto-tagging Content
Avoid Expensive Electronic Dumping Grounds by Auto-tagging ContentAvoid Expensive Electronic Dumping Grounds by Auto-tagging Content
Avoid Expensive Electronic Dumping Grounds by Auto-tagging Content
 
Controlling Content Cost in and Enterprise Content Management System
Controlling Content Cost in and Enterprise Content Management SystemControlling Content Cost in and Enterprise Content Management System
Controlling Content Cost in and Enterprise Content Management System
 
Reduce time pressure in mergers and acquisitions
Reduce time pressure in mergers and acquisitionsReduce time pressure in mergers and acquisitions
Reduce time pressure in mergers and acquisitions
 
Infographic: The Cost of Not Finding Documents
Infographic: The Cost of Not Finding DocumentsInfographic: The Cost of Not Finding Documents
Infographic: The Cost of Not Finding Documents
 
Search Interface Feature Evaluation
Search Interface Feature EvaluationSearch Interface Feature Evaluation
Search Interface Feature Evaluation
 
Will the Improvement in Sharepoint 2016 Search Increase User Adoption?
Will the Improvement in Sharepoint 2016 Search Increase User Adoption?Will the Improvement in Sharepoint 2016 Search Increase User Adoption?
Will the Improvement in Sharepoint 2016 Search Increase User Adoption?
 

Kürzlich hochgeladen

一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
pyhepag
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
cyebo
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
great91
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
ppy8zfkfm
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
dq9vz1isj
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
acoha1
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Stephen266013
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
fztigerwe
 

Kürzlich hochgeladen (20)

Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
一比一原版加利福尼亚大学尔湾分校毕业证成绩单如何办理
 
Seven tools of quality control.slideshare
Seven tools of quality control.slideshareSeven tools of quality control.slideshare
Seven tools of quality control.slideshare
 
一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理一比一原版麦考瑞大学毕业证成绩单如何办理
一比一原版麦考瑞大学毕业证成绩单如何办理
 
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdfGenerative AI for Trailblazers_ Unlock the Future of AI.pdf
Generative AI for Trailblazers_ Unlock the Future of AI.pdf
 
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam DunksNOAM AAUG Adobe Summit 2024: Summit Slam Dunks
NOAM AAUG Adobe Summit 2024: Summit Slam Dunks
 
edited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdfedited gordis ebook sixth edition david d.pdf
edited gordis ebook sixth edition david d.pdf
 
Heaps & its operation -Max Heap, Min Heap
Heaps & its operation -Max Heap, Min  HeapHeaps & its operation -Max Heap, Min  Heap
Heaps & its operation -Max Heap, Min Heap
 
2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call2024 Q1 Tableau User Group Leader Quarterly Call
2024 Q1 Tableau User Group Leader Quarterly Call
 
How to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data AnalyticsHow to Transform Clinical Trial Management with Advanced Data Analytics
How to Transform Clinical Trial Management with Advanced Data Analytics
 
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(WashU毕业证书)圣路易斯华盛顿大学毕业证成绩单本科硕士学位证留信学历认证
 
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
1:1原版定制利物浦大学毕业证(Liverpool毕业证)成绩单学位证书留信学历认证
 
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotecAbortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
Abortion pills in Dammam Saudi Arabia// +966572737505 // buy cytotec
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
如何办理(UPenn毕业证书)宾夕法尼亚大学毕业证成绩单本科硕士学位证留信学历认证
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
如何办理哥伦比亚大学毕业证(Columbia毕业证)成绩单原版一比一
 
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
Statistics Informed Decisions Using Data 5th edition by Michael Sullivan solu...
 

How Text Analytics Increase Search Relevance

  • 1. How Text Analytics Increases Search Relevance 1
  • 2. Users care about findability, not search Findability is the ease of which someone can locate the information they want. Often, it is confused with search – but search is just one method of achieving findability. Search allows people to enter in words that they hope are contained in the content they want to retrieve. Findability includes any method of locating this content, including but not limited to searching. Pingar DiscoveryOne improves findability. 2
  • 3. Significantly, findability includes Facetted Search. Facetted search allows people to filter a search by various categories and topics to remove irrelevant search results and more rapidly spot the content they are looking for. Facets can also be used to filter lists and views as well as search results. Studies1 show that users evaluated facetted search as the most desirable feature to improve findability. Example of a facetted search by Category 1 Divoli, A. and Medelyan, A. Search interface feature evaluation in biosciences, HCIR 2011, Google Mountain View, CA, USA Workshops 3
  • 4. By removing the irrelevant content, facetted search improves search relevance – the number of useful documents on the first page of search results. Without facetted search, your investment in enterprise search cannot deliver its full potential. Facetted search however relies on documents being categorized and tagged with keywords and phrases associated with them – this is called metadata. Without metadata, there can be no facetted search. Unfortunately your staff do not enter metadata. Some systems, such as email, may not even allow users to enter metadata. This is why we created DiscoveryOne – it’s an automated way to add metadata. 4
  • 5. Users get most benefit from facetted search with key phrases Key words and phrases are the most beneficial metadata for facetted search. When searching to gather specific information or to find facts, people prefer a few relevant facets of Pingar generated keyphrases. If you have facetted search in your Enterprise Content Management System (ECMS) or Enterprise Search engine, then facetted search on keywords is critical. As employees are unlikely to record keyphrases, they must be automatically identified by a machine system such as Pingar DiscoveryOne. DiscoveryOne reads a document and identifies the words and phrases that best describes the topics inside a document. 5
  • 6. Document categories can also be useful In addition to keyphrases, organizations define metadata such as what project a document belongs to or which client or product line, etc. Matching these to a document allows this metadata to be used with facetted search as well. Unlike traditional technology, DiscoveryOne has two advanced forms of categorizing content automatically: • By topic (e.g. product or projects or known issues) • By content-type (e.g. employment contract or financial statement) 6
  • 7. Categorizing by topic with taxonomies Categorizing content by topic uses taxonomies. Taxonomies are a pre-defined set of categories. Taxonomies can be flat lists Taxonomies can have hierarchy 7
  • 8. Taxonomy categorization works well when you: • Have a clear idea of the categories you want • Can determine the words and phrases that a document will have to indicate what category it matches This is where Pingar text analytics expertise becomes useful. Pingar does more than match the names of the categories when it categorizes by topic. 8
  • 10. Traditional systems tried to use arcane rules that your employees would have to learn and enter in, however the modern text analytics developed by Pingar does not require that, so it’s faster and less expensive to implement. 10
  • 11. Categorizing by content type with statistical models Taxonomy categorization does not work well with determining what the nature of the content is – is it a letter or a brochure or a contract or financial statement? Statistical models are far superior when categorizing documents by content type and traditional technologies do not allow for this. Statistical models are also useful when you don’t know in advance what words are going to occur. 11
  • 12. Text categorization uses a statistical model built specially for the categories and Pingar tools enable this. In order to generate the model, example documents of each category are fed into the tool and the tool learns what makes documents in each category the same. 12
  • 13. w w w . p i n g a r. c o m North America 440 N. Wolfe Rd, CA 94085 Sunnyvale, USA +1 408 663 2328 Asia Pacific 55 Anzac Ave, 1010 Auckland, New Zealand +64 9 950 3299 Thank you 13