SlideShare a Scribd company logo
1 of 13
Download to read offline
How Text Analytics
Increases Search
Relevance
1
Users care about findability, not search
Findability is the ease of which someone can locate
the information they want. Often, it is confused
with search – but search is just one method of
achieving findability. Search allows people to enter
in words that they hope are contained in the
content they want to retrieve. Findability includes
any method of locating this content, including but
not limited to searching. Pingar DiscoveryOne
improves findability.
2
Significantly, findability includes Facetted Search. Facetted
search allows people to filter a search by various categories
and topics to remove irrelevant search results and more
rapidly spot the content they are looking for. Facets can also
be used to filter lists and views as well as search results.
Studies1 show that users evaluated facetted search as the
most desirable feature to improve findability.
Example of a facetted search by Category
1 Divoli, A. and Medelyan, A. Search interface feature evaluation in biosciences, HCIR 2011,
Google Mountain View, CA, USA Workshops
3
By removing the irrelevant content, facetted search
improves search relevance – the number of useful
documents on the first page of search results.
Without facetted search, your investment in
enterprise search cannot deliver its full potential.
Facetted search however relies on documents being
categorized and tagged with keywords and phrases
associated with them – this is called metadata.
Without metadata, there can be no facetted search.
Unfortunately your staff do not enter metadata.
Some systems, such as email, may not even allow
users to enter metadata. This is why we created
DiscoveryOne – it’s an automated way to add
metadata.
4
Users get most benefit from facetted
search with key phrases
Key words and phrases are the most beneficial metadata
for facetted search. When searching to gather specific
information or to find facts, people prefer a few relevant
facets of Pingar generated keyphrases. If you have
facetted search in your Enterprise Content Management
System (ECMS) or Enterprise Search engine, then facetted
search on keywords is critical.
As employees are unlikely to record keyphrases, they
must be automatically identified by a machine system
such as Pingar DiscoveryOne. DiscoveryOne reads a
document and identifies the words and phrases that best
describes the topics inside a document.
5
Document categories can also be useful
In addition to keyphrases, organizations define metadata
such as what project a document belongs to or which
client or product line, etc. Matching these to a document
allows this metadata to be used with facetted search as
well.
Unlike traditional technology, DiscoveryOne has two
advanced forms of categorizing content automatically:
• By topic (e.g. product or projects or known issues)
• By content-type (e.g. employment contract or financial
statement)
6
Categorizing by topic with taxonomies
Categorizing content by topic uses taxonomies. Taxonomies are a
pre-defined set of categories.
Taxonomies can be flat lists Taxonomies can have hierarchy
7
Taxonomy categorization works well when you:
• Have a clear idea of the categories you want
• Can determine the words and phrases that a
document will have to indicate what category it
matches
This is where Pingar text analytics expertise
becomes useful. Pingar does more than match the
names of the categories when it categorizes by
topic.
8
It also:
9
Traditional systems tried to use arcane rules that
your employees would have to learn and enter in,
however the modern text analytics developed by
Pingar does not require that, so it’s faster and less
expensive to implement.
10
Categorizing by content type with
statistical models
Taxonomy categorization does not work well with
determining what the nature of the content is – is
it a letter or a brochure or a contract or financial
statement? Statistical models are far superior
when categorizing documents by content type and
traditional technologies do not allow for this.
Statistical models are also useful when you don’t
know in advance what words are going to occur.
11
Text categorization uses a statistical model
built specially for the categories and Pingar
tools enable this. In order to generate the
model, example documents of each category
are fed into the tool and the tool learns what
makes documents in each category the same.
12
w w w . p i n g a r. c o m
North America
440 N. Wolfe Rd, CA 94085
Sunnyvale, USA
+1 408 663 2328
Asia Pacific
55 Anzac Ave, 1010
Auckland, New Zealand
+64 9 950 3299
Thank you
13

More Related Content

What's hot

Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
Fishbowl Solutions
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
Editor IJARCET
 
SharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 SearchSharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 Search
C/D/H Technology Consultants
 
Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010
Agnes Molnar
 
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND RANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ijaia
 

What's hot (14)

Chest TermSet GDPR ScanR Presentation
Chest TermSet GDPR ScanR PresentationChest TermSet GDPR ScanR Presentation
Chest TermSet GDPR ScanR Presentation
 
Effective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCMEffective Strategies for Searching Oracle UCM
Effective Strategies for Searching Oracle UCM
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...Structured data and metadata evaluation methodology for organizations looking...
Structured data and metadata evaluation methodology for organizations looking...
 
Modeling & managing metadata for greater productivity
Modeling & managing metadata for greater productivityModeling & managing metadata for greater productivity
Modeling & managing metadata for greater productivity
 
Leveraging Analytics for Dynamic Review Strategies
Leveraging Analytics for Dynamic Review StrategiesLeveraging Analytics for Dynamic Review Strategies
Leveraging Analytics for Dynamic Review Strategies
 
A scalable hybrid research paper recommender system for micro
A scalable hybrid research paper recommender system for microA scalable hybrid research paper recommender system for micro
A scalable hybrid research paper recommender system for micro
 
Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020Volume 2-issue-6-2016-2020
Volume 2-issue-6-2016-2020
 
BPC10 BuckleyMigration-share
BPC10 BuckleyMigration-shareBPC10 BuckleyMigration-share
BPC10 BuckleyMigration-share
 
Starting a search application
Starting a search applicationStarting a search application
Starting a search application
 
SharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 SearchSharePoint User Group Meeting- SharePoint 2013 Search
SharePoint User Group Meeting- SharePoint 2013 Search
 
Graduation Thesis Sample
Graduation Thesis SampleGraduation Thesis Sample
Graduation Thesis Sample
 
Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010Implementing Enterprise Search in SharePoint 2010
Implementing Enterprise Search in SharePoint 2010
 
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND RANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
ANALYSIS OF ENTERPRISE SHARED RESOURCE INVOCATION SCHEME BASED ON HADOOP AND R
 

Viewers also liked

E-commerce Recommender Systems for Small Business
E-commerce Recommender Systems for Small BusinessE-commerce Recommender Systems for Small Business
E-commerce Recommender Systems for Small Business
Stavros Apostolou
 
Unidad I - Curso IV - Escritura Científica
Unidad I - Curso IV - Escritura CientíficaUnidad I - Curso IV - Escritura Científica
Unidad I - Curso IV - Escritura Científica
María Carreras
 

Viewers also liked (19)

Stb gost r 51140 2000
Stb gost r 51140 2000Stb gost r 51140 2000
Stb gost r 51140 2000
 
Soldado desconocido (Lurgio Gavilan)
Soldado desconocido  (Lurgio Gavilan)Soldado desconocido  (Lurgio Gavilan)
Soldado desconocido (Lurgio Gavilan)
 
Stb 11407
Stb 11407Stb 11407
Stb 11407
 
Bean n Gone Ltd
Bean n Gone LtdBean n Gone Ltd
Bean n Gone Ltd
 
Year up JAX IT Council - February 2016
Year up   JAX IT Council - February 2016Year up   JAX IT Council - February 2016
Year up JAX IT Council - February 2016
 
La influencia de las tic
La influencia de las ticLa influencia de las tic
La influencia de las tic
 
Etiqueta de jabon "JC"
Etiqueta de jabon "JC"Etiqueta de jabon "JC"
Etiqueta de jabon "JC"
 
Martes 04 11
Martes 04 11Martes 04 11
Martes 04 11
 
E-commerce Recommender Systems for Small Business
E-commerce Recommender Systems for Small BusinessE-commerce Recommender Systems for Small Business
E-commerce Recommender Systems for Small Business
 
Comunismocarlos
ComunismocarlosComunismocarlos
Comunismocarlos
 
Presentación Construyendo Ciudadanía 2013
Presentación Construyendo Ciudadanía 2013Presentación Construyendo Ciudadanía 2013
Presentación Construyendo Ciudadanía 2013
 
Simple past
Simple pastSimple past
Simple past
 
Trabajo final grupo 404085A
Trabajo final grupo 404085ATrabajo final grupo 404085A
Trabajo final grupo 404085A
 
Chemistryproject 111111111111111111
Chemistryproject 111111111111111111Chemistryproject 111111111111111111
Chemistryproject 111111111111111111
 
Atividades comte, marx, weber, durkheim
Atividades comte, marx, weber, durkheimAtividades comte, marx, weber, durkheim
Atividades comte, marx, weber, durkheim
 
Pas3 Tenancy Design Patterns (Predix Transform 2016)
Pas3 Tenancy Design Patterns (Predix Transform 2016)Pas3 Tenancy Design Patterns (Predix Transform 2016)
Pas3 Tenancy Design Patterns (Predix Transform 2016)
 
Unidad I - Curso IV - Escritura Científica
Unidad I - Curso IV - Escritura CientíficaUnidad I - Curso IV - Escritura Científica
Unidad I - Curso IV - Escritura Científica
 
Caga tió
Caga tióCaga tió
Caga tió
 
IPMA - Cel PMO? - Samolikwidacja, The goal of the PMO? Self-Destruction
IPMA - Cel PMO? - Samolikwidacja, The goal of the PMO? Self-DestructionIPMA - Cel PMO? - Samolikwidacja, The goal of the PMO? Self-Destruction
IPMA - Cel PMO? - Samolikwidacja, The goal of the PMO? Self-Destruction
 

Similar to How Text Analytics Increases Search Relevance

How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
Concept Searching, Inc
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Findwise
 

Similar to How Text Analytics Increases Search Relevance (20)

AMCTO presentation on moving from records managment to information management
AMCTO presentation on moving from records managment to information managementAMCTO presentation on moving from records managment to information management
AMCTO presentation on moving from records managment to information management
 
Five fast ways to improve search and findability across enterprise networks
Five fast ways to improve search and findability across enterprise networksFive fast ways to improve search and findability across enterprise networks
Five fast ways to improve search and findability across enterprise networks
 
Optimising Your Content for Findability
Optimising Your Content for FindabilityOptimising Your Content for Findability
Optimising Your Content for Findability
 
How to be successful with search in your organisation
How to be successful with search in your organisationHow to be successful with search in your organisation
How to be successful with search in your organisation
 
How to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR OrganizationHow to be Successful with Search in YOUR Organization
How to be Successful with Search in YOUR Organization
 
Optimising Your Content for findability
Optimising Your Content for findabilityOptimising Your Content for findability
Optimising Your Content for findability
 
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
 
Search Behavior Patterns
Search Behavior PatternsSearch Behavior Patterns
Search Behavior Patterns
 
FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010FAST Search-webinar-06-29-2010
FAST Search-webinar-06-29-2010
 
How to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right WebinarHow to Get Enterprise Search Right Webinar
How to Get Enterprise Search Right Webinar
 
Elqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds PeopleElqano - Where Knowledge Finds People
Elqano - Where Knowledge Finds People
 
The Digital Workplace Powered by Intelligent Search
The Digital Workplace Powered by Intelligent SearchThe Digital Workplace Powered by Intelligent Search
The Digital Workplace Powered by Intelligent Search
 
Hvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivtHvordan få søk til å fungere effektivt
Hvordan få søk til å fungere effektivt
 
Introduction to Enterprise Search
Introduction to Enterprise SearchIntroduction to Enterprise Search
Introduction to Enterprise Search
 
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
Enterprise Search White Paper: Increase Your Competitiveness - Make a Knowled...
 
7 tips for better enterprise search
7 tips for better enterprise search7 tips for better enterprise search
7 tips for better enterprise search
 
Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Five creative search solutions using text analytics
Five creative search solutions using text analyticsFive creative search solutions using text analytics
Five creative search solutions using text analytics
 
Taxonomy 101
Taxonomy 101Taxonomy 101
Taxonomy 101
 
Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint
 

More from Zanda Mark

More from Zanda Mark (11)

Conducting Content Inventory
Conducting Content InventoryConducting Content Inventory
Conducting Content Inventory
 
Improving digital transfomation process
Improving digital transfomation processImproving digital transfomation process
Improving digital transfomation process
 
Search Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in BiosciencesSearch Interface Feature Evaluation in Biosciences
Search Interface Feature Evaluation in Biosciences
 
Avoid expensive electronic dumping grounds by auto-tagging content
Avoid expensive electronic dumping grounds by auto-tagging contentAvoid expensive electronic dumping grounds by auto-tagging content
Avoid expensive electronic dumping grounds by auto-tagging content
 
Content Management Statistics
Content Management StatisticsContent Management Statistics
Content Management Statistics
 
The cost of not finding documents
The cost of not finding documentsThe cost of not finding documents
The cost of not finding documents
 
Control the Cost of too Much Content
Control the Cost of too Much ContentControl the Cost of too Much Content
Control the Cost of too Much Content
 
How to improve search?
How to improve search? How to improve search?
How to improve search?
 
Data management
Data managementData management
Data management
 
Will the improvement in Sharepoint 2016 search increase user adaption?
Will the improvement in Sharepoint 2016 search increase user adaption?Will the improvement in Sharepoint 2016 search increase user adaption?
Will the improvement in Sharepoint 2016 search increase user adaption?
 
What is metadata?
What is metadata?What is metadata?
What is metadata?
 

Recently uploaded

Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Valters Lauzums
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
adet6151
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
DilipVasan
 
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat ViagraToko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
adet6151
 
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
w7jl3eyno
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Stephen266013
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
cyebo
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
ju0dztxtn
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
dq9vz1isj
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
pyhepag
 

Recently uploaded (20)

basics of data science with application areas.pdf
basics of data science with application areas.pdfbasics of data science with application areas.pdf
basics of data science with application areas.pdf
 
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
Data Analytics for Digital Marketing Lecture for Advanced Digital & Social Me...
 
Pre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptxPre-ProductionImproveddsfjgndflghtgg.pptx
Pre-ProductionImproveddsfjgndflghtgg.pptx
 
2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting2024 Q2 Orange County (CA) Tableau User Group Meeting
2024 Q2 Orange County (CA) Tableau User Group Meeting
 
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra MalangToko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
Toko Jual Viagra Asli Di Malang 081229400522 COD Obat Kuat Viagra Malang
 
社内勉強会資料  Mamba - A new era or ephemeral
社内勉強会資料   Mamba - A new era or ephemeral社内勉強会資料   Mamba - A new era or ephemeral
社内勉強会資料  Mamba - A new era or ephemeral
 
Formulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdfFormulas dax para power bI de microsoft.pdf
Formulas dax para power bI de microsoft.pdf
 
Exploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptxExploratory Data Analysis - Dilip S.pptx
Exploratory Data Analysis - Dilip S.pptx
 
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat ViagraToko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
Toko Jual Viagra Asli Di Salatiga 081229400522 Obat Kuat Viagra
 
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...Genuine love spell caster )! ,+27834335081)   Ex lover back permanently in At...
Genuine love spell caster )! ,+27834335081) Ex lover back permanently in At...
 
ℂall Girls Balbir Nagar ℂall Now Chhaya ☎ 9899900591 WhatsApp Number 24/7
ℂall Girls Balbir Nagar ℂall Now Chhaya ☎ 9899900591 WhatsApp  Number 24/7ℂall Girls Balbir Nagar ℂall Now Chhaya ☎ 9899900591 WhatsApp  Number 24/7
ℂall Girls Balbir Nagar ℂall Now Chhaya ☎ 9899900591 WhatsApp Number 24/7
 
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
如何办理澳洲悉尼大学毕业证(USYD毕业证书)学位证书成绩单原版一比一
 
Audience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptxAudience Researchndfhcvnfgvgbhujhgfv.pptx
Audience Researchndfhcvnfgvgbhujhgfv.pptx
 
一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理一比一原版纽卡斯尔大学毕业证成绩单如何办理
一比一原版纽卡斯尔大学毕业证成绩单如何办理
 
123.docx. .
123.docx.                                 .123.docx.                                 .
123.docx. .
 
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
Data Visualization Exploring and Explaining with Data 1st Edition by Camm sol...
 
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
如何办理英国卡迪夫大学毕业证(Cardiff毕业证书)成绩单留信学历认证
 
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
1:1原版定制伦敦政治经济学院毕业证(LSE毕业证)成绩单学位证书留信学历认证
 
Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)Atlantic Grupa Case Study (Mintec Data AI)
Atlantic Grupa Case Study (Mintec Data AI)
 
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
一比一原版(Monash毕业证书)莫纳什大学毕业证成绩单如何办理
 

How Text Analytics Increases Search Relevance

  • 1. How Text Analytics Increases Search Relevance 1
  • 2. Users care about findability, not search Findability is the ease of which someone can locate the information they want. Often, it is confused with search – but search is just one method of achieving findability. Search allows people to enter in words that they hope are contained in the content they want to retrieve. Findability includes any method of locating this content, including but not limited to searching. Pingar DiscoveryOne improves findability. 2
  • 3. Significantly, findability includes Facetted Search. Facetted search allows people to filter a search by various categories and topics to remove irrelevant search results and more rapidly spot the content they are looking for. Facets can also be used to filter lists and views as well as search results. Studies1 show that users evaluated facetted search as the most desirable feature to improve findability. Example of a facetted search by Category 1 Divoli, A. and Medelyan, A. Search interface feature evaluation in biosciences, HCIR 2011, Google Mountain View, CA, USA Workshops 3
  • 4. By removing the irrelevant content, facetted search improves search relevance – the number of useful documents on the first page of search results. Without facetted search, your investment in enterprise search cannot deliver its full potential. Facetted search however relies on documents being categorized and tagged with keywords and phrases associated with them – this is called metadata. Without metadata, there can be no facetted search. Unfortunately your staff do not enter metadata. Some systems, such as email, may not even allow users to enter metadata. This is why we created DiscoveryOne – it’s an automated way to add metadata. 4
  • 5. Users get most benefit from facetted search with key phrases Key words and phrases are the most beneficial metadata for facetted search. When searching to gather specific information or to find facts, people prefer a few relevant facets of Pingar generated keyphrases. If you have facetted search in your Enterprise Content Management System (ECMS) or Enterprise Search engine, then facetted search on keywords is critical. As employees are unlikely to record keyphrases, they must be automatically identified by a machine system such as Pingar DiscoveryOne. DiscoveryOne reads a document and identifies the words and phrases that best describes the topics inside a document. 5
  • 6. Document categories can also be useful In addition to keyphrases, organizations define metadata such as what project a document belongs to or which client or product line, etc. Matching these to a document allows this metadata to be used with facetted search as well. Unlike traditional technology, DiscoveryOne has two advanced forms of categorizing content automatically: • By topic (e.g. product or projects or known issues) • By content-type (e.g. employment contract or financial statement) 6
  • 7. Categorizing by topic with taxonomies Categorizing content by topic uses taxonomies. Taxonomies are a pre-defined set of categories. Taxonomies can be flat lists Taxonomies can have hierarchy 7
  • 8. Taxonomy categorization works well when you: • Have a clear idea of the categories you want • Can determine the words and phrases that a document will have to indicate what category it matches This is where Pingar text analytics expertise becomes useful. Pingar does more than match the names of the categories when it categorizes by topic. 8
  • 10. Traditional systems tried to use arcane rules that your employees would have to learn and enter in, however the modern text analytics developed by Pingar does not require that, so it’s faster and less expensive to implement. 10
  • 11. Categorizing by content type with statistical models Taxonomy categorization does not work well with determining what the nature of the content is – is it a letter or a brochure or a contract or financial statement? Statistical models are far superior when categorizing documents by content type and traditional technologies do not allow for this. Statistical models are also useful when you don’t know in advance what words are going to occur. 11
  • 12. Text categorization uses a statistical model built specially for the categories and Pingar tools enable this. In order to generate the model, example documents of each category are fed into the tool and the tool learns what makes documents in each category the same. 12
  • 13. w w w . p i n g a r. c o m North America 440 N. Wolfe Rd, CA 94085 Sunnyvale, USA +1 408 663 2328 Asia Pacific 55 Anzac Ave, 1010 Auckland, New Zealand +64 9 950 3299 Thank you 13