SlideShare ist ein Scribd-Unternehmen logo
1 von 38
Enabling Exploration through Text Analytics Daniel Tunkelang Chief Scientist, Endeca
overview ,[object Object],[object Object],[object Object],[object Object]
real-world information seeking examples ,[object Object],[object Object],[object Object],[object Object],[object Object]
example 1: looking for health information ,[object Object],[object Object],[object Object]
google: the default option for most
in government we trust: fda.gov
maybe the private sector knows best: webmd powered by
success – and a sticky site powered by
example 2: looking for work-related information ,[object Object],[object Object],[object Object]
let’s try google again
google: the gateway to wikipedia?
the library of congress (loc.gov)
triangle research libraries: next-gen catalog powered by
faceted search enables query refinement powered by
take-away #1 ,[object Object],[object Object]
text analytics ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
newssift: text analytics enabling exploration powered by categorization named entity detection term extraction sentiment analysis
exploring the news about facebook powered by
facebook: the good powered by Social Utility Iphone Application
facebook: the bad powered by Criminal Behavior Litigation And Settlement
take-away #2 ,[object Object],[object Object]
text analytics is here and now ? ? ?
lots of off-the-shelf options and more!
caveats ,[object Object],[object Object],[object Object],[object Object]
problems with entity extraction ,[object Object],[object Object],[object Object],Arrest (1) Asia (1) ALTOONA, PA (1) Abe Lincoln (1) Bob Dole (1) Boston Tea Party (1) Abraham Lincoln (1) Budweiser (1) Australia (1) Adlai Stephenson (1) Boston Tea Party (1) Austin, Texas (1) Abraham Lincoln (1) Boston Globe (1) Austin (1) Abe Weiss (1) Bocuse d’Or World Cuisine Contest (1) Atlanta (2) Abe Lincoln (1) Bob Dole (1) Asia (1) Abbie Hoffman (1) Bloomberg LP (3) Arrest (1) Aaron Sorkin (1) BioDiversity Research Institute (1) Arlington, Va. (2) ARYE BARAK (1) Big Apple Companies (1) Arkansas (7) ANTONIN SCALIA (1) Bear Stearns (2) Arizona (11) ANTHONY MWANGI (1) Bad News Bears (1) Argentina (1) ANDREW LLOYD WEBBER (1) Australian Liberal Party (1) Appalachia (1) ANDERS ERICSSON (1) Arianna Huffington (1) Americas (17) AMY WINEHOUSE (1) Arctic National Wildlife Refuge (1) Allegheny (1) AMANDA MARCOTTE (1) Apple (1) Alaska (3) ALI HASSAN AL (1) American Airlines Inc. (1) Akihabara (1) ALEX TREBEK (1) Amazon.com Inc. (1) Africa (5) AL GORE (1) Air Force (1) Afghanistan (7) ABDULRAHMAN ABDULLAH (1) ABC News Inc. (1) ALTOONA, PA (1) ABDUL-KARIM KHALAF (1) Organization Location Person
look for ways to cheat! recall precision
division of labor people supply vocabulary machine annotates documents http://www.precolumbianwomen.com/images/inca-labor.10.gif
example: ACM digital library ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
solution ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
example: a search for boeing powered by
it’s a HITS!
if you prefer sports to computer science ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
roger clemens, then and now powered by
pivoting to a different view powered by
take-away #3 ,[object Object],[object Object],[object Object]
looking forward ,[object Object],[object Object],[object Object],[object Object]
in closing ,[object Object],[object Object],[object Object]
thank you…and come to SIGIR! ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Weitere ähnliche Inhalte

Was ist angesagt?

Plagiarism work sheet
Plagiarism work sheetPlagiarism work sheet
Plagiarism work sheet
Vjames12
 
Search Strings
Search StringsSearch Strings
Search Strings
Erin Sees
 
Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712
Ms. D
 
Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)
Swilley Library
 

Was ist angesagt? (19)

Sociology 462
Sociology 462Sociology 462
Sociology 462
 
Electronic Research: Sources and Strategies
Electronic Research: Sources and StrategiesElectronic Research: Sources and Strategies
Electronic Research: Sources and Strategies
 
ENG 101
ENG 101ENG 101
ENG 101
 
Doing Literature Review
Doing Literature ReviewDoing Literature Review
Doing Literature Review
 
Who's citing whom?
Who's citing whom?Who's citing whom?
Who's citing whom?
 
10 easy ways to increase your citation count a checklist
10 easy ways to increase your citation count  a checklist10 easy ways to increase your citation count  a checklist
10 easy ways to increase your citation count a checklist
 
Subject Searching
Subject Searching Subject Searching
Subject Searching
 
What google scholar can do for you
What google scholar can do for youWhat google scholar can do for you
What google scholar can do for you
 
Law1 ppl journal articles
Law1 ppl journal articlesLaw1 ppl journal articles
Law1 ppl journal articles
 
Humanities international complete
Humanities international completeHumanities international complete
Humanities international complete
 
Plagiarism work sheet
Plagiarism work sheetPlagiarism work sheet
Plagiarism work sheet
 
Reflection on web2.0
Reflection on web2.0Reflection on web2.0
Reflection on web2.0
 
How people search the library from a single search box
How people search the library from a single search boxHow people search the library from a single search box
How people search the library from a single search box
 
Search Strings
Search StringsSearch Strings
Search Strings
 
Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712Meabe speeches 2nd sem rev22712
Meabe speeches 2nd sem rev22712
 
Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)Research Skills for Level 6 (Follow Up)
Research Skills for Level 6 (Follow Up)
 
Finding newspaper articles in factiva ppl2015
Finding newspaper articles in factiva ppl2015Finding newspaper articles in factiva ppl2015
Finding newspaper articles in factiva ppl2015
 
Searching databaseswelshci
Searching databaseswelshciSearching databaseswelshci
Searching databaseswelshci
 
National latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinarNational latina researchers network supercharge your search 2015 webinar
National latina researchers network supercharge your search 2015 webinar
 

Andere mochten auch

Andere mochten auch (10)

The Future of Text Analytics
The Future of Text AnalyticsThe Future of Text Analytics
The Future of Text Analytics
 
Predictive Text Analytics
Predictive Text AnalyticsPredictive Text Analytics
Predictive Text Analytics
 
singley+mackie Capabilities Deck
singley+mackie Capabilities Decksingley+mackie Capabilities Deck
singley+mackie Capabilities Deck
 
Text Mining Analytics 101
Text Mining Analytics 101Text Mining Analytics 101
Text Mining Analytics 101
 
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
 
Log Data Mining
Log Data MiningLog Data Mining
Log Data Mining
 
Text Analytics for Dummies 2010
Text Analytics for Dummies 2010Text Analytics for Dummies 2010
Text Analytics for Dummies 2010
 
Elements of Text Mining Part - I
Elements of Text Mining Part - IElements of Text Mining Part - I
Elements of Text Mining Part - I
 
Log Mining: Beyond Log Analysis
Log Mining: Beyond Log AnalysisLog Mining: Beyond Log Analysis
Log Mining: Beyond Log Analysis
 
Data Science - Part XI - Text Analytics
Data Science - Part XI - Text AnalyticsData Science - Part XI - Text Analytics
Data Science - Part XI - Text Analytics
 

Ähnlich wie Enabling Exploration Through Text Analytics

Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
jsotir
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
Hala Nur
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
lynroe
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)
sbishoptcl
 

Ähnlich wie Enabling Exploration Through Text Analytics (20)

Database Basics
Database BasicsDatabase Basics
Database Basics
 
Workshop on Systematic Searching (Oslo)
Workshop on Systematic Searching (Oslo)Workshop on Systematic Searching (Oslo)
Workshop on Systematic Searching (Oslo)
 
Internet searching
Internet searchingInternet searching
Internet searching
 
Reproducibility Analytics Lab
Reproducibility Analytics Lab Reproducibility Analytics Lab
Reproducibility Analytics Lab
 
June 1st Library Presentation for CCTS Summer Fellowship
June 1st Library Presentation for CCTS Summer FellowshipJune 1st Library Presentation for CCTS Summer Fellowship
June 1st Library Presentation for CCTS Summer Fellowship
 
Google for Life Science Researchers
Google for Life Science ResearchersGoogle for Life Science Researchers
Google for Life Science Researchers
 
Libguide powerpoint
Libguide powerpointLibguide powerpoint
Libguide powerpoint
 
Introductory Literature Searching Session
Introductory Literature Searching SessionIntroductory Literature Searching Session
Introductory Literature Searching Session
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
 
Academic Skills 4
Academic Skills 4Academic Skills 4
Academic Skills 4
 
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & EvaluationFSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
FSU SLIS InfoSvcs Wk 3 - Web Search & Evaluation
 
Databasics
DatabasicsDatabasics
Databasics
 
Hinari basic course_module_2_workbook_2014_07
Hinari basic course_module_2_workbook_2014_07Hinari basic course_module_2_workbook_2014_07
Hinari basic course_module_2_workbook_2014_07
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Big 6 Research Skills
Big 6 Research SkillsBig 6 Research Skills
Big 6 Research Skills
 
Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)Information Literacy Orientation (Fall, 2011)
Information Literacy Orientation (Fall, 2011)
 
TSEM Spring 2017 Fath Class1
TSEM Spring 2017 Fath Class1TSEM Spring 2017 Fath Class1
TSEM Spring 2017 Fath Class1
 
TSEM Fall 2016 Fath Class1
TSEM Fall 2016 Fath Class1TSEM Fall 2016 Fath Class1
TSEM Fall 2016 Fath Class1
 
Usability Testing a Public ERM: Worth the Effort?
Usability Testing a Public ERM: Worth the Effort?Usability Testing a Public ERM: Worth the Effort?
Usability Testing a Public ERM: Worth the Effort?
 
A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)A Gentle Introduction to Text Analysis :)
A Gentle Introduction to Text Analysis :)
 

Mehr von Daniel Tunkelang

Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
Daniel Tunkelang
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
Daniel Tunkelang
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
Daniel Tunkelang
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
Daniel Tunkelang
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
Daniel Tunkelang
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
Daniel Tunkelang
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
Daniel Tunkelang
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
Daniel Tunkelang
 

Mehr von Daniel Tunkelang (20)

Query Understanding and Ecommerce
Query Understanding and EcommerceQuery Understanding and Ecommerce
Query Understanding and Ecommerce
 
Semantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce QueriesSemantic Equivalence of e-Commerce Queries
Semantic Equivalence of e-Commerce Queries
 
Helping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query UnderstandingHelping Searchers Satisfice through Query Understanding
Helping Searchers Satisfice through Query Understanding
 
MMM, Search!
MMM, Search!MMM, Search!
MMM, Search!
 
Enterprise Intelligence
Enterprise IntelligenceEnterprise Intelligence
Enterprise Intelligence
 
Query Understanding: A Manifesto
Query Understanding: A ManifestoQuery Understanding: A Manifesto
Query Understanding: A Manifesto
 
Where should you put your data scientists?
Where should you put your data scientists?Where should you put your data scientists?
Where should you put your data scientists?
 
Data Science: A Mindset for Productivity
Data Science: A Mindset for ProductivityData Science: A Mindset for Productivity
Data Science: A Mindset for Productivity
 
My Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine LearningMy Three Ex’s: A Data Science Approach for Applied Machine Learning
My Three Ex’s: A Data Science Approach for Applied Machine Learning
 
Web science - How is it different?
Web science - How is it different?Web science - How is it different?
Web science - How is it different?
 
Better Search Through Query Understanding
Better Search Through Query UnderstandingBetter Search Through Query Understanding
Better Search Through Query Understanding
 
Social Search in a Professional Context
Social Search in a Professional ContextSocial Search in a Professional Context
Social Search in a Professional Context
 
Find and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedInFind and be Found: Information Retrieval at LinkedIn
Find and be Found: Information Retrieval at LinkedIn
 
Search as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal JourneySearch as Communication: Lessons from a Personal Journey
Search as Communication: Lessons from a Personal Journey
 
Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?Enterprise Search: How do we get there from here?
Enterprise Search: How do we get there from here?
 
Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem Big Data, We Have a Communication Problem
Big Data, We Have a Communication Problem
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Information, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of NeedsInformation, Attention, and Trust: A Hierarchy of Needs
Information, Attention, and Trust: A Hierarchy of Needs
 
Data By The People, For The People
Data By The People, For The PeopleData By The People, For The People
Data By The People, For The People
 
Content, Connections, and Context
Content, Connections, and ContextContent, Connections, and Context
Content, Connections, and Context
 

Kürzlich hochgeladen

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Kürzlich hochgeladen (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 

Enabling Exploration Through Text Analytics