SlideShare ist ein Scribd-Unternehmen logo
1 von 23
Taxonomies: Tools or People? TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  When would one favor human indexing over machine indexing? An example of the human indexing effort is presented along with tools that can help with the process. An example of autocategorization is illustrated with a discussion of the reciprocal flow of information between the taxonomy management tool and the autocategorization tool. Speakers then discuss how structured vocabularies help refine categorizers and how feedback from the categorizer tool to the human editorial team contributes to the continual improvement of the vocabularies. by Dave Clarke & Paula McCoy
[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  HUMAN VS. MACHINE & THE HUMAN OPTION
Humans will invent almost anything to save time TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
Human or machine indexing – depends on the data and the user subtle & abstract concepts non-textual, e.g. images, sounds highly structured very high volume homogeneous topics mission-critical precision & recall noisy or incomplete results tolerable very quick turnaround TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com  12/09/09 Slide  Humans Machines Size Time-sensitivity Generalist users Machine-readability Conceptual-abstraction Expert users Data-structure Homogeneity
Human indexing – the process TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  Data Set 1. Review the  content 2. Consult the  vocabularies 3. Either tag the content item or build an index table Controlled Vocabularies Index Table
Human indexing – a wish list  of time-saving tools ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
Human indexing – a wish list  of time-saving tools ,[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
Human indexing – Synaptica’s “IMS” Toolbox TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Human indexing – IMS Workflow Detail TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
Human indexing – profile set up screen shot TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
Human indexing – examples TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  ,[object Object],[object Object],[object Object]
Human indexing – conclusions TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  ,[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide  AUTOCATEGORIZATION A CASE STUDY USING SYNAPTICA
TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide  ,[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide  ProQuest Search Interface
[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide  ProQuest Search Interface
The Autocategorization Solution ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide
[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide  The ProQuest Approach
Thesaurus and Autocat Management ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide
Synaptica-TME Interaction ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide
Synaptica & Autocat: Benefits ,[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide
Benefits for Synaptica Thesaurus Control  ,[object Object],[object Object],[object Object],TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Proquest, Inc., 2009 www.proquest.com 12/09/09 Slide
TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide  [email_address]   [email_address]   Questions?

Weitere ähnliche Inhalte

Ähnlich wie Synaptica Proquest Talk Taxonomy Boot Camp 2009

Aspire Days Roadmap - Northumbria University 13th May
Aspire Days Roadmap - Northumbria University 13th MayAspire Days Roadmap - Northumbria University 13th May
Aspire Days Roadmap - Northumbria University 13th MayChris Clarke
 
IWMW11: A2 working against the silo
IWMW11: A2 working against the siloIWMW11: A2 working against the silo
IWMW11: A2 working against the siloEduserv
 
Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint Haaron Gonzalez
 
facilitating document annotation using content and querying value
facilitating document annotation using content and querying valuefacilitating document annotation using content and querying value
facilitating document annotation using content and querying valueswathi78
 
IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilitiesguestbc914e
 
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...semanticsconference
 
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...martingarland
 
SharePoint Jumpstart #2 Making Basic SharePoint Search Work
SharePoint Jumpstart #2 Making Basic SharePoint Search WorkSharePoint Jumpstart #2 Making Basic SharePoint Search Work
SharePoint Jumpstart #2 Making Basic SharePoint Search WorkEarley Information Science
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsJohn T. Kane
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentJoe Pairman
 
Introduction to Taxonomy Development - by Clobridge Consulting
Introduction to Taxonomy Development - by Clobridge ConsultingIntroduction to Taxonomy Development - by Clobridge Consulting
Introduction to Taxonomy Development - by Clobridge ConsultingAbby Clobridge
 
XXIX Charleston 2009 Silverchair Kerner
XXIX Charleston 2009 Silverchair KernerXXIX Charleston 2009 Silverchair Kerner
XXIX Charleston 2009 Silverchair KernerDarrell W. Gunter
 
Norfolk Intranet 2.0
Norfolk Intranet 2.0Norfolk Intranet 2.0
Norfolk Intranet 2.0djoneseaccess
 
Future of text analysis forrester briefing
Future of text analysis   forrester briefingFuture of text analysis   forrester briefing
Future of text analysis forrester briefingStuart Shulman
 
Building Bridges with Taxonomy: Enabling Semantic Integration
Building Bridges with Taxonomy: Enabling Semantic IntegrationBuilding Bridges with Taxonomy: Enabling Semantic Integration
Building Bridges with Taxonomy: Enabling Semantic IntegrationDesign for Context
 
Chris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesChris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesSharePoint Saturday NY
 

Ähnlich wie Synaptica Proquest Talk Taxonomy Boot Camp 2009 (20)

Aspire Days Roadmap - Northumbria University 13th May
Aspire Days Roadmap - Northumbria University 13th MayAspire Days Roadmap - Northumbria University 13th May
Aspire Days Roadmap - Northumbria University 13th May
 
KMA Taxonomy TBC2010
KMA Taxonomy TBC2010KMA Taxonomy TBC2010
KMA Taxonomy TBC2010
 
IWMW11: A2 working against the silo
IWMW11: A2 working against the siloIWMW11: A2 working against the silo
IWMW11: A2 working against the silo
 
Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint Enhance the way people collaborate with documents in SharePoint
Enhance the way people collaborate with documents in SharePoint
 
facilitating document annotation using content and querying value
facilitating document annotation using content and querying valuefacilitating document annotation using content and querying value
facilitating document annotation using content and querying value
 
IA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch CapabilitiesIA Summit 09 - User Interfaces with Metasearch Capabilities
IA Summit 09 - User Interfaces with Metasearch Capabilities
 
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
Joe Pairman | Multiplying the Power of Taxonomy with Granular, Structured Con...
 
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
Expert Webinar Series 2: Designing Information Architecture for SharePoint: M...
 
Hybrid Approaches to Taxonomy & Folksonmy
Hybrid Approaches to Taxonomy & FolksonmyHybrid Approaches to Taxonomy & Folksonmy
Hybrid Approaches to Taxonomy & Folksonmy
 
SharePoint Jumpstart #2 Making Basic SharePoint Search Work
SharePoint Jumpstart #2 Making Basic SharePoint Search WorkSharePoint Jumpstart #2 Making Basic SharePoint Search Work
SharePoint Jumpstart #2 Making Basic SharePoint Search Work
 
Interleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904LabsInterleaving, Evaluation to Self-learning Search @904Labs
Interleaving, Evaluation to Self-learning Search @904Labs
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
 
Introduction to Taxonomy Development - by Clobridge Consulting
Introduction to Taxonomy Development - by Clobridge ConsultingIntroduction to Taxonomy Development - by Clobridge Consulting
Introduction to Taxonomy Development - by Clobridge Consulting
 
XXIX Charleston 2009 Silverchair Kerner
XXIX Charleston 2009 Silverchair KernerXXIX Charleston 2009 Silverchair Kerner
XXIX Charleston 2009 Silverchair Kerner
 
Norfolk Intranet 2.0
Norfolk Intranet 2.0Norfolk Intranet 2.0
Norfolk Intranet 2.0
 
Future of text analysis forrester briefing
Future of text analysis   forrester briefingFuture of text analysis   forrester briefing
Future of text analysis forrester briefing
 
Building Bridges with Taxonomy: Enabling Semantic Integration
Building Bridges with Taxonomy: Enabling Semantic IntegrationBuilding Bridges with Taxonomy: Enabling Semantic Integration
Building Bridges with Taxonomy: Enabling Semantic Integration
 
KMA on Mms2010 nyc
KMA on Mms2010 nycKMA on Mms2010 nyc
KMA on Mms2010 nyc
 
Chris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and TaxonomiesChris McNulty - Managed Metadata and Taxonomies
Chris McNulty - Managed Metadata and Taxonomies
 
KMA's mms2010nyc
KMA's mms2010nycKMA's mms2010nyc
KMA's mms2010nyc
 

Mehr von Synaptica, LLC

Using ontologies for more than information categorization
Using ontologies for more than information categorizationUsing ontologies for more than information categorization
Using ontologies for more than information categorizationSynaptica, LLC
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-ExpertsSynaptica, LLC
 
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Synaptica, LLC
 
SKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy ManagementSKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy ManagementSynaptica, LLC
 
Successfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 MethodsSuccessfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 MethodsSynaptica, LLC
 

Mehr von Synaptica, LLC (6)

Using ontologies for more than information categorization
Using ontologies for more than information categorizationUsing ontologies for more than information categorization
Using ontologies for more than information categorization
 
Text Analytics for Non-Experts
Text Analytics for Non-ExpertsText Analytics for Non-Experts
Text Analytics for Non-Experts
 
Linked data 20171106
Linked data 20171106Linked data 20171106
Linked data 20171106
 
Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.Selecting the right database type for your knowledge management needs.
Selecting the right database type for your knowledge management needs.
 
SKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy ManagementSKOS-XL vs. Traditional Term Based Taxonomy Management
SKOS-XL vs. Traditional Term Based Taxonomy Management
 
Successfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 MethodsSuccessfully Managing Multilingual Taxonomies: 3 Methods
Successfully Managing Multilingual Taxonomies: 3 Methods
 

Kürzlich hochgeladen

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 

Synaptica Proquest Talk Taxonomy Boot Camp 2009

  • 1. Taxonomies: Tools or People? TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide When would one favor human indexing over machine indexing? An example of the human indexing effort is presented along with tools that can help with the process. An example of autocategorization is illustrated with a discussion of the reciprocal flow of information between the taxonomy management tool and the autocategorization tool. Speakers then discuss how structured vocabularies help refine categorizers and how feedback from the categorizer tool to the human editorial team contributes to the continual improvement of the vocabularies. by Dave Clarke & Paula McCoy
  • 2.
  • 3. Humans will invent almost anything to save time TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
  • 4. Human or machine indexing – depends on the data and the user subtle & abstract concepts non-textual, e.g. images, sounds highly structured very high volume homogeneous topics mission-critical precision & recall noisy or incomplete results tolerable very quick turnaround TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide Humans Machines Size Time-sensitivity Generalist users Machine-readability Conceptual-abstraction Expert users Data-structure Homogeneity
  • 5. Human indexing – the process TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide Data Set 1. Review the content 2. Consult the vocabularies 3. Either tag the content item or build an index table Controlled Vocabularies Index Table
  • 6.
  • 7.
  • 8.
  • 9. Human indexing – IMS Workflow Detail TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
  • 10. Human indexing – profile set up screen shot TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide
  • 11.
  • 12.
  • 13.
  • 14.
  • 15.
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. TBC; Taxonomies: Tools or People? By Dave Clarke & Paula McCoy Copyright © Synaptica, LLC, 2009 www.synapticasoftware.com 12/09/09 Slide [email_address] [email_address] Questions?

Hinweis der Redaktion

  1. People will go to extraordinary lengths to invent tools to save time. All of us would like to be in a place where machines can take over information indexing for us. But when is this possible, and when should it be avoided.
  2. People will go to extraordinary lengths to invent tools to save time. All of us would like to be in a place where machines can take over information indexing for us. But when is this possible, and when should it be avoided.
  3. People will go to extraordinary lengths to invent tools to save time. All of us would like to be in a place where machines can take over information indexing for us. But when is this possible, and when should it be avoided.
  4. This chart attempts to examine the question of when is it appropriate to let machines or people perform indexing. It is not an exact science, so individual circumstances require an evaluation of all these factors, plus business factors such as ease of access to human indexers and to IT resources and funds. Broadly speaking, however, certain factors steer one in the direction of certain solutions. Factors that lean toward machine indexing: If the size of a data set is so large that it would be impossible to process it by humans then machine indexing may be the only solution, regardless of any qualitative factors. If the data set is fast moving and access to it is time-sensitive, then machine indexing can also be the preferred solution. Although small sets of fast moving data may be processed by humans. If the users are generalists or in pursuit of information for recreational purposes then they are likely to be more tolerant of noisy or incomplete results. Factors that lean toward human indexing: If the data is not at all machine-readable then human indexing may be the only solution. For example, photographs and video without any metadata or embedded speech may require human review. If the data contains subtle or abstract concepts then these may elude even the most finely tuned machines. For example, the ideas behind in the To be or not to be soliloquy in Hamlet are too subtle to be identified from textual analysis alone. If the users are experts for whom data is a mission-critical resource then they may require exceedingly high precision and recall which would demand either human indexing or an extremely high degree of human training and QC of the machine process. Factors that benefit either indexing method: If data is well structured within identifiable fields or metadata attributes then this structure provides context that will greatly assist machine indexing, but also help with human indexing. If data is on a homogeneous topic, such as a database of articles all about nuclear physics, will be easier to index than a database covering all disciplines and topics.
  5. The human indexing process essentially involves three simple steps: Review the content one article / record at a time Search the controlled vocabularies to find the terms that best describe the content Either tag the content directly by adding the terms as metadata values within the CMS, or assign the indexing terms to the content item by using a separate index table / interface
  6. Most of our user-base create their taxonomies in Synaptica and then integrate them with third-party automatic indexing tools. Others have determined that they need to perform human indexing and over the years they have developed a wish list of time-saving tools. (see bullets for wish list)
  7. Most of our user-base create their taxonomies in Synaptica and then integrate them with third-party automatic indexing tools. Others have determined that they need to perform human indexing and over the years they have developed a wish list of time-saving tools. (see bullets for wish list)
  8. Ten years ago the Synaptica software team productized this wish list and bundled all these features into a Synaptica package called IMS. IMS – the Indexing Management System – acts as an integration toolset between the taxonomy management system and content management system. It provides ready-made GUI screens, and also a suite of web services components that allow indexing functionality to be custom crafted into the CMS screenflow.
  9. This slide illustrates the workflow for IMS as a component that sits between a CMS system and a taxonomy management system to assist the human indexing process.
  10. This screen shot illustrates how indexing profiles can be created to streamline the indexing operation for particular sets of content. Many parameters can be configured such as user-access permissions, term expansion, access to particular vocabularies and facets, even the selection of individual sub-branches within a hierarchy.
  11. We are actively working with a number of clients who are performing human indexing for selected data sets. Following are three “hypothetical” but realistic examples.
  12. Conclusions: (see bullets for conclusions)
  13. Questions?