SlideShare ist ein Scribd-Unternehmen logo
1 von 50
Tools for TaxonomiesEnterprise Search Summit, May 11, 2010 Heather Hedden Taxonomy Consultant Earley & Associates
About Heather Hedden Taxonomy consultant, Earley & Associates Indexer, Hedden Information Management Instructor, Continuing Education, Simmons College Graduate School of Library & Information Science Formerly taxonomist at Viziant Corporation and Thomson Learning (Gale) Author, The Accidental Taxonomist (Information Today, Inc., May 2010)
What are Taxonomies? Controlled list of terms (words or phrases) for concepts Defined/scoped meaning for each concept Equivalent words or phrases (as synonyms) for each concept Terms are usually arranged in a structured hierarchy and/or grouped by type (facet) to facilitate navigation to find the right term ,[object Object],[object Object]
What are Taxonomy Tools? No authoritative industry list of taxonomy software “Taxonomy software” can mean different things Auto-categorization vs. taxonomy management Existing Web lists are miscellaneous taxonomy-related tools or out-of-date http://taxocop.wikispaces.com/TaxoTools www.taxotips.com/resources/tools www.searchtools.com/info/classifiers-tools.html www.willpowerinfo.co.uk/thessoft.htm
Taxonomy Tool Types Thesaurus/ontology management software Other software with thesaurus/taxonomy modules Auto-categorization/text mining software Other software supporting creating taxonomies mindmapping or concept modeling tools Cardsorting tools Web analytics
Tools Used by “Taxonomists”
Thesaurus Management Software Basics Maintains terms and their relationships (equivalencies, hierarchical, and associative) As reciprocals When renaming, merging, subsuming, or deleting terms Disallows invalid relationships (according to standards) Supports term notes and other attributes for terms Supports candidate/approved terms; includes term creation and update dates Generates reports in various thesaurus display formats (hierarchical, alphabetical) Exports data in interoperable formats for importing into a content management, indexing, search, retrieval system Supports thesaurus standards: ANSI/NISO Z39.19 or ISO 2788
Thesaurus Management Software Feature Comparisons interface design and ease of use multiple taxonomy display options term searching spell-checking speed (limited mouse clicks) for repeated term and relationship additions single-step new term & relationship creation single-step branch (term and narrower terms) moving drag & drop relationship adding user-defined (customizable) relationships user-defined term notes and term attributes bilingual or multilingual taxonomy support importing and exporting formats connectors to enterprise search systems
Thesaurus vs. Ontology Software Ontologies additionally have: Classes for terms  Customizable semantic relationships between terms (hierarchical and associative), dependent on class WC3 or RDF standard outputs (for OWL) ,[object Object]
Ontology software,[object Object]
Thesaurus Management Software MultiTes Pro Multisystems (Miami, FL) www.multites.com Since 1983. Hector Echeverria, president. $295 single user; $1295 for 5 users$2495 for 10 users; $3950 enterprise deployment Add-on products: web development kit, enterprise development kit Imports text; exports text, HTML (as a web page), XML, CSV  Free limited-time downloadable trial and online tutorial Online discussion group for tips
Thesaurus Management Software Cognatrix LGOSystems Pty. Ltd. (Australia) www.cognatrix.com For Mac OS X 10.4.5 and later  US $499, or $199 for an “Education” version limited to 500 terms. Imports from plain text with tab separations.With CognatrixImporter add-on, imports from various XML schemas: Cognatrix, MultiTes, Term Tree, and Zthes. Exports to XML and HTML. Free limited-time downloadable trial and manual
Thesaurus Management Software One-2-One This to That Pty. Ltd. t/a A.C.S.Active Classification Solutions (Australia) www.acs121.com Price: $800 Australian (approximately $700-750) For thesauri and classification systems, but also has features and connectors for records management Replaces a previous thesaurus-only product, Term Tree Drag-and-drop hierarchy feature Free limited-use and limited-size trial
Thesaurus Management Software TheW32 Tim Craven Freeware (Ontario, Canada) http://publish.uwo.ca/~craven/freeware.htm Started as thesaurus feature in his TEXNET auto-abstracting tool in early 1990s.  Free Also provides web site indexing software XRefHT and machine-aided indexing and abstracting software
Thesaurus Management Software TheW32 Interface
Thesaurus Management Software High-end, multi-user client-server, large scale systems ($3000/single user - $75,000+; or annual hosted options) Data Harmony Synaptica Smartlogic Wordmap SchemaLogic STAR/Thesaurus SoutronTHESAURUS Mondeca ITM T3  PoolParty TheMa a.k.a.
Thesaurus Management Software Data Harmony Thesaurus Master Access Innovations (Albuquerque, NM) www.dataharmony.com Indexing services since 1978, commercial software (originally used in-house) offered since 1998.  Multi-platform java-based (used on Windows, Mac, Solaris, Linux). Client software allows remote access. All standard thesaurus displays types as view options User defined associative and equivalence, but no user-defined hierarchical relationships Sold separately or combined with M.A.I. (Machine Aided Indexer) as MAIstro. Other software extensions available.
Thesaurus Management Software Synaptica Synaptica Software LLC (Franktown, CO) http://synapticasoftware.com Since 1995. Owned by Dow Jones 2005-2009. Web browser-based, priced per user, per year, per vocabulary 12 graduations of permission levels Can assign relationship weights Global term and relationships editor, creating a list of terms to edit Side-by-side editor with drag-and-drop Imports: CSV, text, MS Excel, XML (including schemas of ZThes, RDF, SKOS, and OWL)  Exports: CSV, HTML, MS Word, MS Excel, XML (including schemas of ZThes, RDF, SKOS, and OWL)
There are also browse options including a Tree Browse and Alpha-numeric browse option to review terms.
Thesaurus Management Software Semaphore Ontology Manager Smartlogic Semaphore Ltd. (London, UK) + US office www.smartlogic.com Supports creating thesauri according to ISO 2788 standard Supports creating ontologies, through customizable relationships and user-created classes User-defined term attributes and metadata Multiple user access/privilege levels Imports/export in CSV, XML, Zthes, SQL databases, and MultiTes files Related products: Classification Server for automated classification Ontology Service for a navigation system
Thesaurus Management Software Wordmap Taxonomy Manager Wordmap Inc. (Concord, MA) www.wordmap.com In UK since 1998. Acquired by Earley & Associates in 2007. Multi-platform java-based One of a suite of products including Wordmap Intelligent Text Classifier, Taxonomy Connectors for SharePoint and Endeca. User-defined relationships; can also turn on or off relationship name display. Can display two taxonomies side by side and drag and drop. User access/privileges can be set at the individual node level. Imports: CSV, Excel, XML; Exports: XML Real-time access: Java API
Thesaurus Management Software SchemaLogic Enterprise Suite SchemaLogic Inc. (Kirkland, WA) www.schemalogic.com Provides thesaurus management according to ANSI/NISO standards, plus broader structural metadata support Can create customizable relationships Can assign various permission levels to vocabularies or terms Classification module supports 3rd party auto-indexing Connectors to SharePoint, EMC Documentum, and FAST ESP Can import CSV or XML files
Thesaurus Management Software STAR/Thesaurus Cuadra Associates, Inc. (Los Angeles) www.cuadra.com  Stand-alone or integrates with STAR family of products for records mngmt, collections mngmt, archives mngmt, DAM Supports standard thesaurus relationship but not customizable relationships Supports unlimited user-defined notes and categories  Various output report display formats Import/export ASCII text and CSV, but not XML
Thesaurus Management Software SoutronTHESAURUS Soutron Ltd. (United Kingdom) www.soutron.com Markets in the U.S. through partnership with InMagic Stand-alone or integrates with SoutronGLOBAL or SoutronSOLO library management systems, or with InMagic Presto social knowledge management  software Supports standard thesaurus and user-defined relationships Supports term merging Imports from XML; exports to XML or CSV
Thesaurus Management Software Mondeca ITM T3 (Intelligent Topic Manager: Thesaurus, Taxonomies, Terminologies) Mondeca S.A. (Paris, France) www.mondeca.com Since 2008, addition to Intelligent Topic Manager set of products for knowledge management, semantic portals, and e-catalogs web-based collaborative application conforms to both SKOS vocabularies and OWL-standard ontologies  connectors to text mining, classification, and search tools Imports/exports XML, RDF, and SKOS
Thesaurus Management Software PoolParty punkt. netServices GmbH (Vienna, Austria) http://poolparty.punkt.at A new thesaurus tool built on W3C Semantic Web Standards: SKOS, RDF, OWL, SPARQL Installed server or web-hosted options Can link domain-specific thesauri to Linked Open Data Wordpress plugin to build glossaries for blogs Imports ZThes XML, CSV Integrated text extraction and semi-automatic tagging to enable semantic search
Ontology Software Tools for ontologies, not thesauri TopBraid Composer, www.topquadrant.com Altova SemanticWorks, www.altova.com/semanticworks.html Protégé, http://protege.stanford.edu SMORE, www.mindswap.org/2005/SMORE/ SWOOP, http://code.google.com/p/swoop CMAP Tools Ontology Editor, http://cmap.ihmc.us
Other Software with Taxonomy/thesaurus Creation & Editing Components Metadata or cataloging software, especially for archives and libraries Adlib Information Systems, www.adlibsoft.com Content management and document management systems Open Text Collections Server Webtop Thesaurus Manager, www.opentext.com Records management systems a.k.a. from Synercon Management Consulting, www.a-k-a.com.au Auto-categorization and enterprise search software
Auto-categorization and Text Mining Auto-categorization Algorithms, statistics, and training documents – utilize a large set a sample documents per taxonomy term to “train” the system to learn to index Rules base – generate and edit or write “rules” for each term based on co-existing words, proximity, Boolean logic, etc. Text mining Extracts relevant terms from texts to generate a candidate taxonomy or supplement an existing taxonomy
Auto-categorization/Search Software Auto-categorization, text mining, and search systems that utilize taxonomies handle these taxonomies in different ways: With pre-installed taxonomies that cannot be edited With pre-installed taxonomies that the user may edit and extend through the user interface Automatically generate a taxonomy that can be edited Support the import of taxonomies but do not support the editing of those taxonomies Support the import of taxonomies and then the editing of those taxonomies Various combinations of above
Auto-categorization/Search Software Software that can import and use taxonomies but lacking user-interface features to edit those taxonomies includes: Microsoft SharePoint IBM Classification Module Fast Endeca Temis Vivisimo Mindbreeze Exalead PerfectSearch They collaborate with other vendors that develop taxonomies and/or have taxonomy editing capabilities.
Auto-categorization/Search Software Examples of tools with some taxonomy management capabilities: Inxight SmartDiscovery Analysis Server	www.inxightfedsys.com Autonomy Collaborative Classifier	www.autonomy.com Autonomy Interwoven MetaTagger	www.interwoven.com Lexalytics Classifier	www.lexalytics.com Conceptsearching	www.conceptsearching.com
Auto-categorization/Search Software Examples of auto-categorization tools with full thesaurus management capabilities: Data Harmony MAIstro	www.dataharmony.com/products/maistro.html  Smartlogic Semaphore Classification Server	www.smartlogic.com  Wordmap Intelligent Text Classifier	www.wordmap.com  SAS Enterprise Content Categorization(formerly Teragram TK240)	www.sas.com/text-analytics Nstein Text Mining Engine (part of Open Text)	www.nstein.com
Auto-categorization Software combined with Thesaurus Management Data Harmony MAIstro (combines Data Harmony Thesaurus Master and Machine-Aided Indexer) Automatically creates a basic rule for every term and its variants in the Thesaurus Master’s thesaurus Rules may be edited and additional rules can be manually written statistics module tracks the editor’s term choices and compares them with M.A.I. term suggestions, sorting them as hits, misses, and noise to guide and prioritize the editor’s fine-tuning of rules Can be used for machine-aided indexing or fully automated indexing Connectors to Sharepoint and search engines
Auto-categorization Software combined with Thesaurus Management Smartlogic Semaphore Classification Server (connects with Semaphore Ontology Manager) Creates classification rulebases directly from a taxonomy/thesaurus/ontology, and applies these rules to content as it is received to automatically classify content. Rules are based on the term, its equivalencies, and broader/ narrower/related terms Employs 20 different kinds of rules Rules have weights and scores Variants based on spelling, plurals, and stemming may also be considered. Manual rules can take precedence over generated rules.
Auto-categorization Software combined with Thesaurus Management Wordmap Intelligent Text Classifier (connects with Wordmap Taxonomy Manager for leveraging thesaurus) Auto-classification based on statistical method based of Support Vector Machine algorithms and machine learning with training documents Pre-packaged with statistical algorithms based on a generic taxonomy, the U.K.’s Integrated Public Services Vocabulary (IPSV), for which each of hundreds of terms have already been “trained” with representative documents  Wordmap also offers Taxonomy Connectors for taxonomy-driven tagging and search within SharePoint and Endeca
Auto-categorization Software combined with Thesaurus Management SAS Enterprise Content Categorization (ECC) (Formerly Teragram TK240 Taxonomy Manager) Supports taxonomy building or connects with SAS Ontology Manager ECC supports equivalent, hierarchical & related relationships Ontology Manager supports customized relationships and attributes Utilizes both auto-categorization and entity/concept extraction Auto-categorization bases on rules Rules-writing supported with a graphical tree view of Boolean operators and commands User can define weighting of terms
Auto-categorization Software combined with Thesaurus Management Open Text Nstein Text Mining Engine (TME) Modules include concept extractor, entity extractor, auto-categorizer, automated abstract creation, sentiment analysis Taxonomy Manager module (not sold separately) supports creating & editing hierarchical, associative and equivalence relationships according to ANSI/NISO standard Auto-categorization technology based on use of training sets for taxonomy terms, combined with concept extraction technology Ships with pre-installed taxonomies already “trained” for auto-categorization
Concluding Remarks Some “taxonomy tools” are stronger in taxonomy/thesaurus/ontology management. Some “taxonomy tools” are stronger in auto-categorization. A few tools combine both, but vendor partnerships and connectors can also achieve high results.
Questions/Contact/More Info Heather Hedden Earley & Associates 978-371-0822 (direct) 978-467-5195 (mobile) heatherh@earley.com www.earley.com
Tools for Taxonomies

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Numerical Taxonomy
Numerical Taxonomy Numerical Taxonomy
Numerical Taxonomy
 
Icbn
IcbnIcbn
Icbn
 
APG system of classification.pptx
APG system of classification.pptxAPG system of classification.pptx
APG system of classification.pptx
 
PLANT TAXONOMY/ PLANT SYSTEMATIC
PLANT TAXONOMY/ PLANT SYSTEMATICPLANT TAXONOMY/ PLANT SYSTEMATIC
PLANT TAXONOMY/ PLANT SYSTEMATIC
 
Numerical taxonomy_Plant Taxonomy
Numerical taxonomy_Plant TaxonomyNumerical taxonomy_Plant Taxonomy
Numerical taxonomy_Plant Taxonomy
 
TAKHTAJAN SYSTEM OF CLASSIFICATION OF PLANTS
TAKHTAJAN SYSTEM OF CLASSIFICATION OF PLANTSTAKHTAJAN SYSTEM OF CLASSIFICATION OF PLANTS
TAKHTAJAN SYSTEM OF CLASSIFICATION OF PLANTS
 
Family Magnoliaceae
Family MagnoliaceaeFamily Magnoliaceae
Family Magnoliaceae
 
Botany:Pentoxylales
Botany:PentoxylalesBotany:Pentoxylales
Botany:Pentoxylales
 
Chemotaxonomy
ChemotaxonomyChemotaxonomy
Chemotaxonomy
 
Fossil angiosperms
Fossil angiospermsFossil angiosperms
Fossil angiosperms
 
Herbarium Techniques
Herbarium TechniquesHerbarium Techniques
Herbarium Techniques
 
Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)Angiosperm phylogeny grouping I (APG I)
Angiosperm phylogeny grouping I (APG I)
 
Numerical taxonomy
Numerical taxonomyNumerical taxonomy
Numerical taxonomy
 
Numerical taxonomy
Numerical taxonomyNumerical taxonomy
Numerical taxonomy
 
Valid publication & principle of priority
Valid publication & principle of priorityValid publication & principle of priority
Valid publication & principle of priority
 
Flora, Revision and Monograph
Flora, Revision and  MonographFlora, Revision and  Monograph
Flora, Revision and Monograph
 
Angiosperm phylogenic group(apg) iii
Angiosperm phylogenic group(apg) iiiAngiosperm phylogenic group(apg) iii
Angiosperm phylogenic group(apg) iii
 
Adaptations of epiphytes and halophytes
Adaptations of       epiphytes and halophytesAdaptations of       epiphytes and halophytes
Adaptations of epiphytes and halophytes
 
Angiosperms
AngiospermsAngiosperms
Angiosperms
 
Numerical taxonomy
Numerical taxonomyNumerical taxonomy
Numerical taxonomy
 

Ähnlich wie Tools for Taxonomies

SharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementSharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementIvan Sanders
 
ECM And Enterprise Metadata in SharePoint 2010
ECM And Enterprise Metadata in SharePoint 2010ECM And Enterprise Metadata in SharePoint 2010
ECM And Enterprise Metadata in SharePoint 2010Phuong Nguyen
 
Semantics In Declarative Systems
Semantics In Declarative SystemsSemantics In Declarative Systems
Semantics In Declarative SystemsOptum
 
Essential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compEssential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compljnd
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Amit Sheth
 
Office 2.0 at GSA OCIO Offsite
Office 2.0 at GSA OCIO OffsiteOffice 2.0 at GSA OCIO Offsite
Office 2.0 at GSA OCIO OffsiteGeorge Thomas
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Looking Under the Hood -- Australia SharePoint Conference
Looking Under the Hood -- Australia SharePoint ConferenceLooking Under the Hood -- Australia SharePoint Conference
Looking Under the Hood -- Australia SharePoint ConferenceChristian Buckley
 
Business Strategies for Content Management - Part 3: Publishing Web Content U...
Business Strategies for Content Management - Part 3: Publishing Web Content U...Business Strategies for Content Management - Part 3: Publishing Web Content U...
Business Strategies for Content Management - Part 3: Publishing Web Content U...TJ O'Connor
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with searchJean Graef
 
2010 tool forum ata handout
2010 tool forum ata handout2010 tool forum ata handout
2010 tool forum ata handoutascetlan
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010Eli Robillard
 
How your metadata strategy impacts everything you do
How your metadata strategy impacts everything you doHow your metadata strategy impacts everything you do
How your metadata strategy impacts everything you doChristian Buckley
 
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You DoLooking Under the Hood: How Your Metadata Strategy Impacts Everything You Do
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You DoChristian Buckley
 
Eol Drupal Dman Presentation
Eol   Drupal   Dman PresentationEol   Drupal   Dman Presentation
Eol Drupal Dman PresentationDavid Shorthouse
 

Ähnlich wie Tools for Taxonomies (20)

SharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content ManagementSharePoint Connections Coast to Coast Overview of Enterprise Content Management
SharePoint Connections Coast to Coast Overview of Enterprise Content Management
 
ECM And Enterprise Metadata in SharePoint 2010
ECM And Enterprise Metadata in SharePoint 2010ECM And Enterprise Metadata in SharePoint 2010
ECM And Enterprise Metadata in SharePoint 2010
 
Semantics In Declarative Systems
Semantics In Declarative SystemsSemantics In Declarative Systems
Semantics In Declarative Systems
 
User-Driven Taxonomies
User-Driven TaxonomiesUser-Driven Taxonomies
User-Driven Taxonomies
 
Essential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003compEssential Tools Of An Xml Workflow2003comp
Essential Tools Of An Xml Workflow2003comp
 
Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...Semantic Web in Action: Ontology-driven information search, integration and a...
Semantic Web in Action: Ontology-driven information search, integration and a...
 
Office 2.0 at GSA OCIO Offsite
Office 2.0 at GSA OCIO OffsiteOffice 2.0 at GSA OCIO Offsite
Office 2.0 at GSA OCIO Offsite
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Looking Under the Hood -- Australia SharePoint Conference
Looking Under the Hood -- Australia SharePoint ConferenceLooking Under the Hood -- Australia SharePoint Conference
Looking Under the Hood -- Australia SharePoint Conference
 
Ontology
OntologyOntology
Ontology
 
Business Strategies for Content Management - Part 3: Publishing Web Content U...
Business Strategies for Content Management - Part 3: Publishing Web Content U...Business Strategies for Content Management - Part 3: Publishing Web Content U...
Business Strategies for Content Management - Part 3: Publishing Web Content U...
 
Data Harmony Version 3.9 Features Update
Data Harmony Version 3.9 Features UpdateData Harmony Version 3.9 Features Update
Data Harmony Version 3.9 Features Update
 
Using metadata repositories with search
Using metadata repositories with searchUsing metadata repositories with search
Using metadata repositories with search
 
2010 tool forum ata handout
2010 tool forum ata handout2010 tool forum ata handout
2010 tool forum ata handout
 
TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010TSPUG: Content Management in SharePoint 2010
TSPUG: Content Management in SharePoint 2010
 
How your metadata strategy impacts everything you do
How your metadata strategy impacts everything you doHow your metadata strategy impacts everything you do
How your metadata strategy impacts everything you do
 
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You DoLooking Under the Hood: How Your Metadata Strategy Impacts Everything You Do
Looking Under the Hood: How Your Metadata Strategy Impacts Everything You Do
 
Share point summit_2010_lemieux-toc
Share point summit_2010_lemieux-tocShare point summit_2010_lemieux-toc
Share point summit_2010_lemieux-toc
 
KMA Webinar: Managed Metadata Services in SharePoint 2010
KMA Webinar: Managed Metadata Services in SharePoint 2010KMA Webinar: Managed Metadata Services in SharePoint 2010
KMA Webinar: Managed Metadata Services in SharePoint 2010
 
Eol Drupal Dman Presentation
Eol   Drupal   Dman PresentationEol   Drupal   Dman Presentation
Eol Drupal Dman Presentation
 

Mehr von Earley Information Science

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdf
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdfEIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdf
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdfEarley Information Science
 
Reducing Returns to Increase Margin Through Better Product Data
Reducing Returns to Increase Margin Through Better Product DataReducing Returns to Increase Margin Through Better Product Data
Reducing Returns to Increase Margin Through Better Product DataEarley Information Science
 
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEarley Information Science
 
EIS-Webinar-MDM-Personalization-2023-03-15.pdf
EIS-Webinar-MDM-Personalization-2023-03-15.pdfEIS-Webinar-MDM-Personalization-2023-03-15.pdf
EIS-Webinar-MDM-Personalization-2023-03-15.pdfEarley Information Science
 
Accelerating Product Data Programs with Pre-PIM Software
Accelerating Product Data Programs with Pre-PIM SoftwareAccelerating Product Data Programs with Pre-PIM Software
Accelerating Product Data Programs with Pre-PIM SoftwareEarley Information Science
 
What is PIM and Why Your Ecommerce Business Needs It
What is PIM and Why Your Ecommerce Business Needs ItWhat is PIM and Why Your Ecommerce Business Needs It
What is PIM and Why Your Ecommerce Business Needs ItEarley Information Science
 
How Successful B2B Brands Deliver Next-Level Digital Experiences
How Successful B2B Brands Deliver Next-Level Digital ExperiencesHow Successful B2B Brands Deliver Next-Level Digital Experiences
How Successful B2B Brands Deliver Next-Level Digital ExperiencesEarley Information Science
 
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AI
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AIUnlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AI
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AIEarley Information Science
 
Webinar: Powering Personalized Search with Knowledge Graphs
Webinar: Powering Personalized Search with Knowledge GraphsWebinar: Powering Personalized Search with Knowledge Graphs
Webinar: Powering Personalized Search with Knowledge GraphsEarley Information Science
 
EIS Webinar: Building the AI Powered Enterprise
EIS Webinar: Building the AI Powered EnterpriseEIS Webinar: Building the AI Powered Enterprise
EIS Webinar: Building the AI Powered EnterpriseEarley Information Science
 
EIS Webinar: The Knowledge Management Imperative - KM Essential to AI
EIS Webinar: The Knowledge Management Imperative - KM Essential to AIEIS Webinar: The Knowledge Management Imperative - KM Essential to AI
EIS Webinar: The Knowledge Management Imperative - KM Essential to AIEarley Information Science
 
Using Product Data to Drive Chatbot Dialogs - GS1 2019
Using Product Data to Drive Chatbot Dialogs - GS1 2019Using Product Data to Drive Chatbot Dialogs - GS1 2019
Using Product Data to Drive Chatbot Dialogs - GS1 2019Earley Information Science
 
Prerequisites for Effective and Meaningful Automation
Prerequisites for Effective and Meaningful AutomationPrerequisites for Effective and Meaningful Automation
Prerequisites for Effective and Meaningful AutomationEarley Information Science
 
There's No AI Without IA (Information Architecture)
There's No AI Without IA (Information Architecture)There's No AI Without IA (Information Architecture)
There's No AI Without IA (Information Architecture)Earley Information Science
 
Streamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital WorkplaceStreamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital WorkplaceEarley Information Science
 

Mehr von Earley Information Science (20)

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdf
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdfEIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdf
EIS-Webinar-Info-Governance-Age-AI-2024-02-27-for-distr.pdf
 
Reducing Returns to Increase Margin Through Better Product Data
Reducing Returns to Increase Margin Through Better Product DataReducing Returns to Increase Margin Through Better Product Data
Reducing Returns to Increase Margin Through Better Product Data
 
EIS-Webinar-Most-From-LLMs-2023-08-23.pptx
EIS-Webinar-Most-From-LLMs-2023-08-23.pptxEIS-Webinar-Most-From-LLMs-2023-08-23.pptx
EIS-Webinar-Most-From-LLMs-2023-08-23.pptx
 
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdfEIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
EIS-Webinar-Silabs-KM-Content-Program-2023-06-07.pdf
 
EIS-Webinar- Generative-AI-KM-2023-04-19.pdf
EIS-Webinar- Generative-AI-KM-2023-04-19.pdfEIS-Webinar- Generative-AI-KM-2023-04-19.pdf
EIS-Webinar- Generative-AI-KM-2023-04-19.pdf
 
EIS-Webinar-MDM-Personalization-2023-03-15.pdf
EIS-Webinar-MDM-Personalization-2023-03-15.pdfEIS-Webinar-MDM-Personalization-2023-03-15.pdf
EIS-Webinar-MDM-Personalization-2023-03-15.pdf
 
EIS-Webinar-data.world-collab-2023-02-15.pptx
EIS-Webinar-data.world-collab-2023-02-15.pptxEIS-Webinar-data.world-collab-2023-02-15.pptx
EIS-Webinar-data.world-collab-2023-02-15.pptx
 
Accelerating Product Data Programs with Pre-PIM Software
Accelerating Product Data Programs with Pre-PIM SoftwareAccelerating Product Data Programs with Pre-PIM Software
Accelerating Product Data Programs with Pre-PIM Software
 
What is PIM and Why Your Ecommerce Business Needs It
What is PIM and Why Your Ecommerce Business Needs ItWhat is PIM and Why Your Ecommerce Business Needs It
What is PIM and Why Your Ecommerce Business Needs It
 
Knowledge Management & Virtual Agents
Knowledge  Management & Virtual AgentsKnowledge  Management & Virtual Agents
Knowledge Management & Virtual Agents
 
How Successful B2B Brands Deliver Next-Level Digital Experiences
How Successful B2B Brands Deliver Next-Level Digital ExperiencesHow Successful B2B Brands Deliver Next-Level Digital Experiences
How Successful B2B Brands Deliver Next-Level Digital Experiences
 
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AI
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AIUnlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AI
Unlock the Value of Data Discovery Using Knowledge Graphs and Hybrid AI
 
Webinar: Powering Personalized Search with Knowledge Graphs
Webinar: Powering Personalized Search with Knowledge GraphsWebinar: Powering Personalized Search with Knowledge Graphs
Webinar: Powering Personalized Search with Knowledge Graphs
 
EIS Webinar: Building the AI Powered Enterprise
EIS Webinar: Building the AI Powered EnterpriseEIS Webinar: Building the AI Powered Enterprise
EIS Webinar: Building the AI Powered Enterprise
 
EIS Webinar: The Knowledge Management Imperative - KM Essential to AI
EIS Webinar: The Knowledge Management Imperative - KM Essential to AIEIS Webinar: The Knowledge Management Imperative - KM Essential to AI
EIS Webinar: The Knowledge Management Imperative - KM Essential to AI
 
Using Product Data to Drive Chatbot Dialogs - GS1 2019
Using Product Data to Drive Chatbot Dialogs - GS1 2019Using Product Data to Drive Chatbot Dialogs - GS1 2019
Using Product Data to Drive Chatbot Dialogs - GS1 2019
 
Prerequisites for Effective and Meaningful Automation
Prerequisites for Effective and Meaningful AutomationPrerequisites for Effective and Meaningful Automation
Prerequisites for Effective and Meaningful Automation
 
There's No AI Without IA (Information Architecture)
There's No AI Without IA (Information Architecture)There's No AI Without IA (Information Architecture)
There's No AI Without IA (Information Architecture)
 
Streamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital WorkplaceStreamlining Information Flows In The Digital Workplace
Streamlining Information Flows In The Digital Workplace
 

Kürzlich hochgeladen

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 

Kürzlich hochgeladen (20)

MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 

Tools for Taxonomies

  • 1. Tools for TaxonomiesEnterprise Search Summit, May 11, 2010 Heather Hedden Taxonomy Consultant Earley & Associates
  • 2. About Heather Hedden Taxonomy consultant, Earley & Associates Indexer, Hedden Information Management Instructor, Continuing Education, Simmons College Graduate School of Library & Information Science Formerly taxonomist at Viziant Corporation and Thomson Learning (Gale) Author, The Accidental Taxonomist (Information Today, Inc., May 2010)
  • 3.
  • 4. What are Taxonomy Tools? No authoritative industry list of taxonomy software “Taxonomy software” can mean different things Auto-categorization vs. taxonomy management Existing Web lists are miscellaneous taxonomy-related tools or out-of-date http://taxocop.wikispaces.com/TaxoTools www.taxotips.com/resources/tools www.searchtools.com/info/classifiers-tools.html www.willpowerinfo.co.uk/thessoft.htm
  • 5. Taxonomy Tool Types Thesaurus/ontology management software Other software with thesaurus/taxonomy modules Auto-categorization/text mining software Other software supporting creating taxonomies mindmapping or concept modeling tools Cardsorting tools Web analytics
  • 6. Tools Used by “Taxonomists”
  • 7. Thesaurus Management Software Basics Maintains terms and their relationships (equivalencies, hierarchical, and associative) As reciprocals When renaming, merging, subsuming, or deleting terms Disallows invalid relationships (according to standards) Supports term notes and other attributes for terms Supports candidate/approved terms; includes term creation and update dates Generates reports in various thesaurus display formats (hierarchical, alphabetical) Exports data in interoperable formats for importing into a content management, indexing, search, retrieval system Supports thesaurus standards: ANSI/NISO Z39.19 or ISO 2788
  • 8. Thesaurus Management Software Feature Comparisons interface design and ease of use multiple taxonomy display options term searching spell-checking speed (limited mouse clicks) for repeated term and relationship additions single-step new term & relationship creation single-step branch (term and narrower terms) moving drag & drop relationship adding user-defined (customizable) relationships user-defined term notes and term attributes bilingual or multilingual taxonomy support importing and exporting formats connectors to enterprise search systems
  • 9.
  • 10.
  • 11. Thesaurus Management Software MultiTes Pro Multisystems (Miami, FL) www.multites.com Since 1983. Hector Echeverria, president. $295 single user; $1295 for 5 users$2495 for 10 users; $3950 enterprise deployment Add-on products: web development kit, enterprise development kit Imports text; exports text, HTML (as a web page), XML, CSV Free limited-time downloadable trial and online tutorial Online discussion group for tips
  • 12.
  • 13. Thesaurus Management Software Cognatrix LGOSystems Pty. Ltd. (Australia) www.cognatrix.com For Mac OS X 10.4.5 and later US $499, or $199 for an “Education” version limited to 500 terms. Imports from plain text with tab separations.With CognatrixImporter add-on, imports from various XML schemas: Cognatrix, MultiTes, Term Tree, and Zthes. Exports to XML and HTML. Free limited-time downloadable trial and manual
  • 14.
  • 15. Thesaurus Management Software One-2-One This to That Pty. Ltd. t/a A.C.S.Active Classification Solutions (Australia) www.acs121.com Price: $800 Australian (approximately $700-750) For thesauri and classification systems, but also has features and connectors for records management Replaces a previous thesaurus-only product, Term Tree Drag-and-drop hierarchy feature Free limited-use and limited-size trial
  • 16.
  • 17. Thesaurus Management Software TheW32 Tim Craven Freeware (Ontario, Canada) http://publish.uwo.ca/~craven/freeware.htm Started as thesaurus feature in his TEXNET auto-abstracting tool in early 1990s. Free Also provides web site indexing software XRefHT and machine-aided indexing and abstracting software
  • 18. Thesaurus Management Software TheW32 Interface
  • 19. Thesaurus Management Software High-end, multi-user client-server, large scale systems ($3000/single user - $75,000+; or annual hosted options) Data Harmony Synaptica Smartlogic Wordmap SchemaLogic STAR/Thesaurus SoutronTHESAURUS Mondeca ITM T3 PoolParty TheMa a.k.a.
  • 20. Thesaurus Management Software Data Harmony Thesaurus Master Access Innovations (Albuquerque, NM) www.dataharmony.com Indexing services since 1978, commercial software (originally used in-house) offered since 1998. Multi-platform java-based (used on Windows, Mac, Solaris, Linux). Client software allows remote access. All standard thesaurus displays types as view options User defined associative and equivalence, but no user-defined hierarchical relationships Sold separately or combined with M.A.I. (Machine Aided Indexer) as MAIstro. Other software extensions available.
  • 21.
  • 22.
  • 23. Thesaurus Management Software Synaptica Synaptica Software LLC (Franktown, CO) http://synapticasoftware.com Since 1995. Owned by Dow Jones 2005-2009. Web browser-based, priced per user, per year, per vocabulary 12 graduations of permission levels Can assign relationship weights Global term and relationships editor, creating a list of terms to edit Side-by-side editor with drag-and-drop Imports: CSV, text, MS Excel, XML (including schemas of ZThes, RDF, SKOS, and OWL) Exports: CSV, HTML, MS Word, MS Excel, XML (including schemas of ZThes, RDF, SKOS, and OWL)
  • 24. There are also browse options including a Tree Browse and Alpha-numeric browse option to review terms.
  • 25.
  • 26. Thesaurus Management Software Semaphore Ontology Manager Smartlogic Semaphore Ltd. (London, UK) + US office www.smartlogic.com Supports creating thesauri according to ISO 2788 standard Supports creating ontologies, through customizable relationships and user-created classes User-defined term attributes and metadata Multiple user access/privilege levels Imports/export in CSV, XML, Zthes, SQL databases, and MultiTes files Related products: Classification Server for automated classification Ontology Service for a navigation system
  • 27.
  • 28. Thesaurus Management Software Wordmap Taxonomy Manager Wordmap Inc. (Concord, MA) www.wordmap.com In UK since 1998. Acquired by Earley & Associates in 2007. Multi-platform java-based One of a suite of products including Wordmap Intelligent Text Classifier, Taxonomy Connectors for SharePoint and Endeca. User-defined relationships; can also turn on or off relationship name display. Can display two taxonomies side by side and drag and drop. User access/privileges can be set at the individual node level. Imports: CSV, Excel, XML; Exports: XML Real-time access: Java API
  • 29.
  • 30.
  • 31. Thesaurus Management Software SchemaLogic Enterprise Suite SchemaLogic Inc. (Kirkland, WA) www.schemalogic.com Provides thesaurus management according to ANSI/NISO standards, plus broader structural metadata support Can create customizable relationships Can assign various permission levels to vocabularies or terms Classification module supports 3rd party auto-indexing Connectors to SharePoint, EMC Documentum, and FAST ESP Can import CSV or XML files
  • 32. Thesaurus Management Software STAR/Thesaurus Cuadra Associates, Inc. (Los Angeles) www.cuadra.com Stand-alone or integrates with STAR family of products for records mngmt, collections mngmt, archives mngmt, DAM Supports standard thesaurus relationship but not customizable relationships Supports unlimited user-defined notes and categories Various output report display formats Import/export ASCII text and CSV, but not XML
  • 33. Thesaurus Management Software SoutronTHESAURUS Soutron Ltd. (United Kingdom) www.soutron.com Markets in the U.S. through partnership with InMagic Stand-alone or integrates with SoutronGLOBAL or SoutronSOLO library management systems, or with InMagic Presto social knowledge management software Supports standard thesaurus and user-defined relationships Supports term merging Imports from XML; exports to XML or CSV
  • 34. Thesaurus Management Software Mondeca ITM T3 (Intelligent Topic Manager: Thesaurus, Taxonomies, Terminologies) Mondeca S.A. (Paris, France) www.mondeca.com Since 2008, addition to Intelligent Topic Manager set of products for knowledge management, semantic portals, and e-catalogs web-based collaborative application conforms to both SKOS vocabularies and OWL-standard ontologies connectors to text mining, classification, and search tools Imports/exports XML, RDF, and SKOS
  • 35. Thesaurus Management Software PoolParty punkt. netServices GmbH (Vienna, Austria) http://poolparty.punkt.at A new thesaurus tool built on W3C Semantic Web Standards: SKOS, RDF, OWL, SPARQL Installed server or web-hosted options Can link domain-specific thesauri to Linked Open Data Wordpress plugin to build glossaries for blogs Imports ZThes XML, CSV Integrated text extraction and semi-automatic tagging to enable semantic search
  • 36. Ontology Software Tools for ontologies, not thesauri TopBraid Composer, www.topquadrant.com Altova SemanticWorks, www.altova.com/semanticworks.html Protégé, http://protege.stanford.edu SMORE, www.mindswap.org/2005/SMORE/ SWOOP, http://code.google.com/p/swoop CMAP Tools Ontology Editor, http://cmap.ihmc.us
  • 37. Other Software with Taxonomy/thesaurus Creation & Editing Components Metadata or cataloging software, especially for archives and libraries Adlib Information Systems, www.adlibsoft.com Content management and document management systems Open Text Collections Server Webtop Thesaurus Manager, www.opentext.com Records management systems a.k.a. from Synercon Management Consulting, www.a-k-a.com.au Auto-categorization and enterprise search software
  • 38. Auto-categorization and Text Mining Auto-categorization Algorithms, statistics, and training documents – utilize a large set a sample documents per taxonomy term to “train” the system to learn to index Rules base – generate and edit or write “rules” for each term based on co-existing words, proximity, Boolean logic, etc. Text mining Extracts relevant terms from texts to generate a candidate taxonomy or supplement an existing taxonomy
  • 39. Auto-categorization/Search Software Auto-categorization, text mining, and search systems that utilize taxonomies handle these taxonomies in different ways: With pre-installed taxonomies that cannot be edited With pre-installed taxonomies that the user may edit and extend through the user interface Automatically generate a taxonomy that can be edited Support the import of taxonomies but do not support the editing of those taxonomies Support the import of taxonomies and then the editing of those taxonomies Various combinations of above
  • 40. Auto-categorization/Search Software Software that can import and use taxonomies but lacking user-interface features to edit those taxonomies includes: Microsoft SharePoint IBM Classification Module Fast Endeca Temis Vivisimo Mindbreeze Exalead PerfectSearch They collaborate with other vendors that develop taxonomies and/or have taxonomy editing capabilities.
  • 41. Auto-categorization/Search Software Examples of tools with some taxonomy management capabilities: Inxight SmartDiscovery Analysis Server www.inxightfedsys.com Autonomy Collaborative Classifier www.autonomy.com Autonomy Interwoven MetaTagger www.interwoven.com Lexalytics Classifier www.lexalytics.com Conceptsearching www.conceptsearching.com
  • 42. Auto-categorization/Search Software Examples of auto-categorization tools with full thesaurus management capabilities: Data Harmony MAIstro www.dataharmony.com/products/maistro.html Smartlogic Semaphore Classification Server www.smartlogic.com Wordmap Intelligent Text Classifier www.wordmap.com SAS Enterprise Content Categorization(formerly Teragram TK240) www.sas.com/text-analytics Nstein Text Mining Engine (part of Open Text) www.nstein.com
  • 43. Auto-categorization Software combined with Thesaurus Management Data Harmony MAIstro (combines Data Harmony Thesaurus Master and Machine-Aided Indexer) Automatically creates a basic rule for every term and its variants in the Thesaurus Master’s thesaurus Rules may be edited and additional rules can be manually written statistics module tracks the editor’s term choices and compares them with M.A.I. term suggestions, sorting them as hits, misses, and noise to guide and prioritize the editor’s fine-tuning of rules Can be used for machine-aided indexing or fully automated indexing Connectors to Sharepoint and search engines
  • 44. Auto-categorization Software combined with Thesaurus Management Smartlogic Semaphore Classification Server (connects with Semaphore Ontology Manager) Creates classification rulebases directly from a taxonomy/thesaurus/ontology, and applies these rules to content as it is received to automatically classify content. Rules are based on the term, its equivalencies, and broader/ narrower/related terms Employs 20 different kinds of rules Rules have weights and scores Variants based on spelling, plurals, and stemming may also be considered. Manual rules can take precedence over generated rules.
  • 45. Auto-categorization Software combined with Thesaurus Management Wordmap Intelligent Text Classifier (connects with Wordmap Taxonomy Manager for leveraging thesaurus) Auto-classification based on statistical method based of Support Vector Machine algorithms and machine learning with training documents Pre-packaged with statistical algorithms based on a generic taxonomy, the U.K.’s Integrated Public Services Vocabulary (IPSV), for which each of hundreds of terms have already been “trained” with representative documents Wordmap also offers Taxonomy Connectors for taxonomy-driven tagging and search within SharePoint and Endeca
  • 46. Auto-categorization Software combined with Thesaurus Management SAS Enterprise Content Categorization (ECC) (Formerly Teragram TK240 Taxonomy Manager) Supports taxonomy building or connects with SAS Ontology Manager ECC supports equivalent, hierarchical & related relationships Ontology Manager supports customized relationships and attributes Utilizes both auto-categorization and entity/concept extraction Auto-categorization bases on rules Rules-writing supported with a graphical tree view of Boolean operators and commands User can define weighting of terms
  • 47. Auto-categorization Software combined with Thesaurus Management Open Text Nstein Text Mining Engine (TME) Modules include concept extractor, entity extractor, auto-categorizer, automated abstract creation, sentiment analysis Taxonomy Manager module (not sold separately) supports creating & editing hierarchical, associative and equivalence relationships according to ANSI/NISO standard Auto-categorization technology based on use of training sets for taxonomy terms, combined with concept extraction technology Ships with pre-installed taxonomies already “trained” for auto-categorization
  • 48. Concluding Remarks Some “taxonomy tools” are stronger in taxonomy/thesaurus/ontology management. Some “taxonomy tools” are stronger in auto-categorization. A few tools combine both, but vendor partnerships and connectors can also achieve high results.
  • 49. Questions/Contact/More Info Heather Hedden Earley & Associates 978-371-0822 (direct) 978-467-5195 (mobile) heatherh@earley.com www.earley.com