Daniela Barbosa, Synaptica Business Development Manager, Dow Jones Client Solutions, Dow Jones & Company
Paula R McCoy, Manager, Taxonomy Development, ProQuest
Now that you have built your taxonomies, you need to manage and maintain them in a centralized environment that can be leveraged by all of your enterprise applications including search tools, portals, and CMS/DMS systems. This session will review some best practices in centralized taxonomy management and go through the implementation of a thesaurus management tool at ProQuest, which enabled them to create a common language to connect disparate information assets using large and varied vocabularies and authority files linked to new and existing editorial systems.
Centralized Taxonomy Management for Enterprise Information Systems
1. Centralized Taxonomy Management for Enterprise Information Systems Enterprise Search Summit Wednesday, September 24th, 2:00 pm – 2:30 pm Dow Jones Client Solutions ProQuest Synaptica Manager, Taxonomy Development [email_address] [email_address]
2.
3. Some Definitions A taxonomy is a hierarchical topic structure to which information can be assigned through the dual processes of classification (filing to a location) and categorisation (tagging with relevant metadata ). A taxonomy provides browsable navigation and supports filtered search ing A thesaurus is a controlled vocabulary linking an organisation’s common language to its taxonomy structure. It accommodates synonyms, acronyms, language variants and other near equivalences. It also signposts non-hierarchical linkages within and across the taxonomy facets. A thesaurus is usually employed to interpret and guide user search queries An ontology is the working model of entities and interactions in a particular domain of knowledge or content set. It is a set of concepts - such as things, events, and relations - that are specified in some way in order to create an agreed-upon vocabulary for exchanging information. An ontology is increasingly used to visualise (or map) a set of search results and discover new or hidden connections
4. Classic taxonomy … groups things or concepts into families SIDEWAYS Traditional thesaurus … captures the different names of the family members and explores some more distant associations (cousins & close friends) Multi- Directional Emerging ontology … shows a network of multi-dimensional relationships and properties both within and outside the family groups UP DOWN
5. Telephones Is a broader term than Mobile Phones SIDEWAYS Mobile Phones AKA as Cell Phones & Hand Phones And Similar to Hand Held Devices & PDAs Multi- Directional Mobile Phones Are made by Phone Manufacturers And use the networks of Telecoms Service Providers UP DOWN
6.
7.
8. Centralized Taxonomy and Metadata Management As a centralized repository for multi-lingual semantic management that is: - Independent from systems like web-portal search and categorization systems - Scalable ; capable of evolving with emerging corporate semantic standards HTML CSV XML ZThes SKOS OWL Web Services Centralized Taxonomy Management System Synaptica ® Portals Portals Categorizers Portals Portals Search Engines Portals Portals Content Portals Multiple users working in collaborative and compartmentalized space P e r m i s s i o n s
9.
10.
11. Where taxonomy fits with Search DMS CMS Shared Docs News & Research Data Search Engine Taxonomy & Metadata Platform Information Processing, Management and Storage
17. Document, Content & Records Management Synaptica ® Vocabulary & Metadata Management Thesauri Ontologies Filing & Storage Metadata Tagging (Categorisation) Process Search Engine Visualisation Navigation Intranet / Portal User Interface Back End Information Structure Front End Information Intelligence Librarians; Taxonomists; Indexers; Knowledge & Information Managers Information Creators; Records Managers; Content Managers; Librarians; Indexers Information Users (the business; the public) Taxonomies CIOs; CTOs; IT Architects
18. Paula R. McCoy Manager, Taxonomy Development ProQuest [email_address] Centralized Taxonomy Management for Enterprise Information Systems
19.
20. Access to over 125 billion digital pages of content from magazine, trade, & scholarly publications, current & historical newspapers, original materials such as annual reports & civil war pamphlets, and daily wire feeds Subscription-based ProQuest® online information service available in academic and public libraries
21.
22.
23.
24.
25.
26.
27. The Taxonomy Manager’s Job To ensure that indexers and searchers alike have access to a complete and accurate Thesaurus that they can use to maximize the discoverability of documents in ProQuest OBJECTIVE:
28. Sample Subject Term Chronic obstructive pulmonary disease SN: Any lung disease, such as chronic bronchitis or emphysema, causing obstruction of bronchial airflow UF COPD BT Disease BT Respiratory diseases NT Asthma NT Bronchitis NT Emphysema RT Airway management RT Lungs Preferred, or main term Scope note defining term and how it is used Non-preferred term: points to term used to index Terms broader in nature to main term: COPD is a disease, and specifically, a respiratory disease Terms narrower in nature to main term: these are chronic lung diseases Terms related to main term that might be used to narrow the search
32. Adding Terms Today: 3 Easy Steps 2. Export report of new terms into Word 1. Enter term and relationships into Synaptica “ Item Details” window 3. Send Word document to editors