SlideShare a Scribd company logo
1 of 20
easyDITA How-To Series:
Taxonomy 101: Classifying DITA Tasks



    Paul Wlodarczyk
    CEO, Jorsek LLC
    June 28, 2012
Poll: Please complete while folks arrive

 How are you delivering DITA Tasks or other
 procedural / how-to content?
 •    Portal with advanced / faceted search
 •    Static web pages or web help
 •    Print / PDF
 •    Windows Help
 •    Other



  6/28/2012              © Jorsek, LLC. All Rights Reserved.   2
Why talk about task oriented content?
 Task-oriented content is
 valuable:
 • It is versatile and can be
    reused in more
    deliverables than
    conceptual content
        –     Product user guides
        –     Context-sensitive help
        –     Knowledge base
        –     Support
        –     Training
 • It’s what most users are
   searching for in a
                                                                       A DITA Task published to MindTouch
   knowledge base or help

  6/28/2012                            © Jorsek, LLC. All Rights Reserved.                                  3
Benefits of using DTA for authoring tasks
 • Task authored in DITA are
        –     Concise
        –     Consistent
        –     Modular
        –     Semantic
 • DITA Tasks make good
   templates for content
   contributed by SMEs (like
   product engineers)
 • For software UA in
   particular, task-oriented
   content is perfect for QA. The
   task becomes the Test Case.

                                                                  The XML Source for a DITA Task
  6/28/2012                 © Jorsek, LLC. All Rights Reserved.                                    4
Anatomy of a DITA Task
 •    Title
 •    Short description
 •    Context
 •    Prerequisite
 •    Step section
 •    Step
 •    Command
 •    Sub Step
 •    Step Info
 •    Step Result
 •    Step Example
 •    Choice and Choice Table
 •    Example
 •    Post-requisite
 •    Result                                                          A DITA Task in easyDITA
  6/28/2012                     © Jorsek, LLC. All Rights Reserved.                             5
DITA Tasks are semantic

 • DITA tasks are inherently semantic
        – Not simple ordered lists
        – Not simple paragraphs
 • This is useful for
        – Dynamic rendition, e.g.
              • Expand / collapse steps
              • Interactive UI controls
        – Semantic Search in the context of the structure, e.g.
              • find STEPS that contain MENU CASCADES
              • Find STEP INFORMATION that contains IMAGES tagged with [text]
              • Find PREREQUISITES that contain [text]




  6/28/2012                          © Jorsek, LLC. All Rights Reserved.        6
Making tasks more findable with metadata

 • Q: How can we make content
   even more findable
        – For authors and content
          managers?
        – For end users in a dynamic
          delivery system?
 • A: Tag tasks with semantic
   metadata
        – Semantic = “meaning”
        – Metadata can be set with terms
          from controlled vocabularies
          defined and managed in a
          taxonomy
  6/28/2012                   © Jorsek, LLC. All Rights Reserved.   7
What is Metadata?
  • Literally “Data about the data”
  • Also known as “tags”
             – Not to be confused with the content itself (e.g.
               XML structure)
             – Can be embedded in a file (e.g. the DITA Prolog
               or attributes; JPEG image data) or associated in
               a CMS
  • Two main flavors:
             – Administrative metadata
                 • e.g. Content Type, Author, Date
                   Modified, Version, Title, etc.,
                 • Usually system-generated
                 • What the content is
             – Descriptive metadata
                 • Subject classification, keywords, etc.
                 • Usually manually authored
                 • What the content is about

 6/28/2012                                   © Jorsek, LLC. All Rights Reserved.   8
Key Concept: Taxonomy
   taxonomy n. A categorization
   scheme for concepts, often
   hierarchical
   • Most often, taxonomies show “is a”
     relationships, e.g. A mammal is a
     vertebrate, A rodent is a mammal, etc.
   • Navigation up and down the tree yields
     broader than (BT) and narrower than
     (NT) classification
             – Can be used to adjust search scope
   • Can also show related terms (RT)
             – Can be used to suggest related searches / “see
               also”
   • Can manage synonyms (UF – Use For)
             – Can be used to find content when search
               terms are not the preferred terms



 6/28/2012                                 © Jorsek, LLC. All Rights Reserved.   9
Using Taxonomy for controlled vocabularies
 • A taxonomy is the “source of truth” for what terms to use for
   various concepts – so terms are consistent.
 • Taxonomy terms can be used as controlled vocabularies (“pick-
   lists”) for metadata, so authors simply select preferred terms
        – Avoids typos, duplicates, word form variations, use of non-preferred terms
 • Some content management systems enable controlled
   vocabularies from taxonomies to be used for setting attribute
   values in DITA (e.g. selectatts like Audience, Product, Platform etc.).
 • Relationships between terms in a Taxonomy can improve search
        – CMS search and site search indexing tools can use equivalent and related
          terms to find content that does not contain the search term
        – Relationships between terms can be expressed as RDF in HTML content for
          improving web search indexing



  6/28/2012                        © Jorsek, LLC. All Rights Reserved.            10
Simple framework for tagging tasks
• In any industry, we’re all trying to help people do something to
  something in a context:
      – Who is doing what to what (+ other important context or condition)
• Examples
     Junior Service Technician doing preventive maintenance on Acme Jetpack
     XR7 that uses nitrous oxide injection technology
     Casual User clearing paper jam on MFD100 Copier with envelope tray option
     Case Worker performing an intake interview for a recently unemployed
     person in New York State
     Intermediate User publishing a DITA Map using DITA OT to PDF format
     Financial analyst calculating a WACC for a publicly traded company located in
     a country using GAPP accounting
     Registered Nurse administering medication to patient in the ICU and drug is a
     controlled substance
     Contract Service Technician doing diagnosis on P1000 Printer showing missing
     sections of the printed image
  6/28/2012                       © Jorsek, LLC. All Rights Reserved.            11
What metadata do you need?
 Information about the
 Performer, Activity, Object, and Context will help
 narrow search results for a user or author (see our
 blog post on Metadata 101: A Search First Approach)
 • Performer metadata:
     – Types of users (roles, experience, education
       level, etc.)
     – Types of employees
       (title, training, certifications, clearance, departmen
       t, skill level etc.)
     – Types of customers
 • Activity metadata:
       – Broad Task Types (e.g. for service:
           maintenance, diagnosis, repair, calibration, startup,
            etc.)
       – High Level Task names from a performance analysis
           / instructional design
       – Competencies from a model
       –
 6/28/2012 Commercial Services listing © Jorsek, LLC. All Rights Reserved.   12
What metadata do you need?
 Information about the
 Performer, Activity, Object, and Context will help
 narrow search results for a user or author (see our
 blog post on Metadata 101: A Search First Approach)
 • Object (i.e. “To what / to whom”) metadata:
       – Things: Product, product components, product
         subsystems
       – People: Types of customers or clients
 • Context metadata:
       –     Market / locale
       –     Product options
       –     Technologies
       –     Special situations
       –     Tools required
       –     Security classification
       –     Symptoms / Fault codes


 6/28/2012                             © Jorsek, LLC. All Rights Reserved.   13
Do we have to create these terms from scratch?
 No! You are surrounded by free sources for term lists, many are
 governed and authoritative. Don’t reinvent – borrow!
 Here are some common sources of terms:
 •       Corporate ECM or Web taxonomy (from IT or marketing)
 •       Industry-specific taxonomies (e.g. MeSH for life sciences, DSM for mental health)
 •       Government taxonomies (e.g. UK IPSV - Integrated Public Sector Vocabulary)
 •       Generic public domain taxonomies (e.g. People, Places, and Cultures; AP News)
 •       Other corporate sources:
           –     Training group (competency models, task analyses)
           –     HR (Job codes and Job Titles)
           –     Support / field service systems (Parts, fault classifications, failure modes, tools used)
           –     CRM data (Customer names, Customer categories, SKUs, Products & Services)
           –     Product data (Product BOMs, platforms, parts, subsystems, options)
           –     Organization Charts (Divisions, departments, locations, budget centers)
           –     Business Process Analysis (process names and steps, inputs and outputs)



     6/28/2012                                       © Jorsek, LLC. All Rights Reserved.                     14
Taxonomy Tools
 • You can build and manage a simple taxonomy in Microsoft Excel
 • Even if authors manually tag metadata, the Excel taxonomy can be a useful
   guide and source of terms to copy/paste
 • Each row is a term and each column is a level in the hierarchy




 • Put other data required for related and equivalent terms in columns to right of
   preferred term hierarchy
 • Add a column for scope notes
 • Use Grouping to help expand / collapse sections of a long taxonomy
 • If you have a CMS or other tool that consumes taxonomy, you can export a CSV
   file from Excel and import it to the CMS (see Mary Garcia’s excellent blog posts at
     TaxoDiary.com to learn how)
 6/28/2012                         © Jorsek, LLC. All Rights Reserved.              15
Taxonomy Tools
 • Consider using a Taxonomy Management System if:
    – You have a large taxonomy (over 500 terms)
    – The taxonomy changes often
    – You have a complex governance process for approving new terms
    – The taxonomy needs to be consumed by more than one system
    – You are using term relationships to improve search indexing




 6/28/2012                    © Jorsek, LLC. All Rights Reserved.     16
Guidelines for taxonomy quality
 • The hierarchy should reflect any of three relationships:
        – Generic (e.g. VehicleCar)
        – Instance (e.g. Mountain regionsRockies)
        – Whole-Part (e.g. HouseRoof)
 • Terms should be nouns or noun phrases.
 • Activities should be nouns or gerunds.
 • Avoid adjectives and prepositions unless integral to the term.
 • When in doubt singular vs. plural, choose plural; these are
   categories. Singular is OK for instances at the narrow end.
 • Named entities should be proper nouns.
 • Avoid punctuation and ampersands. Eliminate hyphens except
   where the term is confusing or unclear without them.
 • Make the most commonly used term the preferred term, even if it
   is an acronym (e.g. NASA). Make other forms Equivalent Terms.
  6/28/2012                       © Jorsek, LLC. All Rights Reserved.   17
Poll:

 Are you currently using controlled
 vocabularies for any of the following?
 •    CMS Metadata
 •    DITA Attributes
 •    Prolog Metadata and Keywords
 •    Other
 •    Not using controlled vocabularies



  6/28/2012             © Jorsek, LLC. All Rights Reserved.   18
Resources
 • LinkedIn Taxonomy Community of Practice
 • ANSI/NISO Z39.19-2005 - Guidelines on
   Construction, Format, and Management of Monolingual
   Controlled Vocabularies
 • IBM Presentation: Writing Effective DITA Task Topics
       – http://svdig.ditamap.com/DITATaskTopics_090310SR.ppt
 • TaxoDiary blog posts by Mary Garcia:
   Maintaining a Thesaurus in an Excel Workbook (two parts)
       – http://taxodiary.com/2012/04/maintaining-a-thesaurus-in-an-excel-
         workbook/
       – http://taxodiary.com/2012/05/maintaining-a-thesaurus-in-an-excel-
         workbook-part-2/
 • easyDITA blog posts and Twitter
       – easyDITA.com/blog and @easydita

 6/28/2012                       © Jorsek, LLC. All Rights Reserved.         19
Thank you!
 • Questions?
 • Recorded webcast will be available soon through our website –
   you will get an email with the link
 • Anyone can register after the event to view the recording
 • Slides will be available on SlideShare
       – www.slideshare.net/easydita
 • Next webcast July 25, featuring Amber Swope of DITA Strategies
   discussing Using Taxonomy for DITA Content. Please join us!




 6/28/2012                      © Jorsek, LLC. All Rights Reserved.   20

More Related Content

What's hot

What's hot (20)

XSLT Magic Tricks with DITA and FrameMaker
XSLT Magic Tricks with DITA and FrameMakerXSLT Magic Tricks with DITA and FrameMaker
XSLT Magic Tricks with DITA and FrameMaker
 
Visual Data Vault
Visual Data VaultVisual Data Vault
Visual Data Vault
 
Activate Data Governance Using the Data Catalog
Activate Data Governance Using the Data CatalogActivate Data Governance Using the Data Catalog
Activate Data Governance Using the Data Catalog
 
DITA and S1000D Two Paths to Structured Documentation
DITA and S1000D   Two Paths to Structured DocumentationDITA and S1000D   Two Paths to Structured Documentation
DITA and S1000D Two Paths to Structured Documentation
 
Taxonomies for Users
Taxonomies for UsersTaxonomies for Users
Taxonomies for Users
 
You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?You Need a Data Catalog. Do You Know Why?
You Need a Data Catalog. Do You Know Why?
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
DITA Quick Start for Authors - Part I
DITA Quick Start for Authors - Part IDITA Quick Start for Authors - Part I
DITA Quick Start for Authors - Part I
 
Data Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced AnalyticsData Architecture Best Practices for Advanced Analytics
Data Architecture Best Practices for Advanced Analytics
 
Data modelling 101
Data modelling 101Data modelling 101
Data modelling 101
 
Data Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and GovernanceData Catalog for Better Data Discovery and Governance
Data Catalog for Better Data Discovery and Governance
 
Taxonomy: Do I Need One
Taxonomy: Do I Need OneTaxonomy: Do I Need One
Taxonomy: Do I Need One
 
Taxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information ArchitectureTaxonomies and Metadata in Information Architecture
Taxonomies and Metadata in Information Architecture
 
RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?RWDG Slides: What is a Data Steward to do?
RWDG Slides: What is a Data Steward to do?
 
Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)Data Governance Takes a Village (So Why is Everyone Hiding?)
Data Governance Takes a Village (So Why is Everyone Hiding?)
 
Building and using ontologies
Building and using ontologies Building and using ontologies
Building and using ontologies
 
DAS Slides: Data Governance - Combining Data Management with Organizational ...
DAS Slides: Data Governance -  Combining Data Management with Organizational ...DAS Slides: Data Governance -  Combining Data Management with Organizational ...
DAS Slides: Data Governance - Combining Data Management with Organizational ...
 
Keith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst PracticesKeith Schengili-Roberts - DITA Worst Practices
Keith Schengili-Roberts - DITA Worst Practices
 
MDM for Customer data with Talend
MDM for Customer data with Talend MDM for Customer data with Talend
MDM for Customer data with Talend
 
Modern Metadata Strategies
Modern Metadata StrategiesModern Metadata Strategies
Modern Metadata Strategies
 

Viewers also liked

Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
Scott Abel
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
Joe Pairman
 

Viewers also liked (20)

Creating Documentation With A Wiki: The DITA Storm Project
Creating Documentation With A Wiki: The DITA Storm ProjectCreating Documentation With A Wiki: The DITA Storm Project
Creating Documentation With A Wiki: The DITA Storm Project
 
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the PainSurviving the Transition to DITA: Trusted Partners can Ease the Pain
Surviving the Transition to DITA: Trusted Partners can Ease the Pain
 
Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32Localization and DITA: What you Need to Know - LocWorld32
Localization and DITA: What you Need to Know - LocWorld32
 
Converting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePubConverting Unstructured Docs to XML/DITA/ePub
Converting Unstructured Docs to XML/DITA/ePub
 
Pat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITAPat Farrell, Migrating Legacy Documentation to XML and DITA
Pat Farrell, Migrating Legacy Documentation to XML and DITA
 
DITA Interoperability
DITA InteroperabilityDITA Interoperability
DITA Interoperability
 
Metadata: Queen to King Content?
Metadata: Queen to King Content?Metadata: Queen to King Content?
Metadata: Queen to King Content?
 
The Elusive Promise of Reuse
The Elusive Promise of ReuseThe Elusive Promise of Reuse
The Elusive Promise of Reuse
 
Joe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and DeliveryJoe Gelb: Taxonomy and Delivery
Joe Gelb: Taxonomy and Delivery
 
Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)Easy steps to convert your content to structured (frame maker and xml)
Easy steps to convert your content to structured (frame maker and xml)
 
How to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and TaxonomyHow to Optimize Your Metadata and Taxonomy
How to Optimize Your Metadata and Taxonomy
 
Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016Reports and DITA Metrics IXIASOFT User Conference 2016
Reports and DITA Metrics IXIASOFT User Conference 2016
 
Optimizing Content Reuse with DITA
Optimizing Content Reuse with DITAOptimizing Content Reuse with DITA
Optimizing Content Reuse with DITA
 
Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...Developing training websites in multiple languages with (mostly) open-source ...
Developing training websites in multiple languages with (mostly) open-source ...
 
Blurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMSBlurring the Lines between ECM and CCMS
Blurring the Lines between ECM and CCMS
 
Understanding Information Architecture
Understanding Information ArchitectureUnderstanding Information Architecture
Understanding Information Architecture
 
DITA Quick Start
DITA Quick StartDITA Quick Start
DITA Quick Start
 
Multiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured ContentMultiplying the Power of Taxonomy with Granular, Structured Content
Multiplying the Power of Taxonomy with Granular, Structured Content
 
Wireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made EasyWireframing, Mockups, and Prototyping Made Easy
Wireframing, Mockups, and Prototyping Made Easy
 
10 Million Dita Topics Can't Be Wrong
10 Million Dita Topics Can't Be Wrong10 Million Dita Topics Can't Be Wrong
10 Million Dita Topics Can't Be Wrong
 

Similar to Taxonomy 101: Classifying DITA Tasks

Monica Crocker Implementing Ecm Aiim 2009
Monica Crocker   Implementing Ecm Aiim 2009Monica Crocker   Implementing Ecm Aiim 2009
Monica Crocker Implementing Ecm Aiim 2009
AIIM Minnesota
 
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey SwartStreamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Hewlett Packard Enterprise Services
 
Dita webinar 20th march
Dita webinar 20th marchDita webinar 20th march
Dita webinar 20th march
Metapercept
 
Pavankumar_TeraData_DBA_8yrsExp
Pavankumar_TeraData_DBA_8yrsExpPavankumar_TeraData_DBA_8yrsExp
Pavankumar_TeraData_DBA_8yrsExp
pavankumar akula
 

Similar to Taxonomy 101: Classifying DITA Tasks (20)

Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
Playing Tag: Managed Metadata and Taxonomies in SharePoint 2010
 
[AIIM17] Data Categorization You Can Live With - Monica Crocker
[AIIM17]  Data Categorization You Can Live With - Monica Crocker [AIIM17]  Data Categorization You Can Live With - Monica Crocker
[AIIM17] Data Categorization You Can Live With - Monica Crocker
 
Monica Crocker Implementing Ecm Aiim 2009
Monica Crocker   Implementing Ecm Aiim 2009Monica Crocker   Implementing Ecm Aiim 2009
Monica Crocker Implementing Ecm Aiim 2009
 
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
Tools and Techniques for Creating, Maintaining, and Distributing Shareable Me...
 
chapter11-220725121546-671fc36c.pdf
chapter11-220725121546-671fc36c.pdfchapter11-220725121546-671fc36c.pdf
chapter11-220725121546-671fc36c.pdf
 
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
‏‏‏‏‏‏‏‏Chapter 11: Meta-data Management
 
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey SwartStreamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
Streamlining Your Path to Metadata Charlotte Robidoux Stacey Swart
 
Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520Designing High Quality Data Driven Solutions 110520
Designing High Quality Data Driven Solutions 110520
 
Nlp model
Nlp modelNlp model
Nlp model
 
Metastudio DRM. Product presentation (en)
Metastudio DRM. Product presentation (en)Metastudio DRM. Product presentation (en)
Metastudio DRM. Product presentation (en)
 
Making Meaning with Metadata
Making Meaning with MetadataMaking Meaning with Metadata
Making Meaning with Metadata
 
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
Product Keynote: Advancing Denodo’s Logical Data Fabric with AI and Advanced ...
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Dita webinar 20th march
Dita webinar 20th marchDita webinar 20th march
Dita webinar 20th march
 
Why You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records ManagementWhy You Need Intelligent Metadata and Auto-classification in Records Management
Why You Need Intelligent Metadata and Auto-classification in Records Management
 
Pavankumar_TeraData_DBA_8yrsExp
Pavankumar_TeraData_DBA_8yrsExpPavankumar_TeraData_DBA_8yrsExp
Pavankumar_TeraData_DBA_8yrsExp
 
1 d.1
1 d.11 d.1
1 d.1
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
 
Data Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: MetadataData Systems Integration & Business Value Pt. 1: Metadata
Data Systems Integration & Business Value Pt. 1: Metadata
 
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: MetadataData-Ed: Data Systems Integration & Business Value PT. 1: Metadata
Data-Ed: Data Systems Integration & Business Value PT. 1: Metadata
 

Recently uploaded

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Recently uploaded (20)

Cyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdfCyberprint. Dark Pink Apt Group [EN].pdf
Cyberprint. Dark Pink Apt Group [EN].pdf
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 

Taxonomy 101: Classifying DITA Tasks

  • 1. easyDITA How-To Series: Taxonomy 101: Classifying DITA Tasks Paul Wlodarczyk CEO, Jorsek LLC June 28, 2012
  • 2. Poll: Please complete while folks arrive How are you delivering DITA Tasks or other procedural / how-to content? • Portal with advanced / faceted search • Static web pages or web help • Print / PDF • Windows Help • Other 6/28/2012 © Jorsek, LLC. All Rights Reserved. 2
  • 3. Why talk about task oriented content? Task-oriented content is valuable: • It is versatile and can be reused in more deliverables than conceptual content – Product user guides – Context-sensitive help – Knowledge base – Support – Training • It’s what most users are searching for in a A DITA Task published to MindTouch knowledge base or help 6/28/2012 © Jorsek, LLC. All Rights Reserved. 3
  • 4. Benefits of using DTA for authoring tasks • Task authored in DITA are – Concise – Consistent – Modular – Semantic • DITA Tasks make good templates for content contributed by SMEs (like product engineers) • For software UA in particular, task-oriented content is perfect for QA. The task becomes the Test Case. The XML Source for a DITA Task 6/28/2012 © Jorsek, LLC. All Rights Reserved. 4
  • 5. Anatomy of a DITA Task • Title • Short description • Context • Prerequisite • Step section • Step • Command • Sub Step • Step Info • Step Result • Step Example • Choice and Choice Table • Example • Post-requisite • Result A DITA Task in easyDITA 6/28/2012 © Jorsek, LLC. All Rights Reserved. 5
  • 6. DITA Tasks are semantic • DITA tasks are inherently semantic – Not simple ordered lists – Not simple paragraphs • This is useful for – Dynamic rendition, e.g. • Expand / collapse steps • Interactive UI controls – Semantic Search in the context of the structure, e.g. • find STEPS that contain MENU CASCADES • Find STEP INFORMATION that contains IMAGES tagged with [text] • Find PREREQUISITES that contain [text] 6/28/2012 © Jorsek, LLC. All Rights Reserved. 6
  • 7. Making tasks more findable with metadata • Q: How can we make content even more findable – For authors and content managers? – For end users in a dynamic delivery system? • A: Tag tasks with semantic metadata – Semantic = “meaning” – Metadata can be set with terms from controlled vocabularies defined and managed in a taxonomy 6/28/2012 © Jorsek, LLC. All Rights Reserved. 7
  • 8. What is Metadata? • Literally “Data about the data” • Also known as “tags” – Not to be confused with the content itself (e.g. XML structure) – Can be embedded in a file (e.g. the DITA Prolog or attributes; JPEG image data) or associated in a CMS • Two main flavors: – Administrative metadata • e.g. Content Type, Author, Date Modified, Version, Title, etc., • Usually system-generated • What the content is – Descriptive metadata • Subject classification, keywords, etc. • Usually manually authored • What the content is about 6/28/2012 © Jorsek, LLC. All Rights Reserved. 8
  • 9. Key Concept: Taxonomy taxonomy n. A categorization scheme for concepts, often hierarchical • Most often, taxonomies show “is a” relationships, e.g. A mammal is a vertebrate, A rodent is a mammal, etc. • Navigation up and down the tree yields broader than (BT) and narrower than (NT) classification – Can be used to adjust search scope • Can also show related terms (RT) – Can be used to suggest related searches / “see also” • Can manage synonyms (UF – Use For) – Can be used to find content when search terms are not the preferred terms 6/28/2012 © Jorsek, LLC. All Rights Reserved. 9
  • 10. Using Taxonomy for controlled vocabularies • A taxonomy is the “source of truth” for what terms to use for various concepts – so terms are consistent. • Taxonomy terms can be used as controlled vocabularies (“pick- lists”) for metadata, so authors simply select preferred terms – Avoids typos, duplicates, word form variations, use of non-preferred terms • Some content management systems enable controlled vocabularies from taxonomies to be used for setting attribute values in DITA (e.g. selectatts like Audience, Product, Platform etc.). • Relationships between terms in a Taxonomy can improve search – CMS search and site search indexing tools can use equivalent and related terms to find content that does not contain the search term – Relationships between terms can be expressed as RDF in HTML content for improving web search indexing 6/28/2012 © Jorsek, LLC. All Rights Reserved. 10
  • 11. Simple framework for tagging tasks • In any industry, we’re all trying to help people do something to something in a context: – Who is doing what to what (+ other important context or condition) • Examples Junior Service Technician doing preventive maintenance on Acme Jetpack XR7 that uses nitrous oxide injection technology Casual User clearing paper jam on MFD100 Copier with envelope tray option Case Worker performing an intake interview for a recently unemployed person in New York State Intermediate User publishing a DITA Map using DITA OT to PDF format Financial analyst calculating a WACC for a publicly traded company located in a country using GAPP accounting Registered Nurse administering medication to patient in the ICU and drug is a controlled substance Contract Service Technician doing diagnosis on P1000 Printer showing missing sections of the printed image 6/28/2012 © Jorsek, LLC. All Rights Reserved. 11
  • 12. What metadata do you need? Information about the Performer, Activity, Object, and Context will help narrow search results for a user or author (see our blog post on Metadata 101: A Search First Approach) • Performer metadata: – Types of users (roles, experience, education level, etc.) – Types of employees (title, training, certifications, clearance, departmen t, skill level etc.) – Types of customers • Activity metadata: – Broad Task Types (e.g. for service: maintenance, diagnosis, repair, calibration, startup, etc.) – High Level Task names from a performance analysis / instructional design – Competencies from a model – 6/28/2012 Commercial Services listing © Jorsek, LLC. All Rights Reserved. 12
  • 13. What metadata do you need? Information about the Performer, Activity, Object, and Context will help narrow search results for a user or author (see our blog post on Metadata 101: A Search First Approach) • Object (i.e. “To what / to whom”) metadata: – Things: Product, product components, product subsystems – People: Types of customers or clients • Context metadata: – Market / locale – Product options – Technologies – Special situations – Tools required – Security classification – Symptoms / Fault codes 6/28/2012 © Jorsek, LLC. All Rights Reserved. 13
  • 14. Do we have to create these terms from scratch? No! You are surrounded by free sources for term lists, many are governed and authoritative. Don’t reinvent – borrow! Here are some common sources of terms: • Corporate ECM or Web taxonomy (from IT or marketing) • Industry-specific taxonomies (e.g. MeSH for life sciences, DSM for mental health) • Government taxonomies (e.g. UK IPSV - Integrated Public Sector Vocabulary) • Generic public domain taxonomies (e.g. People, Places, and Cultures; AP News) • Other corporate sources: – Training group (competency models, task analyses) – HR (Job codes and Job Titles) – Support / field service systems (Parts, fault classifications, failure modes, tools used) – CRM data (Customer names, Customer categories, SKUs, Products & Services) – Product data (Product BOMs, platforms, parts, subsystems, options) – Organization Charts (Divisions, departments, locations, budget centers) – Business Process Analysis (process names and steps, inputs and outputs) 6/28/2012 © Jorsek, LLC. All Rights Reserved. 14
  • 15. Taxonomy Tools • You can build and manage a simple taxonomy in Microsoft Excel • Even if authors manually tag metadata, the Excel taxonomy can be a useful guide and source of terms to copy/paste • Each row is a term and each column is a level in the hierarchy • Put other data required for related and equivalent terms in columns to right of preferred term hierarchy • Add a column for scope notes • Use Grouping to help expand / collapse sections of a long taxonomy • If you have a CMS or other tool that consumes taxonomy, you can export a CSV file from Excel and import it to the CMS (see Mary Garcia’s excellent blog posts at TaxoDiary.com to learn how) 6/28/2012 © Jorsek, LLC. All Rights Reserved. 15
  • 16. Taxonomy Tools • Consider using a Taxonomy Management System if: – You have a large taxonomy (over 500 terms) – The taxonomy changes often – You have a complex governance process for approving new terms – The taxonomy needs to be consumed by more than one system – You are using term relationships to improve search indexing 6/28/2012 © Jorsek, LLC. All Rights Reserved. 16
  • 17. Guidelines for taxonomy quality • The hierarchy should reflect any of three relationships: – Generic (e.g. VehicleCar) – Instance (e.g. Mountain regionsRockies) – Whole-Part (e.g. HouseRoof) • Terms should be nouns or noun phrases. • Activities should be nouns or gerunds. • Avoid adjectives and prepositions unless integral to the term. • When in doubt singular vs. plural, choose plural; these are categories. Singular is OK for instances at the narrow end. • Named entities should be proper nouns. • Avoid punctuation and ampersands. Eliminate hyphens except where the term is confusing or unclear without them. • Make the most commonly used term the preferred term, even if it is an acronym (e.g. NASA). Make other forms Equivalent Terms. 6/28/2012 © Jorsek, LLC. All Rights Reserved. 17
  • 18. Poll: Are you currently using controlled vocabularies for any of the following? • CMS Metadata • DITA Attributes • Prolog Metadata and Keywords • Other • Not using controlled vocabularies 6/28/2012 © Jorsek, LLC. All Rights Reserved. 18
  • 19. Resources • LinkedIn Taxonomy Community of Practice • ANSI/NISO Z39.19-2005 - Guidelines on Construction, Format, and Management of Monolingual Controlled Vocabularies • IBM Presentation: Writing Effective DITA Task Topics – http://svdig.ditamap.com/DITATaskTopics_090310SR.ppt • TaxoDiary blog posts by Mary Garcia: Maintaining a Thesaurus in an Excel Workbook (two parts) – http://taxodiary.com/2012/04/maintaining-a-thesaurus-in-an-excel- workbook/ – http://taxodiary.com/2012/05/maintaining-a-thesaurus-in-an-excel- workbook-part-2/ • easyDITA blog posts and Twitter – easyDITA.com/blog and @easydita 6/28/2012 © Jorsek, LLC. All Rights Reserved. 19
  • 20. Thank you! • Questions? • Recorded webcast will be available soon through our website – you will get an email with the link • Anyone can register after the event to view the recording • Slides will be available on SlideShare – www.slideshare.net/easydita • Next webcast July 25, featuring Amber Swope of DITA Strategies discussing Using Taxonomy for DITA Content. Please join us! 6/28/2012 © Jorsek, LLC. All Rights Reserved. 20