SlideShare ist ein Scribd-Unternehmen logo
1 von 26
Migrating to DITA:
Defining Your Conversion
Requirements
Yehudit Lindblom & Joe Gelb
February 11, 2014
Who are we?
Yehudit Lindblom
• Project Manager and cat herder
Joe Gelb
• Founder and President of Suite Solutions
Suite Solutions
Our Vision: Enable you to engage your customers by providing quick access to
relevant information: DITA provides the foundation
• Help companies get it right the first time
• XML-based Authoring/Publishing Solutions
• Enterprise Intelligent Dynamic Content: SuiteShare Social KB
• Consultancy, Systems Integration, Application Development
• Cross-Industry Expertise
• High Tech, Aerospace & Defense, Discrete Manufacturing
• Healthcare, Government
Main Topics






Goals of this webinar
Key components of a DITA solution
Why do a conversion?
Review the process
Defining your requirements
Goals of this Webinar
Primary Goal: Empower (not overwhelm) you
• Understand the process, details and dependencies involved
• Understand the possibilities
• Build a solid plan based on experience
• Schedule accordingly
• Understand the skills required and the help you should seek
• Manage expectations
Key Components of a DITA Solution
1.
2.
3.
4.
5.

Staff
Content
Translation
Publishing
Content and configuration management

Your mission is to develop or acquire each of these
Why should you do a professional,
automated conversion?
1. Quickly attain a large sampling of valid DITA to configure and test
your toolset: authoring, publishing, CMS
2. Reap the benefits of your DITA investment and hit your target return
on investment (ROI ) more quickly
3. Provide authors with a solid foundation and benchmark from which
to create new content, based on familiar pre-existing material
4. Converting content manually is grueling and time-consuming
• May burn-out your authors
• Adds even more pressure on production schedules
• Your authors may be inexperienced with DITA and their lack of
expertise will tinge large amounts of content and sow
inconsistency right from the beginning…
Why should you do a professional, automated
conversion?
“Having done this from the other side, it was very helpful for us to see our
content, which we were familiar with, already set up in DITA format and
publishing.”
- Quote from Yours truly…
• Burning out the authors: Because it’s so mind-numbing, it winds up
taking much longer than it should and affects the scheduling.
• Politics: show good progress in a “reasonable” timeframe.
• “What? We just spent $@@@,### on this system and you can’t even
show me a PDF with our current stuff?”
What formats can you convert from?
In order of increasing difficulty…
1. Docbook
2. FrameMaker (structured)
3. FrameMaker (un-structured)
4. Word, RTF (remember Winhelp?)
5. HTML
• Web content
• HTML Help
• Webhelp (e.g. Robohelp)
6. Adobe InDesign
7. PDF
High-Level Conversion Process
1. Prepare
• Requirements: based on content audit and information architecture
• Pre-conversion checklist – preparing content for conversion
• Post-conversion checklist – “clean-up”
• Representative content sample: first large batch for conversion
2. Configure conversion tools
3. Convert first batch
• Run through pre-conversion checklist
• Analyze and generate conversion topic list
• Review topic list
• Convert
• Review, feedback, tweak tools, re-convert
• Run through post-conversion checklist
4. Convert additional content according to a schedule
Conversion Process
•

•
•
•

Compile the most representative sample you can. Otherwise, you risk
missing elements that need to be added later.
For example:
• Include content from all the various templates in use
• Include all possible test elements you will need to convert
Involve your style sheet developer: consider order of elements,
outputclasses
Involve your CMS vendor: understand how the tool deals with
metadata, conditionalization, variables, etc.
Schedule:
• It can take 1-2 months to get first set of content properly converted
• Subsequent conversion batches can go very quickly
• Schedule migrations as needed, allowing enough time to freeze
changes to the content and go through the conversion process
Conversion Requirements
Output structure and file conventions
• Which DTDs to reference
• Which encoding: UTF-8?
• Folder structure of the output
• File naming convention, for example:
• Use lower case for filenames
• Replace all spaces and non-alphanumeric characters with
underscores
• Indicate file-type/topic-type with the first letter in the filename
• Use .dita or .ditamap extensions, or .xml
Conversion Requirements
Mapping content elements to DITA tagging
• Map paragraph and character styles to tagging structures
Examples of mapping rules:
• Short description: first paragraph that is not a list item
• Tasks
• Tagging of context, pre-requisite, step info, etc.
• Only one task to be included per topic. If multiple tasks are
encountered within the same topic, wrap the second task/procedure
in <postreq><required-cleanup>
• If there is a <para> with “Example” text as an inline heading, use the
<example> element
• Do not insert empty <info> or other empty tags in the content
Conversion Requirements
Graphics
•
•

Create linked files from embedded graphics
• Determine naming convention
Graphic formats
• JPG, GIF, PNG: generally fine as-is
• BMP – generally converted to PNG
• EMF – often converted to SVG
• EPS – consider that you will need special processing in the style
sheets to handle
 PDF: Antenna House + GhostScript
 HTML: auto conversion to PNG

• TIFF – OK for PDF, but require handling for HTML formats, since
not all browsers support TIFF viewing out of the box
Conversion Requirements
Graphics
•

Types that need special processing
• Callouts
 Convert to SVG?
 Use numbered callouts with a list of labels underneath the graphic

• Hotspots
• Visio
Conversion Requirements
Publications / Maps
•
•

•
•
•
•
•

Maps versus bookmaps
Modularizing maps
• Creation of submaps for chapters, parts, appendices, etc.
• Use of maprefs versus topicrefs versus conrefs to link submaps
Creation of container topics for chapters, parts, etc.
Insertion of front-matter, back-matter, titles, metadata
List of tables, figures
Glossary
Index
Conversion Requirements
Conditional Tags
•
•

Convert FrameMaker, Robohelp conditions into attribute / value pairs
Combine and discontinue conditionals
Conversion Requirements
Other Requirements
Variables
• Auto-generation of conrefs
• Special tagging for CMS, e.g. @varref for SDL Live Content
Index entries
• Whether to leave inline or move to prolog/metadata/keywords
Context sensitivity
• Retain markers or other tagging used to produce context sensitive help
• Which element to code them in: <data>, <resourceid>?
Cross-references and relationship tables
• Copy all or some xrefs automatically to relationship tables
Conversion Requirements
Mapping
Example of style mapping:
PARAGRAPHS
p

<p>

Para

<p>

Body

<p>

BodyFirst

<p>

this tag has special formatting, may need to add outputclass

Para1Indent

<p>

2nd level paragraph

Indented

<p>

2nd level paragraph

Indented2

<p>

3rd level paragraph

Para2Indent

<p>

3rd level paragraph

Preface

<title>

heading 1

<title>

Head1

<title>

heading 2

<title>

Head2

<title>

heading 3

<title>

TITLES

heading level 2 always starts a new topic

heading level 3 starts a new topic for tasks only
Pre-conversion Checklist
Preparing the content for conversion
•
•

•

•

Some changes are best done before the conversion, some can be done
later
Check style usage
• Example: a heading styled as “Body” will not convert as a title, or
trigger separation into a separate topic
Apply character styles to get semantic domain tagging
• uicontrol, wintitle, filenames
• Example rule: all bold text should become a uicontrol
Other examples:
• Verify there is only one set of steps in each task
• Remove stem sentences that precede steps; in DITA the stem
sentences may be awkward as part of the <context> element
• If a task contains an ordered list that are not steps, assign it a different
style than the style used for steps
Pre-conversion Checklist
Reality Check:
• The content will need work to get into the proper structure
• Time spent preparing for the conversion will have huge impact on the
rest of your process and time to achieve production-ready results
• The conversion process essentially infers structure based on styles –
paragraph and character
• Be rigorous with the styles used in the input documents
• It is irritating grunt work to check and re-apply styles, but it will
save huge effort later
Conversion Topic List
During the first step, a topic list is generated for the conversion batch
• Original File Name – source file for the topic
• Heading Level – level 1 is a chapter level topic
• Topic filename – auto-generated, may be changed
• Topic title – the title taken directly from the heading
• Standalone (Y/N) – by default, each heading is converted into its own
topic. You may choose to combine concept or reference topics
• Topic type –verify the topic type and change if necessary
Conversion Topic List
•
•
•
•
•

A topic-type must be provided for each topic
The conversion tool suggests a topic-type based on heuristics
The list should be reviewed and modified where necessary
The conversion is executed based on the conversion topic list
You can use the topic list to insert index entries, metadata, navtitles,
etc.
Conversion Output
Deliverables:
• For each document, you get back:
• Valid bookmap file with all submaps
• Valid DITA XML files
• Original source images referenced in the XML files
Post-conversion Checklist
Examples:
• Edit top-level maps or bookmaps to include front matter, back matter,
and metadata as needed.
• Review content tagged with <required-cleanup>
• Fix any cross references that the conversion was unable to resolve
• Import content to your CMS, publish, and review the converted content
Conversion Schedule
•

Develop a plan and schedule for migration
• Freeze content
• Apply pre-conversion checklist
• Send batch for conversion
• Review conversion topic list
• Convert
• Apply post-conversion checklist
• Import to CMS and publish
Keep in Touch! Let us know how we can
help you.
For additional information, contact:

Yehudit Lindblom
Joe Gelb
solutions@suite-sol.com
U.S. Office
(609) 360-0650

EMEA Office
+972-2-993-8054

www.suite-sol.com
Follow us on Linked-In
http://www.linkedin.com/company/527916

Weitere ähnliche Inhalte

Mehr von Suite Solutions

SuiteHelp 4.0: Latest Features in Enterprise Webhelp
SuiteHelp 4.0: Latest Features in Enterprise WebhelpSuiteHelp 4.0: Latest Features in Enterprise Webhelp
SuiteHelp 4.0: Latest Features in Enterprise WebhelpSuite Solutions
 
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...Suite Solutions
 
SuiteHelp 3.2.5 Latest Features
SuiteHelp 3.2.5 Latest FeaturesSuiteHelp 3.2.5 Latest Features
SuiteHelp 3.2.5 Latest FeaturesSuite Solutions
 
Using Taxonomy for Customer-centric Dynamic Publishing
Using Taxonomy for Customer-centric Dynamic PublishingUsing Taxonomy for Customer-centric Dynamic Publishing
Using Taxonomy for Customer-centric Dynamic PublishingSuite Solutions
 
DITA Quick Start Webinar Series: Building a Project Plan
DITA Quick Start Webinar Series: Building a Project PlanDITA Quick Start Webinar Series: Building a Project Plan
DITA Quick Start Webinar Series: Building a Project PlanSuite Solutions
 
Suite Labs: Generating SuiteHelp Output
Suite Labs: Generating SuiteHelp OutputSuite Labs: Generating SuiteHelp Output
Suite Labs: Generating SuiteHelp OutputSuite Solutions
 
Overview of SuiteHelp 3.1 for DITA
Overview of SuiteHelp 3.1 for DITAOverview of SuiteHelp 3.1 for DITA
Overview of SuiteHelp 3.1 for DITASuite Solutions
 
C:\Users\User\Desktop\Eclipse Infocenter
C:\Users\User\Desktop\Eclipse InfocenterC:\Users\User\Desktop\Eclipse Infocenter
C:\Users\User\Desktop\Eclipse InfocenterSuite Solutions
 
LocalizingStyleSheetsForHTMLOutputs
LocalizingStyleSheetsForHTMLOutputsLocalizingStyleSheetsForHTMLOutputs
LocalizingStyleSheetsForHTMLOutputsSuite Solutions
 
ImplementingChangeTrackingAndFlagging
ImplementingChangeTrackingAndFlaggingImplementingChangeTrackingAndFlagging
ImplementingChangeTrackingAndFlaggingSuite Solutions
 
Debugging and Error handling
Debugging and Error handlingDebugging and Error handling
Debugging and Error handlingSuite Solutions
 
Dita ot pipeline webinar
Dita ot pipeline webinarDita ot pipeline webinar
Dita ot pipeline webinarSuite Solutions
 
CustomizingStyleSheetsForHTMLOutputs
CustomizingStyleSheetsForHTMLOutputsCustomizingStyleSheetsForHTMLOutputs
CustomizingStyleSheetsForHTMLOutputsSuite Solutions
 

Mehr von Suite Solutions (20)

SuiteHelp 4.0: Latest Features in Enterprise Webhelp
SuiteHelp 4.0: Latest Features in Enterprise WebhelpSuiteHelp 4.0: Latest Features in Enterprise Webhelp
SuiteHelp 4.0: Latest Features in Enterprise Webhelp
 
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...
Moving your Organization up the Knowledge Value Chain (Proposal for Lavacon 2...
 
SuiteHelp 3.2.5 Latest Features
SuiteHelp 3.2.5 Latest FeaturesSuiteHelp 3.2.5 Latest Features
SuiteHelp 3.2.5 Latest Features
 
Using Taxonomy for Customer-centric Dynamic Publishing
Using Taxonomy for Customer-centric Dynamic PublishingUsing Taxonomy for Customer-centric Dynamic Publishing
Using Taxonomy for Customer-centric Dynamic Publishing
 
DITA Quick Start Webinar Series: Building a Project Plan
DITA Quick Start Webinar Series: Building a Project PlanDITA Quick Start Webinar Series: Building a Project Plan
DITA Quick Start Webinar Series: Building a Project Plan
 
Introduction to S1000D
Introduction to S1000DIntroduction to S1000D
Introduction to S1000D
 
Suite Labs: Generating SuiteHelp Output
Suite Labs: Generating SuiteHelp OutputSuite Labs: Generating SuiteHelp Output
Suite Labs: Generating SuiteHelp Output
 
Overview of SuiteHelp 3.1 for DITA
Overview of SuiteHelp 3.1 for DITAOverview of SuiteHelp 3.1 for DITA
Overview of SuiteHelp 3.1 for DITA
 
Svg and graphics
Svg and graphicsSvg and graphics
Svg and graphics
 
C:\Users\User\Desktop\Eclipse Infocenter
C:\Users\User\Desktop\Eclipse InfocenterC:\Users\User\Desktop\Eclipse Infocenter
C:\Users\User\Desktop\Eclipse Infocenter
 
LocalizingStyleSheetsForHTMLOutputs
LocalizingStyleSheetsForHTMLOutputsLocalizingStyleSheetsForHTMLOutputs
LocalizingStyleSheetsForHTMLOutputs
 
ImplementingChangeTrackingAndFlagging
ImplementingChangeTrackingAndFlaggingImplementingChangeTrackingAndFlagging
ImplementingChangeTrackingAndFlagging
 
AdvancedXPath
AdvancedXPathAdvancedXPath
AdvancedXPath
 
Ot performance webinar
Ot performance webinarOt performance webinar
Ot performance webinar
 
PDF Localization
PDF  LocalizationPDF  Localization
PDF Localization
 
Debugging and Error handling
Debugging and Error handlingDebugging and Error handling
Debugging and Error handling
 
Dost.jar and fo.jar
Dost.jar and fo.jarDost.jar and fo.jar
Dost.jar and fo.jar
 
Dita ot pipeline webinar
Dita ot pipeline webinarDita ot pipeline webinar
Dita ot pipeline webinar
 
Developing Plugins
Developing PluginsDeveloping Plugins
Developing Plugins
 
CustomizingStyleSheetsForHTMLOutputs
CustomizingStyleSheetsForHTMLOutputsCustomizingStyleSheetsForHTMLOutputs
CustomizingStyleSheetsForHTMLOutputs
 

Kürzlich hochgeladen

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 

Kürzlich hochgeladen (20)

WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 

DITA Quick Start Webinar: Defining Your Conversion Requirements

  • 1. Migrating to DITA: Defining Your Conversion Requirements Yehudit Lindblom & Joe Gelb February 11, 2014
  • 2. Who are we? Yehudit Lindblom • Project Manager and cat herder Joe Gelb • Founder and President of Suite Solutions Suite Solutions Our Vision: Enable you to engage your customers by providing quick access to relevant information: DITA provides the foundation • Help companies get it right the first time • XML-based Authoring/Publishing Solutions • Enterprise Intelligent Dynamic Content: SuiteShare Social KB • Consultancy, Systems Integration, Application Development • Cross-Industry Expertise • High Tech, Aerospace & Defense, Discrete Manufacturing • Healthcare, Government
  • 3. Main Topics      Goals of this webinar Key components of a DITA solution Why do a conversion? Review the process Defining your requirements
  • 4. Goals of this Webinar Primary Goal: Empower (not overwhelm) you • Understand the process, details and dependencies involved • Understand the possibilities • Build a solid plan based on experience • Schedule accordingly • Understand the skills required and the help you should seek • Manage expectations
  • 5. Key Components of a DITA Solution 1. 2. 3. 4. 5. Staff Content Translation Publishing Content and configuration management Your mission is to develop or acquire each of these
  • 6. Why should you do a professional, automated conversion? 1. Quickly attain a large sampling of valid DITA to configure and test your toolset: authoring, publishing, CMS 2. Reap the benefits of your DITA investment and hit your target return on investment (ROI ) more quickly 3. Provide authors with a solid foundation and benchmark from which to create new content, based on familiar pre-existing material 4. Converting content manually is grueling and time-consuming • May burn-out your authors • Adds even more pressure on production schedules • Your authors may be inexperienced with DITA and their lack of expertise will tinge large amounts of content and sow inconsistency right from the beginning…
  • 7. Why should you do a professional, automated conversion? “Having done this from the other side, it was very helpful for us to see our content, which we were familiar with, already set up in DITA format and publishing.” - Quote from Yours truly… • Burning out the authors: Because it’s so mind-numbing, it winds up taking much longer than it should and affects the scheduling. • Politics: show good progress in a “reasonable” timeframe. • “What? We just spent $@@@,### on this system and you can’t even show me a PDF with our current stuff?”
  • 8. What formats can you convert from? In order of increasing difficulty… 1. Docbook 2. FrameMaker (structured) 3. FrameMaker (un-structured) 4. Word, RTF (remember Winhelp?) 5. HTML • Web content • HTML Help • Webhelp (e.g. Robohelp) 6. Adobe InDesign 7. PDF
  • 9. High-Level Conversion Process 1. Prepare • Requirements: based on content audit and information architecture • Pre-conversion checklist – preparing content for conversion • Post-conversion checklist – “clean-up” • Representative content sample: first large batch for conversion 2. Configure conversion tools 3. Convert first batch • Run through pre-conversion checklist • Analyze and generate conversion topic list • Review topic list • Convert • Review, feedback, tweak tools, re-convert • Run through post-conversion checklist 4. Convert additional content according to a schedule
  • 10. Conversion Process • • • • Compile the most representative sample you can. Otherwise, you risk missing elements that need to be added later. For example: • Include content from all the various templates in use • Include all possible test elements you will need to convert Involve your style sheet developer: consider order of elements, outputclasses Involve your CMS vendor: understand how the tool deals with metadata, conditionalization, variables, etc. Schedule: • It can take 1-2 months to get first set of content properly converted • Subsequent conversion batches can go very quickly • Schedule migrations as needed, allowing enough time to freeze changes to the content and go through the conversion process
  • 11. Conversion Requirements Output structure and file conventions • Which DTDs to reference • Which encoding: UTF-8? • Folder structure of the output • File naming convention, for example: • Use lower case for filenames • Replace all spaces and non-alphanumeric characters with underscores • Indicate file-type/topic-type with the first letter in the filename • Use .dita or .ditamap extensions, or .xml
  • 12. Conversion Requirements Mapping content elements to DITA tagging • Map paragraph and character styles to tagging structures Examples of mapping rules: • Short description: first paragraph that is not a list item • Tasks • Tagging of context, pre-requisite, step info, etc. • Only one task to be included per topic. If multiple tasks are encountered within the same topic, wrap the second task/procedure in <postreq><required-cleanup> • If there is a <para> with “Example” text as an inline heading, use the <example> element • Do not insert empty <info> or other empty tags in the content
  • 13. Conversion Requirements Graphics • • Create linked files from embedded graphics • Determine naming convention Graphic formats • JPG, GIF, PNG: generally fine as-is • BMP – generally converted to PNG • EMF – often converted to SVG • EPS – consider that you will need special processing in the style sheets to handle  PDF: Antenna House + GhostScript  HTML: auto conversion to PNG • TIFF – OK for PDF, but require handling for HTML formats, since not all browsers support TIFF viewing out of the box
  • 14. Conversion Requirements Graphics • Types that need special processing • Callouts  Convert to SVG?  Use numbered callouts with a list of labels underneath the graphic • Hotspots • Visio
  • 15. Conversion Requirements Publications / Maps • • • • • • • Maps versus bookmaps Modularizing maps • Creation of submaps for chapters, parts, appendices, etc. • Use of maprefs versus topicrefs versus conrefs to link submaps Creation of container topics for chapters, parts, etc. Insertion of front-matter, back-matter, titles, metadata List of tables, figures Glossary Index
  • 16. Conversion Requirements Conditional Tags • • Convert FrameMaker, Robohelp conditions into attribute / value pairs Combine and discontinue conditionals
  • 17. Conversion Requirements Other Requirements Variables • Auto-generation of conrefs • Special tagging for CMS, e.g. @varref for SDL Live Content Index entries • Whether to leave inline or move to prolog/metadata/keywords Context sensitivity • Retain markers or other tagging used to produce context sensitive help • Which element to code them in: <data>, <resourceid>? Cross-references and relationship tables • Copy all or some xrefs automatically to relationship tables
  • 18. Conversion Requirements Mapping Example of style mapping: PARAGRAPHS p <p> Para <p> Body <p> BodyFirst <p> this tag has special formatting, may need to add outputclass Para1Indent <p> 2nd level paragraph Indented <p> 2nd level paragraph Indented2 <p> 3rd level paragraph Para2Indent <p> 3rd level paragraph Preface <title> heading 1 <title> Head1 <title> heading 2 <title> Head2 <title> heading 3 <title> TITLES heading level 2 always starts a new topic heading level 3 starts a new topic for tasks only
  • 19. Pre-conversion Checklist Preparing the content for conversion • • • • Some changes are best done before the conversion, some can be done later Check style usage • Example: a heading styled as “Body” will not convert as a title, or trigger separation into a separate topic Apply character styles to get semantic domain tagging • uicontrol, wintitle, filenames • Example rule: all bold text should become a uicontrol Other examples: • Verify there is only one set of steps in each task • Remove stem sentences that precede steps; in DITA the stem sentences may be awkward as part of the <context> element • If a task contains an ordered list that are not steps, assign it a different style than the style used for steps
  • 20. Pre-conversion Checklist Reality Check: • The content will need work to get into the proper structure • Time spent preparing for the conversion will have huge impact on the rest of your process and time to achieve production-ready results • The conversion process essentially infers structure based on styles – paragraph and character • Be rigorous with the styles used in the input documents • It is irritating grunt work to check and re-apply styles, but it will save huge effort later
  • 21. Conversion Topic List During the first step, a topic list is generated for the conversion batch • Original File Name – source file for the topic • Heading Level – level 1 is a chapter level topic • Topic filename – auto-generated, may be changed • Topic title – the title taken directly from the heading • Standalone (Y/N) – by default, each heading is converted into its own topic. You may choose to combine concept or reference topics • Topic type –verify the topic type and change if necessary
  • 22. Conversion Topic List • • • • • A topic-type must be provided for each topic The conversion tool suggests a topic-type based on heuristics The list should be reviewed and modified where necessary The conversion is executed based on the conversion topic list You can use the topic list to insert index entries, metadata, navtitles, etc.
  • 23. Conversion Output Deliverables: • For each document, you get back: • Valid bookmap file with all submaps • Valid DITA XML files • Original source images referenced in the XML files
  • 24. Post-conversion Checklist Examples: • Edit top-level maps or bookmaps to include front matter, back matter, and metadata as needed. • Review content tagged with <required-cleanup> • Fix any cross references that the conversion was unable to resolve • Import content to your CMS, publish, and review the converted content
  • 25. Conversion Schedule • Develop a plan and schedule for migration • Freeze content • Apply pre-conversion checklist • Send batch for conversion • Review conversion topic list • Convert • Apply post-conversion checklist • Import to CMS and publish
  • 26. Keep in Touch! Let us know how we can help you. For additional information, contact: Yehudit Lindblom Joe Gelb solutions@suite-sol.com U.S. Office (609) 360-0650 EMEA Office +972-2-993-8054 www.suite-sol.com Follow us on Linked-In http://www.linkedin.com/company/527916