SlideShare ist ein Scribd-Unternehmen logo
1 von 33
How automated conversion works
  and why it matters
      LavaCon 2012
      Tuesday, October 9, 2012

      How automated conversion works
      and why it matters
      Patrick Baker
      VP, development and professional services
      Stilo International




Copyright © Stilo International plc 2011
introduction

How automated conversion works                              how it works
                                                                patterns / context / guidance
                                                            the secret to successful conversion
and why it matters                                          managing a conversion project
                                                            conclusion




 ►   You‟ve decide to adopt the DITA standard
      ► CMS has been selected
      ► Publishing scripts are in development



 ►   But your content is not yet in DITA
      ► It may be the least exciting part of the project,
         but conversion is necessary
      ► (But it‟s not evil!)
introduction

How automated conversion works                                        how it works
                                                                          patterns / context / guidance
                                                                      the secret to successful conversion
and why it matters                                                    managing a conversion project
                                                                      conclusion




 ►   Automated conversion is an option
      ►   You should consider it

 ►   But why does it matter how it works?
      ►To fully appreciate what‟s possible
      ► So you can pick the best conversion solution
      ► It will enable you to better manage your conversion project


 ►   But, in the end, what really matters?
      ► You do!
      ► You are the content owner, the subject matter expert (SME)
      ► You must be able to control the conversion
introduction

How automated conversion works                 how it works
                                                   patterns / context / guidance
                                               the secret to successful conversion
and why it matters                             managing a conversion project
                                               conclusion




 Agenda
    ►   How does automated conversion work
        ►   Patterns / context / guidance
    ►   The secret to successful conversion revealed
    ►   Managing a conversion project
    ►   Conclusion
introduction

How automated conversion works                                                            how it works
                                                                                              patterns / context / guidance
                                                                                          the secret to successful conversion
and why it matters                                                                        managing a conversion project
                                                                                          conclusion




 ►   The challenge of conversion
      ►   to create semantically rich XML from presentational documents
            semantic
            adjective: relating to meaning in language or logic
            origin: mid 17th century: from French sémantique, from Greek sēmantikos 'significant',
            from sēmainein 'signify', from sēma 'sign„
            (from the oxford dictionary)


      ►   How can we determine the meaning of the content in documents?

      ►   There are three essential mechanisms that content conversion
          technology may leverage - they are:
            ►   patterns
            ►   context
            ►   guided conversion
introduction

How automated conversion works                                            how it works
                                                                              patterns / context / guidance
                                                                          the secret to successful conversion
and why it matters                                                        managing a conversion project
                                                                          conclusion




                                   9/30/12
                                   30/9/12
                                   +1 (613) 745-4242
                                   $65.12
                                   hello

 ►   The meaning of the above are easily understood because humans are
     very good at recognizing patterns

 ►   Computers are also good at this. They use a formalism known as
     regular expressions to recognize such textual patterns

 ►   Regular expressions are a notation for regular grammars. A regular grammar
     describes a regular language. These notions comes from work done by
     linguists and mathematicians in the 1950s

 ►   Though not all sequences of characters can be expressed as a regular
     language, in practice they are extremely useful for recognizing patterns at the
     micro-level (i.e. text of a few to several characters in length)
introduction

How automated conversion works                                     how it works
                                                                       patterns / context / guidance
                                                                   the secret to successful conversion
and why it matters                                                 managing a conversion project
                                                                   conclusion




    Example regular expressions
    9/30/12             digit{1-2} “/” digit{1-2} “/” digit{1-2}


    30/9/12             digit{1-2} “/” digit{1-2} “/” digit{1-2}


    +1 (613) 745-4242   (“+1” white-space)?
                        (“(“ digit{3} “)”)?
                        digit{3} “-” digit{4}


    $65.12              “$” digit+ “.” digit{2}?


    Hello               white-space letter+ (white-space | [“.,?;:”])
introduction

How automated conversion works                                        how it works
                                                                          patterns / context / guidance
                                                                      the secret to successful conversion
and why it matters                                                    managing a conversion project
                                                                      conclusion




 ►   While regular expressions can identify textual patterns, they do not
     understand the meaning of the text they recognize

      7/4/12               digit{1-2} “/” digit{1-2} “/” digit{1-2}

                                    day or month?


      1/7/12               digit{1-2} “/” digit{1-2} “/” digit{1-2}
introduction

How automated conversion works                                              how it works
                                                                                patterns / context / guidance
                                                                            the secret to successful conversion
and why it matters                                                          managing a conversion project
                                                                            conclusion




 ►   Adequate context allows us (and computers) to evolve a deeper
     understanding of the meaning of text

      Context: A newspaper article authored in the USA
      7/4/12               digit{1-2} “/” digit{1-2} “/”      digit{1-2}

                              month               day

                                      Independence Day!!



      Context: A newspaper article authored in Canada
      1/7/12               digit{1-2} “/” digit{1-2}       “/” digit{1-2}

                               day              month

                                        Canada Day!!
introduction

How automated conversion works                                 how it works
                                                                   patterns / context / guidance
                                                               the secret to successful conversion
and why it matters                                             managing a conversion project
                                                               conclusion




 ►   Being Bold -- another example to demonstrate the value of context

          It was really cold out

          Click on the Enable Javascript checkbox

          Select: Start > Programs > Accessories > Notepad
introduction

How automated conversion works                                    how it works
                                                                      patterns / context / guidance
                                                                  the secret to successful conversion
and why it matters                                                managing a conversion project
                                                                  conclusion




  ►   Being Bold -- another example to demonstrate the value of context

            It was really cold out

      Not much context here, maybe this is just plain old bold,
      meaning emphasis
introduction

How automated conversion works                                       how it works
                                                                         patterns / context / guidance
                                                                     the secret to successful conversion
and why it matters                                                   managing a conversion project
                                                                     conclusion




 ►   Being Bold -- another example to demonstrate the value of context

           Click on the Enable Javascript checkbox

     Perhaps we are inside a DITA task at this point.

     The word Click hints that a user interface element may be nearby.
     That fact combined with the bold formatting suggest we have a uicontrol.

           Click on the Enable Javascript checkbox

                            uicontrol
introduction

How automated conversion works                                           how it works
                                                                             patterns / context / guidance
                                                                         the secret to successful conversion
and why it matters                                                       managing a conversion project
                                                                         conclusion




 ►   Being Bold -- another example to demonstrate the value of context

           Select: Start > Programs > Accessories > Notepad

     Perhaps we are inside a DITA task at this point

     The word Select hints that a user interface element may be nearby

     Plus having a sequence of bold tokens with a common separator in between
     each bold fragment suggests that we have a cascade of uicontrols

           <menucascade>
              <uicontrol>Start</uicontrol>
              <uicontrol>Programs</uicontrol>
              <uicontrol>Accessories</uicontrol>
              <uicontrol>Notepad</uicontrol>
           </menucascade>
introduction

How automated conversion works                                        how it works
                                                                          patterns / context / guidance
                                                                      the secret to successful conversion
and why it matters                                                    managing a conversion project
                                                                      conclusion




 ►   Topic type provides quite a useful contextual clue

          Quick-drop cookies

          1. Prepare the dough.
             a. Beat the egg in a large bowl.
             b. Add flour.
             c. Stir in milk.
          2. Prepare the topping.
             a. Mix brown sugar and cinnamon in another bowl.
          3. Form 1-inch round balls of dough.
             It is helpful to use a spoon when forming these balls.
          4. Roll each ball in the topping.
          5. Place each ball on an ungreased cookie sheet.

          Bake at 425 F for 12 to 15 minutes.
introduction

How automated conversion works                                               how it works
                                                                                 patterns / context / guidance
                                                                             the secret to successful conversion
and why it matters                                                           managing a conversion project
                                                                             conclusion




 ►   If we are in a plain topic or concept
      <p>Quick-drop cookies</p>
      <ol>
         <li>Prepare the dough.</li>
         <ol>
            <li>Beat the egg in a large bowl.</li>
            <li>Add flour.</li>
            <li>Stir in milk.</li>
         </ol>

        <li>Prepare the topping.</li>
        <ol>
           <li>Mix brown sugar and cinnamon in another bowl.</li>
        </ol>

         <li><p>Form 1-inch round balls of dough.</p>
             <p>It is helpful to use a spoon when forming these balls.</p>
         </li>
         <li>Use a spoon to make 1-inch round balls of dough.</li>
         <li>Roll each ball in the topping.</li>
         <li>Place each ball on an ungreased cookie sheet.</li>
      </ol>
      <p>Bake at 425 degrees Fahrenheit for 12 to 15 minutes.</p>
introduction

How automated conversion works                                                                         how it works
                                                                                                           patterns / context / guidance
                                                                                                       the secret to successful conversion
and why it matters                                                                                     managing a conversion project
                                                                                                       conclusion




►   If we are in a task
     <task>
        <title>Quick-drop cookies</title>
        <taskbody>
           <steps>
              <step>
                 <cmd>Prepare the dough.</cmd>
                 <substeps>
                    <substep><cmd>Beat the egg in a large bowl.</cmd>
                    </substep>
                    <substep><cmd>Add flour.</cmd></substep>
                    <substep><cmd>Stir in milk.</cmd></substep>
                 </substeps>
              </step>
              <step>
                 <cmd>Prepare the topping.</cmd>
                 <substeps>
                    <substep>
                       <cmd>Mix brown sugar and cinnamon in another bowl.
                       </cmd>
                    </substep>
                 </substeps>
              </step>
              <step>
                 <cmd>Form 1-inch round balls of dough.</cmd>
                 <info><note type="tip">It is helpful to use a spoon when forming these balls.</note></info>
              </step>
              <step><cmd>Roll each ball in the topping.</cmd></step>
              <step><cmd>Place each ball on an ungreased cookie sheet.</cmd>
              </step>
           </steps>
           <postreq>Bake at 425 degrees Fahrenheit for 12 to 15 minutes.
           </postreq>
        </taskbody>
     </task>
introduction

How automated conversion works                                              how it works
                                                                                patterns / context / guidance
                                                                            the secret to successful conversion
and why it matters                                                          managing a conversion project
                                                                            conclusion




 ►   Guiding the automated conversion

      ►   Who?
           The most reliable authority on this is the content owner who is familiar
           with the content

      ►   How?
            A mechanism (user interface) is required which enables the content
            owner to easily express what the correct context is for different portions
            of a document
introduction

How automated conversion works                                               how it works
                                                                                 patterns / context / guidance
                                                                             the secret to successful conversion
and why it matters                                                           managing a conversion project
                                                                             conclusion




 ►   A short list of things automated conversion can handle
        generation of specialized DITA
        generation of hierarchical maps and book maps
        configurable file bursting granularity and file naming
        tables
        lists
        extraction of images and other embedded objects
        procedures authored as tables converted to tasks
        definition lists authored as tables converted to definition lists
        identification of conditional text and generation of ditaval file
        domain specialization (e.g. ui controls and menu cascades)
        relationship tables
        use of conref and conkeyref
introduction

How automated conversion works                                                    how it works
                                                                                      patterns / context / guidance
                                                                                  the secret to successful conversion
and why it matters                                                                managing a conversion project
                                                                                  conclusion




            Knowledge (as defined by Joe Gollner)
            Knowledge is the meaningful organization of information, expressing
            an evolving understanding of a subject and establishing a basis for
            judgment and the potential for action.

     The level of success that an automated conversion technology can
     hope to achieve is bounded by the depth of knowledge it can attain
     of the content to be converted. Context, supported by guided
     conversion, provides for the meaningful organization of the
     information revealed by patterns. The conversion software can act on
     this evolved understanding of your content to produce the richest
     XML possible.

           Knowledge is the key to intelligent content conversion
introduction

How automated conversion works                                            how it works
                                                                              patterns / context / guidance
                                                                          the secret to successful conversion
and why it matters                                                        managing a conversion project
                                                                          conclusion




 ►   The secret to successful conversion is…
      ►   You!

 ►   How come?
      ►   Because knowledge it the key to successful conversion
          (note: This is true for manual as well as for automated conversion)
      ►   It is the Subject Matter Experts (SMEs) that have this knowledge
introduction

How automated conversion works                             how it works
                                                               patterns / context / guidance
                                                           the secret to successful conversion
and why it matters                                         managing a conversion project
                                                           conclusion




 ►   What‟s the best way to run a conversion project?

      ►   Let‟s have a look at some lessons learned from
          software development methodologies
introduction

How automated conversion works                                                   how it works
                                                                                     patterns / context / guidance
                                                                                 the secret to successful conversion
and why it matters                                                               managing a conversion project
                                                                                 conclusion




 Waterfall

 ►   Approach
      ►   Each phase of the project is executed in
          turn - phases are discontinuous

 ►   Downsides
      ►   Requirements must be known and
          completely specified up front
      ►   Lack of feed back. The rigid structure does
          not allow backtracking to incorporate
          knowledge acquired as the project
                                                        Image: Paulsmith99 at en.wikipedia
          progresses
introduction

How automated conversion works   how it works
                                     patterns / context / guidance
                                 the secret to successful conversion
and why it matters               managing a conversion project
                                 conclusion




 ►   We need a steering wheel!
introduction

How automated conversion works             how it works
                                               patterns / context / guidance
                                           the secret to successful conversion
and why it matters                         managing a conversion project
                                           conclusion




 Spiral

 ►   Approach
      ►   An iterative approach aimed at
          minimizing risk
      ►   Project is broken down into
          pieces which are handled as
          waterfall lifecycle

 ►   Advantages
      ►   manage risk
      ►   ability to adapt plan based on
          previous experience

 ►   Downsides
      ►   higher management overhead
introduction

How automated conversion works                 how it works
                                                   patterns / context / guidance
                                               the secret to successful conversion
and why it matters                             managing a conversion project
                                               conclusion




 Agile

 “ We are uncovering better ways of
 developing software by doing it and helping
 others do it. Through this work we have
 come to value ...

  ► Individuals and interactions over
    processes and tools
  ► Working software over comprehensive

    documentation
  ► Customer collaboration over contract

    negotiation
  ► Responding to change over following

    a plan”
introduction

How automated conversion works   how it works
                                     patterns / context / guidance
                                 the secret to successful conversion
and why it matters               managing a conversion project
                                 conclusion




 ►   We need to work together!
introduction

How automated conversion works                                  how it works
                                                                    patterns / context / guidance
                                                                the secret to successful conversion
and why it matters                                              managing a conversion project
                                                                conclusion




 ►   A well-managed (conversion) project benefits from
      ►   the incremental and iterative approach offered by the
          spiral model
      ►   the contribution of individuals that work as a team by
          effectively sharing their expertise, leading to an optimal
          understanding of goals
introduction

How automated conversion works                                               how it works
                                                                                 patterns / context / guidance
                                                                             the secret to successful conversion
and why it matters                                                           managing a conversion project
                                                                             conclusion




 ►   Automated conversion if done right enables these best practices
      ►   use an iterative schedule that you control
      ►   learn from previous iteration
      ►   put your expertise to full use by being directly involved in the actual
          conversion
      ►   share knowledge and insight with team members via the guidance
          interface

 ►   To support this, the conversion solution must
      ►   run on-demand
      ►   provide an interface which allows content owners to guide the conversion
introduction

How automated conversion works                                          how it works
                                                                            patterns / context / guidance
                                                                        the secret to successful conversion
and why it matters                                                      managing a conversion project
                                                                        conclusion




 ►   How?
       ►   knowledge is the key to intelligent conversion
       ►   patterns, context, guided conversion

 ►   Why?
     It‟s good to have a general appreciation of the how
       ►   for setting expectations
           ► not just on perceived limits
           ► but also to demand the best

       ►   to understand that you, as SME, are critical to the effort
       ►   to understand that an automated conversion solution may enable you
           to effectively manage your conversion project
introduction

How automated conversion works                                      how it works
                                                                        patterns / context / guidance
                                                                    the secret to successful conversion
and why it matters                                                  managing a conversion project
                                                                    conclusion




 ►   Look for a conversion solution that
      1. is automated
      2. has the basics in place
          ► supports input formats that you are interested in
          ► can generate high quality, validated output

      3. can convert on-demand
          ► enabling the feedback loop
          ► cutting down on pre-conversion preparation time

      4. provides an interface for SME participation
          ► to guide the conversion
          ► to share and evolve understanding of needs with colleagues
          ► cutting down on post-conversion cleanup time
introduction

How automated conversion works                                    how it works
                                                                      patterns / context / guidance
                                                                  the secret to successful conversion
and why it matters                                                managing a conversion project
                                                                  conclusion




 ►   These benefits can be realized if the automation is done right
      ► quality output

      ► high throughput

      ► reduction of turnaround time
          ► convert on-demand
          ► can manage conversion according to project timelines

      ► stay in control

      ► minimal pre-conversion prep and post-conversion cleanup

      ► reduced cost
introduction

How automated conversion works                            how it works
                                                              patterns / context / guidance
                                                          the secret to successful conversion
and why it matters                                        managing a conversion project
                                                          conclusion




 ►   This will save time and money, because…

      ►   you will get the DITA that you want, because…

      ►   you will understand what you want.
How automated conversion works
  and why it matters
   How automated conversion works
   and why it matters



      Questions?
      Patrick Baker pbaker@stilo.com




Copyright © Stilo International plc 2011
                                           www.stilo.com

Weitere ähnliche Inhalte

Ähnlich wie Migrating to DITA: How Automated Conversion Works and Why it Matters

An Inside Look At Extreme Programming Essay
An Inside Look At Extreme Programming EssayAn Inside Look At Extreme Programming Essay
An Inside Look At Extreme Programming Essay
Sharon Roberts
 
Business Need And Current Situation Essay
Business Need And Current Situation EssayBusiness Need And Current Situation Essay
Business Need And Current Situation Essay
Jill Lyons
 
It's all about efficiency and cost savings
It's all about efficiency and cost savingsIt's all about efficiency and cost savings
It's all about efficiency and cost savings
Stephane Haelterman
 

Ähnlich wie Migrating to DITA: How Automated Conversion Works and Why it Matters (20)

Optimizing Innovation: Modular Toolchains that Enable Digital Transformations
Optimizing Innovation: Modular Toolchains that Enable Digital TransformationsOptimizing Innovation: Modular Toolchains that Enable Digital Transformations
Optimizing Innovation: Modular Toolchains that Enable Digital Transformations
 
Optimizing Innovation- Modular Toolchains that Enable Digital Transformations
Optimizing Innovation-  Modular Toolchains that Enable Digital TransformationsOptimizing Innovation-  Modular Toolchains that Enable Digital Transformations
Optimizing Innovation- Modular Toolchains that Enable Digital Transformations
 
An Inside Look At Extreme Programming Essay
An Inside Look At Extreme Programming EssayAn Inside Look At Extreme Programming Essay
An Inside Look At Extreme Programming Essay
 
Business Processes and Complexity
Business Processes and ComplexityBusiness Processes and Complexity
Business Processes and Complexity
 
BPM in Telecoms
BPM in TelecomsBPM in Telecoms
BPM in Telecoms
 
Event-Driven Architecture (EDA)
Event-Driven Architecture (EDA)Event-Driven Architecture (EDA)
Event-Driven Architecture (EDA)
 
OTM DELIVERED : How Business Process Outsourcing and Preconfigured Solutions...
OTM DELIVERED: How Business Process Outsourcing and Preconfigured Solutions...OTM DELIVERED: How Business Process Outsourcing and Preconfigured Solutions...
OTM DELIVERED : How Business Process Outsourcing and Preconfigured Solutions...
 
PROCESS elementary
PROCESS elementaryPROCESS elementary
PROCESS elementary
 
Business Process Excellence For Financial Services
Business Process Excellence For Financial ServicesBusiness Process Excellence For Financial Services
Business Process Excellence For Financial Services
 
A 5-step methodology for complex E&P data management
A 5-step methodology for complex E&P data managementA 5-step methodology for complex E&P data management
A 5-step methodology for complex E&P data management
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
 
Business Need And Current Situation Essay
Business Need And Current Situation EssayBusiness Need And Current Situation Essay
Business Need And Current Situation Essay
 
Lighthouse thinking (TM), Small, Simple Steps to Extraordinary Success
Lighthouse thinking (TM), Small, Simple Steps to Extraordinary SuccessLighthouse thinking (TM), Small, Simple Steps to Extraordinary Success
Lighthouse thinking (TM), Small, Simple Steps to Extraordinary Success
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunne...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunne...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunne...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Sándor Sojnóczky, Hunne...
 
ОЛЕКСАНДР ВІЛЬЧИНСЬКИЙ «DevOps culture» Lviv DevOps Conference 2019
ОЛЕКСАНДР ВІЛЬЧИНСЬКИЙ «DevOps culture» Lviv DevOps Conference 2019ОЛЕКСАНДР ВІЛЬЧИНСЬКИЙ «DevOps culture» Lviv DevOps Conference 2019
ОЛЕКСАНДР ВІЛЬЧИНСЬКИЙ «DevOps culture» Lviv DevOps Conference 2019
 
It's all about efficiency and cost savings
It's all about efficiency and cost savingsIt's all about efficiency and cost savings
It's all about efficiency and cost savings
 
Enterprise Architecture Governance for an Enterprise Transformation Journey: ...
Enterprise Architecture Governance for an Enterprise Transformation Journey: ...Enterprise Architecture Governance for an Enterprise Transformation Journey: ...
Enterprise Architecture Governance for an Enterprise Transformation Journey: ...
 
Service-Oriented Modeling Strategy and Approach
Service-Oriented Modeling Strategy and ApproachService-Oriented Modeling Strategy and Approach
Service-Oriented Modeling Strategy and Approach
 
Tackling Sprawl in Microsoft 365 #MWCP21
Tackling Sprawl in Microsoft 365 #MWCP21Tackling Sprawl in Microsoft 365 #MWCP21
Tackling Sprawl in Microsoft 365 #MWCP21
 
T bone brochure en(1)
T bone brochure en(1)T bone brochure en(1)
T bone brochure en(1)
 

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Migrating to DITA: How Automated Conversion Works and Why it Matters

  • 1. How automated conversion works and why it matters LavaCon 2012 Tuesday, October 9, 2012 How automated conversion works and why it matters Patrick Baker VP, development and professional services Stilo International Copyright © Stilo International plc 2011
  • 2. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► You‟ve decide to adopt the DITA standard ► CMS has been selected ► Publishing scripts are in development ► But your content is not yet in DITA ► It may be the least exciting part of the project, but conversion is necessary ► (But it‟s not evil!)
  • 3. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Automated conversion is an option ► You should consider it ► But why does it matter how it works? ►To fully appreciate what‟s possible ► So you can pick the best conversion solution ► It will enable you to better manage your conversion project ► But, in the end, what really matters? ► You do! ► You are the content owner, the subject matter expert (SME) ► You must be able to control the conversion
  • 4. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Agenda ► How does automated conversion work ► Patterns / context / guidance ► The secret to successful conversion revealed ► Managing a conversion project ► Conclusion
  • 5. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► The challenge of conversion ► to create semantically rich XML from presentational documents semantic adjective: relating to meaning in language or logic origin: mid 17th century: from French sémantique, from Greek sēmantikos 'significant', from sēmainein 'signify', from sēma 'sign„ (from the oxford dictionary) ► How can we determine the meaning of the content in documents? ► There are three essential mechanisms that content conversion technology may leverage - they are: ► patterns ► context ► guided conversion
  • 6. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion 9/30/12 30/9/12 +1 (613) 745-4242 $65.12 hello ► The meaning of the above are easily understood because humans are very good at recognizing patterns ► Computers are also good at this. They use a formalism known as regular expressions to recognize such textual patterns ► Regular expressions are a notation for regular grammars. A regular grammar describes a regular language. These notions comes from work done by linguists and mathematicians in the 1950s ► Though not all sequences of characters can be expressed as a regular language, in practice they are extremely useful for recognizing patterns at the micro-level (i.e. text of a few to several characters in length)
  • 7. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Example regular expressions 9/30/12 digit{1-2} “/” digit{1-2} “/” digit{1-2} 30/9/12 digit{1-2} “/” digit{1-2} “/” digit{1-2} +1 (613) 745-4242 (“+1” white-space)? (“(“ digit{3} “)”)? digit{3} “-” digit{4} $65.12 “$” digit+ “.” digit{2}? Hello white-space letter+ (white-space | [“.,?;:”])
  • 8. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► While regular expressions can identify textual patterns, they do not understand the meaning of the text they recognize 7/4/12 digit{1-2} “/” digit{1-2} “/” digit{1-2} day or month? 1/7/12 digit{1-2} “/” digit{1-2} “/” digit{1-2}
  • 9. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Adequate context allows us (and computers) to evolve a deeper understanding of the meaning of text Context: A newspaper article authored in the USA 7/4/12 digit{1-2} “/” digit{1-2} “/” digit{1-2} month day Independence Day!! Context: A newspaper article authored in Canada 1/7/12 digit{1-2} “/” digit{1-2} “/” digit{1-2} day month Canada Day!!
  • 10. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Being Bold -- another example to demonstrate the value of context It was really cold out Click on the Enable Javascript checkbox Select: Start > Programs > Accessories > Notepad
  • 11. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Being Bold -- another example to demonstrate the value of context It was really cold out Not much context here, maybe this is just plain old bold, meaning emphasis
  • 12. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Being Bold -- another example to demonstrate the value of context Click on the Enable Javascript checkbox Perhaps we are inside a DITA task at this point. The word Click hints that a user interface element may be nearby. That fact combined with the bold formatting suggest we have a uicontrol. Click on the Enable Javascript checkbox uicontrol
  • 13. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Being Bold -- another example to demonstrate the value of context Select: Start > Programs > Accessories > Notepad Perhaps we are inside a DITA task at this point The word Select hints that a user interface element may be nearby Plus having a sequence of bold tokens with a common separator in between each bold fragment suggests that we have a cascade of uicontrols <menucascade> <uicontrol>Start</uicontrol> <uicontrol>Programs</uicontrol> <uicontrol>Accessories</uicontrol> <uicontrol>Notepad</uicontrol> </menucascade>
  • 14. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Topic type provides quite a useful contextual clue Quick-drop cookies 1. Prepare the dough. a. Beat the egg in a large bowl. b. Add flour. c. Stir in milk. 2. Prepare the topping. a. Mix brown sugar and cinnamon in another bowl. 3. Form 1-inch round balls of dough. It is helpful to use a spoon when forming these balls. 4. Roll each ball in the topping. 5. Place each ball on an ungreased cookie sheet. Bake at 425 F for 12 to 15 minutes.
  • 15. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► If we are in a plain topic or concept <p>Quick-drop cookies</p> <ol> <li>Prepare the dough.</li> <ol> <li>Beat the egg in a large bowl.</li> <li>Add flour.</li> <li>Stir in milk.</li> </ol> <li>Prepare the topping.</li> <ol> <li>Mix brown sugar and cinnamon in another bowl.</li> </ol> <li><p>Form 1-inch round balls of dough.</p> <p>It is helpful to use a spoon when forming these balls.</p> </li> <li>Use a spoon to make 1-inch round balls of dough.</li> <li>Roll each ball in the topping.</li> <li>Place each ball on an ungreased cookie sheet.</li> </ol> <p>Bake at 425 degrees Fahrenheit for 12 to 15 minutes.</p>
  • 16. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► If we are in a task <task> <title>Quick-drop cookies</title> <taskbody> <steps> <step> <cmd>Prepare the dough.</cmd> <substeps> <substep><cmd>Beat the egg in a large bowl.</cmd> </substep> <substep><cmd>Add flour.</cmd></substep> <substep><cmd>Stir in milk.</cmd></substep> </substeps> </step> <step> <cmd>Prepare the topping.</cmd> <substeps> <substep> <cmd>Mix brown sugar and cinnamon in another bowl. </cmd> </substep> </substeps> </step> <step> <cmd>Form 1-inch round balls of dough.</cmd> <info><note type="tip">It is helpful to use a spoon when forming these balls.</note></info> </step> <step><cmd>Roll each ball in the topping.</cmd></step> <step><cmd>Place each ball on an ungreased cookie sheet.</cmd> </step> </steps> <postreq>Bake at 425 degrees Fahrenheit for 12 to 15 minutes. </postreq> </taskbody> </task>
  • 17. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Guiding the automated conversion ► Who? The most reliable authority on this is the content owner who is familiar with the content ► How? A mechanism (user interface) is required which enables the content owner to easily express what the correct context is for different portions of a document
  • 18. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► A short list of things automated conversion can handle  generation of specialized DITA  generation of hierarchical maps and book maps  configurable file bursting granularity and file naming  tables  lists  extraction of images and other embedded objects  procedures authored as tables converted to tasks  definition lists authored as tables converted to definition lists  identification of conditional text and generation of ditaval file  domain specialization (e.g. ui controls and menu cascades)  relationship tables  use of conref and conkeyref
  • 19. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Knowledge (as defined by Joe Gollner) Knowledge is the meaningful organization of information, expressing an evolving understanding of a subject and establishing a basis for judgment and the potential for action. The level of success that an automated conversion technology can hope to achieve is bounded by the depth of knowledge it can attain of the content to be converted. Context, supported by guided conversion, provides for the meaningful organization of the information revealed by patterns. The conversion software can act on this evolved understanding of your content to produce the richest XML possible. Knowledge is the key to intelligent content conversion
  • 20. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► The secret to successful conversion is… ► You! ► How come? ► Because knowledge it the key to successful conversion (note: This is true for manual as well as for automated conversion) ► It is the Subject Matter Experts (SMEs) that have this knowledge
  • 21. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► What‟s the best way to run a conversion project? ► Let‟s have a look at some lessons learned from software development methodologies
  • 22. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Waterfall ► Approach ► Each phase of the project is executed in turn - phases are discontinuous ► Downsides ► Requirements must be known and completely specified up front ► Lack of feed back. The rigid structure does not allow backtracking to incorporate knowledge acquired as the project Image: Paulsmith99 at en.wikipedia progresses
  • 23. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► We need a steering wheel!
  • 24. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Spiral ► Approach ► An iterative approach aimed at minimizing risk ► Project is broken down into pieces which are handled as waterfall lifecycle ► Advantages ► manage risk ► ability to adapt plan based on previous experience ► Downsides ► higher management overhead
  • 25. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion Agile “ We are uncovering better ways of developing software by doing it and helping others do it. Through this work we have come to value ... ► Individuals and interactions over processes and tools ► Working software over comprehensive documentation ► Customer collaboration over contract negotiation ► Responding to change over following a plan”
  • 26. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► We need to work together!
  • 27. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► A well-managed (conversion) project benefits from ► the incremental and iterative approach offered by the spiral model ► the contribution of individuals that work as a team by effectively sharing their expertise, leading to an optimal understanding of goals
  • 28. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Automated conversion if done right enables these best practices ► use an iterative schedule that you control ► learn from previous iteration ► put your expertise to full use by being directly involved in the actual conversion ► share knowledge and insight with team members via the guidance interface ► To support this, the conversion solution must ► run on-demand ► provide an interface which allows content owners to guide the conversion
  • 29. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► How? ► knowledge is the key to intelligent conversion ► patterns, context, guided conversion ► Why? It‟s good to have a general appreciation of the how ► for setting expectations ► not just on perceived limits ► but also to demand the best ► to understand that you, as SME, are critical to the effort ► to understand that an automated conversion solution may enable you to effectively manage your conversion project
  • 30. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► Look for a conversion solution that 1. is automated 2. has the basics in place ► supports input formats that you are interested in ► can generate high quality, validated output 3. can convert on-demand ► enabling the feedback loop ► cutting down on pre-conversion preparation time 4. provides an interface for SME participation ► to guide the conversion ► to share and evolve understanding of needs with colleagues ► cutting down on post-conversion cleanup time
  • 31. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► These benefits can be realized if the automation is done right ► quality output ► high throughput ► reduction of turnaround time ► convert on-demand ► can manage conversion according to project timelines ► stay in control ► minimal pre-conversion prep and post-conversion cleanup ► reduced cost
  • 32. introduction How automated conversion works how it works patterns / context / guidance the secret to successful conversion and why it matters managing a conversion project conclusion ► This will save time and money, because… ► you will get the DITA that you want, because… ► you will understand what you want.
  • 33. How automated conversion works and why it matters How automated conversion works and why it matters Questions? Patrick Baker pbaker@stilo.com Copyright © Stilo International plc 2011 www.stilo.com