SlideShare ist ein Scribd-Unternehmen logo
1 von 25
Downloaden Sie, um offline zu lesen
Word, MathType, DITA,
MathML, and FOP
Doing Math With DITA and Word
10/27/2014 Contrext, LLC 1
About the Author
• Independent consultant focusing on DITA
analysis, design, and implementation
• Doing SGML and XML for cough 30 years cough
• Founding member of the DITA Technical
Committee
• Founding member of the XML Working Group
• Co-editor of HyTime standard (ISO/IEC 10744)
• Primary developer and founder of the DITA for
Publishers project
• Author of DITA for Practitioners, Vol 1 (XML Press)
10/27/2014 Contrext, LLC 2
Agenda
• The challenge: authoring math and publishing
using DITA
• Demonstration
• The tools
• The process
10/27/2014 Contrext, LLC 3
Executive Summary
• DITA 1.3 integrates MathML out of the box
• FOP and Antenna House XSL Formatter both
support MathML rendering
• MathType for Word is a low-cost equation editor
that supports MathML
• The DITA for Publishers Word-to-DITA framework
supports getting MathML from Word
• All together it means low-cost, high-quality math
publishing with DITA
10/27/2014 Contrext, LLC 4
THE CHALLENGE:
AUTHORING MATH AND
PUBLISHING USING DITA
10/27/2014 Contrext, LLC 5
Use Case: Authoring in Word,
Generating DITA
• Authors use Microsoft Word to author the
content:
– Academic papers
– Technical standards
– Technical documentation with math
• Convert the Word to DITA using the DITA for
Publishers Word-to-DITA framework
• Need to get MathML equations from authors
10/27/2014 Contrext, LLC 6
Math is Hard
• Many ways to author math
• MathML is the only XML standard for math
• MathML not practical to author directly:
requires an authoring tool of some sort
• For XML use need a way to get MathML from
authors along with the XML
10/27/2014 Contrext, LLC 7
Rendering MathML
• Requires MathML-aware tools
• May be production quality issues depending
on the renderer chosen
• May need to generate images for some
delivery targets (e.g., EPUB and Kindle)
10/27/2014 Contrext, LLC 8
Word and MathType
• Word has a built-in equation editor
– It does not produce MathML directly
• The Design Science MathType Word plugin can
produce MathML directly
– Can cut and paste from MathType as MathML
– Can convert MathType binary equations to “inline
MathML”
• No other automated way to get MathML from
the MathType equations
10/27/2014 Contrext, LLC 9
DITA and MathML
• Before DITA 1.3, no out-of-the-box MathML
integration in DITA
• Integration doable, e.g. DITA for Publishers,
Design Science solutions
• Need to implement MathML handling in the
DITA Open Toolkit or other DITA tools
10/27/2014 Contrext, LLC 10
DEMONSTRATION
10/27/2014 Contrext, LLC 11
Demonstration
• Real process for scientific journal articles
10/27/2014 Contrext, LLC 12
THE TOOLS
10/27/2014 Contrext, LLC 13
PDF Rendering: FOP and jEuclid
• Apache FOP is open-source XSL-FO engine
• When integrated with Apache jEuclid engine
renders MathML directly
• Rendition quality is good but may not be good
enough for some publications or equations
• To configure, just add jEuclid jar files to the
FOP installation
10/27/2014 Contrext, LLC 14
MathML in DITA
• DITA 1.3 integrates MathML out of the box
• This integration can be used with existing DITA 1.2 or
1.1 systems (not dependent on any other 1.3 features)
• Provides the <mathml> container for containing inline
MathML markup.
• Provides the <mathmlref> element for using MathML
markup by reference
• Provides elements for representing semantic equations
separate from the data format of the equations
• oXygenXML supports MathML in the editor and can
integrate with MathType
10/27/2014 Contrext, LLC 15
MS Word and MathType
• MathType is a low-cost Word plugin
– Visual editing of equations
– Can generate MathML
– Can convert binary MathType equations to “inline
MathML” in the word
– This conversion is not reversible
• Could also use Word’s proprietary equation
markup
– Not currently supported by the D4P Word-to-DITA
process
– But technically possible to convert to MathML
10/27/2014 Contrext, LLC 16
Word-to-DITA Framework
• Part of DITA for Publishers project
• Uses a style-to-tag-map to map Word styles to
DITA markup
• Supports translation of “inline” MathML to
MathML in the DITA
• Requires using MathType to convert binary
equations to inline MathML
10/27/2014 Contrext, LLC 17
THE PROCESS
10/27/2014 Contrext, LLC 18
0. Set Up
• Set up the style-to-tag mapping for converting
Word to DITA
– D4P provides an out-of-the-box mapping for built-
in Word styles
– See DITA for Publishers User Guide for details
• Add the jEuclid libraries to your FOP
installation
• Add the DITA 1.3 MathML and equation
domains to your local topic type shells
10/27/2014 Contrext, LLC 19
1. Author in Word
• Use Word with styles
• Use MathType to create equations
10/27/2014 Contrext, LLC 20
2. Generate Inline MathML
• Use MathType to convert
MathType equations to inline
MathML
• May want to save DOCX file to
a new location
– Conversion is not reversable
– Makes the Word largely
unusable
10/27/2014 Contrext, LLC 21
3. Generate DITA from The Word
Doc
• Run the D4P Word-to-DITA process
• Results in DITA XML
with MathML inline
10/27/2014 Contrext, LLC 22
4. Produce PDF with FOP
• Use normal DITA Open Toolkit process to
generate PDF using FOP
• Equations should
be rendered
10/27/2014 Contrext, LLC 23
Bonus: MathML In HTML
• The open-source MathJax package renders inline
MathML in any Javascript-capable browser
• The DITA 1.3 MathML support includes
generation of HTML with MathJax references
• Some browsers render MathML directly:
– Firefox
– IE 10+ (I think)
• Google has dropped MathML support from
Chrome
10/27/2014 Contrext, LLC 24
Resources
• DITA 1.3 spec: ???
• DITA for Publishers:
http://www.dita4publishers.org
• MathType: http://www.dessci.com
• FOP: http://xmlgraphics.apache.org/fop/
• Me: ekimber@contrext.com,
http://contrext.com
10/27/2014 Contrext, LLC 25

Weitere ähnliche Inhalte

Mehr von soapconf

Mehr von soapconf (14)

Karen Mardahl - Desiring accessibility, soap! 2015
Karen Mardahl - Desiring accessibility, soap! 2015Karen Mardahl - Desiring accessibility, soap! 2015
Karen Mardahl - Desiring accessibility, soap! 2015
 
Emilie Boillat - Whatchamacallit: Controlled Vocabularies for Technical Write...
Emilie Boillat - Whatchamacallit: Controlled Vocabularies for Technical Write...Emilie Boillat - Whatchamacallit: Controlled Vocabularies for Technical Write...
Emilie Boillat - Whatchamacallit: Controlled Vocabularies for Technical Write...
 
Erin Vang - Rockstars, not typists! Expanding your influence in tech organiza...
Erin Vang - Rockstars, not typists! Expanding your influence in tech organiza...Erin Vang - Rockstars, not typists! Expanding your influence in tech organiza...
Erin Vang - Rockstars, not typists! Expanding your influence in tech organiza...
 
Anton Bollen - What Makes a Video Effective?; soap! 2015
Anton Bollen - What Makes a Video Effective?; soap! 2015Anton Bollen - What Makes a Video Effective?; soap! 2015
Anton Bollen - What Makes a Video Effective?; soap! 2015
 
Adam Sanyo - Conref, conkeyref, conrefpush: Reuse strategies when working on ...
Adam Sanyo - Conref, conkeyref, conrefpush: Reuse strategies when working on ...Adam Sanyo - Conref, conkeyref, conrefpush: Reuse strategies when working on ...
Adam Sanyo - Conref, conkeyref, conrefpush: Reuse strategies when working on ...
 
Ray Gallon - Complexity, Nemetics, and Wicked Tech Comm; soap! 2015
Ray Gallon - Complexity, Nemetics, and Wicked Tech Comm; soap! 2015Ray Gallon - Complexity, Nemetics, and Wicked Tech Comm; soap! 2015
Ray Gallon - Complexity, Nemetics, and Wicked Tech Comm; soap! 2015
 
Rick Yagodich - Onramp: Making the case for author experience; soapconf 2014
Rick Yagodich - Onramp: Making the case for author experience; soapconf 2014Rick Yagodich - Onramp: Making the case for author experience; soapconf 2014
Rick Yagodich - Onramp: Making the case for author experience; soapconf 2014
 
Ray Gallon - Your most important business asset - Build a better end-to-end c...
Ray Gallon - Your most important business asset - Build a better end-to-end c...Ray Gallon - Your most important business asset - Build a better end-to-end c...
Ray Gallon - Your most important business asset - Build a better end-to-end c...
 
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
Felix Sasaki - Value beyond content creation - Introducing ITS 2.0; soapconf ...
 
Kasia Mrowca - How to defeat feature gluttony; soapconf 2014
Kasia Mrowca - How to defeat feature gluttony; soapconf 2014Kasia Mrowca - How to defeat feature gluttony; soapconf 2014
Kasia Mrowca - How to defeat feature gluttony; soapconf 2014
 
Noz Urbina - Messages for your manager about content; soapconf 2014
Noz Urbina - Messages for your manager about content; soapconf 2014Noz Urbina - Messages for your manager about content; soapconf 2014
Noz Urbina - Messages for your manager about content; soapconf 2014
 
Agnieszka Tkaczyk - Using infographics in technical communication; soapconf 2014
Agnieszka Tkaczyk - Using infographics in technical communication; soapconf 2014Agnieszka Tkaczyk - Using infographics in technical communication; soapconf 2014
Agnieszka Tkaczyk - Using infographics in technical communication; soapconf 2014
 
Monika Konieczny - Gamification & storytelling: how to turn boring technical ...
Monika Konieczny - Gamification & storytelling: how to turn boring technical ...Monika Konieczny - Gamification & storytelling: how to turn boring technical ...
Monika Konieczny - Gamification & storytelling: how to turn boring technical ...
 
Kevin Duncan - Speaking the visual language using images for effective commun...
Kevin Duncan - Speaking the visual language using images for effective commun...Kevin Duncan - Speaking the visual language using images for effective commun...
Kevin Duncan - Speaking the visual language using images for effective commun...
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Kürzlich hochgeladen (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 

Eliot Kimber - Math ML with MathType, Word, DITA and FOP; soapconf 2014

  • 1. Word, MathType, DITA, MathML, and FOP Doing Math With DITA and Word 10/27/2014 Contrext, LLC 1
  • 2. About the Author • Independent consultant focusing on DITA analysis, design, and implementation • Doing SGML and XML for cough 30 years cough • Founding member of the DITA Technical Committee • Founding member of the XML Working Group • Co-editor of HyTime standard (ISO/IEC 10744) • Primary developer and founder of the DITA for Publishers project • Author of DITA for Practitioners, Vol 1 (XML Press) 10/27/2014 Contrext, LLC 2
  • 3. Agenda • The challenge: authoring math and publishing using DITA • Demonstration • The tools • The process 10/27/2014 Contrext, LLC 3
  • 4. Executive Summary • DITA 1.3 integrates MathML out of the box • FOP and Antenna House XSL Formatter both support MathML rendering • MathType for Word is a low-cost equation editor that supports MathML • The DITA for Publishers Word-to-DITA framework supports getting MathML from Word • All together it means low-cost, high-quality math publishing with DITA 10/27/2014 Contrext, LLC 4
  • 5. THE CHALLENGE: AUTHORING MATH AND PUBLISHING USING DITA 10/27/2014 Contrext, LLC 5
  • 6. Use Case: Authoring in Word, Generating DITA • Authors use Microsoft Word to author the content: – Academic papers – Technical standards – Technical documentation with math • Convert the Word to DITA using the DITA for Publishers Word-to-DITA framework • Need to get MathML equations from authors 10/27/2014 Contrext, LLC 6
  • 7. Math is Hard • Many ways to author math • MathML is the only XML standard for math • MathML not practical to author directly: requires an authoring tool of some sort • For XML use need a way to get MathML from authors along with the XML 10/27/2014 Contrext, LLC 7
  • 8. Rendering MathML • Requires MathML-aware tools • May be production quality issues depending on the renderer chosen • May need to generate images for some delivery targets (e.g., EPUB and Kindle) 10/27/2014 Contrext, LLC 8
  • 9. Word and MathType • Word has a built-in equation editor – It does not produce MathML directly • The Design Science MathType Word plugin can produce MathML directly – Can cut and paste from MathType as MathML – Can convert MathType binary equations to “inline MathML” • No other automated way to get MathML from the MathType equations 10/27/2014 Contrext, LLC 9
  • 10. DITA and MathML • Before DITA 1.3, no out-of-the-box MathML integration in DITA • Integration doable, e.g. DITA for Publishers, Design Science solutions • Need to implement MathML handling in the DITA Open Toolkit or other DITA tools 10/27/2014 Contrext, LLC 10
  • 12. Demonstration • Real process for scientific journal articles 10/27/2014 Contrext, LLC 12
  • 14. PDF Rendering: FOP and jEuclid • Apache FOP is open-source XSL-FO engine • When integrated with Apache jEuclid engine renders MathML directly • Rendition quality is good but may not be good enough for some publications or equations • To configure, just add jEuclid jar files to the FOP installation 10/27/2014 Contrext, LLC 14
  • 15. MathML in DITA • DITA 1.3 integrates MathML out of the box • This integration can be used with existing DITA 1.2 or 1.1 systems (not dependent on any other 1.3 features) • Provides the <mathml> container for containing inline MathML markup. • Provides the <mathmlref> element for using MathML markup by reference • Provides elements for representing semantic equations separate from the data format of the equations • oXygenXML supports MathML in the editor and can integrate with MathType 10/27/2014 Contrext, LLC 15
  • 16. MS Word and MathType • MathType is a low-cost Word plugin – Visual editing of equations – Can generate MathML – Can convert binary MathType equations to “inline MathML” in the word – This conversion is not reversible • Could also use Word’s proprietary equation markup – Not currently supported by the D4P Word-to-DITA process – But technically possible to convert to MathML 10/27/2014 Contrext, LLC 16
  • 17. Word-to-DITA Framework • Part of DITA for Publishers project • Uses a style-to-tag-map to map Word styles to DITA markup • Supports translation of “inline” MathML to MathML in the DITA • Requires using MathType to convert binary equations to inline MathML 10/27/2014 Contrext, LLC 17
  • 19. 0. Set Up • Set up the style-to-tag mapping for converting Word to DITA – D4P provides an out-of-the-box mapping for built- in Word styles – See DITA for Publishers User Guide for details • Add the jEuclid libraries to your FOP installation • Add the DITA 1.3 MathML and equation domains to your local topic type shells 10/27/2014 Contrext, LLC 19
  • 20. 1. Author in Word • Use Word with styles • Use MathType to create equations 10/27/2014 Contrext, LLC 20
  • 21. 2. Generate Inline MathML • Use MathType to convert MathType equations to inline MathML • May want to save DOCX file to a new location – Conversion is not reversable – Makes the Word largely unusable 10/27/2014 Contrext, LLC 21
  • 22. 3. Generate DITA from The Word Doc • Run the D4P Word-to-DITA process • Results in DITA XML with MathML inline 10/27/2014 Contrext, LLC 22
  • 23. 4. Produce PDF with FOP • Use normal DITA Open Toolkit process to generate PDF using FOP • Equations should be rendered 10/27/2014 Contrext, LLC 23
  • 24. Bonus: MathML In HTML • The open-source MathJax package renders inline MathML in any Javascript-capable browser • The DITA 1.3 MathML support includes generation of HTML with MathJax references • Some browsers render MathML directly: – Firefox – IE 10+ (I think) • Google has dropped MathML support from Chrome 10/27/2014 Contrext, LLC 24
  • 25. Resources • DITA 1.3 spec: ??? • DITA for Publishers: http://www.dita4publishers.org • MathType: http://www.dessci.com • FOP: http://xmlgraphics.apache.org/fop/ • Me: ekimber@contrext.com, http://contrext.com 10/27/2014 Contrext, LLC 25