SlideShare ist ein Scribd-Unternehmen logo
1 von 18
ITS 2.0 and Ontology Localization
Felix Sasaki
DFKI / W3C Fellow
March 2014
1
Disclaimer

This is a personal presentation –
nothing said on behalf of the
ITS IG 

2
“WHY?”

3
Recent email exchange
1. Mail from A to public list: “I would like to
translate ontology xyz into language ... –
what parts have to be translated”
2. Answer from ontology editor
3. Clarification question - mail from A
4. Answer from ontology editor
5. Acknowledgement from A: “Thank
you, everything clear to me!”
4
Recent email exchange
• Two persons time lost
– Mail writing & reading, ontology file “reading”, ...

• In total up to ½ a day
• How could this be avoided?
• Provide machine readable information on
what parts of the ontology should be
translated

5
“HOW?”

6
ITS 2.0
• Machine readable metadata to foster
automated content creation
• Input content: XML or HTML5
• Metadata: so called ITS “data categories”
• Example just uses ITS “Translate”

7
ITS 2.0 data categories – the list
•
•
•
•
•
•
•
•
•

Translate
Localization Note
Terminology
Directionality
Ruby
Language Information
Elements Within Text
Domain
Text Analysis /
Disambiguation
• Locale Filter

•
•
•
•
•
•
•
•
•
•

Provenance
External Resource
Target Pointer
Id Value
Preserve Space
Localization Quality Issue
Localization Quality Rating
MT Confidence
Allowed Characters
Storage Size

8
XML example – “local” ITS
<messages ...>

<msg num="123">Click Resume Button on Status

Display or
<panelmsg its:translate="no">CONTINUE</panelmsg>

Button on printer panel</msg>
</messages>

9
XML example – “global” ITS rules
<its:rules version="2.0" ...>
<its:translateRule translate="no"
selector="//code"/>
</its:rules>
• Relies on XPath or CSS selectors
• Global and local ITS work like CSS
• Local ITS overrides global ITS
• Global ITS rules: the last one wins
10
Saving time in ontology localization
• RDF (ontologies and instances) can be stored
as RDF/XML
• ITS 2.0 global rules provide a machine
readable approach to specify what is
translatable and what not
• The following slide shows how the mail
conversation could have been avoided

11
Global rule example for RDF/XML
<its:rules ...>
<its:translateRule selector="//*" translate="no"/>
<its:translateRule
selector="//rdfs:label[lang('en')] |
//dct:title[lang('en')] | rdfs:comment[lang('en')]"
translate="yes"/></its:rules>

• General global rule: all elements are not
translated
• Exception: various elements with human
readable text in English (= source language) are
to be translated
12
Benefit
• Problem solved 
• A lot of ITS 2.0 tool support for XML & HTML5
content
http://www.w3.org/International/its/wiki/ITS_Implementations

• Example: Okapi framework
– Open source modules and WYSIWYG tools for various
localization related tasks
– Screenshot of ITS 2.0 aware workflow: 1) segmentation
2) translation memory lookup 3) machine translation 4)
quality check 5) XLIFF generation
– RDF content can make use of all above steps right away!
13
1)
2)
3)
4)
5)

14
Drawback and Solutions
• Problem solved only for RDF/XML serialization
• What about Turtle, RDFa, N3, ...?
• Option 1: Recommend RDF/XML conversion for
localization
– Benefit: existing tools can be used
– Drawback: formulation of ITS 2.0 rules relies on XPath

• Option 2: Define ITS 2.0 metadata syntax for
other serializations
– Benefit: no need for XPath in non XML syntaxes
– Drawback: tool support not likely to happen soon
15
Potential synergies BPMLOD – ITS IG
• BPMLOD can get input on how to use real life
localization workflows and engage localization
& language technology tool producers (cf.
Okapi)
• ITS IG can reach out to new content types and
contribute to scenarios like “ontology
localization”
– In ITS 2.0, RDF so far is used in NIF
conversion, assuming non RDF content as actual
source content
16
Further information
• BPMLOD CG
http://www.w3.org/community/bpmlod/
• ITS IG
http://www.w3.org/International/its/ig/
• ITS 2.0 spec http://www.w3.org/TR/its20/

17
ITS 2.0 and Ontology Localization
Felix Sasaki
DFKI / W3C Fellow
March 2014
18

Weitere ähnliche Inhalte

Was ist angesagt?

Setting up multilingual websites with Content Management Systems
Setting up multilingual websites with  Content Management SystemsSetting up multilingual websites with  Content Management Systems
Setting up multilingual websites with Content Management SystemsQabiria
 
EPUB3 First Look
EPUB3 First LookEPUB3 First Look
EPUB3 First Lookwhmccoy
 
Humans should not write XML.
Humans should not write XML.Humans should not write XML.
Humans should not write XML.Peter Tröger
 
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows Charleston Conference 2017 - What's Past is Still Messing With Our Workflows
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows Jacqueline Whyte Appleby
 
Introduction to OmegaT
Introduction to OmegaTIntroduction to OmegaT
Introduction to OmegaTQabiria
 
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization iwan_rg
 

Was ist angesagt? (9)

Setting up multilingual websites with Content Management Systems
Setting up multilingual websites with  Content Management SystemsSetting up multilingual websites with  Content Management Systems
Setting up multilingual websites with Content Management Systems
 
Mock test
Mock testMock test
Mock test
 
2015 CIC: #EdTech Forum - LRMI
2015 CIC: #EdTech Forum - LRMI2015 CIC: #EdTech Forum - LRMI
2015 CIC: #EdTech Forum - LRMI
 
EPUB3 First Look
EPUB3 First LookEPUB3 First Look
EPUB3 First Look
 
Swt
SwtSwt
Swt
 
Humans should not write XML.
Humans should not write XML.Humans should not write XML.
Humans should not write XML.
 
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows Charleston Conference 2017 - What's Past is Still Messing With Our Workflows
Charleston Conference 2017 - What's Past is Still Messing With Our Workflows
 
Introduction to OmegaT
Introduction to OmegaTIntroduction to OmegaT
Introduction to OmegaT
 
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
P03- MANDIAC: A Web-based Annotation System For Manual Arabic Diacritization
 

Andere mochten auch

Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung
Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung
Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung SDL Language Technologies
 
Einführung in die räumliche Statistik (2015)
Einführung in die räumliche Statistik (2015)Einführung in die räumliche Statistik (2015)
Einführung in die räumliche Statistik (2015)Jakob Nikolas Kather
 
Fehler im Unterricht konstruktiv nutzen
Fehler im Unterricht konstruktiv nutzenFehler im Unterricht konstruktiv nutzen
Fehler im Unterricht konstruktiv nutzenKarsten D. Wolf
 
Kommunikation und Gesprächsführung
Kommunikation und GesprächsführungKommunikation und Gesprächsführung
Kommunikation und Gesprächsführungfoobar2605
 
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentation
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentationtekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentation
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer DokumentationGeorg Eck
 
20120114 Innovationsmanagement
20120114 Innovationsmanagement20120114 Innovationsmanagement
20120114 InnovationsmanagementICV
 
DITA: Editieren, Terminologie prüfen und Publizieren
DITA: Editieren, Terminologie prüfen und PublizierenDITA: Editieren, Terminologie prüfen und Publizieren
DITA: Editieren, Terminologie prüfen und PublizierenGeorg Eck
 
Mit Stil ans Ziel
Mit Stil ans ZielMit Stil ans Ziel
Mit Stil ans Zielmspgermany
 
Logo! Prinzipien und Gestaltgesetze
Logo! Prinzipien und GestaltgesetzeLogo! Prinzipien und Gestaltgesetze
Logo! Prinzipien und GestaltgesetzeKarl Piaty
 
Gestaltgesetze
GestaltgesetzeGestaltgesetze
GestaltgesetzeMMeellii
 
Kreativitätstechniken
KreativitätstechnikenKreativitätstechniken
Kreativitätstechnikenchristophc
 
Management-Techniken für mehr Partizipation
Management-Techniken für mehr PartizipationManagement-Techniken für mehr Partizipation
Management-Techniken für mehr PartizipationStefan ROOCK
 
Fragebogen erstellen - So kommen Sie ans Ziel!
Fragebogen erstellen - So kommen Sie ans Ziel!Fragebogen erstellen - So kommen Sie ans Ziel!
Fragebogen erstellen - So kommen Sie ans Ziel!INFONAUTICS GmbH
 

Andere mochten auch (20)

Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung
Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung
Terminologie für Alle - Praktischer Nutzen und unternehmensweite Wertschöpfung
 
Intgrationsmöglichkeiten von SDL BeGlobal
Intgrationsmöglichkeiten von SDL BeGlobalIntgrationsmöglichkeiten von SDL BeGlobal
Intgrationsmöglichkeiten von SDL BeGlobal
 
Einführung in die räumliche Statistik (2015)
Einführung in die räumliche Statistik (2015)Einführung in die räumliche Statistik (2015)
Einführung in die räumliche Statistik (2015)
 
Fehler im Unterricht konstruktiv nutzen
Fehler im Unterricht konstruktiv nutzenFehler im Unterricht konstruktiv nutzen
Fehler im Unterricht konstruktiv nutzen
 
Grammatik
GrammatikGrammatik
Grammatik
 
Kommunikation und Gesprächsführung
Kommunikation und GesprächsführungKommunikation und Gesprächsführung
Kommunikation und Gesprächsführung
 
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentation
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentationtekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentation
tekom/tcworld 2013 – T2: Einheitliche Terminologie in Technischer Dokumentation
 
20120114 Innovationsmanagement
20120114 Innovationsmanagement20120114 Innovationsmanagement
20120114 Innovationsmanagement
 
DITA: Editieren, Terminologie prüfen und Publizieren
DITA: Editieren, Terminologie prüfen und PublizierenDITA: Editieren, Terminologie prüfen und Publizieren
DITA: Editieren, Terminologie prüfen und Publizieren
 
Visual Analytics
Visual AnalyticsVisual Analytics
Visual Analytics
 
Kollektive Ideenbewertung mit idea rating sheets
Kollektive Ideenbewertung mit idea rating sheetsKollektive Ideenbewertung mit idea rating sheets
Kollektive Ideenbewertung mit idea rating sheets
 
Mit Stil ans Ziel
Mit Stil ans ZielMit Stil ans Ziel
Mit Stil ans Ziel
 
Logo! Prinzipien und Gestaltgesetze
Logo! Prinzipien und GestaltgesetzeLogo! Prinzipien und Gestaltgesetze
Logo! Prinzipien und Gestaltgesetze
 
Gestaltgesetze
GestaltgesetzeGestaltgesetze
Gestaltgesetze
 
Pragmatik
PragmatikPragmatik
Pragmatik
 
Langenscheidt grammatik intensivtrainerr_a2
Langenscheidt grammatik intensivtrainerr_a2Langenscheidt grammatik intensivtrainerr_a2
Langenscheidt grammatik intensivtrainerr_a2
 
Kreativitätstechniken
KreativitätstechnikenKreativitätstechniken
Kreativitätstechniken
 
Management-Techniken für mehr Partizipation
Management-Techniken für mehr PartizipationManagement-Techniken für mehr Partizipation
Management-Techniken für mehr Partizipation
 
Fragebogen erstellen - So kommen Sie ans Ziel!
Fragebogen erstellen - So kommen Sie ans Ziel!Fragebogen erstellen - So kommen Sie ans Ziel!
Fragebogen erstellen - So kommen Sie ans Ziel!
 
Sprechakt und Sprechakttheorie
Sprechakt und SprechakttheorieSprechakt und Sprechakttheorie
Sprechakt und Sprechakttheorie
 

Ähnlich wie Its2 ontology-localization

Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyIntroduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyRobert Viseur
 
Presentation of OpenNLP
Presentation of OpenNLPPresentation of OpenNLP
Presentation of OpenNLPRobert Viseur
 
epicenter2010 Open Xml
epicenter2010   Open Xmlepicenter2010   Open Xml
epicenter2010 Open XmlCraig Murphy
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aidkgerber
 
Logging presentation
Logging presentationLogging presentation
Logging presentationJatan Malde
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Takeshi Morita
 
Fedora Overview
Fedora OverviewFedora Overview
Fedora Overvieweposthumus
 
Intro to Perfect - LA presentation
Intro to Perfect - LA presentationIntro to Perfect - LA presentation
Intro to Perfect - LA presentationTim Taplin
 
Supporting developers with natural language queries
Supporting developers with natural language queriesSupporting developers with natural language queries
Supporting developers with natural language queriesMasud Rahman
 
Approaches to document/report generation
Approaches to document/report generation Approaches to document/report generation
Approaches to document/report generation plutext
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache StanbolAlkuvoima
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital LibrariesJack Eapen
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital LibrariesJack Eapen
 
Rclex: A Library for Robotics meet Elixir
Rclex: A Library for Robotics meet ElixirRclex: A Library for Robotics meet Elixir
Rclex: A Library for Robotics meet ElixirHideki Takase
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesDave Lewis
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportSebastian Hellmann
 
Evolution Of Object Oriented Technology
Evolution Of Object Oriented TechnologyEvolution Of Object Oriented Technology
Evolution Of Object Oriented TechnologySharon Roberts
 

Ähnlich wie Its2 ontology-localization (20)

Introduction to libre « fulltext » technology
Introduction to libre « fulltext » technologyIntroduction to libre « fulltext » technology
Introduction to libre « fulltext » technology
 
Presentation of OpenNLP
Presentation of OpenNLPPresentation of OpenNLP
Presentation of OpenNLP
 
Metamorphic Domain-Specific Languages
Metamorphic Domain-Specific LanguagesMetamorphic Domain-Specific Languages
Metamorphic Domain-Specific Languages
 
Ld4 l triannon
Ld4 l triannonLd4 l triannon
Ld4 l triannon
 
epicenter2010 Open Xml
epicenter2010   Open Xmlepicenter2010   Open Xml
epicenter2010 Open Xml
 
Decoding and developing the online finding aid
Decoding and developing the online finding aidDecoding and developing the online finding aid
Decoding and developing the online finding aid
 
01 html-introduction
01 html-introduction01 html-introduction
01 html-introduction
 
Logging presentation
Logging presentationLogging presentation
Logging presentation
 
Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...Integrating a Domain Ontology Development Environment and an Ontology Search ...
Integrating a Domain Ontology Development Environment and an Ontology Search ...
 
Fedora Overview
Fedora OverviewFedora Overview
Fedora Overview
 
Intro to Perfect - LA presentation
Intro to Perfect - LA presentationIntro to Perfect - LA presentation
Intro to Perfect - LA presentation
 
Supporting developers with natural language queries
Supporting developers with natural language queriesSupporting developers with natural language queries
Supporting developers with natural language queries
 
Approaches to document/report generation
Approaches to document/report generation Approaches to document/report generation
Approaches to document/report generation
 
Drupal and Apache Stanbol
Drupal and Apache StanbolDrupal and Apache Stanbol
Drupal and Apache Stanbol
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
 
Rclex: A Library for Robotics meet Elixir
Rclex: A Library for Robotics meet ElixirRclex: A Library for Robotics meet Elixir
Rclex: A Library for Robotics meet Elixir
 
The Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New TechnologiesThe Standards Mosaic Opening the Way to New Technologies
The Standards Mosaic Opening the Way to New Technologies
 
NIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate reportNIF 2.0 Phd thesis intermediate report
NIF 2.0 Phd thesis intermediate report
 
Evolution Of Object Oriented Technology
Evolution Of Object Oriented TechnologyEvolution Of Object Oriented Technology
Evolution Of Object Oriented Technology
 

Mehr von Felix Sasaki

Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbanken
Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbankenThb tag-des-offenen-fensters-2021-sasaki-graphdatenbanken
Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbankenFelix Sasaki
 
Sasaki Presentation at EVA 2016
Sasaki Presentation at EVA 2016Sasaki Presentation at EVA 2016
Sasaki Presentation at EVA 2016Felix Sasaki
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xmlFelix Sasaki
 
Sasaki datathon-madrid-2015
Sasaki datathon-madrid-2015Sasaki datathon-madrid-2015
Sasaki datathon-madrid-2015Felix Sasaki
 
Freme at feisgiltt 2015 freme & linked data & localisers
Freme at feisgiltt 2015   freme & linked data & localisersFreme at feisgiltt 2015   freme & linked data & localisers
Freme at feisgiltt 2015 freme & linked data & localisersFelix Sasaki
 
Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use casesFelix Sasaki
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-dataFelix Sasaki
 
1114 sasaki-metadata
1114 sasaki-metadata1114 sasaki-metadata
1114 sasaki-metadataFelix Sasaki
 
Sasaki ins-netz-gegangen-20111117
Sasaki ins-netz-gegangen-20111117Sasaki ins-netz-gegangen-20111117
Sasaki ins-netz-gegangen-20111117Felix Sasaki
 
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 PräsentationFelix Sasaki
 
Sasaki markupforum2011
Sasaki markupforum2011Sasaki markupforum2011
Sasaki markupforum2011Felix Sasaki
 
Sasaki webtechcon2010
Sasaki webtechcon2010Sasaki webtechcon2010
Sasaki webtechcon2010Felix Sasaki
 
Mlw sasaki-20101027
Mlw sasaki-20101027Mlw sasaki-20101027
Mlw sasaki-20101027Felix Sasaki
 
HTML5 - presentation at W3C-Tag 2009
HTML5 - presentation at W3C-Tag 2009HTML5 - presentation at W3C-Tag 2009
HTML5 - presentation at W3C-Tag 2009Felix Sasaki
 

Mehr von Felix Sasaki (15)

Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbanken
Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbankenThb tag-des-offenen-fensters-2021-sasaki-graphdatenbanken
Thb tag-des-offenen-fensters-2021-sasaki-graphdatenbanken
 
XML Seminar
XML SeminarXML Seminar
XML Seminar
 
Sasaki Presentation at EVA 2016
Sasaki Presentation at EVA 2016Sasaki Presentation at EVA 2016
Sasaki Presentation at EVA 2016
 
Linked data-tooling-xml
Linked data-tooling-xmlLinked data-tooling-xml
Linked data-tooling-xml
 
Sasaki datathon-madrid-2015
Sasaki datathon-madrid-2015Sasaki datathon-madrid-2015
Sasaki datathon-madrid-2015
 
Freme at feisgiltt 2015 freme & linked data & localisers
Freme at feisgiltt 2015   freme & linked data & localisersFreme at feisgiltt 2015   freme & linked data & localisers
Freme at feisgiltt 2015 freme & linked data & localisers
 
Freme at feisgiltt 2015 freme use cases
Freme at feisgiltt 2015   freme use casesFreme at feisgiltt 2015   freme use cases
Freme at feisgiltt 2015 freme use cases
 
Sasaki practical-linked-data
Sasaki practical-linked-dataSasaki practical-linked-data
Sasaki practical-linked-data
 
1114 sasaki-metadata
1114 sasaki-metadata1114 sasaki-metadata
1114 sasaki-metadata
 
Sasaki ins-netz-gegangen-20111117
Sasaki ins-netz-gegangen-20111117Sasaki ins-netz-gegangen-20111117
Sasaki ins-netz-gegangen-20111117
 
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation
"Warum Metadaten? Ein Plädoyer und mehr …" - webtechcon 2011 Präsentation
 
Sasaki markupforum2011
Sasaki markupforum2011Sasaki markupforum2011
Sasaki markupforum2011
 
Sasaki webtechcon2010
Sasaki webtechcon2010Sasaki webtechcon2010
Sasaki webtechcon2010
 
Mlw sasaki-20101027
Mlw sasaki-20101027Mlw sasaki-20101027
Mlw sasaki-20101027
 
HTML5 - presentation at W3C-Tag 2009
HTML5 - presentation at W3C-Tag 2009HTML5 - presentation at W3C-Tag 2009
HTML5 - presentation at W3C-Tag 2009
 

Its2 ontology-localization

  • 1. ITS 2.0 and Ontology Localization Felix Sasaki DFKI / W3C Fellow March 2014 1
  • 2. Disclaimer This is a personal presentation – nothing said on behalf of the ITS IG  2
  • 4. Recent email exchange 1. Mail from A to public list: “I would like to translate ontology xyz into language ... – what parts have to be translated” 2. Answer from ontology editor 3. Clarification question - mail from A 4. Answer from ontology editor 5. Acknowledgement from A: “Thank you, everything clear to me!” 4
  • 5. Recent email exchange • Two persons time lost – Mail writing & reading, ontology file “reading”, ... • In total up to ½ a day • How could this be avoided? • Provide machine readable information on what parts of the ontology should be translated 5
  • 7. ITS 2.0 • Machine readable metadata to foster automated content creation • Input content: XML or HTML5 • Metadata: so called ITS “data categories” • Example just uses ITS “Translate” 7
  • 8. ITS 2.0 data categories – the list • • • • • • • • • Translate Localization Note Terminology Directionality Ruby Language Information Elements Within Text Domain Text Analysis / Disambiguation • Locale Filter • • • • • • • • • • Provenance External Resource Target Pointer Id Value Preserve Space Localization Quality Issue Localization Quality Rating MT Confidence Allowed Characters Storage Size 8
  • 9. XML example – “local” ITS <messages ...> <msg num="123">Click Resume Button on Status Display or <panelmsg its:translate="no">CONTINUE</panelmsg> Button on printer panel</msg> </messages> 9
  • 10. XML example – “global” ITS rules <its:rules version="2.0" ...> <its:translateRule translate="no" selector="//code"/> </its:rules> • Relies on XPath or CSS selectors • Global and local ITS work like CSS • Local ITS overrides global ITS • Global ITS rules: the last one wins 10
  • 11. Saving time in ontology localization • RDF (ontologies and instances) can be stored as RDF/XML • ITS 2.0 global rules provide a machine readable approach to specify what is translatable and what not • The following slide shows how the mail conversation could have been avoided 11
  • 12. Global rule example for RDF/XML <its:rules ...> <its:translateRule selector="//*" translate="no"/> <its:translateRule selector="//rdfs:label[lang('en')] | //dct:title[lang('en')] | rdfs:comment[lang('en')]" translate="yes"/></its:rules> • General global rule: all elements are not translated • Exception: various elements with human readable text in English (= source language) are to be translated 12
  • 13. Benefit • Problem solved  • A lot of ITS 2.0 tool support for XML & HTML5 content http://www.w3.org/International/its/wiki/ITS_Implementations • Example: Okapi framework – Open source modules and WYSIWYG tools for various localization related tasks – Screenshot of ITS 2.0 aware workflow: 1) segmentation 2) translation memory lookup 3) machine translation 4) quality check 5) XLIFF generation – RDF content can make use of all above steps right away! 13
  • 15. Drawback and Solutions • Problem solved only for RDF/XML serialization • What about Turtle, RDFa, N3, ...? • Option 1: Recommend RDF/XML conversion for localization – Benefit: existing tools can be used – Drawback: formulation of ITS 2.0 rules relies on XPath • Option 2: Define ITS 2.0 metadata syntax for other serializations – Benefit: no need for XPath in non XML syntaxes – Drawback: tool support not likely to happen soon 15
  • 16. Potential synergies BPMLOD – ITS IG • BPMLOD can get input on how to use real life localization workflows and engage localization & language technology tool producers (cf. Okapi) • ITS IG can reach out to new content types and contribute to scenarios like “ontology localization” – In ITS 2.0, RDF so far is used in NIF conversion, assuming non RDF content as actual source content 16
  • 17. Further information • BPMLOD CG http://www.w3.org/community/bpmlod/ • ITS IG http://www.w3.org/International/its/ig/ • ITS 2.0 spec http://www.w3.org/TR/its20/ 17
  • 18. ITS 2.0 and Ontology Localization Felix Sasaki DFKI / W3C Fellow March 2014 18