SlideShare ist ein Scribd-Unternehmen logo
1 von 13
Downloaden Sie, um offline zu lesen
MT Quality – LSP Perspective
Olga Beregovaya
VP, Technology Solutions
• What do translators appreciate?
• What do translators struggle most with?
• Engineering – impact on quality?
• Final output quality?
In Translators’ Own Words
THE POST-EDITOR PRODUCES:
Publishable quality
The post-editor is responsible for ensuring that client quality requirements
and style guide are met
The post-editor is expected to adhere to client StyleGuide preferences
with regard to:
Infinitive / Imperative
Passive / Impassive
Formal / Informal
Different Styles for Headers, Lists, Tables
Special Handling of UI Options (Bilingual, English, Target?)
Converting All the Measurements Based On the Local Conventions
+ Disambiguate Terminology
+ Correct all the grammatical errors
But does the machine produce sufficient output?
THE POST-EDITOR RECEIVES:
GERMAN FRENCH JAPANESE RUSSIAN CHINESE SPANISH ITALIAN BRAZILIAN
WRONG TERMINOLOGY 6.46 4.93 13.63 5.00 6.20 9.63 3.78 1.13
WRONG SPELLING 2.00 0.86 0.88 0.13 0.30 1.13 0.56 1.27
SOURCE NOT TRANSLATED 6.38 5.36 3.88 5.13 3.60 2.50 1.22 1.73
COMPLIANCE WITH CLIENT
SPECS
2.46 0.86 3.00 2.13 0.70 0.63 0.44 2.60
LITERAL TRANSLATION 7.85 8.64 5.00 4.00 9.40 5.38 7.67 7.93
TEXT/INFO ADDED 2.69 1.36 2.13 1.25 0.80 1.88 0.44 0.80
CAPITALIZATION 2.69 3.43 0.00 2.63 0.50 1.75 3.33 2.60
WRONG WORD FORM 6.77 7.79 0.13 9.88 0.60 6.75 3.67 6.75
WRONG PART OF SPEECH 2.62 3.21 2.00 1.88 0.60 2.13 3.67 1.33
PUNCTUATION 4.46 3.00 0.75 3.38 4.10 2.13 1.22 3.53
SENTENCE STRUCTURE 12.54 10.00 14.25 8.00 13.00 5.38 6.11 3.67
TAGS + MARK-UP 1.23 0.14 0.13 0.50 0.20 0.38 0.44 0.20
LOCALE ADAPTATION 0.46 0.29 0.75 0.63 0.20 0.75 0.44 0.13
SPACING 0.92 0.36 2.25 1.25 4.00 0.50 0.33 0.40
OTHER 1.92 1.50 1.88 0.13 0.50 0.13 1.44 0.27
TOTAL ERRORS 61.46 51.71 50.63 45.88 44.70 41.00 34.78 32.53
Most time-consuming issues that translators need to
fix are:
• Sentence structure (word order)
• MT output too literal
• Wrong terminology
• Word form disagreements
• Source term left untranslated
OR, IN A NUTSHELL…
The Translator Gains…
Productivity gains ranging from 56% to negative
- Content type
- Engine output quality
- How fast is HT (and how much MT helps)
- Correlation?
TOP 6 ON THE TRANSLATORS’ LOVE IT-LIST
1. Source of inspiration: reduces thinking and translation choice time
2. Provides reference - very useful to translators new to a specific domain
3. Reduces typing & lookup time by handling well repetitive terminology and
structures
4. …thereby takes away the more monotonous efforts of translation
5. Post-editors over time notice improvements; appreciate it more if they
‘co-own’ the engine
6. MT output can be funny
LOL!LOL!
TOP 3 ON THE TRANSLATORS’ S*#!T-LIST
1. Wrong sentence structure
• Major impact on the post-editing effort (Spanish and Portuguese produce fewest errors)
• Japanese has the highest error rate and the lowest productivity gains (supported by
the cognitive effort error ranking research)
2. Wrong and inconsistent terminology
• Very time-consuming to check and fix terminology; + enough issues from Fuzzy
Matches already
• A major problem for new products where the terminology is not settled yet
• Inconsistent output for UI references
3. Correct MT to an agreed standard (=quality expectations)
• A challenging concept in the beginning for post-editors – they think they should edit
less if the quality is bad
S*#!TS*#!T
FEEDBACK LOOP – Essential!
SOURCE TEXT MT OUTPUT POST-EDITED OUTPUT
SPECIFIC
ERRORS/CHANGES MADE
Single-phase options range from 1.4kW
to 7.7kW while three-phase PDUs,
packed with output receptacles, range
from 8.6kW to 21.6kW.
Single-fase 7.7kW Opties variëren van
1.4kW om en driefasige PDU's, boordevol
Output-aansluitingen, variëren van 8,6
kW tot 21.6kW.
1,4 kW ... 7,7 kW ... 21,6 kW
Numbers and measurement units are not
converted properly and no spaces
inserted by MT engine (3 out of 4
occurrences, 1 is correct however,
strange...
Single-phase options range from 1.4kW
to 7.7kW while three-phase PDUs,
packed with output receptacles, range
from 8.6kW to 21.6kW.
• Biedt maximaal 24 TB <fmt id="1"
tooltip="SUPERSCRIPT"
endtooltip="SUPERSCRIPT"> 2 </fmt>
maximale capaciteit per-
uitbreidingsbehuizing toe te voegen.
• Biedt een maximale capaciteit van 24
TB<fmt id="1" tooltip="SUPERSCRIPT"
endtooltip="SUPERSCRIPT">2</fmt> per
uitbreidingsbehuizing.
No space should be inserted in front of
and behind a number in superscript (in
this case a "2"). ...>2<... and not: > 2 <
<fmt id="1" tooltip="b"
endtooltip="b">Interface Speed:</fmt> 6
Gb/s SAS
<fmt id="1" tooltip="b" endtooltip="b">
Interfacesnelheid: 6 </fmt> Gb/s SAS
• Biedt een maximale capaciteit van 24
TB<fmt id="1" tooltip="SUPERSCRIPT"
endtooltip="SUPERSCRIPT">2</fmt> per
uitbreidingsbehuizing.
The number is inserted before the tag
and should be after the tag
<fmt id="1" tooltip="b"
endtooltip="b">Intermixed Drive
Capacities:</fmt> Yes
<fmt id="1" tooltip="b" endtooltip="b">
Intermixed Capaciteit van de schijven:
Ja </fmt>
...</fmt> Ja
The string is inserted before the tag and
should be after the tag (and again
spacing before and after tags inserted)
A new feature — DR Rapid Data Access
— adds tighter integration with backup
software applications, starting with
Symantec OpenStorage-enabled
backup applications.
Een nieuwe functie - DR-Rapid Data
Access - voegt strakkere integratie met
back-uptoepassingen, beginnend met
Symantec OpenStorage geschikte
back-uptoepassingen.
... — DR Rapid Data Access — ...
Please ensure any special characters
like — (ChrW(151)) are preserved when
inserting a TM proposal, and not
replaced by a normal hyphen
(ChrW(45)).
Can these errors can be learned and corrected automatically? Can
we simplify or omit the “feedback loop”?
POST-EDITING QUALITY RESULTS
No fails on one of our 28-language PE program thanks to correct
terminology choices and few and consistent error.
LOCALIZATION TAG PLACEMENT
This is what a plain-text engine will do:
To become verified and lift your sending limit, please confirm your email
address, then add a credit or prepaid card to your account and {30} {31}
{32} {33} {34} {35}confirm{36} {37} {38} it.{39}.
{30}Para hacerse verificado y levantar su límite de envío, por favor
confirme su dirección de correo electrónico, luego añada un crédito o
tarjeta de prepago a su cuenta de y
confírmelo.{31}{32}{33}{34}{35}{36}{37}{38}{39}
This is a<ph id="1" x="&lt;b&gt;">{1}</ph>test<ph id="1"
x="&lt;/b&gt;">{2}</ph>
Dies ist ein <ph id="1" x="&lt;b&gt;">{1}</ph>Test<ph id="1"
x="&lt;/b&gt;">{2}</ph>.
AND THIS IS WHAT’S NEEDED
• More transparency in workings of engine and training
• Faster systems, shorter turnaround on large systems
• More “wizards” for training and deployment
• Easier testing methodologies without full deployments
• More standardized scoring and comparison metrics
• More “wizards” for training and deployment
• Predictive analysis of quality – confidence and utility scores
• Normalization integrated into workflow and standardized
• Industry-wide proper name and title library
• Better transliteration standards
• Morphologically aware terminology choices
• More research on post-editing environments
1. How to display source/target
2. How to display multiple suggestions
3. Autocomplete
4. Better ways to calculate the productivity improvements with post-editing
• More interoperability, so translators can stay in CAT tool they prefer
• Simplified workflows connecting MT engines and other tools
Translator Wishlist

Weitere ähnliche Inhalte

Andere mochten auch

Implementation Science in HIV Related Work: Case Study of Injection Drug User...
Implementation Science in HIV Related Work: Case Study of Injection Drug User...Implementation Science in HIV Related Work: Case Study of Injection Drug User...
Implementation Science in HIV Related Work: Case Study of Injection Drug User...HopkinsCFAR
 
Sustainable Cities Network Posibilities
Sustainable Cities   Network PosibilitiesSustainable Cities   Network Posibilities
Sustainable Cities Network PosibilitiesGlenn Klith Andersen
 
Xensoft India Corporate Presentation
Xensoft India Corporate PresentationXensoft India Corporate Presentation
Xensoft India Corporate Presentationxensoft india
 
Lovemydog 2012 , UK\'s leading pet accessory brand
Lovemydog 2012 , UK\'s leading pet accessory brandLovemydog 2012 , UK\'s leading pet accessory brand
Lovemydog 2012 , UK\'s leading pet accessory brandLoveMyDog
 
Programacienciasclinicas2015
Programacienciasclinicas2015Programacienciasclinicas2015
Programacienciasclinicas2015gits2015
 
Apostolado del mar lineamientos celam
Apostolado del mar lineamientos celamApostolado del mar lineamientos celam
Apostolado del mar lineamientos celamitepal
 
The legend of the wawel dragon
The legend of the wawel dragonThe legend of the wawel dragon
The legend of the wawel dragonBabilin
 
1. ciclo de_vida_de_software
1. ciclo de_vida_de_software1. ciclo de_vida_de_software
1. ciclo de_vida_de_softwareMiguel Castro
 
Centros de Votación Elecciones 2015
Centros de Votación Elecciones 2015Centros de Votación Elecciones 2015
Centros de Votación Elecciones 2015miguelalas
 

Andere mochten auch (20)

23 de setiembre del 2014
23 de setiembre del 201423 de setiembre del 2014
23 de setiembre del 2014
 
Implementation Science in HIV Related Work: Case Study of Injection Drug User...
Implementation Science in HIV Related Work: Case Study of Injection Drug User...Implementation Science in HIV Related Work: Case Study of Injection Drug User...
Implementation Science in HIV Related Work: Case Study of Injection Drug User...
 
Dushyant Patel (1)
Dushyant Patel (1)Dushyant Patel (1)
Dushyant Patel (1)
 
Sustainable Cities Network Posibilities
Sustainable Cities   Network PosibilitiesSustainable Cities   Network Posibilities
Sustainable Cities Network Posibilities
 
Xensoft India Corporate Presentation
Xensoft India Corporate PresentationXensoft India Corporate Presentation
Xensoft India Corporate Presentation
 
Lovemydog 2012 , UK\'s leading pet accessory brand
Lovemydog 2012 , UK\'s leading pet accessory brandLovemydog 2012 , UK\'s leading pet accessory brand
Lovemydog 2012 , UK\'s leading pet accessory brand
 
Bondia Lleida 08032012
Bondia Lleida 08032012Bondia Lleida 08032012
Bondia Lleida 08032012
 
14 planeshabana
14 planeshabana14 planeshabana
14 planeshabana
 
Programacienciasclinicas2015
Programacienciasclinicas2015Programacienciasclinicas2015
Programacienciasclinicas2015
 
Apostolado del mar lineamientos celam
Apostolado del mar lineamientos celamApostolado del mar lineamientos celam
Apostolado del mar lineamientos celam
 
151
151151
151
 
Presentación jonny heber
Presentación jonny heberPresentación jonny heber
Presentación jonny heber
 
Volume ii web
Volume ii webVolume ii web
Volume ii web
 
Las 3 erres
Las 3 erresLas 3 erres
Las 3 erres
 
Estatutos de la asociacion europea de mediacion
Estatutos de la asociacion europea de mediacionEstatutos de la asociacion europea de mediacion
Estatutos de la asociacion europea de mediacion
 
Multinacional Toyota
Multinacional ToyotaMultinacional Toyota
Multinacional Toyota
 
The legend of the wawel dragon
The legend of the wawel dragonThe legend of the wawel dragon
The legend of the wawel dragon
 
Relaciones Humanas
Relaciones HumanasRelaciones Humanas
Relaciones Humanas
 
1. ciclo de_vida_de_software
1. ciclo de_vida_de_software1. ciclo de_vida_de_software
1. ciclo de_vida_de_software
 
Centros de Votación Elecciones 2015
Centros de Votación Elecciones 2015Centros de Votación Elecciones 2015
Centros de Votación Elecciones 2015
 

Ähnlich wie Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)

EAMT Presentation by Welocalize Olga Beregovaya May 2015
EAMT Presentation by Welocalize Olga Beregovaya May 2015EAMT Presentation by Welocalize Olga Beregovaya May 2015
EAMT Presentation by Welocalize Olga Beregovaya May 2015Welocalize
 
Eamt olga beregovaya_keynote
Eamt olga beregovaya_keynoteEamt olga beregovaya_keynote
Eamt olga beregovaya_keynoteRobert Martin
 
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...Lucidworks
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones RIILP
 
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)TAUS - The Language Data Network
 
Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018Jose Luis Bonilla Sánchez
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyIconic Translation Machines
 
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...Datavail
 
Encode, tag, realize high precision text editing
Encode, tag, realize high precision text editingEncode, tag, realize high precision text editing
Encode, tag, realize high precision text editingtaeseon ryu
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize
 
Pl sql best practices document
Pl sql best practices documentPl sql best practices document
Pl sql best practices documentAshwani Pandey
 
SFO15-301: Benchmarking Best Practices 101
SFO15-301: Benchmarking Best Practices 101SFO15-301: Benchmarking Best Practices 101
SFO15-301: Benchmarking Best Practices 101Linaro
 
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS - The Language Data Network
 
Learn the different approaches to machine translation and how to improve the ...
Learn the different approaches to machine translation and how to improve the ...Learn the different approaches to machine translation and how to improve the ...
Learn the different approaches to machine translation and how to improve the ...SDL
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translationStephen Peacock
 
AFT Impulse 7 - Ficha de dados
AFT Impulse 7 - Ficha de dadosAFT Impulse 7 - Ficha de dados
AFT Impulse 7 - Ficha de dadosN.A. Tecnologia
 

Ähnlich wie Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize) (20)

EAMT Presentation by Welocalize Olga Beregovaya May 2015
EAMT Presentation by Welocalize Olga Beregovaya May 2015EAMT Presentation by Welocalize Olga Beregovaya May 2015
EAMT Presentation by Welocalize Olga Beregovaya May 2015
 
Eamt olga beregovaya_keynote
Eamt olga beregovaya_keynoteEamt olga beregovaya_keynote
Eamt olga beregovaya_keynote
 
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...
Solr Highlighting at Full Speed: Presented by Timothy Rodriguez, Bloomberg & ...
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones
 
sCode optimization
sCode optimizationsCode optimization
sCode optimization
 
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)
MT Benchmarking and Business Intelligence - Tom Shaw (Capita TI)
 
TAUS QE Summit 2017 eBay EN-DE MT Pilot
TAUS QE Summit 2017   eBay EN-DE MT PilotTAUS QE Summit 2017   eBay EN-DE MT Pilot
TAUS QE Summit 2017 eBay EN-DE MT Pilot
 
Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018Evaluation of MT Quality/Productivity at eBay - AMTA 2018
Evaluation of MT Quality/Productivity at eBay - AMTA 2018
 
TAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance GuidelinesTAUS Evaluating Post-Editor Performance Guidelines
TAUS Evaluating Post-Editor Performance Guidelines
 
What machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happyWhat machine translation developers are doing to make post-editors happy
What machine translation developers are doing to make post-editors happy
 
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...
SQL Pass Summit Presentations from Datavail - Optimize SQL Server: Query Tuni...
 
Encode, tag, realize high precision text editing
Encode, tag, realize high precision text editingEncode, tag, realize high precision text editing
Encode, tag, realize high precision text editing
 
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura CasanellasWelocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
Welocalize Throughputs and Post-Editing Productivity Webinar Laura Casanellas
 
Pl sql best practices document
Pl sql best practices documentPl sql best practices document
Pl sql best practices document
 
MT Use in Lingosail, by Yongpeng Wei, Lingosail
MT Use in Lingosail, by Yongpeng Wei, LingosailMT Use in Lingosail, by Yongpeng Wei, Lingosail
MT Use in Lingosail, by Yongpeng Wei, Lingosail
 
SFO15-301: Benchmarking Best Practices 101
SFO15-301: Benchmarking Best Practices 101SFO15-301: Benchmarking Best Practices 101
SFO15-301: Benchmarking Best Practices 101
 
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
TAUS Roundtable Moscow, CAT or TMS Implementation-Calculation of the Number o...
 
Learn the different approaches to machine translation and how to improve the ...
Learn the different approaches to machine translation and how to improve the ...Learn the different approaches to machine translation and how to improve the ...
Learn the different approaches to machine translation and how to improve the ...
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
 
AFT Impulse 7 - Ficha de dados
AFT Impulse 7 - Ficha de dadosAFT Impulse 7 - Ficha de dados
AFT Impulse 7 - Ficha de dados
 

Mehr von TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

Mehr von TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Kürzlich hochgeladen

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...Sheetaleventcompany
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Chameera Dedduwage
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar TrainingKylaCullinane
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfSenaatti-kiinteistöt
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Vipesco
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsaqsarehman5055
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxNikitaBankoti2
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxraffaeleoman
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024eCommerce Institute
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Hasting Chen
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMoumonDas2
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxmohammadalnahdi22
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyPooja Nehwal
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceDelhi Call girls
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Delhi Call girls
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaKayode Fayemi
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AITatiana Gurgel
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesPooja Nehwal
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardsticksaastr
 

Kürzlich hochgeladen (20)

No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
No Advance 8868886958 Chandigarh Call Girls , Indian Call Girls For Full Nigh...
 
Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)Introduction to Prompt Engineering (Focusing on ChatGPT)
Introduction to Prompt Engineering (Focusing on ChatGPT)
 
Report Writing Webinar Training
Report Writing Webinar TrainingReport Writing Webinar Training
Report Writing Webinar Training
 
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdfThe workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
The workplace ecosystem of the future 24.4.2024 Fabritius_share ii.pdf
 
Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510Thirunelveli call girls Tamil escorts 7877702510
Thirunelveli call girls Tamil escorts 7877702510
 
Air breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animalsAir breathing and respiratory adaptations in diver animals
Air breathing and respiratory adaptations in diver animals
 
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docxANCHORING SCRIPT FOR A CULTURAL EVENT.docx
ANCHORING SCRIPT FOR A CULTURAL EVENT.docx
 
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptxChiulli_Aurora_Oman_Raffaele_Beowulf.pptx
Chiulli_Aurora_Oman_Raffaele_Beowulf.pptx
 
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
Andrés Ramírez Gossler, Facundo Schinnea - eCommerce Day Chile 2024
 
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
Re-membering the Bard: Revisiting The Compleat Wrks of Wllm Shkspr (Abridged)...
 
Mathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptxMathematics of Finance Presentation.pptx
Mathematics of Finance Presentation.pptx
 
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptxMohammad_Alnahdi_Oral_Presentation_Assignment.pptx
Mohammad_Alnahdi_Oral_Presentation_Assignment.pptx
 
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night EnjoyCall Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
Call Girl Number in Khar Mumbai📲 9892124323 💞 Full Night Enjoy
 
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 97 Noida Escorts >༒8448380779 Escort Service
 
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort ServiceBDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
BDSM⚡Call Girls in Sector 93 Noida Escorts >༒8448380779 Escort Service
 
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
Night 7k Call Girls Noida Sector 128 Call Me: 8448380779
 
If this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New NigeriaIf this Giant Must Walk: A Manifesto for a New Nigeria
If this Giant Must Walk: A Manifesto for a New Nigeria
 
Microsoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AIMicrosoft Copilot AI for Everyone - created by AI
Microsoft Copilot AI for Everyone - created by AI
 
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara ServicesVVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
VVIP Call Girls Nalasopara : 9892124323, Call Girls in Nalasopara Services
 
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, YardstickSaaStr Workshop Wednesday w/ Lucas Price, Yardstick
SaaStr Workshop Wednesday w/ Lucas Price, Yardstick
 

Machine Translation Quality - Are We There Yet? - Olga Beregovaya (Welocalize)

  • 1. MT Quality – LSP Perspective Olga Beregovaya VP, Technology Solutions
  • 2. • What do translators appreciate? • What do translators struggle most with? • Engineering – impact on quality? • Final output quality? In Translators’ Own Words
  • 3. THE POST-EDITOR PRODUCES: Publishable quality The post-editor is responsible for ensuring that client quality requirements and style guide are met The post-editor is expected to adhere to client StyleGuide preferences with regard to: Infinitive / Imperative Passive / Impassive Formal / Informal Different Styles for Headers, Lists, Tables Special Handling of UI Options (Bilingual, English, Target?) Converting All the Measurements Based On the Local Conventions + Disambiguate Terminology + Correct all the grammatical errors But does the machine produce sufficient output?
  • 4. THE POST-EDITOR RECEIVES: GERMAN FRENCH JAPANESE RUSSIAN CHINESE SPANISH ITALIAN BRAZILIAN WRONG TERMINOLOGY 6.46 4.93 13.63 5.00 6.20 9.63 3.78 1.13 WRONG SPELLING 2.00 0.86 0.88 0.13 0.30 1.13 0.56 1.27 SOURCE NOT TRANSLATED 6.38 5.36 3.88 5.13 3.60 2.50 1.22 1.73 COMPLIANCE WITH CLIENT SPECS 2.46 0.86 3.00 2.13 0.70 0.63 0.44 2.60 LITERAL TRANSLATION 7.85 8.64 5.00 4.00 9.40 5.38 7.67 7.93 TEXT/INFO ADDED 2.69 1.36 2.13 1.25 0.80 1.88 0.44 0.80 CAPITALIZATION 2.69 3.43 0.00 2.63 0.50 1.75 3.33 2.60 WRONG WORD FORM 6.77 7.79 0.13 9.88 0.60 6.75 3.67 6.75 WRONG PART OF SPEECH 2.62 3.21 2.00 1.88 0.60 2.13 3.67 1.33 PUNCTUATION 4.46 3.00 0.75 3.38 4.10 2.13 1.22 3.53 SENTENCE STRUCTURE 12.54 10.00 14.25 8.00 13.00 5.38 6.11 3.67 TAGS + MARK-UP 1.23 0.14 0.13 0.50 0.20 0.38 0.44 0.20 LOCALE ADAPTATION 0.46 0.29 0.75 0.63 0.20 0.75 0.44 0.13 SPACING 0.92 0.36 2.25 1.25 4.00 0.50 0.33 0.40 OTHER 1.92 1.50 1.88 0.13 0.50 0.13 1.44 0.27 TOTAL ERRORS 61.46 51.71 50.63 45.88 44.70 41.00 34.78 32.53
  • 5. Most time-consuming issues that translators need to fix are: • Sentence structure (word order) • MT output too literal • Wrong terminology • Word form disagreements • Source term left untranslated OR, IN A NUTSHELL…
  • 6. The Translator Gains… Productivity gains ranging from 56% to negative - Content type - Engine output quality - How fast is HT (and how much MT helps) - Correlation?
  • 7. TOP 6 ON THE TRANSLATORS’ LOVE IT-LIST 1. Source of inspiration: reduces thinking and translation choice time 2. Provides reference - very useful to translators new to a specific domain 3. Reduces typing & lookup time by handling well repetitive terminology and structures 4. …thereby takes away the more monotonous efforts of translation 5. Post-editors over time notice improvements; appreciate it more if they ‘co-own’ the engine 6. MT output can be funny LOL!LOL!
  • 8. TOP 3 ON THE TRANSLATORS’ S*#!T-LIST 1. Wrong sentence structure • Major impact on the post-editing effort (Spanish and Portuguese produce fewest errors) • Japanese has the highest error rate and the lowest productivity gains (supported by the cognitive effort error ranking research) 2. Wrong and inconsistent terminology • Very time-consuming to check and fix terminology; + enough issues from Fuzzy Matches already • A major problem for new products where the terminology is not settled yet • Inconsistent output for UI references 3. Correct MT to an agreed standard (=quality expectations) • A challenging concept in the beginning for post-editors – they think they should edit less if the quality is bad S*#!TS*#!T
  • 9. FEEDBACK LOOP – Essential! SOURCE TEXT MT OUTPUT POST-EDITED OUTPUT SPECIFIC ERRORS/CHANGES MADE Single-phase options range from 1.4kW to 7.7kW while three-phase PDUs, packed with output receptacles, range from 8.6kW to 21.6kW. Single-fase 7.7kW Opties variëren van 1.4kW om en driefasige PDU's, boordevol Output-aansluitingen, variëren van 8,6 kW tot 21.6kW. 1,4 kW ... 7,7 kW ... 21,6 kW Numbers and measurement units are not converted properly and no spaces inserted by MT engine (3 out of 4 occurrences, 1 is correct however, strange... Single-phase options range from 1.4kW to 7.7kW while three-phase PDUs, packed with output receptacles, range from 8.6kW to 21.6kW. • Biedt maximaal 24 TB <fmt id="1" tooltip="SUPERSCRIPT" endtooltip="SUPERSCRIPT"> 2 </fmt> maximale capaciteit per- uitbreidingsbehuizing toe te voegen. • Biedt een maximale capaciteit van 24 TB<fmt id="1" tooltip="SUPERSCRIPT" endtooltip="SUPERSCRIPT">2</fmt> per uitbreidingsbehuizing. No space should be inserted in front of and behind a number in superscript (in this case a "2"). ...>2<... and not: > 2 < <fmt id="1" tooltip="b" endtooltip="b">Interface Speed:</fmt> 6 Gb/s SAS <fmt id="1" tooltip="b" endtooltip="b"> Interfacesnelheid: 6 </fmt> Gb/s SAS • Biedt een maximale capaciteit van 24 TB<fmt id="1" tooltip="SUPERSCRIPT" endtooltip="SUPERSCRIPT">2</fmt> per uitbreidingsbehuizing. The number is inserted before the tag and should be after the tag <fmt id="1" tooltip="b" endtooltip="b">Intermixed Drive Capacities:</fmt> Yes <fmt id="1" tooltip="b" endtooltip="b"> Intermixed Capaciteit van de schijven: Ja </fmt> ...</fmt> Ja The string is inserted before the tag and should be after the tag (and again spacing before and after tags inserted) A new feature — DR Rapid Data Access — adds tighter integration with backup software applications, starting with Symantec OpenStorage-enabled backup applications. Een nieuwe functie - DR-Rapid Data Access - voegt strakkere integratie met back-uptoepassingen, beginnend met Symantec OpenStorage geschikte back-uptoepassingen. ... — DR Rapid Data Access — ... Please ensure any special characters like — (ChrW(151)) are preserved when inserting a TM proposal, and not replaced by a normal hyphen (ChrW(45)). Can these errors can be learned and corrected automatically? Can we simplify or omit the “feedback loop”?
  • 10. POST-EDITING QUALITY RESULTS No fails on one of our 28-language PE program thanks to correct terminology choices and few and consistent error.
  • 11. LOCALIZATION TAG PLACEMENT This is what a plain-text engine will do: To become verified and lift your sending limit, please confirm your email address, then add a credit or prepaid card to your account and {30} {31} {32} {33} {34} {35}confirm{36} {37} {38} it.{39}. {30}Para hacerse verificado y levantar su límite de envío, por favor confirme su dirección de correo electrónico, luego añada un crédito o tarjeta de prepago a su cuenta de y confírmelo.{31}{32}{33}{34}{35}{36}{37}{38}{39}
  • 12. This is a<ph id="1" x="&lt;b&gt;">{1}</ph>test<ph id="1" x="&lt;/b&gt;">{2}</ph> Dies ist ein <ph id="1" x="&lt;b&gt;">{1}</ph>Test<ph id="1" x="&lt;/b&gt;">{2}</ph>. AND THIS IS WHAT’S NEEDED
  • 13. • More transparency in workings of engine and training • Faster systems, shorter turnaround on large systems • More “wizards” for training and deployment • Easier testing methodologies without full deployments • More standardized scoring and comparison metrics • More “wizards” for training and deployment • Predictive analysis of quality – confidence and utility scores • Normalization integrated into workflow and standardized • Industry-wide proper name and title library • Better transliteration standards • Morphologically aware terminology choices • More research on post-editing environments 1. How to display source/target 2. How to display multiple suggestions 3. Autocomplete 4. Better ways to calculate the productivity improvements with post-editing • More interoperability, so translators can stay in CAT tool they prefer • Simplified workflows connecting MT engines and other tools Translator Wishlist