Irene O'Riordan (Microsoft) at the Industry Leaders Forum 2015

•Als PPTX, PDF herunterladen•

0 gefällt mir•610 views

4. The many uses of machine translation technology Let Google and Microsoft run with it or invest in your own translation technology. Why build your own machine translation system if Google and Microsoft offer such a great machine translation service. Machine translation technology is a force multiplier and a catalyst for innovation. Learn more about how effective use of MT opens many new services and markets. Panelists: Stéphane Domisse (John Deere), Olga Beregovaya (Welocalize), Diego Bartolome (tauyou), Dragos Munteanu (SDL), Tony O’Dowd (KantanMT), Sanna Piha (Moravia), Irene O'Riordan (Microsoft)

Technologie

Raw-MT goals (Support.Office.com)
Raw-MT article published within 24 hours
MT first: use MT on all articles, human translate high value and high traffic, within 5-10 days
24 languages production approved
90%+ of support articles (10,000 per language, 100+ new/updated per month)
MT only for long tail, bottom 30% of traffic
Automatic translation (recycling + MT) for both text and art
Target ‘acceptable’ quality: >=50% Customer Sat (CSAT), or within 10% of Human translation CSAT
Articles below the threshold are not published

Sample raw-MT topic: Support.com
https://support.office.com/Article/407b4006-4092-4024-8613-03f429afeccb?omkt=es-es

Quality model
• Raw-MT translation/content quality bar: ‘acceptable’
• MT needs to reach a minimum bar to be usable
• Thresholds are 2.5 /4 for human survey assessment, and 50% CSAT
• Recycling helps quality, factored into quality metric – if quality is not sufficient Recycling threshold will be
used
• Onboarding
• 36 main SKU languages have been evaluated for MT quality
• Format: surveys with LSPs, in country and volunteer native speakers
• Special onboarding project in iCMS for limited live publishing, for eval and BI/ratings
• Continuous assessment
• BI monitoring of ratings to ensure MT quality bar continues to be met
• Upgrading to HT
• High traffic MT assets will be upgraded to HT, to ensure optimal customer experience
• Functional/non-language specific quality bar
• Similar to HT topics + disclaimer check & translation coverage

MT Quality evaluation, December 2014 (Support.Office.com)
8 languages fully approved: Spanish, CHS, Brazilian, French, Arabic, Greek, Thai, Malay
16 conditional, usable with recycling: German, Russian, CHT, Danish, Italian, Norwegian, Polish,
Portuguese, Swedish, Hebrew, Bulgarian, Indonesian, Romanian, Vietnamese, Hindi, Lithuanian
12 below bar: Japanese, Korean, Slovakian, Ukrainian, Latvian, Dutch, Finnish, Turkish, Czech,
Hungarian, Slovenian, Estonian
00
0.5
1
1.5
2
2.5
3
3.5
4
Machine translation results
Raw Recycled acceptability

Weitere ähnliche Inhalte

Mehr von TAUS - The Language Data Network

Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network

A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network

Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network

Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network

Farmer Lv (TrueTran)TAUS - The Language Data Network

Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network

The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network

Translation Technology Showcase in ShenzhenTAUS - The Language Data Network

How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network

SDL Trados Studio 2017, Jocelyn He (SDL)TAUS - The Language Data Network

How we train post-editors - Yongpeng Wei (Lingosail)TAUS - The Language Data Network

A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network

QE integrated in XTM, by Bob Willans (XTM)TAUS - The Language Data Network

How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network

Traditional Models of Translation Outsourcing Seem Well-Established and Sound...TAUS - The Language Data Network

Smarter, Faster, Better: The secrets of productive Machine Translation, Tony ...TAUS - The Language Data Network

QE Made Easy by Attila Görög (TAUS)TAUS - The Language Data Network

Driving Innovation through Standardized APIs, By Klaus Fleischmann, KaleidoscopeTAUS - The Language Data Network

Lights Out, Translation is Datafied, by Jaap van der Meer (TAUS)TAUS - The Language Data Network

Topic 5: DQF Integrations and Use CasesTAUS - The Language Data Network

Mehr von TAUS - The Language Data Network (20)

Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...

A translation memory P2P trading platform - to make global translation memory...

Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...

Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...

Farmer Lv (TrueTran)

Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...

The Theory and Practice of Computer Aided Translation Training System, Liu Q...

Translation Technology Showcase in Shenzhen

How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)

SDL Trados Studio 2017, Jocelyn He (SDL)

How we train post-editors - Yongpeng Wei (Lingosail)

A use-case for getting MT into your company, Kerstin Berns (berns language c...

QE integrated in XTM, by Bob Willans (XTM)

How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)

Traditional Models of Translation Outsourcing Seem Well-Established and Sound...

Smarter, Faster, Better: The secrets of productive Machine Translation, Tony ...

QE Made Easy by Attila Görög (TAUS)

Driving Innovation through Standardized APIs, By Klaus Fleischmann, Kaleidoscope

Lights Out, Translation is Datafied, by Jaap van der Meer (TAUS)

Topic 5: DQF Integrations and Use Cases

Kürzlich hochgeladen

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

🐬 The future of MySQL is Postgres 🐘RTylerCroy

Scaling API-first – The story of a global engineering organizationRadu Cotescu

Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun

From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j

How to convert PDF to text with Nanonetsnaman860154

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

A Year of the Servo Reboot: Where Are We Now?Igalia

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Boost PC performance: How more available memory can improve productivityPrincipled Technologies

Presentation on how to chat with PDF using ChatGPT code interpreternaman860154

Kürzlich hochgeladen (20)

Handwritten Text Recognition for manuscripts and early printed texts

🐬 The future of MySQL is Postgres 🐘

Scaling API-first – The story of a global engineering organization

Powerful Google developer tools for immediate impact! (2023-24 C)

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...

How to convert PDF to text with Nanonets

Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...

How to Troubleshoot Apps for the Modern Connected Worker

CNv6 Instructor Chapter 6 Quality of Service

08448380779 Call Girls In Civil Lines Women Seeking Men

A Year of the Servo Reboot: Where Are We Now?

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Driving Behavioral Change for Information Management through Data-Driven Gree...

The Codex of Business Writing Software for Real-World Solutions 2.pptx

The 7 Things I Know About Cyber Security After 25 Years | April 2024

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Boost PC performance: How more available memory can improve productivity

Presentation on how to chat with PDF using ChatGPT code interpreter

Irene O'Riordan (Microsoft) at the Industry Leaders Forum 2015

2. Raw-MT goals (Support.Office.com) Raw-MT article published within 24 hours MT first: use MT on all articles, human translate high value and high traffic, within 5-10 days 24 languages production approved 90%+ of support articles (10,000 per language, 100+ new/updated per month) MT only for long tail, bottom 30% of traffic Automatic translation (recycling + MT) for both text and art Target ‘acceptable’ quality: >=50% Customer Sat (CSAT), or within 10% of Human translation CSAT Articles below the threshold are not published

3. Sample raw-MT topic: Support.com https://support.office.com/Article/407b4006-4092-4024-8613-03f429afeccb?omkt=es-es

4. Quality model • Raw-MT translation/content quality bar: ‘acceptable’ • MT needs to reach a minimum bar to be usable • Thresholds are 2.5 /4 for human survey assessment, and 50% CSAT • Recycling helps quality, factored into quality metric – if quality is not sufficient Recycling threshold will be used • Onboarding • 36 main SKU languages have been evaluated for MT quality • Format: surveys with LSPs, in country and volunteer native speakers • Special onboarding project in iCMS for limited live publishing, for eval and BI/ratings • Continuous assessment • BI monitoring of ratings to ensure MT quality bar continues to be met • Upgrading to HT • High traffic MT assets will be upgraded to HT, to ensure optimal customer experience • Functional/non-language specific quality bar • Similar to HT topics + disclaimer check & translation coverage

5. MT Quality evaluation, December 2014 (Support.Office.com) 8 languages fully approved: Spanish, CHS, Brazilian, French, Arabic, Greek, Thai, Malay 16 conditional, usable with recycling: German, Russian, CHT, Danish, Italian, Norwegian, Polish, Portuguese, Swedish, Hebrew, Bulgarian, Indonesian, Romanian, Vietnamese, Hindi, Lithuanian 12 below bar: Japanese, Korean, Slovakian, Ukrainian, Latvian, Dutch, Finnish, Turkish, Czech, Hungarian, Slovenian, Estonian 00 0.5 1 1.5 2 2.5 3 3.5 4 Machine translation results Raw Recycled acceptability

Hinweis der Redaktion

Top business goals for raw-MT usage is turnaround and long tail Scope: Goal is to use for as many languages as possible, publishing in 39 intl languages, 36 languages evaluated (2 further in eval at the moment), 24 currently production ready. Expectation is 90%+ of all support articles can be machine translated Only articles are currently in scope, other content types like training, video, templates will be looked at in future. Marketing content is also excluded. Automatic translation of art: working on solution for screenshots leveraging OCR, recycling against software, and MT fall-back, have had promising results. Quality Measured by user feedback ratings Recycling used to improve quality by reuse, and to also gate quality. Recycling level is used to determine if articles should be published, for conditionally approved languages Metrics 1.Speed: 24 h turnaround Business goal 1: Reduce time to market: Use raw-MT to publish content quicker than human translation Metric 1. Turnaround time, 24 hours from when an English article goes live, to when a translated article goes live, for support.com 2. Scope: Traffic & volume – 70% HT page-views Business goal 2: Focus human translation on high value and high traffic content, use MT for long tail and to deliver more volume and added value. Metric 2: Traffic: The top 70% of page views for each language where MT is enabled should hit human translated content. The bottom 30% is MTd (assumes 80/20 traffic distribution where small amount of content generates most traffic) Metric 3: Volume: MT should be used as much as possible, subject to availability and quality and acceptability constraints. 3. Quality: acceptable Business goal 3: Quality & Customer experience: published MT content, needs to be understandable, of acceptable quality, and not detract from overall customer satisfaction Metric 4: BI ratings: Customer Satisfaction rating should be >=50%, or within 10% of HT or English Metric 5: Human evaluation: average score 2.5 or above.
Disclaimer at top and bottom. Top disclaimer contains link to US topic No bilingual view, planned for future
MT quality evaluation for Office.support.com Human evaluation on 4 point scale, standard survey format. Per language: 10 articles, 5 reviewers from LSP + additional in market /native speakers for tier 1 and 3. 5 articles with no recycling, 5 articles with 50% recycling. Result based on main question: was this topic helpful?

Irene O'Riordan (Microsoft) at the Industry Leaders Forum 2015

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Mehr von TAUS - The Language Data Network

Mehr von TAUS - The Language Data Network (20)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

Irene O'Riordan (Microsoft) at the Industry Leaders Forum 2015

Hinweis der Redaktion