SlideShare ist ein Scribd-Unternehmen logo
1 von 14
Beyond MT?
A few premature reflections on the
use of AI in translation
TAUS Global Content Summit Amsterdam, 6 March 2019
Dieter Rummel, EC, Directorate General for Translation
2
Directorate General
for Translation
Main document types
2015
38
16%
14%
6%
1% 11%
2%
2%
5%
2% 3%
1 EU law, including the legislative process
2 Guardian of the Treaties/Implementation of EU law
3 Correspondence
4 Political documents
5 Relations with other EU institutions
6 Communication, web, media, publications
7 Budget, budgetary procedure
8 Documents linked to international organisations and non-EU countries
9 Notices for publication in OJ
10 Commission working or internal documents
11 Other3
Evolution 2012-2018 : Number of translated pages and number of DGT staff
2200
2250
2300
2350
2400
2450
2500
2550
2600
0
500,000
1,000,000
1,500,000
2,000,000
2,500,000
2012 2013 2014 2015 2016 2017 2018
Pages
Staff
Context
Long-standing use of language technology + CAT tools
"More (better) with less"
More complexity, new formats, new ways of working
Stronger recourse to outsourcing
Shift from documents to content
Machine Translation as integral part of the resource mix
EC
Systran/ECMT
Rule-based MT
Ca. 1976 to 2010
MT@EC
Statistical MT
Moses Decoder
2013 - 2018
eTranslation
Neural MT
Connecting Europe
Facility (CEF)
From 2018
Machine translation at DGT
eTranslation use in DGT (up to Q3/2018)
Origin of translated segments
Buzz kill – or why I hate “AI”
• Beware of the images
• Neural MT vs. Recursive hetero-associative memories for translation
• Artificial intelligence is not about intelligence
• Neural networks have little to do with actual neurons
• Big data + neurons + deep learning + magic = Amazing stuff
happens!
• Do we really have big(-ish) data?
• Believe the hype - but in moderation
• Technology is not a solution
• Poor processes don’t get better through AI
• Doing the same and expecting different results = insanity
So, this had to be said.
But it’s pretty cool anyway.
• The technology has become accessible.
• “Big data” discussions have shown the possibilities of correlating
data from different sources.
• New ways of transforming data into usable information?
Describe
What is
happening?
Diagnose
Why did it
happen?
Predict
What will
happen?
Decide
What
should I
do?
Big data? - Big Questions!
What we translate
• What is the
document/content about?
• Is the document difficult, i.e.
demanding or complex?
• Are we working on
something similar?
• Do we have reliable
resources for this
document?
• How well will MT work for
this document?
Organising work
• How should this content be
best translated?
• Who is most suitable to
translate/revise the
document?
• How should the content be
split between several
translators (=meaningful
clustering)?
• What is our capacity to
translate?
• Are there meaningful
alternatives to the existing
forecasting model?
External service
providers
• How good is the contractor’s
work?
• How confident are we that
they will deliver good
quality?
• How reliable are they?
• Can we correlate
freelancer/agency, history of
evaluations, domain,
document type, document
complexity to calculate a
“reliability indicator” that
could support outsourcing
decisions?
More Big Questions!
Quality
• How good is a given translation?
• How good are our language
resources?
• Can we automatically detect
technically and linguistically poor
or suspect?
• How can we learn from mistakes?
Customers
• What are the common issues in
source documents?
• What do they have in common?
• Do we have the linguistic
resources to handle their
documents?
• What are their request patterns?
What next?
•Multi-disciplinary
•Explore use cases and
questions
•Break silos
•Validate or reject ideas
and assumptions in a
cost-effective way
•Training (also for
managers!)
•Learn what we do not
know
•Develop skills
•Translation memories
•Terminology
•XLIFF
•“Bad data”
•Missing data
Think about
Data
Create
understanding
and capacity
Incubate!Experiment
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

Weitere ähnliche Inhalte

Ähnlich wie TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...TAUS - The Language Data Network
 
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUSThe TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUSTAUS - The Language Data Network
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?Elena Simperl
 
Data Modeling for communication
Data Modeling for communicationData Modeling for communication
Data Modeling for communicationRichard Freggi
 
XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013Gareth Oakes
 
Translation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_enTranslation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_enVyacheslav Guzovsky
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdfAkuhuruf
 
Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSiJohn O'Gorman
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystemRob Hanna, ECMs
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...IMPACT Centre of Competence
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaDiana Maynard
 
VisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - WelcomeVisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - WelcomeVisibleThread
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceGSDI Association
 
Making Inter-operability Visible
Making Inter-operability VisibleMaking Inter-operability Visible
Making Inter-operability Visibleliddy
 

Ähnlich wie TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission) (20)

TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
TAUS 2.0 and the Game Changers in Localization (Jaap van der Meer, director o...
 
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUSThe TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
The TAUS Translation Data Landscape Report, by Jaap van der Meer, TAUS
 
Monetize Big Data
Monetize Big DataMonetize Big Data
Monetize Big Data
 
TAUS New Year's Reception 2014
TAUS New Year's Reception 2014TAUS New Year's Reception 2014
TAUS New Year's Reception 2014
 
Sample
Sample Sample
Sample
 
The web of data: how are we doing so far?
The web of data: how are we doing so far?The web of data: how are we doing so far?
The web of data: how are we doing so far?
 
Ima g ine2014_8c1report
Ima g ine2014_8c1reportIma g ine2014_8c1report
Ima g ine2014_8c1report
 
Data Modeling for communication
Data Modeling for communicationData Modeling for communication
Data Modeling for communication
 
Gift presentation
Gift presentationGift presentation
Gift presentation
 
XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013XML Drafting Discussion - PCC IT Conference 2013
XML Drafting Discussion - PCC IT Conference 2013
 
Translation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_enTranslation_integration_into_the_documentation_process_en
Translation_integration_into_the_documentation_process_en
 
elgendy2014.pdf
elgendy2014.pdfelgendy2014.pdf
elgendy2014.pdf
 
Language First Protocol from QSi
Language First Protocol from QSiLanguage First Protocol from QSi
Language First Protocol from QSi
 
Mapping the content ecosystem
Mapping the content ecosystemMapping the content ecosystem
Mapping the content ecosystem
 
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
1. 'Interoperability. A quick chat, a few war stories'. Carl Wilson, Open Pla...
 
Martinez treasury 4 11
Martinez treasury 4 11Martinez treasury 4 11
Martinez treasury 4 11
 
GATE: a text analysis tool for social media
GATE: a text analysis tool for social mediaGATE: a text analysis tool for social media
GATE: a text analysis tool for social media
 
VisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - WelcomeVisibleThread Users Conference 2018 - Welcome
VisibleThread Users Conference 2018 - Welcome
 
Open Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 ConferenceOpen Source & Open Data Session report from imaGIne 2014 Conference
Open Source & Open Data Session report from imaGIne 2014 Conference
 
Making Inter-operability Visible
Making Inter-operability VisibleMaking Inter-operability Visible
Making Inter-operability Visible
 

Mehr von TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...TAUS - The Language Data Network
 

Mehr von TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
 

Kürzlich hochgeladen

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...apidays
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityWSO2
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Zilliz
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfsudhanshuwaghmare1
 

Kürzlich hochgeladen (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflections on the use of AI in translation. By Dieter Rummel (Head of Informatics, DGT European Commission)

  • 1. Beyond MT? A few premature reflections on the use of AI in translation TAUS Global Content Summit Amsterdam, 6 March 2019 Dieter Rummel, EC, Directorate General for Translation
  • 3. Main document types 2015 38 16% 14% 6% 1% 11% 2% 2% 5% 2% 3% 1 EU law, including the legislative process 2 Guardian of the Treaties/Implementation of EU law 3 Correspondence 4 Political documents 5 Relations with other EU institutions 6 Communication, web, media, publications 7 Budget, budgetary procedure 8 Documents linked to international organisations and non-EU countries 9 Notices for publication in OJ 10 Commission working or internal documents 11 Other3
  • 4. Evolution 2012-2018 : Number of translated pages and number of DGT staff 2200 2250 2300 2350 2400 2450 2500 2550 2600 0 500,000 1,000,000 1,500,000 2,000,000 2,500,000 2012 2013 2014 2015 2016 2017 2018 Pages Staff
  • 5. Context Long-standing use of language technology + CAT tools "More (better) with less" More complexity, new formats, new ways of working Stronger recourse to outsourcing Shift from documents to content Machine Translation as integral part of the resource mix
  • 6. EC Systran/ECMT Rule-based MT Ca. 1976 to 2010 MT@EC Statistical MT Moses Decoder 2013 - 2018 eTranslation Neural MT Connecting Europe Facility (CEF) From 2018 Machine translation at DGT
  • 7. eTranslation use in DGT (up to Q3/2018)
  • 9. Buzz kill – or why I hate “AI” • Beware of the images • Neural MT vs. Recursive hetero-associative memories for translation • Artificial intelligence is not about intelligence • Neural networks have little to do with actual neurons • Big data + neurons + deep learning + magic = Amazing stuff happens! • Do we really have big(-ish) data? • Believe the hype - but in moderation • Technology is not a solution • Poor processes don’t get better through AI • Doing the same and expecting different results = insanity
  • 10. So, this had to be said. But it’s pretty cool anyway. • The technology has become accessible. • “Big data” discussions have shown the possibilities of correlating data from different sources. • New ways of transforming data into usable information? Describe What is happening? Diagnose Why did it happen? Predict What will happen? Decide What should I do?
  • 11. Big data? - Big Questions! What we translate • What is the document/content about? • Is the document difficult, i.e. demanding or complex? • Are we working on something similar? • Do we have reliable resources for this document? • How well will MT work for this document? Organising work • How should this content be best translated? • Who is most suitable to translate/revise the document? • How should the content be split between several translators (=meaningful clustering)? • What is our capacity to translate? • Are there meaningful alternatives to the existing forecasting model? External service providers • How good is the contractor’s work? • How confident are we that they will deliver good quality? • How reliable are they? • Can we correlate freelancer/agency, history of evaluations, domain, document type, document complexity to calculate a “reliability indicator” that could support outsourcing decisions?
  • 12. More Big Questions! Quality • How good is a given translation? • How good are our language resources? • Can we automatically detect technically and linguistically poor or suspect? • How can we learn from mistakes? Customers • What are the common issues in source documents? • What do they have in common? • Do we have the linguistic resources to handle their documents? • What are their request patterns?
  • 13. What next? •Multi-disciplinary •Explore use cases and questions •Break silos •Validate or reject ideas and assumptions in a cost-effective way •Training (also for managers!) •Learn what we do not know •Develop skills •Translation memories •Terminology •XLIFF •“Bad data” •Missing data Think about Data Create understanding and capacity Incubate!Experiment