SlideShare ist ein Scribd-Unternehmen logo
1 von 74
TAUS Moses Roundtable
Welcome and Aims
Rahzeb Choudhury
TAUS
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Moses Users – Finding Common Ground
Are there areas where Moses users (from industry) can
cooperate? (beyond what is already done as part of
MosesCore)
AREA COOPERATION
Knowledge Sharing Yes
Sharing Investment ?
Sharing Code ?
This slide may not be used or copied without permission from TAUS
Open/Proprietary
This slide may not be used or copied without permission from TAUS
Agenda
14:00/ Welcome
14:10/ Introductions
14:30/ Results Moses Survey
15:00/ Moses Roadmap
15:30 / Discussion on Areas for Cooperation
16:00 / BREAK
16:30/ Review/Prioritize Areas for Cooperation
17:15/ Wrap Up and Adjourn
TAUS Moses Roundtable
Introductions
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Introductions
o Konstantinos Chatzitheodorou, Alpha CRC
o Natalia Kljueva, Charles University
o Shadi Salen, Charles University
o Milan Condak, Condak.net s.r.o.
o Bonnie Dorr, DARPA
o Zdena Závůrková, IBM
o Anabela Barreiro, INESC-ID
o Adam LopezJohns Hopkins University
o Christian Buck, Lantis
o Michal Kašpar, Lingea s.r.o.
o Jacek Skarbek, LocStar
o Tomas Fulatak, Moravia
o Niko Papula, Multilizer
This slide may not be used or copied without permission from TAUS
Introductions
o Daniel Rosàs, Pactera
o Francis Tyers, Prompsit
o WonYoung Seo, Samsung Electronics
o SeungWook Lee, Samsung Electronics
o Falko Schaefer, SAP AG
o Alexander Semerenko, Seznam.cz, a.s.
o Jie Jiang, Capita T&I
o Martin Baumgärtner, STAR Langauge Technology & Solutions GmbH
o Ronald Horsselenberg, TransIT BV
o Ulrich Germann, University of Edinburgh
o Varvara Logacheva, USFD
o Alex Yanishevsky, Welocalize
o Andrzej Zydroo, XTM-INTL
This slide may not be used or copied without permission from TAUS
Introductions
Organisers
o Ondrej Bojar, Charles University
o Philipp Koehn, University of Edinburgh
o Barry Haddow, University of Edinburgh
o Hieu Hoang, University of Edinburgh
o Achim Ruopp, TAUS
o Rahzeb Choudhury, TAUS
TAUS Moses Roundtable
MT @ Alpha CRC
CHATZITHEODOROU Konstantinos
Alpha CRC
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
MT @ Alpha CRC
o Working with MT since 2006
o Hybrid phrase-based MT system
o Post-editing cost evaluation
 Have developed Reverse Analysis, a methodology to
evaluate the post editing effort on the basis of how much
MT output was edited
This slide may not be used or copied without permission from TAUS
Alpha MT flow
Selection of
training data
Training
Moses, SRILM, MGIZA,
MERT …
Translation Post-editing
Re-training
Rules Insertion
POS, Syntactic, Morpholo
gy
Terminology
optional
mandatory
TAUS Moses Roundtable
<My MTs and my CATs>
<Milan Čondák>
<Condak.net s.r.o. Petřvald>
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
< My MTs and my CATs >
<PC Translator>
o My first MT was Czech program PC Translator. This
SW run in Windows, one language is foreign language
and second language is Czech. PC Translator has the
bidirectoral indexes and can translate in both
directions.
o There was 3 main modules: a Dictionary, an Editor
and a Dictionary Manager.
o PC Translator worked in two modes: translating of an
entire file or translating of text in Editor.
o By text translating was visible a terminology of
opened sentence.
This slide may not be used or copied without permission from TAUS
Wordfast Classic in MS Word
o Wordfast Classic (WFC) have been offering integration
MT which works in MS Word
o So I asked a developer of PC Translator to create API
for MS Word. He created three APIs: for MS Word, for
MS Outlook and MS IE. Later he added APIs for more
email clients and web brousers.
o WFC begun to use new feature, a Companion. In new
window is displayed terminology of opened segment
which is found in Wordfast glossary.
This slide may not be used or copied without permission from TAUS
Wordfast Classic in MS Word
This slide may not be used or copied without permission from TAUS
Web translation services in my MT and CATs
o PC Translator can show
offers from Google and
Bing:
o http://www.condak.net/m
achine_t/cs/comprendo/c
s/07.html
o MetaTexis for Word 2007
+ Web MT Servers:
o http://www.condak.net/c
at_other/virtaal/2013082
1/cs/02.html
o Free Translation via
Internet works without
registration.
o Virtaal Plugins - models
for TM an MT:
o http://www.condak.net/c
at_other/virtaal/2013082
1/cs/03.html
TAUS Moses Roundtable
MT in localization company
Jacek Skarbek
LocStar
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Our experience with MT
o We are a software localization provider which has been
using CAT tools for 17 years – we have large TMs
o Some of our clients provide us work with MTranslated
content (both for no matches and as alternative to TM
fuzzies) using their own MT solutions – our job is to work
on MT like with fuzzy matches (no typical post-editing)
o For about 2 years we have been used MT for one of our
main customer. We buy MT content from third party that
use their own solution based on Moses and TMs provided
by customer.
o We have large TMs collected and we test our Moses
based internal solution to use it in production
environment
This slide may not be used or copied without permission from TAUS
Moses related problems/areas to improve
o Tag handling/inline markup – only partially resolved by
M4Loc solutions
o Lack of API to better/easier integrate Moses into
production workflow
o We need better terminology/software items handling. I.e
something like <zone>, but phrase in zone treated
separately from rest of sentence at the level of TM and
as a part of sentence at the level of LM
o Inflections in Slavic languages
This slide may not be used or copied without permission from TAUS
Other problems
o Weird approach to MT rate – customers tend to decrease
translation rate in the same percent as measured (or
estimated) acceleration of translation work itself, while
translation rates covers also project management and all
other linguistic and technical tasks that are not
accelerated by MT
o We are not allowed to use MT for some customers – it is
restricted by work agreement . Although we treat
MTranslated content as fuzzy matches, they afraid that it
would impact on final quality of translation
TAUS Moses Roundtable
Samsung Electronics
Cooperation and SMT
Seung-Wook Lee, Wonyoung Seo
Samsung Electronics Corporation
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Samsung Electronics Cooperation and Machine Translation
o Our team provide translation services for various internal
groupware applications (e.g., instance messenger) for the
department
o One of the main concerns of ours is to expand language pairs
 There are very little of bilingual corpus available for the
most of languages, such as Asian languages
 Is indirect translation the solution? how do we deal with the
error propagation?
 Working groups and developer meetings for those
languages may necessary
TAUS Moses Roundtable
Statistical Machine Translation
at SAP
Dr. Falko Schaefer
SAP Language Services
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
SMT Project at SLS
o SAP Language Services (SLS) has successfully worked
with rule-based MT for over 20 years
o However, the growing demand for a new breed of MT
meant that SLS began to embark on Moses-based
SMT in early 2013
o The SLS MT project aims to establish a new MT
service to reduce translation throughput time and
cost
o To that end SLS works with an external partner to
support implementation and knowledge transfer
TAUS Moses Roundtable
Alex Yanishevsky
Welocalize
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Company Intro
o One of top 10 LSPs (language service providers)
o Department dedicated to MT and Language Tools
(evaluation of MT, productivity workbench, corpus
preparation, vendor selection, vendor training and
certification)
o MT agnostic
o MT integrated into TMS/GMS
This slide may not be used or copied without permission from TAUS
Areas of Interest
o Productization of Moses
-lower barrier of entry
-interoperability
o Integration into TMS/GMS
o Tag handling
o Predictive modeling
TAUS Moses Roundtable
Results: Moses Survey
Achim Ruopp
TAUS
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Demographic Composition
0% 5% 10% 15% 20% 25% 30% 35%
Consultant
Language Service Provider/Agency
Other
Research Institute
Translation Buyer
Translation Technology Provider
Translator
2013 2012 2011
This slide may not be used or copied without permission from TAUS
Ranking of Requested Moses Improvements
2013
Rank
2012
Rank
2011
Rank
1 1 3 Training and translation speed
2 4 1 Integrating Moses into existing workflow/system
(e.g. TM integration)
3 3 2 Installing and using Moses
4 Terminology Management
5 2 4 Evaluation results (e.g. evaluating productivity)
6 5 5 Language-specific issues
7 Advanced features (e.g. tree-based translation)
8 7 7 Customer support
6 6 Easier to get the right human resources
This slide may not be used or copied without permission from TAUS
#1 Requested Moses Improvement
Training and Translation Speed
o Users are aware that SMT requires a considerable
amount of computing resources
o Request driven by management and user demands
 Fast-turn-around/online translation
 Frequent re-training of systems with new/updated data
o Recommendation
 Integrate recent training speed improvements into the
training tool chain
 Document recommendations how to best use the training
speed improvements
 Further optimize performance for multi-threaded decoding
This slide may not be used or copied without permission from TAUS
#2 Requested Moses Improvement
Integrating Moses into Existing Workflows/Systems
o Integration into growing number of diverse systems
 TMS/CaT/TenT
 Content Management Systems
 Automated Speech Recognition
 Dialog Systems
 …
o Recommendation
 Comprehensive, stable and well documented APIs to the
decoder and data produced by it
 RESTful HTTP API (Google/Bing compatible?)
 Finish Okapi/M4Loc file format support
This slide may not be used or copied without permission from TAUS
#3 Requested Moses Improvement
Installing and Using Moses
o Still a range of installation experiences from “No
problem” to “very complex to understand and to
implement”
o Should Moses team provide installable packages?
o Windows support?
o UI?
o Recommendation
 Occasional stable releases of Moses as installable packages
across different platforms – consistency is key
 Take on the maintenance and release of required
components abandoned by their original developers
This slide may not be used or copied without permission from TAUS
#4 Requested Moses Improvement
Terminology Management
o Terminology injection from terminological resources
 Term bases
 Named entity recognizers
o Recommendation:
 Better documentation of the XML input feature
 Ensuring that the XML input feature minimally impacts
translation quality of the overall sentence
 Handling input with named entities marked up by named
entity recognizers
 Ensure XML input can be handled in the complete tool chain
(e.g. tokenizer)
This slide may not be used or copied without permission from TAUS
#5 Requested Moses Improvement
Evaluation
o Expansion of the metrics that can be used to tune
Moses MT systems
 Specifically for MT+post-editing scenario
o Evaluation and productivity testing systems
 Can be external
o Recommendation
 Integrate tuning metrics into Moses that allow optimizing
systems for the MT+post-editing usage scenario
 Ensure interoperability with external evaluation/productivity
testing systems, e.g. TAUS DQF, QT Launchpad
This slide may not be used or copied without permission from TAUS
#6 Requested Moses Improvement
Language-Specific Issues
o Moses is focused on a relatively small set of European
languages
o Survey participants would like to see tools for more
languages included
o Full Unicode support
o Recommendation
 Test and improve Unicode support in the language-
independent core
 Recommend and document use of additional language tools
 Encourage users to report Unicode issues and provide
language-specific data
This slide may not be used or copied without permission from TAUS
#7 Requested Moses Improvement
Advanced Features for Moses Commercialization
o Received a broad cross-section of requests
o Researchers develop cutting-edge technologies that
could benefit industry
o Too often conversations still happen in distinct
academic and industry silos
o Recommendation
 Start a conversation explaining how newly developed
methods and technologies can help the industry to address
critical MT issues, e.g.
o Tree-based/syntax-based models
o Morphologically rich languages
This slide may not be used or copied without permission from TAUS
#8 Requested Moses Improvement
Customer Support
o Moses support mailing list considered excellent
o Few requests for professional support or faster
support response times
o Recommendation
 Continue excellent support on mailing list
 Improve documentation for some industry-relevant features
to allow easier adoption
This slide may not be used or copied without permission from TAUS
Moses Open Source Project
Strengths Weaknesses
o Additions/updates to
“core” Moses:
decoder, training, LM
 Latest methods
 Benefiting all users
o Documentation and
support
o MosesCore funded
releases and tutorial
o Few contributions by
long-time industry
Moses users
 Adobe Moses Tools
 DoMY CE
o Complexity of
installation/use for
entry-level users
This slide may not be used or copied without permission from TAUS
Moses Future
Academic Project Broad Adoption
o Sharing platform for
research progress
o Unstable code base
o Complex use
o Integration by few
sophisticated technology
providers
o Similar OSS project
 HTK speech recognition
toolkit
o Ease of installation
o Ease of use for diverse
scenarios
o Pre-trained engines
o Similar OSS projects
 PostgreSQL
 NLTK (Natural Language
Toolkit)
 CMU Sphinx
Not mutually exclusive!
This slide may not be used or copied without permission from TAUS
PostgreSQL
Object-relational database management system
o Started in 1986 by Michael Stonebraker at UC
Berkeley
o Evolved from research project into universal RDBMS
o Used by Apple, BASF, Skype, Redhat, governments,
universities …
o Broad contributor base
 Often industry funded
o PostgreSQL license (similar to MIT license)
o Commercial support through EnterpriseDB
o Consulting/training available
This slide may not be used or copied without permission from TAUS
Discussion To Follow
o Discuss industry needs
o Identify areas of industry cooperation
o Discuss line between open source project and
proprietary add-ons
This slide may not be used or copied without permission from TAUS
Open/Proprietary
TAUS Moses Roundtable
Moses Roadmap
Philipp Koehn
11-Sep-2013
Prague, Czech Republic
Moses Roadmap
Philipp Koehn
11 September 2013
Philipp Koehn Roadmap 11 September 2013
1Development in Moses
• Moses is mainly developed in academia
• Academic research progress is somewhat un-predictable
• Biases
1. quality
2. scalability
3. usability
• Sometimes research use cases do not match industry use cases
(e.g., translation of news vs. technical documentation)
Philipp Koehn Roadmap 11 September 2013
2Modular Design
Cube
Pruning
Chart
Parse-
Decoding
LM
Driven
Chart
Decoding
Stack
Beam
Decoding
irstLM randLM kenLMSRILM
Decoding
Algorithm
Language
Model
Forced
Decoding
LM
Server
Text
In
Memory
XML
On
Disk
Confusion
Network
Suffix
Array
over
Corpus
Lattice
Compact
Text N-Best
Search
Graph
Input
Translation
Model
Output
CSLM
Philipp Koehn Roadmap 11 September 2013
3Progress in Models
1990 2000
word-based models
2010
phrase-based models
formal grammar-based models
linguistic grammar-based models
semantics
Philipp Koehn Roadmap 11 September 2013
4Progress in Methods
1990 2000 2010
probabilistic models
parameter tuning
large-scale
discriminative training
Philipp Koehn Roadmap 11 September 2013
5
Quality
Some examples from UEDIN systems in WMT 2013
• Better machine learning methods
• Linguistically motivated models
• More data
Philipp Koehn Roadmap 11 September 2013
6
Quality
Some examples from UEDIN systems in WMT 2013
• Better machine learning methods
operation sequence model
• Linguistically motivated models
syntax-based machine translation model
• More data
training a language model on 130 billion words
Philipp Koehn Roadmap 11 September 2013
o
1
Generate(Ich,I) Ich↓
I
o
2
GenerateTargetOnly(do) Ich↓
Ido
o
3
o
4
InsertGap
Generate(nicht,not)
Ichnicht↓
Idonot
o
5
o
6
JumpBack(1)
Generate(gehe,go)
Ichgehe↓nicht
Idonotgo
o
7
GenerateSourceOnly(ja) Ichgeheja↓nicht
Idonotgo
o
8
JumpForward Ichgehejanicht↓
Idonotgo
o
9
Generate(zum,tothe) ...gehejanichtzum↓
...notgotothe
o
10
Generate(haus,house) ...janichtzumhaus↓
...gotothehouse
7
Operation Sequence Model
5-gram sequence model over operations
(minimal phrase translations, reordering)
p(o1) p(o2|o1) p(o3|o1,o2) ... p(o10|o6,o7,o8,o9)
Feature function in Moses [Durrani et al., ACL 2013]
de-en
fr-en
es-en
cs-en
ru-en
en-de
en-fr
en-es
en-cs
en-ru
2012
+.26
+.19
+.49
+.33
+.46
+.07
+.60
+.57
+.35
+.30
2013
+.29
+.37
+.90
+.09
+.33
+.20
+.36
+.44
+.27
+.40
Philipp Koehn Roadmap 11 September 2013
8
Sie
PPER
will
VAFIN
eine
ART
Tasse
NN
Kaffee
NN
trinken
VVINF
NP
VP
S
PRO
she
VB
drink
NN
|
cup
IN
|
of
NP
PP
NN
NP
DET
|
a
VBZ
|
wants
VB
VP
VP
NPTO
|
to
NN
coffee
PRO VP
Syntax-Based Machine Translation
[Nadejde et. al, WMT2013]
➏
S
➊ ➋ ➌
➍
➎
German–English English–German
manual score
0.608
0.586
system
UEDIN-SYNTAX
UEDIN-PHRASE
manual score
0.614
0.571
system
UEDIN-SYNTAX
UEDIN-PHRASE
Philipp Koehn Roadmap 11 September 2013
9
Huge, I Say Huge!, Language Model
• Unpruned 5-gram language model trained on 130 billion words
• Training straightforward [Heafield et al., ACL 2013]
• Decoding requires 1TB RAM machine
• Best performance at WMT2013 (manual judgment)
Spanish–English French–English Czech–English
score
0.624
0.595
0.570
system
UEDIN-HEAFIELD
”ONLINE-B”
UEDIN
score
0.638
0.604
0.591
system
UEDIN-HEAFIELD
UEDIN
”ONLINE-B”
score
0.607
0.582
0.562
system
UEDIN-HEAFIELD
”ONLINE-B”
UEDIN
Philipp Koehn Roadmap 11 September 2013
10Usability
• Main uses of Moses
– productivity tool for professional translators
– gisting for information discovery
• Research driven by real-world use
–
–
–
–
incremental training
handling of tags
terminology management
quality estimation
Philipp Koehn Roadmap 11 September 2013
11
• Integration of statistical MT and collaborative translation memories
• Novel technology
– Self-tuning machine translation
– User adaptive machine translation
– Informative machine translation
• Open source workbench
• Extensive testing by translation agency
Philipp Koehn Roadmap 11 September 2013
12
• Cognitive studies of translator behaviour based on key logging and eye tracking
• Novel types of assistance to human translators
– interactive translation prediction
– interactive editing
– adaptive translation models
• Open source workbench
• Field tests by translation agency and online volunteer translation platforms
Philipp Koehn Roadmap 11 September 2013
13
• Use of machine translation for community content
• Novel technology
– Pre-editing of content
– Monolingual and bilingual post-editing
– Development of feedback loops
• Use in
– commercial product forum relating to Symantec network security products
– content in community of volunteer translators Traducteurs sans Fronti`eres
Philipp Koehn Roadmap 11 September 2013
14
The Future: Better Models
• Syntax-based and semantic statistical models
– improvements to basic tools of natural language processing
– requires annotated data resources, annotation standards
– new models, training methods, inference algorithms
• Exploitation of data — machine learning
– different types: parallel, comparable, monolingual, interactive
– scaling up of existing machine learning methods
– adaptation to user needs
• Integration with other technologies
–
–
–
–
–
–
human translation and localization workflows
speech recognition
dialog systems
information retrieval
data mining
communication systems
Philipp Koehn Roadmap 11 September 2013
15
The Future: Better Usability
• Installation
– MOSESCORE installer, pre-built binaries
– pre-installed virtual machines for Amazon EC et al.
• Resources
– ongoing efforts to make data publicly available
– memory and time efficient training and decoding
• Integration into workflows
–
–
–
–
addressing requirements of professional translators
industry-led projects on handling tags, untranslated terms, terminology
MOSESCORE ”arrows” workflow management
various server process implementation, e.g., based on Google API
Philipp Koehn Roadmap 11 September 2013
16Thank You
questions?
Philipp Koehn Roadmap 11 September 2013
TAUS Moses Roundtable
Review and Discussion of
Sharing Optons in the
Industry
Rahzeb Choudhury, Achim Ruopp
TAUS
11-Sep-2013
Prague, Czech Republic
This slide may not be used or copied without permission from TAUS
Sharing Knowledge
o TAUS Machine Translation Showcases
 Co-located with Localization World Conferences
 Familiarizing the industry with Moses/SMT
 Users share experiences
 Panel discussions
o TAUS Machine Translation and Moses Tutorial
 Online tutorial teaching theory and practice
 300+ registered users
 Developed in collaboration with UEdin
o This TAUS Moses Roundtable
This slide may not be used or copied without permission from TAUS
Sharing Code
o DoMY CE
 Prepare training corpora
 Train & tune SMT models
 Manage SMT resources
 Translate documents
o M4Loc – Moses for Localization
 Integration with popular open source Okapi localization
framework
 Adobe Moses Tools
o In Moses /contrib folder
 Moses for Mere Mortals
 Several web APIs
o Language-specific non-breaking prefix files
This slide may not be used or copied without permission from TAUS
Industry Sharing
Knowledge
Investment
Code
This slide may not be used or copied without permission from TAUS
Discussion: Ideas for Sharing
o What are common use scenarios?
 (among participants)
o MT as a productivity enhancer
 Beginner, Pilot, Implementation, Production, Ongoing rollout
o MT to gist – no participants involved in the scenario
o How do we make them easier to achieve?
This slide may not be used or copied without permission from TAUS
Beginners -Installing and Using Moses
o Still a range of installation experiences from “No
problem” to “very complex to understand and to
implement”
o Should Moses team provide installable packages?
o Windows support?
o UI?
o Recommendation
 Occasional stable releases of Moses as installable packages
across different platforms – consistency is key
 Take on the maintenance and release of required
components abandoned by their original developers
This slide may not be used or copied without permission from TAUS
Beginners - Installing and Using Moses
o Conclusions during meeting:
 The resources available (Moses site, support list, MT and
Moses Tutorial) are sufficient
 The v1 release is very welcome and look forward to future
releases
 Try to ensure these resources are more easily discoverable,
ensure documentation stays up to date, and easy to use
This slide may not be used or copied without permission from TAUS
Implementation
Integrating Moses into Existing Workflows/Systems
o Integration into growing number of diverse systems
 TMS/CaT/TenT
 Content Management Systems
 Automated Speech Recognition
 Dialog Systems
 …
o Recommendation
 Comprehensive, stable and well documented APIs to the
decoder and data produced by it
 RESTful HTTP API (Google/Bing compatible?)
 Finish Okapi/M4Loc file format support
This slide may not be used or copied without permission from TAUS
Implementation
Integrating Moses into Existing Workflows/Systems
o Conclusions during meeting:
 Main areas of cooperation (APIs and formatting) covered by
current activity
 TAUS to help with next steps for Moses4Loc (Formatting) to
help ensure there is thorough testing
This slide may not be used or copied without permission from TAUS
Production
Training and Translation Speed
o Users are aware that SMT requires a considerable
amount of computing resources
o Request driven by management and user demands
 Fast-turn-around/online translation
 Frequent re-training of systems with new/updated data
o Recommendation
 Integrate recent training speed improvements into the
training tool chain
 Document recommendations how to best use the training
speed improvements
 Further optimize performance for multi-threaded decoding
This slide may not be used or copied without permission from TAUS
Production
Training and Translation Speed
o Conclusions during meeting:
 Participants did not have any specific ideas beyond what the
MosesCore consortium members are already doing
This slide may not be used or copied without permission from TAUS
Other Issues/Ideas Raised
o Lack of Data
 The TAUS Data repository was shown as a potential source of
training data
o Interoperability
 Going forward it would be good to be able to share translation
and language models
 Participants briefly discussed the complexity of the challenge
o Shared engines
 It was suggested that baseline language/industry/domain engines
be made available
 Making the engines built as part of the TAUS Developing Talent
project available may be a good start. TAUS will look into this.
TAUS Moses Roundtable
Thank you!
Achim Ruopp (achim@taus.net)
Rahzeb Choudhury (rahzeb@taus.net )

Weitere ähnliche Inhalte

Andere mochten auch

TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS - The Language Data Network
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Antzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora tausAntzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora tausLourdes Macicior
 
TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS - The Language Data Network
 
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS - The Language Data Network
 
The cognitive era and the future of content
The cognitive era and the future of contentThe cognitive era and the future of content
The cognitive era and the future of contentScott Abel
 

Andere mochten auch (7)

TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agendaTAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
TAUS USER CONFERENCE 2010, What’s on the horizon? The research agenda
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Antzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora tausAntzinaroa eta erdi aroa nora taus
Antzinaroa eta erdi aroa nora taus
 
TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013TAUS webinar The Big Picture View On The Translation Industry, March 2013
TAUS webinar The Big Picture View On The Translation Industry, March 2013
 
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engineTAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
TAUS USER CONFERENCE 2010, The Deep Hybrid machine translation engine
 
The cognitive era and the future of content
The cognitive era and the future of contentThe cognitive era and the future of content
The cognitive era and the future of content
 

Ähnlich wie TAUS Moses Roundtable, Prague, 11 September 2013

TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...TAUS - The Language Data Network
 
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...TAUS - The Language Data Network
 
Presentation at CEF-EU-Luxembourg
Presentation at CEF-EU-LuxembourgPresentation at CEF-EU-Luxembourg
Presentation at CEF-EU-LuxembourgManuel Herranz
 
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)TAUS - The Language Data Network
 
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS - The Language Data Network
 
User Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineUser Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineABBYY Language Serivces
 
Mostafa Mohamed Abdelmoniem
Mostafa Mohamed AbdelmoniemMostafa Mohamed Abdelmoniem
Mostafa Mohamed Abdelmoniembadom
 
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...20210428 - Sustainable Engineering practices & API Communities: Adoption Best...
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...Angel Alberici
 
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...TAUS - The Language Data Network
 

Ähnlich wie TAUS Moses Roundtable, Prague, 11 September 2013 (20)

Insights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUSInsights in the MT Market, by Jaap van der Meer, TAUS
Insights in the MT Market, by Jaap van der Meer, TAUS
 
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
 
WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013WEBINAR: TAUS Outlook 2013
WEBINAR: TAUS Outlook 2013
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, The Landscape, Rahzeb...
 
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...
TAUS Machine Translation Showcase, TAUS Introduction and MT Market Overview, ...
 
The state of post editing
The state of post editingThe state of post editing
The state of post editing
 
Presentation at CEF-EU-Luxembourg
Presentation at CEF-EU-LuxembourgPresentation at CEF-EU-Luxembourg
Presentation at CEF-EU-Luxembourg
 
TAUS Webinar March 2012
TAUS Webinar March 2012TAUS Webinar March 2012
TAUS Webinar March 2012
 
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)
TAUS Data Association | Consultation: Strategic Roadmap - Webinar (May 12, 2010)
 
OBASHI® - Foundation
OBASHI® - FoundationOBASHI® - Foundation
OBASHI® - Foundation
 
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...
Integrating MT with TM for Higher Quality Translation (STAR MT and Transit NX...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Seattle, Moses Tutorial and Ot...
 
TAUS Webinar January 2012
TAUS Webinar January 2012TAUS Webinar January 2012
TAUS Webinar January 2012
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
KHALIL HIDMI CV-8
KHALIL HIDMI CV-8KHALIL HIDMI CV-8
KHALIL HIDMI CV-8
 
User Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia OnlineUser Empowered Machine Translation. Dion Wiggins, Asia Online
User Empowered Machine Translation. Dion Wiggins, Asia Online
 
Mostafa Mohamed Abdelmoniem
Mostafa Mohamed AbdelmoniemMostafa Mohamed Abdelmoniem
Mostafa Mohamed Abdelmoniem
 
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...20210428 - Sustainable Engineering practices & API Communities: Adoption Best...
20210428 - Sustainable Engineering practices & API Communities: Adoption Best...
 
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Introduction and Overview, Rahzeb Choudhury, TAUS, 10 April...
 

Mehr von TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...TAUS - The Language Data Network
 

Mehr von TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
Traditional Models of Translation Outsourcing Seem Well-Established and Sound...
 

Kürzlich hochgeladen

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Angeliki Cooney
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsNanddeep Nachan
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 

Kürzlich hochgeladen (20)

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 

TAUS Moses Roundtable, Prague, 11 September 2013

  • 1. TAUS Moses Roundtable Welcome and Aims Rahzeb Choudhury TAUS 11-Sep-2013 Prague, Czech Republic
  • 2. This slide may not be used or copied without permission from TAUS Moses Users – Finding Common Ground Are there areas where Moses users (from industry) can cooperate? (beyond what is already done as part of MosesCore) AREA COOPERATION Knowledge Sharing Yes Sharing Investment ? Sharing Code ?
  • 3. This slide may not be used or copied without permission from TAUS Open/Proprietary
  • 4. This slide may not be used or copied without permission from TAUS Agenda 14:00/ Welcome 14:10/ Introductions 14:30/ Results Moses Survey 15:00/ Moses Roadmap 15:30 / Discussion on Areas for Cooperation 16:00 / BREAK 16:30/ Review/Prioritize Areas for Cooperation 17:15/ Wrap Up and Adjourn
  • 6. This slide may not be used or copied without permission from TAUS Introductions o Konstantinos Chatzitheodorou, Alpha CRC o Natalia Kljueva, Charles University o Shadi Salen, Charles University o Milan Condak, Condak.net s.r.o. o Bonnie Dorr, DARPA o Zdena Závůrková, IBM o Anabela Barreiro, INESC-ID o Adam LopezJohns Hopkins University o Christian Buck, Lantis o Michal Kašpar, Lingea s.r.o. o Jacek Skarbek, LocStar o Tomas Fulatak, Moravia o Niko Papula, Multilizer
  • 7. This slide may not be used or copied without permission from TAUS Introductions o Daniel Rosàs, Pactera o Francis Tyers, Prompsit o WonYoung Seo, Samsung Electronics o SeungWook Lee, Samsung Electronics o Falko Schaefer, SAP AG o Alexander Semerenko, Seznam.cz, a.s. o Jie Jiang, Capita T&I o Martin Baumgärtner, STAR Langauge Technology & Solutions GmbH o Ronald Horsselenberg, TransIT BV o Ulrich Germann, University of Edinburgh o Varvara Logacheva, USFD o Alex Yanishevsky, Welocalize o Andrzej Zydroo, XTM-INTL
  • 8. This slide may not be used or copied without permission from TAUS Introductions Organisers o Ondrej Bojar, Charles University o Philipp Koehn, University of Edinburgh o Barry Haddow, University of Edinburgh o Hieu Hoang, University of Edinburgh o Achim Ruopp, TAUS o Rahzeb Choudhury, TAUS
  • 9. TAUS Moses Roundtable MT @ Alpha CRC CHATZITHEODOROU Konstantinos Alpha CRC 11-Sep-2013 Prague, Czech Republic
  • 10. This slide may not be used or copied without permission from TAUS MT @ Alpha CRC o Working with MT since 2006 o Hybrid phrase-based MT system o Post-editing cost evaluation  Have developed Reverse Analysis, a methodology to evaluate the post editing effort on the basis of how much MT output was edited
  • 11. This slide may not be used or copied without permission from TAUS Alpha MT flow Selection of training data Training Moses, SRILM, MGIZA, MERT … Translation Post-editing Re-training Rules Insertion POS, Syntactic, Morpholo gy Terminology optional mandatory
  • 12. TAUS Moses Roundtable <My MTs and my CATs> <Milan Čondák> <Condak.net s.r.o. Petřvald> 11-Sep-2013 Prague, Czech Republic
  • 13. This slide may not be used or copied without permission from TAUS < My MTs and my CATs > <PC Translator> o My first MT was Czech program PC Translator. This SW run in Windows, one language is foreign language and second language is Czech. PC Translator has the bidirectoral indexes and can translate in both directions. o There was 3 main modules: a Dictionary, an Editor and a Dictionary Manager. o PC Translator worked in two modes: translating of an entire file or translating of text in Editor. o By text translating was visible a terminology of opened sentence.
  • 14. This slide may not be used or copied without permission from TAUS Wordfast Classic in MS Word o Wordfast Classic (WFC) have been offering integration MT which works in MS Word o So I asked a developer of PC Translator to create API for MS Word. He created three APIs: for MS Word, for MS Outlook and MS IE. Later he added APIs for more email clients and web brousers. o WFC begun to use new feature, a Companion. In new window is displayed terminology of opened segment which is found in Wordfast glossary.
  • 15. This slide may not be used or copied without permission from TAUS Wordfast Classic in MS Word
  • 16. This slide may not be used or copied without permission from TAUS Web translation services in my MT and CATs o PC Translator can show offers from Google and Bing: o http://www.condak.net/m achine_t/cs/comprendo/c s/07.html o MetaTexis for Word 2007 + Web MT Servers: o http://www.condak.net/c at_other/virtaal/2013082 1/cs/02.html o Free Translation via Internet works without registration. o Virtaal Plugins - models for TM an MT: o http://www.condak.net/c at_other/virtaal/2013082 1/cs/03.html
  • 17. TAUS Moses Roundtable MT in localization company Jacek Skarbek LocStar 11-Sep-2013 Prague, Czech Republic
  • 18. This slide may not be used or copied without permission from TAUS Our experience with MT o We are a software localization provider which has been using CAT tools for 17 years – we have large TMs o Some of our clients provide us work with MTranslated content (both for no matches and as alternative to TM fuzzies) using their own MT solutions – our job is to work on MT like with fuzzy matches (no typical post-editing) o For about 2 years we have been used MT for one of our main customer. We buy MT content from third party that use their own solution based on Moses and TMs provided by customer. o We have large TMs collected and we test our Moses based internal solution to use it in production environment
  • 19. This slide may not be used or copied without permission from TAUS Moses related problems/areas to improve o Tag handling/inline markup – only partially resolved by M4Loc solutions o Lack of API to better/easier integrate Moses into production workflow o We need better terminology/software items handling. I.e something like <zone>, but phrase in zone treated separately from rest of sentence at the level of TM and as a part of sentence at the level of LM o Inflections in Slavic languages
  • 20. This slide may not be used or copied without permission from TAUS Other problems o Weird approach to MT rate – customers tend to decrease translation rate in the same percent as measured (or estimated) acceleration of translation work itself, while translation rates covers also project management and all other linguistic and technical tasks that are not accelerated by MT o We are not allowed to use MT for some customers – it is restricted by work agreement . Although we treat MTranslated content as fuzzy matches, they afraid that it would impact on final quality of translation
  • 21. TAUS Moses Roundtable Samsung Electronics Cooperation and SMT Seung-Wook Lee, Wonyoung Seo Samsung Electronics Corporation 11-Sep-2013 Prague, Czech Republic
  • 22. This slide may not be used or copied without permission from TAUS Samsung Electronics Cooperation and Machine Translation o Our team provide translation services for various internal groupware applications (e.g., instance messenger) for the department o One of the main concerns of ours is to expand language pairs  There are very little of bilingual corpus available for the most of languages, such as Asian languages  Is indirect translation the solution? how do we deal with the error propagation?  Working groups and developer meetings for those languages may necessary
  • 23. TAUS Moses Roundtable Statistical Machine Translation at SAP Dr. Falko Schaefer SAP Language Services 11-Sep-2013 Prague, Czech Republic
  • 24. This slide may not be used or copied without permission from TAUS SMT Project at SLS o SAP Language Services (SLS) has successfully worked with rule-based MT for over 20 years o However, the growing demand for a new breed of MT meant that SLS began to embark on Moses-based SMT in early 2013 o The SLS MT project aims to establish a new MT service to reduce translation throughput time and cost o To that end SLS works with an external partner to support implementation and knowledge transfer
  • 25. TAUS Moses Roundtable Alex Yanishevsky Welocalize 11-Sep-2013 Prague, Czech Republic
  • 26. This slide may not be used or copied without permission from TAUS Company Intro o One of top 10 LSPs (language service providers) o Department dedicated to MT and Language Tools (evaluation of MT, productivity workbench, corpus preparation, vendor selection, vendor training and certification) o MT agnostic o MT integrated into TMS/GMS
  • 27. This slide may not be used or copied without permission from TAUS Areas of Interest o Productization of Moses -lower barrier of entry -interoperability o Integration into TMS/GMS o Tag handling o Predictive modeling
  • 28. TAUS Moses Roundtable Results: Moses Survey Achim Ruopp TAUS 11-Sep-2013 Prague, Czech Republic
  • 29. This slide may not be used or copied without permission from TAUS Demographic Composition 0% 5% 10% 15% 20% 25% 30% 35% Consultant Language Service Provider/Agency Other Research Institute Translation Buyer Translation Technology Provider Translator 2013 2012 2011
  • 30. This slide may not be used or copied without permission from TAUS Ranking of Requested Moses Improvements 2013 Rank 2012 Rank 2011 Rank 1 1 3 Training and translation speed 2 4 1 Integrating Moses into existing workflow/system (e.g. TM integration) 3 3 2 Installing and using Moses 4 Terminology Management 5 2 4 Evaluation results (e.g. evaluating productivity) 6 5 5 Language-specific issues 7 Advanced features (e.g. tree-based translation) 8 7 7 Customer support 6 6 Easier to get the right human resources
  • 31. This slide may not be used or copied without permission from TAUS #1 Requested Moses Improvement Training and Translation Speed o Users are aware that SMT requires a considerable amount of computing resources o Request driven by management and user demands  Fast-turn-around/online translation  Frequent re-training of systems with new/updated data o Recommendation  Integrate recent training speed improvements into the training tool chain  Document recommendations how to best use the training speed improvements  Further optimize performance for multi-threaded decoding
  • 32. This slide may not be used or copied without permission from TAUS #2 Requested Moses Improvement Integrating Moses into Existing Workflows/Systems o Integration into growing number of diverse systems  TMS/CaT/TenT  Content Management Systems  Automated Speech Recognition  Dialog Systems  … o Recommendation  Comprehensive, stable and well documented APIs to the decoder and data produced by it  RESTful HTTP API (Google/Bing compatible?)  Finish Okapi/M4Loc file format support
  • 33. This slide may not be used or copied without permission from TAUS #3 Requested Moses Improvement Installing and Using Moses o Still a range of installation experiences from “No problem” to “very complex to understand and to implement” o Should Moses team provide installable packages? o Windows support? o UI? o Recommendation  Occasional stable releases of Moses as installable packages across different platforms – consistency is key  Take on the maintenance and release of required components abandoned by their original developers
  • 34. This slide may not be used or copied without permission from TAUS #4 Requested Moses Improvement Terminology Management o Terminology injection from terminological resources  Term bases  Named entity recognizers o Recommendation:  Better documentation of the XML input feature  Ensuring that the XML input feature minimally impacts translation quality of the overall sentence  Handling input with named entities marked up by named entity recognizers  Ensure XML input can be handled in the complete tool chain (e.g. tokenizer)
  • 35. This slide may not be used or copied without permission from TAUS #5 Requested Moses Improvement Evaluation o Expansion of the metrics that can be used to tune Moses MT systems  Specifically for MT+post-editing scenario o Evaluation and productivity testing systems  Can be external o Recommendation  Integrate tuning metrics into Moses that allow optimizing systems for the MT+post-editing usage scenario  Ensure interoperability with external evaluation/productivity testing systems, e.g. TAUS DQF, QT Launchpad
  • 36. This slide may not be used or copied without permission from TAUS #6 Requested Moses Improvement Language-Specific Issues o Moses is focused on a relatively small set of European languages o Survey participants would like to see tools for more languages included o Full Unicode support o Recommendation  Test and improve Unicode support in the language- independent core  Recommend and document use of additional language tools  Encourage users to report Unicode issues and provide language-specific data
  • 37. This slide may not be used or copied without permission from TAUS #7 Requested Moses Improvement Advanced Features for Moses Commercialization o Received a broad cross-section of requests o Researchers develop cutting-edge technologies that could benefit industry o Too often conversations still happen in distinct academic and industry silos o Recommendation  Start a conversation explaining how newly developed methods and technologies can help the industry to address critical MT issues, e.g. o Tree-based/syntax-based models o Morphologically rich languages
  • 38. This slide may not be used or copied without permission from TAUS #8 Requested Moses Improvement Customer Support o Moses support mailing list considered excellent o Few requests for professional support or faster support response times o Recommendation  Continue excellent support on mailing list  Improve documentation for some industry-relevant features to allow easier adoption
  • 39. This slide may not be used or copied without permission from TAUS Moses Open Source Project Strengths Weaknesses o Additions/updates to “core” Moses: decoder, training, LM  Latest methods  Benefiting all users o Documentation and support o MosesCore funded releases and tutorial o Few contributions by long-time industry Moses users  Adobe Moses Tools  DoMY CE o Complexity of installation/use for entry-level users
  • 40. This slide may not be used or copied without permission from TAUS Moses Future Academic Project Broad Adoption o Sharing platform for research progress o Unstable code base o Complex use o Integration by few sophisticated technology providers o Similar OSS project  HTK speech recognition toolkit o Ease of installation o Ease of use for diverse scenarios o Pre-trained engines o Similar OSS projects  PostgreSQL  NLTK (Natural Language Toolkit)  CMU Sphinx Not mutually exclusive!
  • 41. This slide may not be used or copied without permission from TAUS PostgreSQL Object-relational database management system o Started in 1986 by Michael Stonebraker at UC Berkeley o Evolved from research project into universal RDBMS o Used by Apple, BASF, Skype, Redhat, governments, universities … o Broad contributor base  Often industry funded o PostgreSQL license (similar to MIT license) o Commercial support through EnterpriseDB o Consulting/training available
  • 42. This slide may not be used or copied without permission from TAUS Discussion To Follow o Discuss industry needs o Identify areas of industry cooperation o Discuss line between open source project and proprietary add-ons
  • 43. This slide may not be used or copied without permission from TAUS Open/Proprietary
  • 44. TAUS Moses Roundtable Moses Roadmap Philipp Koehn 11-Sep-2013 Prague, Czech Republic
  • 45. Moses Roadmap Philipp Koehn 11 September 2013 Philipp Koehn Roadmap 11 September 2013
  • 46. 1Development in Moses • Moses is mainly developed in academia • Academic research progress is somewhat un-predictable • Biases 1. quality 2. scalability 3. usability • Sometimes research use cases do not match industry use cases (e.g., translation of news vs. technical documentation) Philipp Koehn Roadmap 11 September 2013
  • 47. 2Modular Design Cube Pruning Chart Parse- Decoding LM Driven Chart Decoding Stack Beam Decoding irstLM randLM kenLMSRILM Decoding Algorithm Language Model Forced Decoding LM Server Text In Memory XML On Disk Confusion Network Suffix Array over Corpus Lattice Compact Text N-Best Search Graph Input Translation Model Output CSLM Philipp Koehn Roadmap 11 September 2013
  • 48. 3Progress in Models 1990 2000 word-based models 2010 phrase-based models formal grammar-based models linguistic grammar-based models semantics Philipp Koehn Roadmap 11 September 2013
  • 49. 4Progress in Methods 1990 2000 2010 probabilistic models parameter tuning large-scale discriminative training Philipp Koehn Roadmap 11 September 2013
  • 50. 5 Quality Some examples from UEDIN systems in WMT 2013 • Better machine learning methods • Linguistically motivated models • More data Philipp Koehn Roadmap 11 September 2013
  • 51. 6 Quality Some examples from UEDIN systems in WMT 2013 • Better machine learning methods operation sequence model • Linguistically motivated models syntax-based machine translation model • More data training a language model on 130 billion words Philipp Koehn Roadmap 11 September 2013
  • 52. o 1 Generate(Ich,I) Ich↓ I o 2 GenerateTargetOnly(do) Ich↓ Ido o 3 o 4 InsertGap Generate(nicht,not) Ichnicht↓ Idonot o 5 o 6 JumpBack(1) Generate(gehe,go) Ichgehe↓nicht Idonotgo o 7 GenerateSourceOnly(ja) Ichgeheja↓nicht Idonotgo o 8 JumpForward Ichgehejanicht↓ Idonotgo o 9 Generate(zum,tothe) ...gehejanichtzum↓ ...notgotothe o 10 Generate(haus,house) ...janichtzumhaus↓ ...gotothehouse 7 Operation Sequence Model 5-gram sequence model over operations (minimal phrase translations, reordering) p(o1) p(o2|o1) p(o3|o1,o2) ... p(o10|o6,o7,o8,o9) Feature function in Moses [Durrani et al., ACL 2013] de-en fr-en es-en cs-en ru-en en-de en-fr en-es en-cs en-ru 2012 +.26 +.19 +.49 +.33 +.46 +.07 +.60 +.57 +.35 +.30 2013 +.29 +.37 +.90 +.09 +.33 +.20 +.36 +.44 +.27 +.40 Philipp Koehn Roadmap 11 September 2013
  • 53. 8 Sie PPER will VAFIN eine ART Tasse NN Kaffee NN trinken VVINF NP VP S PRO she VB drink NN | cup IN | of NP PP NN NP DET | a VBZ | wants VB VP VP NPTO | to NN coffee PRO VP Syntax-Based Machine Translation [Nadejde et. al, WMT2013] ➏ S ➊ ➋ ➌ ➍ ➎ German–English English–German manual score 0.608 0.586 system UEDIN-SYNTAX UEDIN-PHRASE manual score 0.614 0.571 system UEDIN-SYNTAX UEDIN-PHRASE Philipp Koehn Roadmap 11 September 2013
  • 54. 9 Huge, I Say Huge!, Language Model • Unpruned 5-gram language model trained on 130 billion words • Training straightforward [Heafield et al., ACL 2013] • Decoding requires 1TB RAM machine • Best performance at WMT2013 (manual judgment) Spanish–English French–English Czech–English score 0.624 0.595 0.570 system UEDIN-HEAFIELD ”ONLINE-B” UEDIN score 0.638 0.604 0.591 system UEDIN-HEAFIELD UEDIN ”ONLINE-B” score 0.607 0.582 0.562 system UEDIN-HEAFIELD ”ONLINE-B” UEDIN Philipp Koehn Roadmap 11 September 2013
  • 55. 10Usability • Main uses of Moses – productivity tool for professional translators – gisting for information discovery • Research driven by real-world use – – – – incremental training handling of tags terminology management quality estimation Philipp Koehn Roadmap 11 September 2013
  • 56. 11 • Integration of statistical MT and collaborative translation memories • Novel technology – Self-tuning machine translation – User adaptive machine translation – Informative machine translation • Open source workbench • Extensive testing by translation agency Philipp Koehn Roadmap 11 September 2013
  • 57. 12 • Cognitive studies of translator behaviour based on key logging and eye tracking • Novel types of assistance to human translators – interactive translation prediction – interactive editing – adaptive translation models • Open source workbench • Field tests by translation agency and online volunteer translation platforms Philipp Koehn Roadmap 11 September 2013
  • 58. 13 • Use of machine translation for community content • Novel technology – Pre-editing of content – Monolingual and bilingual post-editing – Development of feedback loops • Use in – commercial product forum relating to Symantec network security products – content in community of volunteer translators Traducteurs sans Fronti`eres Philipp Koehn Roadmap 11 September 2013
  • 59. 14 The Future: Better Models • Syntax-based and semantic statistical models – improvements to basic tools of natural language processing – requires annotated data resources, annotation standards – new models, training methods, inference algorithms • Exploitation of data — machine learning – different types: parallel, comparable, monolingual, interactive – scaling up of existing machine learning methods – adaptation to user needs • Integration with other technologies – – – – – – human translation and localization workflows speech recognition dialog systems information retrieval data mining communication systems Philipp Koehn Roadmap 11 September 2013
  • 60. 15 The Future: Better Usability • Installation – MOSESCORE installer, pre-built binaries – pre-installed virtual machines for Amazon EC et al. • Resources – ongoing efforts to make data publicly available – memory and time efficient training and decoding • Integration into workflows – – – – addressing requirements of professional translators industry-led projects on handling tags, untranslated terms, terminology MOSESCORE ”arrows” workflow management various server process implementation, e.g., based on Google API Philipp Koehn Roadmap 11 September 2013
  • 61. 16Thank You questions? Philipp Koehn Roadmap 11 September 2013
  • 62. TAUS Moses Roundtable Review and Discussion of Sharing Optons in the Industry Rahzeb Choudhury, Achim Ruopp TAUS 11-Sep-2013 Prague, Czech Republic
  • 63. This slide may not be used or copied without permission from TAUS Sharing Knowledge o TAUS Machine Translation Showcases  Co-located with Localization World Conferences  Familiarizing the industry with Moses/SMT  Users share experiences  Panel discussions o TAUS Machine Translation and Moses Tutorial  Online tutorial teaching theory and practice  300+ registered users  Developed in collaboration with UEdin o This TAUS Moses Roundtable
  • 64. This slide may not be used or copied without permission from TAUS Sharing Code o DoMY CE  Prepare training corpora  Train & tune SMT models  Manage SMT resources  Translate documents o M4Loc – Moses for Localization  Integration with popular open source Okapi localization framework  Adobe Moses Tools o In Moses /contrib folder  Moses for Mere Mortals  Several web APIs o Language-specific non-breaking prefix files
  • 65. This slide may not be used or copied without permission from TAUS Industry Sharing Knowledge Investment Code
  • 66. This slide may not be used or copied without permission from TAUS Discussion: Ideas for Sharing o What are common use scenarios?  (among participants) o MT as a productivity enhancer  Beginner, Pilot, Implementation, Production, Ongoing rollout o MT to gist – no participants involved in the scenario o How do we make them easier to achieve?
  • 67. This slide may not be used or copied without permission from TAUS Beginners -Installing and Using Moses o Still a range of installation experiences from “No problem” to “very complex to understand and to implement” o Should Moses team provide installable packages? o Windows support? o UI? o Recommendation  Occasional stable releases of Moses as installable packages across different platforms – consistency is key  Take on the maintenance and release of required components abandoned by their original developers
  • 68. This slide may not be used or copied without permission from TAUS Beginners - Installing and Using Moses o Conclusions during meeting:  The resources available (Moses site, support list, MT and Moses Tutorial) are sufficient  The v1 release is very welcome and look forward to future releases  Try to ensure these resources are more easily discoverable, ensure documentation stays up to date, and easy to use
  • 69. This slide may not be used or copied without permission from TAUS Implementation Integrating Moses into Existing Workflows/Systems o Integration into growing number of diverse systems  TMS/CaT/TenT  Content Management Systems  Automated Speech Recognition  Dialog Systems  … o Recommendation  Comprehensive, stable and well documented APIs to the decoder and data produced by it  RESTful HTTP API (Google/Bing compatible?)  Finish Okapi/M4Loc file format support
  • 70. This slide may not be used or copied without permission from TAUS Implementation Integrating Moses into Existing Workflows/Systems o Conclusions during meeting:  Main areas of cooperation (APIs and formatting) covered by current activity  TAUS to help with next steps for Moses4Loc (Formatting) to help ensure there is thorough testing
  • 71. This slide may not be used or copied without permission from TAUS Production Training and Translation Speed o Users are aware that SMT requires a considerable amount of computing resources o Request driven by management and user demands  Fast-turn-around/online translation  Frequent re-training of systems with new/updated data o Recommendation  Integrate recent training speed improvements into the training tool chain  Document recommendations how to best use the training speed improvements  Further optimize performance for multi-threaded decoding
  • 72. This slide may not be used or copied without permission from TAUS Production Training and Translation Speed o Conclusions during meeting:  Participants did not have any specific ideas beyond what the MosesCore consortium members are already doing
  • 73. This slide may not be used or copied without permission from TAUS Other Issues/Ideas Raised o Lack of Data  The TAUS Data repository was shown as a potential source of training data o Interoperability  Going forward it would be good to be able to share translation and language models  Participants briefly discussed the complexity of the challenge o Shared engines  It was suggested that baseline language/industry/domain engines be made available  Making the engines built as part of the TAUS Developing Talent project available may be a good start. TAUS will look into this.
  • 74. TAUS Moses Roundtable Thank you! Achim Ruopp (achim@taus.net) Rahzeb Choudhury (rahzeb@taus.net )

Hinweis der Redaktion

  1. Point out that industry users make up majority of respondents now; selection bias?: unlikely, as numbers shifted over the past 3 years
  2. Clustering? Distributed Models?Demands will grow, so requirement unlikely to go away
  3. Unicode support is not sexy, but needs to be stable, otherwise trained engines quickly become obsolete; see also XML input support
  4. Unicode support is not sexy, but needs to be stable, otherwise trained engines quickly become obsolete; see also XML input support
  5. Unicode support is not sexy, but needs to be stable, otherwise trained engines quickly become obsolete; see also XML input support
  6. “Complexity of installation/use for entry-level users”: in addition to complexity of SMT
  7. HTK: Example given by Daniel Marcu (SDL) – no active developmentCMU Sphinx: e.g. used by BBC R&amp;DFine line
  8. PostgreSQL as a prime example how an academic project can be transformed into a long-living successful open source project
  9. Clustering? Distributed Models?Demands will grow, so requirement unlikely to go away
  10. Clustering? Distributed Models?Demands will grow, so requirement unlikely to go away