SlideShare ist ein Scribd-Unternehmen logo
1 von 41
Downloaden Sie, um offline zu lesen
How We Use Moses to Develop
      Our Multi-lingual Machine
        Translation Systems?

                   Chengqing ZONG (宗成庆)
 Institute of Automation, Chinese Academy of Sciences
                       中国科学院自动化研究所
                       cqzong@nlpr.ia.ac.cn

100190 北京市海澱區中關村東路95號                     電郵:cqzong@nlpr.ia.ac.cn
http://www.nlpr.ia.ac.cn/cip/cqzong.htm   電話: +86-10-6255 4263
Outline

1.    Brief Introduction to Our Work
2.    Main Features of Moses
3.    How We Use Moses?
4.    Our Feeling
1.  Brief Introduction to Our Work
Our group is working with machine translation
(MT) research and system development in the
National Laboratory of Pattern Recognition
(NLPR), Institute of Automation, Chinese
Academy of Sciences (CASIA).
  u  6 staffs
  u  8 Ph.D candidates, 1 Master student
  u  5 visiting scholars
1.  Brief Introduction to Our Work
1.  Brief Introduction to Our Work
Multilingual text-to-text translation system



           Japanese



                       Chinese
1.  Brief Introduction to Our Work
n  In
    evaluation of spoken
language translation
(SLT) organized by
IWSLT’2007
The performance of CE
clean text translation of
our system was the best
one according to the
results of human
rankings.
1.  Brief Introduction to Our Work
n    In IWSLT’2008
                       CASIA
 CASIA
1.  Brief Introduction to Our Work
n    In IWSLT’2009




                      CASIA




                      CASIA
1.  Brief Introduction to Our Work


              CASIA




              CASIA
1.  Brief Introduction to Our Work


              NLPR
1.  Brief Introduction to Our Work
 ²  In MT evaluation organized by China Workshop on
     Machine Translation (CWMT) 2011 (Sept. 23~ 24th), our
     system participated in all tasks:
     1.  Chinese to English (News domain, progress)
     2.  English to Chinese (News domain, progress)
     3.  English to Chinese (News domain, current)
     4.  English to Chinese (Science domain)
     5.  Japanese to Chinese (News domain)
     6.  Tibetan to Chinese (Government documents)
     7.  Mongolian to Chinese (Daily)
     8.  Uigur to Chinese (News domain)
     9.  Kazakh to Chinese (News domain)
     10.  Kir Kyrgyz to Chinese (News domain)
19 Units and 165 Systems participated in this evaluation
1.  Brief Introduction to Our Work
According to BLEU scores, the performance of our
system was the top one in the following 5 tasks :
  ü    English to Chinese (News domain, progress)
  ü    Japanese-to-Chinese (News domain)
  ü    Tibetan to Chinese (Government documents)
  ü    Mongolian to Chinese (Daily)
  ü    Kir Kyrgyz to Chinese (News domain)
And it is ranked at the second position in the following 4
tasks: ü  Chinese to English (News domain, progress)
         ü  English to Chinese (News domain, current)
         ü  Uigur to Chinese (News domain)
         ü  Kazakh to Chinese (News domain)
Outline

1.    Brief Introduction to Our Work
2.    Main Features of Moses
3.    How We Use Moses?
4.    Our Feeling
2. Main Features of Moses
n    The basic ideas of statistical machine translation
      (SMT) can be formulated in principle as
          ebest =argmaxe p(f | e)×pLM(e)×wlength(e)

      Now it is usually implemented by a log-linear
      model:



                       weight        feature
2. Main Features of Moses
Some useful features include:
 ü    Phrase translation probability ;
 ü    Lexical phrase translation probability ;
 ü    Inversed phrase translation probability ;
 ü    Inversed lexical phrase translation probability ;
 ü    English language model based on n-gram ;
 ü    English sentence length penalty ;
 ü    Chinese phrase count penalty.
2. Main Features of Moses
A phrase-based example:
                 欧洲 部分 地区 遭受 洪水 袭击
(1)
        欧洲            部分 地区       遭受 洪水 袭击

(2)
        Europe         parts of   hit by floods

(3)
        parts of       Europe     hit by floods
2. Main Features of Moses
           Development
              data

Parallel      Moses
 data        training    Test data


           Translation     Moses
             model        decoder

                            Target       Moses
The Framework:           translation   evaluation

                                       Good or bad
2. Main Features of Moses
n    Offer two types of translation models:
      phase-based and tree-based
n    Support factored translation models
n    Allow the decoding of different kinds of
      inputs: sentences, confusion networks and
      word lattices
2. Main Features of Moses
n    Support n-best translation output besides the
      best one
       l    This is a good conference.
       l    This was a great conference.
       l    It is a good meeting.
       l    ……
n    Provide an experimental management system
n    Translate fast with a good translation quality
2. Main Features of Moses
n    Keep balance on Speed or Quality?
      n  If we want translation speed, Moses provides us
          many options to accelerate the translation
          process, such as beam size, the granularity of
          translation rules.
      n  If we pursue translation quality, Moses also

          allows us to enlarge the translation search space
          in order to have a bigger change to obtain a
          better translation.
2. Main Features of Moses
n    It now includes more and more even better
      translation models
      n  Hierarchical Phrase-based Translation Model
          (HPB)
      n  Tree-to-Tree/String-to-Tree Translation Models


n    It provides more new features, such as
      faster language modeling, multi-thread
      decoding, client-server translation etc.
      It keeps improving ……
2. Main Features of Moses
n    Moses provides good documentation and
      friendly interface
n    We can upgrade the components if we need
n    We can develop hybrid translation methods
      in the framework of Moses

      It allows extension ……
Outline

1.    Brief Introduction to Our Work
2.    Main Features of Moses
3.    How We Use Moses?
4.    Our Feeling
3. How We Use Moses?

n    Moses facilitates our research work
      l    For the beginners of SMT
      l    For the researchers familiar with SMT
      l    For the engineers to build an SMT system
3. How We Use Moses?
u For the beginners of SMT:
n    For most beginners of SMT, Moses is the most fresh
      and vivid tutorials to give the beginners an intuitive
      feeling of SMT;
n    Detailed guidance is very easy for beginners to use;
n    It can provide a preliminary understanding of the
      modules involved in the SMT system;
n    It can guide beginners to locate their interested
      research in SMT quickly.
3. How We Use Moses?

We use Moses as a tutorial tool.
3. How We Use Moses?
u For the researchers familiar with SMT
n    Moses provides the whole toolkit for
      building a translation system
      n    data preparation, word alignment, translation rule
            extraction, parameter tuning, decoding, and
            evaluation
n    We just need to study the sub-models that
      we are interested in and then propose new
      algorithms, and finally verify the
      effectiveness using Moses
3. How We Use Moses?
n    For example, we proposed a new algorithm of
      word alignment and translation rule
      extraction
n    Moses can help us to verify the effectiveness
      of the proposed methods in just few days. It
      accelerates our research work a lot
n    The most important for MT researchers,
      Moses has become a de facto standard
      baseline to test their own models
3. How We Use Moses?

We develop new models to compare
with Moses and propose new algorithms
to implement on Moses platform.
3. How We Use Moses?
                          Interlingua



              Semantic                  Semantic

                         Tree-to-tree
              Syntax                       Syntax
 String-to-tree                                 Tree-to-string
Formalism gram.          Hierarchical          Formalism gram.
                         phrase based
                         Phrase-based
  Phrases                                          Phrases

                       Word-based model
Source language                                    Target language
3. How We Use Moses?
u For the engineers to build an SMT system
n    They do not need to care about the principle about
      how Moses works
n    just need to provide training data, development data,
      and test data
n    do some pre-processing work to make data clean
n    do some post-processing work to convert the output
Source sentence
                      Pre-processing

      MT engine 1       Moses
                       MT engine 2 …     MT engine 6


    n-best list        n-best list   … n-best list


                  Merged n-best list                 MBR decoder

                     Word aligning               References for
                                                   alignment
                    Merging alignments
                                      Decoder based on
                  Confusion network         C.N        Translation
                          NLPR, CAS-IA 4/23/12         32
3. How We Use Moses?

We        also use Moses as a tool to
evaluate the quality of some collected
parallel corpus because we can build an MT
system in two or three days based on the
corpus and evaluate the quality of translation.
We know how well the translation quality
reflect the quality of corpus.
3. How We Use Moses?
For example,
1-1 merkezdiki dölet apparatliri bilen jaylardiki dölet
   apparatlirining xizmet hoquqi merkezning bir tutash
   rehberlikide jaylarning teshebbuskarliqi we aktipliqini toluq
   jari qildurush prinsipi boyiche ayrilidu.
1-2 中央和地方的国家机构职权的划分,遵循在中央的统一
   领导下,充分发挥地方的主动性、积极性的原则。
2-1 madda jungxua xelq jumhuriyitide hemme millet
   bapbarawer.
2-2 中华人民共和国各民族一律平等。
    ……
3. How We Use Moses?

Many participant systems in MT
evaluations in the world employ Moses,
such as in evaluations of NIST, WMT, IWSLT
and CWMT etc.
3. How We Use Moses?
Systems   Use Moses?
 DCU         √         7 among 11
 DFKI        √         systems
 FBK         √         employed Moses
 KIT         √
                       in SLT evaluation
 LIG         √
LIMSI
                       of IWSLT’2011!
 LIUM        √
 MIT
 MSR
 NICT        √
RWTH
3. How We Use Moses?
Systems      Use     Systems   Use Moses ?
           Moses ?
 DCU         √        HIT          √
                                             16 among 19
 NTT         √        IMNU         √
                                             systems
Systran      √        FRDC         √         employed
ICT-CAS      √        BUAA         √
                                             Moses in MT
                                             evaluation of
IA-CAS       √        XMU          √
                                             CWMT’2011!
IS-CAS       √        IIM          √
 NEU                  NJU
 XAUT        √        BJTU         √
 ISTIC                XJU          √
XJIPC        √
Outline

1.    Brief Introduction to Our Work
2.    Main Features of Moses
3.    How We Use Moses?
4.    Our Feeling
4. Our Feeling
n    Moses is our friend
      n  It is a good helper and saves us a lot of labor
      n  It is a good mirror to reflect the quality of our

          MT systems
      n  It is a roll booster of MT research




      We love our friend!
4. Our Feeling
n    Moses is our competitor
      n  We hope to develop new translation models to
          surpass Moses, as an MT researcher
      n  Competition makes us get progress



 We love our competitor!
 We love Moses!
Thanks
 谢谢!

Weitere ähnliche Inhalte

Ähnlich wie TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia, 23 April 2012

Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...mlaij
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
Deep network notes.pdf
Deep network notes.pdfDeep network notes.pdf
Deep network notes.pdfRamya Nellutla
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translationStephen Peacock
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert SystemModeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert Systemcsandit
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathiaciijournal
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathiaciijournal
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathiaciijournal
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...IRJET Journal
 
Turkish language modeling using BERT
Turkish language modeling using BERTTurkish language modeling using BERT
Turkish language modeling using BERTAbdurrahimDerric
 
Session on machine translation batu 19 march2016
Session on machine translation batu 19 march2016Session on machine translation batu 19 march2016
Session on machine translation batu 19 march2016Nakul Sharma
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...kevig
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ijnlc
 

Ähnlich wie TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia, 23 April 2012 (20)

How to Translate from English to Khmer using Moses
How to Translate from English to Khmer using MosesHow to Translate from English to Khmer using Moses
How to Translate from English to Khmer using Moses
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
 
Deep network notes.pdf
Deep network notes.pdfDeep network notes.pdf
Deep network notes.pdf
 
What is machine translation
What is machine translationWhat is machine translation
What is machine translation
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert SystemModeling of Speech Synthesis of Standard Arabic Using an Expert System
Modeling of Speech Synthesis of Standard Arabic Using an Expert System
 
Moses
MosesMoses
Moses
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
 
A Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to MarathiA Novel Approach for Rule Based Translation of English to Marathi
A Novel Approach for Rule Based Translation of English to Marathi
 
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
Rendering Of Voice By Using Convolutional Neural Network And With The Help Of...
 
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
[IJET-V2I1P13] Authors:Shilpa More, Gagandeep .S. Dhir , Deepak Daiwadney and...
 
Turkish language modeling using BERT
Turkish language modeling using BERTTurkish language modeling using BERT
Turkish language modeling using BERT
 
Session on machine translation batu 19 march2016
Session on machine translation batu 19 march2016Session on machine translation batu 19 march2016
Session on machine translation batu 19 march2016
 
SMT3
SMT3SMT3
SMT3
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
 
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
ATTENTION-BASED SYLLABLE LEVEL NEURAL MACHINE TRANSLATION SYSTEM FOR MYANMAR ...
 

Mehr von TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)TAUS - The Language Data Network
 

Mehr von TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
How Existing Quality Models Get Challenged, by Katka Gasova (Moravia)
 

Kürzlich hochgeladen

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 

Kürzlich hochgeladen (20)

08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 

TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia, 23 April 2012

  • 1. How We Use Moses to Develop Our Multi-lingual Machine Translation Systems? Chengqing ZONG (宗成庆) Institute of Automation, Chinese Academy of Sciences 中国科学院自动化研究所 cqzong@nlpr.ia.ac.cn 100190 北京市海澱區中關村東路95號 電郵:cqzong@nlpr.ia.ac.cn http://www.nlpr.ia.ac.cn/cip/cqzong.htm 電話: +86-10-6255 4263
  • 2. Outline 1.  Brief Introduction to Our Work 2.  Main Features of Moses 3.  How We Use Moses? 4.  Our Feeling
  • 3. 1.  Brief Introduction to Our Work Our group is working with machine translation (MT) research and system development in the National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA). u  6 staffs u  8 Ph.D candidates, 1 Master student u  5 visiting scholars
  • 5. 1.  Brief Introduction to Our Work Multilingual text-to-text translation system Japanese Chinese
  • 6. 1.  Brief Introduction to Our Work n  In evaluation of spoken language translation (SLT) organized by IWSLT’2007 The performance of CE clean text translation of our system was the best one according to the results of human rankings.
  • 7. 1.  Brief Introduction to Our Work n  In IWSLT’2008 CASIA CASIA
  • 8. 1.  Brief Introduction to Our Work n  In IWSLT’2009 CASIA CASIA
  • 9. 1.  Brief Introduction to Our Work CASIA CASIA
  • 10. 1.  Brief Introduction to Our Work NLPR
  • 11. 1.  Brief Introduction to Our Work ²  In MT evaluation organized by China Workshop on Machine Translation (CWMT) 2011 (Sept. 23~ 24th), our system participated in all tasks: 1.  Chinese to English (News domain, progress) 2.  English to Chinese (News domain, progress) 3.  English to Chinese (News domain, current) 4.  English to Chinese (Science domain) 5.  Japanese to Chinese (News domain) 6.  Tibetan to Chinese (Government documents) 7.  Mongolian to Chinese (Daily) 8.  Uigur to Chinese (News domain) 9.  Kazakh to Chinese (News domain) 10.  Kir Kyrgyz to Chinese (News domain) 19 Units and 165 Systems participated in this evaluation
  • 12. 1.  Brief Introduction to Our Work According to BLEU scores, the performance of our system was the top one in the following 5 tasks : ü  English to Chinese (News domain, progress) ü  Japanese-to-Chinese (News domain) ü  Tibetan to Chinese (Government documents) ü  Mongolian to Chinese (Daily) ü  Kir Kyrgyz to Chinese (News domain) And it is ranked at the second position in the following 4 tasks: ü  Chinese to English (News domain, progress) ü  English to Chinese (News domain, current) ü  Uigur to Chinese (News domain) ü  Kazakh to Chinese (News domain)
  • 13. Outline 1.  Brief Introduction to Our Work 2.  Main Features of Moses 3.  How We Use Moses? 4.  Our Feeling
  • 14. 2. Main Features of Moses n  The basic ideas of statistical machine translation (SMT) can be formulated in principle as ebest =argmaxe p(f | e)×pLM(e)×wlength(e) Now it is usually implemented by a log-linear model: weight feature
  • 15. 2. Main Features of Moses Some useful features include: ü  Phrase translation probability ; ü  Lexical phrase translation probability ; ü  Inversed phrase translation probability ; ü  Inversed lexical phrase translation probability ; ü  English language model based on n-gram ; ü  English sentence length penalty ; ü  Chinese phrase count penalty.
  • 16. 2. Main Features of Moses A phrase-based example: 欧洲 部分 地区 遭受 洪水 袭击 (1) 欧洲 部分 地区 遭受 洪水 袭击 (2) Europe parts of hit by floods (3) parts of Europe hit by floods
  • 17. 2. Main Features of Moses Development data Parallel Moses data training Test data Translation Moses model decoder Target Moses The Framework: translation evaluation Good or bad
  • 18. 2. Main Features of Moses n  Offer two types of translation models: phase-based and tree-based n  Support factored translation models n  Allow the decoding of different kinds of inputs: sentences, confusion networks and word lattices
  • 19. 2. Main Features of Moses n  Support n-best translation output besides the best one l  This is a good conference. l  This was a great conference. l  It is a good meeting. l  …… n  Provide an experimental management system n  Translate fast with a good translation quality
  • 20. 2. Main Features of Moses n  Keep balance on Speed or Quality? n  If we want translation speed, Moses provides us many options to accelerate the translation process, such as beam size, the granularity of translation rules. n  If we pursue translation quality, Moses also allows us to enlarge the translation search space in order to have a bigger change to obtain a better translation.
  • 21. 2. Main Features of Moses n  It now includes more and more even better translation models n  Hierarchical Phrase-based Translation Model (HPB) n  Tree-to-Tree/String-to-Tree Translation Models n  It provides more new features, such as faster language modeling, multi-thread decoding, client-server translation etc. It keeps improving ……
  • 22. 2. Main Features of Moses n  Moses provides good documentation and friendly interface n  We can upgrade the components if we need n  We can develop hybrid translation methods in the framework of Moses It allows extension ……
  • 23. Outline 1.  Brief Introduction to Our Work 2.  Main Features of Moses 3.  How We Use Moses? 4.  Our Feeling
  • 24. 3. How We Use Moses? n  Moses facilitates our research work l  For the beginners of SMT l  For the researchers familiar with SMT l  For the engineers to build an SMT system
  • 25. 3. How We Use Moses? u For the beginners of SMT: n  For most beginners of SMT, Moses is the most fresh and vivid tutorials to give the beginners an intuitive feeling of SMT; n  Detailed guidance is very easy for beginners to use; n  It can provide a preliminary understanding of the modules involved in the SMT system; n  It can guide beginners to locate their interested research in SMT quickly.
  • 26. 3. How We Use Moses? We use Moses as a tutorial tool.
  • 27. 3. How We Use Moses? u For the researchers familiar with SMT n  Moses provides the whole toolkit for building a translation system n  data preparation, word alignment, translation rule extraction, parameter tuning, decoding, and evaluation n  We just need to study the sub-models that we are interested in and then propose new algorithms, and finally verify the effectiveness using Moses
  • 28. 3. How We Use Moses? n  For example, we proposed a new algorithm of word alignment and translation rule extraction n  Moses can help us to verify the effectiveness of the proposed methods in just few days. It accelerates our research work a lot n  The most important for MT researchers, Moses has become a de facto standard baseline to test their own models
  • 29. 3. How We Use Moses? We develop new models to compare with Moses and propose new algorithms to implement on Moses platform.
  • 30. 3. How We Use Moses? Interlingua Semantic Semantic Tree-to-tree Syntax Syntax String-to-tree Tree-to-string Formalism gram. Hierarchical Formalism gram. phrase based Phrase-based Phrases Phrases Word-based model Source language Target language
  • 31. 3. How We Use Moses? u For the engineers to build an SMT system n  They do not need to care about the principle about how Moses works n  just need to provide training data, development data, and test data n  do some pre-processing work to make data clean n  do some post-processing work to convert the output
  • 32. Source sentence Pre-processing MT engine 1 Moses MT engine 2 … MT engine 6 n-best list n-best list … n-best list Merged n-best list MBR decoder Word aligning References for alignment Merging alignments Decoder based on Confusion network C.N Translation NLPR, CAS-IA 4/23/12 32
  • 33. 3. How We Use Moses? We also use Moses as a tool to evaluate the quality of some collected parallel corpus because we can build an MT system in two or three days based on the corpus and evaluate the quality of translation. We know how well the translation quality reflect the quality of corpus.
  • 34. 3. How We Use Moses? For example, 1-1 merkezdiki dölet apparatliri bilen jaylardiki dölet apparatlirining xizmet hoquqi merkezning bir tutash rehberlikide jaylarning teshebbuskarliqi we aktipliqini toluq jari qildurush prinsipi boyiche ayrilidu. 1-2 中央和地方的国家机构职权的划分,遵循在中央的统一 领导下,充分发挥地方的主动性、积极性的原则。 2-1 madda jungxua xelq jumhuriyitide hemme millet bapbarawer. 2-2 中华人民共和国各民族一律平等。 ……
  • 35. 3. How We Use Moses? Many participant systems in MT evaluations in the world employ Moses, such as in evaluations of NIST, WMT, IWSLT and CWMT etc.
  • 36. 3. How We Use Moses? Systems Use Moses? DCU √ 7 among 11 DFKI √ systems FBK √ employed Moses KIT √ in SLT evaluation LIG √ LIMSI of IWSLT’2011! LIUM √ MIT MSR NICT √ RWTH
  • 37. 3. How We Use Moses? Systems Use Systems Use Moses ? Moses ? DCU √ HIT √ 16 among 19 NTT √ IMNU √ systems Systran √ FRDC √ employed ICT-CAS √ BUAA √ Moses in MT evaluation of IA-CAS √ XMU √ CWMT’2011! IS-CAS √ IIM √ NEU NJU XAUT √ BJTU √ ISTIC XJU √ XJIPC √
  • 38. Outline 1.  Brief Introduction to Our Work 2.  Main Features of Moses 3.  How We Use Moses? 4.  Our Feeling
  • 39. 4. Our Feeling n  Moses is our friend n  It is a good helper and saves us a lot of labor n  It is a good mirror to reflect the quality of our MT systems n  It is a roll booster of MT research We love our friend!
  • 40. 4. Our Feeling n  Moses is our competitor n  We hope to develop new translation models to surpass Moses, as an MT researcher n  Competition makes us get progress We love our competitor! We love Moses!