SlideShare ist ein Scribd-Unternehmen logo
1 von 41
25th International Conference, GSCL 2013 
Aaron L.-F. Han, Derek F. Wong, Lidia S. Chao, Liangye He, Shuo Li, 
and Ling Zhu 
September 25th -27th, 2013, Darmstadt, Germany 
Natural Language Processing & Portuguese-Chinese Machine Translation 
Laboratory 
Department of Computer and Information Science 
University of Macau
 Background of language Treebank 
 Motivation 
 Designed phrase tagset mapping 
 Application in MT evaluation 
1. Manual evaluations 
2. Traditional automatic MT evaluation methods 
3. Designed unsupervised MT evaluation 
4. Evaluating the evaluation method 
5. Experiments 
6. Open source code 
 Discussion 
 Further information
• To promote the development of syntactic analysis 
• Many language treebanks are developed 
– English Penn Treebank (Marcus et al., 1993; Mitchell et al., 
1994) 
– German Negra Treebank (Skut et al., 1997) 
– French Treebank (Abeillé et al., 2003) 
– Chinese Sinica Treebank (Chen et al., 2003) 
– Etc.
• Problems 
– Different treebanks use their own syntactic tagsets 
– The number of tags ranging from tens (e.g. English Penn 
Treebank) to hundreds (e.g. Chinese Sinica Treebank) 
– Inconvenient when undertaking the multilingual or cross-lingual 
research
• To bridge the gap between these treebanks and 
facilitate future research 
– E.g. the unsupervised induction of syntactic structure 
• Petrov et al. (2012) develop a universal POS tagset 
• How about the phrase level tags? 
• The disaccord problem in the phrase level tags 
remains unsolved 
– Let’s try to solve it
• Tentative design of phrase tagset mapping 
– On English Penn Treebank I, II & French Treebank 
• 9 universal phrasal categories covering 
– 14 phrase tags in English Penn Treebank I 
– 26 phrase tags in English Penn Treebank II 
– 14 phrase tags in French Treebank
Table 1: phrase tagset mapping for French and English treebanks
• Universal phrasal categories: NP (noun phrase), 
VP (verb phrase), AJP (adjective phrase), AVP 
(adverbial phrase), PP (prepositional phrase), S (sub/- 
sentence), CONJP (conjunction phrase), COP 
(coordinated phrse), X (other phrases or unknown) 
• NP covering 
– French tags: NP 
– English tags: NP, NAC (the scope of certain prenominal 
modifiers within an NP), NX (within certain complex NPs to 
mark the head of NP), WHNP (wh-noun phrase), QP 
(quantifier phrase)
• VP covering 
– French tags: VN (verbal nucleus), VP (infinitives and 
nonfinite clauses) 
– English tags: VP (verb phrase) 
• AJP covering 
– French tags: AP (adjectival phrase) 
– English tags: ADJP (adjective phrase), WHADJP (wh-adjective 
phrase)
• AVP covering 
– French tags: AdP (adverbial phrases) 
– English tags: ADVP (adverb phrase), WHAVP (wh-adverb 
phrase), PRT (particle) 
• PP covering 
– French tags: PP 
– English tags: PP, WHPP (wh-propositional phrase phrase)
• S covering 
– French tags: SENT (sentence), S (finite clause) 
– English tags: S (simple declarative clause), SBAR (clause 
introduced by a subordinating conjunction), SBARQ (direct 
question introduced by a wh-phrase), SINV (declarative 
sentence with subject-aux inversion), SQ (sub-constituent 
of SBARQ), PRN (parenthetical), FRAG (fragment), RRC 
(reduced relative clause). 
• CONJP covering 
– French tags: N/A 
– English tags: CONJP
• COP covering 
– French tags: COORD (coordinated phrase) 
– English tags: UCP (coordinated phrases belonging to 
different categories) 
• X covering 
– French tags: unknown 
– English tags: X (unknown or uncertain), INTJ (interjection), 
LST (list marker)
4. Application in Machine Translation 
evaluation
• Rapid development of Machine Translations 
– MT began as early as in the 1950s (Weaver, 1955) 
– Big progress science the 1990s due to the development of 
computers (storage capacity and computational power) 
and the enlarged bilingual corpora (Marino et al. 2006) 
• Difficulties of MT evaluation 
– language variability results in no single correct translation 
– the natural languages are highly ambiguous and different 
languages do not always express the same content in the 
same way (Arnold, 2003)
• Traditional manual evaluation criteria: 
– intelligibility (measuring how understandable the 
sentence is) 
– fidelity (measuring how much information the translated 
sentence retains as compared to the original) by the 
Automatic Language Processing Advisory Committee 
(ALPAC) around 1966 (Carroll, 1966) 
– adequacy (similar as fidelity), fluency (whether the 
sentence is well-formed and fluent) and comprehension 
(improved intelligibility) by Defense Advanced Research 
Projects Agency (DARPA) of US (White et al., 1994)
• Problems of manual evaluations : 
– Time-consuming 
– Expensive 
– Unrepeatable 
– Low agreement (Callison-Burch, et al., 2011)
• Measuring the similarity of automatic translation and 
reference translation 
– Automatic translation (or hypothesis translation, target 
translation): by automatic MT system 
– Reference translation: by professional translators 
– Source language and source document: not used 
• Traditional automatic evaluation: 
– BLEU: n-gram precisions (Papineni et al., 2002) 
– TER: edit distances (Snover et al., 2006) 
– METEOR: precision and recall (Banerjee and Lavie, 2005)
• Problems in supervised MT evaluation 
– Reference translations are expensive 
– Reference translations are not available is some cases 
• Could we get rid of the reference translation? 
– Unsupervised MT evaluation method 
– Extract information from source and target language 
– How to use the designed universal phrase tagset?
• Assume that the translated sentence should have a 
similar set of phrase categories with the source 
sentence. 
– This design is inspired by the synonymous relation 
between source and target sentence. 
• Two sentences that have similar set of phrases may 
talk about different things. 
– However, this evaluation approach is not designed for 
general circumstance 
– Assume that the targeted sentences are indeed the 
translated sentences from the source document
• First, we parse the source and target languages 
respectively 
• Then we extract the phrase set from the source and 
target sentences 
• Third, we convert the phrases into the developed 
universal phrase categories 
• Last, we measure the similarity of source and target 
language on the universal phrase sequences
Figure 1: the parsed French and English sentence
The level of extracted phrase tags: just the upper level of POS tags, bottom-up 
Figure 2: convert the extracted phrase into universal phrase tags
• What is the similarity metric we employed? 
• Designed similarity metric: HPPR 
– N1 gram position order difference penalty 
– Weighted N2 gram precision 
– Weighted N3 gram recall 
– Weighted geometric mean in n-gram precision & recall 
– Weighted harmonic mean to combine sub-factors 
– The parameters are tunable according to different 
language pairs
• 퐻푃푃푅 = 퐻푎푟(푤푃푠푁1푃푠퐷푖푓, 푤푃푟푁2푃푟푒, 푤푅푐푁3푅푒푐) 
• 퐻푃푃푅 = 
푤푃푠+푤푃푟+푤푅푐 
푤푃푠 
푁1푃푠퐷푖푓 
푤푃푟 
푁2푃푟푒 
+ 
푤푅푐 
푁3푅푒푐 
+ 
• 푁1푃푠퐷푖푓, 푁2푃푟푒, and 푁3푅푒푐 are the corpus level 
scores of sub-factors position difference penalty, 
precision and recall.
• The sentence level 푁1푃푠퐷푖푓 score: 
• 푁1푃푠퐷푖푓 = exp(−푁1푃퐷) 
1 
• 푁1푃퐷 = 
퐿푒푛푔푡ℎℎ푦푝 
Σ|푃퐷푖 | 
• 푃퐷푖 = |푃푠푁ℎ푦푝 − 푀푎푡푐ℎ푃푠푁푠푟푐 | 
• 푃푠푁ℎ푦푝 and 푀푎푡푐ℎ푃푠푁푠푟푐 are the position number 
of matching tag in the hypothesis and source 
sentence respectively. When no match for the tag: 
푃퐷푖 = |푃푠푁ℎ푦푝 − 0|
Figure 3: N1 gram tag alignment algorithm
Figure 4: 푁1푃퐷 calculation example
• Corpus-level weighted n-gram precision & recall 
• 푁2푃푟푒 = exp(Σ푁2 푤푛푙표푔푃푛) 
푛=1 
푁3 푤푛푙표푔푅푛) 
• 푁3푅푒푐 = exp(Σ푛=1 
• 푃푛 = 
#푚푎푡푐ℎ푒푑 푛푔푟푎푚 푐ℎ푢푛푘푠 
#푛푔푟푎푚 푐ℎ푢푛푘푠 표푓 ℎ푦푝표푡ℎ푒푠푖푠 푐표푟푝푢푠 
• 푅푛 = 
#푚푎푡푐ℎ푒푑 푛푔푟푎푚 푐ℎ푢푛푘푠 
#푛푔푟푎푚 푐ℎ푢푛푘푠 표푓 푠표푢푟푐푒 푐표푟푝푢푠
Figure 5: bigram chunk matching example
• How reliable is the automatic metric? 
• Evaluation criteria for evaluation metrics: 
– Human judgments are the golden to approach, currently 
– Correlation with human judgments (Callison-Burch, et al., 
2011, 2012) 
• Spearman rank correlation coefficient rs: 
– 푟푠 푋푌 = 1 − 
푛 푑푖 
6 Σ푖=1 
2 
푛(푛2−1) 
– Two rank sequences 푋 = 푥1, … , 푥푛 , 푌 = {푦1, … , 푦푛}
• Corpus from WMT 
– Workshop of statistical machine translation 
– SIGMT, ACL’S special interest group of machine translation 
• Training data (WMT11), tune the parameters 
– 3, 003 sentences for each document 
– 18 automatic French-to-English MT systems 
• Testing data (WMT12) 
– 3, 003 sentences for each document 
– 15 automatic French-to-English MT systems
• Training, tune the parameters 
– N1, N2 and N3 are tuned as 2, 3 and 3 due to the fact that 
the 4-gram chunk match usually results in 0 score. 
– Tuned values of factor weights are shown in table 
Table 2: tuned parameter values
• Comparisons with: 
– BLEU, measure the closeness of the hypothesis and 
reference translations, n-gram precision 
– TER, measure the editing distance of hypothesis to 
reference translations
Table 3: training (development) scores on WMT11 corpus 
Table 4: testing scores on WMT12 corpus
Table 5: correlation score intro (Cohen, 1988) 
 The experiment results on the development and testing corpora show that 
HPPR without using reference translations has yielded promising 
correlation scores (0.63 and 0.59 respectively). 
 There is still potential to improve the performances of all the three 
metrics, even though that the correlation scores which are higher than 0.5 
are already considered as strong correlation as shown in Table 5.
• Phrase Tagset Mapping for French and English 
Treebanks and Its Application in Machine 
Translation Evaluation 
– Aaron L.-F. Han, Derek F. Wong, Lidia S. Chao, Liangye He, 
Shuo Li, and Ling Zhu. GSCL 2013, Darmstadt, Germany. 
LNCS Vol. 8105, pp. 119-131, Volume Editors: Iryna 
Gurevych, Chris Biemann and Torsten Zesch. 
• Open source tool for phrase tagset mapping 
and HPPR similarity measuring algorithms: 
https://github.com/aaronlifenghan/aaron-project-hppr
• Facilitate future research in multilingual or cross-lingual 
literature, this paper designs a phrase tags 
mapping between the French Treebank and the 
English Penn Treebank using 9 phrase categories. 
• One of the potential applications of the designed 
universal phrase tagset is shown in the unsupervised 
MT evaluation task in the experiment section.
• There are still some limitations in this work to be 
addressed in the future. 
– The designed universal phrase categories may not be 
able to cover all the phrase tags of other language 
treebanks, so this tagset could be expanded when 
necessary. 
– The designed HPPR formula contains the n-gram factors 
of position difference, precision and recall, which may not 
be sufficient or suitable for some of the other language 
pairs, so different measuring factors should be added or 
switched when facing new tasks.
• Actually speaking, the designed models are very 
related to the similarity measuring. Where we 
have employed them is in the MT evaluation. These 
works may be further developed into other 
literature: 
– information retrieval 
– question and answering 
– Searching 
– text analysis 
– etc.
• Ongoing and further works: 
– The combination of translation and evaluation, tuning the 
translation model using evaluation metrics 
– Evaluation models from the perspective of semantics 
– The further explorations of unsupervised evaluation 
models, extracting other features from source and target 
languages 
• Aaron open source tools: https://github.com/aaronlifenghan 
• Aaron network Home: http://www.linkedin.com/in/aaronhan
GSCL 2013, Darmstadt, Germany 
Aaron L.-F. Han 
email: hanlifengaaron AT gmail DOT com 
Natural Language Processing & Portuguese-Chinese Machine Translation 
Laboratory 
Department of Computer and Information Science 
University of Macau

Weitere ähnliche Inhalte

Was ist angesagt?

Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information RetrievalNik Spirin
 
A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...Ilia Karpov
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational SemanticsMarina Santini
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesAntonio Toral
 
A statistical approach to machine translation
A statistical approach to machine translationA statistical approach to machine translation
A statistical approach to machine translationHiroshi Matsumoto
 
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...RIILP
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015RIILP
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational SemanticsMarina Santini
 
Word representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2VecWord representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2Vecananth
 
A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...Francisco Manuel Rangel Pardo
 
Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Francisco Manuel Rangel Pardo
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...Lifeng (Aaron) Han
 
Argument extraction from news, blog and social media.
Argument extraction  from news, blog and social media.Argument extraction  from news, blog and social media.
Argument extraction from news, blog and social media.Sharath TS
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Vsevolod Dyomkin
 

Was ist angesagt? (20)

Nlp
NlpNlp
Nlp
 
Language Models for Information Retrieval
Language Models for Information RetrievalLanguage Models for Information Retrieval
Language Models for Information Retrieval
 
A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...A general method applicable to the search for anglicisms in russian social ne...
A general method applicable to the search for anglicisms in russian social ne...
 
Semantics and Computational Semantics
Semantics and Computational SemanticsSemantics and Computational Semantics
Semantics and Computational Semantics
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
 
A statistical approach to machine translation
A statistical approach to machine translationA statistical approach to machine translation
A statistical approach to machine translation
 
8 issues in pos tagging
8 issues in pos tagging8 issues in pos tagging
8 issues in pos tagging
 
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...
4. Josef Van Genabith (DCU) & Khalil Sima'an (UVA) Example Based Machine Tran...
 
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
ESR10 Joachim Daiber - EXPERT Summer School - Malaga 2015
 
Lecture 2: Computational Semantics
Lecture 2: Computational SemanticsLecture 2: Computational Semantics
Lecture 2: Computational Semantics
 
AINL 2016: Eyecioglu
AINL 2016: EyeciogluAINL 2016: Eyecioglu
AINL 2016: Eyecioglu
 
Word representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2VecWord representation: SVD, LSA, Word2Vec
Word representation: SVD, LSA, Word2Vec
 
A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...A Low Dimensionality Representation for Language Variety Identification (CICL...
A Low Dimensionality Representation for Language Variety Identification (CICL...
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
 
Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...Language Variety Identification using Distributed Representations of Words an...
Language Variety Identification using Distributed Representations of Words an...
 
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
ACL-WMT2013.A Description of Tunable Machine Translation Evaluation Systems i...
 
L3 v2
L3 v2L3 v2
L3 v2
 
Argument extraction from news, blog and social media.
Argument extraction  from news, blog and social media.Argument extraction  from news, blog and social media.
Argument extraction from news, blog and social media.
 
Zrm
ZrmZrm
Zrm
 
Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)Crash Course in Natural Language Processing (2016)
Crash Course in Natural Language Processing (2016)
 

Andere mochten auch

The CLEAR Dependency
The CLEAR DependencyThe CLEAR Dependency
The CLEAR DependencyJinho Choi
 
Google translate 1
Google translate 1Google translate 1
Google translate 1Debbie Lahav
 
Human vs-Machine-Translation
Human vs-Machine-TranslationHuman vs-Machine-Translation
Human vs-Machine-TranslationNordicTrans.com
 
Is Google Translate Effective At Sentence Changing
Is Google Translate Effective At Sentence ChangingIs Google Translate Effective At Sentence Changing
Is Google Translate Effective At Sentence ChangingSentenceChanger
 
Google Translate + TectoMT
Google Translate + TectoMTGoogle Translate + TectoMT
Google Translate + TectoMTMartin Majlis
 
Language Use And Preservation Online
Language Use And Preservation OnlineLanguage Use And Preservation Online
Language Use And Preservation OnlineTadej Gregorcic
 
Building Translate on Glass
Building Translate on GlassBuilding Translate on Glass
Building Translate on GlassTrish Whetzel
 
8 Google Translate
8 Google Translate8 Google Translate
8 Google Translateaptwano
 
Google translate (new russian)
Google translate (new russian)Google translate (new russian)
Google translate (new russian)Nurbek Matzhani
 
Google Translate in the Classroom
Google Translate in the ClassroomGoogle Translate in the Classroom
Google Translate in the Classroommarafaye
 
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...Bilingual Data Mining for the English-Amharic Statistical Machine Translation...
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...Guy De Pauw
 
Amharic document clustering
Amharic document clusteringAmharic document clustering
Amharic document clusteringGuy De Pauw
 
Google Translate Update
Google Translate UpdateGoogle Translate Update
Google Translate Updatemrsvogel
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Daniel Adenew
 
Machine Translation=Google Translator
Machine Translation=Google TranslatorMachine Translation=Google Translator
Machine Translation=Google TranslatorNerea
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translationRushdi Shams
 

Andere mochten auch (20)

The CLEAR Dependency
The CLEAR DependencyThe CLEAR Dependency
The CLEAR Dependency
 
Chapter14part2
Chapter14part2Chapter14part2
Chapter14part2
 
Google translate 1
Google translate 1Google translate 1
Google translate 1
 
Human vs-Machine-Translation
Human vs-Machine-TranslationHuman vs-Machine-Translation
Human vs-Machine-Translation
 
Is Google Translate Effective At Sentence Changing
Is Google Translate Effective At Sentence ChangingIs Google Translate Effective At Sentence Changing
Is Google Translate Effective At Sentence Changing
 
Google Translate + TectoMT
Google Translate + TectoMTGoogle Translate + TectoMT
Google Translate + TectoMT
 
Language Use And Preservation Online
Language Use And Preservation OnlineLanguage Use And Preservation Online
Language Use And Preservation Online
 
Building Translate on Glass
Building Translate on GlassBuilding Translate on Glass
Building Translate on Glass
 
8 Google Translate
8 Google Translate8 Google Translate
8 Google Translate
 
Google translate (new russian)
Google translate (new russian)Google translate (new russian)
Google translate (new russian)
 
Google Translate in the Classroom
Google Translate in the ClassroomGoogle Translate in the Classroom
Google Translate in the Classroom
 
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...Bilingual Data Mining for the English-Amharic Statistical Machine Translation...
Bilingual Data Mining for the English-Amharic Statistical Machine Translation...
 
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
Linguistic Evaluation of Support Verb Construction Translations by OpenLogos ...
 
Amharic document clustering
Amharic document clusteringAmharic document clustering
Amharic document clustering
 
Google Translate Update
Google Translate UpdateGoogle Translate Update
Google Translate Update
 
Google translate
Google translateGoogle translate
Google translate
 
Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...Natural language processing with python and amharic syntax parse tree by dani...
Natural language processing with python and amharic syntax parse tree by dani...
 
Machine Translation=Google Translator
Machine Translation=Google TranslatorMachine Translation=Google Translator
Machine Translation=Google Translator
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
 
Slideshare
SlideshareSlideshare
Slideshare
 

Ähnlich wie Pptphrase tagset mapping for french and english treebanks and its application in machine translation evaluation

MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...Lifeng (Aaron) Han
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT Lifeng (Aaron) Han
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLifeng (Aaron) Han
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools Lifeng (Aaron) Han
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...Lifeng (Aaron) Han
 
PPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks
PPT-CCL: A Universal Phrase Tagset for Multilingual TreebanksPPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks
PPT-CCL: A Universal Phrase Tagset for Multilingual TreebanksLifeng (Aaron) Han
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelHemantha Kulathilake
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationGennadi Lembersky
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmDhruvKushwaha12
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4DigiGurukul
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.pptmilkesa13
 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Lviv Data Science Summer School
 
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...Lifeng (Aaron) Han
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsLifeng (Aaron) Han
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AISATHYANARAYANAKB
 
Thamme Gowda's PhD dissertation defense slides
Thamme Gowda's PhD dissertation defense slidesThamme Gowda's PhD dissertation defense slides
Thamme Gowda's PhD dissertation defense slidesThamme Gowda
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language ProcessingToine Bogers
 

Ähnlich wie Pptphrase tagset mapping for french and english treebanks and its application in machine translation evaluation (20)

MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
MT SUMMIT PPT: Language-independent Model for Machine Translation Evaluation ...
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
Lepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metricLepor: augmented automatic MT evaluation metric
Lepor: augmented automatic MT evaluation metric
 
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
CUHK intern PPT. Machine Translation Evaluation: Methods and Tools
 
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
TSD2013 PPT.AUTOMATIC MACHINE TRANSLATION EVALUATION WITH PART-OF-SPEECH INFO...
 
PPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks
PPT-CCL: A Universal Phrase Tagset for Multilingual TreebanksPPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks
PPT-CCL: A Universal Phrase Tagset for Multilingual Treebanks
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
The Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine TranslationThe Effect of Translationese on Statistical Machine Translation
The Effect of Translationese on Statistical Machine Translation
 
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmmUnit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
Unit-1 PPL PPTvvhvmmmmmmmmmmmmmmmmmmmmmm
 
Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4Artificial Intelligence Notes Unit 4
Artificial Intelligence Notes Unit 4
 
2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt2-Chapter Two-N-gram Language Models.ppt
2-Chapter Two-N-gram Language Models.ppt
 
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
Master defence 2020 - Anastasiia Khaburska - Statistical and Neural Language ...
 
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...
MT SUMMIT13.Language-independent Model for Machine Translation Evaluation wit...
 
Meta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methodsMeta-evaluation of machine translation evaluation methods
Meta-evaluation of machine translation evaluation methods
 
Natural Language Processing Course in AI
Natural Language Processing Course in AINatural Language Processing Course in AI
Natural Language Processing Course in AI
 
TransQuest
TransQuestTransQuest
TransQuest
 
Thamme Gowda's PhD dissertation defense slides
Thamme Gowda's PhD dissertation defense slidesThamme Gowda's PhD dissertation defense slides
Thamme Gowda's PhD dissertation defense slides
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Mehr von Lifeng (Aaron) Han

WMT2022 Biomedical MT PPT: Logrus Global and Uni Manchester
WMT2022 Biomedical MT PPT: Logrus Global and Uni ManchesterWMT2022 Biomedical MT PPT: Logrus Global and Uni Manchester
WMT2022 Biomedical MT PPT: Logrus Global and Uni ManchesterLifeng (Aaron) Han
 
Measuring Uncertainty in Translation Quality Evaluation (TQE)
Measuring Uncertainty in Translation Quality Evaluation (TQE)Measuring Uncertainty in Translation Quality Evaluation (TQE)
Measuring Uncertainty in Translation Quality Evaluation (TQE)Lifeng (Aaron) Han
 
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...Lifeng (Aaron) Han
 
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...Lifeng (Aaron) Han
 
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...
 HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio... HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...Lifeng (Aaron) Han
 
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Lifeng (Aaron) Han
 
Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Lifeng (Aaron) Han
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...Lifeng (Aaron) Han
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word ExpressionsLifeng (Aaron) Han
 
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longer
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longerBuild moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longer
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longerLifeng (Aaron) Han
 
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...Lifeng (Aaron) Han
 
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...Lifeng (Aaron) Han
 
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaMultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaLifeng (Aaron) Han
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.Lifeng (Aaron) Han
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationLifeng (Aaron) Han
 
machine translation evaluation resources and methods: a survey
machine translation evaluation resources and methods: a surveymachine translation evaluation resources and methods: a survey
machine translation evaluation resources and methods: a surveyLifeng (Aaron) Han
 
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...Lifeng (Aaron) Han
 
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning ModelChinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning ModelLifeng (Aaron) Han
 
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...Quality Estimation for Machine Translation Using the Joint Method of Evaluati...
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...Lifeng (Aaron) Han
 
PubhD talk: MT serving the society
PubhD talk: MT serving the societyPubhD talk: MT serving the society
PubhD talk: MT serving the societyLifeng (Aaron) Han
 

Mehr von Lifeng (Aaron) Han (20)

WMT2022 Biomedical MT PPT: Logrus Global and Uni Manchester
WMT2022 Biomedical MT PPT: Logrus Global and Uni ManchesterWMT2022 Biomedical MT PPT: Logrus Global and Uni Manchester
WMT2022 Biomedical MT PPT: Logrus Global and Uni Manchester
 
Measuring Uncertainty in Translation Quality Evaluation (TQE)
Measuring Uncertainty in Translation Quality Evaluation (TQE)Measuring Uncertainty in Translation Quality Evaluation (TQE)
Measuring Uncertainty in Translation Quality Evaluation (TQE)
 
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...
Meta-Evaluation of Translation Evaluation Methods: a systematic up-to-date ov...
 
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Profession...
 
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...
 HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio... HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...
HOPE: A Task-Oriented and Human-Centric Evaluation Framework Using Professio...
 
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
Monte Carlo Modelling of Confidence Intervals in Translation Quality Evaluati...
 
Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...Apply chinese radicals into neural machine translation: deeper than character...
Apply chinese radicals into neural machine translation: deeper than character...
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
 
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
Chinese Character Decomposition for  Neural MT with Multi-Word ExpressionsChinese Character Decomposition for  Neural MT with Multi-Word Expressions
Chinese Character Decomposition for Neural MT with Multi-Word Expressions
 
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longer
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longerBuild moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longer
Build moses on ubuntu (64 bit) system in virtubox recorded by aaron _v2longer
 
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
Detection of Verbal Multi-Word Expressions via Conditional Random Fields with...
 
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations ...
 
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel CorporaMultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
MultiMWE: Building a Multi-lingual Multi-Word Expression (MWE) Parallel Corpora
 
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
ADAPT Centre and My NLP journey: MT, MTE, QE, MWE, NER, Treebanks, Parsing.
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine Translation
 
machine translation evaluation resources and methods: a survey
machine translation evaluation resources and methods: a surveymachine translation evaluation resources and methods: a survey
machine translation evaluation resources and methods: a survey
 
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
Incorporating Chinese Radicals Into Neural Machine Translation: Deeper Than C...
 
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning ModelChinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
Chinese Named Entity Recognition with Graph-based Semi-supervised Learning Model
 
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...Quality Estimation for Machine Translation Using the Joint Method of Evaluati...
Quality Estimation for Machine Translation Using the Joint Method of Evaluati...
 
PubhD talk: MT serving the society
PubhD talk: MT serving the societyPubhD talk: MT serving the society
PubhD talk: MT serving the society
 

Pptphrase tagset mapping for french and english treebanks and its application in machine translation evaluation

  • 1. 25th International Conference, GSCL 2013 Aaron L.-F. Han, Derek F. Wong, Lidia S. Chao, Liangye He, Shuo Li, and Ling Zhu September 25th -27th, 2013, Darmstadt, Germany Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory Department of Computer and Information Science University of Macau
  • 2.  Background of language Treebank  Motivation  Designed phrase tagset mapping  Application in MT evaluation 1. Manual evaluations 2. Traditional automatic MT evaluation methods 3. Designed unsupervised MT evaluation 4. Evaluating the evaluation method 5. Experiments 6. Open source code  Discussion  Further information
  • 3. • To promote the development of syntactic analysis • Many language treebanks are developed – English Penn Treebank (Marcus et al., 1993; Mitchell et al., 1994) – German Negra Treebank (Skut et al., 1997) – French Treebank (Abeillé et al., 2003) – Chinese Sinica Treebank (Chen et al., 2003) – Etc.
  • 4. • Problems – Different treebanks use their own syntactic tagsets – The number of tags ranging from tens (e.g. English Penn Treebank) to hundreds (e.g. Chinese Sinica Treebank) – Inconvenient when undertaking the multilingual or cross-lingual research
  • 5. • To bridge the gap between these treebanks and facilitate future research – E.g. the unsupervised induction of syntactic structure • Petrov et al. (2012) develop a universal POS tagset • How about the phrase level tags? • The disaccord problem in the phrase level tags remains unsolved – Let’s try to solve it
  • 6. • Tentative design of phrase tagset mapping – On English Penn Treebank I, II & French Treebank • 9 universal phrasal categories covering – 14 phrase tags in English Penn Treebank I – 26 phrase tags in English Penn Treebank II – 14 phrase tags in French Treebank
  • 7. Table 1: phrase tagset mapping for French and English treebanks
  • 8. • Universal phrasal categories: NP (noun phrase), VP (verb phrase), AJP (adjective phrase), AVP (adverbial phrase), PP (prepositional phrase), S (sub/- sentence), CONJP (conjunction phrase), COP (coordinated phrse), X (other phrases or unknown) • NP covering – French tags: NP – English tags: NP, NAC (the scope of certain prenominal modifiers within an NP), NX (within certain complex NPs to mark the head of NP), WHNP (wh-noun phrase), QP (quantifier phrase)
  • 9. • VP covering – French tags: VN (verbal nucleus), VP (infinitives and nonfinite clauses) – English tags: VP (verb phrase) • AJP covering – French tags: AP (adjectival phrase) – English tags: ADJP (adjective phrase), WHADJP (wh-adjective phrase)
  • 10. • AVP covering – French tags: AdP (adverbial phrases) – English tags: ADVP (adverb phrase), WHAVP (wh-adverb phrase), PRT (particle) • PP covering – French tags: PP – English tags: PP, WHPP (wh-propositional phrase phrase)
  • 11. • S covering – French tags: SENT (sentence), S (finite clause) – English tags: S (simple declarative clause), SBAR (clause introduced by a subordinating conjunction), SBARQ (direct question introduced by a wh-phrase), SINV (declarative sentence with subject-aux inversion), SQ (sub-constituent of SBARQ), PRN (parenthetical), FRAG (fragment), RRC (reduced relative clause). • CONJP covering – French tags: N/A – English tags: CONJP
  • 12. • COP covering – French tags: COORD (coordinated phrase) – English tags: UCP (coordinated phrases belonging to different categories) • X covering – French tags: unknown – English tags: X (unknown or uncertain), INTJ (interjection), LST (list marker)
  • 13. 4. Application in Machine Translation evaluation
  • 14. • Rapid development of Machine Translations – MT began as early as in the 1950s (Weaver, 1955) – Big progress science the 1990s due to the development of computers (storage capacity and computational power) and the enlarged bilingual corpora (Marino et al. 2006) • Difficulties of MT evaluation – language variability results in no single correct translation – the natural languages are highly ambiguous and different languages do not always express the same content in the same way (Arnold, 2003)
  • 15. • Traditional manual evaluation criteria: – intelligibility (measuring how understandable the sentence is) – fidelity (measuring how much information the translated sentence retains as compared to the original) by the Automatic Language Processing Advisory Committee (ALPAC) around 1966 (Carroll, 1966) – adequacy (similar as fidelity), fluency (whether the sentence is well-formed and fluent) and comprehension (improved intelligibility) by Defense Advanced Research Projects Agency (DARPA) of US (White et al., 1994)
  • 16. • Problems of manual evaluations : – Time-consuming – Expensive – Unrepeatable – Low agreement (Callison-Burch, et al., 2011)
  • 17. • Measuring the similarity of automatic translation and reference translation – Automatic translation (or hypothesis translation, target translation): by automatic MT system – Reference translation: by professional translators – Source language and source document: not used • Traditional automatic evaluation: – BLEU: n-gram precisions (Papineni et al., 2002) – TER: edit distances (Snover et al., 2006) – METEOR: precision and recall (Banerjee and Lavie, 2005)
  • 18. • Problems in supervised MT evaluation – Reference translations are expensive – Reference translations are not available is some cases • Could we get rid of the reference translation? – Unsupervised MT evaluation method – Extract information from source and target language – How to use the designed universal phrase tagset?
  • 19. • Assume that the translated sentence should have a similar set of phrase categories with the source sentence. – This design is inspired by the synonymous relation between source and target sentence. • Two sentences that have similar set of phrases may talk about different things. – However, this evaluation approach is not designed for general circumstance – Assume that the targeted sentences are indeed the translated sentences from the source document
  • 20. • First, we parse the source and target languages respectively • Then we extract the phrase set from the source and target sentences • Third, we convert the phrases into the developed universal phrase categories • Last, we measure the similarity of source and target language on the universal phrase sequences
  • 21. Figure 1: the parsed French and English sentence
  • 22. The level of extracted phrase tags: just the upper level of POS tags, bottom-up Figure 2: convert the extracted phrase into universal phrase tags
  • 23. • What is the similarity metric we employed? • Designed similarity metric: HPPR – N1 gram position order difference penalty – Weighted N2 gram precision – Weighted N3 gram recall – Weighted geometric mean in n-gram precision & recall – Weighted harmonic mean to combine sub-factors – The parameters are tunable according to different language pairs
  • 24. • 퐻푃푃푅 = 퐻푎푟(푤푃푠푁1푃푠퐷푖푓, 푤푃푟푁2푃푟푒, 푤푅푐푁3푅푒푐) • 퐻푃푃푅 = 푤푃푠+푤푃푟+푤푅푐 푤푃푠 푁1푃푠퐷푖푓 푤푃푟 푁2푃푟푒 + 푤푅푐 푁3푅푒푐 + • 푁1푃푠퐷푖푓, 푁2푃푟푒, and 푁3푅푒푐 are the corpus level scores of sub-factors position difference penalty, precision and recall.
  • 25. • The sentence level 푁1푃푠퐷푖푓 score: • 푁1푃푠퐷푖푓 = exp(−푁1푃퐷) 1 • 푁1푃퐷 = 퐿푒푛푔푡ℎℎ푦푝 Σ|푃퐷푖 | • 푃퐷푖 = |푃푠푁ℎ푦푝 − 푀푎푡푐ℎ푃푠푁푠푟푐 | • 푃푠푁ℎ푦푝 and 푀푎푡푐ℎ푃푠푁푠푟푐 are the position number of matching tag in the hypothesis and source sentence respectively. When no match for the tag: 푃퐷푖 = |푃푠푁ℎ푦푝 − 0|
  • 26. Figure 3: N1 gram tag alignment algorithm
  • 27. Figure 4: 푁1푃퐷 calculation example
  • 28. • Corpus-level weighted n-gram precision & recall • 푁2푃푟푒 = exp(Σ푁2 푤푛푙표푔푃푛) 푛=1 푁3 푤푛푙표푔푅푛) • 푁3푅푒푐 = exp(Σ푛=1 • 푃푛 = #푚푎푡푐ℎ푒푑 푛푔푟푎푚 푐ℎ푢푛푘푠 #푛푔푟푎푚 푐ℎ푢푛푘푠 표푓 ℎ푦푝표푡ℎ푒푠푖푠 푐표푟푝푢푠 • 푅푛 = #푚푎푡푐ℎ푒푑 푛푔푟푎푚 푐ℎ푢푛푘푠 #푛푔푟푎푚 푐ℎ푢푛푘푠 표푓 푠표푢푟푐푒 푐표푟푝푢푠
  • 29. Figure 5: bigram chunk matching example
  • 30. • How reliable is the automatic metric? • Evaluation criteria for evaluation metrics: – Human judgments are the golden to approach, currently – Correlation with human judgments (Callison-Burch, et al., 2011, 2012) • Spearman rank correlation coefficient rs: – 푟푠 푋푌 = 1 − 푛 푑푖 6 Σ푖=1 2 푛(푛2−1) – Two rank sequences 푋 = 푥1, … , 푥푛 , 푌 = {푦1, … , 푦푛}
  • 31. • Corpus from WMT – Workshop of statistical machine translation – SIGMT, ACL’S special interest group of machine translation • Training data (WMT11), tune the parameters – 3, 003 sentences for each document – 18 automatic French-to-English MT systems • Testing data (WMT12) – 3, 003 sentences for each document – 15 automatic French-to-English MT systems
  • 32. • Training, tune the parameters – N1, N2 and N3 are tuned as 2, 3 and 3 due to the fact that the 4-gram chunk match usually results in 0 score. – Tuned values of factor weights are shown in table Table 2: tuned parameter values
  • 33. • Comparisons with: – BLEU, measure the closeness of the hypothesis and reference translations, n-gram precision – TER, measure the editing distance of hypothesis to reference translations
  • 34. Table 3: training (development) scores on WMT11 corpus Table 4: testing scores on WMT12 corpus
  • 35. Table 5: correlation score intro (Cohen, 1988)  The experiment results on the development and testing corpora show that HPPR without using reference translations has yielded promising correlation scores (0.63 and 0.59 respectively).  There is still potential to improve the performances of all the three metrics, even though that the correlation scores which are higher than 0.5 are already considered as strong correlation as shown in Table 5.
  • 36. • Phrase Tagset Mapping for French and English Treebanks and Its Application in Machine Translation Evaluation – Aaron L.-F. Han, Derek F. Wong, Lidia S. Chao, Liangye He, Shuo Li, and Ling Zhu. GSCL 2013, Darmstadt, Germany. LNCS Vol. 8105, pp. 119-131, Volume Editors: Iryna Gurevych, Chris Biemann and Torsten Zesch. • Open source tool for phrase tagset mapping and HPPR similarity measuring algorithms: https://github.com/aaronlifenghan/aaron-project-hppr
  • 37. • Facilitate future research in multilingual or cross-lingual literature, this paper designs a phrase tags mapping between the French Treebank and the English Penn Treebank using 9 phrase categories. • One of the potential applications of the designed universal phrase tagset is shown in the unsupervised MT evaluation task in the experiment section.
  • 38. • There are still some limitations in this work to be addressed in the future. – The designed universal phrase categories may not be able to cover all the phrase tags of other language treebanks, so this tagset could be expanded when necessary. – The designed HPPR formula contains the n-gram factors of position difference, precision and recall, which may not be sufficient or suitable for some of the other language pairs, so different measuring factors should be added or switched when facing new tasks.
  • 39. • Actually speaking, the designed models are very related to the similarity measuring. Where we have employed them is in the MT evaluation. These works may be further developed into other literature: – information retrieval – question and answering – Searching – text analysis – etc.
  • 40. • Ongoing and further works: – The combination of translation and evaluation, tuning the translation model using evaluation metrics – Evaluation models from the perspective of semantics – The further explorations of unsupervised evaluation models, extracting other features from source and target languages • Aaron open source tools: https://github.com/aaronlifenghan • Aaron network Home: http://www.linkedin.com/in/aaronhan
  • 41. GSCL 2013, Darmstadt, Germany Aaron L.-F. Han email: hanlifengaaron AT gmail DOT com Natural Language Processing & Portuguese-Chinese Machine Translation Laboratory Department of Computer and Information Science University of Macau