SlideShare ist ein Scribd-Unternehmen logo
1 von 66
Downloaden Sie, um offline zu lesen
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models

Statistical Machine Translation
Part I: Khalil Sima’an
Data and Models
Universiteit van Amsterdam

Syntax

Part II: Trevor Cohn
Decoding and efficiency
University of Sheffield
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

Statistical Machine Translation: PART I
Dr. Khalil Sima’an
Statistical Language Processing and Learning
Institute for Logic, Language and Computation
Universiteit van Amsterdam

Phrase-Based
Models
Limitations of
PB Models
Syntax

Some slides use figures from Philipp Koehn, Barry Haddow and Sophie Arnoult
Data and Models: Structure of lecture
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

General statistical framework
Word-based models: word alignments
Phrase-based models: phrase-alignments
Tree-based models: tree-alignments
Statistical Approach: Parallel Corpora
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an

Task: Translate a source sentence f to a target sentence e.
Data: Parallel corpus (source-target sentence pairs).
‫ﻧﺤﻦ ﺍﻟﺸﻌﻮﺏ‬

‫ﻭﺳﺎﺋﻂ ﺍﻹﻋﻼﻡ ﺍﻟﻤﺘﻌﺪﺩﺓ ﻭﺛﺎﺋﻖ ﻭﺧﺮﺍﺋﻂ ﻣﻨﺸﻮﺭﺍﺕ ﻭﻃﻮﺍﺑﻊ ﻭﻗﻮﺍﻋﺪ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﺗﻌﻤﻞ ﺑﺤﺚ‬

Word-Based
Models

‫ﺍﻟﺴﻠﻢ ﻭﺍﻷﻣﻦ ﺍﻟﺘﻨﻤﻴﺔ ﺍﻻﻗﺘﺼﺎﺩﻳﺔ ﻭﺍﻻﺟﺘﻤﺎﻋﻴﺔ ﺣﻘﻮﻕ ﺍﻹﻧﺴﺎﻥ ﺍﻟﺸﺆﻭﻥ ﺍﻹﻧﺴﺎﻧﻴﺔ ﺍﻟﻘﺎﻧﻮﻥ ﺍﻟﺪﻭﻟﻲ‬
Daily Briefing│Press Releases│Radio, TV, Photo│Documents, Maps│Publications, Stamps, Databases│UN Works
Peace & Security│ Economic & Social Development │ Human Rights │ Humanitarian Affairs │ International Law

Welcome to the United Nations

Alignment
Symmetrization

Situation in the Middle East

UN News Centre

Situation in Iraq

About the United Nations

Renewing the United Nations
UN Action against Terrorism

Main Bodies

Issues on the UN Agenda

Conferences & Events

Phrase-Based
Models
Limitations of
PB Models
Syntax

‫ﺍﻷﻣﻴﻦ ﺍﻟﻌﺎﻡ‬

‫ﺍﻷﻫﺪﺍﻑ ﺍﻹﻧﻤﺎﺋﻴﺔ ﻟﻸﻣﻢ‬

‫ﺍﻟﺤﺎﻟﺔ ﻓﻲ ﺍﻟﺸﺮﻕ ﺍﻷﻭﺳﻂ‬

Secretary-General

UN Millennium
Development Goals

‫ﺍﻟﻤﺘﺤﺪﺓ‬

‫ﺍﻟﺤﺎﻟﺔ ﻓﻲ ﺍﻟﻌﺮﺍﻕ‬
‫ﻣﺮﻛﺰ ﺃﻧﺒﺎء ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬

‫ﺗﺠﺪﻳﺪ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬
‫ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﻓﻲ ﻣﻮﺍﺟﻬﺔ ﺍﻹﺭﻫﺎﺏ‬
‫ﻗﻀﺎﻳﺎ ﻋﻠﻰ ﺟﺪﻭﻝ ﺃﻋﻤﺎﻝ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬
‫ﺍﻟﻤﺠﺘﻤﻊ ﺍﻟﻤﺪﻧﻲ / ﺍﻟﻤﺆﺳﺴﺎﺕ‬

‫ﻣﻌﻠﻮﻣﺎﺕ ﻋﻦ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬
‫ﺃﻫﻼ ﺑﻜﻢ ﻓﻲ ﻣﻮﻗﻊ ﺍﻷﻣﻢ‬

‫ﺍﻷﺟﻬﺰﺓ ﺍﻟﺮﺋﻴﺴﻴﺔ‬

‫ﺍﻟﻤﺘﺤﺪﺓ‬
‫ﻣﺆﺗﻤﺮﺍﺕ ﻭﻣﻨﺎﺳﺒﺎﺕ‬

‫ﺍﻟﺘﺠﺎﺭﻳﺔ‬

Civil Society & Business

Member States

UN Webcast
General Assembly President

CyberSchoolBus
International Conference on Financing for
Development, Doha, Qatar
29 November - 2 December 2008

Home │ Recent Additions │ Employment │ Procurement │ Comments │ Q&A │ UN System Sites │ Index │ Search

‫ | ﻋﺮﺑﻲ‬中文 | English | Français | Русский | Español
Copyright, United Nations, 2000-2008 | Terms of Use | Privacy Notice | Help

‫ﺍﻟﺒﺚ ﺍﻟﺸﺒﻜﻲ‬

‫ﺍﻟﺪﻭﻝ ﺍﻷﻋﻀﺎء‬

‫ﺍﻟﺤﺎﻓﻠﺔ ﺍﻟﻤﺪﺭﺳﻴﺔ ﺍﻹﻟﻜﺘﺮﻭﻧﻴﺔ‬

‫ﺭﺋﻴﺲ ﺍﻟﺠﻤﻌﻴﺔ ﺍﻟﻌﺎﻣﺔ‬
‫ﻣﺆﺗﻤﺮ ﺍﻟﻤﺘﺎﺑﻌﺔ ﺍﻟﺪﻭﻟﻲ ﻟﺘﻤﻮﻳﻞ ﺍﻟﺘﻨﻤﻴﺔ ﺍﻟﻤﻌﻨﻲ‬
‫ﺑﺎﺳﺘﻌﺮﺍﺽ ﺗﻨﻔﻴﺬ ﺗﻮﺍﻓﻖ ﺁﺭﺍء ﻣﻮﻧﺘﻴﺮﻱ‬
‫ﺍﻟﺪﻭﺣﺔ-ﻗﻄﺮ‬
‫92 ﺗﺸﺮﻳﻦ ﺍﻟﺜﺎﻧﻲ/ﻧﻮﻓﻤﺒﺮ - 2 ﻛﺎﻧﻮﻥ ﺍﻷﻭﻝ/ﺩﻳﺴﻤﺒﺮ‬
2008

‫ﺻﻔﺤﺔ ﺍﻻﺳﺘﻘﺒﺎﻝ ﺇﺿﺎﻓﺎﺕ ﺟﺪﻳﺪﺓ ﻓﺮﺹ ﻋﻤﻞ ﺷﻌﺒﺔ ﺍﻟﻤﺸﺘﺮﻳﺎﺕ ﻣﻼﺣﻈﺎﺕ ﺃﺳﺌﻠﺔ ﻭﺃﺟﻮﺑﺔ ﻣﻮﺍﻗﻊ ﻣﻨﻈﻮﻣﺔ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﻓﻬﺮﺱ ﺍﻟﻤﻮﻗﻊ‬

‫ ﻋﺮﺑﻲ‬中文 English Français Русский Español
‫ﺻﻔﺤﺔ ﺍﻟﻤﺴﺎﻋﺪﺓ ﺷﺮﻭﻁ ﺍﺳﺘﺨﺪﺍﻡ ﺍﻟﻤﻮﻗﻊ ﺑﻴﺎﻥ ﺍﻟﺨﺼﻮﺻﻴﺎﺕ‬
2008-2000 ‫ﺣﻘﻮﻕ ﺍﻟﻄﺒﻊ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬

Source-Channel Approach: IBM Models (1990’s)
Parallel Corpus Example
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an

Parallel corpus C = a collection of text-chunks and their
translations.
Parallel corpora are the by-product of human translation.
Every source chunk is paired with a target chunk.

Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models

Dutch
De prijs van het huis is gestegen.
Het huis kan worden verkocht.
Als het de marktprijs daalt zullen sommige
gezinnen een zware tijd doormaken.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

English
The price of the house has risen.
The house can be sold.
If the market price goes down, some families
will go through difficult times.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.

Syntax

Hansards Canadian Parliament Proc. (English-French).
European Parliament Proc. (23 languages).
United Nations documents.
Newspapers: Chinese-English; Arabic-English; Urdu-English.
TAUS corpora.
Generative Source-Channel Framework
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models

Given source sentence f, select target sentence e
L.M.

T .M.

arg maxe∈E(f) { P(e | f) } = arg maxe∈E(f) { P(e) × P(f | e) }
Set E(f) is the set of hypothesized translations of f.
P(f | e): accounts for divergence in . . .
word order

Limitations of
PB Models

morphology

Syntax

syntactic relations
idiomatic ways of expression
.
.
.
How to estimate P(e | f)? Sparse-data problem!
Inducing The Structure of Translation Data
Tutorial to
Statistical
Machine
Translation

e = Mary did not slap the green witch .

????

Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

f = Maria no dio una bofetada a la bruja verde .
The latent structure of translation equivalence

Graphical representations ∆f and ∆e for f and e.
Relation a between ∆f and ∆e
arg maxe∈E(f) { P(e | f) } =
arg maxe∈E(f) {

∆f ,a,∆e

P(e, ∆f , ∆e , a | f) }

The difficult question: Which ∆f/e and a fit data best?
Structure in current models
Tutorial to
Statistical
Machine
Translation

a

∆f → ∆e

Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

In most current models structure of reordering:
∆f/e are structures over word positions.
a is an alignment between groups of word positions in
∆f and ∆e .
Challenge: Number of permutations of n words is n!
Structure shows translation units composing together
What are the atomic translation units?
How these compose together efficiently?
How to put probs. on these structures?
Structure helps combat sparsity and complexity
Structure in Existing Models: Sketch
l

Tutorial to
Statistical
Machine
Translation

uage

Lang

aj

Alignment a

Word-based

Dr Khalil
Sima’an

el on

Mod

e

el

e aj

e1

1

"translation dictionary"
m

j

fm
fj

1

f1

Word-Based
Models
Alignment
Symmetrization

Phrase-based

Phrase-Based
Models
Limitations of
PB Models

l
uage

aj

Syntax

Lang

1

Tree-based

Mod

el on

e

el

e aj

e1
m
j

fm
fj

1

f1

Problem: No sufficient stats to estimate P(e | f) from data
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Word-Based Models: Word Alignments
Some History and References
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Statistical models with word-alignments:
Brown, Cocke, Della Pietra, Della Pietra, Jelinek,
Lafferty, Mercer and Roossin. A statistical approach to
machine translation. Computational Linguistics, 1990.
Brown, Della Pietra, Della Pietra and Mercer. The
mathematics of statistical machine translation:
parameter estimation., Computational Linguistics,
1993.
Och and Ney: A Systematic Comparison of Various
Statistical Alignment Models. Computational
Linguistics, 2003.
Word-Based Models and Word-Alignment
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

a is a mapping between word positions.

a
e
f

∆f and ∆e are sequences of word positions.
m
l
e = e1 = e1 . . . el and f = f1 = f1 . . . fm
A hidden word-alignment a:
P(f | e) =

P(a, f | e)
a

Assume that a target word-position ei translates into
zero or more source word-positions
a : {posf } → ({pose } ∪ {0})
ai or a(i), i.e., word position in e with which fi is aligned.
Word Alignment Example
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Word Alignment Example
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

(Le reste appartenait aux autochtones |
Word Alignment Example: Not covered in this
setting
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Word Alignment Matrix Example
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Translation model with word alignment
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

arg maxe P(e | f) = arg maxe P(e)×P(f | e)
P(f | e) =

a P(a, f

| e) =

a P(a|

e) × P(f | a, e)

Questions
How to parametrize the model?
How are e, f and a composed from basic units?
How to train the model?
How to acquire word alignment?
How to translate with this model?
Decoding and computational issues (for second part)
Word-Alignment As Hidden Structure
Tutorial to
Statistical
Machine
Translation

l

ge M
ngua

La

aj

Dr Khalil
Sima’an

Alignment a

Alignment
Symmetrization

e aj

"translation dictionary"
m

j

Phrase-Based
Models
Limitations of
PB Models

el

e1

1

Word-Based
Models

on e
odel

fm
fj

1

Syntax

f1

We need to decompose
The alignment a and the length m: P(a | e)
“Translation dictionary” P(f | e, a)
Word Alignment Models: General Scheme
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models

Alignment of positions in f with positions in e:
m
a = a1 = a1 . . . am
Markov process over a
m m
l
P(a1 , f1 | e1 ) = P(m | e) ×
m

Alignment
Symmetrization

j=1

Phrase-Based
Models
Limitations of
PB Models
Syntax

j−1 j−1
j
j−1
P(aj | a1 , f1 , m, e) × P(fj | a1 , f1 , m, e)

In words: to generate alignment a and foreign sentence f
1
2

3

Choose a length m for f
Generate alignment aj given the preceding alignments,
words in f, m, and e
Generate word fj conditioned on structure so far and e.

IBM models are obtained by simplifications of this formula.
IBM Model I
Tutorial to
Statistical
Machine
Translation

m m
P(a1 , f1 | e1 . . . el ) = P(m | e) ×
m

Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models

j=1

IBM Model I:
Length: P(m | e) =≈ P(m | l) ≈ =

A fixed probability .

Align with uniform probability j with any aj in el1 or
j−1 j−1
NULL: P(aj | a1 , f1 , m, e) ≈ (l + 1)−1

Limitations of
PB Models
Syntax

j−1 j−1
j
j−1
P(aj | a1 , f1 , m, e) × P(fj | a1 , f1 , m, e)

Note that aj can be linked with l positions in e or with NULL.

Lexicon: lexicon parameters πt (f | e)
j
j−1
P(fj | a1 , f1 , m, e) ≈ P(fj | eaj ) = πt (fj | eaj )

Parameters:

and {πt (f | e) | f , e ∈ C}.
Sketch IBM Model I
Tutorial to
Statistical
Machine
Translation

l

Dr Khalil
Sima’an

L

age
angu

aj

Word-Based
Models

e

el

eaj

1

Alignment
Symmetrization

el on

Mod

e1

label every f position with an e position

choose length m given l
"translation dictionary"

Phrase-Based
Models

m

Limitations of
PB Models

j

fm

Syntax

fj

1

f1
IBM Model I Parameters and Data Likelhood
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an

Data Likelihood:

m
a1

Word-Based
Models

l

=

Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models

m m
P(a1 , f1 | e1 . . . el )

P(f |e) =

Parameters:

(l + 1)m

×

l

m

πt (fj | eaj )

...
a1 =0

am =0 j=1

and {πt (f | e) | f , e ∈ C}.

Fix , i.e., in practice put a uniform probability over a range [1..m], for some natural number m.

Syntax

Dilemma
To estimate these parameters we need word-alignment
To get word-alignment we need these parameters.
IBM Model II
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Extends IBM Model I at alignment probs:
m m
P(a1 , f1

m

| e1 . . . el ) ≈ ×

j−1 j−1
P(aj | a1 , f1 , m, e)×πt (fj | eaj )

j=1

IBM Model II: changes only one element in IBM Model I:
IBM Model I does not take into account the position of
words in both strings
j−1 j−1
P(aj | a1 , f1 , m, e) = P(aj | j, l, m) := πA (aj | j, l, m)

Where πA (.|.) are parameters to be learned from data.
IBM Models III, IV and V concentrate on more complex
alignments allowing, e.g., 1 − to − n (fertility)
IBM Model II Parameters
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

m m
P(a1 , f1 | e1 . . . el ) ≈ ×

m

πA (aj | j, l, m) × πt (fj | eaj )
j=1

Parameters: {πA (aj | j, l, m)} and {πt (fj | eaj )}
Dilemma
To estimate these parameters we need word-alignment
To get word-alignment we need these parameters.
Estimating Model Parameters
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Maximum-Likelihood Estimation of model M on parallel
corpus C
arg max P(C | m) = arg max
m∈M

m∈M

Pm (e | f)
e,f in C

Example IBM Model I:
Model M is defined by model parameters.
Data is incomplete: no closed form solution.
Expectation-Maximization (EM) sketch
Init: Set the parameters at some m0 and let i = 0
Repeat until convergence (in perplexity)
EMi (C) = C completed using estimate mi
EMi (C) contains mi -expectations over e, f, a : P(a | f, e)

mi+1 = Relative Frequency Estimates from EMi (C).
EM for Lexicon and Word Alignment Probs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

... la maison ... la maison blue ... la fleur ...

Phrase-Based
Models
 

... the house ... the blue house ... the flower ...

 

Limitations of
PB Models
Syntax
EM for Lexicon and Word Alignment Probs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

... la maison ... la maison blue ... la fleur ...

Phrase-Based
Models
Limitations of
PB Models
Syntax

 

 

... the house ... the blue house ... the flower ...
EM for Lexicon and Word Alignment Probs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

... la maison ... la maison bleu ... la fleur ...

Phrase-Based
Models
Limitations of
PB Models
Syntax

 

 

... the house ... the blue house ... the flower ...
EM for Lexicon and Word Alignment Probs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

... la maison ... la maison bleu ... la fleur ...

Phrase-Based
Models
Limitations of
PB Models
Syntax

 

 

... the house ... the blue house ... the flower ...
Translation Using EM Estimates
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Lexicon probability estimates: {πt (fj | eaj )}
ˆ
Alignment probabilities: {πA (aj | j, m, l)}
ˆ
Translation Model + Language Model + Decoder
arg maxe P(e | f) = arg maxe P(e) ×

P(a, f | e)
a
Viterbi Word-Alignment using EM estimates
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

After EM has stabilized on estimates
{πt (fj | eaj )}
ˆ

For every f, e in C apply the following
m m
arg maxam P(a1 , f1 | e1 . . . el ) ≈

Phrase-Based
Models
Limitations of
PB Models
Syntax

and {πA (aj | j, m, l)}
ˆ

1

m

arg maxam
1

πA (aj | j, m, l) × πt (fj | eaj )
ˆ
ˆ

×
j=1
HMM Alignment Model: General Form
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

m m
P(a1 , f1 | e1 . . . el ) ≈ ×

m

j−1 j−1
P(aj | a1 , f1 , m, e)×πt (fj | eaj )

j=1

Words do not move independently of each other:
condition word movement on previous word movement
j−1 j−1
P(aj | a1 , f1 , m, e) ≈ P(aj | aj−1 , m)
IBM Model III (and IV): Example
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an

A hidden word-alignment a: P(f | e) =

a P(a, f

Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Estimate alignment + lexicon + reordering + fertility
parameters.

| e)
Word-based Models (Och  Ney 2003)
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Word-Alignment As Hidden Structure:
Sufficient?
Tutorial to
Statistical
Machine
Translation

l

ge M
ngua

La

aj

Dr Khalil
Sima’an

Alignment a

Alignment
Symmetrization

e aj

translation dictionary
m

j

Phrase-Based
Models
Limitations of
PB Models

el

e1

1

Word-Based
Models

on e
odel

fm
fj

1

Syntax

f1

We assumed alignment between words and dictionary:
Alignment a and the length m: P(a | e)
Dictionary P(f | e, a)
Limitations of Word-based Models
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Limitations of word-based translation:
Many-to-one and many-to-many is common:
“Makes more difficult”/bemoeilijkt “Dat richtte (hen)
ten gronde”/”That destroyed (them)”
Reordering takes place (often) by whole blocks.
Reordering individual words increases ambiguity.
“The (big heavy) cow/la vaca (pesada grande)”
Translation works by “fixed expressions” (idiomatic).
Concatenating word-translations increases ambiguity.
Estimates of P(f | e) by word-based models are inaccurate.
Instead of words as basic events: multi-word events in
corpus.
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Obtaining Symmetrized Word Alignments
Asymetric Alignments: Limitations
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization

Word-based models presented so far are based on
asymmetric word alignment.
Each position i in f is aligned with at most one position
in e: ai
What about such word alignments?

Phrase-Based
Models
Limitations of
PB Models
Syntax

Or when a word in f translated into two or more in e?
Symmetrization Heursitics
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Obtain Af→e and Ae→f
From Intersection Af→e ∩ Ae→f to Union Af→e ∪ Ae→f
Symmetrization Heursitics Algoritm
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Obtain Af→e and Ae→f
From Intersection Af→e ∩ Ae→f to Union Af→e ∪ Ae→f
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Phrase-based Models: Alignment between Phrases
Statistical “Memory-based” Translation
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Store arbitrary length source-target translation units
from training parallel corpus.
Translate new input by “covering” it with translation
units replayed from memory.
Idiomatic = Tiling: Phrase-Based SMT
Assume word-alignment a is given in parallel corpus.
Phrase-pair = contiguous source-target n, m -grams
that are translational equivalents under a.
Estimate phrase-pair probabilities Θ(f i | ei )
Translate f by “tiling it with phrases with order
permutation”
PBSMT some references
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Alignment-template Approach to Stat. Machine
Translation (RWTH Aachen 1999)
Phrase-based statistical machine translation (Zens,
Och and Ney 2002)
Phrase based SMT (Koehn, Och and Marcu 2003)
Joint Phrase-based SMT (Wang and Marcu 2005)
Statistical Machine Translation. (Ph. Koehn, Cambridge
University Press 2010)
Relation to EBMT 1984; Data-Oriented Translation (2000).
Phrase-Based Models: Conceptual
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models

T he president meets

Segment foreign sentence f
I
into I phrases f 1

economic officials

arg maxe P(e | f) = arg maxe P(e) × P(f | e)
P(f | e) =

I
f 1 ,eI
1

I

P(f 1 ,eI | e)
1

Limitations of
PB Models
Syntax

Saudi

I
i=1

P(f i | ei )×d(starti −endi−1 −1)

ph.table

arg maxe P(f | e) ≈ arg max

I
f 1 ,eI
1

Dist.reord.

I
Θ(f i | ei )×d(starti −endi−1 −1)
i=1

starti /endi are positions of first/last words of f i (translateing to ei ).
d(x) = αx exponentially decaying in words skipped (α ∈ (0, 1]).
Phrase-Based Models: Linear-interpolation
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

T he president meets

Segment foreign sentence f
I
into I phrases f 1

Saudi

economic officials

Log-linear interpolation of factors:
score(e|f) =

λf × log Hf (e, f)
f∈F

Where set F consists of:
Bag of phrases translation =

I
i=1 Θ(f i

| ei )

d() Phrases reordered with reordering model d(.)
lm Language model (5-grams or even 7-grams).
other Smoothing + length penalty terms.
Topics to discuss
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Phrase table extraction
Estimating {Θ(f i | ei )} and {Θ(ei | f i )}
Lexicalized and hierarchical phrase reordering models
Other: phrase, length penalty . . .
Log-linear interpolation and minimum error-rate training
Decoding and optimization
Extracting phrase pairs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

A phrase pair f , e is consistent with alignment a iff
Non-empty: at least one alignment pair from a is in
f,e
No foreign positions inside f , e aligned to positions
outside it
No english positions inside f , e aligned to positions
outside it
Extracting phrase pairs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Extracting phrase pairs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Extracting phrase pairs
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Extracting phrases from permutations
Tutorial to
Statistical
Machine
Translation

1

3

4

2

5

1

3

4

2

5

1

2

3

4

5

1

2

3

4

5

1

3

4

2

5

1

3

4

2

5

1

2

3

4

5

1

2

3

4

5

1

3

4

2

5

1

3

4

2

5

1

2

3

4

5

1

2

3

4

5

Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Phrase pair weights
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an

Extract phrase pairs from corpus into multiset
Tab = { f , e , freq(f , e)}

Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Weights for f , e
Θ(f | e) =

freq(f ,e)
freq(f ,e)

f ,e ∈Tab

Θ(e | f ) =

freq(e,f )
freq(f ,e )

f ,e ∈Tab

Smoothing with lexical word alignment estimates from
IBM models
Distance-Based Reordering Sketch
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Lexicalized Reordering Sketch (Tillmann 2004)
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Limited generalization over parallel data (1)
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Non-productive Phrase Table: Phrase Variants?
Morphological e.g., changing inflection, agreement
Al$arekat Alhindiyya
the-companies the-Indian
the Indian companies

$areka hindiyya
company Indian
(an) Indian company

Syntactic e.g. adding adjective/proposition/adverbials
the fish in the deep sea swims

the fish swims

Reordering minor reordering of same words not allowed
In Arabic V-S-O and S-V-O are allowed.
Semantic e.g. synonyms, paraphrases
Non-productive Phrase Table = Data Sparseness
Limited generalization over parallel data (2)
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models

Reordering
Local, monotone, almost non-lexicalized reordering.
What about long range reordering?

Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Five phrases need to be reversed: see Chiang 2007 (J. CL).

Reordering target phrases with a coarse “source road
map”?
Limitations: Data-Sparseness
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Non-productive phrase table + Local, Uncharted reordering
⇓
Data-sparseness: Shorter phrases will apply down to word
level.
⇓
Shorter phrases combined assuming independence.
⇓
Target phrase selection hard due to large hypotheses lattice
Target Language Model = Only “GLUE” over target phrases.
The Shorter the Phrases, the Greater the Risk
Idiomatic Approach: GOOD, BAD and UGLY
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Phrases as atomic units
Good: Less ambiguity in lexical choice and reordering.
Match-Retrieve exactly is largely safe.
Bad: Weak generalization over data.
No phrase variants, weak reordering
Ugly: Fall-back on shorter phrases downto
word-to-word
LM as “glue” is insufficient.
Idiomatic approach does not alleviate data-sparseness

How Should We Translate Novel Phrases?
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Towards the land of bi-trees
Alignments between Tree Pairs, ITG, Hierarchical
Models and Syntax
Hidden Structure of Translation: Tree Pairs
Tutorial to
Statistical
Machine
Translation

l

Dr Khalil
Sima’an

on e

el

uag
Lang e aj

aj

Word-Based
Models

del
e Mo

1

e1

Alignment
Symmetrization
Phrase-Based
Models

m

Limitations of
PB Models

j

Syntax

fm
fj

1

f1
Reordering n words
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models

1

Permutations of n words: n!

3

4

1

2

3

5

4

5

Surely not all permutations are needed! (Wu 1995)
Use trees and allow permutations on the nodes?
There is an exponential number of trees in n
ITG hypothesis (Wu 1995)

Limitations of
PB Models
Syntax

2

[]
[]

Assume binary trees with two operations


[]
1 3

4 2

5

Phrase-based forms of ITG (Chiang 2005; 2007): Hiero
Syntax-Driven Phrase Translation
Tutorial to
Statistical
Machine
Translation

Syntax-driven Re-Ordering

Hierarchical (ITG)

Dr Khalil
Sima’an

Linguistic Syntax

XP

XP

Word-Based
Models

XP
XP

Alignment
Symmetrization
Phrase-Based
Models

1

3

4

1

2

3

2

4

5

5

Limitations of
PB Models
Is translation syntactically cohesive?
Syntax

Reordering == Moving children in parse tree?
Binary: monotone or inverted order at every node.
Lexical elements can be phrase pairs.
Covers word-alignments in parallel corpora?
Word order difference and syntax: Impression
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Phrase-based Hierarchical Model: Hiero
(Chiang 2005; 2007)
Tutorial to
Statistical
Machine
Translation

Extracting phrase-pairs with gaps (hierarchical trees):

Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

X → X1 dar una bufetada a X2 / X1 slap X2
X → Maria no X la bruja verde / Mary did not X the green witch
ITG with syntactic labels
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax
Tutorial to
Statistical
Machine
Translation
Dr Khalil
Sima’an
Word-Based
Models
Alignment
Symmetrization
Phrase-Based
Models
Limitations of
PB Models
Syntax

Part II: Trevor Cohn
Decoding algorithms and efficiency

Weitere ähnliche Inhalte

Was ist angesagt?

Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translationHrishikesh Nair
 
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?tauyou
 
9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation
9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation
9. Manuel Harranz (pangeanic) Hybrid Solutions for TranslationRIILP
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translationkhyati gupta
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationChamani Shiranthika
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesAntonio Toral
 
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATIONAN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATIONcscpconf
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELSijnlc
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Rajnish Raj
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMassimo Schenone
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Taggingtheyaseen51
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translationMarcis Pinnis
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...John Tinsley
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Marcin Junczys-Dowmunt
 
Statistical machine translation for indian language copy
Statistical machine translation for indian language   copyStatistical machine translation for indian language   copy
Statistical machine translation for indian language copyNakul Sharma
 

Was ist angesagt? (20)

SMT3
SMT3SMT3
SMT3
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translation
 
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?
2013 ALC Boston: Your Trained Moses SMT System doesn't work. What can you do?
 
9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation
9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation
9. Manuel Harranz (pangeanic) Hybrid Solutions for Translation
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
Integration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translationIntegration of speech recognition with computer assisted translation
Integration of speech recognition with computer assisted translation
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
 
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATIONAN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
AN ADVANCED APPROACH FOR RULE BASED ENGLISH TO BENGALI MACHINE TRANSLATION
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
NLP
NLPNLP
NLP
 
MACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSISMACHINE-DRIVEN TEXT ANALYSIS
MACHINE-DRIVEN TEXT ANALYSIS
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
NLP pipeline in machine translation
NLP pipeline in machine translationNLP pipeline in machine translation
NLP pipeline in machine translation
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
BERT
BERTBERT
BERT
 
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
Automatic Grammatical Error Correction for ESL-Learners by SMT - Getting it r...
 
Statistical machine translation for indian language copy
Statistical machine translation for indian language   copyStatistical machine translation for indian language   copy
Statistical machine translation for indian language copy
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
 

Andere mochten auch

Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsParisa Niksefat
 
Multi lingual corpus for machine aided translation
Multi lingual corpus for machine aided translationMulti lingual corpus for machine aided translation
Multi lingual corpus for machine aided translationAashna Phanda
 
Traductores de Nicaragua (505)2289-4596
Traductores de Nicaragua  (505)2289-4596Traductores de Nicaragua  (505)2289-4596
Traductores de Nicaragua (505)2289-4596Rolando Tellez
 
A review on Google Translator
A review on Google TranslatorA review on Google Translator
A review on Google TranslatorSylvia
 
Google translator
Google translatorGoogle translator
Google translatorLaura P
 
Human vs machine translation
Human  vs machine translationHuman  vs machine translation
Human vs machine translationJeff Hernandez
 
Mixing Computer-Assisted Translation and Machine Translation
Mixing Computer-Assisted Translation and Machine TranslationMixing Computer-Assisted Translation and Machine Translation
Mixing Computer-Assisted Translation and Machine Translationallinportuguese
 
Machine translation vs human translation
Machine translation vs human translationMachine translation vs human translation
Machine translation vs human translationLanguages Pro
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Kotaro Hara
 
Machine Translation And Computer Assisted Translation
Machine Translation And Computer Assisted TranslationMachine Translation And Computer Assisted Translation
Machine Translation And Computer Assisted TranslationTeritaa
 
Google Translate Update
Google Translate UpdateGoogle Translate Update
Google Translate Updatemrsvogel
 
Machine Translation=Google Translator
Machine Translation=Google TranslatorMachine Translation=Google Translator
Machine Translation=Google TranslatorNerea
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translationRushdi Shams
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?Multilizer
 

Andere mochten auch (17)

Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
 
Multi lingual corpus for machine aided translation
Multi lingual corpus for machine aided translationMulti lingual corpus for machine aided translation
Multi lingual corpus for machine aided translation
 
Traductores de Nicaragua (505)2289-4596
Traductores de Nicaragua  (505)2289-4596Traductores de Nicaragua  (505)2289-4596
Traductores de Nicaragua (505)2289-4596
 
A review on Google Translator
A review on Google TranslatorA review on Google Translator
A review on Google Translator
 
Google translator
Google translatorGoogle translator
Google translator
 
Human vs machine translation
Human  vs machine translationHuman  vs machine translation
Human vs machine translation
 
Mixing Computer-Assisted Translation and Machine Translation
Mixing Computer-Assisted Translation and Machine TranslationMixing Computer-Assisted Translation and Machine Translation
Mixing Computer-Assisted Translation and Machine Translation
 
Machine translation vs human translation
Machine translation vs human translationMachine translation vs human translation
Machine translation vs human translation
 
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
Effect of Machine Translation in Interlingual Conversation: Lessons from a Fo...
 
Machine Translation And Computer Assisted Translation
Machine Translation And Computer Assisted TranslationMachine Translation And Computer Assisted Translation
Machine Translation And Computer Assisted Translation
 
Google Translate Update
Google Translate UpdateGoogle Translate Update
Google Translate Update
 
Google translate
Google translateGoogle translate
Google translate
 
Machine Translation=Google Translator
Machine Translation=Google TranslatorMachine Translation=Google Translator
Machine Translation=Google Translator
 
Machine translation
Machine translationMachine translation
Machine translation
 
Machine Translation
Machine TranslationMachine Translation
Machine Translation
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?
 

Ähnlich wie 6. Khalil Sima'an (UVA) Statistical Machine Translation

Moses Statistical Machine Translation tool
Moses Statistical Machine Translation toolMoses Statistical Machine Translation tool
Moses Statistical Machine Translation toolyashothara shanmugarajah
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningLena Shakurova
 
Pbsmt presenation waleed_oransa_29_april2010
Pbsmt presenation waleed_oransa_29_april2010Pbsmt presenation waleed_oransa_29_april2010
Pbsmt presenation waleed_oransa_29_april2010woransa
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to HindiRajat Jain
 
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATIONNEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATIONcscpconf
 
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Using Embeddings to Understand the Variance and Evolution of Data Science... ...Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Using Embeddings to Understand the Variance and Evolution of Data Science... ...PyData
 
Embracing diversity searching over multiple languages
Embracing diversity  searching over multiple languagesEmbracing diversity  searching over multiple languages
Embracing diversity searching over multiple languagesSuneel Marthi
 
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...Association for Computational Linguistics
 
Mt summit2015 jdu_v2
Mt summit2015 jdu_v2Mt summit2015 jdu_v2
Mt summit2015 jdu_v2Jinhua Du
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationLifeng (Aaron) Han
 
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015RIILP
 
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdfsimonp16
 
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...Estelle Delpech
 
Tamil-English Document Translation Using Statistical Machine Translation Appr...
Tamil-English Document Translation Using Statistical Machine Translation Appr...Tamil-English Document Translation Using Statistical Machine Translation Appr...
Tamil-English Document Translation Using Statistical Machine Translation Appr...baskaran_md
 
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Sheeyam Shellvacumar
 

Ähnlich wie 6. Khalil Sima'an (UVA) Statistical Machine Translation (20)

Miguel Rios - 2015 - Obtaining SMT dictionaries for related languages
Miguel Rios - 2015 - Obtaining SMT dictionaries for related languagesMiguel Rios - 2015 - Obtaining SMT dictionaries for related languages
Miguel Rios - 2015 - Obtaining SMT dictionaries for related languages
 
Moses Statistical Machine Translation tool
Moses Statistical Machine Translation toolMoses Statistical Machine Translation tool
Moses Statistical Machine Translation tool
 
How to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learningHow to expand your nlp solution to new languages using transfer learning
How to expand your nlp solution to new languages using transfer learning
 
Pbsmt presenation waleed_oransa_29_april2010
Pbsmt presenation waleed_oransa_29_april2010Pbsmt presenation waleed_oransa_29_april2010
Pbsmt presenation waleed_oransa_29_april2010
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATIONNEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
NEURAL SYMBOLIC ARABIC PARAPHRASING WITH AUTOMATIC EVALUATION
 
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Using Embeddings to Understand the Variance and Evolution of Data Science... ...Using Embeddings to Understand the Variance and Evolution of Data Science... ...
Using Embeddings to Understand the Variance and Evolution of Data Science... ...
 
Embracing diversity searching over multiple languages
Embracing diversity  searching over multiple languagesEmbracing diversity  searching over multiple languages
Embracing diversity searching over multiple languages
 
Anabela Barreiro - Alinhamentos
Anabela Barreiro - AlinhamentosAnabela Barreiro - Alinhamentos
Anabela Barreiro - Alinhamentos
 
Cross language alignments - challenges guidelines and gold sets
Cross language alignments - challenges guidelines and gold setsCross language alignments - challenges guidelines and gold sets
Cross language alignments - challenges guidelines and gold sets
 
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...
Jérémy Ferrero - 2017 - Using Word Embedding for Cross-Language Plagiarism ...
 
Mt summit2015 jdu_v2
Mt summit2015 jdu_v2Mt summit2015 jdu_v2
Mt summit2015 jdu_v2
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine Translation
 
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
ESR2 Santanu Pal - EXPERT Summer School - Malaga 2015
 
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
07-Effect-Of-Machine-Translation-In-Interlingual-Conversation.pdf
 
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
Extraction of domain-specific bilingual lexicon from comparable corpora: comp...
 
Parafraseo-Chenggang.pdf
Parafraseo-Chenggang.pdfParafraseo-Chenggang.pdf
Parafraseo-Chenggang.pdf
 
Tamil-English Document Translation Using Statistical Machine Translation Appr...
Tamil-English Document Translation Using Statistical Machine Translation Appr...Tamil-English Document Translation Using Statistical Machine Translation Appr...
Tamil-English Document Translation Using Statistical Machine Translation Appr...
 
evonlp_slides.pdf
evonlp_slides.pdfevonlp_slides.pdf
evonlp_slides.pdf
 
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.
 

Mehr von RIILP

Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD RIILP
 
Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic RIILP
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones RIILP
 
Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones RIILP
 
Gianluca Giulinin - FAO
Gianluca Giulinin - FAO Gianluca Giulinin - FAO
Gianluca Giulinin - FAO RIILP
 
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic RIILP
 
Tony O'Dowd - KantanMT
Tony O'Dowd -  KantanMT Tony O'Dowd -  KantanMT
Tony O'Dowd - KantanMT RIILP
 
Santanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARSantanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARRIILP
 
Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU RIILP
 
Anna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMAAnna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMARIILP
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD RIILP
 
Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW RIILP
 
Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA RIILP
 
Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU RIILP
 
Liling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARLiling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARRIILP
 
Sandra de luca - Acclaro
Sandra de luca - AcclaroSandra de luca - Acclaro
Sandra de luca - AcclaroRIILP
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015RIILP
 
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015RIILP
 
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015RIILP
 
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015RIILP
 

Mehr von RIILP (20)

Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD Gabriella Gonzalez - eTRAD
Gabriella Gonzalez - eTRAD
 
Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic Manuel Herranz - Pangeanic
Manuel Herranz - Pangeanic
 
Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones Carla Parra Escartin - ER2 Hermes Traducciones
Carla Parra Escartin - ER2 Hermes Traducciones
 
Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones Juanjo Arevelillo - Hermes Traducciones
Juanjo Arevelillo - Hermes Traducciones
 
Gianluca Giulinin - FAO
Gianluca Giulinin - FAO Gianluca Giulinin - FAO
Gianluca Giulinin - FAO
 
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
Lianet Sepulveda & Alexander Raginsky - ER 3a & ER 3b Pangeanic
 
Tony O'Dowd - KantanMT
Tony O'Dowd -  KantanMT Tony O'Dowd -  KantanMT
Tony O'Dowd - KantanMT
 
Santanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAARSantanu Pal - ESR 2 USAAR
Santanu Pal - ESR 2 USAAR
 
Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU Chris Hokamp - ESR 9 DCU
Chris Hokamp - ESR 9 DCU
 
Anna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMAAnna Zaretskaya - ESR 1 UMA
Anna Zaretskaya - ESR 1 UMA
 
Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD  Carolina Scarton - ESR 7 - USFD
Carolina Scarton - ESR 7 - USFD
 
Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW Rohit Gupta - ESR 4 - UoW
Rohit Gupta - ESR 4 - UoW
 
Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA Hernani Costa - ESR 3 - UMA
Hernani Costa - ESR 3 - UMA
 
Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU Liangyou Li - ESR 8 - DCU
Liangyou Li - ESR 8 - DCU
 
Liling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAARLiling Tan - ESR 5 USAAR
Liling Tan - ESR 5 USAAR
 
Sandra de luca - Acclaro
Sandra de luca - AcclaroSandra de luca - Acclaro
Sandra de luca - Acclaro
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
 
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
ESR1 Anna Zaretskaya - EXPERT Summer School - Malaga 2015
 
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
ESR3 Hernani Costa - EXPERT Summer School - Malaga 2015
 
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015
ESR4 Rohit Gupta - EXPERT Summer School - Malaga 2015
 

Kürzlich hochgeladen

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 

Kürzlich hochgeladen (20)

Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 

6. Khalil Sima'an (UVA) Statistical Machine Translation

  • 1. Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Statistical Machine Translation Part I: Khalil Sima’an Data and Models Universiteit van Amsterdam Syntax Part II: Trevor Cohn Decoding and efficiency University of Sheffield
  • 2. Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Statistical Machine Translation: PART I Dr. Khalil Sima’an Statistical Language Processing and Learning Institute for Logic, Language and Computation Universiteit van Amsterdam Phrase-Based Models Limitations of PB Models Syntax Some slides use figures from Philipp Koehn, Barry Haddow and Sophie Arnoult
  • 3. Data and Models: Structure of lecture Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax General statistical framework Word-based models: word alignments Phrase-based models: phrase-alignments Tree-based models: tree-alignments
  • 4. Statistical Approach: Parallel Corpora Tutorial to Statistical Machine Translation Dr Khalil Sima’an Task: Translate a source sentence f to a target sentence e. Data: Parallel corpus (source-target sentence pairs). ‫ﻧﺤﻦ ﺍﻟﺸﻌﻮﺏ‬ ‫ﻭﺳﺎﺋﻂ ﺍﻹﻋﻼﻡ ﺍﻟﻤﺘﻌﺪﺩﺓ ﻭﺛﺎﺋﻖ ﻭﺧﺮﺍﺋﻂ ﻣﻨﺸﻮﺭﺍﺕ ﻭﻃﻮﺍﺑﻊ ﻭﻗﻮﺍﻋﺪ ﺍﻟﺒﻴﺎﻧﺎﺕ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﺗﻌﻤﻞ ﺑﺤﺚ‬ Word-Based Models ‫ﺍﻟﺴﻠﻢ ﻭﺍﻷﻣﻦ ﺍﻟﺘﻨﻤﻴﺔ ﺍﻻﻗﺘﺼﺎﺩﻳﺔ ﻭﺍﻻﺟﺘﻤﺎﻋﻴﺔ ﺣﻘﻮﻕ ﺍﻹﻧﺴﺎﻥ ﺍﻟﺸﺆﻭﻥ ﺍﻹﻧﺴﺎﻧﻴﺔ ﺍﻟﻘﺎﻧﻮﻥ ﺍﻟﺪﻭﻟﻲ‬ Daily Briefing│Press Releases│Radio, TV, Photo│Documents, Maps│Publications, Stamps, Databases│UN Works Peace & Security│ Economic & Social Development │ Human Rights │ Humanitarian Affairs │ International Law Welcome to the United Nations Alignment Symmetrization Situation in the Middle East UN News Centre Situation in Iraq About the United Nations Renewing the United Nations UN Action against Terrorism Main Bodies Issues on the UN Agenda Conferences & Events Phrase-Based Models Limitations of PB Models Syntax ‫ﺍﻷﻣﻴﻦ ﺍﻟﻌﺎﻡ‬ ‫ﺍﻷﻫﺪﺍﻑ ﺍﻹﻧﻤﺎﺋﻴﺔ ﻟﻸﻣﻢ‬ ‫ﺍﻟﺤﺎﻟﺔ ﻓﻲ ﺍﻟﺸﺮﻕ ﺍﻷﻭﺳﻂ‬ Secretary-General UN Millennium Development Goals ‫ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﺍﻟﺤﺎﻟﺔ ﻓﻲ ﺍﻟﻌﺮﺍﻕ‬ ‫ﻣﺮﻛﺰ ﺃﻧﺒﺎء ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﺗﺠﺪﻳﺪ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﻓﻲ ﻣﻮﺍﺟﻬﺔ ﺍﻹﺭﻫﺎﺏ‬ ‫ﻗﻀﺎﻳﺎ ﻋﻠﻰ ﺟﺪﻭﻝ ﺃﻋﻤﺎﻝ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﺍﻟﻤﺠﺘﻤﻊ ﺍﻟﻤﺪﻧﻲ / ﺍﻟﻤﺆﺳﺴﺎﺕ‬ ‫ﻣﻌﻠﻮﻣﺎﺕ ﻋﻦ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﺃﻫﻼ ﺑﻜﻢ ﻓﻲ ﻣﻮﻗﻊ ﺍﻷﻣﻢ‬ ‫ﺍﻷﺟﻬﺰﺓ ﺍﻟﺮﺋﻴﺴﻴﺔ‬ ‫ﺍﻟﻤﺘﺤﺪﺓ‬ ‫ﻣﺆﺗﻤﺮﺍﺕ ﻭﻣﻨﺎﺳﺒﺎﺕ‬ ‫ﺍﻟﺘﺠﺎﺭﻳﺔ‬ Civil Society & Business Member States UN Webcast General Assembly President CyberSchoolBus International Conference on Financing for Development, Doha, Qatar 29 November - 2 December 2008 Home │ Recent Additions │ Employment │ Procurement │ Comments │ Q&A │ UN System Sites │ Index │ Search ‫ | ﻋﺮﺑﻲ‬中文 | English | Français | Русский | Español Copyright, United Nations, 2000-2008 | Terms of Use | Privacy Notice | Help ‫ﺍﻟﺒﺚ ﺍﻟﺸﺒﻜﻲ‬ ‫ﺍﻟﺪﻭﻝ ﺍﻷﻋﻀﺎء‬ ‫ﺍﻟﺤﺎﻓﻠﺔ ﺍﻟﻤﺪﺭﺳﻴﺔ ﺍﻹﻟﻜﺘﺮﻭﻧﻴﺔ‬ ‫ﺭﺋﻴﺲ ﺍﻟﺠﻤﻌﻴﺔ ﺍﻟﻌﺎﻣﺔ‬ ‫ﻣﺆﺗﻤﺮ ﺍﻟﻤﺘﺎﺑﻌﺔ ﺍﻟﺪﻭﻟﻲ ﻟﺘﻤﻮﻳﻞ ﺍﻟﺘﻨﻤﻴﺔ ﺍﻟﻤﻌﻨﻲ‬ ‫ﺑﺎﺳﺘﻌﺮﺍﺽ ﺗﻨﻔﻴﺬ ﺗﻮﺍﻓﻖ ﺁﺭﺍء ﻣﻮﻧﺘﻴﺮﻱ‬ ‫ﺍﻟﺪﻭﺣﺔ-ﻗﻄﺮ‬ ‫92 ﺗﺸﺮﻳﻦ ﺍﻟﺜﺎﻧﻲ/ﻧﻮﻓﻤﺒﺮ - 2 ﻛﺎﻧﻮﻥ ﺍﻷﻭﻝ/ﺩﻳﺴﻤﺒﺮ‬ 2008 ‫ﺻﻔﺤﺔ ﺍﻻﺳﺘﻘﺒﺎﻝ ﺇﺿﺎﻓﺎﺕ ﺟﺪﻳﺪﺓ ﻓﺮﺹ ﻋﻤﻞ ﺷﻌﺒﺔ ﺍﻟﻤﺸﺘﺮﻳﺎﺕ ﻣﻼﺣﻈﺎﺕ ﺃﺳﺌﻠﺔ ﻭﺃﺟﻮﺑﺔ ﻣﻮﺍﻗﻊ ﻣﻨﻈﻮﻣﺔ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ ﻓﻬﺮﺱ ﺍﻟﻤﻮﻗﻊ‬ ‫ ﻋﺮﺑﻲ‬中文 English Français Русский Español ‫ﺻﻔﺤﺔ ﺍﻟﻤﺴﺎﻋﺪﺓ ﺷﺮﻭﻁ ﺍﺳﺘﺨﺪﺍﻡ ﺍﻟﻤﻮﻗﻊ ﺑﻴﺎﻥ ﺍﻟﺨﺼﻮﺻﻴﺎﺕ‬ 2008-2000 ‫ﺣﻘﻮﻕ ﺍﻟﻄﺒﻊ ﺍﻷﻣﻢ ﺍﻟﻤﺘﺤﺪﺓ‬ Source-Channel Approach: IBM Models (1990’s)
  • 5. Parallel Corpus Example Tutorial to Statistical Machine Translation Dr Khalil Sima’an Parallel corpus C = a collection of text-chunks and their translations. Parallel corpora are the by-product of human translation. Every source chunk is paired with a target chunk. Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Dutch De prijs van het huis is gestegen. Het huis kan worden verkocht. Als het de marktprijs daalt zullen sommige gezinnen een zware tijd doormaken. . . . . . . . . . . . . . . . . . . English The price of the house has risen. The house can be sold. If the market price goes down, some families will go through difficult times. . . . . . . . . . . . . . . . . . . Syntax Hansards Canadian Parliament Proc. (English-French). European Parliament Proc. (23 languages). United Nations documents. Newspapers: Chinese-English; Arabic-English; Urdu-English. TAUS corpora.
  • 6. Generative Source-Channel Framework Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Given source sentence f, select target sentence e L.M. T .M. arg maxe∈E(f) { P(e | f) } = arg maxe∈E(f) { P(e) × P(f | e) } Set E(f) is the set of hypothesized translations of f. P(f | e): accounts for divergence in . . . word order Limitations of PB Models morphology Syntax syntactic relations idiomatic ways of expression . . . How to estimate P(e | f)? Sparse-data problem!
  • 7. Inducing The Structure of Translation Data Tutorial to Statistical Machine Translation e = Mary did not slap the green witch . ???? Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax f = Maria no dio una bofetada a la bruja verde . The latent structure of translation equivalence Graphical representations ∆f and ∆e for f and e. Relation a between ∆f and ∆e arg maxe∈E(f) { P(e | f) } = arg maxe∈E(f) { ∆f ,a,∆e P(e, ∆f , ∆e , a | f) } The difficult question: Which ∆f/e and a fit data best?
  • 8. Structure in current models Tutorial to Statistical Machine Translation a ∆f → ∆e Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax In most current models structure of reordering: ∆f/e are structures over word positions. a is an alignment between groups of word positions in ∆f and ∆e . Challenge: Number of permutations of n words is n! Structure shows translation units composing together What are the atomic translation units? How these compose together efficiently? How to put probs. on these structures? Structure helps combat sparsity and complexity
  • 9. Structure in Existing Models: Sketch l Tutorial to Statistical Machine Translation uage Lang aj Alignment a Word-based Dr Khalil Sima’an el on Mod e el e aj e1 1 "translation dictionary" m j fm fj 1 f1 Word-Based Models Alignment Symmetrization Phrase-based Phrase-Based Models Limitations of PB Models l uage aj Syntax Lang 1 Tree-based Mod el on e el e aj e1 m j fm fj 1 f1 Problem: No sufficient stats to estimate P(e | f) from data
  • 11. Some History and References Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Statistical models with word-alignments: Brown, Cocke, Della Pietra, Della Pietra, Jelinek, Lafferty, Mercer and Roossin. A statistical approach to machine translation. Computational Linguistics, 1990. Brown, Della Pietra, Della Pietra and Mercer. The mathematics of statistical machine translation: parameter estimation., Computational Linguistics, 1993. Och and Ney: A Systematic Comparison of Various Statistical Alignment Models. Computational Linguistics, 2003.
  • 12. Word-Based Models and Word-Alignment Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax a is a mapping between word positions. a e f ∆f and ∆e are sequences of word positions. m l e = e1 = e1 . . . el and f = f1 = f1 . . . fm A hidden word-alignment a: P(f | e) = P(a, f | e) a Assume that a target word-position ei translates into zero or more source word-positions a : {posf } → ({pose } ∪ {0}) ai or a(i), i.e., word position in e with which fi is aligned.
  • 13. Word Alignment Example Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 14. Word Alignment Example Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax (Le reste appartenait aux autochtones |
  • 15. Word Alignment Example: Not covered in this setting Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 16. Word Alignment Matrix Example Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 17. Translation model with word alignment Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax arg maxe P(e | f) = arg maxe P(e)×P(f | e) P(f | e) = a P(a, f | e) = a P(a| e) × P(f | a, e) Questions How to parametrize the model? How are e, f and a composed from basic units? How to train the model? How to acquire word alignment? How to translate with this model? Decoding and computational issues (for second part)
  • 18. Word-Alignment As Hidden Structure Tutorial to Statistical Machine Translation l ge M ngua La aj Dr Khalil Sima’an Alignment a Alignment Symmetrization e aj "translation dictionary" m j Phrase-Based Models Limitations of PB Models el e1 1 Word-Based Models on e odel fm fj 1 Syntax f1 We need to decompose The alignment a and the length m: P(a | e) “Translation dictionary” P(f | e, a)
  • 19. Word Alignment Models: General Scheme Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment of positions in f with positions in e: m a = a1 = a1 . . . am Markov process over a m m l P(a1 , f1 | e1 ) = P(m | e) × m Alignment Symmetrization j=1 Phrase-Based Models Limitations of PB Models Syntax j−1 j−1 j j−1 P(aj | a1 , f1 , m, e) × P(fj | a1 , f1 , m, e) In words: to generate alignment a and foreign sentence f 1 2 3 Choose a length m for f Generate alignment aj given the preceding alignments, words in f, m, and e Generate word fj conditioned on structure so far and e. IBM models are obtained by simplifications of this formula.
  • 20. IBM Model I Tutorial to Statistical Machine Translation m m P(a1 , f1 | e1 . . . el ) = P(m | e) × m Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models j=1 IBM Model I: Length: P(m | e) =≈ P(m | l) ≈ = A fixed probability . Align with uniform probability j with any aj in el1 or j−1 j−1 NULL: P(aj | a1 , f1 , m, e) ≈ (l + 1)−1 Limitations of PB Models Syntax j−1 j−1 j j−1 P(aj | a1 , f1 , m, e) × P(fj | a1 , f1 , m, e) Note that aj can be linked with l positions in e or with NULL. Lexicon: lexicon parameters πt (f | e) j j−1 P(fj | a1 , f1 , m, e) ≈ P(fj | eaj ) = πt (fj | eaj ) Parameters: and {πt (f | e) | f , e ∈ C}.
  • 21. Sketch IBM Model I Tutorial to Statistical Machine Translation l Dr Khalil Sima’an L age angu aj Word-Based Models e el eaj 1 Alignment Symmetrization el on Mod e1 label every f position with an e position choose length m given l "translation dictionary" Phrase-Based Models m Limitations of PB Models j fm Syntax fj 1 f1
  • 22. IBM Model I Parameters and Data Likelhood Tutorial to Statistical Machine Translation Dr Khalil Sima’an Data Likelihood: m a1 Word-Based Models l = Alignment Symmetrization Phrase-Based Models Limitations of PB Models m m P(a1 , f1 | e1 . . . el ) P(f |e) = Parameters: (l + 1)m × l m πt (fj | eaj ) ... a1 =0 am =0 j=1 and {πt (f | e) | f , e ∈ C}. Fix , i.e., in practice put a uniform probability over a range [1..m], for some natural number m. Syntax Dilemma To estimate these parameters we need word-alignment To get word-alignment we need these parameters.
  • 23. IBM Model II Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Extends IBM Model I at alignment probs: m m P(a1 , f1 m | e1 . . . el ) ≈ × j−1 j−1 P(aj | a1 , f1 , m, e)×πt (fj | eaj ) j=1 IBM Model II: changes only one element in IBM Model I: IBM Model I does not take into account the position of words in both strings j−1 j−1 P(aj | a1 , f1 , m, e) = P(aj | j, l, m) := πA (aj | j, l, m) Where πA (.|.) are parameters to be learned from data. IBM Models III, IV and V concentrate on more complex alignments allowing, e.g., 1 − to − n (fertility)
  • 24. IBM Model II Parameters Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax m m P(a1 , f1 | e1 . . . el ) ≈ × m πA (aj | j, l, m) × πt (fj | eaj ) j=1 Parameters: {πA (aj | j, l, m)} and {πt (fj | eaj )} Dilemma To estimate these parameters we need word-alignment To get word-alignment we need these parameters.
  • 25. Estimating Model Parameters Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Maximum-Likelihood Estimation of model M on parallel corpus C arg max P(C | m) = arg max m∈M m∈M Pm (e | f) e,f in C Example IBM Model I: Model M is defined by model parameters. Data is incomplete: no closed form solution. Expectation-Maximization (EM) sketch Init: Set the parameters at some m0 and let i = 0 Repeat until convergence (in perplexity) EMi (C) = C completed using estimate mi EMi (C) contains mi -expectations over e, f, a : P(a | f, e) mi+1 = Relative Frequency Estimates from EMi (C).
  • 26. EM for Lexicon and Word Alignment Probs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization ... la maison ... la maison blue ... la fleur ... Phrase-Based Models   ... the house ... the blue house ... the flower ...   Limitations of PB Models Syntax
  • 27. EM for Lexicon and Word Alignment Probs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization ... la maison ... la maison blue ... la fleur ... Phrase-Based Models Limitations of PB Models Syntax     ... the house ... the blue house ... the flower ...
  • 28. EM for Lexicon and Word Alignment Probs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization ... la maison ... la maison bleu ... la fleur ... Phrase-Based Models Limitations of PB Models Syntax     ... the house ... the blue house ... the flower ...
  • 29. EM for Lexicon and Word Alignment Probs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization ... la maison ... la maison bleu ... la fleur ... Phrase-Based Models Limitations of PB Models Syntax     ... the house ... the blue house ... the flower ...
  • 30. Translation Using EM Estimates Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Lexicon probability estimates: {πt (fj | eaj )} ˆ Alignment probabilities: {πA (aj | j, m, l)} ˆ Translation Model + Language Model + Decoder arg maxe P(e | f) = arg maxe P(e) × P(a, f | e) a
  • 31. Viterbi Word-Alignment using EM estimates Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization After EM has stabilized on estimates {πt (fj | eaj )} ˆ For every f, e in C apply the following m m arg maxam P(a1 , f1 | e1 . . . el ) ≈ Phrase-Based Models Limitations of PB Models Syntax and {πA (aj | j, m, l)} ˆ 1 m arg maxam 1 πA (aj | j, m, l) × πt (fj | eaj ) ˆ ˆ × j=1
  • 32. HMM Alignment Model: General Form Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax m m P(a1 , f1 | e1 . . . el ) ≈ × m j−1 j−1 P(aj | a1 , f1 , m, e)×πt (fj | eaj ) j=1 Words do not move independently of each other: condition word movement on previous word movement j−1 j−1 P(aj | a1 , f1 , m, e) ≈ P(aj | aj−1 , m)
  • 33. IBM Model III (and IV): Example Tutorial to Statistical Machine Translation Dr Khalil Sima’an A hidden word-alignment a: P(f | e) = a P(a, f Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Estimate alignment + lexicon + reordering + fertility parameters. | e)
  • 34. Word-based Models (Och Ney 2003) Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 35. Word-Alignment As Hidden Structure: Sufficient? Tutorial to Statistical Machine Translation l ge M ngua La aj Dr Khalil Sima’an Alignment a Alignment Symmetrization e aj translation dictionary m j Phrase-Based Models Limitations of PB Models el e1 1 Word-Based Models on e odel fm fj 1 Syntax f1 We assumed alignment between words and dictionary: Alignment a and the length m: P(a | e) Dictionary P(f | e, a)
  • 36. Limitations of Word-based Models Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Limitations of word-based translation: Many-to-one and many-to-many is common: “Makes more difficult”/bemoeilijkt “Dat richtte (hen) ten gronde”/”That destroyed (them)” Reordering takes place (often) by whole blocks. Reordering individual words increases ambiguity. “The (big heavy) cow/la vaca (pesada grande)” Translation works by “fixed expressions” (idiomatic). Concatenating word-translations increases ambiguity. Estimates of P(f | e) by word-based models are inaccurate. Instead of words as basic events: multi-word events in corpus.
  • 38. Asymetric Alignments: Limitations Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Word-based models presented so far are based on asymmetric word alignment. Each position i in f is aligned with at most one position in e: ai What about such word alignments? Phrase-Based Models Limitations of PB Models Syntax Or when a word in f translated into two or more in e?
  • 39. Symmetrization Heursitics Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Obtain Af→e and Ae→f From Intersection Af→e ∩ Ae→f to Union Af→e ∪ Ae→f
  • 40. Symmetrization Heursitics Algoritm Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Obtain Af→e and Ae→f From Intersection Af→e ∩ Ae→f to Union Af→e ∪ Ae→f
  • 42. Statistical “Memory-based” Translation Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Store arbitrary length source-target translation units from training parallel corpus. Translate new input by “covering” it with translation units replayed from memory. Idiomatic = Tiling: Phrase-Based SMT Assume word-alignment a is given in parallel corpus. Phrase-pair = contiguous source-target n, m -grams that are translational equivalents under a. Estimate phrase-pair probabilities Θ(f i | ei ) Translate f by “tiling it with phrases with order permutation”
  • 43. PBSMT some references Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Alignment-template Approach to Stat. Machine Translation (RWTH Aachen 1999) Phrase-based statistical machine translation (Zens, Och and Ney 2002) Phrase based SMT (Koehn, Och and Marcu 2003) Joint Phrase-based SMT (Wang and Marcu 2005) Statistical Machine Translation. (Ph. Koehn, Cambridge University Press 2010) Relation to EBMT 1984; Data-Oriented Translation (2000).
  • 44. Phrase-Based Models: Conceptual Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models T he president meets Segment foreign sentence f I into I phrases f 1 economic officials arg maxe P(e | f) = arg maxe P(e) × P(f | e) P(f | e) = I f 1 ,eI 1 I P(f 1 ,eI | e) 1 Limitations of PB Models Syntax Saudi I i=1 P(f i | ei )×d(starti −endi−1 −1) ph.table arg maxe P(f | e) ≈ arg max I f 1 ,eI 1 Dist.reord. I Θ(f i | ei )×d(starti −endi−1 −1) i=1 starti /endi are positions of first/last words of f i (translateing to ei ). d(x) = αx exponentially decaying in words skipped (α ∈ (0, 1]).
  • 45. Phrase-Based Models: Linear-interpolation Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax T he president meets Segment foreign sentence f I into I phrases f 1 Saudi economic officials Log-linear interpolation of factors: score(e|f) = λf × log Hf (e, f) f∈F Where set F consists of: Bag of phrases translation = I i=1 Θ(f i | ei ) d() Phrases reordered with reordering model d(.) lm Language model (5-grams or even 7-grams). other Smoothing + length penalty terms.
  • 46. Topics to discuss Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Phrase table extraction Estimating {Θ(f i | ei )} and {Θ(ei | f i )} Lexicalized and hierarchical phrase reordering models Other: phrase, length penalty . . . Log-linear interpolation and minimum error-rate training Decoding and optimization
  • 47. Extracting phrase pairs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax A phrase pair f , e is consistent with alignment a iff Non-empty: at least one alignment pair from a is in f,e No foreign positions inside f , e aligned to positions outside it No english positions inside f , e aligned to positions outside it
  • 48. Extracting phrase pairs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 49. Extracting phrase pairs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 50. Extracting phrase pairs Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 51. Extracting phrases from permutations Tutorial to Statistical Machine Translation 1 3 4 2 5 1 3 4 2 5 1 2 3 4 5 1 2 3 4 5 1 3 4 2 5 1 3 4 2 5 1 2 3 4 5 1 2 3 4 5 1 3 4 2 5 1 3 4 2 5 1 2 3 4 5 1 2 3 4 5 Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 52. Phrase pair weights Tutorial to Statistical Machine Translation Dr Khalil Sima’an Extract phrase pairs from corpus into multiset Tab = { f , e , freq(f , e)} Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Weights for f , e Θ(f | e) = freq(f ,e) freq(f ,e) f ,e ∈Tab Θ(e | f ) = freq(e,f ) freq(f ,e ) f ,e ∈Tab Smoothing with lexical word alignment estimates from IBM models
  • 53. Distance-Based Reordering Sketch Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 54. Lexicalized Reordering Sketch (Tillmann 2004) Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 55. Limited generalization over parallel data (1) Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Non-productive Phrase Table: Phrase Variants? Morphological e.g., changing inflection, agreement Al$arekat Alhindiyya the-companies the-Indian the Indian companies $areka hindiyya company Indian (an) Indian company Syntactic e.g. adding adjective/proposition/adverbials the fish in the deep sea swims the fish swims Reordering minor reordering of same words not allowed In Arabic V-S-O and S-V-O are allowed. Semantic e.g. synonyms, paraphrases Non-productive Phrase Table = Data Sparseness
  • 56. Limited generalization over parallel data (2) Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Reordering Local, monotone, almost non-lexicalized reordering. What about long range reordering? Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Five phrases need to be reversed: see Chiang 2007 (J. CL). Reordering target phrases with a coarse “source road map”?
  • 57. Limitations: Data-Sparseness Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Non-productive phrase table + Local, Uncharted reordering ⇓ Data-sparseness: Shorter phrases will apply down to word level. ⇓ Shorter phrases combined assuming independence. ⇓ Target phrase selection hard due to large hypotheses lattice Target Language Model = Only “GLUE” over target phrases. The Shorter the Phrases, the Greater the Risk
  • 58. Idiomatic Approach: GOOD, BAD and UGLY Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Phrases as atomic units Good: Less ambiguity in lexical choice and reordering. Match-Retrieve exactly is largely safe. Bad: Weak generalization over data. No phrase variants, weak reordering Ugly: Fall-back on shorter phrases downto word-to-word LM as “glue” is insufficient. Idiomatic approach does not alleviate data-sparseness How Should We Translate Novel Phrases?
  • 59. Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax Towards the land of bi-trees Alignments between Tree Pairs, ITG, Hierarchical Models and Syntax
  • 60. Hidden Structure of Translation: Tree Pairs Tutorial to Statistical Machine Translation l Dr Khalil Sima’an on e el uag Lang e aj aj Word-Based Models del e Mo 1 e1 Alignment Symmetrization Phrase-Based Models m Limitations of PB Models j Syntax fm fj 1 f1
  • 61. Reordering n words Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models 1 Permutations of n words: n! 3 4 1 2 3 5 4 5 Surely not all permutations are needed! (Wu 1995) Use trees and allow permutations on the nodes? There is an exponential number of trees in n ITG hypothesis (Wu 1995) Limitations of PB Models Syntax 2 [] [] Assume binary trees with two operations [] 1 3 4 2 5 Phrase-based forms of ITG (Chiang 2005; 2007): Hiero
  • 62. Syntax-Driven Phrase Translation Tutorial to Statistical Machine Translation Syntax-driven Re-Ordering Hierarchical (ITG) Dr Khalil Sima’an Linguistic Syntax XP XP Word-Based Models XP XP Alignment Symmetrization Phrase-Based Models 1 3 4 1 2 3 2 4 5 5 Limitations of PB Models Is translation syntactically cohesive? Syntax Reordering == Moving children in parse tree? Binary: monotone or inverted order at every node. Lexical elements can be phrase pairs. Covers word-alignments in parallel corpora?
  • 63. Word order difference and syntax: Impression Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax
  • 64. Phrase-based Hierarchical Model: Hiero (Chiang 2005; 2007) Tutorial to Statistical Machine Translation Extracting phrase-pairs with gaps (hierarchical trees): Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax X → X1 dar una bufetada a X2 / X1 slap X2 X → Maria no X la bruja verde / Mary did not X the green witch
  • 65. ITG with syntactic labels Tutorial to Statistical Machine Translation Dr Khalil Sima’an Word-Based Models Alignment Symmetrization Phrase-Based Models Limitations of PB Models Syntax