SlideShare ist ein Scribd-Unternehmen logo
1 von 33
Statistical Machine Translation  Waleed Oransa, M.Sc. College of Computing and Information Technology Arab Academy for Science and Technology Cairo, Egypt [email_address]
Agenda ,[object Object],[object Object],[object Object],[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object],[object Object]
Why is Machine Translation so Hard? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
MT Approaches ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Interlingua Semantic Syntactic  Structure Word s Semantic  Syntactic  Structure Word s Direct Syntactic Transfer Semantic Transfer Source Language Text Target Language Text Conceptual  Generation Semantic   Generation Syntactic  Generation Morphological  Generation Conceptual  Analysis Semantic Analysis Parsing  Morphological Analysis Better Quality & More difficulty
Why Statistical Machine Translation (SMT)? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Example of a parallel corpus ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Statistical Machine Translation (SMT) ,[object Object],[object Object],[object Object],[object Object],[object Object]
How to build SMT System ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object]
SMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model Arabic Sentences Translation Model (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit) What is the Language Model?
Language Model (LM) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
LM Role in SMT مشرف  يجتمعوا مشرف  تجتمع مشرف  يجتمع مشرف  يتقابل  مشرف  يقابل  مشرف  يجتمعا مشرف  تجتمعا  مشرف  يجتمعن مشرف  يجتمعون مشرف  يتقابلان Language Model  مشرف   xxxx مع  كبار المسؤولين المدنيين والعسكريين Language Model  يجتمع يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
[object Object]
SMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model Arabic Sentences Translation Model (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit) What is the Translation Model?
Translation Model (TM) ,[object Object],[object Object],[object Object]
Translation Model (TM) ,[object Object],[object Object],[object Object],[object Object]
Translation Model (TM) ,[object Object],[object Object]
SMT Translation Phase SMT System (Decoder) Source Text Target Text Language  Model (Arabic) Translation Model (English/Arabic) Musharraf Meets with Senior Civilian  ,[object Object],[object Object],[object Object],[object Object],[object Object],Initial N-best hypotheses p=0.13 p=0.21 p=0.23 p=0.12 p=0.18 ,[object Object],[object Object],[object Object],[object Object],[object Object],p=0.53 p=0.42 p=0.37 p=0.22 p=0.48 Final N-best hypotheses ,[object Object]
How TM & LM work together? Musharraf Meets with Senior Civilian and Military Officials مشرف  *****  مع  كبار المسؤولين المدنيين والعسكريين يجتمع Language Model  Translation Model  يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
Agenda ,[object Object],[object Object],[object Object],[object Object]
PBSMT Approach ,[object Object],[object Object],[object Object],[object Object],ولد The prophet Mohamed was born في سنة  570  ميلادية in 570 A.D الرسول محمد
PBSMT Training Phase English Sentences PBSMT System Training Language  Model (Arabic) Input:  Training Corpus Arabic/English Bi-Text Output:  Language Model  and Translation Model PBSMT Normal Training Phase Arabic Sentences Translation Model  Phrase Table (English/Arabic) Language   Modeling Training (Tool: SRILM toolkit) Translation   Modeling Training (Tool: Giza++ & Moses toolkit)
Phrase based alignment (The prophet,  الرسول ) (The prophet Mohamed,  الرسول محمد ) (great man,  رجل عظيم ) (Mohamed is a great man,  محمد رجل عظيم ) (The prophet Mohamed is a great man,  الرسول محمد رجل عظيم ) etc. Extract all phrase: English to Arabic word alignment  Arabic to English word alignment  Intersection of both alignments
PBSMT  drawbacks ,[object Object],[object Object],[object Object]
Agenda ,[object Object],[object Object],[object Object],[object Object]
MT Evaluation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Human Evaluation 5 Adequacy (1-5) 4 Fluency (1-5) مشرف  يتقابل مع  كبار المسؤولين المدنيين والعسكريين  بخصوص  ملف الحد من الانتشار النووي MT Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
Automatic Evaluation ,[object Object],[object Object],[object Object],[object Object],Higher BLEU Score مشرف يجتمع بكبار  المسؤولين المدنيين والعسكريين  لبحث  ملف الحد من الانتشار النووي Ref3 مشرف  يجتمع  مع  كبار المسؤولين المدنيين   والعسكريين  لدرس  ملف الحد من الانتشار النووي Ref2 مشرف يجتمع  بكبار  المسؤولين المدنيين والعسكريين  لدرس  ملف الحد من الانتشار النووي Ref1 مشرف   يجتمع   بالقادة   ال مدنيين و   الع سكريين   رفيعي المستوى  ل دراسة   ملف  وقف  الانتشار النووي MT2 مشرف   يتقابل   مع  كبار المسؤولين المدنيين والعسكريين   بخصوص   ملف الحد من الانتشار النووي MT1 Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
Agenda ,[object Object],[object Object],[object Object],[object Object]
Online MT Services Review البنتان قالتا أنّنا جيّدين SK 8 قال الاثنان بنات  " نحن جيّد " SY 7 ان فتاتين  " نحن جيدة " MS 6 الفتاتين وقال  " نحن جاهزون " GO 5 قالت الفتاتان  " نحن جيدات " The two girls said "we are good" B خمسة عشر بنتًا Sakhr Trjem (SK) 4 خمسة عشر بنات Systran translator  (SY) 3 خمسة عشر الفتيات MS-Bing Translator (MS) 2 خمسة عشر فتيات Google Translate (GO) 1 خمس عشرة فتاة Fifteen girls A Arabic Translation  Sentence/Translation Service
Thank you  شكراً

Weitere ähnliche Inhalte

Was ist angesagt?

Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translationkhyati gupta
 
CBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMERCBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMERijnlc
 
Assamese to English Statistical Machine Translation
Assamese to English Statistical Machine TranslationAssamese to English Statistical Machine Translation
Assamese to English Statistical Machine TranslationKalyanee Baruah
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesAntonio Toral
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.pptbutest
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelHemantha Kulathilake
 
7 probability and statistics an introduction
7 probability and statistics an introduction7 probability and statistics an introduction
7 probability and statistics an introductionThennarasuSakkan
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: ParsingRushdi Shams
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsSajeed Mahaboob
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translationguest873a50
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIcscpconf
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpusThennarasuSakkan
 
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABIRULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABIijnlc
 

Was ist angesagt? (20)

SMT3
SMT3SMT3
SMT3
 
NLP_KASHK:Text Normalization
NLP_KASHK:Text NormalizationNLP_KASHK:Text Normalization
NLP_KASHK:Text Normalization
 
NLP_KASHK:POS Tagging
NLP_KASHK:POS TaggingNLP_KASHK:POS Tagging
NLP_KASHK:POS Tagging
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
CBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMERCBAS: CONTEXT BASED ARABIC STEMMER
CBAS: CONTEXT BASED ARABIC STEMMER
 
Ijetcas14 575
Ijetcas14 575Ijetcas14 575
Ijetcas14 575
 
Assamese to English Statistical Machine Translation
Assamese to English Statistical Machine TranslationAssamese to English Statistical Machine Translation
Assamese to English Statistical Machine Translation
 
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologiesParallel Corpora in (Machine) Translation: goals, issues and methodologies
Parallel Corpora in (Machine) Translation: goals, issues and methodologies
 
Moore_slides.ppt
Moore_slides.pptMoore_slides.ppt
Moore_slides.ppt
 
NLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language ModelNLP_KASHK:Evaluating Language Model
NLP_KASHK:Evaluating Language Model
 
7 probability and statistics an introduction
7 probability and statistics an introduction7 probability and statistics an introduction
7 probability and statistics an introduction
 
Natural Language Processing: Parsing
Natural Language Processing: ParsingNatural Language Processing: Parsing
Natural Language Processing: Parsing
 
Intro to NLP. Lecture 2
Intro to NLP.  Lecture 2Intro to NLP.  Lecture 2
Intro to NLP. Lecture 2
 
Evaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutionsEvaluation of hindi english mt systems, challenges and solutions
Evaluation of hindi english mt systems, challenges and solutions
 
Lec 15,16,17 NLP.machine translation
Lec 15,16,17  NLP.machine translationLec 15,16,17  NLP.machine translation
Lec 15,16,17 NLP.machine translation
 
HMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDIHMM BASED POS TAGGER FOR HINDI
HMM BASED POS TAGGER FOR HINDI
 
Barreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentationBarreiro-Batista-LR4NLP@Coling2018-presentation
Barreiro-Batista-LR4NLP@Coling2018-presentation
 
5a use of annotated corpus
5a use of annotated corpus5a use of annotated corpus
5a use of annotated corpus
 
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABIRULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
RULE BASED TRANSLITERATION SCHEME FOR ENGLISH TO PUNJABI
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
 

Ähnlich wie Pbsmt presenation waleed_oransa_29_april2010

Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Sheeyam Shellvacumar
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationLifeng (Aaron) Han
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShashank Shisodia
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Association for Computational Linguistics
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translationbehzad66
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodijnlc
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELSijnlc
 
"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of PatentsIconic Translation Machines
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsIOSR Journals
 
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslationEffectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslationSunayana Gawde
 
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a CorpusSvetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a CorpusSvetlin Nakov
 
English to Bangla Translation
English to Bangla TranslationEnglish to Bangla Translation
English to Bangla TranslationSaugata Bose
 
Machine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraMachine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraVaishnaviKhandelwal6
 

Ähnlich wie Pbsmt presenation waleed_oransa_29_april2010 (20)

Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.Real-time DirectTranslation System for Sinhala and Tamil Languages.
Real-time DirectTranslation System for Sinhala and Tamil Languages.
 
A deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine TranslationA deep analysis of Multi-word Expression and Machine Translation
A deep analysis of Multi-word Expression and Machine Translation
 
Shallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliteratorShallow parser for hindi language with an input from a transliterator
Shallow parser for hindi language with an input from a transliterator
 
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
Junki Matsuo - 2015 - Source Phrase Segmentation and Translation for Japanese...
 
Personalising speech to-speech translation
Personalising speech to-speech translationPersonalising speech to-speech translation
Personalising speech to-speech translation
 
Hps a hierarchical persian stemming method
Hps a hierarchical persian stemming methodHps a hierarchical persian stemming method
Hps a hierarchical persian stemming method
 
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
S URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELSS URVEY  O N M ACHINE  T RANSLITERATION A ND  M ACHINE L EARNING M ODELS
S URVEY O N M ACHINE T RANSLITERATION A ND M ACHINE L EARNING M ODELS
 
"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents"Machine Translation 101" and the Challenge of Patents
"Machine Translation 101" and the Challenge of Patents
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
 
NLP
NLPNLP
NLP
 
NLP
NLPNLP
NLP
 
D2 anandkumar
D2 anandkumarD2 anandkumar
D2 anandkumar
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
 
Machine translation
Machine translationMachine translation
Machine translation
 
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslationEffectof morphologicalsegmentation&de segmentationonmachinetranslation
Effectof morphologicalsegmentation&de segmentationonmachinetranslation
 
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a CorpusSvetlin Nakov - Improved Word Alignments Using the Web as a Corpus
Svetlin Nakov - Improved Word Alignments Using the Web as a Corpus
 
English to Bangla Translation
English to Bangla TranslationEnglish to Bangla Translation
English to Bangla Translation
 
Machine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraMachine translation ppt by shantanu arora
Machine translation ppt by shantanu arora
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 

Kürzlich hochgeladen

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and ModificationsMJDuyan
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxAmita Gupta
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...pradhanghanshyam7136
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdfQucHHunhnh
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docxPoojaSen20
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Association for Project Management
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docxPoojaSen20
 

Kürzlich hochgeladen (20)

2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Understanding Accommodations and Modifications
Understanding  Accommodations and ModificationsUnderstanding  Accommodations and Modifications
Understanding Accommodations and Modifications
 
Third Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptxThird Battle of Panipat detailed notes.pptx
Third Battle of Panipat detailed notes.pptx
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
1029 - Danh muc Sach Giao Khoa 10 . pdf
1029 -  Danh muc Sach Giao Khoa 10 . pdf1029 -  Danh muc Sach Giao Khoa 10 . pdf
1029 - Danh muc Sach Giao Khoa 10 . pdf
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
PROCESS RECORDING FORMAT.docx
PROCESS      RECORDING        FORMAT.docxPROCESS      RECORDING        FORMAT.docx
PROCESS RECORDING FORMAT.docx
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...Making communications land - Are they received and understood as intended? we...
Making communications land - Are they received and understood as intended? we...
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
psychiatric nursing HISTORY COLLECTION .docx
psychiatric  nursing HISTORY  COLLECTION  .docxpsychiatric  nursing HISTORY  COLLECTION  .docx
psychiatric nursing HISTORY COLLECTION .docx
 

Pbsmt presenation waleed_oransa_29_april2010

  • 1. Statistical Machine Translation Waleed Oransa, M.Sc. College of Computing and Information Technology Arab Academy for Science and Technology Cairo, Egypt [email_address]
  • 2.
  • 3.
  • 4.
  • 5.
  • 6. Interlingua Semantic Syntactic Structure Word s Semantic Syntactic Structure Word s Direct Syntactic Transfer Semantic Transfer Source Language Text Target Language Text Conceptual Generation Semantic Generation Syntactic Generation Morphological Generation Conceptual Analysis Semantic Analysis Parsing Morphological Analysis Better Quality & More difficulty
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12. SMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model Arabic Sentences Translation Model (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit) What is the Language Model?
  • 13.
  • 14. LM Role in SMT مشرف يجتمعوا مشرف تجتمع مشرف يجتمع مشرف يتقابل مشرف يقابل مشرف يجتمعا مشرف تجتمعا مشرف يجتمعن مشرف يجتمعون مشرف يتقابلان Language Model مشرف xxxx مع كبار المسؤولين المدنيين والعسكريين Language Model يجتمع يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
  • 15.
  • 16. SMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model Arabic Sentences Translation Model (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit) What is the Translation Model?
  • 17.
  • 18.
  • 19.
  • 20.
  • 21. How TM & LM work together? Musharraf Meets with Senior Civilian and Military Officials مشرف ***** مع كبار المسؤولين المدنيين والعسكريين يجتمع Language Model Translation Model يجتمعوا تجتمع يجتمع يتقابل يقابل يجتمعا تجتمعا يجتمعن يجتمعون يتقابلان
  • 22.
  • 23.
  • 24. PBSMT Training Phase English Sentences PBSMT System Training Language Model (Arabic) Input: Training Corpus Arabic/English Bi-Text Output: Language Model and Translation Model PBSMT Normal Training Phase Arabic Sentences Translation Model Phrase Table (English/Arabic) Language Modeling Training (Tool: SRILM toolkit) Translation Modeling Training (Tool: Giza++ & Moses toolkit)
  • 25. Phrase based alignment (The prophet, الرسول ) (The prophet Mohamed, الرسول محمد ) (great man, رجل عظيم ) (Mohamed is a great man, محمد رجل عظيم ) (The prophet Mohamed is a great man, الرسول محمد رجل عظيم ) etc. Extract all phrase: English to Arabic word alignment Arabic to English word alignment Intersection of both alignments
  • 26.
  • 27.
  • 28.
  • 29. Human Evaluation 5 Adequacy (1-5) 4 Fluency (1-5) مشرف يتقابل مع كبار المسؤولين المدنيين والعسكريين بخصوص ملف الحد من الانتشار النووي MT Musharraf Meets with Senior Civilian and Military Officials to Examine Nuclear Anti-Proliferation Dossier Source
  • 30.
  • 31.
  • 32. Online MT Services Review البنتان قالتا أنّنا جيّدين SK 8 قال الاثنان بنات " نحن جيّد " SY 7 ان فتاتين " نحن جيدة " MS 6 الفتاتين وقال " نحن جاهزون " GO 5 قالت الفتاتان " نحن جيدات " The two girls said "we are good" B خمسة عشر بنتًا Sakhr Trjem (SK) 4 خمسة عشر بنات Systran translator (SY) 3 خمسة عشر الفتيات MS-Bing Translator (MS) 2 خمسة عشر فتيات Google Translate (GO) 1 خمس عشرة فتاة Fifteen girls A Arabic Translation Sentence/Translation Service
  • 33. Thank you شكراً