SlideShare ist ein Scribd-Unternehmen logo
1 von 17
Machine Translation
Mohamed Hassan
 Introduction
 Main challenges
 Techniques
 Features
Introduction
 Machine translation:-
 Machine Translation has been defined as the process that
utilizes computer software to translate text from one
natural language(such as English) to another (such as
Arabic).
 The idea of machine translation may be traced back to
the 17th century
 MT on the web starts with Systran offering free translation
of small texts (1996)
Technique
 Example-based MT
 Dictionary-based
 Rule-based
 Hybrid MT
 Neural MT
 Statistical
 Interlingual
 Transfer-based
Example-based MT
 characterized by its use of a bilingual corpus with parallel
texts as its main knowledge base.
 It is essentially a translation by analogy
 Ex
 English How much is that umbrella
 Arabic ‫المظله‬ ‫هذه‬ ‫سعر‬ ‫كم‬
 English How much is that doggie
 Arabic ‫الكلب‬ ‫هذا‬ ‫سعر‬ ‫كم‬
Dictionary-based
 The words will be translated as a dictionary does — word
by word, usually without much correlation of meaning
between them
Rule-based
 RBMT involves more information about the linguistics of the
source and target languages ,using the syntactic rules and
semantic analysis of both languages
This type of translation is used mostly in the creation
of dictionaries and grammar programs
Interlingual
 instance of rule-based machine-translation
 Itis necessary to have an intermediate representation(interlingua)
that captures the "meaning" of the original sentence in order to
generate the correct translation
 "language neutral" representation that is independent of any language
 Advantage: one of the major advantages of this system is that the
interlingua becomes more valuable as the number of target languages
it can be turned into increases
 the only interlingual machine translation system that has been made
operational at the commercial level is the KANT system
Transfer-based
 Itis necessary to have an intermediate representation that
captures the "meaning" of the original sentence in order
to generate the correct translation
 it depends partially on the language pair involved in the
translation
Statistical
 using statistical methods based on bilingual text corpora,
such as the Canadian Hansard corpus
 The idea behind statistical machine translation comes
from information theory
Hybrid MT
combination of statistical and rule-
based translation methodologies
Neural MT
 neural network is trained by deep
learning techniques
Challenges in MT
 Ambiguity
Ex1:
Book the flight -> verb
Read the book -> noun
Ex2:
Kill a man (‫)قتل‬
Kill a process (‫انهاء‬)
Ex3:
she couldn’t bear children
‫تستطيع‬ ‫ال‬‫تحمل‬‫االطفال‬
‫تستطيع‬ ‫ال‬‫انجاب‬‫اطفال‬
Challenges in MT
Different word orders
English word order : subject –verb –object
Mohamed is at home
Arabic word order:
‫المنزل‬ ‫في‬ ‫يتواجد‬ ‫محمد‬(‫اسميه‬ ‫جمله‬)
‫المنزل‬ ‫في‬ ‫محمد‬ ‫يتواجد‬(‫فعليه‬ ‫جمله‬)
Japanese: subject –object- verb
Challenges in MT
 Compound Words
Arabic ‫ا‬َ‫ه‬‫و‬ُ‫م‬ُ‫ك‬ُ‫م‬ِ‫ز‬ْ‫ل‬ُ‫ن‬َ‫أ‬
English Shall we compel you to accept it
 Missing Names
A language may not have a word for a certain
action or object that exists in another language
ksnona (Greek)
guest room(english)
Application
Google translator
Bing Translator
SYSTRAN
Asia Online
Machine translation

Weitere ähnliche Inhalte

Was ist angesagt?

Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
vini89
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
Rushdi Shams
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
theyaseen51
 

Was ist angesagt? (20)

Machine translation with statistical approach
Machine translation with statistical approachMachine translation with statistical approach
Machine translation with statistical approach
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
NLP
NLPNLP
NLP
 
Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)Introduction to Natural Language Processing (NLP)
Introduction to Natural Language Processing (NLP)
 
natural language processing help at myassignmenthelp.net
natural language processing  help at myassignmenthelp.netnatural language processing  help at myassignmenthelp.net
natural language processing help at myassignmenthelp.net
 
Types of machine translation
Types of machine translationTypes of machine translation
Types of machine translation
 
Spell checker using Natural language processing
Spell checker using Natural language processing Spell checker using Natural language processing
Spell checker using Natural language processing
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Introduction to Natural Language Processing
Introduction to Natural Language ProcessingIntroduction to Natural Language Processing
Introduction to Natural Language Processing
 
Lecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language TechnologyLecture 1: Semantic Analysis in Language Technology
Lecture 1: Semantic Analysis in Language Technology
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Statistical machine translation
Statistical machine translationStatistical machine translation
Statistical machine translation
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Parts of Speect Tagging
Parts of Speect TaggingParts of Speect Tagging
Parts of Speect Tagging
 
Morphological Analysis
Morphological AnalysisMorphological Analysis
Morphological Analysis
 
Natural Language Processing
Natural Language ProcessingNatural Language Processing
Natural Language Processing
 
Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...Natural Language processing Parts of speech tagging, its classes, and how to ...
Natural Language processing Parts of speech tagging, its classes, and how to ...
 
Nlp ambiguity presentation
Nlp ambiguity presentationNlp ambiguity presentation
Nlp ambiguity presentation
 
Challenges in nlp
Challenges in nlpChallenges in nlp
Challenges in nlp
 
NLP
NLPNLP
NLP
 

Ähnlich wie Machine translation

Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
Parisa Niksefat
 

Ähnlich wie Machine translation (20)

Machine translation ppt by shantanu arora
Machine translation ppt by shantanu aroraMachine translation ppt by shantanu arora
Machine translation ppt by shantanu arora
 
machinetranslation-161223011433.pptx
machinetranslation-161223011433.pptxmachinetranslation-161223011433.pptx
machinetranslation-161223011433.pptx
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...Past, Present, and Future: Machine Translation & Natural Language Processing ...
Past, Present, and Future: Machine Translation & Natural Language Processing ...
 
E-Translation
E-TranslationE-Translation
E-Translation
 
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid ApproachPunjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
Punjabi to Hindi Transliteration System for Proper Nouns Using Hybrid Approach
 
Arabic MT Project
Arabic MT ProjectArabic MT Project
Arabic MT Project
 
Moses
MosesMoses
Moses
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
machine transaltion
machine transaltionmachine transaltion
machine transaltion
 
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
Applying Rule-Based Maximum Matching Approach for Verb Phrase Identification ...
 
Error Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation OutputsError Analysis of Rule-based Machine Translation Outputs
Error Analysis of Rule-based Machine Translation Outputs
 
Multi lingual corpus for machine aided translation
Multi lingual corpus for machine aided translationMulti lingual corpus for machine aided translation
Multi lingual corpus for machine aided translation
 
**JUNK** (no subject)
**JUNK** (no subject)**JUNK** (no subject)
**JUNK** (no subject)
 
SMT3
SMT3SMT3
SMT3
 
How to Translate from English to Khmer using Moses
How to Translate from English to Khmer using MosesHow to Translate from English to Khmer using Moses
How to Translate from English to Khmer using Moses
 
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
EMPLOYING PIVOT LANGUAGE TECHNIQUE THROUGH STATISTICAL AND NEURAL MACHINE TRA...
 
Machine Translation Approaches and Design Aspects
Machine Translation Approaches and Design AspectsMachine Translation Approaches and Design Aspects
Machine Translation Approaches and Design Aspects
 
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
Approach To Build A Marathi Text-To-Speech System Using Concatenative Synthes...
 
Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...Improving the role of language model in statistical machine translation (Indo...
Improving the role of language model in statistical machine translation (Indo...
 

Kürzlich hochgeladen

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Kürzlich hochgeladen (20)

EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 

Machine translation

  • 2.  Introduction  Main challenges  Techniques  Features
  • 3. Introduction  Machine translation:-  Machine Translation has been defined as the process that utilizes computer software to translate text from one natural language(such as English) to another (such as Arabic).  The idea of machine translation may be traced back to the 17th century  MT on the web starts with Systran offering free translation of small texts (1996)
  • 4. Technique  Example-based MT  Dictionary-based  Rule-based  Hybrid MT  Neural MT  Statistical  Interlingual  Transfer-based
  • 5. Example-based MT  characterized by its use of a bilingual corpus with parallel texts as its main knowledge base.  It is essentially a translation by analogy  Ex  English How much is that umbrella  Arabic ‫المظله‬ ‫هذه‬ ‫سعر‬ ‫كم‬  English How much is that doggie  Arabic ‫الكلب‬ ‫هذا‬ ‫سعر‬ ‫كم‬
  • 6. Dictionary-based  The words will be translated as a dictionary does — word by word, usually without much correlation of meaning between them
  • 7. Rule-based  RBMT involves more information about the linguistics of the source and target languages ,using the syntactic rules and semantic analysis of both languages This type of translation is used mostly in the creation of dictionaries and grammar programs
  • 8. Interlingual  instance of rule-based machine-translation  Itis necessary to have an intermediate representation(interlingua) that captures the "meaning" of the original sentence in order to generate the correct translation  "language neutral" representation that is independent of any language  Advantage: one of the major advantages of this system is that the interlingua becomes more valuable as the number of target languages it can be turned into increases  the only interlingual machine translation system that has been made operational at the commercial level is the KANT system
  • 9. Transfer-based  Itis necessary to have an intermediate representation that captures the "meaning" of the original sentence in order to generate the correct translation  it depends partially on the language pair involved in the translation
  • 10. Statistical  using statistical methods based on bilingual text corpora, such as the Canadian Hansard corpus  The idea behind statistical machine translation comes from information theory
  • 11. Hybrid MT combination of statistical and rule- based translation methodologies
  • 12. Neural MT  neural network is trained by deep learning techniques
  • 13. Challenges in MT  Ambiguity Ex1: Book the flight -> verb Read the book -> noun Ex2: Kill a man (‫)قتل‬ Kill a process (‫انهاء‬) Ex3: she couldn’t bear children ‫تستطيع‬ ‫ال‬‫تحمل‬‫االطفال‬ ‫تستطيع‬ ‫ال‬‫انجاب‬‫اطفال‬
  • 14. Challenges in MT Different word orders English word order : subject –verb –object Mohamed is at home Arabic word order: ‫المنزل‬ ‫في‬ ‫يتواجد‬ ‫محمد‬(‫اسميه‬ ‫جمله‬) ‫المنزل‬ ‫في‬ ‫محمد‬ ‫يتواجد‬(‫فعليه‬ ‫جمله‬) Japanese: subject –object- verb
  • 15. Challenges in MT  Compound Words Arabic ‫ا‬َ‫ه‬‫و‬ُ‫م‬ُ‫ك‬ُ‫م‬ِ‫ز‬ْ‫ل‬ُ‫ن‬َ‫أ‬ English Shall we compel you to accept it  Missing Names A language may not have a word for a certain action or object that exists in another language ksnona (Greek) guest room(english)