SlideShare a Scribd company logo
1 of 42
Download to read offline
Changes in Moses 
Hieu Hoang 
TAUS 
October 2014
MosesCore 
Year 1 (2012) 
• Easier installation 
– Binary releases 
– Pre-built models 
• Testing and Releases 
– Linux, Mac OSX, Windows 
– 32 and 64-bit 
• Faster training 
– Parallelism at all stages
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion
Specify a Feature Function 
Then…. 
[lmodel-file] 
8 0 3 europarl.en.srilm.gz 
[weight-l] 
0.142 
ini file: 
• New Feature Function 
– New sections 
● [feature-function-file] 
● [weight-?] 
• Custom code 
– Parse ini file 
– Initialize feature function
Adding new Feature Function 
Now…. 
[feature] 
KENLM file=path order=0 
[weight] 
KENLM0= 0.142 
ini file: 
• New Feature Function 
– No new section 
● Line in [feature] section 
● Line in [weight] section 
– Framework 
● parse ini file 
● initialize feature
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
● Dynamic suffix array 
● Stores training data 
– Extract translation rule on-the-fly 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
● Continuous space LM 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
● Replicate Devlin et al, 2014 
● Large quality gains 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
● Character level translation 
● Learns from parallel data 
● Integrate into decoder 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
– Extra information for each rule 
● Context, syntax, domain etc 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding 
– Faster, memory efficient decoding 
– More syntactic models
Technical Breakout 
• Organization and Releases 
– Academic and commercial needs 
– Prevent forks 
– Development/Stable versions 
– Forwards/Backward compatibility 
– Upgradability 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
– Platform/Clouds 
– Docker containers 
– Priorities 
– Interaction and data formats 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development 
– User-friendliness 
– End-to-end solution 
– Users
Changes in Moses 
Hieu Hoang 
TAUS 
October 2014 
Thanks for inviting me to come 
Here to tell you a little about the things I’ve 
been doing to Moses 
- over the past 2 years 
- mainly concentrate of the past year 
- but will quickly tell you about things I did 
prior to that 
1
MosesCore 
Year 1 (2012) 
• Easier installation 
– Binary releases 
– Pre-built models 
• Testing and Releases 
– Linux, Mac OSX, Windows 
– 32 and 64-bit 
• Faster training 
– Parallelism at all stages 
In the 1st year 
- picked off the low hanging fruit 
- fixed many of the easy issues that required 
- time & effort 
Made installation easier 
Run a lot of experiments anyway 
- gave some of them away 
- with all the scripts + configuration 
- used to run them 
- students can see how to replicate our 
results 
Lots of testing 
- all major platforms 
Made obvious speed improvements 
2
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder 
In year 2 
- made it even easier to install 
- if you can’t be bother to compile or even 
download the binaries 
- download a virtual machine with moses + 
friends installed 
OR 
rent an amazon server with moses + friends 
installed 
3
MosesCore 
Year 2 (2013) 
• Even Easier installation 
– Binary releases 
– Pre-built models 
– Virtual Machines 
– Amazon EC2 
• Refactored Decoder 
However, the main reason I came here today 
- talk about the major changes I made 
- in decoder 
- and else where 
Makes is easier for us coders 
- add and change things in Moses 
4
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
What is a feature function? 
- something that gives a translation a score 
over the last few years 
- gotten bored with existing features like 
language models and reordering models 
the trend in MT 
- create novel features which give a score to 
a translation 
- like any feature, tries to give bigger scores 
to better models 
New feature function framework 
- designed to make it easy to add new 
features 
5
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
Simplify class structure 
- to make it easier for us to develop with 
Moses 
- Moses has been around for 8 years now 
- everyone has the freedom to add what 
they want 
- no-one is in overall control 
- this way of organising an open-source 
project is great 
- gotten lots of contribution, lots of 
features 
- downside 
- grown organically 
- things are not as well structured as 
they can be 
6
Why did you Refactor? 
• Feature Function Framework 
– easier to implement new features 
– use sparse features 
• Simplify class structure 
– easier to develop with Moses 
• Delete functionality 
– easier to refactor code 
– very little deletion 
Why did I delete things 
- delete very little 
- I’m not the gatekeeper of moses, I don’t 
control it 
- if a functionality was deleted, it’s not a 
comment on usefulness of it 
- purely ‘cos it got in the way of the 
refactoring 
Quickly go thru the last 2 
- before telling you about feature functions 
7
Specify a Feature Function 
Then…. 
[lmodel-file] 
8 0 3 europarl.en.srilm.gz 
[weight-l] 
0.142 
ini file: 
• New Feature Function 
– New sections 
● [feature-function-file] 
● [weight-?] 
• Custom code 
– Parse ini file 
– Initialize feature function 
completely bestoked 
- no framework to help you 
- if you don’t do it right, wont’ work 
8
Adding new Feature Function 
Now…. 
[feature] 
KENLM file=path order=0 
[weight] 
KENLM0= 0.142 
ini file: 
• New Feature Function 
– No new section 
● Line in [feature] section 
● Line in [weight] section 
– Framework 
● parse ini file 
● initialize feature 
Write a class that implements the feature 
function 
The framework does the rest 
- no need to create a custom section in the ini 
file 
or 
- change StaticData class 
or 
- change Paramater class 
9
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
● Dynamic suffix array 
● Stores training data 
– Extract translation rule on-the-fly 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
● Continuous space LM 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
● Replicate Devlin et al, 2014 
● Large quality gains 
– Transliteration 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
● Character level translation 
● Learns from parallel data 
● Integrate into decoder 
• Translation rule properties 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
– Extra information for each rule 
● Context, syntax, domain etc 
• Syntax decoding
MosesCore 
Year 3 (2014) 
• Exploit new framework 
– Updatable phrase-table 
– Neural network language model 
– Bilingual language models 
– Transliteration 
• Translation rule properties 
• Syntax decoding 
– Faster, memory efficient decoding 
– More syntactic models
Technical Breakout 
• Organization and Releases 
– Academic and commercial needs 
– Prevent forks 
– Development/Stable versions 
– Forwards/Backward compatibility 
– Upgradability 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
– Platform/Clouds 
– Docker containers 
– Priorities 
– Interaction and data formats 
• Future development
Technical Breakout 
• Organization and Releases 
• Features 
• Deployment 
• Future development 
– User-friendliness 
– End-to-end solution 
– Users
TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

More Related Content

What's hot

OSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical UpdateOSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical Updatemfrancis
 
TokuDB - What You Need to Know
TokuDB - What You Need to KnowTokuDB - What You Need to Know
TokuDB - What You Need to KnowJervin Real
 
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestWEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestFreddy Vega
 
My S Q L Introduction for 1 day training
My S Q L  Introduction for 1 day trainingMy S Q L  Introduction for 1 day training
My S Q L Introduction for 1 day trainingIvan Tu
 
Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015kacfengine
 
A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0Sarah Dutkiewicz
 
Version control with subversion
Version control with subversionVersion control with subversion
Version control with subversionxprayc
 
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...BIOVIA
 
Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Sarah Dutkiewicz
 
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Lenz Grimmer
 
Ups and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingUps and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingCsaba Toth
 
Scaling with Symfony - PHP UK
Scaling with Symfony - PHP UKScaling with Symfony - PHP UK
Scaling with Symfony - PHP UKRicard Clau
 
Embedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewEmbedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewAhmed El-Arabawy
 
High Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningHigh Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningAlbert Chen
 

What's hot (18)

OSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical UpdateOSGi Community Event 2010 - OSGi Technical Update
OSGi Community Event 2010 - OSGi Technical Update
 
TokuDB - What You Need to Know
TokuDB - What You Need to KnowTokuDB - What You Need to Know
TokuDB - What You Need to Know
 
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATestWEPA - Webdriver Enhanced Platform for Automation - WEPATest
WEPA - Webdriver Enhanced Platform for Automation - WEPATest
 
My S Q L Introduction for 1 day training
My S Q L  Introduction for 1 day trainingMy S Q L  Introduction for 1 day training
My S Q L Introduction for 1 day training
 
Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015Releases - CFEngine presentation - Configuration Management Camp 2015
Releases - CFEngine presentation - Configuration Management Camp 2015
 
A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0A Lap Around PowerShell 3.0
A Lap Around PowerShell 3.0
 
Version control with subversion
Version control with subversionVersion control with subversion
Version control with subversion
 
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
(ATS4-PLAT01) Core Architecture Changes in AEP 9.0 and their Impact on Admini...
 
Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0Unleashing the Power: A Lap Around PowerShell 3.0
Unleashing the Power: A Lap Around PowerShell 3.0
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
69-kauri
69-kauri69-kauri
69-kauri
 
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
Making MySQL Administration a Breeze - A look into a MySQL DBA's toolchest
 
Ups and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research settingUps and downs of enterprise Java app in a research setting
Ups and downs of enterprise Java app in a research setting
 
Scaling with Symfony - PHP UK
Scaling with Symfony - PHP UKScaling with Symfony - PHP UK
Scaling with Symfony - PHP UK
 
Embedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course OverviewEmbedded Systems: Lecture 1: Course Overview
Embedded Systems: Lecture 1: Course Overview
 
High Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance TuningHigh Concurrency Architecture and Laravel Performance Tuning
High Concurrency Architecture and Laravel Performance Tuning
 

Similar to TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

Java for XPages Development
Java for XPages DevelopmentJava for XPages Development
Java for XPages DevelopmentTeamstudio
 
Automated product categorization
Automated product categorizationAutomated product categorization
Automated product categorizationAndreas Loupasakis
 
Automated product categorization
Automated product categorization   Automated product categorization
Automated product categorization Warply
 
Finite State Queries In Lucene
Finite State Queries In LuceneFinite State Queries In Lucene
Finite State Queries In Luceneotisg
 
Infrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryInfrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryCarlo Bonamico
 
Reading Notes : the practice of programming
Reading Notes : the practice of programmingReading Notes : the practice of programming
Reading Notes : the practice of programmingJuggernaut Liu
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibAndrew Coleman
 
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote   Yonik Seeley & Steve Rowe lucene solr roadmapKeynote   Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote Yonik Seeley & Steve Rowe lucene solr roadmaplucenerevolution
 
KEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road mapKEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road maplucenerevolution
 
Connect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience NecessaryConnect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience Necessarypanagenda
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsAndreas Chatziantoniou
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsAndreas Chatziantoniou
 
Mixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting exampleMixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting examplecorehard_by
 
A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012Imaginet
 
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteSearch Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteLucidworks
 
An Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerAn Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerMongoDB
 
CFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful CodeCFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful Codeindiver
 
Big datatraining.in devops-part1
Big datatraining.in devops-part1Big datatraining.in devops-part1
Big datatraining.in devops-part1MvkZ
 

Similar to TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh (20)

Java for XPages Development
Java for XPages DevelopmentJava for XPages Development
Java for XPages Development
 
Automated product categorization
Automated product categorizationAutomated product categorization
Automated product categorization
 
Automated product categorization
Automated product categorization   Automated product categorization
Automated product categorization
 
Finite State Queries In Lucene
Finite State Queries In LuceneFinite State Queries In Lucene
Finite State Queries In Lucene
 
Infrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous DeliveryInfrastructure as Data with Ansible for easier Continuous Delivery
Infrastructure as Data with Ansible for easier Continuous Delivery
 
Reading Notes : the practice of programming
Reading Notes : the practice of programmingReading Notes : the practice of programming
Reading Notes : the practice of programming
 
Hia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iibHia 1693-effective application-development_in_iib
Hia 1693-effective application-development_in_iib
 
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote   Yonik Seeley & Steve Rowe lucene solr roadmapKeynote   Yonik Seeley & Steve Rowe lucene solr roadmap
Keynote Yonik Seeley & Steve Rowe lucene solr roadmap
 
KEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road mapKEYNOTE: Lucene / Solr road map
KEYNOTE: Lucene / Solr road map
 
Connect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience NecessaryConnect 2014 SHOW102: XPages Still No Experience Necessary
Connect 2014 SHOW102: XPages Still No Experience Necessary
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology experts
 
Impact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology expertsImpact of cloud services on the work of oracle technology experts
Impact of cloud services on the work of oracle technology experts
 
Mixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting exampleMixing d ps building architecture on the cross cutting example
Mixing d ps building architecture on the cross cutting example
 
Short-Training asp.net vNext
Short-Training asp.net vNextShort-Training asp.net vNext
Short-Training asp.net vNext
 
A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012A Day in the Life: Developer Enhancements with Visual Studio 2012
A Day in the Life: Developer Enhancements with Visual Studio 2012
 
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, EvernoteSearch Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
Search Architecture at Evernote: Presented by Christian Kohlschütter, Evernote
 
PostgreSQL and MySQL
PostgreSQL and MySQLPostgreSQL and MySQL
PostgreSQL and MySQL
 
An Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops ManagerAn Introduction to MongoDB Ops Manager
An Introduction to MongoDB Ops Manager
 
CFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful CodeCFWheels - Pragmatic, Beautiful Code
CFWheels - Pragmatic, Beautiful Code
 
Big datatraining.in devops-part1
Big datatraining.in devops-part1Big datatraining.in devops-part1
Big datatraining.in devops-part1
 

More from TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

More from TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Recently uploaded

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024The Digital Insurer
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilV3cube
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 

Recently uploaded (20)

Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 

TAUS Moses Industry Roundtable 2014, Changes in Moses, Hieu Hoang, University of Edinburgh

  • 1. Changes in Moses Hieu Hoang TAUS October 2014
  • 2. MosesCore Year 1 (2012) • Easier installation – Binary releases – Pre-built models • Testing and Releases – Linux, Mac OSX, Windows – 32 and 64-bit • Faster training – Parallelism at all stages
  • 3. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder
  • 4. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder
  • 5. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 6. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 7. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion
  • 8. Specify a Feature Function Then…. [lmodel-file] 8 0 3 europarl.en.srilm.gz [weight-l] 0.142 ini file: • New Feature Function – New sections ● [feature-function-file] ● [weight-?] • Custom code – Parse ini file – Initialize feature function
  • 9. Adding new Feature Function Now…. [feature] KENLM file=path order=0 [weight] KENLM0= 0.142 ini file: • New Feature Function – No new section ● Line in [feature] section ● Line in [weight] section – Framework ● parse ini file ● initialize feature
  • 10. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 11. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table ● Dynamic suffix array ● Stores training data – Extract translation rule on-the-fly – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 12. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model ● Continuous space LM – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 13. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models ● Replicate Devlin et al, 2014 ● Large quality gains – Transliteration • Translation rule properties • Syntax decoding
  • 14. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration ● Character level translation ● Learns from parallel data ● Integrate into decoder • Translation rule properties • Syntax decoding
  • 15. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties – Extra information for each rule ● Context, syntax, domain etc • Syntax decoding
  • 16. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding – Faster, memory efficient decoding – More syntactic models
  • 17. Technical Breakout • Organization and Releases – Academic and commercial needs – Prevent forks – Development/Stable versions – Forwards/Backward compatibility – Upgradability • Features • Deployment • Future development
  • 18. Technical Breakout • Organization and Releases • Features • Deployment • Future development
  • 19. Technical Breakout • Organization and Releases • Features • Deployment – Platform/Clouds – Docker containers – Priorities – Interaction and data formats • Future development
  • 20. Technical Breakout • Organization and Releases • Features • Deployment • Future development – User-friendliness – End-to-end solution – Users
  • 21.
  • 22. Changes in Moses Hieu Hoang TAUS October 2014 Thanks for inviting me to come Here to tell you a little about the things I’ve been doing to Moses - over the past 2 years - mainly concentrate of the past year - but will quickly tell you about things I did prior to that 1
  • 23. MosesCore Year 1 (2012) • Easier installation – Binary releases – Pre-built models • Testing and Releases – Linux, Mac OSX, Windows – 32 and 64-bit • Faster training – Parallelism at all stages In the 1st year - picked off the low hanging fruit - fixed many of the easy issues that required - time & effort Made installation easier Run a lot of experiments anyway - gave some of them away - with all the scripts + configuration - used to run them - students can see how to replicate our results Lots of testing - all major platforms Made obvious speed improvements 2
  • 24. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder In year 2 - made it even easier to install - if you can’t be bother to compile or even download the binaries - download a virtual machine with moses + friends installed OR rent an amazon server with moses + friends installed 3
  • 25. MosesCore Year 2 (2013) • Even Easier installation – Binary releases – Pre-built models – Virtual Machines – Amazon EC2 • Refactored Decoder However, the main reason I came here today - talk about the major changes I made - in decoder - and else where Makes is easier for us coders - add and change things in Moses 4
  • 26. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion What is a feature function? - something that gives a translation a score over the last few years - gotten bored with existing features like language models and reordering models the trend in MT - create novel features which give a score to a translation - like any feature, tries to give bigger scores to better models New feature function framework - designed to make it easy to add new features 5
  • 27. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion Simplify class structure - to make it easier for us to develop with Moses - Moses has been around for 8 years now - everyone has the freedom to add what they want - no-one is in overall control - this way of organising an open-source project is great - gotten lots of contribution, lots of features - downside - grown organically - things are not as well structured as they can be 6
  • 28. Why did you Refactor? • Feature Function Framework – easier to implement new features – use sparse features • Simplify class structure – easier to develop with Moses • Delete functionality – easier to refactor code – very little deletion Why did I delete things - delete very little - I’m not the gatekeeper of moses, I don’t control it - if a functionality was deleted, it’s not a comment on usefulness of it - purely ‘cos it got in the way of the refactoring Quickly go thru the last 2 - before telling you about feature functions 7
  • 29. Specify a Feature Function Then…. [lmodel-file] 8 0 3 europarl.en.srilm.gz [weight-l] 0.142 ini file: • New Feature Function – New sections ● [feature-function-file] ● [weight-?] • Custom code – Parse ini file – Initialize feature function completely bestoked - no framework to help you - if you don’t do it right, wont’ work 8
  • 30. Adding new Feature Function Now…. [feature] KENLM file=path order=0 [weight] KENLM0= 0.142 ini file: • New Feature Function – No new section ● Line in [feature] section ● Line in [weight] section – Framework ● parse ini file ● initialize feature Write a class that implements the feature function The framework does the rest - no need to create a custom section in the ini file or - change StaticData class or - change Paramater class 9
  • 31. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 32. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table ● Dynamic suffix array ● Stores training data – Extract translation rule on-the-fly – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 33. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model ● Continuous space LM – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding
  • 34. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models ● Replicate Devlin et al, 2014 ● Large quality gains – Transliteration • Translation rule properties • Syntax decoding
  • 35. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration ● Character level translation ● Learns from parallel data ● Integrate into decoder • Translation rule properties • Syntax decoding
  • 36. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties – Extra information for each rule ● Context, syntax, domain etc • Syntax decoding
  • 37. MosesCore Year 3 (2014) • Exploit new framework – Updatable phrase-table – Neural network language model – Bilingual language models – Transliteration • Translation rule properties • Syntax decoding – Faster, memory efficient decoding – More syntactic models
  • 38. Technical Breakout • Organization and Releases – Academic and commercial needs – Prevent forks – Development/Stable versions – Forwards/Backward compatibility – Upgradability • Features • Deployment • Future development
  • 39. Technical Breakout • Organization and Releases • Features • Deployment • Future development
  • 40. Technical Breakout • Organization and Releases • Features • Deployment – Platform/Clouds – Docker containers – Priorities – Interaction and data formats • Future development
  • 41. Technical Breakout • Organization and Releases • Features • Deployment • Future development – User-friendliness – End-to-end solution – Users