SlideShare ist ein Scribd-Unternehmen logo
1 von 24
Can
Big Data from the Cloud
RevolutionizeTranslation
Metrics?
Quick intro
• 2010: Memsource founded
• 2015: 50,000 users & 100+ million words translated monthly
• Some of the world’s largest translation providers and buyers are
customers
SEGA FUJIFILM
Cloud tools lead to Big Data
Server tools – private data silos Cloud tools – centralized data
And the clouds are getting bigger…
In May alone, users processed 0.8 billion words in Memsource
…Which opens opportunities for
benchmarking and trendwatching
Impact
•Find market pain
points
•Usage stats
•Universal
performance
metrics
•Eliminate free
tests
•ROI tracking
•Identify synergies
•Higher margins
•Real-time
benchmarking
•Notifications that
help manage
operations
Translation
companies
Buyers
Technology
providers
Freelancers
and Project
managers
Example problem - quality
• Free testing
• Since the end of LISA everyone has a unique quality metric
• Can we embed a certain standard into the tool itself?
Freelancer profile on Upwork.com
Our analytics building blocks
SQL technology
Visualization: 400 filters
Legacy solution
Visualization: about 20 filters
Kibana console look
So what can we track there?
• In theory, anything:
• Translation data
• Productivity
• Business analytics
• Notifications
• In practice (challenges):
• Data clean-up
• Relevance
• Interpretation
Translation memory used for 85% of jobs
Users save 10 to 40% with TM
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50
USERS
OVERALL TM LEVERAGE BY TOP 50 VOLUME USERS
repetitions tm.match101 tm.match100 tm.match95 tm.match85 tm.match75 tm.match50 tm.match0
Data for jobs where post-editing analysis has been performed, December 2015 - May 2016
Sample
9 bn
words
Savings
approx.
$300
million
MT is currently used on 31% of projects
Top MT Engines
ENGINE %
Microsoft with Feedback 15.8%
Microsoft Translator Hub 9.9%
Google Translate 2.6%
Microsoft Translator 2.5%
SDL BeGlobal 0.4%
Other 0.6%
MT not used 68.2%
Up to 80% content pasted from MT then
edited
Sample size 20 million words, December 2015 - May 2016
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
en:es pt:en en:pt es:en en:ru ru:en en:de pt:es en:fr es:pt
%OFWORDSINSEGEMENTSFROMMT
SAMPLE LANGUAGE PAIRS
EDIT DISTANCE FOR MAJOR LANGUAGE PAIRS
mt.match100 mt.match95 mt.match85 mt.match75 mt.match50 mt.match0
MT not used
Raw MT
Moderate
edits
Heavily
edited
Many linguists translate
more than 10 pages a day consistently
0
200
400
600
800
1000
1200
1400
1600
1
21
41
61
81
101
121
141
161
181
201
221
241
261
281
301
321
341
361
381
401
421
441
461
481
501
521
541
561
581
601
621
641
661
681
701
721
741
761
781
801
821
841
861
881
901
921
941
961
981
PagesCompletedinApril
Users
TOP 1000 LINGUIST ROLE PRODUCTIVITY, PAGES IN APRIL 2016
Norm:
8 pages a day x 20 days
20 pages a day
Probably not
human translation
10 pages a day
Project manager productivity
408
325
313
263
159
143
122
74 68 63
31
10 5
0
50
100
150
200
250
300
350
400
450
Renato Joana Kris John Bill Robert Alex Sandor Dave Millingan Mihiko Olga Barbora
Job Created by PMs and Completed by Linguists in the last 30
days
– test organization
Benchmarking possibilities
674
440 428
94
37
13 9 5 12 10 7
0
100
200
300
400
500
600
700
800
1 or less from 1 to 10 from 11 to 100 from 101 to
200
from 200 to
300
from 300 to
400
from 400 to
500
from 500 to
600
from 501 to
1000
from 1001 to
2000
more than
2000
PM Productivity, Completed Jobs Per Month
Number of jobs completed
Numberofusers
December – May 2016
Top 10%
Project manager productivity
408
325
313
263
159
143
122
74 68 63
31
10 5
0
50
100
150
200
250
300
350
400
450
Renato Joana Kris John Bill Robert Alex Sandor Dave Millingan Mihiko Olga Barbora
Job Created by PMs and Completed by Linguists in the last 30
days
Top 10% of Global PM
User Population
“In fact, Big Data applications are bound only
by the human imagination”.
Peter Pham
What you can do now
• What to track?
• How can organizations benefit from each other’s data?
• Which data should not be shared?
Thank you!
konstantin@memsource.com

Weitere ähnliche Inhalte

Ähnlich wie Can Big Data from the Cloud Revolutionize Translation Metrics

Open Source Operations Analytics With Elastic
Open Source Operations Analytics With ElasticOpen Source Operations Analytics With Elastic
Open Source Operations Analytics With ElasticArthur Gimpel
 
MiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit MichiganMiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit MichiganKirti Vashee
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)TAUS - The Language Data Network
 
Good Applications of Bad Machine Translation
Good Applications of Bad Machine TranslationGood Applications of Bad Machine Translation
Good Applications of Bad Machine Translationbdonaldson
 
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWS
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWSSeeing the Wood for the Trees in MT Evaluation: an LSP success story from RWS
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWSIconic Translation Machines
 
Mli 2017 business success migrations on m2
Mli 2017 business success migrations on m2Mli 2017 business success migrations on m2
Mli 2017 business success migrations on m2Hanoi MagentoMeetup
 
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...SDL
 
Is Orchestration the Next Big Thing in DevOps
Is Orchestration the Next Big Thing in DevOpsIs Orchestration the Next Big Thing in DevOps
Is Orchestration the Next Big Thing in DevOpsNati Shalom
 
Software Development Trends 2010-2011
Software Development Trends 2010-2011Software Development Trends 2010-2011
Software Development Trends 2010-2011Charalampos Arapidis
 
Slides: The Automated Business Glossary
Slides: The Automated Business GlossarySlides: The Automated Business Glossary
Slides: The Automated Business GlossaryDATAVERSITY
 
Quantifying Business Advantage: The Value of Database Selection
Quantifying Business Advantage: The Value of Database SelectionQuantifying Business Advantage: The Value of Database Selection
Quantifying Business Advantage: The Value of Database SelectionMongoDB
 
MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleDatabricks
 
15th December 2016 - Microsoft Paddington Vuzion Awareness Event
15th December 2016 - Microsoft Paddington Vuzion Awareness Event15th December 2016 - Microsoft Paddington Vuzion Awareness Event
15th December 2016 - Microsoft Paddington Vuzion Awareness EventVuzion
 
Lunch and Learn and Sneakers
Lunch and Learn and SneakersLunch and Learn and Sneakers
Lunch and Learn and SneakersBill Zajac
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLoriThicke
 
Adoption of Robotic Automation Process
Adoption of Robotic Automation ProcessAdoption of Robotic Automation Process
Adoption of Robotic Automation ProcessMukund Wangikar
 

Ähnlich wie Can Big Data from the Cloud Revolutionize Translation Metrics (20)

Microsoft - SEO - TAUS Tokyo Forum 2015
Microsoft - SEO - TAUS Tokyo Forum 2015Microsoft - SEO - TAUS Tokyo Forum 2015
Microsoft - SEO - TAUS Tokyo Forum 2015
 
Open Source Operations Analytics With Elastic
Open Source Operations Analytics With ElasticOpen Source Operations Analytics With Elastic
Open Source Operations Analytics With Elastic
 
MiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit MichiganMiTiN 2013 Keynote in Detroit Michigan
MiTiN 2013 Keynote in Detroit Michigan
 
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)
Topic 4: The Magician's Hat: Turning Data into Business Intelligence (2)
 
Good Applications of Bad Machine Translation
Good Applications of Bad Machine TranslationGood Applications of Bad Machine Translation
Good Applications of Bad Machine Translation
 
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWS
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWSSeeing the Wood for the Trees in MT Evaluation: an LSP success story from RWS
Seeing the Wood for the Trees in MT Evaluation: an LSP success story from RWS
 
Mli 2017 business success migrations on m2
Mli 2017 business success migrations on m2Mli 2017 business success migrations on m2
Mli 2017 business success migrations on m2
 
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...Machine Translation: Latest Innovations and their Impact on Commercial Transl...
Machine Translation: Latest Innovations and their Impact on Commercial Transl...
 
Is Orchestration the Next Big Thing in DevOps
Is Orchestration the Next Big Thing in DevOpsIs Orchestration the Next Big Thing in DevOps
Is Orchestration the Next Big Thing in DevOps
 
Microstrategy Overview
Microstrategy OverviewMicrostrategy Overview
Microstrategy Overview
 
Software Development Trends 2010-2011
Software Development Trends 2010-2011Software Development Trends 2010-2011
Software Development Trends 2010-2011
 
Slides: The Automated Business Glossary
Slides: The Automated Business GlossarySlides: The Automated Business Glossary
Slides: The Automated Business Glossary
 
Quantifying Business Advantage: The Value of Database Selection
Quantifying Business Advantage: The Value of Database SelectionQuantifying Business Advantage: The Value of Database Selection
Quantifying Business Advantage: The Value of Database Selection
 
MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at Scale
 
15th December 2016 - Microsoft Paddington Vuzion Awareness Event
15th December 2016 - Microsoft Paddington Vuzion Awareness Event15th December 2016 - Microsoft Paddington Vuzion Awareness Event
15th December 2016 - Microsoft Paddington Vuzion Awareness Event
 
Lunch and Learn and Sneakers
Lunch and Learn and SneakersLunch and Learn and Sneakers
Lunch and Learn and Sneakers
 
MyMemory
MyMemoryMyMemory
MyMemory
 
Lexcelera MT Breaking Compromises
Lexcelera MT Breaking CompromisesLexcelera MT Breaking Compromises
Lexcelera MT Breaking Compromises
 
Cloud Economics
Cloud EconomicsCloud Economics
Cloud Economics
 
Adoption of Robotic Automation Process
Adoption of Robotic Automation ProcessAdoption of Robotic Automation Process
Adoption of Robotic Automation Process
 

Mehr von Konstantin Dranch

Language Services Market in Germany 2020 (updated)
Language Services Market in Germany 2020 (updated)Language Services Market in Germany 2020 (updated)
Language Services Market in Germany 2020 (updated)Konstantin Dranch
 
События и тенденции российского рынка переводов 2017
События и тенденции российского рынка переводов 2017События и тенденции российского рынка переводов 2017
События и тенденции российского рынка переводов 2017Konstantin Dranch
 
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)Konstantin Dranch
 
French translation companies 2016
French translation companies 2016French translation companies 2016
French translation companies 2016Konstantin Dranch
 
Ukrainian Translation Market - UTIC conference presentation
Ukrainian Translation Market - UTIC conference presentationUkrainian Translation Market - UTIC conference presentation
Ukrainian Translation Market - UTIC conference presentationKonstantin Dranch
 
Elena Tarasova: Translation agencies: the good, the bad and the okay ones
Elena Tarasova: Translation agencies: the good, the bad and the okay ones Elena Tarasova: Translation agencies: the good, the bad and the okay ones
Elena Tarasova: Translation agencies: the good, the bad and the okay ones Konstantin Dranch
 
Top 10 events of 2013 in the Russian translation industry
Top 10 events of 2013 in the Russian translation industryTop 10 events of 2013 in the Russian translation industry
Top 10 events of 2013 in the Russian translation industryKonstantin Dranch
 
10 mind-wriggling figures from Translation Forum Russia 2013
10 mind-wriggling figures from Translation Forum Russia 201310 mind-wriggling figures from Translation Forum Russia 2013
10 mind-wriggling figures from Translation Forum Russia 2013Konstantin Dranch
 
Translation Industry in Russia
Translation Industry in RussiaTranslation Industry in Russia
Translation Industry in RussiaKonstantin Dranch
 

Mehr von Konstantin Dranch (9)

Language Services Market in Germany 2020 (updated)
Language Services Market in Germany 2020 (updated)Language Services Market in Germany 2020 (updated)
Language Services Market in Germany 2020 (updated)
 
События и тенденции российского рынка переводов 2017
События и тенденции российского рынка переводов 2017События и тенденции российского рынка переводов 2017
События и тенденции российского рынка переводов 2017
 
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)
Обзор переводческих тендеров 2016. (Андрей Балаев, Прима Виста)
 
French translation companies 2016
French translation companies 2016French translation companies 2016
French translation companies 2016
 
Ukrainian Translation Market - UTIC conference presentation
Ukrainian Translation Market - UTIC conference presentationUkrainian Translation Market - UTIC conference presentation
Ukrainian Translation Market - UTIC conference presentation
 
Elena Tarasova: Translation agencies: the good, the bad and the okay ones
Elena Tarasova: Translation agencies: the good, the bad and the okay ones Elena Tarasova: Translation agencies: the good, the bad and the okay ones
Elena Tarasova: Translation agencies: the good, the bad and the okay ones
 
Top 10 events of 2013 in the Russian translation industry
Top 10 events of 2013 in the Russian translation industryTop 10 events of 2013 in the Russian translation industry
Top 10 events of 2013 in the Russian translation industry
 
10 mind-wriggling figures from Translation Forum Russia 2013
10 mind-wriggling figures from Translation Forum Russia 201310 mind-wriggling figures from Translation Forum Russia 2013
10 mind-wriggling figures from Translation Forum Russia 2013
 
Translation Industry in Russia
Translation Industry in RussiaTranslation Industry in Russia
Translation Industry in Russia
 

Kürzlich hochgeladen

Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Kirill Klimov
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadAyesha Khan
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCRashishs7044
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03DallasHaselhorst
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024Matteo Carbone
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menzaictsugar
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...ssuserf63bd7
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncrdollysharma2066
 
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxContemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxMarkAnthonyAurellano
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africaictsugar
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfJos Voskuil
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyotictsugar
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailAriel592675
 

Kürzlich hochgeladen (20)

Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024Flow Your Strategy at Flight Levels Day 2024
Flow Your Strategy at Flight Levels Day 2024
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
 
8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR8447779800, Low rate Call girls in Rohini Delhi NCR
8447779800, Low rate Call girls in Rohini Delhi NCR
 
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
No-1 Call Girls In Goa 93193 VIP 73153 Escort service In North Goa Panaji, Ca...
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
 
Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03Cybersecurity Awareness Training Presentation v2024.03
Cybersecurity Awareness Training Presentation v2024.03
 
IoT Insurance Observatory: summary 2024
IoT Insurance Observatory:  summary 2024IoT Insurance Observatory:  summary 2024
IoT Insurance Observatory: summary 2024
 
Corporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information TechnologyCorporate Profile 47Billion Information Technology
Corporate Profile 47Billion Information Technology
 
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu MenzaYouth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
Youth Involvement in an Innovative Coconut Value Chain by Mwalimu Menza
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...
 
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / NcrCall Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
Call Girls in DELHI Cantt, ( Call Me )-8377877756-Female Escort- In Delhi / Ncr
 
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptxContemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
Contemporary Economic Issues Facing the Filipino Entrepreneur (1).pptx
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
Kenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby AfricaKenya’s Coconut Value Chain by Gatsby Africa
Kenya’s Coconut Value Chain by Gatsby Africa
 
Digital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdfDigital Transformation in the PLM domain - distrib.pdf
Digital Transformation in the PLM domain - distrib.pdf
 
Investment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy CheruiyotInvestment in The Coconut Industry by Nancy Cheruiyot
Investment in The Coconut Industry by Nancy Cheruiyot
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detail
 
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCREnjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
Enjoy ➥8448380779▻ Call Girls In Sector 18 Noida Escorts Delhi NCR
 

Can Big Data from the Cloud Revolutionize Translation Metrics

  • 1. Can Big Data from the Cloud RevolutionizeTranslation Metrics?
  • 2. Quick intro • 2010: Memsource founded • 2015: 50,000 users & 100+ million words translated monthly • Some of the world’s largest translation providers and buyers are customers SEGA FUJIFILM
  • 3. Cloud tools lead to Big Data Server tools – private data silos Cloud tools – centralized data
  • 4. And the clouds are getting bigger… In May alone, users processed 0.8 billion words in Memsource
  • 5. …Which opens opportunities for benchmarking and trendwatching
  • 6. Impact •Find market pain points •Usage stats •Universal performance metrics •Eliminate free tests •ROI tracking •Identify synergies •Higher margins •Real-time benchmarking •Notifications that help manage operations Translation companies Buyers Technology providers Freelancers and Project managers
  • 7. Example problem - quality • Free testing • Since the end of LISA everyone has a unique quality metric • Can we embed a certain standard into the tool itself?
  • 8.
  • 10. Our analytics building blocks SQL technology Visualization: 400 filters Legacy solution Visualization: about 20 filters
  • 12. So what can we track there? • In theory, anything: • Translation data • Productivity • Business analytics • Notifications • In practice (challenges): • Data clean-up • Relevance • Interpretation
  • 13. Translation memory used for 85% of jobs
  • 14. Users save 10 to 40% with TM 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 USERS OVERALL TM LEVERAGE BY TOP 50 VOLUME USERS repetitions tm.match101 tm.match100 tm.match95 tm.match85 tm.match75 tm.match50 tm.match0 Data for jobs where post-editing analysis has been performed, December 2015 - May 2016 Sample 9 bn words Savings approx. $300 million
  • 15. MT is currently used on 31% of projects Top MT Engines ENGINE % Microsoft with Feedback 15.8% Microsoft Translator Hub 9.9% Google Translate 2.6% Microsoft Translator 2.5% SDL BeGlobal 0.4% Other 0.6% MT not used 68.2%
  • 16. Up to 80% content pasted from MT then edited Sample size 20 million words, December 2015 - May 2016 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% en:es pt:en en:pt es:en en:ru ru:en en:de pt:es en:fr es:pt %OFWORDSINSEGEMENTSFROMMT SAMPLE LANGUAGE PAIRS EDIT DISTANCE FOR MAJOR LANGUAGE PAIRS mt.match100 mt.match95 mt.match85 mt.match75 mt.match50 mt.match0 MT not used Raw MT Moderate edits Heavily edited
  • 17. Many linguists translate more than 10 pages a day consistently 0 200 400 600 800 1000 1200 1400 1600 1 21 41 61 81 101 121 141 161 181 201 221 241 261 281 301 321 341 361 381 401 421 441 461 481 501 521 541 561 581 601 621 641 661 681 701 721 741 761 781 801 821 841 861 881 901 921 941 961 981 PagesCompletedinApril Users TOP 1000 LINGUIST ROLE PRODUCTIVITY, PAGES IN APRIL 2016 Norm: 8 pages a day x 20 days 20 pages a day Probably not human translation 10 pages a day
  • 18. Project manager productivity 408 325 313 263 159 143 122 74 68 63 31 10 5 0 50 100 150 200 250 300 350 400 450 Renato Joana Kris John Bill Robert Alex Sandor Dave Millingan Mihiko Olga Barbora Job Created by PMs and Completed by Linguists in the last 30 days – test organization
  • 19. Benchmarking possibilities 674 440 428 94 37 13 9 5 12 10 7 0 100 200 300 400 500 600 700 800 1 or less from 1 to 10 from 11 to 100 from 101 to 200 from 200 to 300 from 300 to 400 from 400 to 500 from 500 to 600 from 501 to 1000 from 1001 to 2000 more than 2000 PM Productivity, Completed Jobs Per Month Number of jobs completed Numberofusers December – May 2016 Top 10%
  • 20. Project manager productivity 408 325 313 263 159 143 122 74 68 63 31 10 5 0 50 100 150 200 250 300 350 400 450 Renato Joana Kris John Bill Robert Alex Sandor Dave Millingan Mihiko Olga Barbora Job Created by PMs and Completed by Linguists in the last 30 days Top 10% of Global PM User Population
  • 21.
  • 22. “In fact, Big Data applications are bound only by the human imagination”. Peter Pham
  • 23. What you can do now • What to track? • How can organizations benefit from each other’s data? • Which data should not be shared?

Hinweis der Redaktion

  1. Some background here