SlideShare ist ein Scribd-Unternehmen logo
1 von 21
Downloaden Sie, um offline zu lesen
Multilingual Term Extraction
as a Service from Acrolinx
Ben Gottesman
Michael Klemme
Acrolinx
CHAT2013
Definitions
term extraction: automatically identifying potential terms in a
document (corpus)
multilingual term extraction: automatically identifying potential terms
and their translations in a document and its translation (parallel
corpus / translation memory)

The wizard begins creating the bootable image.

Der Assistent beginnt mit der Erstellung des bootfähigen Image.
(… or, if the source-language terminology already exists, just identify translations)
Synonyms
Identify same-language synonyms via translations in common
German

English

Die Spannungsversorgung für
die Elektronik wird vom
Speisegerät G526 sichergestellt.

The voltage supply for the
electronics is maintained by the
power supply unit G526.

Spannungsversorgung für
interne Speisung (X3e)

Power supply for internal supply
(X3e)

Unterspannung in der
Stromversorgung

Undervoltage in the power
supply

Spannungsversorgung
Stromversorgung

voltage supply
power supply
Outline
• What is multilingual term extraction?
• What is the workflow from customer perspective?
– customer use case examples
– show extraction results, demonstrate human validation

• How does the extraction work?
– how we identify candidates
•

source-language candidates

•

translation candidates

– how we filter translation candidates
– how we identify source-language synonyms

• What is Acrolinx and how does MTE fit in?
Outline
• What is multilingual term extraction?
• What is the workflow from customer perspective?
– customer use case examples
– show extraction results, demonstrate human validation

• How does the extraction work?
– how we identify candidates
•

source-language candidates

•

translation candidates

– how we filter translation candidates
– how we identify source-language synonyms

• What is Acrolinx and how does MTE fit in?
Workflow: Customer perspective
1. Customer provides translated documents
2. Acrolinx provides extracted multilingual term
candidates to customer

3. Customer validates candidates
4. Validated results become (or are added to)
customer’s term bank
Customer use cases, past examples
Use case 1
– de-<en,fr,es,it,pt> (mostly de-en)
– ~142,000 bilingual segments; ~2,685,000 tokens (total)

Use case 2
– de-<en,fr> (all data trilingual)
– ~132,000 bilingual segments; ~1,259,000 tokens
– data document-aligned, not segment-aligned, so extra step required

Use case 3
–
–
–
–

en-de
~942,000 bilingual segments; ~25,000,000 tokens
extract translations of a given list of keywords
determine which keywords don’t occur in data
Results

• human validation in Excel

“Baugruppe” has been translated
inconsistently into English in the past
Mark respective translations as
preferred/deprecated to guide translators
in the future.
Results

“Stromversorgung” and “Einspeisung” have translations in common.
→ automatically identified as possible synonyms, so same Cluster ID
To validate synonym link, edit Subcluster IDs to be the same.
Mark respective variants as preferred/deprecated to guide authors.
Outline
• What is multilingual term extraction?
• What is the workflow from customer perspective?
– customer use case examples
– show extraction results, demonstrate human validation

• How does the extraction work?
– how we identify candidates
•

source-language candidates

•

translation candidates

– how we filter translation candidates
– how we identify source-language synonyms

• What is Acrolinx and how does MTE fit in?
How does the extraction work?
• Extract source-language term candidates from
source-language text (unless source-language
terminology exists)
The wizard begins creating the bootable image.

– linguistics-based
• especially part-of-speech patterns

– same functionality built into the core Acrolinx
product
How does the extraction work?
• Extract translation candidates of each sourcelanguage term candidate from target-language
text
The wizard begins creating the bootable image.

Der Assistent beginnt mit der Erstellung des bootfähigen Image.

– use statistical phrase-alignment technology
– same used in statistical machine translation
How does the extraction work?
• Filter translation candidates

translation candidates for “Eingangsspannung” (pink = filtered out)

… based on:
– confidence score calculated from translation probabilities
•

can adjust threshold to favour precision or recall

– surface characteristics (closed-class words, punctuation)
– term-candidacy of translation (if possible for language)
How does the extraction work?
• Identify synonyms (‘cluster’ candidates)

cluster around “Stromwandler” (minimum link confidence threshold = 0.01)

– link confidence based on the degree to which translations are shared
– can adjust threshold to favour precision or recall of links
How does the extraction work?
• Identify synonyms (‘cluster’ candidates)

cluster around “Stromwandler” (minimum link confidence threshold = 0.03)

– link confidence based on the degree to which translations are shared
– can adjust threshold to favour precision or recall of links
Outline
• What is multilingual term extraction?
• What is the workflow from customer perspective?
– customer use case examples
– show extraction results, demonstrate human validation

• How does the extraction work?
– how we identify candidates
•

source-language candidates

•

translation candidates

– how we filter translation candidates
– how we identify source-language synonyms

• What is Acrolinx and how does MTE fit in?
What is Acrolinx?
Acrolinx is Content Optimization Software. It helps
authors make there text
– more correct,
– more consistent,
– and more readable.
What is Acrolinx?
Acrolinx is Content Optimization Software. It helps
authors make their text
– more correct,
– more consistent,
– and more readable.

Consistent use of terminology is an important factor in
the readability of text. Acrolinx provides:
– term extraction (monolingual, aka term harvesting)
– terminology management
– term checking

Multilingual Term Extraction as a Service is a natural
complement to the prior terminology functions.
Acrolinx @ tekom

Visit Acrolinx at tekom!
→ Hall 3, Stand 310
Outline
• What is multilingual term extraction?
• What is the workflow from customer perspective?
– customer use case examples
– show extraction results, demonstrate human validation

• How does the extraction work?
– how we identify candidates
•

source-language candidates

•

translation candidates

– how we filter translation candidates
– how we identify source-language synonyms

• What is Acrolinx and how does MTE fit in?
Questions?

Weitere ähnliche Inhalte

Was ist angesagt?

Extraction Based automatic summarization
Extraction Based automatic summarizationExtraction Based automatic summarization
Extraction Based automatic summarizationAbdelaziz Al-Rihawi
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1Pratishtha Ram
 
Document Summarization
Document SummarizationDocument Summarization
Document SummarizationPratik Kumar
 
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGE
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGEPRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGE
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGEkevig
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...Lifeng (Aaron) Han
 
Fusing Modeling and Programming into Language-Oriented Programming
Fusing Modeling and Programming into Language-Oriented ProgrammingFusing Modeling and Programming into Language-Oriented Programming
Fusing Modeling and Programming into Language-Oriented ProgrammingMarkus Voelter
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problemJaeHo Jang
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translationkhyati gupta
 
Frequently asked tcs technical interview questions and answers
Frequently asked tcs technical interview questions and answersFrequently asked tcs technical interview questions and answers
Frequently asked tcs technical interview questions and answersnishajj
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT Lifeng (Aaron) Han
 
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Quinsulon Israel
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?Multilizer
 
JetBrains MPS: Structure Aspect
JetBrains MPS: Structure AspectJetBrains MPS: Structure Aspect
JetBrains MPS: Structure AspectMikhail Barash
 

Was ist angesagt? (20)

Extraction Based automatic summarization
Extraction Based automatic summarizationExtraction Based automatic summarization
Extraction Based automatic summarization
 
Text summarization
Text summarization Text summarization
Text summarization
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1
 
1 compiler outline
1 compiler outline1 compiler outline
1 compiler outline
 
Document Summarization
Document SummarizationDocument Summarization
Document Summarization
 
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGE
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGEPRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGE
PRONOUN DISAMBIGUATION: WITH APPLICATION TO THE WINOGRAD SCHEMA CHALLENGE
 
Nlp
NlpNlp
Nlp
 
SMT3
SMT3SMT3
SMT3
 
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
cushLEPOR uses LABSE distilled knowledge to improve correlation with human tr...
 
Lexical1
Lexical1Lexical1
Lexical1
 
Transfer learning in nlp
Transfer learning in nlpTransfer learning in nlp
Transfer learning in nlp
 
Fusing Modeling and Programming into Language-Oriented Programming
Fusing Modeling and Programming into Language-Oriented ProgrammingFusing Modeling and Programming into Language-Oriented Programming
Fusing Modeling and Programming into Language-Oriented Programming
 
Open vocabulary problem
Open vocabulary problemOpen vocabulary problem
Open vocabulary problem
 
Experiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine TranslationExperiments with Different Models of Statistcial Machine Translation
Experiments with Different Models of Statistcial Machine Translation
 
Frequently asked tcs technical interview questions and answers
Frequently asked tcs technical interview questions and answersFrequently asked tcs technical interview questions and answers
Frequently asked tcs technical interview questions and answers
 
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT LEPOR: an augmented machine translation evaluation metric - Thesis PPT
LEPOR: an augmented machine translation evaluation metric - Thesis PPT
 
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
Dissertation defense slides on "Semantic Analysis for Improved Multi-document...
 
Machine Translation: What it is?
Machine Translation: What it is?Machine Translation: What it is?
Machine Translation: What it is?
 
JetBrains MPS: Structure Aspect
JetBrains MPS: Structure AspectJetBrains MPS: Structure Aspect
JetBrains MPS: Structure Aspect
 
Ire major project
Ire major projectIre major project
Ire major project
 

Andere mochten auch

TAUS MT SHOWCASE, Is the Translation Industry Ready, Jaap van der Meer, TAUS...
TAUS MT SHOWCASE,  Is the Translation Industry Ready, Jaap van der Meer, TAUS...TAUS MT SHOWCASE,  Is the Translation Industry Ready, Jaap van der Meer, TAUS...
TAUS MT SHOWCASE, Is the Translation Industry Ready, Jaap van der Meer, TAUS...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013
TAUS MT SHOWCASE,  Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013TAUS MT SHOWCASE,  Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013
TAUS MT SHOWCASE, Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013TAUS - The Language Data Network
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...TAUS - The Language Data Network
 
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...TAUS - The Language Data Network
 

Andere mochten auch (19)

TAUS MT SHOWCASE, Is the Translation Industry Ready, Jaap van der Meer, TAUS...
TAUS MT SHOWCASE,  Is the Translation Industry Ready, Jaap van der Meer, TAUS...TAUS MT SHOWCASE,  Is the Translation Industry Ready, Jaap van der Meer, TAUS...
TAUS MT SHOWCASE, Is the Translation Industry Ready, Jaap van der Meer, TAUS...
 
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...
TAUS MT SHOWCASE, Creating Competitive Advantage with Rapid Customization & D...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Serge Gladhoff, Logrus...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Gustavo Lucardi, Trust...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Jie Jiang, Applied lan...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Andrejs Vasiljevs, Til...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Diego Bartolome, tauyo...
 
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
TAUS MT SHOWCASE, The WeMT Program, Olga Beregovaya, Welocalize, 10 October 2...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Jie Jiang, Applied la...
 
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
TAUS MT SHOWCASE, Moses in the Mix. A Technology Agnostic Approach to a Winni...
 
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
TAUS MT SHOWCASE, The Open Source MT System Moses and Its Use in the Industry...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Manuel Herranz, Pangean...
 
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
TAUS MT SHOWCASE, Moses and Other Resources, Rahzeb Choudhury, TAUS, 10 April...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Chengqing Zong, Casia...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Monaco, Joel Sigling, AVB, 25 ...
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Paris, Kerstin Bier, Sybase, 4...
 
TAUS MT SHOWCASE, Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013
TAUS MT SHOWCASE,  Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013TAUS MT SHOWCASE,  Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013
TAUS MT SHOWCASE, Microsoft Translator, Chris Wendt, Microsoft, 10 October 2013
 
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...
TAUS OPEN SOURCE MACHINE TRANSLATION SHOWCASE, Beijing, Yu Gong, Adobe, 23 Ap...
 
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...
TAUS MT SHOWCASE, Google Translator Toolkit, Patcharin Areewong, Google, 10 A...
 

Ähnlich wie Multilingual Term Extraction as a Service from Acrolinx, CHAT2013

Search explained T3DD15
Search explained T3DD15Search explained T3DD15
Search explained T3DD15Hans Höchtl
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...QuantInsti
 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsMarina Santini
 
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsR Systems International
 
Beginning text analysis
Beginning text analysisBeginning text analysis
Beginning text analysisBarry DeCicco
 
CASE tools and their effects on software quality
CASE tools and their effects on software qualityCASE tools and their effects on software quality
CASE tools and their effects on software qualityUtkarsh Agarwal
 
Switch case and looping statement
Switch case and looping statementSwitch case and looping statement
Switch case and looping statement_jenica
 
RDBMS
RDBMSRDBMS
RDBMSsowfi
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015RIILP
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...mlaij
 
Natural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyNatural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyAkshayaNagarajan10
 
Intelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIntelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIAEME Publication
 
Project Presentation
Project PresentationProject Presentation
Project Presentationbutest
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...RajkiranVeluri
 

Ähnlich wie Multilingual Term Extraction as a Service from Acrolinx, CHAT2013 (20)

Natural Language Processing using Java
Natural Language Processing using JavaNatural Language Processing using Java
Natural Language Processing using Java
 
Search explained T3DD15
Search explained T3DD15Search explained T3DD15
Search explained T3DD15
 
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
Masterclass: Natural Language Processing in Trading with Terry Benzschawel & ...
 
Towards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology ApplicationsTowards a Quality Assessment of Web Corpora for Language Technology Applications
Towards a Quality Assessment of Web Corpora for Language Technology Applications
 
1 cc
1 cc1 cc
1 cc
 
Deciphering voice of customer through speech analytics
Deciphering voice of customer through speech analyticsDeciphering voice of customer through speech analytics
Deciphering voice of customer through speech analytics
 
SEppt
SEpptSEppt
SEppt
 
Beginning text analysis
Beginning text analysisBeginning text analysis
Beginning text analysis
 
CASE tools and their effects on software quality
CASE tools and their effects on software qualityCASE tools and their effects on software quality
CASE tools and their effects on software quality
 
Switch case and looping statement
Switch case and looping statementSwitch case and looping statement
Switch case and looping statement
 
Bi-lingual Word Sense Induction
Bi-lingual Word Sense InductionBi-lingual Word Sense Induction
Bi-lingual Word Sense Induction
 
Translationusing moses1
Translationusing moses1Translationusing moses1
Translationusing moses1
 
Moses
MosesMoses
Moses
 
RDBMS
RDBMSRDBMS
RDBMS
 
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
ER1 Eduard Barbu - EXPERT Summer School - Malaga 2015
 
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...Classification of Machine Translation Outputs Using NB Classifier and SVM for...
Classification of Machine Translation Outputs Using NB Classifier and SVM for...
 
Natural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A SurveyNatural Language Processing Advancements By Deep Learning - A Survey
Natural Language Processing Advancements By Deep Learning - A Survey
 
Intelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversionIntelligent query converter a domain independent interfacefor conversion
Intelligent query converter a domain independent interfacefor conversion
 
Project Presentation
Project PresentationProject Presentation
Project Presentation
 
Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...Natural Language Processing, Techniques, Current Trends and Applications in I...
Natural Language Processing, Techniques, Current Trends and Applications in I...
 

Mehr von TAUS - The Language Data Network

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS - The Language Data Network
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS - The Language Data Network
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...TAUS - The Language Data Network
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)TAUS - The Language Data Network
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...TAUS - The Language Data Network
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...TAUS - The Language Data Network
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...TAUS - The Language Data Network
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...TAUS - The Language Data Network
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...TAUS - The Language Data Network
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...TAUS - The Language Data Network
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)TAUS - The Language Data Network
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...TAUS - The Language Data Network
 

Mehr von TAUS - The Language Data Network (20)

TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
TAUS Global Content Summit Amsterdam 2019 / Beyond MT. A few premature reflec...
 
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
TAUS Global Content Summit Amsterdam 2019 / Measure with DQF, Dace Dzeguze (T...
 
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
TAUS Global Content Summit Amsterdam 2019 / Automatic for the People by Domin...
 
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
TAUS Global Content Summit Amsterdam 2019 / The Quantum Leap: Human Parity, C...
 
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
TAUS Global Content Summit Amsterdam 2019 / Growing Business by Connecting Co...
 
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
Achieving Translation Efficiency and Accuracy for Video Content, Xiao Yuan (P...
 
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
Introduction Innovation Contest Shenzhen by Henri Broekmate (Lionbridge)
 
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann... Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
Game Changer for Linguistic Review: Shifting the Paradigm, Klaus Fleischmann...
 
A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...A translation memory P2P trading platform - to make global translation memory...
A translation memory P2P trading platform - to make global translation memory...
 
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
Shiyibao — The Most Efficient Translation Feedback System Ever, Guanqing Hao ...
 
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
Stepes – Instant Human Translation Services for the Digital World, Carl Yao (...
 
Farmer Lv (TrueTran)
Farmer Lv (TrueTran)Farmer Lv (TrueTran)
Farmer Lv (TrueTran)
 
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
Smart Translation Resource Management: Semantic Matching, Kirk Zhang (Wiitran...
 
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 The Theory and Practice of Computer Aided Translation Training System, Liu Q... The Theory and Practice of Computer Aided Translation Training System, Liu Q...
The Theory and Practice of Computer Aided Translation Training System, Liu Q...
 
Translation Technology Showcase in Shenzhen
Translation Technology Showcase in ShenzhenTranslation Technology Showcase in Shenzhen
Translation Technology Showcase in Shenzhen
 
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
How to efficiently use large-scale TMs in translation, Jing Zhang (Tmxmall)
 
SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)SDL Trados Studio 2017, Jocelyn He (SDL)
SDL Trados Studio 2017, Jocelyn He (SDL)
 
How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)How we train post-editors - Yongpeng Wei (Lingosail)
How we train post-editors - Yongpeng Wei (Lingosail)
 
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 A use-case for getting MT into your company, Kerstin Berns (berns language c... A use-case for getting MT into your company, Kerstin Berns (berns language c...
A use-case for getting MT into your company, Kerstin Berns (berns language c...
 
QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)QE integrated in XTM, by Bob Willans (XTM)
QE integrated in XTM, by Bob Willans (XTM)
 

Kürzlich hochgeladen

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsJoaquim Jorge
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfEnterprise Knowledge
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEarley Information Science
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Igalia
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 

Kürzlich hochgeladen (20)

Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdfThe Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptxEIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

Multilingual Term Extraction as a Service from Acrolinx, CHAT2013

  • 1. Multilingual Term Extraction as a Service from Acrolinx Ben Gottesman Michael Klemme Acrolinx CHAT2013
  • 2. Definitions term extraction: automatically identifying potential terms in a document (corpus) multilingual term extraction: automatically identifying potential terms and their translations in a document and its translation (parallel corpus / translation memory) The wizard begins creating the bootable image. Der Assistent beginnt mit der Erstellung des bootfähigen Image. (… or, if the source-language terminology already exists, just identify translations)
  • 3. Synonyms Identify same-language synonyms via translations in common German English Die Spannungsversorgung für die Elektronik wird vom Speisegerät G526 sichergestellt. The voltage supply for the electronics is maintained by the power supply unit G526. Spannungsversorgung für interne Speisung (X3e) Power supply for internal supply (X3e) Unterspannung in der Stromversorgung Undervoltage in the power supply Spannungsversorgung Stromversorgung voltage supply power supply
  • 4. Outline • What is multilingual term extraction? • What is the workflow from customer perspective? – customer use case examples – show extraction results, demonstrate human validation • How does the extraction work? – how we identify candidates • source-language candidates • translation candidates – how we filter translation candidates – how we identify source-language synonyms • What is Acrolinx and how does MTE fit in?
  • 5. Outline • What is multilingual term extraction? • What is the workflow from customer perspective? – customer use case examples – show extraction results, demonstrate human validation • How does the extraction work? – how we identify candidates • source-language candidates • translation candidates – how we filter translation candidates – how we identify source-language synonyms • What is Acrolinx and how does MTE fit in?
  • 6. Workflow: Customer perspective 1. Customer provides translated documents 2. Acrolinx provides extracted multilingual term candidates to customer 3. Customer validates candidates 4. Validated results become (or are added to) customer’s term bank
  • 7. Customer use cases, past examples Use case 1 – de-<en,fr,es,it,pt> (mostly de-en) – ~142,000 bilingual segments; ~2,685,000 tokens (total) Use case 2 – de-<en,fr> (all data trilingual) – ~132,000 bilingual segments; ~1,259,000 tokens – data document-aligned, not segment-aligned, so extra step required Use case 3 – – – – en-de ~942,000 bilingual segments; ~25,000,000 tokens extract translations of a given list of keywords determine which keywords don’t occur in data
  • 8. Results • human validation in Excel “Baugruppe” has been translated inconsistently into English in the past Mark respective translations as preferred/deprecated to guide translators in the future.
  • 9. Results “Stromversorgung” and “Einspeisung” have translations in common. → automatically identified as possible synonyms, so same Cluster ID To validate synonym link, edit Subcluster IDs to be the same. Mark respective variants as preferred/deprecated to guide authors.
  • 10. Outline • What is multilingual term extraction? • What is the workflow from customer perspective? – customer use case examples – show extraction results, demonstrate human validation • How does the extraction work? – how we identify candidates • source-language candidates • translation candidates – how we filter translation candidates – how we identify source-language synonyms • What is Acrolinx and how does MTE fit in?
  • 11. How does the extraction work? • Extract source-language term candidates from source-language text (unless source-language terminology exists) The wizard begins creating the bootable image. – linguistics-based • especially part-of-speech patterns – same functionality built into the core Acrolinx product
  • 12. How does the extraction work? • Extract translation candidates of each sourcelanguage term candidate from target-language text The wizard begins creating the bootable image. Der Assistent beginnt mit der Erstellung des bootfähigen Image. – use statistical phrase-alignment technology – same used in statistical machine translation
  • 13. How does the extraction work? • Filter translation candidates translation candidates for “Eingangsspannung” (pink = filtered out) … based on: – confidence score calculated from translation probabilities • can adjust threshold to favour precision or recall – surface characteristics (closed-class words, punctuation) – term-candidacy of translation (if possible for language)
  • 14. How does the extraction work? • Identify synonyms (‘cluster’ candidates) cluster around “Stromwandler” (minimum link confidence threshold = 0.01) – link confidence based on the degree to which translations are shared – can adjust threshold to favour precision or recall of links
  • 15. How does the extraction work? • Identify synonyms (‘cluster’ candidates) cluster around “Stromwandler” (minimum link confidence threshold = 0.03) – link confidence based on the degree to which translations are shared – can adjust threshold to favour precision or recall of links
  • 16. Outline • What is multilingual term extraction? • What is the workflow from customer perspective? – customer use case examples – show extraction results, demonstrate human validation • How does the extraction work? – how we identify candidates • source-language candidates • translation candidates – how we filter translation candidates – how we identify source-language synonyms • What is Acrolinx and how does MTE fit in?
  • 17. What is Acrolinx? Acrolinx is Content Optimization Software. It helps authors make there text – more correct, – more consistent, – and more readable.
  • 18. What is Acrolinx? Acrolinx is Content Optimization Software. It helps authors make their text – more correct, – more consistent, – and more readable. Consistent use of terminology is an important factor in the readability of text. Acrolinx provides: – term extraction (monolingual, aka term harvesting) – terminology management – term checking Multilingual Term Extraction as a Service is a natural complement to the prior terminology functions.
  • 19. Acrolinx @ tekom Visit Acrolinx at tekom! → Hall 3, Stand 310
  • 20. Outline • What is multilingual term extraction? • What is the workflow from customer perspective? – customer use case examples – show extraction results, demonstrate human validation • How does the extraction work? – how we identify candidates • source-language candidates • translation candidates – how we filter translation candidates – how we identify source-language synonyms • What is Acrolinx and how does MTE fit in?