SlideShare a Scribd company logo
1 of 39
Download to read offline
1
2
Automatic speech
recognition for mobile
applications in Yandex
Automatic speech
recognition for mobile
applications in Yandex
Fran CampilloFran Campillo
3
OutlineOutline
●
Motivation.
●
Road map.
●
Automatic speech recognition.
●
Data collection.
●
Experiments.
●
Results.
●
Motivation.
●
Road map.
●
Automatic speech recognition.
●
Data collection.
●
Experiments.
●
Results.
4
MotivationMotivation
5
MotivationMotivation
6
Road mapRoad map
7
Road mapRoad map
●
Sep-2011: study of open source tools and data
collection.
– HTK, Sphinx, Rasr, Kaldi,...
– Service provided by 3rd
party.
●
Jan-2012: development of in-house technology.
●
Jan-2013: launching of own services.
●
Sep-2011: study of open source tools and data
collection.
– HTK, Sphinx, Rasr, Kaldi,...
– Service provided by 3rd
party.
●
Jan-2012: development of in-house technology.
●
Jan-2013: launching of own services.
8
Automatic speech
recognition
Automatic speech
recognition
9
ASR: complexityASR: complexity
Style Planned Spontaneous
Audio quality CD Telephone
Vocabulary size Hundreds Hundreds of thousands
Number of speakers One Many
Recognition rate WorseWorseBetterBetter
Complexity BiggerBiggerSmallerSmaller
10
Word pronunciationsWord pronunciations
●
ASR: sounds => words.
●
How is a word pronounced?
– Line => /'laɪn/.
– Linear => /'lɪnɪɘʳ/
●
Need a mapping from writing to
phonemes: G2P.
11
Word pronunciations: dictionaryWord pronunciations: dictionary
а a
аб a tc p
абад a dc b a tc t
абаза a dc b a z a
абакан a dc b a tc k ax n
абакана a dc b a tc k a n a
абакане a dc b a tc k a nj e
абаканская a dc b a tc k a n s tc k ax j a
абаканский a dc b a tc k a n s tc kj I j
абакумова a dc b a tc k u m ax v a
абанский a dc b a n s tc kj I j
абганеровская a dc b dc g ax nj I r ax f s tc k ax j a
абдулино a dc b dc dK& u lj i n a
абельмановская a dc bj e lj m ax n ax f s tc k ax j a
абзаково a dc b z a tc k o v a
абзелиловский a dc b zj i lj i l ax f s tc kj I j
12
Speech parametrizationSpeech parametrization
Phone /a/ Phone /i/
13
ASR: the problemASR: the problem
●
We have a sequence of observations:
– O = {o1
, o2
, …, oT
}
– oi
is a feature vector representing a speech frame.
● Goal: finding the likeliest sequence of words wi
for O:
argmax
i
P(wi/O)argmax
i
P(wi/O)
14
ASR: the problem (II)ASR: the problem (II)
● We cannot compute directly P(wi
/O).
●
Bayes: P(wi/O)=
P(O/wi)P(wi)
P(O)
argmax
i
P(wi/O)=argmax
i
{P(O/wi)P(wi)}
Acoustic model Language model
15
Language modelLanguage model
●
Probability of sequences of words:
– “We will rock you” => P1
.
– “Will will rock you” => P2
.
●
Trained on large corpora.
●
The closer to the application domain, the
better.
16
Acoustic model: Hidden Markov ModelsAcoustic model: Hidden Markov Models
●
HMM of first order: sequence of states that depend only on the state
before, and are associated to events we can observe
●
Typical layout for ASR:
Q1
Q2
Q3
a11
a12
a22
a23
a33
b1
(o) b2
(o) b3
(o)
● aij
: transition probabilities.
● bj
(o): probability of observation o in state j.
17
Acoustic model: HMM and speechAcoustic model: HMM and speech
●
Each state models a part of the phoneme:
– 1st
: beginning of the phoneme.
– 2nd
: stationary part.
– 3rd
: end of the phoneme.
● aij
: duration of each part.
● bj
(o): probability of producing a vector of features o in
state j.
18
Modeling probability of observationModeling probability of observation
●
Gaussian mixtures:
– cjm
= weight of mth
Gaussian of state j.
– μjm
=> average (vector) of mth
Gaussian of state j.
– ∑jm
=> covariance matrix of mth
Gaussian of state j.
●
Neural networks.
bj(x)=∑m c jm N (x ,μ jm,Σjm)
19
Waveform, phonemes, frames, and statesWaveform, phonemes, frames, and states
/o//o/
to1
o2
o3
o4
o5
o6
o7
o8
o9
o10
/o//o/
Q1
Q2
Q3
Q1
=> o1
, o2
Q2
=> o3
, o4
, o5
, o6
, o7
Q3
= > o8
, o9
, o10
μ3m,
∑3m,
c3m
μ2m,
∑2m,
c2m
μ1m,
∑1m,
c1m
20
Block diagram for trainingBlock diagram for training
Initialization
Baum-Welch
HMM Parameters update
Convergence
Prototype HMM
No
Trained models
Yes
Initial μ0m,
∑j0m,
com
for the GMMs
Alignments of the
training sentences
(observations to states)
New estimations
for μjm
, ∑jm
, cjm
Training sentences
21
DecodingDecoding
●
Lexicon: words that can be recognized.
●
Decoder: dynamic programming, with the constraints imposed
by the lexicon, the acoustic models, and the language model.
Parametrize
Lexicon
Acoustic
models
Language
model
DecoderSpeech signal Words
22
Our decoderOur decoder
●
Based on Weighted Finite State transducers.
●
The lexicon, the language model, and the
acoustic model are composed into a single
structure.
–Same information, but more efficient.
Lexicon
Acoustic
models
Language
model
HCLG
23
Composition of WFST: exampleComposition of WFST: example
Lexicon
Language
model
0 1
B:Bob
2
ah:
3
b:
4
l: likes
5
ay: k:
6
s:
24
Data collectionData collection
25
Data collectionData collection
●
Speech samples taken from the field.
●
Manual transcriptions:
– Speaker features: gender, native,...
– Anomalies in the pronunciation.
– Noises in the recording.
26
Manual transcriptionsManual transcriptions
●
600k recordings.
●
Uncompressed format: 8KHz and 16KHz.
●
286020 different speakers.
Percentage (%)
Native 87.7
Male 83.3
Female 8.5
Child 8.2
27
Manual transcriptionsManual transcriptions
●
Percentage of records without anomalies: 7.4%
Anomalies Percentage (%)
side_speech 14.4
speech-in-noise 71.5
Indistinguishable 3.7
mouth_noise 3.6
breath_noise 6.3
Irregular pronunciations 5.3
Hesitations 0.5
Fragments 5.5
Transient noise 14.0
Foreign words 0.1
28
Manual transcriptions: examplesManual transcriptions: examples
●
марциальные воды male, native
●
*трёx#пруд#ньій* male, native, speech-in-noise
●
[side_speech] чкалова male, native, speech-in-noise,
bad-audio тр
29
VisualQAVisualQA
30
ExperimentsExperiments
31
Grapheme-2-phonemeGrapheme-2-phoneme
●
Sequitur:
– Based on joined sequence models.
– Accuracy => 2.09% phoneme error rate.
●
Phonetisaurus:
– WFST.
– Accuracy => 1.04% phoneme error rate.
●
Special treatment for Latin words:
– G2P trained on transliterated version of Russian pronunciation (for
example: whatsapp => уотсап).
32
Noise modelsNoise models
33
Experiments: acoustic model vs. language modelExperiments: acoustic model vs. language model
34
Experiments: number of GaussiansExperiments: number of Gaussians
35
ResultsResults
36
Users: NavigatorUsers: Navigator
37
●
Results relative to our WER in each experiment (in red, experiments in which our system is
outperformed):
Results: relative word error rateResults: relative word error rate
Maps Navigation General search
Yandex-GMM 1 1 1
3rd Party 44.6% 31.8% 37.3%
Competitor 1.9% -9.7% -23.4%
General search
Yandex-DNN 1
Competitor 6.6%
38
Thanks for your
attention!
Thanks for your
attention!
39
Fran CampilloFran Campillo
Senior Software EngineerSenior Software Engineer
Yandex Speech GroupYandex Speech Group
francampillo@yandex-team.rufrancampillo@yandex-team.ru
PhDPhD

More Related Content

What's hot

Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognitionfathitarek
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionboddu syamprasad
 
Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...csandit
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationDivya Sugumar
 
Introduction to Transformers for NLP - Olga Petrova
Introduction to Transformers for NLP - Olga PetrovaIntroduction to Transformers for NLP - Olga Petrova
Introduction to Transformers for NLP - Olga PetrovaAlexey Grigorev
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to HindiRajat Jain
 
Natural language processing
Natural language processingNatural language processing
Natural language processingBasha Chand
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSubmissionResearchpa
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)Yuriy Guts
 
NLP using transformers
NLP using transformers NLP using transformers
NLP using transformers Arvind Devaraj
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language ProcessingMichel Bruley
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlabArcanjo Salazaku
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games ResearchJose Zagal
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summaryAditya Deshmukh
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1Pratishtha Ram
 

What's hot (20)

Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
NLP
NLPNLP
NLP
 
Sphinx4
Sphinx4Sphinx4
Sphinx4
 
Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...Hindi digits recognition system on speech data collected in different natural...
Hindi digits recognition system on speech data collected in different natural...
 
Introduction to Transformer Model
Introduction to Transformer ModelIntroduction to Transformer Model
Introduction to Transformer Model
 
Natural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative CommunicationNatural Language Processing in Alternative and Augmentative Communication
Natural Language Processing in Alternative and Augmentative Communication
 
Introduction to Transformers for NLP - Olga Petrova
Introduction to Transformers for NLP - Olga PetrovaIntroduction to Transformers for NLP - Olga Petrova
Introduction to Transformers for NLP - Olga Petrova
 
Nltk
NltkNltk
Nltk
 
Machine translation from English to Hindi
Machine translation from English to HindiMachine translation from English to Hindi
Machine translation from English to Hindi
 
Natural language processing
Natural language processingNatural language processing
Natural language processing
 
ppt
pptppt
ppt
 
Speech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speechSpeech Recognition: Transcription and transformation of human speech
Speech Recognition: Transcription and transformation of human speech
 
Natural Language Processing (NLP)
Natural Language Processing (NLP)Natural Language Processing (NLP)
Natural Language Processing (NLP)
 
NLP using transformers
NLP using transformers NLP using transformers
NLP using transformers
 
Big Data and Natural Language Processing
Big Data and Natural Language ProcessingBig Data and Natural Language Processing
Big Data and Natural Language Processing
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
Natural Language Processing for Games Research
Natural Language Processing for Games ResearchNatural Language Processing for Games Research
Natural Language Processing for Games Research
 
Bachelors project summary
Bachelors project summaryBachelors project summary
Bachelors project summary
 
Minor Project Presentation 1
Minor Project Presentation 1Minor Project Presentation 1
Minor Project Presentation 1
 

Viewers also liked

Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognitionCharu Joshi
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speechNikolay Karpov
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech RecognitionYogesh Vijay
 
NFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IINFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IIHarshad Karmarkar
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognitionأحلام انصارى
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition systemRipal Ranpara
 
Advances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionAdvances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionRising Media, Inc.
 
PatSeer Patent Database Overview
PatSeer Patent Database OverviewPatSeer Patent Database Overview
PatSeer Patent Database OverviewHarshad Karmarkar
 
A study of EMG based Speech Recognition
A study of EMG  based Speech Recognition A study of EMG  based Speech Recognition
A study of EMG based Speech Recognition vetrivel D
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionManthan Gandhi
 
Developing with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile AppsDeveloping with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile AppsNick Landry
 
Speech recognition: ready to take off?
Speech recognition: ready to take off?Speech recognition: ready to take off?
Speech recognition: ready to take off?Jeffrey Funk
 
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...UXPA International
 
Kaldi-voice: Your personal speech recognition server using open source code
Kaldi-voice: Your personal speech recognition server using open source codeKaldi-voice: Your personal speech recognition server using open source code
Kaldi-voice: Your personal speech recognition server using open source codeXavier Anguera
 
Doubly fed-induction-generator
Doubly fed-induction-generatorDoubly fed-induction-generator
Doubly fed-induction-generatorHarshad Karmarkar
 
Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediacidroypaes
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition TechnologySeminar Links
 

Viewers also liked (20)

Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Speech recognition
Speech recognitionSpeech recognition
Speech recognition
 
Principal characteristics of speech
Principal characteristics of speechPrincipal characteristics of speech
Principal characteristics of speech
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
NFC Patent Analysis_Report-II
NFC Patent Analysis_Report-IINFC Patent Analysis_Report-II
NFC Patent Analysis_Report-II
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Thin Film Batteries
Thin Film BatteriesThin Film Batteries
Thin Film Batteries
 
Noise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech RecognitionNoise Adaptive Training for Robust Automatic Speech Recognition
Noise Adaptive Training for Robust Automatic Speech Recognition
 
Speech recognition system
Speech recognition systemSpeech recognition system
Speech recognition system
 
Advances in Voice & Speech Recognition
Advances in Voice & Speech RecognitionAdvances in Voice & Speech Recognition
Advances in Voice & Speech Recognition
 
PatSeer Patent Database Overview
PatSeer Patent Database OverviewPatSeer Patent Database Overview
PatSeer Patent Database Overview
 
A study of EMG based Speech Recognition
A study of EMG  based Speech Recognition A study of EMG  based Speech Recognition
A study of EMG based Speech Recognition
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
Developing with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile AppsDeveloping with Speech and Voice Recognition in Mobile Apps
Developing with Speech and Voice Recognition in Mobile Apps
 
Speech recognition: ready to take off?
Speech recognition: ready to take off?Speech recognition: ready to take off?
Speech recognition: ready to take off?
 
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
Where's Jarvis? The future of Voice Recognition and Natural Language User Int...
 
Kaldi-voice: Your personal speech recognition server using open source code
Kaldi-voice: Your personal speech recognition server using open source codeKaldi-voice: Your personal speech recognition server using open source code
Kaldi-voice: Your personal speech recognition server using open source code
 
Doubly fed-induction-generator
Doubly fed-induction-generatorDoubly fed-induction-generator
Doubly fed-induction-generator
 
Voice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social mediaVoice input and speech recognition system in tourism/social media
Voice input and speech recognition system in tourism/social media
 
Speech Recognition Technology
Speech Recognition TechnologySpeech Recognition Technology
Speech Recognition Technology
 

Similar to "Automatic speech recognition for mobile applications in Yandex" — Fran Campillo, Яндекс

Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"Lviv Startup Club
 
Understanding Names with Neural Networks - May 2020
Understanding Names with Neural Networks - May 2020Understanding Names with Neural Networks - May 2020
Understanding Names with Neural Networks - May 2020Basis Technology
 
lec03-LanguageModels_230214_161016.pdf
lec03-LanguageModels_230214_161016.pdflec03-LanguageModels_230214_161016.pdf
lec03-LanguageModels_230214_161016.pdfykyog
 
Speech To Sign Language Interpreter System
Speech To Sign Language Interpreter SystemSpeech To Sign Language Interpreter System
Speech To Sign Language Interpreter Systemkkkseld
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingKaterina Vylomova
 
Recent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingRecent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingSeth Grimes
 
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...Learning to Generate Pseudo-code from Source Code using Statistical Machine T...
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...Yusuke Oda
 
Conditional generative model for audio
Conditional generative model for audioConditional generative model for audio
Conditional generative model for audioKeunwoo Choi
 
Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)Sumit Raj
 
Merghani-SACNAS Poster
Merghani-SACNAS PosterMerghani-SACNAS Poster
Merghani-SACNAS PosterTaha Merghani
 
Big Data Spain 2017 - Deriving Actionable Insights from High Volume Media St...
Big Data Spain 2017  - Deriving Actionable Insights from High Volume Media St...Big Data Spain 2017  - Deriving Actionable Insights from High Volume Media St...
Big Data Spain 2017 - Deriving Actionable Insights from High Volume Media St...Apache OpenNLP
 
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...Codemotion
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Julien PLU
 

Similar to "Automatic speech recognition for mobile applications in Yandex" — Fran Campillo, Яндекс (20)

Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
Vladyslav Hamolia "How to choose ASR (automatic speech recognition) system"
 
Khmer ASR
Khmer ASRKhmer ASR
Khmer ASR
 
Understanding Names with Neural Networks - May 2020
Understanding Names with Neural Networks - May 2020Understanding Names with Neural Networks - May 2020
Understanding Names with Neural Networks - May 2020
 
Asr
AsrAsr
Asr
 
lec03-LanguageModels_230214_161016.pdf
lec03-LanguageModels_230214_161016.pdflec03-LanguageModels_230214_161016.pdf
lec03-LanguageModels_230214_161016.pdf
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
Build your own ASR engine
Build your own ASR engineBuild your own ASR engine
Build your own ASR engine
 
Logic Programming and ILP
Logic Programming and ILPLogic Programming and ILP
Logic Programming and ILP
 
Speech To Sign Language Interpreter System
Speech To Sign Language Interpreter SystemSpeech To Sign Language Interpreter System
Speech To Sign Language Interpreter System
 
Asr
AsrAsr
Asr
 
Contemporary Models of Natural Language Processing
Contemporary Models of Natural Language ProcessingContemporary Models of Natural Language Processing
Contemporary Models of Natural Language Processing
 
Recent Advances in Natural Language Processing
Recent Advances in Natural Language ProcessingRecent Advances in Natural Language Processing
Recent Advances in Natural Language Processing
 
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...Learning to Generate Pseudo-code from Source Code using Statistical Machine T...
Learning to Generate Pseudo-code from Source Code using Statistical Machine T...
 
Conditional generative model for audio
Conditional generative model for audioConditional generative model for audio
Conditional generative model for audio
 
Natural language processing (Python)
Natural language processing (Python)Natural language processing (Python)
Natural language processing (Python)
 
Merghani-SACNAS Poster
Merghani-SACNAS PosterMerghani-SACNAS Poster
Merghani-SACNAS Poster
 
Big Data Spain 2017 - Deriving Actionable Insights from High Volume Media St...
Big Data Spain 2017  - Deriving Actionable Insights from High Volume Media St...Big Data Spain 2017  - Deriving Actionable Insights from High Volume Media St...
Big Data Spain 2017 - Deriving Actionable Insights from High Volume Media St...
 
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
Deep Learning for Machine Translation: a paradigm shift - Alberto Massidda - ...
 
Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?Can Deep Learning Techniques Improve Entity Linking?
Can Deep Learning Techniques Improve Entity Linking?
 
NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...
NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...
NLP@DATEV: Setting up a domain specific language model, Dr. Jonas Rende & Tho...
 

More from Yandex

Предсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksПредсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksYandex
 
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Yandex
 
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаСтруктурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаYandex
 
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаПредставление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаYandex
 
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Yandex
 
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Yandex
 
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Yandex
 
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Yandex
 
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Yandex
 
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Yandex
 
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Yandex
 
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Yandex
 
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровКак защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровYandex
 
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Yandex
 
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Yandex
 
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Yandex
 
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Yandex
 
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Yandex
 
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Yandex
 
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Yandex
 

More from Yandex (20)

Предсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of TanksПредсказание оттока игроков из World of Tanks
Предсказание оттока игроков из World of Tanks
 
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
 
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров ЯндексаСтруктурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
 
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров ЯндексаПредставление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
 
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
 
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
 
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
 
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
 
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
 
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
 
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
 
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
 
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеровКак защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
 
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
 
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
 
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
 
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
 
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
 
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
 
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
 

Recently uploaded

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfOverkill Security
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdflior mazor
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024The Digital Insurer
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Jeffrey Haguewood
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 

Recently uploaded (20)

Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 

"Automatic speech recognition for mobile applications in Yandex" — Fran Campillo, Яндекс

  • 1. 1
  • 2. 2 Automatic speech recognition for mobile applications in Yandex Automatic speech recognition for mobile applications in Yandex Fran CampilloFran Campillo
  • 3. 3 OutlineOutline ● Motivation. ● Road map. ● Automatic speech recognition. ● Data collection. ● Experiments. ● Results. ● Motivation. ● Road map. ● Automatic speech recognition. ● Data collection. ● Experiments. ● Results.
  • 7. 7 Road mapRoad map ● Sep-2011: study of open source tools and data collection. – HTK, Sphinx, Rasr, Kaldi,... – Service provided by 3rd party. ● Jan-2012: development of in-house technology. ● Jan-2013: launching of own services. ● Sep-2011: study of open source tools and data collection. – HTK, Sphinx, Rasr, Kaldi,... – Service provided by 3rd party. ● Jan-2012: development of in-house technology. ● Jan-2013: launching of own services.
  • 9. 9 ASR: complexityASR: complexity Style Planned Spontaneous Audio quality CD Telephone Vocabulary size Hundreds Hundreds of thousands Number of speakers One Many Recognition rate WorseWorseBetterBetter Complexity BiggerBiggerSmallerSmaller
  • 10. 10 Word pronunciationsWord pronunciations ● ASR: sounds => words. ● How is a word pronounced? – Line => /'laɪn/. – Linear => /'lɪnɪɘʳ/ ● Need a mapping from writing to phonemes: G2P.
  • 11. 11 Word pronunciations: dictionaryWord pronunciations: dictionary а a аб a tc p абад a dc b a tc t абаза a dc b a z a абакан a dc b a tc k ax n абакана a dc b a tc k a n a абакане a dc b a tc k a nj e абаканская a dc b a tc k a n s tc k ax j a абаканский a dc b a tc k a n s tc kj I j абакумова a dc b a tc k u m ax v a абанский a dc b a n s tc kj I j абганеровская a dc b dc g ax nj I r ax f s tc k ax j a абдулино a dc b dc dK& u lj i n a абельмановская a dc bj e lj m ax n ax f s tc k ax j a абзаково a dc b z a tc k o v a абзелиловский a dc b zj i lj i l ax f s tc kj I j
  • 13. 13 ASR: the problemASR: the problem ● We have a sequence of observations: – O = {o1 , o2 , …, oT } – oi is a feature vector representing a speech frame. ● Goal: finding the likeliest sequence of words wi for O: argmax i P(wi/O)argmax i P(wi/O)
  • 14. 14 ASR: the problem (II)ASR: the problem (II) ● We cannot compute directly P(wi /O). ● Bayes: P(wi/O)= P(O/wi)P(wi) P(O) argmax i P(wi/O)=argmax i {P(O/wi)P(wi)} Acoustic model Language model
  • 15. 15 Language modelLanguage model ● Probability of sequences of words: – “We will rock you” => P1 . – “Will will rock you” => P2 . ● Trained on large corpora. ● The closer to the application domain, the better.
  • 16. 16 Acoustic model: Hidden Markov ModelsAcoustic model: Hidden Markov Models ● HMM of first order: sequence of states that depend only on the state before, and are associated to events we can observe ● Typical layout for ASR: Q1 Q2 Q3 a11 a12 a22 a23 a33 b1 (o) b2 (o) b3 (o) ● aij : transition probabilities. ● bj (o): probability of observation o in state j.
  • 17. 17 Acoustic model: HMM and speechAcoustic model: HMM and speech ● Each state models a part of the phoneme: – 1st : beginning of the phoneme. – 2nd : stationary part. – 3rd : end of the phoneme. ● aij : duration of each part. ● bj (o): probability of producing a vector of features o in state j.
  • 18. 18 Modeling probability of observationModeling probability of observation ● Gaussian mixtures: – cjm = weight of mth Gaussian of state j. – μjm => average (vector) of mth Gaussian of state j. – ∑jm => covariance matrix of mth Gaussian of state j. ● Neural networks. bj(x)=∑m c jm N (x ,μ jm,Σjm)
  • 19. 19 Waveform, phonemes, frames, and statesWaveform, phonemes, frames, and states /o//o/ to1 o2 o3 o4 o5 o6 o7 o8 o9 o10 /o//o/ Q1 Q2 Q3 Q1 => o1 , o2 Q2 => o3 , o4 , o5 , o6 , o7 Q3 = > o8 , o9 , o10 μ3m, ∑3m, c3m μ2m, ∑2m, c2m μ1m, ∑1m, c1m
  • 20. 20 Block diagram for trainingBlock diagram for training Initialization Baum-Welch HMM Parameters update Convergence Prototype HMM No Trained models Yes Initial μ0m, ∑j0m, com for the GMMs Alignments of the training sentences (observations to states) New estimations for μjm , ∑jm , cjm Training sentences
  • 21. 21 DecodingDecoding ● Lexicon: words that can be recognized. ● Decoder: dynamic programming, with the constraints imposed by the lexicon, the acoustic models, and the language model. Parametrize Lexicon Acoustic models Language model DecoderSpeech signal Words
  • 22. 22 Our decoderOur decoder ● Based on Weighted Finite State transducers. ● The lexicon, the language model, and the acoustic model are composed into a single structure. –Same information, but more efficient. Lexicon Acoustic models Language model HCLG
  • 23. 23 Composition of WFST: exampleComposition of WFST: example Lexicon Language model 0 1 B:Bob 2 ah: 3 b: 4 l: likes 5 ay: k: 6 s:
  • 25. 25 Data collectionData collection ● Speech samples taken from the field. ● Manual transcriptions: – Speaker features: gender, native,... – Anomalies in the pronunciation. – Noises in the recording.
  • 26. 26 Manual transcriptionsManual transcriptions ● 600k recordings. ● Uncompressed format: 8KHz and 16KHz. ● 286020 different speakers. Percentage (%) Native 87.7 Male 83.3 Female 8.5 Child 8.2
  • 27. 27 Manual transcriptionsManual transcriptions ● Percentage of records without anomalies: 7.4% Anomalies Percentage (%) side_speech 14.4 speech-in-noise 71.5 Indistinguishable 3.7 mouth_noise 3.6 breath_noise 6.3 Irregular pronunciations 5.3 Hesitations 0.5 Fragments 5.5 Transient noise 14.0 Foreign words 0.1
  • 28. 28 Manual transcriptions: examplesManual transcriptions: examples ● марциальные воды male, native ● *трёx#пруд#ньій* male, native, speech-in-noise ● [side_speech] чкалова male, native, speech-in-noise, bad-audio тр
  • 31. 31 Grapheme-2-phonemeGrapheme-2-phoneme ● Sequitur: – Based on joined sequence models. – Accuracy => 2.09% phoneme error rate. ● Phonetisaurus: – WFST. – Accuracy => 1.04% phoneme error rate. ● Special treatment for Latin words: – G2P trained on transliterated version of Russian pronunciation (for example: whatsapp => уотсап).
  • 33. 33 Experiments: acoustic model vs. language modelExperiments: acoustic model vs. language model
  • 34. 34 Experiments: number of GaussiansExperiments: number of Gaussians
  • 37. 37 ● Results relative to our WER in each experiment (in red, experiments in which our system is outperformed): Results: relative word error rateResults: relative word error rate Maps Navigation General search Yandex-GMM 1 1 1 3rd Party 44.6% 31.8% 37.3% Competitor 1.9% -9.7% -23.4% General search Yandex-DNN 1 Competitor 6.6%
  • 39. 39 Fran CampilloFran Campillo Senior Software EngineerSenior Software Engineer Yandex Speech GroupYandex Speech Group francampillo@yandex-team.rufrancampillo@yandex-team.ru PhDPhD