AUTOMATIC SPEECH RECOGNITION SYSTEM USING KALDI

S U B M I T T E D B Y -
P R I Y A N S H U P A L D U T T A ( C S M 2 1 0 1 5 )
S U B H R O J I T S A I K I A ( C S M 2 1 0 0 3 )

Automatic Speech Recognition
(ASR) is the process of converting
an unknown speech waveform
into the corresponding
orthographic transcription

•
•
•
•
•
•
•
•
•
•

• Download from the http://github.com/kaldi-asr/kaldi.
2. WE NEED TO INSTALL ALL THE DEPENDENCIES FOR KALDI TO WORK
PROPERLY.
•
•
•

Precondition
We have collected some amount of audio data that contain only spoken digits by 9 different
speakers. Each audio file is an entire spoken sentence in Assamese (e.g. 'এক',দুই,তিতি,চাতি etc).
Purpose
We have to divide our data into train and test sets, set up an ASR system, train it, test it and get
some decoding results.
First task
Now we have to create the project in the kaldi/egs/ directory. This is a place where We will put all
the stuff related to the project.
Our Approach To Build the system

Data preparation
•
•
•
•
•
•

In order to decode the text we have create two files (for some
configuration modifications in decoding and mfcc feature extraction
processes ):
a.) decode.config
b.) mfcc.conf

AUTOMATIC SPEECH RECOGNITION SYSTEM USING KALDI

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Ähnlich wie AUTOMATIC SPEECH RECOGNITION SYSTEM USING KALDI

Ähnlich wie AUTOMATIC SPEECH RECOGNITION SYSTEM USING KALDI (9)

Kürzlich hochgeladen

Kürzlich hochgeladen (20)

AUTOMATIC SPEECH RECOGNITION SYSTEM USING KALDI