2. Introduction
Objective To develop a speaker identification
system and control the system using a person’s
voice.
Platform Matlab
Implementation of Artificial Neural Networks
(ANN) for pattern classification
Feature extraction MFCC
2
4. Signal Processing
Built - in MATLAB function
‘wavrecord.m’
The recorded samples serve as input to
the next stage, which is the Mel –
Frequency Cepstral Analysis.
4
5. Feature Extraction
Mel – Frequency Cepstral Coefficients (MFCC)
MFCCs are based on the known variation of the
human ear’s critical bandwidths with frequency
Linear at low frequencies and logarithmic at high
frequencies
5
8. Artificial Neural Networks (ANN)
General models of how human brain processes
information.
Layered architecture Consists of nodes
corresponding to neurons and of weights
corresponding to connections between neurons
“Learning” rule Weights are adjusted on the
basis of a series of training patterns
8
9. Probabilistic Neural Network (PNN)
Feed – forward neural network
Provides a general technique to solve pattern classification
problems
Develops distribution function to estimate the likelihood of
an input pattern being within several given categories.
Created in MATLAB using ‘newpnn’
net = newpnn(p,t)
9
11. Conclusion
Implementation difficult due to variability
in speech signal
Possible improvement using noise
cancellation techniques Weiner Filter,
Adaptive Filters
11
12. References
L.Rabiner, B. H. Juang – Fundamentals of Speech
Recognition
C. P. Lim, S.C. Woo – Speech Recognition using
Neural Networks. IEEE Trans. on Acoustics,
Speech and Signal Processing - 2000.
Khalid Saeed and Mohammed Kheir Nammous –
A Speech and Speaker Identification System.
IEEE Trans. on Industrial Electronics - 2007.
12