SlideShare ist ein Scribd-Unternehmen logo
1 von 13
SPEAKER IDENTIFICATION
  SYSTEM WITH VOICE –
     CONTROLLED
    FUNCTIONALITY
Introduction
Objective  To develop a speaker identification
system and control the system using a person’s
voice.

Platform  Matlab

Implementation of Artificial Neural Networks
(ANN) for pattern classification

Feature extraction  MFCC
                                   2
Experimental Setup

         Sound Recorder       Feature Extraction

Speech       Wav File                MFCC




          Artificial Neural Network Subsystem


               Test                   Train




                                                   3
Signal Processing

Built - in MATLAB function
‘wavrecord.m’

 The recorded samples serve as input to
the next stage, which is the Mel –
Frequency Cepstral Analysis.


                               4
Feature Extraction
Mel – Frequency Cepstral Coefficients (MFCC)

 MFCCs are based on the known variation of the
human ear’s critical bandwidths with frequency

Linear at low frequencies and logarithmic at high
frequencies


                                      5
MFCC Block Diagram


Speech    Frame     Frame Windowing               FFT
         Blocking




  Mel       Cepstrum     Mel      Mel–Freq.   Spectrum
 Cepstrum
                                  Wrapping
                       Spectrum




                                              6
Steps of MFCC
1.   Frame Blocking
2.   Windowing
3.   Fast Fourier Transform (FFT )
4.   Mel–Frequency Wrapping
5.   Cepstrum


   Auditory Toolbox - mfcc.m
ceps=mfcc(input, sampling rate, [frame rate])

                                     7
Artificial Neural Networks (ANN)
General models of how human brain processes
information.

Layered architecture  Consists of nodes
corresponding to neurons and of weights
corresponding to connections between neurons

“Learning” rule  Weights are adjusted on the
basis of a series of training patterns

                                    8
Probabilistic Neural Network (PNN)
 Feed – forward neural network

 Provides a general technique to solve pattern classification
 problems

 Develops distribution function to estimate the likelihood of
 an input pattern being within several given categories.

 Created in MATLAB using ‘newpnn’
                 net = newpnn(p,t)

                                               9
Schematic Diagram




                10
Conclusion
Implementation difficult due to variability
in speech signal

Possible improvement using noise
cancellation techniques  Weiner Filter,
Adaptive Filters


                                  11
References
L.Rabiner, B. H. Juang – Fundamentals of Speech
Recognition
C. P. Lim, S.C. Woo – Speech Recognition using
Neural Networks. IEEE Trans. on Acoustics,
Speech and Signal Processing - 2000.
Khalid Saeed and Mohammed Kheir Nammous –
A Speech and Speaker Identification System.
IEEE Trans. on Industrial Electronics - 2007.

                                    12
13

Weitere ähnliche Inhalte

Was ist angesagt?

Ai based character recognition and speech synthesis
Ai based character recognition and speech  synthesisAi based character recognition and speech  synthesis
Ai based character recognition and speech synthesisAnkita Jadhao
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in androidAnshuli Mittal
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentationhimanshubhatti
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition systemAlok Tiwari
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYIJCERT
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognitionRichie
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data miningJimit Rupani
 
Speech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformSpeech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformvidhateswapnil
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesHarshad Karmarkar
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1Samiul Parag
 

Was ist angesagt? (20)

Ai based character recognition and speech synthesis
Ai based character recognition and speech  synthesisAi based character recognition and speech  synthesis
Ai based character recognition and speech synthesis
 
Voice recognition
Voice recognitionVoice recognition
Voice recognition
 
An Introduction To Speech Recognition
An Introduction To Speech RecognitionAn Introduction To Speech Recognition
An Introduction To Speech Recognition
 
Mini Project- Audio Enhancement
Mini Project- Audio EnhancementMini Project- Audio Enhancement
Mini Project- Audio Enhancement
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in android
 
Speech recognition final presentation
Speech recognition final presentationSpeech recognition final presentation
Speech recognition final presentation
 
Automatic speech recognition system
Automatic speech recognition systemAutomatic speech recognition system
Automatic speech recognition system
 
Mini Project- Audio Enhancement
Mini Project-  Audio EnhancementMini Project-  Audio Enhancement
Mini Project- Audio Enhancement
 
AUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEYAUTOMATIC SPEECH RECOGNITION- A SURVEY
AUTOMATIC SPEECH RECOGNITION- A SURVEY
 
Automatic speech recognition
Automatic speech recognitionAutomatic speech recognition
Automatic speech recognition
 
SPEAKER VERIFICATION
SPEAKER VERIFICATIONSPEAKER VERIFICATION
SPEAKER VERIFICATION
 
Speech Signal Analysis
Speech Signal AnalysisSpeech Signal Analysis
Speech Signal Analysis
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
Speech Recognition
Speech RecognitionSpeech Recognition
Speech Recognition
 
speech processing and recognition basic in data mining
speech processing and recognition basic in  data miningspeech processing and recognition basic in  data mining
speech processing and recognition basic in data mining
 
Speech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transformSpeech recognition-using-wavelet-transform
Speech recognition-using-wavelet-transform
 
Automatic Speech Recognition
Automatic Speech RecognitionAutomatic Speech Recognition
Automatic Speech Recognition
 
Deep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh TomarDeep Learning for Speech Recognition - Vikrant Singh Tomar
Deep Learning for Speech Recognition - Vikrant Singh Tomar
 
Voice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devicesVoice/Speech recognition in mobile devices
Voice/Speech recognition in mobile devices
 
Digital speech processing lecture1
Digital speech processing lecture1Digital speech processing lecture1
Digital speech processing lecture1
 

Andere mochten auch

Workshop geweld tegen meisjes
Workshop geweld tegen meisjesWorkshop geweld tegen meisjes
Workshop geweld tegen meisjesnobnob
 
The PEACE Model of Investigative Interviewing
The PEACE Model of Investigative InterviewingThe PEACE Model of Investigative Interviewing
The PEACE Model of Investigative InterviewingDaren Jay
 
85 business analyst interview questions and answers
85 business analyst interview questions and answers85 business analyst interview questions and answers
85 business analyst interview questions and answersBusinessAnalyst247
 
8 Free Types of Marketing Strategies
8 Free Types of Marketing Strategies8 Free Types of Marketing Strategies
8 Free Types of Marketing StrategiesBrian Downard
 

Andere mochten auch (6)

Workshop geweld tegen meisjes
Workshop geweld tegen meisjesWorkshop geweld tegen meisjes
Workshop geweld tegen meisjes
 
The PEACE Model of Investigative Interviewing
The PEACE Model of Investigative InterviewingThe PEACE Model of Investigative Interviewing
The PEACE Model of Investigative Interviewing
 
EDI 2009- Admissibility of Electronic/Digital Evidence
EDI 2009- Admissibility of Electronic/Digital EvidenceEDI 2009- Admissibility of Electronic/Digital Evidence
EDI 2009- Admissibility of Electronic/Digital Evidence
 
Types of questions
Types of questionsTypes of questions
Types of questions
 
85 business analyst interview questions and answers
85 business analyst interview questions and answers85 business analyst interview questions and answers
85 business analyst interview questions and answers
 
8 Free Types of Marketing Strategies
8 Free Types of Marketing Strategies8 Free Types of Marketing Strategies
8 Free Types of Marketing Strategies
 

Ähnlich wie Speaker identification system with voice controlled functionality

A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...TELKOMNIKA JOURNAL
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognitionphyuhsan
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals
 
Dynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingDynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingCSCJournals
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency Phan Duy
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONijma
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONijma
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONijma
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
 
A Novel Method for Speaker Independent Recognition Based on Hidden Markov Model
A Novel Method for Speaker Independent Recognition Based on Hidden Markov ModelA Novel Method for Speaker Independent Recognition Based on Hidden Markov Model
A Novel Method for Speaker Independent Recognition Based on Hidden Markov ModelIDES Editor
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalizationKamal Bhatt
 
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Ahmed Ayman
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor
 
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...IJERA Editor
 
Looking into the Black Box - A Theoretical Insight into Deep Learning Networks
Looking into the Black Box - A Theoretical Insight into Deep Learning NetworksLooking into the Black Box - A Theoretical Insight into Deep Learning Networks
Looking into the Black Box - A Theoretical Insight into Deep Learning NetworksDinesh V
 

Ähnlich wie Speaker identification system with voice controlled functionality (20)

Ijetcas14 426
Ijetcas14 426Ijetcas14 426
Ijetcas14 426
 
Et25897899
Et25897899Et25897899
Et25897899
 
A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...A comparison of different support vector machine kernels for artificial speec...
A comparison of different support vector machine kernels for artificial speec...
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueA Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique
 
Dynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modellingDynamic Audio-Visual Client Recognition modelling
Dynamic Audio-Visual Client Recognition modelling
 
P141omfccu
P141omfccuP141omfccu
P141omfccu
 
Speaker identification using mel frequency
Speaker identification using mel frequency Speaker identification using mel frequency
Speaker identification using mel frequency
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
 
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONQUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
E44082429
E44082429E44082429
E44082429
 
A Novel Method for Speaker Independent Recognition Based on Hidden Markov Model
A Novel Method for Speaker Independent Recognition Based on Hidden Markov ModelA Novel Method for Speaker Independent Recognition Based on Hidden Markov Model
A Novel Method for Speaker Independent Recognition Based on Hidden Markov Model
 
Adaptive equalization
Adaptive equalizationAdaptive equalization
Adaptive equalization
 
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...
 
Real Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and ValidationReal Time Speaker Identification System – Design, Implementation and Validation
Real Time Speaker Identification System – Design, Implementation and Validation
 
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...
 
Looking into the Black Box - A Theoretical Insight into Deep Learning Networks
Looking into the Black Box - A Theoretical Insight into Deep Learning NetworksLooking into the Black Box - A Theoretical Insight into Deep Learning Networks
Looking into the Black Box - A Theoretical Insight into Deep Learning Networks
 

Speaker identification system with voice controlled functionality

  • 1. SPEAKER IDENTIFICATION SYSTEM WITH VOICE – CONTROLLED FUNCTIONALITY
  • 2. Introduction Objective  To develop a speaker identification system and control the system using a person’s voice. Platform  Matlab Implementation of Artificial Neural Networks (ANN) for pattern classification Feature extraction  MFCC 2
  • 3. Experimental Setup Sound Recorder Feature Extraction Speech Wav File MFCC Artificial Neural Network Subsystem Test Train 3
  • 4. Signal Processing Built - in MATLAB function ‘wavrecord.m’ The recorded samples serve as input to the next stage, which is the Mel – Frequency Cepstral Analysis. 4
  • 5. Feature Extraction Mel – Frequency Cepstral Coefficients (MFCC) MFCCs are based on the known variation of the human ear’s critical bandwidths with frequency Linear at low frequencies and logarithmic at high frequencies 5
  • 6. MFCC Block Diagram Speech Frame Frame Windowing FFT Blocking Mel Cepstrum Mel Mel–Freq. Spectrum Cepstrum Wrapping Spectrum 6
  • 7. Steps of MFCC 1. Frame Blocking 2. Windowing 3. Fast Fourier Transform (FFT ) 4. Mel–Frequency Wrapping 5. Cepstrum Auditory Toolbox - mfcc.m ceps=mfcc(input, sampling rate, [frame rate]) 7
  • 8. Artificial Neural Networks (ANN) General models of how human brain processes information. Layered architecture  Consists of nodes corresponding to neurons and of weights corresponding to connections between neurons “Learning” rule  Weights are adjusted on the basis of a series of training patterns 8
  • 9. Probabilistic Neural Network (PNN) Feed – forward neural network Provides a general technique to solve pattern classification problems Develops distribution function to estimate the likelihood of an input pattern being within several given categories. Created in MATLAB using ‘newpnn’ net = newpnn(p,t) 9
  • 11. Conclusion Implementation difficult due to variability in speech signal Possible improvement using noise cancellation techniques  Weiner Filter, Adaptive Filters 11
  • 12. References L.Rabiner, B. H. Juang – Fundamentals of Speech Recognition C. P. Lim, S.C. Woo – Speech Recognition using Neural Networks. IEEE Trans. on Acoustics, Speech and Signal Processing - 2000. Khalid Saeed and Mohammed Kheir Nammous – A Speech and Speaker Identification System. IEEE Trans. on Industrial Electronics - 2007. 12
  • 13. 13