Speaker identification system with voice controlled functionality

•Als PPT, PDF herunterladen•

1 gefällt mir•1,193 views

arizhamid786

SPEAKER IDENTIFICATION
SYSTEM WITH VOICE –
CONTROLLED
FUNCTIONALITY

Introduction
Objective  To develop a speaker identification
system and control the system using a person’s
voice.

Platform  Matlab

Implementation of Artificial Neural Networks
(ANN) for pattern classification

Feature extraction  MFCC
2

Experimental Setup

Sound Recorder Feature Extraction

Speech Wav File MFCC

Artificial Neural Network Subsystem

Test Train

3

Signal Processing

Built - in MATLAB function
‘wavrecord.m’

The recorded samples serve as input to
the next stage, which is the Mel –
Frequency Cepstral Analysis.

4

Feature Extraction
Mel – Frequency Cepstral Coefficients (MFCC)

MFCCs are based on the known variation of the
human ear’s critical bandwidths with frequency

Linear at low frequencies and logarithmic at high
frequencies

5

MFCC Block Diagram

Speech Frame Frame Windowing FFT
Blocking

Mel Cepstrum Mel Mel–Freq. Spectrum
Cepstrum
Wrapping
Spectrum

6

Steps of MFCC
1. Frame Blocking
2. Windowing
3. Fast Fourier Transform (FFT )
4. Mel–Frequency Wrapping
5. Cepstrum

Auditory Toolbox - mfcc.m
ceps=mfcc(input, sampling rate, [frame rate])

7

Artificial Neural Networks (ANN)
General models of how human brain processes
information.

Layered architecture  Consists of nodes
corresponding to neurons and of weights
corresponding to connections between neurons

“Learning” rule  Weights are adjusted on the
basis of a series of training patterns

8

Probabilistic Neural Network (PNN)
Feed – forward neural network

Provides a general technique to solve pattern classification
problems

Develops distribution function to estimate the likelihood of
an input pattern being within several given categories.

Created in MATLAB using ‘newpnn’
net = newpnn(p,t)

9

Conclusion
Implementation difficult due to variability
in speech signal

Possible improvement using noise
cancellation techniques  Weiner Filter,
Adaptive Filters

11

References
L.Rabiner, B. H. Juang – Fundamentals of Speech
Recognition
C. P. Lim, S.C. Woo – Speech Recognition using
Neural Networks. IEEE Trans. on Acoustics,
Speech and Signal Processing - 2000.
Khalid Saeed and Mohammed Kheir Nammous –
A Speech and Speaker Identification System.
IEEE Trans. on Industrial Electronics - 2007.

12

Weitere ähnliche Inhalte

Was ist angesagt?

Ai based character recognition and speech synthesisAnkita Jadhao

Voice recognitionKenneth Carnesi, JD

An Introduction To Speech RecognitionDepartment of Telecommunications, Ministry of Communication & IT (INDIA)

Mini Project- Audio EnhancementUniversity of Hertfordshire, School of Electronic Communications and Electrical Engineering

Speaker recognition in androidAnshuli Mittal

Speech recognition final presentationhimanshubhatti

Automatic speech recognition systemAlok Tiwari

Mini Project- Audio EnhancementUniversity of Hertfordshire, School of Electronic Communications and Electrical Engineering

AUTOMATIC SPEECH RECOGNITION- A SURVEYIJCERT

Automatic speech recognitionRichie

SPEAKER VERIFICATIONniranjan kumar

Speech Signal AnalysisPradeep Reddy Guvvala

Speech Signal ProcessingMurtadha Alsabbagh

Speech RecognitionHardik Kanjariya

speech processing and recognition basic in data miningJimit Rupani

Speech recognition-using-wavelet-transformvidhateswapnil

Automatic Speech RecognitionInternational Islamic University

Deep Learning for Speech Recognition - Vikrant Singh TomarWithTheBest

Voice/Speech recognition in mobile devicesHarshad Karmarkar

Digital speech processing lecture1Samiul Parag

Was ist angesagt? (20)

Ai based character recognition and speech synthesis

Voice recognition

An Introduction To Speech Recognition

Mini Project- Audio Enhancement

Speaker recognition in android

Speech recognition final presentation

Automatic speech recognition system

Mini Project- Audio Enhancement

AUTOMATIC SPEECH RECOGNITION- A SURVEY

Automatic speech recognition

SPEAKER VERIFICATION

Speech Signal Analysis

Speech Signal Processing

Speech Recognition

speech processing and recognition basic in data mining

Speech recognition-using-wavelet-transform

Automatic Speech Recognition

Deep Learning for Speech Recognition - Vikrant Singh Tomar

Voice/Speech recognition in mobile devices

Digital speech processing lecture1

Andere mochten auch

Workshop geweld tegen meisjesnobnob

The PEACE Model of Investigative InterviewingDaren Jay

EDI 2009- Admissibility of Electronic/Digital EvidenceGeorgetown University Law Center Office of Continuing Legal Education

Types of questionsFrank Calberg

85 business analyst interview questions and answersBusinessAnalyst247

8 Free Types of Marketing StrategiesBrian Downard

Andere mochten auch (6)

Workshop geweld tegen meisjes

The PEACE Model of Investigative Interviewing

EDI 2009- Admissibility of Electronic/Digital Evidence

Types of questions

85 business analyst interview questions and answers

8 Free Types of Marketing Strategies

Ähnlich wie Speaker identification system with voice controlled functionality

Ijetcas14 426Iasir Journals

Et25897899IJERA Editor

A comparison of different support vector machine kernels for artificial speec...TELKOMNIKA JOURNAL

Voice biometric recognitionphyuhsan

A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition TechniqueCSCJournals

Dynamic Audio-Visual Client Recognition modellingCSCJournals

P141omfcculodhabhavik

Speaker identification using mel frequency Phan Duy

QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONijma

International Journal of Engineering and Science Invention (IJESI)inventionjournals

QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITIONijma

Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals

E44082429IJERA Editor

A Novel Method for Speaker Independent Recognition Based on Hidden Markov ModelIDES Editor

Adaptive equalizationKamal Bhatt

Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...Ahmed Ayman

Real Time Speaker Identification System – Design, Implementation and ValidationIDES Editor

Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...IJERA Editor

Looking into the Black Box - A Theoretical Insight into Deep Learning NetworksDinesh V

Ähnlich wie Speaker identification system with voice controlled functionality (20)

Ijetcas14 426

Et25897899

A comparison of different support vector machine kernels for artificial speec...

Voice biometric recognition

A Novel, Robust, Hierarchical, Text-Independent Speaker Recognition Technique

Dynamic Audio-Visual Client Recognition modelling

P141omfccu

Speaker identification using mel frequency

QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION

International Journal of Engineering and Science Invention (IJESI)

QUALITATIVE ANALYSIS OF PLP IN LSTM FOR BANGLA SPEECH RECOGNITION

Wavelet Based Noise Robust Features for Speaker Recognition

E44082429

A Novel Method for Speaker Independent Recognition Based on Hidden Markov Model

Adaptive equalization

Joint MFCC-and-Vector Quantization based Text-Independent Speaker Recognition...

Real Time Speaker Identification System – Design, Implementation and Validation

Performance Evaluation of Conventional and Hybrid Feature Extractions Using M...

Looking into the Black Box - A Theoretical Insight into Deep Learning Networks

Speaker identification system with voice controlled functionality

1. SPEAKER IDENTIFICATION SYSTEM WITH VOICE – CONTROLLED FUNCTIONALITY

2. Introduction Objective  To develop a speaker identification system and control the system using a person’s voice. Platform  Matlab Implementation of Artificial Neural Networks (ANN) for pattern classification Feature extraction  MFCC 2

3. Experimental Setup Sound Recorder Feature Extraction Speech Wav File MFCC Artificial Neural Network Subsystem Test Train 3

4. Signal Processing Built - in MATLAB function ‘wavrecord.m’ The recorded samples serve as input to the next stage, which is the Mel – Frequency Cepstral Analysis. 4

5. Feature Extraction Mel – Frequency Cepstral Coefficients (MFCC) MFCCs are based on the known variation of the human ear’s critical bandwidths with frequency Linear at low frequencies and logarithmic at high frequencies 5

6. MFCC Block Diagram Speech Frame Frame Windowing FFT Blocking Mel Cepstrum Mel Mel–Freq. Spectrum Cepstrum Wrapping Spectrum 6

7. Steps of MFCC 1. Frame Blocking 2. Windowing 3. Fast Fourier Transform (FFT ) 4. Mel–Frequency Wrapping 5. Cepstrum Auditory Toolbox - mfcc.m ceps=mfcc(input, sampling rate, [frame rate]) 7

8. Artificial Neural Networks (ANN) General models of how human brain processes information. Layered architecture  Consists of nodes corresponding to neurons and of weights corresponding to connections between neurons “Learning” rule  Weights are adjusted on the basis of a series of training patterns 8

9. Probabilistic Neural Network (PNN) Feed – forward neural network Provides a general technique to solve pattern classification problems Develops distribution function to estimate the likelihood of an input pattern being within several given categories. Created in MATLAB using ‘newpnn’ net = newpnn(p,t) 9

10. Schematic Diagram 10

11. Conclusion Implementation difficult due to variability in speech signal Possible improvement using noise cancellation techniques  Weiner Filter, Adaptive Filters 11

12. References L.Rabiner, B. H. Juang – Fundamentals of Speech Recognition C. P. Lim, S.C. Woo – Speech Recognition using Neural Networks. IEEE Trans. on Acoustics, Speech and Signal Processing - 2000. Khalid Saeed and Mohammed Kheir Nammous – A Speech and Speaker Identification System. IEEE Trans. on Industrial Electronics - 2007. 12

13. 13

Speaker identification system with voice controlled functionality

Empfohlen

Empfohlen

Weitere ähnliche Inhalte

Was ist angesagt?

Was ist angesagt? (20)

Andere mochten auch

Andere mochten auch (6)

Ähnlich wie Speaker identification system with voice controlled functionality

Ähnlich wie Speaker identification system with voice controlled functionality (20)

Speaker identification system with voice controlled functionality