SlideShare ist ein Scribd-Unternehmen logo
1 von 5
Downloaden Sie, um offline zu lesen
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 913
Pitch Detection Algorithms in Time Domain
Dr. Chavan Madhukar S1, Sutar Akshay A2
1Associate Professor & PG Co-ordinator, Dept. of Electronics and Telecommunication Engineering, P.V.P. Institute
of Technology, Budhgaon-416304, Maharashtra, India
2PG Student, Dept. of Dept. of Electronics and Telecommunication Engineering, P.V.P. Institute of Technology,
Budhgaon-416304, Maharashtra, India
---------------------------------------------------------------------***---------------------------------------------------------------------
Abstract - Speech signal can be classified into voiced,
unvoiced and silence regions. The near periodic vibration of
vocal folds is excitation for the production of voiced speech.
Pitch is an important parameter in speech processing. It plays
a major role in many speech processing applications. Speech
signal is affected by background noise and it degrades the
performance. Estimation of pitch for noisy speech signal is
important task in many applications. The paper proposes a
pitch detection algorithm based on the short-time average
magnitude difference function (AMDF) and the short-term
autocorrelation function (ACF). One defining characteristicof
speech is its pitch. Detecting this Pitch or equivalently,
fundamental frequency detection of a speech signal is
important in many speech applications. Pitch detectors are
used in vocoders, speaker identification and verification
systems and also as aids to the handicapped. Because of its
importance many solutions to detect pitch has been proposed
both in time and frequency domains. One such solutionispitch
detection is by using Autocorrelation method and Average
Magnitude Difference Function (AMDF), method which are
analyses done in the time domain. This paper gives the
implementation results of the pitch period estimated in the
time for samples of speech sounds, for different voice samples.
Key Words: Pitch, Autocorrelation function,LPF,Center-
clipping, Center peak-clipping, Energy of ClippedSignal.
1. INTRODUCTION
Speech can be classified into two general categories,
voiced and unvoiced speech. A voicedsoundisoneinwhich
the vocal cords of the speaker vibrate as the sound is made,
and unvoiced sound is one where the vocal cords do not
vibrate. Therefore in voiced sound as the vocal chords
vibrate, “Pitch” refers to the percept of the fundamentally
frequency of such vibrations or the resulting periodicity in
the speech signal.
The pitch estimation plays a very important role in
speech compression,speechcoding,speechrecognitionand
synthesis, as well as in voice translation. A good estimation
of the pitch period is crucial to improving the performance
of speech analysis and synthesis systems.
Most low rate voice coders requires accurate pitch
estimation for good reconstructed speech, and some
medium rate coders use pitch to reduce transmission rate
while preserving high quality speech. Pitch patterns are
useful in speaker recognition and synthesis.
Various pitch detection algorithms (PDAs) have been
developed in the past. Most of them have very high
accuracy for voiced pitch estimation, but the error rate
considering voicing decision is still quite high. Moreover,
the PDAs performance degrades significantly as the signal
conditions deteriorate. Pitch detection algorithms can be
classified into the following basic categories: time domain
based tracking, frequency domain based tracking or joint
time-frequency domain based tracking. This paper
discusses time domain based pitch period estimation.
2. PITCH DETECTION ALGORITHMS
Speech signal varies with time and so for extracting
features properly, we need to identify a smaller portion or
window segment of speech. Then the recorded speech signal
is downsampled and read using 'wavread' function in
MATLAB. This function reads a small window of the
downsampled signal and returns the sampled values as well
as the sampling frequency. For pitch detection, time domain
analyses are done in this work. The pitch can be determined
either from periodicity in time domain. Time domain pitch
analysis includes the Autocorrelation method and AMDF
techniques
3. TIME DOMAIN PITCH DETECTION ALGORITHM
3.1 Autocorrelation Method
The autocorrelation approach is the most widely used
time domain method for determining pitch period of a
speech signal. This method isbased on detectingthehighest
value of the autocorrelation function in the region of
interest. For given discrete signal x(n), the autocorrelation
function is generally defined as in (1)
(1)
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 914
Where, N is the length of analyzed sequence and n is
index in frameof autocorrelationpointstobecomputed.For
pitch detection, if we assume that x (n) is periodic sequence
i.e. x(n)= x(n+P) forall n, it is shownthattheautocorrelation
function is also periodic with the same period, R(m)=R
(m+P). Conversely, the periodicity in the autocorrelation
function indicates periodicity in the signal. For a non-
stationary signal, such as speech, the concept of a long-time
autocorrelation measurement given by (1) is not really
meaningful. In practice, we operate with short speech
segments, consisting of finite number of samples.
(2)
Most low rate voice coders requires accurate pitch
estimation forgood reconstructedspeech,andsomemedium
rate coders use pitch to reduce transmission rate while
preserving.
3.2 Average Magnitude Difference Function
The average magnitude difference function (AMDF) is
another type of autocorrelation analysis. Instead of
correlating the input speech at various delays (where
multiplications and summations are formed at each value), a
difference signal is formed between the delayed speech and
original, and at each delay value the absolute magnitude is
taken. For the frame of N samples, the short-term difference
function AMDF is defined as in
(3)
Where x (n) are samples of analyzed speech frame, x (n+m)
are samples and N is frame length. The differencefunctionis
expected to have a stronglocal minimumifthelagmisequal
to or very close to the fundamental frequency.
PDA based on average magnitude difference function
has advantage in relatively low computational cost and
simple implementation.Unliketheautocorrelationfunction,
the AMDF calculations require no multiplications. This is a
desirable property for real-timeapplications.Foreachvalue
of delay, computationismadeoveranintegratingwindowof
N samples. The average magnitude difference function is
computed on speech segmentatlagsrunningfrom16to160
samples. The pitch period is identified as the value ofthelag
at which the minimum AMDF occurs.
4. IMPLEMENTATION RESULTS AND DISCUSSION
4.1 Autocorrelation Method
Fig.1 shows the block diagram of pitch detection using
autocorrelation method. At the beginning of processing, the
speech signal must be segmented into frames. Speech
recordings with 8 kHz sampling frequency were used for
experiments. Therefore, input speech signal must be
segmented into frames of 240 samples (30ms).
 FRAMING AND WINDOWING FOR ACF
Perform frame blocking such that a stream of audio
signals is converted to a set of frames. The time duration of
each frame is about 20~30 ms if the frame duration is too
big, the time-varying characteristics of the audio signal
cannot extracted. On the other hand, if the framedurationis
too small,cannot extractvalidacousticfeatures.Ingeneral,a
frameshould be containsseveralfundamentalperiodsofthe
given audio signals. Usually the frame size in terms of
sample points is equal to the powers of 2 such as 256, 512,
1024, etc. such that it is suitable for fast Fourier transform.
To reduce the difference between neighboring frames,
overlap between them is done. Usually the overlap is 1/2 to
2/3 of the original frame. The more overlap, the more
computation is needed.
There are several terminologies that are used often:
 Frame size: The sampling points within each frame
 Frame overlap: The sampling points of the overlap
between consecutive frames.
 Frame step (or hop size): This is equal to the frame size
minus the overlap.
 Frame rate: The number of frames per second, which is
equal to the sample frequency divided by the frame step
For instance, if a stream of audio signals with sample
frequency fs=16000, and a frameduration of 25 ms,overlap
of 15 ms, then
 Frame size = fs*25/1000 = 400 (sample points)
 Frame overlap = fs*15/1000 = 240 (sample points)
 Frame step (or hop size) = 400-240 = 160 (sample
points)
 Frame rate = fs/160 = 100 frames/sec
fs = 16,000 samples/second
Frame rate (overlap percentage) = 10 ms
Window length (Frame length) = 25 ms
=> (25 ms * 16,000 = 4000 sample/frame)
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 915
Fig 1. Process of frame blocking for ACF method
Speech is a time varying signal, and some variations are
random. Usually during slow speech, the vocal tract shape
and excitation type do not change in 200 ms. But phonemes
have an average duration of 80 ms. most changes occur
more frequently than the 200 ms time interval. Signal
analysis assumes that the properties of a signal usually
change relatively slowly with time. This allows for short-
term analysis of a signal. The signal is divided into
successive segments, analysis is done on these segments,
and somedynamic parametersareextracted.Thesignals(n)
is multiplied by a fixed length analysis window w(n) to
extract a particular segment at a time. This is called
windowing. Choosing the right shape of window is very
important, because it allows different samples to be
weighted differently. The simplest analysis window is a
rectangular window of length Nw:
(4)
Fig. 2 Flowchart of pitch detection using ACF method
Fig. 3 Autocorrelation of voiced frame
After windowing and computing the autocorrelationover
the range of lags, the peaks are searched from the
autocorrelation function. The positions (index) of the peaks
are obtained. The distance between successive peaks is
measured. If these distances are within a threshold valuethen
the frame is classified as voiced. Otherwise, the section is
classified as unvoiced. Then the value of fundamental
frequency can be computed from the pitch period.
Fig. 3 showstheexperimentalresultsoftheautocorrelation
method obtained for a voiced frame. The waveform in the
figure is the autocorrelation function of the signal. The
periodicity of the signal waveform and also the uniform time
lag difference between the peaks of the autocorrelation
function explains the fact that the input speech signal is
voiced.
4.2 Average Magnitude Difference Function
Method
Procedure of processing operations for AMDF based
pitch detector is quite similar to the Autocorrelation
method.
 FRAMING AND WINDOWING FOR AMDF
Perform frame blocking such that a stream of audio
signals is converted to a set of frames. The time duration of
each frame is about 20~30 ms if the frame duration is too
big, the time-varying characteristics of the audio signal
cannot extracted. On the other hand, if the framedurationis
too small,cannot extractvalidacousticfeatures.Ingeneral,a
frameshould be containsseveralfundamentalperiodsofthe
given audio signals. Usually the frame size in terms of
sample points is equal to the powers of 2 such as 256, 512,
1024, etc. such that it is suitable for fast Fourier transform.
To reduce the difference between neighboring frames,
overlap between them is done. Usually the overlap is 1/2 to
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 916
2/3 of the original frame. The more overlap, the more
computation is needed.
There are several terminologies that are used often:
 Frame size: The sampling points within each frame
 Frame overlap: The sampling points of the overlap
between consecutive frames
 Frame step (or hop size): This is equal to the frame size
minus the overlap.
 Frame rate: The number of frames per second, which is
equal to the sample frequency divided by the frame step
For instance, if a stream of audio signals with sample
frequency fs=16000, and a frame duration of 25 ms,overlap
of 15 ms, then
 Frame size = fs*25/1000 = 400 (sample points)
 Frame overlap = fs*15/1000 = 240 (sample points)
 Frame step (or hop size) = 400-240 = 160 (sample
points)
 Frame rate = fs/160 = 100 frames/sec
fs = 16,000 samples/second
Frame rate (overlap percentage) = 10 ms
Window length (Frame length) = 25 ms
=> (25 ms * 16,000 = 4000 sample/frame)
Fig 4 Process of frame blocking for AMDF method
Speech is a timevarying signal, and somevariationsare
random. Usually during slow speech, the vocal tract shape
and excitation type do not change in 200 ms. But phonemes
have an average duration of 80 ms. most changes occur
more frequently than the 200 ms time interval. Signal
analysis assumes that the properties of a signal usually
change relatively slowly with time. This allows for short-
term analysis of a signal. The signal is divided into
successive segments, analysis is done on these segments,
and somedynamic parametersareextracted.Thesignals(n)
is multiplied by a fixed length analysis window w(n) to
extract a particular segment at a time. This is called
windowing. Choosing the right shape of window is very
important, because it allows different samples to be
weighted differently. The simplest analysis window is a
rectangular window of length Nw:
(6)
Fig. 5 Flowchart of pitch detection using AMDF method
After windowing, the average magnitude difference
function is computed on the speech segment as defined in
equation 3. The pitch period is identified as the value of the
lag at which the minimum AMDF occurs. This is the pitch
period. Fig. 5 shows the flowchart of AMDF method.
Fig 6. AMDF Function of voiced frame
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072
© 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 917
Fig. 6 shows the experimental results of the AMDF
method obtained for a voiced frame. The waveform in the
figure is the AMDF function of an signal. The average
magnitude difference function (AMDF) is another type of
autocorrelation analysis. Instead of correlating the input
speech at various delays (where multiplications and
summations are formed at each value), a difference signalis
formed between the delayed speech and original, and at
each delay value the absolute magnitude is taken.
5. CONCLUSIONS
This paper discusses the different pitch detection
algorithms for speech signals. PDAs based on the
autocorrelation function – Autocorrelation method and
Autocorrelation. Each of the described algorithms has their
advantages and drawbacks. The AMDF method has great
advantage in very low computational complexity. This
makes it possible to implement it in real-time applications.
REFERENCES
[1] Chandrashekar H. M1 and Pratibha K2, “Estimation
and Tracking of Pitch for Noisy Speech Signals using
EMD based Autocorrelation FunctionAlgorithm”2017
2nd IEEE International Conference On Recent Trends
in Electronics Information & Communication
Technology (RTEICT), May 19-20, 2017, India
[2] K. Inbanila1 and E. Krishnakumar2 ‘‘Enhancement of
Substitution Voices Using F1 Formant Deviation
Analysis and DTW Based Template Matching”
International Conference on Wireless
Communications, Signal Processing and Networking
(WiSPNET) 2017.
[3] L. Rabiner, M. J. Cheng, A. E. Rosenberg, and C. A.
McGonegal, "A comparative performance study of
several pitch detection algorithms," IEEETransactions
on ASSP, vol. 24, pp. 399-417, 1976.
[4] L. R.Rabiner and R. W.Schafer ,Digital Processing of
Speech Signals, Prentice-Hall, Englewood Cliffs, NJ,
1978.
[5] W. J. Hess, Pitch Determination of Speech Signals,New
York: Springer, 1993
[6] H. Boƙil, P. Pollák, "Direct Time Domain Fundamental
Frequency Estimation of Speech in Noisy Conditions".
Proc. EUSIPCO2004, Wien, Austria, vol. 1, pp. 1003-
1006, 2004.
[7] B. Kotnik, H. Höge, and Z. Kacic, “Evaluation of Pitch
Detection Algorithms in Adverse Conditions”. Proc.
3rd International Conference on Speech Prosody,
Dresden, Germany, pp. 149 -152, 2006
ACKNOWLEDGEMENT
Gratitude is the hardest emotion to express and often
one doesn’t find adequate words to convey that entire one
feels. I would like to express the deepest appreciation to my
Guide and PG Coordinator Dr. Madhukar S. Chavan who has
the attitude and the substance of a geniusthathecontinually
and convincingly conveyed a spirit of adventure in regard to
research. Without his guidance and persistent help this
dissertation would not have been possible.
I would like to thank my Head of department Dr. D. B.
Kadam for his valuable suggestions and constant
encouragement all through the project work.
I offer my humble and sincere thanks to Dr. D. V.
Ghewade Principal, P.V.P.I.T. Budhgaon for his all possible
cooperation. I express my sincere thanks to all faculty and
staff members of Electronics and Telecommunication Engg.
Department of P.V.P.I.T. Budhgaon for their kind co-
operation and encouragement.
It gives me immense pleasuretoacknowledgeandthank
many individuals who contributed in various ways for the
successful completion of this work. This project consumed
huge amount of work, research and dedication. I would like
to extend my sincere gratitude to all of them.
BIOGRAPHIES
“Dr. Chavan Madhukar S.
Associate Professor & PG Co-
ordinator.
Department of Electronics and
Telecommunication Engineering,
P.V.P. Institute of Technology,
Budhgaon.”
“Mr. Sutar Akshay A.
PG Student ”Description
Department of Electronics and
Telecommunication Engineering,
P.V.P. Institute of Technology,
Budhgaon”

Weitere Àhnliche Inhalte

Was ist angesagt?

44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognitionsunnysyed
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognitionphyuhsan
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing documenthimadrigupta
 
Lr3620092012
Lr3620092012Lr3620092012
Lr3620092012IJERA Editor
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONniranjan kumar
 
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...IJERA Editor
 
Speaker recognition systems
Speaker recognition systemsSpeaker recognition systems
Speaker recognition systemsNamratha Dcruz
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...ijceronline
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approachijsrd.com
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal ProcessingMurtadha Alsabbagh
 
1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSINGmukesh bhardwaj
 
Homomorphic speech processing
Homomorphic speech processingHomomorphic speech processing
Homomorphic speech processingsivakumar m
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...CSCJournals
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basicsHemaraja Nayaka S
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniquePankaj Kumar
 
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSijiert bestjournal
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkeSAT Journals
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based featuresijsc
 

Was ist angesagt? (20)

D011132635
D011132635D011132635
D011132635
 
Voice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from LaryngectomyVoice Morphing System for People Suffering from Laryngectomy
Voice Morphing System for People Suffering from Laryngectomy
 
44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition44 i9 advanced-speaker-recognition
44 i9 advanced-speaker-recognition
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
Voice morphing document
Voice morphing documentVoice morphing document
Voice morphing document
 
Lr3620092012
Lr3620092012Lr3620092012
Lr3620092012
 
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITIONSPEKER RECOGNITION UNDER LIMITED DATA CODITION
SPEKER RECOGNITION UNDER LIMITED DATA CODITION
 
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
Identification of Sex of the Speaker With Reference To Bodo Vowels: A Compara...
 
Speaker recognition systems
Speaker recognition systemsSpeaker recognition systems
Speaker recognition systems
 
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
Emotion Recognition based on audio signal using GFCC Extraction and BPNN Clas...
 
Speaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization ApproachSpeaker Recognition System using MFCC and Vector Quantization Approach
Speaker Recognition System using MFCC and Vector Quantization Approach
 
Speech Signal Processing
Speech Signal ProcessingSpeech Signal Processing
Speech Signal Processing
 
1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING1 AUDIO SIGNAL PROCESSING
1 AUDIO SIGNAL PROCESSING
 
Homomorphic speech processing
Homomorphic speech processingHomomorphic speech processing
Homomorphic speech processing
 
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
Teager Energy Operation on Wavelet Packet Coefficients for Enhancing Noisy Sp...
 
Speech technology basics
Speech technology   basicsSpeech technology   basics
Speech technology basics
 
Environmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC techniqueEnvironmental Sound detection Using MFCC technique
Environmental Sound detection Using MFCC technique
 
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALSA REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
A REVIEW OF LPC METHODS FOR ENHANCEMENT OF SPEECH SIGNALS
 
Isolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural networkIsolated words recognition using mfcc, lpc and neural network
Isolated words recognition using mfcc, lpc and neural network
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based features
 

Ähnlich wie IRJET- Pitch Detection Algorithms in Time Domain

Enhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationEnhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationIRJET Journal
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET Journal
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...IRJET Journal
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...IJMER
 
Automatic Speech Recognition Incorporating Modulation Domain Enhancement
Automatic Speech Recognition Incorporating Modulation Domain EnhancementAutomatic Speech Recognition Incorporating Modulation Domain Enhancement
Automatic Speech Recognition Incorporating Modulation Domain EnhancementIRJET Journal
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...sipij
 
Multichannel Speech Signal Separation by Beam forming Techniques
Multichannel Speech Signal Separation by Beam forming TechniquesMultichannel Speech Signal Separation by Beam forming Techniques
Multichannel Speech Signal Separation by Beam forming TechniquesIRJET Journal
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...ijceronline
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...eSAT Journals
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlabArcanjo Salazaku
 
Analysis of Speech Enhancement Incorporating Speech Recognition
Analysis of Speech Enhancement Incorporating Speech RecognitionAnalysis of Speech Enhancement Incorporating Speech Recognition
Analysis of Speech Enhancement Incorporating Speech RecognitionIRJET Journal
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environmentssipij
 
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...IRJET Journal
 

Ähnlich wie IRJET- Pitch Detection Algorithms in Time Domain (20)

Enhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition applicationEnhanced modulation spectral subtraction for IOVT speech recognition application
Enhanced modulation spectral subtraction for IOVT speech recognition application
 
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech EnhancementIRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
IRJET- Survey on Efficient Signal Processing Techniques for Speech Enhancement
 
H0814247
H0814247H0814247
H0814247
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...Enhanced modulation spectral subtraction incorporating various real time nois...
Enhanced modulation spectral subtraction incorporating various real time nois...
 
40120140504002
4012014050400240120140504002
40120140504002
 
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
Development of Algorithm for Voice Operated Switch for Digital Audio Control ...
 
Automatic Speech Recognition Incorporating Modulation Domain Enhancement
Automatic Speech Recognition Incorporating Modulation Domain EnhancementAutomatic Speech Recognition Incorporating Modulation Domain Enhancement
Automatic Speech Recognition Incorporating Modulation Domain Enhancement
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
H010234144
H010234144H010234144
H010234144
 
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
ROBUST FEATURE EXTRACTION USING AUTOCORRELATION DOMAIN FOR NOISY SPEECH RECOG...
 
Multichannel Speech Signal Separation by Beam forming Techniques
Multichannel Speech Signal Separation by Beam forming TechniquesMultichannel Speech Signal Separation by Beam forming Techniques
Multichannel Speech Signal Separation by Beam forming Techniques
 
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...IJCER (www.ijceronline.com) International Journal of computational Engineerin...
IJCER (www.ijceronline.com) International Journal of computational Engineerin...
 
Pitch and time scale modifications
Pitch and time scale modificationsPitch and time scale modifications
Pitch and time scale modifications
 
Ijecet 06 10_006
Ijecet 06 10_006Ijecet 06 10_006
Ijecet 06 10_006
 
Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...Design and implementation of different audio restoration techniques for audio...
Design and implementation of different audio restoration techniques for audio...
 
Speaker recognition on matlab
Speaker recognition on matlabSpeaker recognition on matlab
Speaker recognition on matlab
 
Analysis of Speech Enhancement Incorporating Speech Recognition
Analysis of Speech Enhancement Incorporating Speech RecognitionAnalysis of Speech Enhancement Incorporating Speech Recognition
Analysis of Speech Enhancement Incorporating Speech Recognition
 
Speech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise EnvironmentsSpeech Enhancement for Nonstationary Noise Environments
Speech Enhancement for Nonstationary Noise Environments
 
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...IRJET -  	  Essential Features Extraction from Aaroh and Avroh of Indian Clas...
IRJET - Essential Features Extraction from Aaroh and Avroh of Indian Clas...
 

Mehr von IRJET Journal

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...IRJET Journal
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTUREIRJET Journal
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...IRJET Journal
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsIRJET Journal
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...IRJET Journal
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...IRJET Journal
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...IRJET Journal
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...IRJET Journal
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASIRJET Journal
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...IRJET Journal
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProIRJET Journal
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...IRJET Journal
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemIRJET Journal
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesIRJET Journal
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web applicationIRJET Journal
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...IRJET Journal
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.IRJET Journal
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...IRJET Journal
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignIRJET Journal
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...IRJET Journal
 

Mehr von IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

KĂŒrzlich hochgeladen

Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleAlluxio, Inc.
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfAsst.prof M.Gokilavani
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitterShivangiSharma879191
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxwendy cai
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxKartikeyaDwivedi3
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction managementMariconPadriquez1
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 

KĂŒrzlich hochgeladen (20)

Correctly Loading Incremental Data at Scale
Correctly Loading Incremental Data at ScaleCorrectly Loading Incremental Data at Scale
Correctly Loading Incremental Data at Scale
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdfCCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
CCS355 Neural Networks & Deep Learning Unit 1 PDF notes with Question bank .pdf
 
8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter8251 universal synchronous asynchronous receiver transmitter
8251 universal synchronous asynchronous receiver transmitter
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
What are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptxWhat are the advantages and disadvantages of membrane structures.pptx
What are the advantages and disadvantages of membrane structures.pptx
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Serviceyoung call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
young call girls in Rajiv Chowk🔝 9953056974 🔝 Delhi escort Service
 
Concrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptxConcrete Mix Design - IS 10262-2019 - .pptx
Concrete Mix Design - IS 10262-2019 - .pptx
 
computer application and construction management
computer application and construction managementcomputer application and construction management
computer application and construction management
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 

IRJET- Pitch Detection Algorithms in Time Domain

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 913 Pitch Detection Algorithms in Time Domain Dr. Chavan Madhukar S1, Sutar Akshay A2 1Associate Professor & PG Co-ordinator, Dept. of Electronics and Telecommunication Engineering, P.V.P. Institute of Technology, Budhgaon-416304, Maharashtra, India 2PG Student, Dept. of Dept. of Electronics and Telecommunication Engineering, P.V.P. Institute of Technology, Budhgaon-416304, Maharashtra, India ---------------------------------------------------------------------***--------------------------------------------------------------------- Abstract - Speech signal can be classified into voiced, unvoiced and silence regions. The near periodic vibration of vocal folds is excitation for the production of voiced speech. Pitch is an important parameter in speech processing. It plays a major role in many speech processing applications. Speech signal is affected by background noise and it degrades the performance. Estimation of pitch for noisy speech signal is important task in many applications. The paper proposes a pitch detection algorithm based on the short-time average magnitude difference function (AMDF) and the short-term autocorrelation function (ACF). One defining characteristicof speech is its pitch. Detecting this Pitch or equivalently, fundamental frequency detection of a speech signal is important in many speech applications. Pitch detectors are used in vocoders, speaker identification and verification systems and also as aids to the handicapped. Because of its importance many solutions to detect pitch has been proposed both in time and frequency domains. One such solutionispitch detection is by using Autocorrelation method and Average Magnitude Difference Function (AMDF), method which are analyses done in the time domain. This paper gives the implementation results of the pitch period estimated in the time for samples of speech sounds, for different voice samples. Key Words: Pitch, Autocorrelation function,LPF,Center- clipping, Center peak-clipping, Energy of ClippedSignal. 1. INTRODUCTION Speech can be classified into two general categories, voiced and unvoiced speech. A voicedsoundisoneinwhich the vocal cords of the speaker vibrate as the sound is made, and unvoiced sound is one where the vocal cords do not vibrate. Therefore in voiced sound as the vocal chords vibrate, “Pitch” refers to the percept of the fundamentally frequency of such vibrations or the resulting periodicity in the speech signal. The pitch estimation plays a very important role in speech compression,speechcoding,speechrecognitionand synthesis, as well as in voice translation. A good estimation of the pitch period is crucial to improving the performance of speech analysis and synthesis systems. Most low rate voice coders requires accurate pitch estimation for good reconstructed speech, and some medium rate coders use pitch to reduce transmission rate while preserving high quality speech. Pitch patterns are useful in speaker recognition and synthesis. Various pitch detection algorithms (PDAs) have been developed in the past. Most of them have very high accuracy for voiced pitch estimation, but the error rate considering voicing decision is still quite high. Moreover, the PDAs performance degrades significantly as the signal conditions deteriorate. Pitch detection algorithms can be classified into the following basic categories: time domain based tracking, frequency domain based tracking or joint time-frequency domain based tracking. This paper discusses time domain based pitch period estimation. 2. PITCH DETECTION ALGORITHMS Speech signal varies with time and so for extracting features properly, we need to identify a smaller portion or window segment of speech. Then the recorded speech signal is downsampled and read using 'wavread' function in MATLAB. This function reads a small window of the downsampled signal and returns the sampled values as well as the sampling frequency. For pitch detection, time domain analyses are done in this work. The pitch can be determined either from periodicity in time domain. Time domain pitch analysis includes the Autocorrelation method and AMDF techniques 3. TIME DOMAIN PITCH DETECTION ALGORITHM 3.1 Autocorrelation Method The autocorrelation approach is the most widely used time domain method for determining pitch period of a speech signal. This method isbased on detectingthehighest value of the autocorrelation function in the region of interest. For given discrete signal x(n), the autocorrelation function is generally defined as in (1) (1)
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 914 Where, N is the length of analyzed sequence and n is index in frameof autocorrelationpointstobecomputed.For pitch detection, if we assume that x (n) is periodic sequence i.e. x(n)= x(n+P) forall n, it is shownthattheautocorrelation function is also periodic with the same period, R(m)=R (m+P). Conversely, the periodicity in the autocorrelation function indicates periodicity in the signal. For a non- stationary signal, such as speech, the concept of a long-time autocorrelation measurement given by (1) is not really meaningful. In practice, we operate with short speech segments, consisting of finite number of samples. (2) Most low rate voice coders requires accurate pitch estimation forgood reconstructedspeech,andsomemedium rate coders use pitch to reduce transmission rate while preserving. 3.2 Average Magnitude Difference Function The average magnitude difference function (AMDF) is another type of autocorrelation analysis. Instead of correlating the input speech at various delays (where multiplications and summations are formed at each value), a difference signal is formed between the delayed speech and original, and at each delay value the absolute magnitude is taken. For the frame of N samples, the short-term difference function AMDF is defined as in (3) Where x (n) are samples of analyzed speech frame, x (n+m) are samples and N is frame length. The differencefunctionis expected to have a stronglocal minimumifthelagmisequal to or very close to the fundamental frequency. PDA based on average magnitude difference function has advantage in relatively low computational cost and simple implementation.Unliketheautocorrelationfunction, the AMDF calculations require no multiplications. This is a desirable property for real-timeapplications.Foreachvalue of delay, computationismadeoveranintegratingwindowof N samples. The average magnitude difference function is computed on speech segmentatlagsrunningfrom16to160 samples. The pitch period is identified as the value ofthelag at which the minimum AMDF occurs. 4. IMPLEMENTATION RESULTS AND DISCUSSION 4.1 Autocorrelation Method Fig.1 shows the block diagram of pitch detection using autocorrelation method. At the beginning of processing, the speech signal must be segmented into frames. Speech recordings with 8 kHz sampling frequency were used for experiments. Therefore, input speech signal must be segmented into frames of 240 samples (30ms).  FRAMING AND WINDOWING FOR ACF Perform frame blocking such that a stream of audio signals is converted to a set of frames. The time duration of each frame is about 20~30 ms if the frame duration is too big, the time-varying characteristics of the audio signal cannot extracted. On the other hand, if the framedurationis too small,cannot extractvalidacousticfeatures.Ingeneral,a frameshould be containsseveralfundamentalperiodsofthe given audio signals. Usually the frame size in terms of sample points is equal to the powers of 2 such as 256, 512, 1024, etc. such that it is suitable for fast Fourier transform. To reduce the difference between neighboring frames, overlap between them is done. Usually the overlap is 1/2 to 2/3 of the original frame. The more overlap, the more computation is needed. There are several terminologies that are used often:  Frame size: The sampling points within each frame  Frame overlap: The sampling points of the overlap between consecutive frames.  Frame step (or hop size): This is equal to the frame size minus the overlap.  Frame rate: The number of frames per second, which is equal to the sample frequency divided by the frame step For instance, if a stream of audio signals with sample frequency fs=16000, and a frameduration of 25 ms,overlap of 15 ms, then  Frame size = fs*25/1000 = 400 (sample points)  Frame overlap = fs*15/1000 = 240 (sample points)  Frame step (or hop size) = 400-240 = 160 (sample points)  Frame rate = fs/160 = 100 frames/sec fs = 16,000 samples/second Frame rate (overlap percentage) = 10 ms Window length (Frame length) = 25 ms => (25 ms * 16,000 = 4000 sample/frame)
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 915 Fig 1. Process of frame blocking for ACF method Speech is a time varying signal, and some variations are random. Usually during slow speech, the vocal tract shape and excitation type do not change in 200 ms. But phonemes have an average duration of 80 ms. most changes occur more frequently than the 200 ms time interval. Signal analysis assumes that the properties of a signal usually change relatively slowly with time. This allows for short- term analysis of a signal. The signal is divided into successive segments, analysis is done on these segments, and somedynamic parametersareextracted.Thesignals(n) is multiplied by a fixed length analysis window w(n) to extract a particular segment at a time. This is called windowing. Choosing the right shape of window is very important, because it allows different samples to be weighted differently. The simplest analysis window is a rectangular window of length Nw: (4) Fig. 2 Flowchart of pitch detection using ACF method Fig. 3 Autocorrelation of voiced frame After windowing and computing the autocorrelationover the range of lags, the peaks are searched from the autocorrelation function. The positions (index) of the peaks are obtained. The distance between successive peaks is measured. If these distances are within a threshold valuethen the frame is classified as voiced. Otherwise, the section is classified as unvoiced. Then the value of fundamental frequency can be computed from the pitch period. Fig. 3 showstheexperimentalresultsoftheautocorrelation method obtained for a voiced frame. The waveform in the figure is the autocorrelation function of the signal. The periodicity of the signal waveform and also the uniform time lag difference between the peaks of the autocorrelation function explains the fact that the input speech signal is voiced. 4.2 Average Magnitude Difference Function Method Procedure of processing operations for AMDF based pitch detector is quite similar to the Autocorrelation method.  FRAMING AND WINDOWING FOR AMDF Perform frame blocking such that a stream of audio signals is converted to a set of frames. The time duration of each frame is about 20~30 ms if the frame duration is too big, the time-varying characteristics of the audio signal cannot extracted. On the other hand, if the framedurationis too small,cannot extractvalidacousticfeatures.Ingeneral,a frameshould be containsseveralfundamentalperiodsofthe given audio signals. Usually the frame size in terms of sample points is equal to the powers of 2 such as 256, 512, 1024, etc. such that it is suitable for fast Fourier transform. To reduce the difference between neighboring frames, overlap between them is done. Usually the overlap is 1/2 to
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 916 2/3 of the original frame. The more overlap, the more computation is needed. There are several terminologies that are used often:  Frame size: The sampling points within each frame  Frame overlap: The sampling points of the overlap between consecutive frames  Frame step (or hop size): This is equal to the frame size minus the overlap.  Frame rate: The number of frames per second, which is equal to the sample frequency divided by the frame step For instance, if a stream of audio signals with sample frequency fs=16000, and a frame duration of 25 ms,overlap of 15 ms, then  Frame size = fs*25/1000 = 400 (sample points)  Frame overlap = fs*15/1000 = 240 (sample points)  Frame step (or hop size) = 400-240 = 160 (sample points)  Frame rate = fs/160 = 100 frames/sec fs = 16,000 samples/second Frame rate (overlap percentage) = 10 ms Window length (Frame length) = 25 ms => (25 ms * 16,000 = 4000 sample/frame) Fig 4 Process of frame blocking for AMDF method Speech is a timevarying signal, and somevariationsare random. Usually during slow speech, the vocal tract shape and excitation type do not change in 200 ms. But phonemes have an average duration of 80 ms. most changes occur more frequently than the 200 ms time interval. Signal analysis assumes that the properties of a signal usually change relatively slowly with time. This allows for short- term analysis of a signal. The signal is divided into successive segments, analysis is done on these segments, and somedynamic parametersareextracted.Thesignals(n) is multiplied by a fixed length analysis window w(n) to extract a particular segment at a time. This is called windowing. Choosing the right shape of window is very important, because it allows different samples to be weighted differently. The simplest analysis window is a rectangular window of length Nw: (6) Fig. 5 Flowchart of pitch detection using AMDF method After windowing, the average magnitude difference function is computed on the speech segment as defined in equation 3. The pitch period is identified as the value of the lag at which the minimum AMDF occurs. This is the pitch period. Fig. 5 shows the flowchart of AMDF method. Fig 6. AMDF Function of voiced frame
  • 5. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 06 Issue: 06 | June 2019 www.irjet.net p-ISSN: 2395-0072 © 2019, IRJET | Impact Factor value: 7.211 | ISO 9001:2008 Certified Journal | Page 917 Fig. 6 shows the experimental results of the AMDF method obtained for a voiced frame. The waveform in the figure is the AMDF function of an signal. The average magnitude difference function (AMDF) is another type of autocorrelation analysis. Instead of correlating the input speech at various delays (where multiplications and summations are formed at each value), a difference signalis formed between the delayed speech and original, and at each delay value the absolute magnitude is taken. 5. CONCLUSIONS This paper discusses the different pitch detection algorithms for speech signals. PDAs based on the autocorrelation function – Autocorrelation method and Autocorrelation. Each of the described algorithms has their advantages and drawbacks. The AMDF method has great advantage in very low computational complexity. This makes it possible to implement it in real-time applications. REFERENCES [1] Chandrashekar H. M1 and Pratibha K2, “Estimation and Tracking of Pitch for Noisy Speech Signals using EMD based Autocorrelation FunctionAlgorithm”2017 2nd IEEE International Conference On Recent Trends in Electronics Information & Communication Technology (RTEICT), May 19-20, 2017, India [2] K. Inbanila1 and E. Krishnakumar2 ‘‘Enhancement of Substitution Voices Using F1 Formant Deviation Analysis and DTW Based Template Matching” International Conference on Wireless Communications, Signal Processing and Networking (WiSPNET) 2017. [3] L. Rabiner, M. J. Cheng, A. E. Rosenberg, and C. A. McGonegal, "A comparative performance study of several pitch detection algorithms," IEEETransactions on ASSP, vol. 24, pp. 399-417, 1976. [4] L. R.Rabiner and R. W.Schafer ,Digital Processing of Speech Signals, Prentice-Hall, Englewood Cliffs, NJ, 1978. [5] W. J. Hess, Pitch Determination of Speech Signals,New York: Springer, 1993 [6] H. Boƙil, P. PollĂĄk, "Direct Time Domain Fundamental Frequency Estimation of Speech in Noisy Conditions". Proc. EUSIPCO2004, Wien, Austria, vol. 1, pp. 1003- 1006, 2004. [7] B. Kotnik, H. Höge, and Z. Kacic, “Evaluation of Pitch Detection Algorithms in Adverse Conditions”. Proc. 3rd International Conference on Speech Prosody, Dresden, Germany, pp. 149 -152, 2006 ACKNOWLEDGEMENT Gratitude is the hardest emotion to express and often one doesn’t find adequate words to convey that entire one feels. I would like to express the deepest appreciation to my Guide and PG Coordinator Dr. Madhukar S. Chavan who has the attitude and the substance of a geniusthathecontinually and convincingly conveyed a spirit of adventure in regard to research. Without his guidance and persistent help this dissertation would not have been possible. I would like to thank my Head of department Dr. D. B. Kadam for his valuable suggestions and constant encouragement all through the project work. I offer my humble and sincere thanks to Dr. D. V. Ghewade Principal, P.V.P.I.T. Budhgaon for his all possible cooperation. I express my sincere thanks to all faculty and staff members of Electronics and Telecommunication Engg. Department of P.V.P.I.T. Budhgaon for their kind co- operation and encouragement. It gives me immense pleasuretoacknowledgeandthank many individuals who contributed in various ways for the successful completion of this work. This project consumed huge amount of work, research and dedication. I would like to extend my sincere gratitude to all of them. BIOGRAPHIES “Dr. Chavan Madhukar S. Associate Professor & PG Co- ordinator. Department of Electronics and Telecommunication Engineering, P.V.P. Institute of Technology, Budhgaon.” “Mr. Sutar Akshay A. PG Student ”Description Department of Electronics and Telecommunication Engineering, P.V.P. Institute of Technology, Budhgaon”