SlideShare a Scribd company logo
1 of 8
Download to read offline
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 54
A Combined Voice Activity Detector Based On Singular Value
Decomposition and Fourier Transform
Amard Afzalian a.afzalian@gmail.com
Islamic Azad University, Science
and Research,Tehran, IRAN.
Mohammad Reze Karami Mollaei mkarami@nit.ac.ir
Faculty of Electrical and Computer Engineering,
Babol Noshirvani University of Technology
Babol, P.O. Box 47135-484, IRAN
Jamal Ghasemi jghasemi@stu.nit.ac.ir
Faculty of Electrical and Computer Engineering,
Signal Processing Laboratory
Babol Noshirvani University of Technology,
Babol, P.O. Box 47135-484, IRAN
Abstract
Voice activity detector (VAD) is used to separate the speech data included parts
from silence parts of the signal. In this paper a new VAD algorithm is represented
on the basis of singular value decomposition. There are two sections to perform
the feature vector extraction. In first section voiced frames are separated from
unvoiced and silence frames. In second section unvoiced frames are silence
frames. To perform the above sections, first, windowing the noisy signal then
Hankel’s matrix is formed for each frame. The basis of statistical feature
extraction of purposed system is slope of singular value curve related to each
frame by using linear regression. It is shown that the slope of singular values
curve per different SNRs in voiced frames is more than the other types and this
property can be to achieve the goal the first part can be used. High similarity
between feature vector of unvoiced and silence frame caused to approach for
separation of the two categories above cannot be used. So in the second part,
the frequency characteristics for identification of unvoiced frames from silent
frames have been used. Simulation results show that high speed and accuracy
are the advantages of the proposed system.
Keywords: Speech, Voice Activity Detector, Singular Value.
1. INTRODUCTION
Voice activity detection is an important step in some speech processing systems, such as speech
recognition, speech enhancement, noise estimation, speech compression ... etc. In speech
recognition when a word or utterance begins or ends (the end points) must be specified [1]. Also
VAD is used to disable speech recognition for silence segments. Some speech transition systems
transmits active segments in high rate of bits and transmits silence in low rate of bits, by this
method they improve the band-width [2]. In some speech enhancement algorithm for example
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 55
spectral subtraction method, a VAD method is required to know when to update the noise
reference [3,20,21]. Conversational speech is a sequence of consecutive segments of silence
and speech. In noisy signal silence regions are noisy. Voice sound contain more energy than
unvoiced sound, while unvoiced sounds are more noise-like, so in noisy condition activity
detection is harder In unvoiced regions. Feature extraction is the most important section in VAD
system that elicits required parameters from desired frame. To achieve an accurate algorithm, the
system parameters must be selected until by them can be able to separate from each other
areas. After proper feature election, threshold is applied to the extracted parameters and the
decisions are made. To achieving good detection level threshold can be adapted to the change of
the noise conditions. Many of the algorithms assume that the first frames are silence [5, 6, and 7],
so we can initialize noise reference from these frames. Common features used in VAD's are short
term Fourier transform and zero crossing rate [4, 6, 11, and 12]. Another important and widely
used parameter in this regard is Cepstral Coefficient [7, 9]. In this method the Cepstral
coefficients are calculated within frames and then by calculating the difference between this
vector and the value assigned to the noise signal and then comparing the result with the basic
threshold value, the frame identity could be determined. LPC method is also another major
applicable method for VAD implementation [13]. Generally in LPC based algorithms a series of
mean coefficients are experimentally considered for voice, unvoiced and silent modes. In the next
step the LPC coefficients of suspicious frame and their relative difference with mean indices are
calculated and the frame identity is recognized based on these values. The other parameters for
implementing VAD in combined algorithms are LTSE (long term Spectral Estimation)[5], wavelet
coefficient [8,10], the ratio of signal to noise in sub-band [14], LSPE(Least Square Periodicity
Estimator)[11] and AMDF(Average Magnitude Difference Function)[15]. One of most important
cases in VAD system is speed of system performance beside proper accuracy. In this paper is to
present a new algorithm of VAD based on single value decomposition and frequency features,
specifications accuracy and speed simultaneously be fulfilled. Based on this, paper organization
as follows that in Section 2 single values decomposition (SVD) will be explained. In Section 3 the
proposed method with the system block diagram is given. In Section 4 simulation results in terms
of quantitative and qualitative criteria is evaluated. Finally, the article concludes with Section 5
ends.
2. Singular Value Decomposition
The singular value composition is one of the main tools in digital signal processing and statistical
data. By doing SVD on a matrix with dimensions of M×N, we have:
(1)T
X U V= Σ
On above relation U and V matrixes are singular vectors matrix with dimensions of M×M and
N×N, respectively. Also, ∑ with r order of a diagonal matrix M×N is included singular values so
that components on the main dial gauge are not zero and other components are zero. The
elements on main dial gauge areas 11 22 ... 0rrσ σ σ> > > > And are the values of X matrix. For
exercising SVD to one dimensional matrix, the vector of signal samples must map to subspace
with more dimensions, on the other hand must be changed to a matrix in certain way. Different
ways have indicated for one dimensional signal transformation to a matrix that in this article
(here) have used Hankel’s way [16,19].
3. Purposed algorithm
The main question on sound discovery is the classification of listening signal characteristic to
diagnosis sound parts. Thus, listening signal are classified to sound classes: silence, voiced, and
unvoiced. For classifying, suitable characteristics must elicit from the speech signal parts (frame).
Before studying the details of purposed system, general block diagram are showed in figure 1.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 56
One Frame of Noisy signal
Windowing
SVD
VAD Output
VAD Decision
FFT & Gaussian
Approximation
Feature Extraction BlockInput Signal
Figure (1) block diagram of propositional method for indicator system of voice activity.
As shown on figure 1, feature vector extraction done in two parts. In first part, voiced frames are
separated from unvoiced and silence frame as for statistical characteristics of singular values
matrix. In second part, unvoiced frames are separated from silence frames that this separation is
based on its frequency spectrum and Gaussian rate in each frame. At the end, one value accrues
to voiced and unvoiced frames that including voice information, and zeros one to silence frames.
3.1. Voiced frames separation from the other parts
In suggested system to separate the voiced parts from two parts of unvoiced and silence ones, it
is used on slop of singular value curve in related part. For doing atop stages, first noisy signal
divide to 16ms frames. According to relation (2), Hankel matrix makes for every frame.
(2)
0 1 1
1 2
0 1 1
1 1
...
...
[ , ,..., ]
...
M
M
k n k
L L N
x x x
x x x
X x x x H
x x x
−
−
− −
 
 
 = → =
 
 
 
M M M M
Where kX is the vector of exist samples in K frame in input signal and kH is the isomorphic
Hankel matrix of L×M dimensions with kX . The percent of sample overlapping in matrix and the
conditions of kH dimensions are brought on (3) and (4) relations.
(3)1
% 100
L
overlapping
L
−
= ×
(4)1,M L N L M+ = + ≥
For gaining full primitive pattern of kX frame, the number of added zeros must lessen in M
column of kH matrix that its results have brought on (5) relation.
(5)- mod( , ) [ ] 1, -[ ]
2 2
N N
ZeroPadding L N L L M N= ⇒ = + =
Gained values of L, M in (5) relation get kH semi rectangular matrix with maximum sample
overlapping. The singular values of each frame are got by Hankel matrix of existed frame and
using SVD map on related singular values.
(6)T
k k k kH U V= ∑
In (6) relation kU and kV the singular vectors of diagonal matrix and also k∑ are isomorphic
singular values matrix with kH (part 2). Figure (2), singular values vectors show the every voiced
frames in 16 millisecond length and SNR=10db with white Gaussian noise.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 57
0 20 40 60 80 100 120 140
0
2
4
6
8
10
12
Singular Values of Voice Frame
0 20 40 60 80 100 120 140
0
0.2
0.4
0.6
0.8
1
1.2
1.4
Singular Values of Unvoice Frame
0 20 40 60 80 100 120 140
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
Singular Values of Silence Frame
Figure (2) singular values vectors of voiced, unvoiced
and silence frame in SNR=10db (voice signal from
housewives_16.wav on TIMIT database)
According as seeing on figure (2) the slop of curve in singular values between voiced frames are
different from the other parts. In fact, singular values of voiced frames have more slop than
singular values of unvoiced and silence frames. The base of statistical feature extraction for
separating voiced frames is the slop of singular values curve related to each frame by using linear
regression. Table 1 shows the values of this slop in different SNRs on three certain frames that
have chosen from voiced, unvoiced and silence parts.
Table (1):
slope of singular values curve in linear regression related to species of frame on different SNRs
(voice signal from housewives_16.wav on TIMIT database)
Mean amount of singular values curve slope on 10 times repeat for each SNR
Silence frameUnvoiced
frame
Voiced
frame
SNR
0.09490.09870.14980db
0.05280.05660.12325db
0.03050.03380.117010db
0.01760.02320.114315db
0.00980.01750.113920db
Results of table (1) are support on this thesis that the slop of singular values curve on different
SNRs in voiced frames are more than the others and by using this trait we can achieve the goal
of first section that was the feature vector extraction in voiced frames from related singular values
matrix.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 58
2.3. Separating the voiced and silence frames
By studying figure (2) and table (1) are deduced that according to approximation through slop of
singular values curves related to voiced and silence frames, it can't divide these two type frames
from each other. Because of this in purposed system have used the other trait to separate these
two parts. In this part, frequency trait has used for recognition unvoiced frames from silence ones.
The base of comparison is the curve of Gaussian function, (7) relation.
(7)
2
2
( )
2
( ; , )
x c
f x c e γ
γ
− −
=
Atop relation C is mean and γ is variance of curve. By doing study, we see the discrete Fourier
transform of unvoiced frames in meddle frequency are similar to the curve of Gaussian functions
to some extent. Figure (3) and (4) show the smooth frequency spectrum related to unvoiced and
silence frames and their comparison with the curve of Gaussian function. (Voice signal from
TIMIT database with SNR=15db)
0 20 40 60 80 100 120
0
1
2
3
4
5
6
x 10
-3 Unvoice Frame in FFT Domain & Gaussian function
Figure (3) frequency spectrum of Unvoice frame in
SNR=15db and the curve of Gaussian function ( 0.6γ = )
0 20 40 60 80 100 120 140
0
1
2
3
4
5
6
x 10
-3 Silence Frame in FFT Domain & Gaussian function
Figure (4) frequency spectrum of Silence frame in
SNR=15db and the curve of Gaussian function ( 0.6γ = )
According as atop figures are seen, frequency spectrum of voiced frame is more similar to
Gaussian curve than silence frame noise in meddle frequency. Also by different examinations, the
optimal values of γ parameter in varying SNRs are as 0.2 with a view of minimizing the
difference between Gaussian pattern and frequency spectrum of silence frame pattern.
4. SIMULATION RESULTS
In this section, the operation of purposed system study and compare considered as accuracy and
speed. Sx114.wav voice signal from TIMIT database including 166 of 16 millisecond frames with
sampling rate is 16 kHz that each frame has 256 samples. Figure (5) shows the noisy signal with
SNR=10db and the indicator systems output of voice activity.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 59
(a)
(b)
(c)
0 0.5 1 1.5 2 2.5 3 3.5 4
x 10
4
-0.1
0
0.1
time
Amplitude
original signal
0 20 40 60 80 100 120 140 160
-1
0
1
2
Output Of Ideal VAD
0 20 40 60 80 100 120 140 160
-1
0
1
2
Output Of VAD Based on Wavelet
0 20 40 60 80 100 120 140 160
-1
0
1
2
Output Of Perposed VAD
(d)
Figure(5).Simulation Results: (a)TIMIT Signal, (b) Output of ideal VAD, (c) Output of VAD
based on Wavelet, (d) Output of Purposed VAD
According as seeing, suggested method has more efficiency to keep the parts of signals that the
activity signal is weak. In figure (5) and dashed line-drawn area, one unvoiced letter has omitted
in VAD system as wavelet (figure 5-c ) that it has kept on VAD output propositional system (figure
5-d ). In table (2) the accuracy rate of VAD propositional system is shown as wavelet transform
[17] about voice signal sx114.wav for different SNRs.
Table (2): percent of VAD system error comparison by using purposed algorithm and wavelet based
algorithm in different SNRs
Wavelet based algorithm
Purposed
algorithm
SNR
36%25%0db
20%18%5db
14%15%10db
12%13%15db
10%8%20db
One of the strength points of the purposed algorithm is its speed. In table (3), speed of two
algorithms has compared with each other in processing the sx114.wav sound file (CPU Intel
Core2Duo 2.5 GHz, 4 M Cache 733 MB RAM)
Table (3):
Speed comparison of VAD system by using purposed algorithm and wavelet based algorithm
Consumed time in wavelet based algorithmConsumed time in purposed algorithm
12 second4 second
In this part by using two VAD systems as preprocessing block has brought the results of hearing
test for a speech signal rich-making system that accuracy of VAD operation system be proved in
keeping unvoiced areas. In table 4 the standards has come that is used in evaluation of speech
with hearing factor.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 60
Table (4)
Five-Point adjectival scales for quality and impairment, and associated scores
ImpairmentScore
Imperceptible5 (Excellent)
(Just) Perceptible but not Annoying4 (Good)
(Perceptible and) Slightly Annoying3 (Fair)
Annoying (but not Objectionable)2 (Poor)
Very Annoying (Objectionable)1 (Bad)
In table 5, results of using two said VAD algorithms have shown as preprocessing block for
enhancement method of multi band spectral subtraction. Specifications of this test are in [3].
Table (5)
Results of MOS test;
17 clean speech signal from TIMIT database; Noise Type: White Gaussian Noise.
Input SNR
10db5db0db
Used Algorithm
2.82.31.6Wavelet Based
3.32.71.8Purposed
Studying the results of table (5) are shown the reform efficiency of speech enhancement by using
of propositional VAD algorithm to wavelet transform way.
5. CONCLUSION
In this paper a new method for Voice activity detector based on Singular Value Decomposition
and discrete Fourier transform was proposed. The proposed method is evaluated by using
various criteria. By using the mentioned criteria, it is presented that this method can compete with
other methods. Also, the aim of indicator systems of voice activity is control of destruction
unvoiced signal sites in rich-making operation that observantly to results of hearing test, the
propositional algorithm have proper power in compare with wavelet transform way. The
propositional system has manifold speed than usual method that is one of the obvious characters
in practical usage and hardware implementation.
6. REFERENCES
[1] J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, A. J. Rubio, “A new Kullback-Leibler
VAD for speech recognition in noise”, IEEE Signal Processing Letters, 11(2) :266– 269.
2004.
[2] A. Kondoz, “Digital speech: coding for low bit rate communication systems”, J. Wiley, New
York, 1994.
[3] Y. Ghanbari, M. R. Karami Mollaei, “A new approach for speech enhancement based on the
adaptive thresholding of the wavelet packets”, Speech Communication 48 (2006) 927–940.
[4] B. V. Harsha, “A Noise Robust Activity Detection Algorithm”, proc. Of int. symposium of
intelligent multimedia, video and speech processing, pp. 322-325, Oct. 2004, Hong Kon.
[5] J. Ramırez, J. C. Segura, C. Benıtez, A. de la Torre, A. Rubio, “A New Adaptive Long-Term
Spectral Estimation Voice Activity Detector,” EUROSPEECH, pp. 3041-3044, 2003,
Geneva.
[6] J. Faneuff, “ Spatial, Spectral, and Perceptual Nonlinear Noise Reduction for Hands-free
Microphones in a Car,” Master Thesis Electrical and Computer Engineering July 2002.
A. Afzalian, M. R. Karami mollaei & J. Ghasemi
Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 61
[7] S. Skorik, F. Berthommier, “On a Cepstrum-Based Speech Detector Robust To White Noise,”
Accepted for Specom 2000, St. Petersbourg.
[8] J. Stegmann, G. Schroeder, “Robust Voice Activity Detection Based on the Wavelet
Transform", Proc. IEEE Workshop on Speech Coding, Sep. 1997, pp. 99-100, Pocono
Manor, Pennsylvania, USA.
[9] J.A. Haigh and J.S. Mason, “Robust Voice Activity Detection Using Cepstral Features,” In
Proc. of IEEE TENCON'93, vol. 3, pp. 321-324, 1993, Beijng.
[10] J. Shaojun, G. Hitato, Y. Fuliang, “A New Algorithm For Voice Activity Detection Based On
Wavelet Transform,” proc. Of int. symposium of intelligent multimedia, video and speech
processing, pp. 222-225, Oct. 2004, Hong Kong.
[11] Tanyer S G, Ozer H, “ Voice activity detection in nonstationary guassian noise,” proceeding
of ICSP’98 pp.1620-1623.
[12] Sangwan, A., Chiranth, M. C., Jamadagni, H. S., Sah, R.,Prasad, R. V., Gaurav, V., “VAD
Techniques for Real-Time Speech Transmission on the Internet”, 5th IEEE International
Conference on High-Speed Networks and Multimedia Communications, pp. 46–50, 2002.
[13] L. R. Rabiner, M. R. Sambur, “ Application of an LPC Distance Measure to the Voiced-
Unvoiced-Silence Detection Problem,” IEEE Trans. on Acoustics, Speech, and Signal
Processing, Vol. ASSP-25, No. 4, pp. 338-343, August 1977.
[14] A. Vahatalo, I. Johansson, “Voice Activity Detection For GSM Adaptive Multi-Rate Codec,”
IEEE 1999, pp. 55-57.
[15] M.Orlandi, a. santarelli, D. Falavigna, “Maximum Likelihood endpoint Detection with time-
domain features,”eurospeech 2003, Geneva, pp.1757-1760.
[16] S. H. Jensen, P. C. Hansen, S. D. Hansen, “Reduction of Broad-Band Noise in Speech by
Truncated QSVD,” IEEE, Trans on speech & Audio Processing, Vol.3, No.6, November
1995.
[17] Y. Ghanbari, M. Karami, “Spectral subtraction in the wavelet domain for speech
enhancement,” International Conference on Information Knowledge Technology (IkT2004),
CD ROM, 2004.
[18] H. Sameti, H. Sheikhzadeh, Li Deng, R. L.Brennan, “HMM-Based Strategies for
Enhancement of Speech Signals Embedded in Nonstationary Noise”, IEEE Transactions
on Speech and Audio Processing, Vol. 6, No. 5, September 1998.
[19] S. Sayeed, N. S. Kamel, R. Besar, “A Sensor-Based Approach for Dynamic Signature
Verification using Data Glove”. Signal Processing: An International Journal (SPIJ),
2(1)1:10, 2008.
[20] J.Ghasemi, M. R. Karami Mollaei, "A New Approach for Speech Enhancement Based On
Eigenvalue Spectral Subtraction" Signal Processing: An International Journal, (SPIJ) 3(4),
34:41, 2009.
[21] A. Afzalian, M.R. Karami Mollaei, J. Ghasemi, " A New Approach for Speech
Enhancement Based On Singular Value Decomposition and Wavelet Transform", AJBAS
In Press(2010).

More Related Content

What's hot

IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive SensingIRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive SensingIRJET Journal
 
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...CSCJournals
 
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...ijcnac
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...IDES Editor
 
Improved nonlocal means based on pre classification and invariant block matching
Improved nonlocal means based on pre classification and invariant block matchingImproved nonlocal means based on pre classification and invariant block matching
Improved nonlocal means based on pre classification and invariant block matchingIAEME Publication
 
Hybrid hmmdtw based speech recognition with kernel adaptive filtering method
Hybrid hmmdtw based speech recognition with kernel adaptive filtering methodHybrid hmmdtw based speech recognition with kernel adaptive filtering method
Hybrid hmmdtw based speech recognition with kernel adaptive filtering methodijcsa
 
Performance Analysis of PAPR Reduction in MIMO-OFDM
Performance Analysis of PAPR Reduction in MIMO-OFDMPerformance Analysis of PAPR Reduction in MIMO-OFDM
Performance Analysis of PAPR Reduction in MIMO-OFDMIJARBEST JOURNAL
 
Performance of MMSE Denoise Signal Using LS-MMSE Technique
Performance of MMSE Denoise Signal Using LS-MMSE  TechniquePerformance of MMSE Denoise Signal Using LS-MMSE  Technique
Performance of MMSE Denoise Signal Using LS-MMSE TechniqueIJMER
 
Design of ternary sequence using msaa
Design of ternary sequence using msaaDesign of ternary sequence using msaa
Design of ternary sequence using msaaEditor Jacotech
 
Method for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionMethod for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionCSCJournals
 
Project_report_BSS
Project_report_BSSProject_report_BSS
Project_report_BSSKamal Bhagat
 
14 a blind edit ms word (edit tyas)2
14 a blind edit ms word (edit tyas)214 a blind edit ms word (edit tyas)2
14 a blind edit ms word (edit tyas)2IAESIJEECS
 
hankel_norm approximation_fir_ ijc
hankel_norm approximation_fir_ ijchankel_norm approximation_fir_ ijc
hankel_norm approximation_fir_ ijcVasilis Tsoulkas
 
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading Channels
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading ChannelsOn Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading Channels
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading ChannelsCSCJournals
 
Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63Prateek Omer
 

What's hot (20)

IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive SensingIRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
IRJET- Reconstruction of Sparse Signals(Speech) Using Compressive Sensing
 
Final document
Final documentFinal document
Final document
 
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...
Noisy Speech Enhancement Using Soft Thresholding on Selected Intrinsic Mode F...
 
F43063841
F43063841F43063841
F43063841
 
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
Performance Comparison of Modified Variable Step Size Leaky LMS Algorithm for...
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
 
Improved nonlocal means based on pre classification and invariant block matching
Improved nonlocal means based on pre classification and invariant block matchingImproved nonlocal means based on pre classification and invariant block matching
Improved nonlocal means based on pre classification and invariant block matching
 
Hybrid hmmdtw based speech recognition with kernel adaptive filtering method
Hybrid hmmdtw based speech recognition with kernel adaptive filtering methodHybrid hmmdtw based speech recognition with kernel adaptive filtering method
Hybrid hmmdtw based speech recognition with kernel adaptive filtering method
 
Id135
Id135Id135
Id135
 
Performance Analysis of PAPR Reduction in MIMO-OFDM
Performance Analysis of PAPR Reduction in MIMO-OFDMPerformance Analysis of PAPR Reduction in MIMO-OFDM
Performance Analysis of PAPR Reduction in MIMO-OFDM
 
Performance of MMSE Denoise Signal Using LS-MMSE Technique
Performance of MMSE Denoise Signal Using LS-MMSE  TechniquePerformance of MMSE Denoise Signal Using LS-MMSE  Technique
Performance of MMSE Denoise Signal Using LS-MMSE Technique
 
D111823
D111823D111823
D111823
 
Design of ternary sequence using msaa
Design of ternary sequence using msaaDesign of ternary sequence using msaa
Design of ternary sequence using msaa
 
Iy2617051711
Iy2617051711Iy2617051711
Iy2617051711
 
Method for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF InjectionMethod for Converter Synchronization with RF Injection
Method for Converter Synchronization with RF Injection
 
Project_report_BSS
Project_report_BSSProject_report_BSS
Project_report_BSS
 
14 a blind edit ms word (edit tyas)2
14 a blind edit ms word (edit tyas)214 a blind edit ms word (edit tyas)2
14 a blind edit ms word (edit tyas)2
 
hankel_norm approximation_fir_ ijc
hankel_norm approximation_fir_ ijchankel_norm approximation_fir_ ijc
hankel_norm approximation_fir_ ijc
 
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading Channels
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading ChannelsOn Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading Channels
On Channel Estimation of OFDM-BPSK and -QPSK over Nakagami-m Fading Channels
 
Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63Ch7 noise variation of different modulation scheme pg 63
Ch7 noise variation of different modulation scheme pg 63
 

Viewers also liked

Speaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalitySpeaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalityarizhamid786
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in androidAnshuli Mittal
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...ijsrd.com
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringTejus Adiga M
 
Framework of ADP TRAINING - AAI
Framework of ADP TRAINING - AAIFramework of ADP TRAINING - AAI
Framework of ADP TRAINING - AAISANJIV SONI
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementHarshal Ladhe
 
intelligent transportation system
intelligent transportation system intelligent transportation system
intelligent transportation system Mohammed Faazil
 
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEM
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEMSENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEM
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEMJournal For Research
 
Nlms algorithm for adaptive filter
Nlms algorithm for adaptive filterNlms algorithm for adaptive filter
Nlms algorithm for adaptive filterchintanajoshi
 
pavement analysis
pavement analysis pavement analysis
pavement analysis Saeed Badeli
 
" Design of Flexible Pavement "
" Design of Flexible Pavement "" Design of Flexible Pavement "
" Design of Flexible Pavement "Nishant Pandey
 
Solar roads seminar
Solar roads seminarSolar roads seminar
Solar roads seminarDaanish Zama
 
Airtificial Intelligence in Power System
Airtificial Intelligence in Power SystemAirtificial Intelligence in Power System
Airtificial Intelligence in Power SystemPratik Doshi
 
Intelligent transport system (ITS)
Intelligent transport system (ITS)Intelligent transport system (ITS)
Intelligent transport system (ITS)Aravind Samala
 
ppt on pavement design
ppt on pavement designppt on pavement design
ppt on pavement designRohit Ranjan
 
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...Brati Sundar Nanda
 
artificial intelligence in power plants.
artificial intelligence in power plants.artificial intelligence in power plants.
artificial intelligence in power plants.8105268008
 
Performance analysis of adaptive noise canceller for an ecg signal
Performance analysis of adaptive noise canceller for an ecg signalPerformance analysis of adaptive noise canceller for an ecg signal
Performance analysis of adaptive noise canceller for an ecg signalRaj Kumar Thenua
 
Artificial intelligence report
Artificial intelligence reportArtificial intelligence report
Artificial intelligence reportSourabh Sharma
 

Viewers also liked (20)

Speaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionalitySpeaker identification system with voice controlled functionality
Speaker identification system with voice controlled functionality
 
Speaker recognition in android
Speaker recognition in androidSpeaker recognition in android
Speaker recognition in android
 
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
Cancellation of Noise from Speech Signal using Voice Activity Detection Metho...
 
Voice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency FilteringVoice Activity Detection using Single Frequency Filtering
Voice Activity Detection using Single Frequency Filtering
 
Framework of ADP TRAINING - AAI
Framework of ADP TRAINING - AAIFramework of ADP TRAINING - AAI
Framework of ADP TRAINING - AAI
 
Adaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancementAdaptive noise estimation algorithm for speech enhancement
Adaptive noise estimation algorithm for speech enhancement
 
intelligent transportation system
intelligent transportation system intelligent transportation system
intelligent transportation system
 
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEM
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEMSENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEM
SENSOR BASED AUTOMATIC DRIP IRRIGATION SYSTEM
 
Nlms algorithm for adaptive filter
Nlms algorithm for adaptive filterNlms algorithm for adaptive filter
Nlms algorithm for adaptive filter
 
pavement analysis
pavement analysis pavement analysis
pavement analysis
 
Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
" Design of Flexible Pavement "
" Design of Flexible Pavement "" Design of Flexible Pavement "
" Design of Flexible Pavement "
 
Solar roads seminar
Solar roads seminarSolar roads seminar
Solar roads seminar
 
Airtificial Intelligence in Power System
Airtificial Intelligence in Power SystemAirtificial Intelligence in Power System
Airtificial Intelligence in Power System
 
Intelligent transport system (ITS)
Intelligent transport system (ITS)Intelligent transport system (ITS)
Intelligent transport system (ITS)
 
ppt on pavement design
ppt on pavement designppt on pavement design
ppt on pavement design
 
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...
Noice canclellation using adaptive filters with adpative algorithms(LMS,NLMS,...
 
artificial intelligence in power plants.
artificial intelligence in power plants.artificial intelligence in power plants.
artificial intelligence in power plants.
 
Performance analysis of adaptive noise canceller for an ecg signal
Performance analysis of adaptive noise canceller for an ecg signalPerformance analysis of adaptive noise canceller for an ecg signal
Performance analysis of adaptive noise canceller for an ecg signal
 
Artificial intelligence report
Artificial intelligence reportArtificial intelligence report
Artificial intelligence report
 

Similar to A Combined Voice Activity Detector Based On Singular Value Decomposition and Fourier Transform

A Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak TransformA Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak TransformCSCJournals
 
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of VocabularyAn Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabularysipij
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Venkata Sudhir Vedurla
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...sipij
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniqueseSAT Journals
 
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...csijjournal
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...csijjournal
 
Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...csijjournal
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...csijjournal
 
Frequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsFrequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsCSCJournals
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...CSCJournals
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechIOSR Journals
 
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...Recovery of low frequency Signals from noisy data using Ensembled Empirical M...
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...inventionjournals
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...IJERA Editor
 
Hardware efficient singular value decomposition in mimo ofdm system
Hardware efficient singular value decomposition in mimo ofdm systemHardware efficient singular value decomposition in mimo ofdm system
Hardware efficient singular value decomposition in mimo ofdm systemIAEME Publication
 
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...CSCJournals
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdfIjictTeam
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)IJERD Editor
 

Similar to A Combined Voice Activity Detector Based On Singular Value Decomposition and Fourier Transform (20)

A Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak TransformA Text-Independent Speaker Identification System based on The Zak Transform
A Text-Independent Speaker Identification System based on The Zak Transform
 
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of VocabularyAn Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
An Effective Approach for Chinese Speech Recognition on Small Size of Vocabulary
 
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
Analysis the results_of_acoustic_echo_cancellation_for_speech_processing_usin...
 
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
A Novel Uncertainty Parameter SR ( Signal to Residual Spectrum Ratio ) Evalua...
 
Comparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniquesComparative performance analysis of channel normalization techniques
Comparative performance analysis of channel normalization techniques
 
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
Trend Removal from Raman Spectra With Local Variance Estimation and Cubic Spl...
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
 
Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...Trend removal from raman spectra with local variance estimation and cubic spl...
Trend removal from raman spectra with local variance estimation and cubic spl...
 
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
TREND REMOVAL FROM RAMAN SPECTRA WITH LOCAL VARIANCE ESTIMATION AND CUBIC SPL...
 
B046040611
B046040611B046040611
B046040611
 
Frequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral componentsFrequency based criterion for distinguishing tonal and noisy spectral components
Frequency based criterion for distinguishing tonal and noisy spectral components
 
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
A Gaussian Clustering Based Voice Activity Detector for Noisy Environments Us...
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
 
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...Recovery of low frequency Signals from noisy data using Ensembled Empirical M...
Recovery of low frequency Signals from noisy data using Ensembled Empirical M...
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...
Performance Analysis of Adaptive DOA Estimation Algorithms For Mobile Applica...
 
Hardware efficient singular value decomposition in mimo ofdm system
Hardware efficient singular value decomposition in mimo ofdm systemHardware efficient singular value decomposition in mimo ofdm system
Hardware efficient singular value decomposition in mimo ofdm system
 
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
 
20575-38936-1-PB.pdf
20575-38936-1-PB.pdf20575-38936-1-PB.pdf
20575-38936-1-PB.pdf
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 

Recently uploaded

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Celine George
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxCarlos105
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptshraddhaparab530
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfPatidar M
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfTechSoup
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)cama23
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYKayeClaireEstoconing
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsManeerUddin
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4MiaBumagat1
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Mark Reed
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 

Recently uploaded (20)

Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17Field Attribute Index Feature in Odoo 17
Field Attribute Index Feature in Odoo 17
 
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptxBarangay Council for the Protection of Children (BCPC) Orientation.pptx
Barangay Council for the Protection of Children (BCPC) Orientation.pptx
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Integumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.pptIntegumentary System SMP B. Pharm Sem I.ppt
Integumentary System SMP B. Pharm Sem I.ppt
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Active Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdfActive Learning Strategies (in short ALS).pdf
Active Learning Strategies (in short ALS).pdf
 
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptxYOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
YOUVE_GOT_EMAIL_PRELIMS_EL_DORADO_2024.pptx
 
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdfInclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
Inclusivity Essentials_ Creating Accessible Websites for Nonprofits .pdf
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)Global Lehigh Strategic Initiatives (without descriptions)
Global Lehigh Strategic Initiatives (without descriptions)
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITYISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
ISYU TUNGKOL SA SEKSWLADIDA (ISSUE ABOUT SEXUALITY
 
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptxLEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
LEFT_ON_C'N_ PRELIMS_EL_DORADO_2024.pptx
 
Food processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture honsFood processing presentation for bsc agriculture hons
Food processing presentation for bsc agriculture hons
 
ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4ANG SEKTOR NG agrikultura.pptx QUARTER 4
ANG SEKTOR NG agrikultura.pptx QUARTER 4
 
Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)Influencing policy (training slides from Fast Track Impact)
Influencing policy (training slides from Fast Track Impact)
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 

A Combined Voice Activity Detector Based On Singular Value Decomposition and Fourier Transform

  • 1. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 54 A Combined Voice Activity Detector Based On Singular Value Decomposition and Fourier Transform Amard Afzalian a.afzalian@gmail.com Islamic Azad University, Science and Research,Tehran, IRAN. Mohammad Reze Karami Mollaei mkarami@nit.ac.ir Faculty of Electrical and Computer Engineering, Babol Noshirvani University of Technology Babol, P.O. Box 47135-484, IRAN Jamal Ghasemi jghasemi@stu.nit.ac.ir Faculty of Electrical and Computer Engineering, Signal Processing Laboratory Babol Noshirvani University of Technology, Babol, P.O. Box 47135-484, IRAN Abstract Voice activity detector (VAD) is used to separate the speech data included parts from silence parts of the signal. In this paper a new VAD algorithm is represented on the basis of singular value decomposition. There are two sections to perform the feature vector extraction. In first section voiced frames are separated from unvoiced and silence frames. In second section unvoiced frames are silence frames. To perform the above sections, first, windowing the noisy signal then Hankel’s matrix is formed for each frame. The basis of statistical feature extraction of purposed system is slope of singular value curve related to each frame by using linear regression. It is shown that the slope of singular values curve per different SNRs in voiced frames is more than the other types and this property can be to achieve the goal the first part can be used. High similarity between feature vector of unvoiced and silence frame caused to approach for separation of the two categories above cannot be used. So in the second part, the frequency characteristics for identification of unvoiced frames from silent frames have been used. Simulation results show that high speed and accuracy are the advantages of the proposed system. Keywords: Speech, Voice Activity Detector, Singular Value. 1. INTRODUCTION Voice activity detection is an important step in some speech processing systems, such as speech recognition, speech enhancement, noise estimation, speech compression ... etc. In speech recognition when a word or utterance begins or ends (the end points) must be specified [1]. Also VAD is used to disable speech recognition for silence segments. Some speech transition systems transmits active segments in high rate of bits and transmits silence in low rate of bits, by this method they improve the band-width [2]. In some speech enhancement algorithm for example
  • 2. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 55 spectral subtraction method, a VAD method is required to know when to update the noise reference [3,20,21]. Conversational speech is a sequence of consecutive segments of silence and speech. In noisy signal silence regions are noisy. Voice sound contain more energy than unvoiced sound, while unvoiced sounds are more noise-like, so in noisy condition activity detection is harder In unvoiced regions. Feature extraction is the most important section in VAD system that elicits required parameters from desired frame. To achieve an accurate algorithm, the system parameters must be selected until by them can be able to separate from each other areas. After proper feature election, threshold is applied to the extracted parameters and the decisions are made. To achieving good detection level threshold can be adapted to the change of the noise conditions. Many of the algorithms assume that the first frames are silence [5, 6, and 7], so we can initialize noise reference from these frames. Common features used in VAD's are short term Fourier transform and zero crossing rate [4, 6, 11, and 12]. Another important and widely used parameter in this regard is Cepstral Coefficient [7, 9]. In this method the Cepstral coefficients are calculated within frames and then by calculating the difference between this vector and the value assigned to the noise signal and then comparing the result with the basic threshold value, the frame identity could be determined. LPC method is also another major applicable method for VAD implementation [13]. Generally in LPC based algorithms a series of mean coefficients are experimentally considered for voice, unvoiced and silent modes. In the next step the LPC coefficients of suspicious frame and their relative difference with mean indices are calculated and the frame identity is recognized based on these values. The other parameters for implementing VAD in combined algorithms are LTSE (long term Spectral Estimation)[5], wavelet coefficient [8,10], the ratio of signal to noise in sub-band [14], LSPE(Least Square Periodicity Estimator)[11] and AMDF(Average Magnitude Difference Function)[15]. One of most important cases in VAD system is speed of system performance beside proper accuracy. In this paper is to present a new algorithm of VAD based on single value decomposition and frequency features, specifications accuracy and speed simultaneously be fulfilled. Based on this, paper organization as follows that in Section 2 single values decomposition (SVD) will be explained. In Section 3 the proposed method with the system block diagram is given. In Section 4 simulation results in terms of quantitative and qualitative criteria is evaluated. Finally, the article concludes with Section 5 ends. 2. Singular Value Decomposition The singular value composition is one of the main tools in digital signal processing and statistical data. By doing SVD on a matrix with dimensions of M×N, we have: (1)T X U V= Σ On above relation U and V matrixes are singular vectors matrix with dimensions of M×M and N×N, respectively. Also, ∑ with r order of a diagonal matrix M×N is included singular values so that components on the main dial gauge are not zero and other components are zero. The elements on main dial gauge areas 11 22 ... 0rrσ σ σ> > > > And are the values of X matrix. For exercising SVD to one dimensional matrix, the vector of signal samples must map to subspace with more dimensions, on the other hand must be changed to a matrix in certain way. Different ways have indicated for one dimensional signal transformation to a matrix that in this article (here) have used Hankel’s way [16,19]. 3. Purposed algorithm The main question on sound discovery is the classification of listening signal characteristic to diagnosis sound parts. Thus, listening signal are classified to sound classes: silence, voiced, and unvoiced. For classifying, suitable characteristics must elicit from the speech signal parts (frame). Before studying the details of purposed system, general block diagram are showed in figure 1.
  • 3. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 56 One Frame of Noisy signal Windowing SVD VAD Output VAD Decision FFT & Gaussian Approximation Feature Extraction BlockInput Signal Figure (1) block diagram of propositional method for indicator system of voice activity. As shown on figure 1, feature vector extraction done in two parts. In first part, voiced frames are separated from unvoiced and silence frame as for statistical characteristics of singular values matrix. In second part, unvoiced frames are separated from silence frames that this separation is based on its frequency spectrum and Gaussian rate in each frame. At the end, one value accrues to voiced and unvoiced frames that including voice information, and zeros one to silence frames. 3.1. Voiced frames separation from the other parts In suggested system to separate the voiced parts from two parts of unvoiced and silence ones, it is used on slop of singular value curve in related part. For doing atop stages, first noisy signal divide to 16ms frames. According to relation (2), Hankel matrix makes for every frame. (2) 0 1 1 1 2 0 1 1 1 1 ... ... [ , ,..., ] ... M M k n k L L N x x x x x x X x x x H x x x − − − −      = → =       M M M M Where kX is the vector of exist samples in K frame in input signal and kH is the isomorphic Hankel matrix of L×M dimensions with kX . The percent of sample overlapping in matrix and the conditions of kH dimensions are brought on (3) and (4) relations. (3)1 % 100 L overlapping L − = × (4)1,M L N L M+ = + ≥ For gaining full primitive pattern of kX frame, the number of added zeros must lessen in M column of kH matrix that its results have brought on (5) relation. (5)- mod( , ) [ ] 1, -[ ] 2 2 N N ZeroPadding L N L L M N= ⇒ = + = Gained values of L, M in (5) relation get kH semi rectangular matrix with maximum sample overlapping. The singular values of each frame are got by Hankel matrix of existed frame and using SVD map on related singular values. (6)T k k k kH U V= ∑ In (6) relation kU and kV the singular vectors of diagonal matrix and also k∑ are isomorphic singular values matrix with kH (part 2). Figure (2), singular values vectors show the every voiced frames in 16 millisecond length and SNR=10db with white Gaussian noise.
  • 4. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 57 0 20 40 60 80 100 120 140 0 2 4 6 8 10 12 Singular Values of Voice Frame 0 20 40 60 80 100 120 140 0 0.2 0.4 0.6 0.8 1 1.2 1.4 Singular Values of Unvoice Frame 0 20 40 60 80 100 120 140 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 Singular Values of Silence Frame Figure (2) singular values vectors of voiced, unvoiced and silence frame in SNR=10db (voice signal from housewives_16.wav on TIMIT database) According as seeing on figure (2) the slop of curve in singular values between voiced frames are different from the other parts. In fact, singular values of voiced frames have more slop than singular values of unvoiced and silence frames. The base of statistical feature extraction for separating voiced frames is the slop of singular values curve related to each frame by using linear regression. Table 1 shows the values of this slop in different SNRs on three certain frames that have chosen from voiced, unvoiced and silence parts. Table (1): slope of singular values curve in linear regression related to species of frame on different SNRs (voice signal from housewives_16.wav on TIMIT database) Mean amount of singular values curve slope on 10 times repeat for each SNR Silence frameUnvoiced frame Voiced frame SNR 0.09490.09870.14980db 0.05280.05660.12325db 0.03050.03380.117010db 0.01760.02320.114315db 0.00980.01750.113920db Results of table (1) are support on this thesis that the slop of singular values curve on different SNRs in voiced frames are more than the others and by using this trait we can achieve the goal of first section that was the feature vector extraction in voiced frames from related singular values matrix.
  • 5. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 58 2.3. Separating the voiced and silence frames By studying figure (2) and table (1) are deduced that according to approximation through slop of singular values curves related to voiced and silence frames, it can't divide these two type frames from each other. Because of this in purposed system have used the other trait to separate these two parts. In this part, frequency trait has used for recognition unvoiced frames from silence ones. The base of comparison is the curve of Gaussian function, (7) relation. (7) 2 2 ( ) 2 ( ; , ) x c f x c e γ γ − − = Atop relation C is mean and γ is variance of curve. By doing study, we see the discrete Fourier transform of unvoiced frames in meddle frequency are similar to the curve of Gaussian functions to some extent. Figure (3) and (4) show the smooth frequency spectrum related to unvoiced and silence frames and their comparison with the curve of Gaussian function. (Voice signal from TIMIT database with SNR=15db) 0 20 40 60 80 100 120 0 1 2 3 4 5 6 x 10 -3 Unvoice Frame in FFT Domain & Gaussian function Figure (3) frequency spectrum of Unvoice frame in SNR=15db and the curve of Gaussian function ( 0.6γ = ) 0 20 40 60 80 100 120 140 0 1 2 3 4 5 6 x 10 -3 Silence Frame in FFT Domain & Gaussian function Figure (4) frequency spectrum of Silence frame in SNR=15db and the curve of Gaussian function ( 0.6γ = ) According as atop figures are seen, frequency spectrum of voiced frame is more similar to Gaussian curve than silence frame noise in meddle frequency. Also by different examinations, the optimal values of γ parameter in varying SNRs are as 0.2 with a view of minimizing the difference between Gaussian pattern and frequency spectrum of silence frame pattern. 4. SIMULATION RESULTS In this section, the operation of purposed system study and compare considered as accuracy and speed. Sx114.wav voice signal from TIMIT database including 166 of 16 millisecond frames with sampling rate is 16 kHz that each frame has 256 samples. Figure (5) shows the noisy signal with SNR=10db and the indicator systems output of voice activity.
  • 6. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 59 (a) (b) (c) 0 0.5 1 1.5 2 2.5 3 3.5 4 x 10 4 -0.1 0 0.1 time Amplitude original signal 0 20 40 60 80 100 120 140 160 -1 0 1 2 Output Of Ideal VAD 0 20 40 60 80 100 120 140 160 -1 0 1 2 Output Of VAD Based on Wavelet 0 20 40 60 80 100 120 140 160 -1 0 1 2 Output Of Perposed VAD (d) Figure(5).Simulation Results: (a)TIMIT Signal, (b) Output of ideal VAD, (c) Output of VAD based on Wavelet, (d) Output of Purposed VAD According as seeing, suggested method has more efficiency to keep the parts of signals that the activity signal is weak. In figure (5) and dashed line-drawn area, one unvoiced letter has omitted in VAD system as wavelet (figure 5-c ) that it has kept on VAD output propositional system (figure 5-d ). In table (2) the accuracy rate of VAD propositional system is shown as wavelet transform [17] about voice signal sx114.wav for different SNRs. Table (2): percent of VAD system error comparison by using purposed algorithm and wavelet based algorithm in different SNRs Wavelet based algorithm Purposed algorithm SNR 36%25%0db 20%18%5db 14%15%10db 12%13%15db 10%8%20db One of the strength points of the purposed algorithm is its speed. In table (3), speed of two algorithms has compared with each other in processing the sx114.wav sound file (CPU Intel Core2Duo 2.5 GHz, 4 M Cache 733 MB RAM) Table (3): Speed comparison of VAD system by using purposed algorithm and wavelet based algorithm Consumed time in wavelet based algorithmConsumed time in purposed algorithm 12 second4 second In this part by using two VAD systems as preprocessing block has brought the results of hearing test for a speech signal rich-making system that accuracy of VAD operation system be proved in keeping unvoiced areas. In table 4 the standards has come that is used in evaluation of speech with hearing factor.
  • 7. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 60 Table (4) Five-Point adjectival scales for quality and impairment, and associated scores ImpairmentScore Imperceptible5 (Excellent) (Just) Perceptible but not Annoying4 (Good) (Perceptible and) Slightly Annoying3 (Fair) Annoying (but not Objectionable)2 (Poor) Very Annoying (Objectionable)1 (Bad) In table 5, results of using two said VAD algorithms have shown as preprocessing block for enhancement method of multi band spectral subtraction. Specifications of this test are in [3]. Table (5) Results of MOS test; 17 clean speech signal from TIMIT database; Noise Type: White Gaussian Noise. Input SNR 10db5db0db Used Algorithm 2.82.31.6Wavelet Based 3.32.71.8Purposed Studying the results of table (5) are shown the reform efficiency of speech enhancement by using of propositional VAD algorithm to wavelet transform way. 5. CONCLUSION In this paper a new method for Voice activity detector based on Singular Value Decomposition and discrete Fourier transform was proposed. The proposed method is evaluated by using various criteria. By using the mentioned criteria, it is presented that this method can compete with other methods. Also, the aim of indicator systems of voice activity is control of destruction unvoiced signal sites in rich-making operation that observantly to results of hearing test, the propositional algorithm have proper power in compare with wavelet transform way. The propositional system has manifold speed than usual method that is one of the obvious characters in practical usage and hardware implementation. 6. REFERENCES [1] J. Ramirez, J. C. Segura, C. Benitez, A. de la Torre, A. J. Rubio, “A new Kullback-Leibler VAD for speech recognition in noise”, IEEE Signal Processing Letters, 11(2) :266– 269. 2004. [2] A. Kondoz, “Digital speech: coding for low bit rate communication systems”, J. Wiley, New York, 1994. [3] Y. Ghanbari, M. R. Karami Mollaei, “A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets”, Speech Communication 48 (2006) 927–940. [4] B. V. Harsha, “A Noise Robust Activity Detection Algorithm”, proc. Of int. symposium of intelligent multimedia, video and speech processing, pp. 322-325, Oct. 2004, Hong Kon. [5] J. Ramırez, J. C. Segura, C. Benıtez, A. de la Torre, A. Rubio, “A New Adaptive Long-Term Spectral Estimation Voice Activity Detector,” EUROSPEECH, pp. 3041-3044, 2003, Geneva. [6] J. Faneuff, “ Spatial, Spectral, and Perceptual Nonlinear Noise Reduction for Hands-free Microphones in a Car,” Master Thesis Electrical and Computer Engineering July 2002.
  • 8. A. Afzalian, M. R. Karami mollaei & J. Ghasemi Signal Processing: An International Journal (SPIJ), Volume(4): Issue(1) 61 [7] S. Skorik, F. Berthommier, “On a Cepstrum-Based Speech Detector Robust To White Noise,” Accepted for Specom 2000, St. Petersbourg. [8] J. Stegmann, G. Schroeder, “Robust Voice Activity Detection Based on the Wavelet Transform", Proc. IEEE Workshop on Speech Coding, Sep. 1997, pp. 99-100, Pocono Manor, Pennsylvania, USA. [9] J.A. Haigh and J.S. Mason, “Robust Voice Activity Detection Using Cepstral Features,” In Proc. of IEEE TENCON'93, vol. 3, pp. 321-324, 1993, Beijng. [10] J. Shaojun, G. Hitato, Y. Fuliang, “A New Algorithm For Voice Activity Detection Based On Wavelet Transform,” proc. Of int. symposium of intelligent multimedia, video and speech processing, pp. 222-225, Oct. 2004, Hong Kong. [11] Tanyer S G, Ozer H, “ Voice activity detection in nonstationary guassian noise,” proceeding of ICSP’98 pp.1620-1623. [12] Sangwan, A., Chiranth, M. C., Jamadagni, H. S., Sah, R.,Prasad, R. V., Gaurav, V., “VAD Techniques for Real-Time Speech Transmission on the Internet”, 5th IEEE International Conference on High-Speed Networks and Multimedia Communications, pp. 46–50, 2002. [13] L. R. Rabiner, M. R. Sambur, “ Application of an LPC Distance Measure to the Voiced- Unvoiced-Silence Detection Problem,” IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. ASSP-25, No. 4, pp. 338-343, August 1977. [14] A. Vahatalo, I. Johansson, “Voice Activity Detection For GSM Adaptive Multi-Rate Codec,” IEEE 1999, pp. 55-57. [15] M.Orlandi, a. santarelli, D. Falavigna, “Maximum Likelihood endpoint Detection with time- domain features,”eurospeech 2003, Geneva, pp.1757-1760. [16] S. H. Jensen, P. C. Hansen, S. D. Hansen, “Reduction of Broad-Band Noise in Speech by Truncated QSVD,” IEEE, Trans on speech & Audio Processing, Vol.3, No.6, November 1995. [17] Y. Ghanbari, M. Karami, “Spectral subtraction in the wavelet domain for speech enhancement,” International Conference on Information Knowledge Technology (IkT2004), CD ROM, 2004. [18] H. Sameti, H. Sheikhzadeh, Li Deng, R. L.Brennan, “HMM-Based Strategies for Enhancement of Speech Signals Embedded in Nonstationary Noise”, IEEE Transactions on Speech and Audio Processing, Vol. 6, No. 5, September 1998. [19] S. Sayeed, N. S. Kamel, R. Besar, “A Sensor-Based Approach for Dynamic Signature Verification using Data Glove”. Signal Processing: An International Journal (SPIJ), 2(1)1:10, 2008. [20] J.Ghasemi, M. R. Karami Mollaei, "A New Approach for Speech Enhancement Based On Eigenvalue Spectral Subtraction" Signal Processing: An International Journal, (SPIJ) 3(4), 34:41, 2009. [21] A. Afzalian, M.R. Karami Mollaei, J. Ghasemi, " A New Approach for Speech Enhancement Based On Singular Value Decomposition and Wavelet Transform", AJBAS In Press(2010).