SlideShare a Scribd company logo
1 of 18
Download to read offline
Flexible Microphone Array Based on
Multichannel Nonnegative Matrix Factorization
and Statistical Signal Estimation
Hiroshi Saruwatari, Kazuma Takata
(The Unoversity of Tokyo, JAPAN)
Nobutaka Ono (NII, JAPAN),
Shoji Makino (University of Tsukuba, JAPAN)
Acoustic Array Systems: Paper ICA2016-312
Outline
 Introduction of rescue robot audition
 Conventional approaches (ICA, IVA, Rank-1
MNMF)
 Informed source separation and its problem
 Ego-noise basis mismatch problem solution
 Speech ambiguity problem solution
 Experimental evaluation
 Conclusion
2
Introduction: Rescue Robot Audition
 Aimed to detect victims’ speech in a disaster area.
 Flexible body twists and moves driven by vibration motors.
 It wears multiple microphones around the body.
• Thus, microphones’ position is always unknown.
• Self-Vibration generates harmful noise.
(so-called Ego-Noise)
One of the Distributed Microphone Array Problem
3
MicrophoneVibrator
What is hose-shaped rescue robot?
4
Source Observation Separated
Mixing Separation
Conventional: ICA or Independent vector analysis (IVA), which
separates the sources based on their independence nature.
We assume
linear time-
invariance in A.
This is a simultaneous
estimation problem for
W and source
statistical models.
x=As y=Wx
Unknown Known
W
Demixing
matrix
How to solve? Use Blind Source Separation
Source model (p.d.f.s)
S1
S2
Speech
Ego-noise
Speech
Ego-
noise
+
Low-rank source spectrogram
5
Rank-1 MNMF (Independent Low-Rank Matrix Analysis)
that separates the sources by estimating demixing matrix W
and low-rank source spectrogram model via Nonnegative
Matrix Factorization (NMF) [Lee, 2001].
Rank-1 MNMF [Kitamura, Saruwatari et al., IEEE Trans. ASLP 2016]
W
Demixing
matrix
Simultaneous
estimation for W
and TV
+
In this study, we focus our attention to…
6
Rank-1 MNMF (Independent Low-Rank Matrix Analysis)
Pros & Cons:
• All parameters can be updated via Auxiliary-Function method
(EM-like algorithm), keeping nonnegative feature of T & V.
• The cost function always decreases in each iteration. Thus,
this is convergence-guaranteed algorithm unlike ICA!
• Still affected by initial state of parameters. go to “Informed”
Rank-1 MNMF’s cost function to be minimized
: Independence measure between sources (for W)
: Low-rank approximation of sources (for T and V)
(Note: both are based on Itakura-Saito (IS) divergence.)
Typical ego-noise
basis trained by
NMF in advance
Activation
Source model in Rank-1 MNMF
7
Basis
Toward Informed Source Separation
Typical ego-noise
basis trained by
NMF in advance
Activation
Source model in Rank-1 MNMF
Fixing a part of bases, estimate
remaining parameters and W.
8
Basis
Speech
basis
Ego-
noise
basis
Toward Informed Source Separation
(unknown)
(unknown)
Typical ego-noise
basis trained by
NMF in advance
Activation
Source model in Rank-1 MNMF
Fixing a part of bases, estimate
remaining parameters and W.
[Problem 1] Ego-noise time-variance (ego-noise mismatch problem)
[Problem 2] Unknown speech (speech model ambiguity problem)
9
Basis
Speech
basis
Ego-
noise
basis
Toward Informed Source Separation
(unknown)
(unknown)
Supervised Rank-1 MNMF
Rough separation
Statistical Postfilter [Breithaupt, 2010]
 Chi distribution (sparse p.d.f.)
is used as target signal prior.
 Its sparseness can be estimted
from data empirically via
higher-order statistics
[Murota, Saruwatari, ICASSP2014].
Observed
signal
Thanks to sparse prior, we can
obtain more accurate separation
and its Certainty.
Statistical Signal Estimation
Certainty
Estimated ego-noise
Sparse p.d.f.
6
Estimated target signal
Statistical Signal Estimation
6
Certainty I ={1; if G(f,t)>0.8, otherwise 0}: binary mask that
extracts seldom overlapping components with the target signal
from the estimated interference signal.
12
Problem 1: Ego-Noise Mismatch Solution
 We sample convincing ego-noise spectrogram by certainty I.
 Next, obtain smoothed “time-frequency deformation function”
between sampled spectrogram and original supervised ego-
noise basis.
 Time-invariant all-pole model is used as deformation function.
Diagonal matrix with entries
Supervised ego-noise basis
Ego-noise activation
KL divergence
Order of all-pole model
This can be solved as extended NMF optimization.
Frequency
Powerspectrum
13
Problem 1: Ego-Noise Mismatch Solution
: each element of
Update of activation
Update of all-pole-model weight
By noting the KL-cost function as J, its auxiliary function is given by
 Statistical postfilter’s output is sparse estimation of S.
 We can re-estimate sparse-aware speech basis using .
 We use it as an initial value of speech basis in Rank-1 MNMF.
14
Problem 2: Speech Model Ambiguity Solution
Speech basis Speech activation
IS-divergence
Time
Frequency
Time
Frequency
Sparse low-rank speech spectrogramOutput of Rank-1 MNMF
Sparse
Low-rank
approximation
Sparse speech spectrogram
実験条件
 # of mic. : 8 channel microphones on 3-m-long hose-shape robot
 Speech : male & female speech with real-recorded impulse responses
 Ego-noise: real-recorded in moving hose-shaped robot (2 patterns)
 Training : matched with mixed ego-noise (2 patterns) &
mismatched (3 patterns)
 Evaluation: SDR improvement (both SNR and distortion are considered)
 Input SDR: 0 dB, -5 dB, -10 dB
 Comparison: IVA, PSNMF (single-channel supervised NMF),
Rank-1 MNMF (no supervision)
15
Simulation Experiment
True target Interference Artificial distortionEstimated
Higher SDR
indicates better
separation
16
Example of Typical SDR Improvement
Supervised
Rank-1
MNMF
Statistical
postfilter
(1)
(2)
(3)
(4)
Combination of each processing is effective.
SDRImprovement[dB]
Step(1) Step(2) Step(3) Step(4)
SDR increases through
each processing step
Before basis defom.
and initialization
Basis
deform.
and
Initialization
After basis defom.
and initialization
17
Comparison with Competitors
 Proposed methods of both matched and mismatched cases
outperform other conventional methods, whereas the
mismatched case is inferior to matched.
Conventional
Proposed
 We proposed a new informed source separation
method for the flexible microphone array system based
on supervised Rank-1 MNMF and statistical speech
enhancement.
 To reduce the mismatch problem, we proposed the
algorithm that an all-pole model is estimated to deform
the bases using the reliable spectral components
sampled by the statistical signal enhancement method.
 We revealed that the proposed method outperforms the
conventional methods via experiments with actual
sounds in the rescue robot.
18
Conclusion
Thank you for your attention!

More Related Content

What's hot

Robust music signal separation based on supervised nonnegative matrix factori...
Robust music signal separation based on supervised nonnegative matrix factori...Robust music signal separation based on supervised nonnegative matrix factori...
Robust music signal separation based on supervised nonnegative matrix factori...Daichi Kitamura
 
Blind audio source separation based on time-frequency structure models
Blind audio source separation based on time-frequency structure modelsBlind audio source separation based on time-frequency structure models
Blind audio source separation based on time-frequency structure modelsKitamura Laboratory
 
Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...Kitamura Laboratory
 
Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Daichi Kitamura
 
DNN-based frequency component prediction for frequency-domain audio source se...
DNN-based frequency component prediction for frequency-domain audio source se...DNN-based frequency component prediction for frequency-domain audio source se...
DNN-based frequency component prediction for frequency-domain audio source se...Kitamura Laboratory
 
Linear multichannel blind source separation based on time-frequency mask obta...
Linear multichannel blind source separation based on time-frequency mask obta...Linear multichannel blind source separation based on time-frequency mask obta...
Linear multichannel blind source separation based on time-frequency mask obta...Kitamura Laboratory
 
Hybrid multichannel signal separation using supervised nonnegative matrix fac...
Hybrid multichannel signal separation using supervised nonnegative matrix fac...Hybrid multichannel signal separation using supervised nonnegative matrix fac...
Hybrid multichannel signal separation using supervised nonnegative matrix fac...Daichi Kitamura
 
Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...Daichi Kitamura
 
Online Divergence Switching for Superresolution-Based Nonnegative Matrix Fa...
Online Divergence Switching for  Superresolution-Based  Nonnegative Matrix Fa...Online Divergence Switching for  Superresolution-Based  Nonnegative Matrix Fa...
Online Divergence Switching for Superresolution-Based Nonnegative Matrix Fa...奈良先端大 情報科学研究科
 
Depth Estimation of Sound Images Using Directional Clustering and Activation...
Depth Estimation of Sound Images Using  Directional Clustering and Activation...Depth Estimation of Sound Images Using  Directional Clustering and Activation...
Depth Estimation of Sound Images Using Directional Clustering and Activation...奈良先端大 情報科学研究科
 
Depth estimation of sound images using directional clustering and activation-...
Depth estimation of sound images using directional clustering and activation-...Depth estimation of sound images using directional clustering and activation-...
Depth estimation of sound images using directional clustering and activation-...Daichi Kitamura
 
Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Daichi Kitamura
 
DNN-based permutation solver for frequency-domain independent component analy...
DNN-based permutation solver for frequency-domain independent component analy...DNN-based permutation solver for frequency-domain independent component analy...
DNN-based permutation solver for frequency-domain independent component analy...Kitamura Laboratory
 
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用Kitamura Laboratory
 
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceAudio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceDaichi Kitamura
 
Experimental analysis of optimal window length for independent low-rank matri...
Experimental analysis of optimal window length for independent low-rank matri...Experimental analysis of optimal window length for independent low-rank matri...
Experimental analysis of optimal window length for independent low-rank matri...Daichi Kitamura
 
Robust Sound Field Reproduction against Listener’s Movement Utilizing Image ...
Robust Sound Field Reproduction against  Listener’s Movement Utilizing Image ...Robust Sound Field Reproduction against  Listener’s Movement Utilizing Image ...
Robust Sound Field Reproduction against Listener’s Movement Utilizing Image ...奈良先端大 情報科学研究科
 
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...Hiroki_Tanji
 
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...CSCJournals
 
Embedded Signal Approach to Image Texture Reproduction Analysis
Embedded Signal Approach to Image Texture Reproduction AnalysisEmbedded Signal Approach to Image Texture Reproduction Analysis
Embedded Signal Approach to Image Texture Reproduction AnalysisBurns Digital Imaging LLC
 

What's hot (20)

Robust music signal separation based on supervised nonnegative matrix factori...
Robust music signal separation based on supervised nonnegative matrix factori...Robust music signal separation based on supervised nonnegative matrix factori...
Robust music signal separation based on supervised nonnegative matrix factori...
 
Blind audio source separation based on time-frequency structure models
Blind audio source separation based on time-frequency structure modelsBlind audio source separation based on time-frequency structure models
Blind audio source separation based on time-frequency structure models
 
Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...Prior distribution design for music bleeding-sound reduction based on nonnega...
Prior distribution design for music bleeding-sound reduction based on nonnega...
 
Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...
 
DNN-based frequency component prediction for frequency-domain audio source se...
DNN-based frequency component prediction for frequency-domain audio source se...DNN-based frequency component prediction for frequency-domain audio source se...
DNN-based frequency component prediction for frequency-domain audio source se...
 
Linear multichannel blind source separation based on time-frequency mask obta...
Linear multichannel blind source separation based on time-frequency mask obta...Linear multichannel blind source separation based on time-frequency mask obta...
Linear multichannel blind source separation based on time-frequency mask obta...
 
Hybrid multichannel signal separation using supervised nonnegative matrix fac...
Hybrid multichannel signal separation using supervised nonnegative matrix fac...Hybrid multichannel signal separation using supervised nonnegative matrix fac...
Hybrid multichannel signal separation using supervised nonnegative matrix fac...
 
Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...Online divergence switching for superresolution-based nonnegative matrix fact...
Online divergence switching for superresolution-based nonnegative matrix fact...
 
Online Divergence Switching for Superresolution-Based Nonnegative Matrix Fa...
Online Divergence Switching for  Superresolution-Based  Nonnegative Matrix Fa...Online Divergence Switching for  Superresolution-Based  Nonnegative Matrix Fa...
Online Divergence Switching for Superresolution-Based Nonnegative Matrix Fa...
 
Depth Estimation of Sound Images Using Directional Clustering and Activation...
Depth Estimation of Sound Images Using  Directional Clustering and Activation...Depth Estimation of Sound Images Using  Directional Clustering and Activation...
Depth Estimation of Sound Images Using Directional Clustering and Activation...
 
Depth estimation of sound images using directional clustering and activation-...
Depth estimation of sound images using directional clustering and activation-...Depth estimation of sound images using directional clustering and activation-...
Depth estimation of sound images using directional clustering and activation-...
 
Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...Blind source separation based on independent low-rank matrix analysis and its...
Blind source separation based on independent low-rank matrix analysis and its...
 
DNN-based permutation solver for frequency-domain independent component analy...
DNN-based permutation solver for frequency-domain independent component analy...DNN-based permutation solver for frequency-domain independent component analy...
DNN-based permutation solver for frequency-domain independent component analy...
 
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用
コサイン類似度罰則条件付き半教師あり非負値行列因子分解と音源分離への応用
 
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical IndependenceAudio Source Separation Based on Low-Rank Structure and Statistical Independence
Audio Source Separation Based on Low-Rank Structure and Statistical Independence
 
Experimental analysis of optimal window length for independent low-rank matri...
Experimental analysis of optimal window length for independent low-rank matri...Experimental analysis of optimal window length for independent low-rank matri...
Experimental analysis of optimal window length for independent low-rank matri...
 
Robust Sound Field Reproduction against Listener’s Movement Utilizing Image ...
Robust Sound Field Reproduction against  Listener’s Movement Utilizing Image ...Robust Sound Field Reproduction against  Listener’s Movement Utilizing Image ...
Robust Sound Field Reproduction against Listener’s Movement Utilizing Image ...
 
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...
Learning the Statistical Model of the NMF Using the Deep Multiplicative Updat...
 
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...
International Journal of Biometrics and Bioinformatics(IJBB) Volume (1) Issue...
 
Embedded Signal Approach to Image Texture Reproduction Analysis
Embedded Signal Approach to Image Texture Reproduction AnalysisEmbedded Signal Approach to Image Texture Reproduction Analysis
Embedded Signal Approach to Image Texture Reproduction Analysis
 

Viewers also liked

HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価
HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価
HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価Shinnosuke Takamichi
 
Moment matching networkを用いた音声パラメータのランダム生成の検討
Moment matching networkを用いた音声パラメータのランダム生成の検討Moment matching networkを用いた音声パラメータのランダム生成の検討
Moment matching networkを用いた音声パラメータのランダム生成の検討Shinnosuke Takamichi
 
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...Daichi Kitamura
 
数値解析と物理学
数値解析と物理学数値解析と物理学
数値解析と物理学すずしめ
 

Viewers also liked (10)

HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価
HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価
HMMに基づく日本人英語音声合成における中学生徒の英語音声を用いた評価
 
Asj2017 3invited
Asj2017 3invitedAsj2017 3invited
Asj2017 3invited
 
Moment matching networkを用いた音声パラメータのランダム生成の検討
Moment matching networkを用いた音声パラメータのランダム生成の検討Moment matching networkを用いた音声パラメータのランダム生成の検討
Moment matching networkを用いた音声パラメータのランダム生成の検討
 
ILRMA 20170227 danwakai
ILRMA 20170227 danwakaiILRMA 20170227 danwakai
ILRMA 20170227 danwakai
 
Slp201702
Slp201702Slp201702
Slp201702
 
Ea2015 7for ss
Ea2015 7for ssEa2015 7for ss
Ea2015 7for ss
 
Asj2017 3 bileveloptnmf
Asj2017 3 bileveloptnmfAsj2017 3 bileveloptnmf
Asj2017 3 bileveloptnmf
 
Discriminative SNMF EA201603
Discriminative SNMF EA201603Discriminative SNMF EA201603
Discriminative SNMF EA201603
 
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...
独立性に基づくブラインド音源分離の発展と独立低ランク行列分析 History of independence-based blind source sep...
 
数値解析と物理学
数値解析と物理学数値解析と物理学
数値解析と物理学
 

Similar to Ica2016 312 saruwatari

Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderIJTET Journal
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognitionphyuhsan
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechIOSR Journals
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)inventionjournals
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...CSCJournals
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based featuresijsc
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features ijsc
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionCSCJournals
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABsipij
 
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...cscpconf
 
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION optljjournal
 
07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdfAlexanderLerch4
 
BIOMASS_E2ES_IGARSS2011.ppt
BIOMASS_E2ES_IGARSS2011.pptBIOMASS_E2ES_IGARSS2011.ppt
BIOMASS_E2ES_IGARSS2011.pptgrssieee
 
IEEE_Paper_PID2966731 (1).pdf
IEEE_Paper_PID2966731 (1).pdfIEEE_Paper_PID2966731 (1).pdf
IEEE_Paper_PID2966731 (1).pdfChirag Dalal
 
Acoustic fMRI noise reduction: a perceived loudness approach
Acoustic fMRI noise reduction: a perceived loudness approachAcoustic fMRI noise reduction: a perceived loudness approach
Acoustic fMRI noise reduction: a perceived loudness approachDimitri Vrehen
 
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...CSCJournals
 

Similar to Ica2016 312 saruwatari (20)

Speaker recognition.
Speaker recognition.Speaker recognition.
Speaker recognition.
 
Animal Voice Morphing System
Animal Voice Morphing SystemAnimal Voice Morphing System
Animal Voice Morphing System
 
Speech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using VocoderSpeech Analysis and synthesis using Vocoder
Speech Analysis and synthesis using Vocoder
 
Voice biometric recognition
Voice biometric recognitionVoice biometric recognition
Voice biometric recognition
 
Emotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio SpeechEmotion Recognition Based On Audio Speech
Emotion Recognition Based On Audio Speech
 
International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)International Journal of Engineering and Science Invention (IJESI)
International Journal of Engineering and Science Invention (IJESI)
 
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
A Combined Voice Activity Detector Based On Singular Value Decomposition and ...
 
Broad phoneme classification using signal based features
Broad phoneme classification using signal based featuresBroad phoneme classification using signal based features
Broad phoneme classification using signal based features
 
Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features  Broad Phoneme Classification Using Signal Based Features
Broad Phoneme Classification Using Signal Based Features
 
Wavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker RecognitionWavelet Based Noise Robust Features for Speaker Recognition
Wavelet Based Noise Robust Features for Speaker Recognition
 
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLABA GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
A GAUSSIAN MIXTURE MODEL BASED SPEECH RECOGNITION SYSTEM USING MATLAB
 
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
ANALYSIS OF SPEECH UNDER STRESS USING LINEAR TECHNIQUES AND NON-LINEAR TECHNI...
 
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION
VOICED SPEECH CHARACTERISATION BASED ON EMPIRICAL MODE DECOMPOSITION
 
07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf07-03-03-ACA-Tonal-Monof0.pdf
07-03-03-ACA-Tonal-Monof0.pdf
 
H0814247
H0814247H0814247
H0814247
 
V041203124126
V041203124126V041203124126
V041203124126
 
BIOMASS_E2ES_IGARSS2011.ppt
BIOMASS_E2ES_IGARSS2011.pptBIOMASS_E2ES_IGARSS2011.ppt
BIOMASS_E2ES_IGARSS2011.ppt
 
IEEE_Paper_PID2966731 (1).pdf
IEEE_Paper_PID2966731 (1).pdfIEEE_Paper_PID2966731 (1).pdf
IEEE_Paper_PID2966731 (1).pdf
 
Acoustic fMRI noise reduction: a perceived loudness approach
Acoustic fMRI noise reduction: a perceived loudness approachAcoustic fMRI noise reduction: a perceived loudness approach
Acoustic fMRI noise reduction: a perceived loudness approach
 
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
Speech Processing in Stressing Co-Channel Interference Using the Wigner Distr...
 

Recently uploaded

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )Tsuyoshi Horigome
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSISrknatarajan
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...Call Girls in Nagpur High Profile
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performancesivaprakash250
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Christo Ananth
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordAsst.prof M.Gokilavani
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxupamatechverse
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escortsranjana rawat
 

Recently uploaded (20)

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service
 
SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )SPICE PARK APR2024 ( 6,793 SPICE Models )
SPICE PARK APR2024 ( 6,793 SPICE Models )
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
UNIT-III FMM. DIMENSIONAL ANALYSIS
UNIT-III FMM.        DIMENSIONAL ANALYSISUNIT-III FMM.        DIMENSIONAL ANALYSIS
UNIT-III FMM. DIMENSIONAL ANALYSIS
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Top Rated  Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...Booking open Available Pune Call Girls Koregaon Park  6297143586 Call Hot Ind...
Booking open Available Pune Call Girls Koregaon Park 6297143586 Call Hot Ind...
 
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur EscortsHigh Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
High Profile Call Girls Nagpur Meera Call 7001035870 Meet With Nagpur Escorts
 
UNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its PerformanceUNIT - IV - Air Compressors and its Performance
UNIT - IV - Air Compressors and its Performance
 
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
Call for Papers - African Journal of Biological Sciences, E-ISSN: 2663-2187, ...
 
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete RecordCCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
CCS335 _ Neural Networks and Deep Learning Laboratory_Lab Complete Record
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
Introduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptxIntroduction to Multiple Access Protocol.pptx
Introduction to Multiple Access Protocol.pptx
 
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
(MEERA) Dapodi Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Escorts
 

Ica2016 312 saruwatari

  • 1. Flexible Microphone Array Based on Multichannel Nonnegative Matrix Factorization and Statistical Signal Estimation Hiroshi Saruwatari, Kazuma Takata (The Unoversity of Tokyo, JAPAN) Nobutaka Ono (NII, JAPAN), Shoji Makino (University of Tsukuba, JAPAN) Acoustic Array Systems: Paper ICA2016-312
  • 2. Outline  Introduction of rescue robot audition  Conventional approaches (ICA, IVA, Rank-1 MNMF)  Informed source separation and its problem  Ego-noise basis mismatch problem solution  Speech ambiguity problem solution  Experimental evaluation  Conclusion 2
  • 3. Introduction: Rescue Robot Audition  Aimed to detect victims’ speech in a disaster area.  Flexible body twists and moves driven by vibration motors.  It wears multiple microphones around the body. • Thus, microphones’ position is always unknown. • Self-Vibration generates harmful noise. (so-called Ego-Noise) One of the Distributed Microphone Array Problem 3 MicrophoneVibrator What is hose-shaped rescue robot?
  • 4. 4 Source Observation Separated Mixing Separation Conventional: ICA or Independent vector analysis (IVA), which separates the sources based on their independence nature. We assume linear time- invariance in A. This is a simultaneous estimation problem for W and source statistical models. x=As y=Wx Unknown Known W Demixing matrix How to solve? Use Blind Source Separation Source model (p.d.f.s) S1 S2 Speech Ego-noise Speech Ego- noise +
  • 5. Low-rank source spectrogram 5 Rank-1 MNMF (Independent Low-Rank Matrix Analysis) that separates the sources by estimating demixing matrix W and low-rank source spectrogram model via Nonnegative Matrix Factorization (NMF) [Lee, 2001]. Rank-1 MNMF [Kitamura, Saruwatari et al., IEEE Trans. ASLP 2016] W Demixing matrix Simultaneous estimation for W and TV + In this study, we focus our attention to…
  • 6. 6 Rank-1 MNMF (Independent Low-Rank Matrix Analysis) Pros & Cons: • All parameters can be updated via Auxiliary-Function method (EM-like algorithm), keeping nonnegative feature of T & V. • The cost function always decreases in each iteration. Thus, this is convergence-guaranteed algorithm unlike ICA! • Still affected by initial state of parameters. go to “Informed” Rank-1 MNMF’s cost function to be minimized : Independence measure between sources (for W) : Low-rank approximation of sources (for T and V) (Note: both are based on Itakura-Saito (IS) divergence.)
  • 7. Typical ego-noise basis trained by NMF in advance Activation Source model in Rank-1 MNMF 7 Basis Toward Informed Source Separation
  • 8. Typical ego-noise basis trained by NMF in advance Activation Source model in Rank-1 MNMF Fixing a part of bases, estimate remaining parameters and W. 8 Basis Speech basis Ego- noise basis Toward Informed Source Separation (unknown) (unknown)
  • 9. Typical ego-noise basis trained by NMF in advance Activation Source model in Rank-1 MNMF Fixing a part of bases, estimate remaining parameters and W. [Problem 1] Ego-noise time-variance (ego-noise mismatch problem) [Problem 2] Unknown speech (speech model ambiguity problem) 9 Basis Speech basis Ego- noise basis Toward Informed Source Separation (unknown) (unknown)
  • 10. Supervised Rank-1 MNMF Rough separation Statistical Postfilter [Breithaupt, 2010]  Chi distribution (sparse p.d.f.) is used as target signal prior.  Its sparseness can be estimted from data empirically via higher-order statistics [Murota, Saruwatari, ICASSP2014]. Observed signal Thanks to sparse prior, we can obtain more accurate separation and its Certainty. Statistical Signal Estimation Certainty Estimated ego-noise Sparse p.d.f. 6 Estimated target signal
  • 11. Statistical Signal Estimation 6 Certainty I ={1; if G(f,t)>0.8, otherwise 0}: binary mask that extracts seldom overlapping components with the target signal from the estimated interference signal.
  • 12. 12 Problem 1: Ego-Noise Mismatch Solution  We sample convincing ego-noise spectrogram by certainty I.  Next, obtain smoothed “time-frequency deformation function” between sampled spectrogram and original supervised ego- noise basis.  Time-invariant all-pole model is used as deformation function. Diagonal matrix with entries Supervised ego-noise basis Ego-noise activation KL divergence Order of all-pole model This can be solved as extended NMF optimization. Frequency Powerspectrum
  • 13. 13 Problem 1: Ego-Noise Mismatch Solution : each element of Update of activation Update of all-pole-model weight By noting the KL-cost function as J, its auxiliary function is given by
  • 14.  Statistical postfilter’s output is sparse estimation of S.  We can re-estimate sparse-aware speech basis using .  We use it as an initial value of speech basis in Rank-1 MNMF. 14 Problem 2: Speech Model Ambiguity Solution Speech basis Speech activation IS-divergence Time Frequency Time Frequency Sparse low-rank speech spectrogramOutput of Rank-1 MNMF Sparse Low-rank approximation Sparse speech spectrogram
  • 15. 実験条件  # of mic. : 8 channel microphones on 3-m-long hose-shape robot  Speech : male & female speech with real-recorded impulse responses  Ego-noise: real-recorded in moving hose-shaped robot (2 patterns)  Training : matched with mixed ego-noise (2 patterns) & mismatched (3 patterns)  Evaluation: SDR improvement (both SNR and distortion are considered)  Input SDR: 0 dB, -5 dB, -10 dB  Comparison: IVA, PSNMF (single-channel supervised NMF), Rank-1 MNMF (no supervision) 15 Simulation Experiment True target Interference Artificial distortionEstimated Higher SDR indicates better separation
  • 16. 16 Example of Typical SDR Improvement Supervised Rank-1 MNMF Statistical postfilter (1) (2) (3) (4) Combination of each processing is effective. SDRImprovement[dB] Step(1) Step(2) Step(3) Step(4) SDR increases through each processing step Before basis defom. and initialization Basis deform. and Initialization After basis defom. and initialization
  • 17. 17 Comparison with Competitors  Proposed methods of both matched and mismatched cases outperform other conventional methods, whereas the mismatched case is inferior to matched. Conventional Proposed
  • 18.  We proposed a new informed source separation method for the flexible microphone array system based on supervised Rank-1 MNMF and statistical speech enhancement.  To reduce the mismatch problem, we proposed the algorithm that an all-pole model is estimated to deform the bases using the reliable spectral components sampled by the statistical signal enhancement method.  We revealed that the proposed method outperforms the conventional methods via experiments with actual sounds in the rescue robot. 18 Conclusion Thank you for your attention!