Speech Enhancement by Profile Fitting Method

Osamu ICHIKAWA  Tetsuya TAKIGUCHI  Masafumi NISHIMURA  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E86-D   No.3   pp.514-521
Publication Date: 2003/03/01
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Speech Information Processing)
Category: Robust Speech Recognition and Enhancement
Keyword: 
speech enhancement,  microphone array,  beamformer,  noise reduction,  spectral subtraction,  speech recognition,  

Full Text: PDF(840.5KB)
>>Buy this Article


Summary: 
It is believed that distant-talking speech recognition in a noisy environment requires a large-scale microphone array. However, this cannot fit into small consumer devices. Our objective is to improve the performance with a limited number of microphones (preferably only left and right). In this paper, we focused on a profile that is the shape of the power distribution according to the beamforming direction. An observed profile can be decomposed into known profiles for directional sound sources and a non-directional background sound source. Evaluations confirmed this method reduced the CER (Character Error Ratio) for the dictation task by more than 20% compared to a conventional 2-channel Adaptive Spectral Subtraction beamformer in a non-reverberant environment.