Noise Reduction in Time Domain Using Referential Reconstruction

Takehiro IHARA  Takayuki NAGAI  Kazuhiko OZEKI  Akira KUREMATSU  

IEICE TRANSACTIONS on Information and Systems   Vol.E89-D   No.3   pp.1203-1213
Publication Date: 2006/03/01
Online ISSN: 1745-1361
DOI: 10.1093/ietisy/e89-d.3.1203
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Speech and Hearing
noise reduction,  time domain,  referential reconstruction,  pattern matching,  automatic speech recognition,  

Full Text: PDF>>
Buy this Article

We present a novel approach for single-channel noise reduction of speech signals contaminated by additive noise. In this approach, the system requires speech samples to be uttered in advance by the same speaker as that of the input signal. Speech samples used in this method must have enough phonetic variety to reconstruct the input signal. In the proposed method, which we refer to as referential reconstruction, we have used a small database created from examples of speech, which will be called reference signals. Referential reconstruction uses an example-based approach, in which the objective is to find the candidate speech frame which is the most similar to the clean input frame without noise, although the input frame is contaminated with noise. When candidate frames are found, they become final outputs without any special processing. In order to find the candidate frames, a correlation coefficient is used as a similarity measure. Through automatic speech recognition experiments, the proposed method was shown to be effective, particularly for low-SNR speech signals corrupted with white noise or noise in high-frequency bands. Since the direct implementation of this method requires infeasible computational cost for searching through reference signals, a coarse-to-fine strategy is introduced in this paper.