For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Blind Source Separation of Convolutive Mixtures of Speech in Frequency Domain
Shoji MAKINO Hiroshi SAWADA Ryo MUKAI Shoko ARAKI
IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences
Publication Date: 2005/07/01
Print ISSN: 0916-8508
Type of Manuscript: INVITED PAPER (Special Section on Multi-channel Acoustic Signal Processing)
blind source separation, convolutive mixtures, independent component analysis, frequency-domain BSS, microphone array, adaptive beamformer,
Full Text: PDF(1.7MB)>>
This paper overviews a total solution for frequency-domain blind source separation (BSS) of convolutive mixtures of audio signals, especially speech. Frequency-domain BSS performs independent component analysis (ICA) in each frequency bin, and this is more efficient than time-domain BSS. We describe a sophisticated total solution for frequency-domain BSS, including permutation, scaling, circularity, and complex activation function solutions. Experimental results of 22, 33, 44, 68, and 22 (moving sources), (#sources#microphones) in a room are promising.