Bi-Direction Interaural Matching Filter and Decision Weighting Fusion for Sound Source Localization in Noisy Environments

Hong LIU  Mengdi YUE  Jie ZHANG  

IEICE TRANSACTIONS on Information and Systems   Vol.E99-D   No.12   pp.3192-3196
Publication Date: 2016/12/01
Publicized: 2016/09/12
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2016EDL8125
Type of Manuscript: LETTER
Category: Speech and Hearing
binaural auditory,  sound source localization,  hybrid interaural matching filter,  decision weighting fusion,  

Full Text: PDF>>
Buy this Article

Sound source localization is an essential technique in many applications, e.g., speech enhancement, speech capturing and human-robot interaction. However, the performance of traditional methods degrades in noisy or reverberant environments, and it is sensitive to the spatial location of sound source. To solve these problems, we propose a sound source localization framework based on bi-direction interaural matching filter (IMF) and decision weighting fusion. Firstly, bi-directional IMF is put forward to describe the difference between binaural signals in forward and backward directions, respectively. Then, a hybrid interaural matching filter (HIMF), which is obtained by the bi-direction IMF through decision weighting fusion, is used to alleviate the affection of sound locations on sound source localization. Finally, the cosine similarity between the HIMFs computed from the binaural audio and transfer functions is employed to measure the probability of the source location. Constructing the similarity for all the spatial directions as a matrix, we can determine the source location by Maximum A Posteriori (MAP) estimation. Compared with several state-of-the-art methods, experimental results indicate that HIMF is more robust in noisy environments.