Sound Source Localization Using a Profile Fitting Method with Sound Reflectors

Osamu ICHIKAWA  Tetsuya TAKIGUCHI  Masafumi NISHIMURA  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E87-D   No.5   pp.1138-1145
Publication Date: 2004/05/01
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Section on Speech Dynamics by Ear, Eye, Mouth and Machine)
Category: 
Keyword: 
sound source localization,  microphone array,  sound reflector,  ICTD,  ICLD,  

Full Text: PDF(1.2MB)
>>Buy this Article


Summary: 
In a two-microphone approach, interchannel differences in time (ICTD) and interchannel differences in sound level (ICLD) have generally been used for sound source localization. But those cues are not effective for vertical localization in the median plane (direct front). For that purpose, spectral cues based on features of head-related transfer functions (HRTF) have been investigated, but they are not robust enough against signal variations and environmental noise. In this paper, we use a "profile" as a cue while using a combination of reflectors specially designed for vertical localization. The observed sound is converted into a profile containing information about reflections as well as ICTD and ICLD data. The observed profile is decomposed into signal and noise by using template profiles associated with sound source locations. The template minimizing the residual of the decomposition gives the estimated sound source location. Experiments show this method can correctly provide a rough estimate of the vertical location even in a noisy environment.