A Study on Mouth Shape Features Suitable for HMM Speech Recognition Using Fusion of Visual and Auditory Information

Naoshi DOI  Akira SHINTANI  Yasuhisa HAYASHI  Akio OGIHARA  Shinobu TAKAMATSU  

IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol.E78-A   No.11   pp.1548-1552
Publication Date: 1995/11/25
Online ISSN: 
Print ISSN: 0916-8508
Type of Manuscript: Special Section LETTER (Special Section of Letters Selected from the 1995 IEICE General Conference)
speech recognition,  fusion of visual and auditory information,  feature of mouth shape,  image processing,  

Full Text: PDF(285.7KB)>>
Buy this Article

Recently, some speech recognition methods using fusion of visual and auditory information have been researched. In this paper, a study on the mouth shape image suitable for fusion of visual and auditory information has been described. Features of mouth shape which are extracted from gray level image and binary image are adopted, and speech recognition using linear combination method has been performed. From results of speech recognition, the studies on the mouth shape features which are effective in fusion of visual and auditory information have been performed. And the effectiveness of using two kinds of mouth shape features also has been confirmed.