Speech Recognition Using Finger Tapping Timings

Hiromitsu BAN  Chiyomi MIYAJIMA  Katsunobu ITOU  Kazuya TAKEDA  Fumitada ITAKURA  

IEICE TRANSACTIONS on Information and Systems   Vol.E88-D   No.3   pp.667-670
Publication Date: 2005/03/01
Online ISSN: 
DOI: 10.1093/ietisy/e88-d.3.667
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Speech and Hearing
multi-modal speech recognition,  human behavioral synchronization,  

Full Text: PDF(514.6KB)
>>Buy this Article

Behavioral synchronization between speech and finger tapping provides a novel approach to improving speech recognition accuracy. We combine a sequence of finger tapping timings recorded alongside an utterance using two distinct methods: in the first method, HMM state transition probabilities at the word boundaries are controlled by the timing of the finger tapping; in the second, the probability (relative frequency) of the finger tapping is used as a 'feature' and combined with MFCC in a HMM recognition system. We evaluate these methods through connected digit recognition under different noise conditions (AURORA-2J). Leveraging the synchrony between speech and finger tapping provides a 46% relative improvement in connected digit recognition experiments.