Duration Modeling Using Cumulative Duration Probability

Tae-Young YANG  Chungyong LEE  Dae-Hee YOUN  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E85-D   No.9   pp.1452-1454
Publication Date: 2002/09/01
Online ISSN: 
DOI: 
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
speech recognition,  connected digit recognition,  duration modeling,  cumulative duration probability,  

Full Text: PDF(108.9KB)>>
Buy this Article




Summary: 
A duration modeling technique is proposed for the HMM based connected digit recognizer. The proposed duration modeling technique uses a cumulative duration probability. The cumulative duration probability is defined as the partial sum of the duration probabilities which can be estimated from the training speech data. Two approaches of using it are presented. First, the cumulative duration probability is used as a weighting factor to the state transition probability of HMM. Second, it replaces the conventional state transition probability. In both approaches, the cumulative duration probability is combined directly to the Viterbi decoding procedure. A modified Viterbi decoding procedure is also presented. One of the advantages of the proposed duration modeling technique is that the cumulative duration probability rules the transitions of states and words at each frame. Therefore, an additional post-procedure is not required. The proposed technique was examined by recognition experiments on Korean connected digit. Experimental results showed that two approach achieved almost same performances and that the average recognition accuracy was enhanced from 83.60% to 93.12%.