Context-Dependent Phoneme Duration Modeling with Tree-Based State Tying

Sung-Joon PARK  Myoung-Wan KOO  Chu-Shik JHON  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E88-D   No.3   pp.662-666
Publication Date: 2005/03/01
Online ISSN: 
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Speech and Hearing
Keyword: 
duration model,  gamma distribution,  tree-based state tying,  

Full Text: PDF(227.7KB)
>>Buy this Article


Summary: 
This letter presents two methods of modeling phoneme durations. One is the context-independent phoneme duration modeling in which duration parameters are stored in each phoneme. The other is the context-dependent duration modeling in which duration parameters are stored in each state shared by context-dependent phonemes. The phoneme duration model is compared with a without-duration model and a state duration model. Experiments are performed on a database collected over the telephone network. Experimental results show that duration information rejects out-of-task (OOT) words well and that the context-dependent duration model yields the best performance among the tested models.