For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Outlier Detection and Removal for HMM-Based Speech Synthesis with an Insufficient Speech Database
Doo Hwa HONG June Sig SUNG Kyung Hwan OH Nam Soo KIM
IEICE TRANSACTIONS on Information and Systems
Publication Date: 2012/09/01
Online ISSN: 1745-1361
Print ISSN: 0916-8532
Type of Manuscript: LETTER
Category: Speech and Hearing
HMM-based speech synthesis, decision tree-based clustering, outlier detection, insufficient speech database,
Full Text: PDF>>
Decision tree-based clustering and parameter estimation are essential steps in the training part of an HMM-based speech synthesis system. These two steps are usually performed based on the maximum likelihood (ML) criterion. However, one of the drawbacks of the ML criterion is that it is sensitive to outliers which usually result in quality degradation of the synthesized speech. In this letter, we propose an approach to detect and remove outliers for HMM-based speech synthesis. Experimental results show that the proposed approach can improve the synthetic speech, particularly when the available training speech database is insufficient.