|
|
Please login using the form on menu list.
It is required to login for Full-Text PDF.
|
An Unsupervised Speaker Adaptation Method for Lecture-Style Spontaneous Speech Recognition Using Multiple Recognition Systems
Seiichi NAKAGAWA
Tomohiro WATANABE
Hiromitsu NISHIZAKI
Takehito UTSURO
Publication
IEICE TRANSACTIONS on Information and Systems Vol.E88-D No.3 pp.463-471
Publication Date: 2005/03/01
Online ISSN:
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Section on Corpus-Based Speech Technologies)
Category: Spoken Language Systems
Keyword: spontaneous speech recognition,
unsupervised speaker adaptation,
confidence measure,
multiple LVCSR models,
Full Text: PDF(640.8KB)
Summary: This paper describes an accurate unsupervised speaker adaptation method for lecture style spontaneous speech recognition using multiple LVCSR systems. In an unsupervised speaker adaptation framework, the improvement of recognition performance by adapting acoustic models remarkably depends on the accuracy of labels such as phonemes and syllables. Therefore, extraction of the adaptation data guided by confidence measure is effective for unsupervised adaptation. In this paper, we looked for the high confidence portions based on the agreement between two LVCSR systems, adapted acoustic models using the portions attached with high accurate labels, and then improved the recognition accuracy. We applied our method to the Corpus of Spontaneous Japanese (CSJ) and the method improved the recognition rate by about 2.1% in comparison with a traditional method.
|
|