Phoneme Set Design for Speech Recognition of English by Japanese

Xiaoyun WANG  Jinsong ZHANG  Masafumi NISHIDA  Seiichi YAMAMOTO  

IEICE TRANSACTIONS on Information and Systems   Vol.E98-D   No.1   pp.148-156
Publication Date: 2015/01/01
Publicized: 2014/10/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2014EDP7168
Type of Manuscript: PAPER
Category: Speech and Hearing
phonetic decision tree (PDT),  Phoneme set,  Second language speech recognition,  

Full Text: PDF(1023.6KB)>>
Buy this Article

This paper describes a novel method to improve the performance of second language speech recognition when the mother tongue of users is known. Considering that second language speech usually includes less fluent pronunciation and more frequent pronunciation mistakes, the authors propose using a reduced phoneme set generated by a phonetic decision tree (PDT)-based top-down sequential splitting method instead of the canonical one of the second language. The authors verify the efficacy of the proposed method using second language speech collected with a translation game type dialogue-based English CALL system. Experiments show that a speech recognizer achieved higher recognition accuracy with the reduced phoneme set than with the canonical phoneme set.