Classification of Utterances Based on Multiple BLEU Scores for Translation-Game-Type CALL Systems

Reiko KUWA  Tsuneo KATO  Seiichi YAMAMOTO  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E101-D   No.3   pp.750-757
Publication Date: 2018/03/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.2017EDP7151
Type of Manuscript: PAPER
Category: Speech and Hearing
Keyword: 
interactive CALL,  classification,  learner corpus,  BLEU,  

Full Text: PDF(660.6KB)
>>Buy this Article


Summary: 
This paper proposes a classification method of second-language-learner utterances for interactive computer-assisted language learning systems. This classification method uses three types of bilingual evaluation understudy (BLEU) scores as features for a classifier. The three BLEU scores are calculated in accordance with three subsets of a learner corpus divided according to the quality of utterances. For the purpose of overcoming the data-sparseness problem, this classification method uses the BLEU scores calculated using a mixture of word and part-of-speech (POS)-tag sequences converted from word sequences based on a POS-replacement rule according to which words are replaced with POS tags in n-grams. Experiments of classifying English utterances by Japanese demonstrated that the proposed classification method achieved classification accuracy of 78.2% which was 12.3 points higher than a baseline with one BLEU score.