Class-Dependent Modeling for Dialog Translation

Andrew FINCH  Eiichiro SUMITA  Satoshi NAKAMURA  

IEICE TRANSACTIONS on Information and Systems   Vol.E92-D   No.12   pp.2469-2477
Publication Date: 2009/12/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E92.D.2469
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Speech and Hearing
dialog translation,  class-dependent,  mixture model,  

Full Text: PDF>>
Buy this Article

This paper presents a technique for class-dependent decoding for statistical machine translation (SMT). The approach differs from previous methods of class-dependent translation in that the class-dependent forms of all models are integrated directly into the decoding process. We employ probabilistic mixture weights between models that can change dynamically on a sentence-by-sentence basis depending on the characteristics of the source sentence. The effectiveness of this approach is demonstrated by evaluating its performance on travel conversation data. We used this approach to tackle the translation of questions and declarative sentences using class-dependent models. To achieve this, our system integrated two sets of models specifically built to deal with sentences that fall into one of two classes of dialog sentence: questions and declarations, with a third set of models built with all of the data to handle the general case. The technique was thoroughly evaluated on data from 16 language pairs using 6 machine translation evaluation metrics. We found the results were corpus-dependent, but in most cases our system was able to improve translation performance, and for some languages the improvements were substantial.