Multilingual Closed Caption Translation System for Digital Television

Sanghwa YUH  Kongjoo LEE  Jungyun SEO  

IEICE TRANSACTIONS on Information and Systems   Vol.E89-D   No.6   pp.1885-1892
Publication Date: 2006/06/01
Online ISSN: 1745-1361
DOI: 10.1093/ietisy/e89-d.6.1885
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Section on Human Communication II)
Category: Service and System
closed caption,  machine translation,  digital TV,  named entity,  translingual equivalence,  domain identifier,  

Full Text: PDF(574.3KB)>>
Buy this Article

In this paper, we present a Korean to Chinese/English/Japanese multilingual Machine Translation (MT) system of closed captions for Digital Television (DTV). Preliminary experiments of our closed caption translation with existing base MT systems had shown unsatisfactory result. In order to achieve more accurate translation with the base MT systems, we adopted live resources of multilingual Named Entities and their translingual equivalences from the Web. We also utilize the program information, which the terrestrial broadcasters offer through DTV transport stream, in order to use program specific dictionaries, including the names of characters, locations and organizations. Two more components are adopted for reducing the ambiguities of parsing and word sense disambiguation; sentence simplification for long sentence segmentation and dynamic domain identification for automatic domain dictionary stacking. With these integrated approaches, we could raise the Mean Opinion Score (MOS) of translation accuracy by 0.40 higher than the base MT systems.