Sentiment Classification in Under-Resourced Languages Using Graph-Based Semi-Supervised Learning Methods

Yong REN  Nobuhiro KAJI  Naoki YOSHINAGA  Masaru KITSUREGAWA  

IEICE TRANSACTIONS on Information and Systems   Vol.E97-D   No.4   pp.790-797
Publication Date: 2014/04/01
Online ISSN: 1745-1361
DOI: 10.1587/transinf.E97.D.790
Type of Manuscript: Special Section PAPER (Special Section on Data Engineering and Information Management)
sentiment classification,  graph-based semi-supervised learning,  

Full Text: FreePDF(1.4MB)

In sentiment classification, conventional supervised approaches heavily rely on a large amount of linguistic resources, which are costly to obtain for under-resourced languages. To overcome this scarce resource problem, there exist several methods that exploit graph-based semi-supervised learning (SSL). However, fundamental issues such as controlling label propagation, choosing the initial seeds, selecting edges have barely been studied. Our evaluation on three real datasets demonstrates that manipulating the label propagating behavior and choosing labeled seeds appropriately play a critical role in adopting graph-based SSL approaches for this task.