Automatic Detection of Mis-Spelled Japanese Expressions Using a New Method for Automatic Extraction of Negative Examples Based on Positive Examples

Masaki MURATA  Hitoshi ISAHARA  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E85-D   No.9   pp.1416-1424
Publication Date: 2002/09/01
Online ISSN: 
DOI: 
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Natural Language Processing
Keyword: 
negative examples,  spelling errors,  spell checker,  

Full Text: PDF(279.5KB)>>
Buy this Article




Summary: 
We developed a method for extracting negative examples when only positive examples are given as supervised data. This method calculates the probability of occurrence of an input example, which should be judged to be positive or negative. It considers an input example that has a high probability of occurrence but does not appear in the set of positive examples as a negative example. We used this method for one of important tasks in natural language processing: automatic detection of misspelled Japanese expressions. The results showed that the method is effective. In this study, we also described two other methods we developed for the detection of misspelled expressions: a combined method and a "leaving-one-out" method. In our experiments, we found that these methods are also effective.