For Full-Text PDF, please login, if you are a member of IEICE,|
or go to Pay Per View on menu list, if you are a nonmember of IEICE.
Automatic Detection of Mis-Spelled Japanese Expressions Using a New Method for Automatic Extraction of Negative Examples Based on Positive Examples
Masaki MURATA Hitoshi ISAHARA
IEICE TRANSACTIONS on Information and Systems
Publication Date: 2002/09/01
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Natural Language Processing
negative examples, spelling errors, spell checker,
Full Text: PDF(279.5KB)>>
We developed a method for extracting negative examples when only positive examples are given as supervised data. This method calculates the probability of occurrence of an input example, which should be judged to be positive or negative. It considers an input example that has a high probability of occurrence but does not appear in the set of positive examples as a negative example. We used this method for one of important tasks in natural language processing: automatic detection of misspelled Japanese expressions. The results showed that the method is effective. In this study, we also described two other methods we developed for the detection of misspelled expressions: a combined method and a "leaving-one-out" method. In our experiments, we found that these methods are also effective.