Constraint Satisfaction Approach to Extraction of Japanese Character Regions from Unformatted Document Image

Keiji GYOHTEN  Noboru BABAGUCHI  Tadahiro KITAHASHI  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E78-D   No.4   pp.466-475
Publication Date: 1995/04/25
Online ISSN: 
DOI: 
Print ISSN: 0916-8532
Type of Manuscript: PAPER
Category: Image Processing, Computer Graphics and Pattern Recognition
Keyword: 
character extraction,  unformatted document,  bottom-up approach,  constraint satisfaction,  simulated annealing,  

Full Text: PDF>>
Buy this Article




Summary: 
In this paper, we present a method for extracting the Japanese printed characters from unformatted document images. This research takes into account the multiple general features specific to the Japanese printed characters. In our method, these features are thought of as the constraints for the regions to be extracted within the constraint satisfaction approach. This is achieved by minimizing a constraint function estimating quantitative satisfaction of the features. Our method is applicable to all kinds of the Japanese documents because it is no need of a priori knowledge about the document layout. We have favorable experimental results for the effectiveness of this method.