Extraction of Inclined Character Strings from Unformed Document Images Using the Confidence Value of a Character Recognizer

Kei TAKIZAWA  Daisaku ARITA  Michihiko MINOH  Katsuo IKEDA  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E77-D   No.7   pp.839-845
Publication Date: 1994/07/25
Online ISSN: 
DOI: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Document Analysis and Recognition)
Category: 
Keyword: 
string segmentation,  string extraction,  layout analysis,  

Full Text: PDF>>
Buy this Article




Summary: 
A method for extracting and recognizing character strings from unformed document images, which have inclined character strings and have no structure at all, is described. To process such kinds of unformed documents, previous schemes, which are intended only to deal with documents containing nothing but horizontal or vertical strings of characters, do not work well. Our method is based on the idea that the processes of recognition and extraction of character patterns should operate together, and on the characteristic that the character patterns are located close to each other when they belong to the same string. The method has been implemented and applied to several images. The experimental results show the robustness of our method.