Representing, Utilizing and Acquiring Knowledge for Document lmage Understanding

Koichi KISE  Noboru BABAGUCHI  

Publication
IEICE TRANSACTIONS on Information and Systems   Vol.E77-D   No.7   pp.770-777
Publication Date: 1994/07/25
Online ISSN: 
DOI: 
Print ISSN: 0916-8532
Type of Manuscript: Special Section PAPER (Special Issue on Document Analysis and Recognition)
Category: 
Keyword: 
document image understanding,  document image analysis,  knowledge based system,  knowledge acquisition,  

Full Text: PDF(696.9KB)>>
Buy this Article




Summary: 
This paper discusses the role of knowledge in document image understanding from the viewpoints of representation, utilization and acquisition. For the representation of knowledge, we propose two models, a layout model and a content model, which represent knowledge about the layout structure and content of a document, respectively. For the utilization of knowledge, we implement layout analysis and content analysis which utilize a layout model and a content model, respectively. The strategy of hypothesis generation and verification is introduced in order to integrate these two kinds of analysis. For the acquisition of knowledge, we propose a method of incremental acquisition of a layout model from a stream of example documents. From the experimental results of document image understanding and knowledge acquisition using 50 samples of visiting cards, we verified the effectiveness of the proposed method.