Segmentation of Horizontal and Vertical Touching Thai Characters


IEICE TRANSACTIONS on Fundamentals of Electronics, Communications and Computer Sciences   Vol.E83-A    No.6    pp.987-995
Publication Date: 2000/06/25
Online ISSN: 
Print ISSN: 0916-8508
Type of Manuscript: Special Section PAPER (Special Section of Papers Selected from 1999 International Technical Conference on Circuits/Systems, Computers and Communications (ITC-CSCC'99))
touching characters,  segmentation,  Thai characters,  projection profile,  

Full Text: PDF(1.3MB)>>
Buy this Article

This paper proposes a scheme which combines the conventional technique with a multi-level structure of Thai sentences for detection and segmentation for touching Thai printed characters. The proposed scheme solves problems of both horizontally and vertically touching characters. The complexity of a multi-level structure is employed to classify characters into three zones. The edge detection technique is applied to separate overlapping characters. Then, the horizontal touching characters are determined by using a statistical width of characters. The segmentation point of horizontal touching characters is determined using vertical projection combined with a statistical width of characters. The vertical touching characters are determined by considering the overlapping area of character boundary between zones. The height of line is used to separate the segment of vertical touching characters. Ambiguities are handle by using distinctive features of Thai characters. The effectiveness of the proposed scheme is tested with data from both newspapers and printed documents. The accuracy of 97 and 98 percents are obtained for newspaper and printed documents respectively.