Title: Complex document image segmentation using localized histogram analysis with multi-layer matching and clustering
Authors: Chen, YL
Chiu, CC
Wu, BF
電控工程研究所
Institute of Electrical and Control Engineering
Keywords: image segmentation;multilevel thresholding;region-based segmentation;document analysis
Issue Date: 2004
Abstract: This paper proposes a new segmentation method to separate the text from various complex document images. An automatic multilevel thresholding method, based on discriminant analysis, is utilized to recursively segment a specified block region into several layered image sub-blocks. Then the multi-layer region-based clustering method is performed to process the layered image sub-blocks to form several object layers. Hence character strings with different illuminations, non-text objects and background components are segmented into separate object layers. After performed text extraction process, the text objects with different sizes, styles and illuminations are properly extracted. Experimental results on the extraction of text strings from complex document images demonstrate the effectiveness of the proposed region-based segmentation method.
URI: http://hdl.handle.net/11536/18197
ISBN: 0-7803-8566-7
ISSN: 1062-922X
Journal: 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7
Begin Page: 3063
End Page: 3070
Appears in Collections:Conferences Paper