標題: | Complex document image segmentation using localized histogram analysis with multi-layer matching and clustering |
作者: | Chen, YL Chiu, CC Wu, BF 電控工程研究所 Institute of Electrical and Control Engineering |
關鍵字: | image segmentation;multilevel thresholding;region-based segmentation;document analysis |
公開日期: | 2004 |
摘要: | This paper proposes a new segmentation method to separate the text from various complex document images. An automatic multilevel thresholding method, based on discriminant analysis, is utilized to recursively segment a specified block region into several layered image sub-blocks. Then the multi-layer region-based clustering method is performed to process the layered image sub-blocks to form several object layers. Hence character strings with different illuminations, non-text objects and background components are segmented into separate object layers. After performed text extraction process, the text objects with different sizes, styles and illuminations are properly extracted. Experimental results on the extraction of text strings from complex document images demonstrate the effectiveness of the proposed region-based segmentation method. |
URI: | http://hdl.handle.net/11536/18197 |
ISBN: | 0-7803-8566-7 |
ISSN: | 1062-922X |
期刊: | 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7 |
起始頁: | 3063 |
結束頁: | 3070 |
Appears in Collections: | Conferences Paper |