Full metadata record
DC FieldValueLanguage
dc.contributor.authorChen, Yen-Linen_US
dc.contributor.authorWu, Bing-Feien_US
dc.date.accessioned2014-12-08T15:09:12Z-
dc.date.available2014-12-08T15:09:12Z-
dc.date.issued2009-07-01en_US
dc.identifier.issn0031-3203en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.patcog.2008.10.032en_US
dc.identifier.urihttp://hdl.handle.net/11536/7018-
dc.description.abstractThis study presents a new method, namely the multiplane segmentation approach, for segmenting and extracting textual objects from various real-life complex document images. The proposed multi-plane segmentation approach first decomposes the document image into distinct object planes to extract and separate homogeneous objects including textual regions of interest, non-text objects such as graphics and pictures, and background textures. This process consists of two stages-localized histogram multilevel thresholding and multi-plane region matching and assembling. Then a text extraction procedure is applied Oil the resultant planes to detect and extract textual objects with different characteristics in the respective planes. The proposed approach processes document images regionally and adaptively according to their respective local features. Hence detailed characteristics of the extracted textual objects, Particularly small characters with thin strokes, as well as gradational illuminations of characters, can be well-preserved. Moreover, this way also allows background objects with uneven, gradational, and sharp variations in contrast, illumination, and texture to be handled easily and well. Experimental results on real-life complex document images demonstrate that the proposed approach is effective in extracting textual objects with Various illuminations, sizes, and font styles from various types of complex document images. (C) 2008 Elsevier Ltd. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectDocument image processingen_US
dc.subjectText extractionen_US
dc.subjectImage segmentationen_US
dc.subjectMultilevel thresholdingen_US
dc.subjectRegion segmentationen_US
dc.subjectComplex document imagesen_US
dc.titleA multi-plane approach for text segmentation of complex document imagesen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.patcog.2008.10.032en_US
dc.identifier.journalPATTERN RECOGNITIONen_US
dc.citation.volume42en_US
dc.citation.issue7en_US
dc.citation.spage1419en_US
dc.citation.epage1444en_US
dc.contributor.department電控工程研究所zh_TW
dc.contributor.departmentInstitute of Electrical and Control Engineeringen_US
dc.identifier.wosnumberWOS:000265365500020-
dc.citation.woscount11-
Appears in Collections:Articles


Files in This Item:

  1. 000265365500020.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.