標題: | Binarization of color document images via luminance and saturation color features |
作者: | Tsai, CM Lee, HJ 資訊工程學系 Department of Computer Science |
關鍵字: | color document;color feature;decision-tree;luminance;name-card;saturation;uniform invoice |
公開日期: | 1-Apr-2002 |
摘要: | This paper presents a novel binarization algorithm for color document images. Conventional thresholding methods do not produce satisfactory binarization results for documents with close or mixed foreground colors and background colors. Initially, statistical image features are extracted from the luminance distribution. Then, a decision-tree based binarization method is proposed, which selects various color features to binarize color document images. First, if the document image colors are concentrated within a limited range, saturation is employed. Second, if the image foreground colors are significant, luminance is adopted. Third, if the image background colors are concentrated within a limited range, luminance is also applied. Fourth, if the total number of pixels with low luminance (less than 60) is limited, saturation is applied; else both luminance and saturation are employed. Our experiments include 519 color images, most of which are uniform invoice and name-card document images. The proposed binarization method generates better results than other available methods in shape and connected-component measurements. Also, the binarization method obtains higher recognition accuracy in a commercial OCR system than other comparable methods. |
URI: | http://dx.doi.org/10.1109/TIP.2002.999677 http://hdl.handle.net/11536/28907 |
ISSN: | 1057-7149 |
DOI: | 10.1109/TIP.2002.999677 |
期刊: | IEEE TRANSACTIONS ON IMAGE PROCESSING |
Volume: | 11 |
Issue: | 4 |
起始頁: | 434 |
結束頁: | 451 |
Appears in Collections: | Articles |
Files in This Item:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.