複雜型複合式文件影像壓縮方法之研究

標題:	複雜型複合式文件影像壓縮方法之研究 THE STUDY OF THE COMPRESSION ALGORITHMS FOR COMPLEX COMPOUND DOCUMENT IMAGES
作者:	瞿忠正 Chung-Cheng Chiu 吳炳飛 Bing-Fei Wu 電控工程研究所
關鍵字:	圖文分離;影像壓縮;複雜型複合文件影像;text segmentation;image compression;complex compound document image
公開日期:	2004
摘要:	由於複合式文件影像中包含許多文字資訊，當文件影像以傳統壓縮方法壓縮時，文字資訊會產生大量的失真，文字和一些屬於高頻的資訊會變的模糊。所以，傳統壓縮方法並不適合拿來直接對複合式文件影像作壓縮處理，同時，壓縮後文件影像中的文字，也無法容易的被電腦辨識或被我們閱讀。因為文件中文字資訊的重要性，所以文件影像的文字切割技術已經發展了十多年，但是針對複合式文件影像的研究，仍是一個新鮮的研究課題。目前已有許多學者針對複合式文件影像研究文字切割的方法，但是這些方法依然不能適用於目前報章雜誌上圖文交疊、背景變化多端的複雜型複合式文件影像。像這類複雜型複合式文件影像的文字切割技術，可以說是文件影像處理的一大挑戰。如果可以從不同複雜程度的影像中，將文字切割出來，那就可以適用於所有的文件影像處理。本篇論文研究目標就是發展出一種可以解決複雜型複合式文件影像的文字切割方法，使文件影像壓縮可以達到更高的壓縮倍數與視覺品質。本篇論文提出三個文字切割的方法，這三個方法所處理的複合式文件影像難度依章節順序增高。本文中提出的切割方法應用於文件影像壓縮，可以明顯的看出壓縮倍數與視覺品質優於JPEG或DjVu，而且在本文第三個切割方法(MLSM)中，提出新的區域性區塊特徵分離與拼圖式全圖整合的方法，在解決複雜型複合式文件影像的文字切割問題時，即使在同一張完整的文件影像中，包含各種不同程度的複雜狀況，也可以順利的將不同顏色、不同複雜背景與不同交疊程度的文字切割出來，提高各種複雜型複合式文件影像的壓縮品質。 Traditional image compression methods are not suitable for compound document images because such images include much text. These image data are high-frequency components, many of which are lost in compression. Text and the high-frequency components thus become blurred. Then, the text cannot be recognized easily by the human eye or a computer. The text contains most information, separating the text from a compound document image is one of the most significant areas of research into document images. Document image segmentation, which separates the text from the monochromatic background, has been studied for over ten years. Segmenting compound document images is still an open research field. Many techniques have been developed to segment document images. However, they are insufficient when the background includes sharply varying contours or overlaps with text. Finding a text segmentation method of complex compound documents remains a great challenge and the research field is still young. This dissertation presents three segmentation algorithms for compressing image documents, with a high compression ratio of both color and monochromatic compound document images. The proposed algorithms greatly outperform the famous image compression methods, JPEG and DjVu, and enable the effective extraction of the text from a complex background, achieving a high compression ratio for compound document images.
URI:	http://140.113.39.130/cdrfb3/record/nctu/#GT008712806 http://hdl.handle.net/11536/43001
Appears in Collections:	Thesis

Files in This Item:

280601.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.