標題: Recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features
作者: Inkeaw, Papangkorn
Charoenkwan, Phasit
Huang, Hui-Ling
Marukatat, Sanparith
Ho, Shinn-Ying
Chaijaruwanich, Jeerayut
生物科技學系
生物資訊及系統生物研究所
Department of Biological Science and Technology
Institude of Bioinformatics and Systems Biology
關鍵字: Lanna Dhamma alphabet;Optical character recognition;Image moments;Writer-independent recognition
公開日期: 1-十二月-2017
摘要: Lanna Dhamma alphabet was used mainly for religious communication in the ancient Lanna Kingdom of Thailand. The old manuscripts using this alphabet are gradually decayed. It is desirable to preserve these valuable manuscripts in machine-encoded text files. Existing works used optical character recognition (OCR) methods based on wavelet transform for recognition of handwritten Lanna Dhamma characters. However, the test accuracy of writer-independent recognition is not satisfactory. This work proposes an OCR method, called LDIMS, for recognition of handwritten Lanna Dhamma characters using a set of optimally designed moment features. The LDIMS using an optimization approach to feature selection consists of three main phases: (1) determination of moment orders for each of eight effective moment descriptors, (2) the best combination of selected moment descriptors and (3) the optimized selection of moment features using an inheritable bi-objective genetic algorithm. The LDIMS has three individual feature sets for the recognition of handwritten Lanna Dhamma characters in upper, middle and lower levels. The character images gleaned from previous work were used as a training dataset. A new character image dataset from different writers was established for evaluating ability of writer-independent recognition. The experimental results show that the LDIMS using four moment descriptors, Meixner, Charlier, Tchebichef and Hahn, has test accuracies of 86.60, 74.38 and 85.82% for the characters in upper, middle and lower levels, respectively. The LDIMS with a mean accuracy of 82.27% performed well in recognizing the handwritten Lanna Dhamma characters from new writers, compared to existing methods using generic descriptors in terms of both accuracy and feature number used. Experimental results show that the generalized OCR method, LDIMS, is also effective for character recognition of digit and English alphabets, compared to existing methods.
URI: http://dx.doi.org/10.1007/s10032-017-0290-x
http://hdl.handle.net/11536/144216
ISSN: 1433-2833
DOI: 10.1007/s10032-017-0290-x
期刊: INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION
Volume: 20
起始頁: 259
結束頁: 274
顯示於類別:期刊論文