標題: Development of Novel Lip-Reading Recognition Algorithm
作者: Lin, Bor-Shing
Yao, Yu-Hsien
Liu, Ching-Feng
Lien, Ching-Feng
Lin, Bor-Shyh
影像與生醫光電研究所
Institute of Imaging and Biomedical Photonics
關鍵字: Laryngectomy;lip-reading recogonition;mouth region of interest;visual-only speech recognition;vowels recognition
公開日期: 1-一月-2017
摘要: Total laryngectomy is a common treatment for patients with advanced laryngeal and hypopharyngeal cancer, but it is also a result from the loss of the natural voice and directly affects the basic communication functions in daily life. Reconstructing the basic communication function is an important issue for these patients after total laryngectomy surgery. Recently, the image processing technique for lip-reading recognition has been widely developed and applied in various kinds of applications. It is also one of the possibly alternative approaches to reconstructing the basic communication function for these patients after total laryngectomy surgery. Although many human lip-reading recognition methods have been developed to detect lip contour precisely, detecting pronouncing lip contour effectively is still a difficult challenge. In this paper, a novel lip-reading recognition algorithm was proposed to recognize English vowels from the lip contour when speaking. Here, several criteria for detecting the mouth region of interest (ROI) were designed to reduce the error rate of detecting the mouth ROI and lip contour. Moreover, several lip parameters, including the width, height, contour points, area, and the ratio (width/height) of lips, were used to recognize the lip contour and English vowels when speaking. The advantages of the proposed method are that it could detect the mouth ROI automatically, reduce the influence of individual differences, such as the individual lip shape or makeup effect, and it also could perform a good performance without pretraining. Finally, the performance of lip-reading recognition under different backgrounds and individual differences was also tested, and the accuracy of the proposed algorithm on lip-reading recognition was over 80%.
URI: http://dx.doi.org/10.1109/ACCESS.2017.2649838
http://hdl.handle.net/11536/144819
ISSN: 2169-3536
DOI: 10.1109/ACCESS.2017.2649838
期刊: IEEE ACCESS
Volume: 5
起始頁: 794
結束頁: 801
顯示於類別:期刊論文


文件中的檔案:

  1. d980a2ff49ee33dfc6306ea97e636882.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。