Full metadata record
DC FieldValueLanguage
dc.contributor.author許頌伶zh_TW
dc.contributor.author蔡淳仁zh_TW
dc.contributor.authorHsu, Sung-Lingen_US
dc.contributor.authorTsai, Chun-Jenen_US
dc.date.accessioned2018-01-24T07:37:57Z-
dc.date.available2018-01-24T07:37:57Z-
dc.date.issued2016en_US
dc.identifier.urihttp://etd.lib.nctu.edu.tw/cdrfb3/record/nctu/#GT070356074en_US
dc.identifier.urihttp://hdl.handle.net/11536/139377-
dc.description.abstract本論文使用三維手部模型訓練類神經網路,再利用訓練完成的類神經網路模型對二維手勢影像進行辨識。 用於訓練的三維模型是由Blender三維繪圖軟體繪製而成,再加上手指轉動角度的限制,讓三維模型更貼近真實手掌與手指的運動方式。此外,根據使用性質的不同,設計了三種不同的手勢組合,分別將手勢分為:243種、36種、32種。243種手勢組合提供了所有手指旋轉角度的集合,可以用來更準確地估測每根手指的彎曲度;36種手勢組合是根據常見的手勢動作而設計;32種手勢組合則可以單純地用來辨識每根手指是否有彎曲。三維模型可依照不同的手勢組合產生相對應的二維影像(Blender Images)輸出,作為類神經網路的訓練資料。 類神經網路的部分是採用Caffe提供的AlexNet架構。將三維模型產生的二維影像(Blender Images)、加上少量的真實影像作為訓練數據集,藉由調整類神經網路的訓練參數與訓練數據集的大小、特性等等,讓三維手部模型訓練而成的類神經網路能夠成功辨識真實的二維手勢影像,以求降低深度學習對真實訓練資料量的要求。zh_TW
dc.description.abstractIn this thesis, a hand gesture recognition method, Synthetically Trained Deep Learning (STDL), is presented. STDL uses a 3D hand model to train the neural network, which is used to classify the real hand gesture images. STDL is composed of two major parts. The first is 3D hand modeling done with Blender, an open-source 3D computer graphics software. To make the 3D hand model more realistic, the motion constraints of finger joints are added. Furthermore, three different sets of hand gestures are designed for distinct purposes. The numbers of gestures in each set are 243, 36, and 32 respectively. The set with 243 gestures covers the entire range of fingers rotation angles and can be used to estimate the pose of each finger more accurately. The set with 36 gestures is designed according to common gestures. Last, the set with 32 gestures is focused on whether each finger is bent or not. For each set of hand gestures to be recognized, the Blender CAD tool is used to generate the 2D training images from the 3D hand models. The second part of STDL is the deep learning module. The neural networks used in STDL adopt AlexNet architecture provided by Caffe, a famous and popular deep learning framework. Using large amount of 2D Blender images plus few real images as the training and validation set of Caffe, the trained model can classify the real hand gesture images successfully by adjusting the neural network training parameters, the sizes of the training and validation sets, etc. STDL can reduce the demands for quantities of real image data in deep learning training substantially.en_US
dc.language.isozh_TWen_US
dc.subject手勢辨識zh_TW
dc.subject類神經網路zh_TW
dc.subject深度學習zh_TW
dc.subject三維手部模型zh_TW
dc.subjectHand Gesture Recognitionen_US
dc.subjectNeural Networksen_US
dc.subjectDeep Learningen_US
dc.subject3D Hand Modelingen_US
dc.title利用三維模型訓練類神經網路的手勢辨識技術zh_TW
dc.titleHand Gesture Recognition with Synthetically Trained Deep Learningen_US
dc.typeThesisen_US
dc.contributor.department資訊科學與工程研究所zh_TW
Appears in Collections:Thesis