標題: | Multiclass machine learning classification of functional brain images for Parkinson's disease stage prediction |
作者: | Huang, Guan-Hua Lin, Chih-Hsuan Cai, Yu-Ren Chen, Tai-Been Hsu, Shih-Yen Lu, Nan-Han Chen, Huei-Yung Wu, Yi-Chen 統計學研究所 Institute of Statistics |
關鍵字: | deep neural network;functional brain image;machine learning;supervised classification |
公開日期: | 1-Oct-2020 |
摘要: | We analyzed a data set containing functional brain images from 6 healthy controls and 196 individuals with Parkinson's disease (PD), who were divided into five stages according to illness severity. The goal was to predict patients' PD illness stages by using their functional brain images. We employed the following prediction approaches: multivariate statistical methods (linear discriminant analysis, support vector machine, decision tree, and multilayer perceptron [MLP]), ensemble learning models (random forest [RF] and adaptive boosting), and deep convolutional neural network (CNN). For statistical and ensemble models, various feature extraction approaches (principal component analysis [PCA], multilinear PCA, intensity summary statistics [IStat], and Laws' texture energy measure) were employed to extract features, the synthetic minority over-sampling technique was used to address imbalanced data, and the optimal combination of hyperparameters was found using a grid search. For CNN modeling, we applied an image augmentation technique to increase and balance data sizes over different disease stages. We adopted transfer learning to incorporate pretrained VGG16 weights and architecture into the model fitting, and we also tested a state-of-the-art machine learning model that could automatically generate an optimal neural architecture. We found that IStat consistently outperformed other feature extraction approaches. MLP and RF were the analytic approaches with the highest prediction accuracy rate for multivariate statistical and ensemble learning models, respectively. Overall, the deep CNN model with pretrained VGG16 weights and architecture outperformed other approaches; it captured critical features from imaging, effectively distinguished between normal controls and patients with PD, and achieved the highest classification accuracy. |
URI: | http://dx.doi.org/10.1002/sam.11480 http://hdl.handle.net/11536/155368 |
ISSN: | 1932-1864 |
DOI: | 10.1002/sam.11480 |
期刊: | STATISTICAL ANALYSIS AND DATA MINING |
Volume: | 13 |
Issue: | 5 |
起始頁: | 508 |
結束頁: | 523 |
Appears in Collections: | Articles |