標題: | Supervised learning of multivariate skew normal mixture models with missing information |
作者: | Lin, Tzy-Chy Lin, Tsung-I 統計學研究所 Institute of Statistics |
關鍵字: | Classifier;EM algorithm;Ignorable;Incomplete data;MSN model;Multivariate truncated normal |
公開日期: | 1-六月-2010 |
摘要: | We establish computationally flexible tools for the analysis of multivariate skew normal mixtures when missing values occur in data. To facilitate the computation and simplify the theoretical derivation, two auxiliary permutation matrices are incorporated into the model for the determination of observed and missing components of each observation and are manifestly effective in reducing the computational complexity. We present an analytically feasible EM algorithm for the supervised learning of parameters as well as missing observations. The proposed mixture analyzer, including the most commonly used Gaussian mixtures as a special case, allows practitioners to handle incomplete multivariate data sets in a wide range of considerations. The methodology is illustrated through a real data set with varying proportions of synthetic missing values generated by MCAR and MAR mechanisms and shown to perform well on classification tasks. |
URI: | http://dx.doi.org/10.1007/s00180-009-0169-5 http://hdl.handle.net/11536/5339 |
ISSN: | 0943-4062 |
DOI: | 10.1007/s00180-009-0169-5 |
期刊: | COMPUTATIONAL STATISTICS |
Volume: | 25 |
Issue: | 2 |
起始頁: | 183 |
結束頁: | 201 |
顯示於類別: | 期刊論文 |