標題: Supervised learning of multivariate skew normal mixture models with missing information
作者: Lin, Tzy-Chy
Lin, Tsung-I
統計學研究所
Institute of Statistics
關鍵字: Classifier;EM algorithm;Ignorable;Incomplete data;MSN model;Multivariate truncated normal
公開日期: 1-六月-2010
摘要: We establish computationally flexible tools for the analysis of multivariate skew normal mixtures when missing values occur in data. To facilitate the computation and simplify the theoretical derivation, two auxiliary permutation matrices are incorporated into the model for the determination of observed and missing components of each observation and are manifestly effective in reducing the computational complexity. We present an analytically feasible EM algorithm for the supervised learning of parameters as well as missing observations. The proposed mixture analyzer, including the most commonly used Gaussian mixtures as a special case, allows practitioners to handle incomplete multivariate data sets in a wide range of considerations. The methodology is illustrated through a real data set with varying proportions of synthetic missing values generated by MCAR and MAR mechanisms and shown to perform well on classification tasks.
URI: http://dx.doi.org/10.1007/s00180-009-0169-5
http://hdl.handle.net/11536/5339
ISSN: 0943-4062
DOI: 10.1007/s00180-009-0169-5
期刊: COMPUTATIONAL STATISTICS
Volume: 25
Issue: 2
起始頁: 183
結束頁: 201
顯示於類別:期刊論文


文件中的檔案:

  1. 000276653900001.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。