標題: | Clustering data with partial background information |
作者: | Liu, Chien-Liang Hsaio, Wen-Hoar Chang, Tao-Hsing Li, Hsuan-Hsun 資訊工程學系 工業工程與管理學系 Department of Computer Science Department of Industrial Engineering and Management |
關鍵字: | Clustering;Fuzzy clustering;Semi-supervised learning;Non-negative matrix factorization (NMF) |
公開日期: | 1-五月-2019 |
摘要: | Clustering with partial supervision background information or semi-supervised clustering, learning from a combination of both labeled and unlabeled data, has received a lot of attention over the last decade. The supervisory information is usually used as the constraints to bias clustering towards a good region of search space. This paper proposes a semi-supervised algorithm, called constrained non-negative matrix factorization (Constrained-NMF), with a few labeled examples as constraints to improve performance. The proposed algorithm is a matrix factorization algorithm, in which initialization of matrices is required at the beginning. Although the benefits of good initialization are well-known, randomized seeding of basis matrix and coefficient matrix is still the standard approach for many non-negative matrix factorization (NMF) algorithms. This work devises an algorithm called entropy-based weighted semi-supervised fuzzy c-means (EWSS-FCM) algorithm to initialize the seeds. The experimental results indicate that the proposed Constrained-NMF can benefit from the initialization obtained from EWSS-FCM, which emphasizes the role of labeled examples and automatically weights them during the course of clustering. This work considers labeled examples in the objective functions to devise the two algorithms, in which the labeled information is propagated to unlabeled examples iteratively. We further analyze the proposed Constrained-NMF and give convergence justifications. The experiments are conducted on five real data sets, and experimental results indicate that the proposed algorithm generally outperforms the other alternatives. |
URI: | http://dx.doi.org/10.1007/s13042-018-0790-0 http://hdl.handle.net/11536/151919 |
ISSN: | 1868-8071 |
DOI: | 10.1007/s13042-018-0790-0 |
期刊: | INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS |
Volume: | 10 |
Issue: | 5 |
起始頁: | 1123 |
結束頁: | 1138 |
顯示於類別: | 期刊論文 |