標題: Clustering data with partial background information
作者: Liu, Chien-Liang
Hsaio, Wen-Hoar
Chang, Tao-Hsing
Li, Hsuan-Hsun
資訊工程學系
工業工程與管理學系
Department of Computer Science
Department of Industrial Engineering and Management
關鍵字: Clustering;Fuzzy clustering;Semi-supervised learning;Non-negative matrix factorization (NMF)
公開日期: 1-五月-2019
摘要: Clustering with partial supervision background information or semi-supervised clustering, learning from a combination of both labeled and unlabeled data, has received a lot of attention over the last decade. The supervisory information is usually used as the constraints to bias clustering towards a good region of search space. This paper proposes a semi-supervised algorithm, called constrained non-negative matrix factorization (Constrained-NMF), with a few labeled examples as constraints to improve performance. The proposed algorithm is a matrix factorization algorithm, in which initialization of matrices is required at the beginning. Although the benefits of good initialization are well-known, randomized seeding of basis matrix and coefficient matrix is still the standard approach for many non-negative matrix factorization (NMF) algorithms. This work devises an algorithm called entropy-based weighted semi-supervised fuzzy c-means (EWSS-FCM) algorithm to initialize the seeds. The experimental results indicate that the proposed Constrained-NMF can benefit from the initialization obtained from EWSS-FCM, which emphasizes the role of labeled examples and automatically weights them during the course of clustering. This work considers labeled examples in the objective functions to devise the two algorithms, in which the labeled information is propagated to unlabeled examples iteratively. We further analyze the proposed Constrained-NMF and give convergence justifications. The experiments are conducted on five real data sets, and experimental results indicate that the proposed algorithm generally outperforms the other alternatives.
URI: http://dx.doi.org/10.1007/s13042-018-0790-0
http://hdl.handle.net/11536/151919
ISSN: 1868-8071
DOI: 10.1007/s13042-018-0790-0
期刊: INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS
Volume: 10
Issue: 5
起始頁: 1123
結束頁: 1138
顯示於類別:期刊論文