Full metadata record
DC FieldValueLanguage
dc.contributor.authorLiu, Chien-Liangen_US
dc.contributor.authorHsaio, Wen-Hoaren_US
dc.contributor.authorChang, Tao-Hsingen_US
dc.contributor.authorLi, Hsuan-Hsunen_US
dc.date.accessioned2019-06-03T01:08:32Z-
dc.date.available2019-06-03T01:08:32Z-
dc.date.issued2019-05-01en_US
dc.identifier.issn1868-8071en_US
dc.identifier.urihttp://dx.doi.org/10.1007/s13042-018-0790-0en_US
dc.identifier.urihttp://hdl.handle.net/11536/151919-
dc.description.abstractClustering with partial supervision background information or semi-supervised clustering, learning from a combination of both labeled and unlabeled data, has received a lot of attention over the last decade. The supervisory information is usually used as the constraints to bias clustering towards a good region of search space. This paper proposes a semi-supervised algorithm, called constrained non-negative matrix factorization (Constrained-NMF), with a few labeled examples as constraints to improve performance. The proposed algorithm is a matrix factorization algorithm, in which initialization of matrices is required at the beginning. Although the benefits of good initialization are well-known, randomized seeding of basis matrix and coefficient matrix is still the standard approach for many non-negative matrix factorization (NMF) algorithms. This work devises an algorithm called entropy-based weighted semi-supervised fuzzy c-means (EWSS-FCM) algorithm to initialize the seeds. The experimental results indicate that the proposed Constrained-NMF can benefit from the initialization obtained from EWSS-FCM, which emphasizes the role of labeled examples and automatically weights them during the course of clustering. This work considers labeled examples in the objective functions to devise the two algorithms, in which the labeled information is propagated to unlabeled examples iteratively. We further analyze the proposed Constrained-NMF and give convergence justifications. The experiments are conducted on five real data sets, and experimental results indicate that the proposed algorithm generally outperforms the other alternatives.en_US
dc.language.isoen_USen_US
dc.subjectClusteringen_US
dc.subjectFuzzy clusteringen_US
dc.subjectSemi-supervised learningen_US
dc.subjectNon-negative matrix factorization (NMF)en_US
dc.titleClustering data with partial background informationen_US
dc.typeArticleen_US
dc.identifier.doi10.1007/s13042-018-0790-0en_US
dc.identifier.journalINTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICSen_US
dc.citation.volume10en_US
dc.citation.issue5en_US
dc.citation.spage1123en_US
dc.citation.epage1138en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.department工業工程與管理學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.contributor.departmentDepartment of Industrial Engineering and Managementen_US
dc.identifier.wosnumberWOS:000465596100024en_US
dc.citation.woscount0en_US
Appears in Collections:Articles