标题: | Clustering documents with labeled and unlabeled documents using fuzzy semi-Kmeans |
作者: | Liu, Chien-Liang Chang, Tao-Hsing Li, Hsuan-Hsun 資訊工程學系 Department of Computer Science |
关键字: | Fuzzy clustering;Semi-supervised learning;Text mining;Fuzzy semi-Kmeans |
公开日期: | 16-六月-2013 |
摘要: | While focusing on document clustering, this work presents a fuzzy semi-supervised clustering algorithm called fuzzy semi-Kmeans. The fuzzy semi-Kmeans is an extension of K-means clustering model, and it is inspired by an EM algorithm and a Gaussian mixture model. Additionally, the fuzzy semi-Kmeans provides the flexibility to employ different fuzzy membership functions to measure the distance between data. This work employs Gaussian weighting function to conduct experiments, but cosine similarity function can be used as well. This work conducts experiments on three data sets and compares fuzzy semi-Kmeans with several methods. The experimental results indicate that fuzzy semi-Kmeans can generally outperform the other methods. (C) 2013 Elsevier B.V. All rights reserved. |
URI: | http://dx.doi.org/10.1016/j.fss.2013.01.004 http://hdl.handle.net/11536/21844 |
ISSN: | 0165-0114 |
DOI: | 10.1016/j.fss.2013.01.004 |
期刊: | FUZZY SETS AND SYSTEMS |
Volume: | 221 |
Issue: | |
起始页: | 48 |
结束页: | 64 |
显示于类别: | Articles |
文件中的档案:
If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.