完整後設資料紀錄
DC 欄位語言
dc.contributor.authorChen, Chun-Lingen_US
dc.contributor.authorTseng, Frank S. C.en_US
dc.contributor.authorLiang, Tyneen_US
dc.date.accessioned2019-04-02T06:00:22Z-
dc.date.available2019-04-02T06:00:22Z-
dc.date.issued2010-11-01en_US
dc.identifier.issn0169-023Xen_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.datak.2010.08.003en_US
dc.identifier.urihttp://hdl.handle.net/11536/150143-
dc.description.abstractWith the rapid growth of text documents, document clustering has become one of the main techniques for organizing large amount of documents into a small number of meaningful clusters. However, there still exist several challenges for document clustering, such as high dimensionality, scalability, accuracy, meaningful cluster labels, overlapping clusters, and extracting semantics from texts. In order to improve the quality of document clustering results, we propose an effective Fuzzy-based Multi-label Document Clustering (FMDC) approach that integrates fuzzy association rule mining with an existing ontology WordNet to alleviate these problems. In our approach, the key terms will be extracted from the document set, and the initial representation of all documents is further enriched by using hypernyms of WordNet in order to exploit the semantic relations between terms. Then, a fuzzy association rule mining algorithm for texts is employed to discover a set of highly-related fuzzy frequent itemsets, which contain key terms to be regarded as the labels of the candidate clusters. Finally, each document is dispatched into more than one target cluster by referring to these candidate clusters, and then the highly similar target clusters are merged. We conducted experiments to evaluate the performance based on Classic, Re0, R8, and WebKB datasets. The experimental results proved that our approach outperforms the influential document clustering methods with higher accuracy. Therefore, our approach not only provides more general and meaningful labels for documents, but also effectively generates overlapping clusters. (C) 2010 Elsevier B.V. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectFuzzy association rule miningen_US
dc.subjectText miningen_US
dc.subjectDocument clusteringen_US
dc.subjectWordNeten_US
dc.subjectFrequent itemsetsen_US
dc.titleAn integration of Word Net and fuzzy association rule mining for multi-label document clusteringen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.datak.2010.08.003en_US
dc.identifier.journalDATA & KNOWLEDGE ENGINEERINGen_US
dc.citation.volume69en_US
dc.citation.spage1208en_US
dc.citation.epage1226en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000283975800009en_US
dc.citation.woscount27en_US
顯示於類別:期刊論文