標題: An integration of fuzzy association rules and WordNet for document clustering
作者: Chen, Chun-Ling
Tseng, Frank S. C.
Liang, Tyne
資訊工程學系
Department of Computer Science
關鍵字: Fuzzy association rule mining;Text mining;Document clustering;Frequent itemsets;WordNet
公開日期: 1-Sep-2011
摘要: With the rapid growth of text documents, document clustering technique is emerging for efficient document retrieval and better document browsing. Recently, some methods had been proposed to resolve the problems of high dimensionality, scalability, accuracy, and meaningful cluster labels by using frequent itemsets derived from association rule mining for clustering documents. In order to improve the quality of document clustering results, we propose an effective Fuzzy Frequent Itemset-based Document Clustering (F(2)IDC) approach that combines fuzzy association rule mining with the background knowledge embedded in WordNet. A term hierarchy generated from WordNet is applied to discover generalized frequent itemsets as candidate cluster labels for grouping documents. We have conducted experiments to evaluate our approach on Classic4, Re0, R8, and WebKB datasets. Our experimental results show that our proposed approach indeed provide more accurate clustering results than prior influential clustering methods presented in recent literature.
URI: http://dx.doi.org/10.1007/s10115-010-0364-2
http://hdl.handle.net/11536/19892
ISSN: 0219-1377
DOI: 10.1007/s10115-010-0364-2
期刊: KNOWLEDGE AND INFORMATION SYSTEMS
Volume: 28
Issue: 3
起始頁: 687
結束頁: 708
Appears in Collections:Articles


Files in This Item:

  1. 000294229000009.pdf

If it is a zip file, please download the file and unzip it, then open index.html in a browser to view the full text content.