標題: Efficient Algorithms for Mining Top-K High Utility Itemsets
作者: Tseng, Vincent S.
Wu, Cheng-Wei
Fournier-Viger, Philippe
Yu, Philip S.
資訊工程學系
Department of Computer Science
關鍵字: Utility mining;high utility itemset mining;top-k pattern mining;top-k high utility itemset mining
公開日期: 1-一月-2016
摘要: High utility itemsets (HUIs) mining is an emerging topic in data mining, which refers to discovering all itemsets having a utility meeting a user-specified minimum utility threshold min_util. However, setting min_util appropriately is a difficult problem for users. Generally speaking, finding an appropriate minimum utility threshold by trial and error is a tedious process for users. If min_util is set too low, too many HUIs will be generated, which may cause the mining process to be very inefficient. On the other hand, if min_util is set too high, it is likely that no HUIs will be found. In this paper, we address the above issues by proposing a new framework for top-k high utility itemset mining, where k is the desired number of HUIs to be mined. Two types of efficient algorithms named TKU (mining Top-K Utility itemsets) and TKO (mining Top-K utility itemsets in One phase) are proposed for mining such itemsets without the need to set min_util. We provide a structural comparison of the two algorithms with discussions on their advantages and limitations. Empirical evaluations on both real and synthetic datasets show that the performance of the proposed algorithms is close to that of the optimal case of state-of-the-art utility mining algorithms.
URI: http://dx.doi.org/10.1109/TKDE.2015.2458860
http://hdl.handle.net/11536/129515
ISSN: 1041-4347
DOI: 10.1109/TKDE.2015.2458860
期刊: IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
Volume: 28
起始頁: 54
結束頁: 67
顯示於類別:期刊論文