標題: On Incremental High Utility Sequential Pattern Mining
作者: Wang, Jun-Zhe
Huang, Jiun-Long
資訊工程學系
Department of Computer Science
關鍵字: High utility sequential pattern mining;incremental mining;incremental high utility sequential pattern mining;utility mining
公開日期: 1-Jul-2018
摘要: High utility sequential pattern (HUSP) mining is an emerging topic in pattern mining, and only a few algorithms have been proposed to address it. In practice, most sequence databases usually grow over time, and it is inefficient for existing algorithms to mine HUSPs from scratch when databases grow with a small portion of updates. In view of this, we propose the IncUSP-Miner(+) algorithm to mine HUSPs incrementally. Specifically, to avoid redundant re-computations, we propose a tighter upper bound of the utility of a sequence, called Tight Sequence Utility (TSU), and then we design a novel data structure, called the candidate pattern tree, to buffer the sequences whose TSU values are greater than or equal to the minimum utility threshold in the original database. Accordingly, to avoid keeping a huge amount of utility information for each sequence, a set of concise utility information is designed to be stored in each tree node. To improve the mining efficiency, several strategies are proposed to reduce the amount of computation for utility update and the scopes of database scans. Moreover, several strategies are also proposed to properly adjust the candidate pattern tree for the support of multiple database updates. Experimental results on some real and synthetic datasets show that IncUSP-Miner + is able to efficiently mine HUSPs incrementally.
URI: http://dx.doi.org/10.1145/3178114
http://hdl.handle.net/11536/151685
ISSN: 2157-6904
DOI: 10.1145/3178114
期刊: ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY
Volume: 9
Issue: 5
起始頁: 0
結束頁: 0
Appears in Collections:Articles