Fast algorithms for mining high-utility itemsets with various discount strategies

doi:10.1016/j.aei.2016.02.003

Full metadata record

DC Field	Value	Language
dc.contributor.author	Lin, Jerry Chun-Wei	en_US
dc.contributor.author	Gan, Wensheng	en_US
dc.contributor.author	Fournier-Viger, Philippe	en_US
dc.contributor.author	Hong, Tzung-Pei	en_US
dc.contributor.author	Tseng, Vincent S.	en_US
dc.date.accessioned	2017-04-21T06:55:18Z	-
dc.date.available	2017-04-21T06:55:18Z	-
dc.date.issued	2016-04	en_US
dc.identifier.issn	1474-0346	en_US
dc.identifier.uri	http://dx.doi.org/10.1016/j.aei.2016.02.003	en_US
dc.identifier.uri	http://hdl.handle.net/11536/133815	-
dc.description.abstract	In recent years, mining high-utility itemsets (HUIs) has emerged as a key topic in data mining. It consists of discovering sets of items generating a high profit in a transactional database by considering both purchase quantities and unit profits of items. Many algorithms have been proposed for this task. However, most of them assume the unrealistic assumption that unit profits of items remain unchanged over time. But in real-life, the profit of an item or itemset varies as a function of cost prices, sales prices and sale strategies. Recently, a three-phase algorithm has been proposed to mine HUIs, while considering that each item may have different discount strategies. However, the complete set of HUIs cannot be retrieved based on the traditional TWU model with its defined discount strategies. Moreover, it suffers from the well-known drawbacks of Apriori-based algorithms such as maintaining a huge amount of candidates in memory and repeatedly performing time-consuming database scans. In this paper, a HUI-DTP algorithm for mining HUIs when considering discount strategies of items is introduced. The HUI-DTP is designed as a two-phase algorithm to mine the complete set of HUIs based on a novel downward closure property and a vertical TID-list structure. Furthermore, the HUI-DMiner is an algorithm relying on a compact data structure (Positive-and-Negative Utility-list, PNU-list) and properties of two new pruning strategies to efficiently discover HUIs without candidate generation, while considerably reducing the size of the search space. Moreover, a strategy named Estimated Utility Co-occurrence Strategy which stores the relationships between 2-itemsets is also applied in the improved HUI-DEMiner algorithm to speed up computation. An extensive experimental study carried on several real-life datasets shows that the proposed algorithms outperform the previous.best algorithm in terms of runtime, memory consumption and scalability. (C) 2016 Elsevier Ltd. All rights reserved.	en_US
dc.language.iso	en_US	en_US
dc.subject	High-utility itemsets	en_US
dc.subject	Discount strategies	en_US
dc.subject	Downward closure property	en_US
dc.subject	Pruning strategies	en_US
dc.subject	PNU-list	en_US
dc.title	Fast algorithms for mining high-utility itemsets with various discount strategies	en_US
dc.identifier.doi	10.1016/j.aei.2016.02.003	en_US
dc.identifier.journal	ADVANCED ENGINEERING INFORMATICS	en_US
dc.citation.volume	30	en_US
dc.citation.issue	2	en_US
dc.citation.spage	109	en_US
dc.citation.epage	126	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.identifier.wosnumber	WOS:000376694600002	en_US
Appears in Collections:	Articles