Full metadata record
DC FieldValueLanguage
dc.contributor.authorLin, Jerry Chun-Weien_US
dc.contributor.authorGan, Wenshengen_US
dc.contributor.authorFournier-Viger, Philippeen_US
dc.contributor.authorHong, Tzung-Peien_US
dc.contributor.authorTseng, Vincent S.en_US
dc.date.accessioned2017-04-21T06:55:18Z-
dc.date.available2017-04-21T06:55:18Z-
dc.date.issued2016-04en_US
dc.identifier.issn1474-0346en_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.aei.2016.02.003en_US
dc.identifier.urihttp://hdl.handle.net/11536/133815-
dc.description.abstractIn recent years, mining high-utility itemsets (HUIs) has emerged as a key topic in data mining. It consists of discovering sets of items generating a high profit in a transactional database by considering both purchase quantities and unit profits of items. Many algorithms have been proposed for this task. However, most of them assume the unrealistic assumption that unit profits of items remain unchanged over time. But in real-life, the profit of an item or itemset varies as a function of cost prices, sales prices and sale strategies. Recently, a three-phase algorithm has been proposed to mine HUIs, while considering that each item may have different discount strategies. However, the complete set of HUIs cannot be retrieved based on the traditional TWU model with its defined discount strategies. Moreover, it suffers from the well-known drawbacks of Apriori-based algorithms such as maintaining a huge amount of candidates in memory and repeatedly performing time-consuming database scans. In this paper, a HUI-DTP algorithm for mining HUIs when considering discount strategies of items is introduced. The HUI-DTP is designed as a two-phase algorithm to mine the complete set of HUIs based on a novel downward closure property and a vertical TID-list structure. Furthermore, the HUI-DMiner is an algorithm relying on a compact data structure (Positive-and-Negative Utility-list, PNU-list) and properties of two new pruning strategies to efficiently discover HUIs without candidate generation, while considerably reducing the size of the search space. Moreover, a strategy named Estimated Utility Co-occurrence Strategy which stores the relationships between 2-itemsets is also applied in the improved HUI-DEMiner algorithm to speed up computation. An extensive experimental study carried on several real-life datasets shows that the proposed algorithms outperform the previous.best algorithm in terms of runtime, memory consumption and scalability. (C) 2016 Elsevier Ltd. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectHigh-utility itemsetsen_US
dc.subjectDiscount strategiesen_US
dc.subjectDownward closure propertyen_US
dc.subjectPruning strategiesen_US
dc.subjectPNU-listen_US
dc.titleFast algorithms for mining high-utility itemsets with various discount strategiesen_US
dc.identifier.doi10.1016/j.aei.2016.02.003en_US
dc.identifier.journalADVANCED ENGINEERING INFORMATICSen_US
dc.citation.volume30en_US
dc.citation.issue2en_US
dc.citation.spage109en_US
dc.citation.epage126en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000376694600002en_US
Appears in Collections:Articles