Full metadata record
DC FieldValueLanguage
dc.contributor.authorZida, Souleymaneen_US
dc.contributor.authorFournier-Viger, Philippeen_US
dc.contributor.authorLin, Jerry Chun-Weien_US
dc.contributor.authorWu, Cheng-Weien_US
dc.contributor.authorTseng, Vincent S.en_US
dc.date.accessioned2018-08-21T05:53:57Z-
dc.date.available2018-08-21T05:53:57Z-
dc.date.issued2017-05-01en_US
dc.identifier.issn0219-1377en_US
dc.identifier.urihttp://dx.doi.org/10.1007/s10115-016-0986-0en_US
dc.identifier.urihttp://hdl.handle.net/11536/145377-
dc.description.abstractIn recent years, high-utility itemset mining has emerged as an important data mining task. However, it remains computationally expensive both in terms of runtime and memory consumption. It is thus an important challenge to design more efficient algorithms for this task. In this paper, we address this issue by proposing a novel algorithm named EFIM (EFficient high-utility Itemset Mining), which introduces several new ideas to more efficiently discover high-utility itemsets. EFIM relies on two new upper bounds named revised sub-tree utility and local utility to more effectively prune the search space. It also introduces a novel array-based utility counting technique named Fast Utility Counting to calculate these upper bounds in linear time and space. Moreover, to reduce the cost of database scans, EFIM proposes efficient database projection and transaction merging techniques named High-utility Database Projection and High-utility Transaction Merging (HTM), also performed in linear time. An extensive experimental study on various datasets shows that EFIM is in general two to three orders of magnitude faster than the state-of-art algorithms HUP, HUI-Miner, HUP-Miner, FHM and UP-Growth+ on dense datasets and performs quite well on sparse datasets. Moreover, a key advantage of EFIM is its low memory consumption.en_US
dc.language.isoen_USen_US
dc.subjectPattern miningen_US
dc.subjectItemset mining, High-utility miningen_US
dc.subjectFast Utility Counting, High-utility database merging and projectionen_US
dc.titleEFIM: a fast and memory efficient algorithm for high-utility itemset miningen_US
dc.typeArticleen_US
dc.identifier.doi10.1007/s10115-016-0986-0en_US
dc.identifier.journalKNOWLEDGE AND INFORMATION SYSTEMSen_US
dc.citation.volume51en_US
dc.citation.spage595en_US
dc.citation.epage625en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000399408200009en_US
Appears in Collections:Articles