Full metadata record
DC FieldValueLanguage
dc.contributor.authorLin, Jerry Chun-Weien_US
dc.contributor.authorGan, Wenshengen_US
dc.contributor.authorFournier-Viger, Philippeen_US
dc.contributor.authorHong, Tzung-Peien_US
dc.contributor.authorTseng, Vincent S.en_US
dc.date.accessioned2017-04-21T06:55:47Z-
dc.date.available2017-04-21T06:55:47Z-
dc.date.issued2016-01en_US
dc.identifier.issn0924-669Xen_US
dc.identifier.urihttp://dx.doi.org/10.1007/s10489-015-0703-9en_US
dc.identifier.urihttp://hdl.handle.net/11536/133425-
dc.description.abstractFrequent itemset mining (FIM) is a fundamental research topic, which consists of discovering useful and meaningful relationships between items in transaction databases. However, FIM suffers from two important limitations. First, it assumes that all items have the same importance. Second, it ignores the fact that data collected in a real-life environment is often inaccurate, imprecise, or incomplete. To address these issues and mine more useful and meaningful knowledge, the problems of weighted and uncertain itemset mining have been respectively proposed, where a user may respectively assign weights to items to specify their relative importance, and specify existential probabilities to represent uncertainty in transactions. However, no work has addressed both of these issues at the same time. In this paper, we address this important research problem by designing a new type of patterns named high expected weighted itemset (HEWI) and the HEWI-Uapriori algorithm to efficiently discover HEWIs. The HEWI-Uapriori finds HEWIs using an Apriori-like two-phase approach. The algorithm introduces a property named high upper-bound expected weighted downward closure (HUBEWDC) to early prune the search space and unpromising itemsets. Substantial experiments on real-life and synthetic datasets are conducted to evaluate the performance of the proposed algorithm in terms of runtime, memory consumption, and number of patterns found. Results show that the proposed algorithm has excellent performance and scalability compared with traditional methods for weighted-itemset mining and uncertain itemset mining.en_US
dc.language.isoen_USen_US
dc.subjectData miningen_US
dc.subjectUncertain databasesen_US
dc.subjectWeighted frequent itemsetsen_US
dc.subjectTwo-phaseen_US
dc.subjectUpper-bounden_US
dc.titleWeighted frequent itemset mining over uncertain databasesen_US
dc.identifier.doi10.1007/s10489-015-0703-9en_US
dc.identifier.journalAPPLIED INTELLIGENCEen_US
dc.citation.volume44en_US
dc.citation.issue1en_US
dc.citation.spage232en_US
dc.citation.epage250en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000368149500014en_US
Appears in Collections:Articles