Full metadata record
DC FieldValueLanguage
dc.contributor.authorZihayat, Mortezaen_US
dc.contributor.authorWu, Cheng-Weien_US
dc.contributor.authorAn, Aijunen_US
dc.contributor.authorTseng, Vincent S.en_US
dc.contributor.authorLin, Chienen_US
dc.date.accessioned2018-08-21T05:53:57Z-
dc.date.available2018-08-21T05:53:57Z-
dc.date.issued2017-01-01en_US
dc.identifier.issn1088-467Xen_US
dc.identifier.urihttp://dx.doi.org/10.3233/IDA-170874en_US
dc.identifier.urihttp://hdl.handle.net/11536/145380-
dc.description.abstractHigh utility sequential pattern (HUSP) mining has emerged as a novel topic in data mining. Although some preliminary works have been conducted on this topic, they incur the problem of producing a large search space for high utility sequential patterns. In addition, they mainly focus on mining HUSPs in static databases and do not take streaming data into account, where unbounded data come continuously and often at a high speed. To efficiently deal with both problems, we propose a novel framework for mining high utility sequential patterns over static and streaming databases. In this regard, two efficient data structures named ItemUtilLists (Item Utility Lists) and HUSP-Tree (High Utility Sequential Pattern Tree) are proposed to maintain essential information for mining HUSPs in both offline and online fashions. In addition, a novel utility model called Sequence-Suffix Utility is proposed for effectively pruning the search space in HUSP mining. We propose an algorithm named HUSP-Miner (High Utility Sequential Pattern Miner) to find HUSPs in static databases efficiently. Then, a one-pass algorithm named HUSP-Stream (High Utility Sequential Pattern mining over Data Streams) is proposed to incrementally update ItemUtilLists and HUSP-Tree online and find HUSPs over data streams. To the best of our knowledge, HUSP-Stream is the first method to find HUSPs over data streams. Experimental results on both real and synthetic datasets show that HUSP-Miner outperforms the compared algorithms substantially in terms of execution time, memory usage and number of generated candidates. The experiments also demonstrate impressive performance of HUSP-Stream to update the data structures and discover HUSPs over data streams.en_US
dc.language.isoen_USen_US
dc.subjectHigh utility sequential pattern miningen_US
dc.subjectdata streamsen_US
dc.subjectsliding windowen_US
dc.titleEfficiently mining high utility sequential patterns in static and streaming dataen_US
dc.typeArticleen_US
dc.identifier.doi10.3233/IDA-170874en_US
dc.identifier.journalINTELLIGENT DATA ANALYSISen_US
dc.citation.volume21en_US
dc.citation.issue1en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000399455600007en_US
Appears in Collections:Articles