完整後設資料紀錄
DC 欄位語言
dc.contributor.authorPeng, Wen-Chihen_US
dc.contributor.authorLiao, Zhung-Xunen_US
dc.date.accessioned2014-12-08T15:08:34Z-
dc.date.available2014-12-08T15:08:34Z-
dc.date.issued2009-10-01en_US
dc.identifier.issn0169-023Xen_US
dc.identifier.urihttp://dx.doi.org/10.1016/j.datak.2009.04.009en_US
dc.identifier.urihttp://hdl.handle.net/11536/6585-
dc.description.abstractIn this paper, given a set of sequence databases across multiple domains, we aim at mining multi-domain sequential patterns, where a multi-domain sequential pattern is a sequence of events whose occurrence time is within a pre-defined time window. We first: propose algorithm Naive in which multiple sequence databases are joined as one sequence database for utilizing traditional sequential pattern mining algorithms (e.g., PrefixSpan). Due to the nature of join operations, algorithm Naive is costly and is developed for comparison purposes. Thus, we propose two algorithms without any join operations for mining multidomain sequential patterns. Explicitly, algorithm IndividualMine derives sequential patterns in each domain and then iteratively combines sequential patterns among sequence databases of multiple domains to derive candidate multi-domain sequential patterns. However, not all sequential patterns mined in the sequence database of each domain are able to form multi-domain sequential patterns. To avoid the mining cost incurred in algorithm IndividualMine, algorithm PropagatedMine is developed. Algorithm PropagatedMine first performs one sequential pattern mining from one sequence database. In light of sequential patterns mined, algorithm PropagatedMine propagates sequential patterns mined to other sequence databases. Furthermore, sequential patterns mined are represented as a lattice structure for further reducing the number of sequential patterns to be propagated. In addition. we develop some mechanisms to allow some empty sets in multi-domain sequential patterns. Performance of the proposed algorithms is comparatively analyzed and sensitivity analysis is conducted. Experimental results show that by exploring propagation and lattice structures, algorithm PropagatedMine outperforms algorithm IndividualMine in terms of efficiency (i.e., the execution time). (C) 2009 Elsevier B.V. All rights reserved.en_US
dc.language.isoen_USen_US
dc.subjectData miningen_US
dc.subjectSequential pattern miningen_US
dc.subjectMulti-domain sequential patternsen_US
dc.titleMining sequential patterns across multiple sequence databasesen_US
dc.typeArticleen_US
dc.identifier.doi10.1016/j.datak.2009.04.009en_US
dc.identifier.journalDATA & KNOWLEDGE ENGINEERINGen_US
dc.citation.volume68en_US
dc.citation.issue10en_US
dc.citation.spage1014en_US
dc.citation.epage1033en_US
dc.contributor.department資訊工程學系zh_TW
dc.contributor.departmentDepartment of Computer Scienceen_US
dc.identifier.wosnumberWOS:000270603900008-
dc.citation.woscount6-
顯示於類別:期刊論文


文件中的檔案:

  1. 000270603900008.pdf

若為 zip 檔案,請下載檔案解壓縮後,用瀏覽器開啟資料夾中的 index.html 瀏覽全文。