Early Classification of Multivariate Time Series on Distributed and In-Memory Platforms

doi:10.1007/978-3-319-67274-8_1

Full metadata record

DC Field	Value	Language
dc.contributor.author	Tseng, Vincent S.	en_US
dc.contributor.author	Huang, Huai-Shuo	en_US
dc.contributor.author	Huang, Chia-Wei	en_US
dc.contributor.author	Wang, Ping-Feng	en_US
dc.contributor.author	Li, Chu-Feng	en_US
dc.date.accessioned	2019-04-02T06:04:52Z	-
dc.date.available	2019-04-02T06:04:52Z	-
dc.date.issued	2017-01-01	en_US
dc.identifier.issn	0302-9743	en_US
dc.identifier.uri	http://dx.doi.org/10.1007/978-3-319-67274-8_1	en_US
dc.identifier.uri	http://hdl.handle.net/11536/150801	-
dc.description.abstract	With the popularity of Internet of Things (IOT) applications, various kinds of active sensors are deployed and multivariate time series datasets are generated rapidly. Early classification of multivariate time series is an emerging topic in data mining due to the wide applications in many domains. The unique part of early classification lies in that it uses only earlier part of time series data to reach classification results with the same accuracy as by methods using complete time series information. Although a number of relevant studies have been presented recently, most of them didn't consider the issues of data scale and execution efficiency simultaneously. The main research issue of this paper falls in how to mine interpretable patterns from multivariate time series data, with which effective classification models can be constructed to ensure the accuracy as well as earliness. To take into account the issues of data scale and execution efficiency simultaneously, we explore distributed in-memory computing techniques and multivariate shapelets-based approaches to construct a Spark-based inmemory mining framework to parallelize the feature extraction process. We implement a framework with Multivariate Shapelets Detection (MSD) method as a based example. Through empirical evaluation on various kinds of sensory datasets, the scalability of the framework is evaluated in terms of efficiency while ensuring the same accuracy and reliability in early classification of multivariate time series. This work is the first one to realize multivariate time series early classification on Spark distributed in-memory computing platform, which can serve as a good base for an extension to other time series classification methods based on shapelet feature extraction.	en_US
dc.language.iso	en_US	en_US
dc.subject	Early classification	en_US
dc.subject	Multivariate time series	en_US
dc.subject	Parallel and distributed computing	en_US
dc.subject	Shapelets	en_US
dc.title	Early Classification of Multivariate Time Series on Distributed and In-Memory Platforms	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.doi	10.1007/978-3-319-67274-8_1	en_US
dc.identifier.journal	TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2017	en_US
dc.citation.volume	10526	en_US
dc.citation.spage	3	en_US
dc.citation.epage	14	en_US
dc.contributor.department	交大名義發表	zh_TW
dc.contributor.department	National Chiao Tung University	en_US
dc.identifier.wosnumber	WOS:000449978200001	en_US
dc.citation.woscount	0	en_US
Appears in Collections:	Conferences Paper