基于平稳性分析的时间序列分割改进新样本预测-2021年

ID：37259

阅读量：0

大小：0.62 MB

页数：22页

时间：2023-03-03

金币：10

上传者：战必胜

sensors

Article

Time Series Segmentation Based on Stationarity Analysis

to Improve New Samples Prediction

Ricardo Petri Silva

* , Bruno Bogaz Zarpelão

, Alberto Cano

and Sylvio Barbon Junior



 

Citation: Silva, R.P.; Zarpelão, B.B.;

Cano, A.; Junior, S.B. Time Series

Segmentation Based on Stationarity

Analysis to Improve New Samples

Prediction. Sensors 2021, 21, 7333.

https://doi.org/10.3390/s21217333

Academic Editors: YangQuan Chen,

Subhas Mukhopadhyay, Nunzio

Cennamo, M. Jamal Deen, Junseop

Lee and Simone Morais

Received: 9 August 2021

Accepted: 2 November 2021

Published: 4 November 2021

Publisher’s Note: MDPI stays neutral

with regard to jurisdictional claims in

published maps and institutional afﬁl-

iations.

Licensee MDPI, Basel, Switzerland.

This article is an open access article

distributed under the terms and

conditions of the Creative Commons

Attribution (CC BY) license (https://

creativecommons.org/licenses/by/

4.0/).

Department of Electrical Engineering, State University of Londrina, Londrina 86057-970, Brazil

Department of Computer Science, State University of Londrina, Londrina 86057-970, Brazil;

brunozarpelao@uel.br (B.B.Z.); barbon@uel.br (S.B.J.)

Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA;

acano@vcu.edu

* Correspondence: petri@uel.br

Abstract:

A wide range of applications based on sequential data, named time series, have become

increasingly popular in recent years, mainly those based on the Internet of Things (IoT). Several

different machine learning algorithms exploit the patterns extracted from sequential data to support

multiple tasks. However, this data can suffer from unreliable readings that can lead to low accuracy

models due to the low-quality training sets available. Detecting the change point between high

representative segments is an important ally to ﬁnd and thread biased subsequences. By constructing

a framework based on the Augmented Dickey-Fuller (ADF) test for data stationarity, two proposals

to automatically segment subsequences in a time series were developed. The former proposal, called

Change Detector segmentation, relies on change detection methods of data stream mining. The latter,

called ADF-based segmentation, is constructed on a new change detector derived from the ADF test

only. Experiments over real-ﬁle IoT databases and benchmarks showed the improvement provided

by our proposals for prediction tasks with traditional Autoregressive integrated moving average

(ARIMA) and Deep Learning (Long short-term memory and Temporal Convolutional Networks)

methods. Results obtained by the Long short-term memory predictive model reduced the relative

prediction error from 1 to 0.67, compared to time series without segmentation.

Keywords:

time series segmentation; stationarity analysis; time series prediction improvement; size

reduction in time series

1. Introduction

The growth of data generation increases daily due to the advancement of

technology [1]

With the advent of sensors that are capable of capturing precious data, there is also the

need to transform this data into information. The most common data structure in the era of

automatic sensor data processing is time series. A time series can be deﬁned as a set of se-

quential data ordered in time [

]. Traditionally, stochastic processes are used to model time

series behavior with great success [

]. In addition, machine learning-based approaches

are also employed to perform the identiﬁcation of complex behaviors of nonlinear patterns,

optimization of unconventional functions, and even establishing connections with long

dependencies through recurrent neural networks [

]. These patterns can be veriﬁed in

different areas, such as climatic data [

], sales [

], medical diagnosis [

–

], security [

and even the change in share values on the stock exchange [13].

From time series analyses, it is possible to examine these patterns and create predictions

of future samples, as discussed in Mahalakshmi et al. [

]. Models based on machine learning,

e.g., Long short-term memory (LSTM) and Temporal Convolutional Network (TCN), have

shown promising results, [

], as an alternative to statistical models. Approaches that apply

machine learning concepts can adapt their settings to improve predictive ability [

]. This can

be done by adjusting their hyperparameters so that the time series modeling is better suited

Sensors 2021, 21, 7333. https://doi.org/10.3390/s21217333 https://www.mdpi.com/journal/sensors

资源描述：

当前文档最多预览五页，下载文档查看全文

侵权申诉



1 1 2 3 4 5 / 22



此文档下载收益归作者所有

当前文档最多预览五页，下载文档查看全文

版权提示

温馨提示：
1. 部分包含数学公式或PPT动画的文件，查看预览时可能会显示错乱或异常，文件下载后无此问题，请放心下载。
2. 本文档由用户上传，版权归属用户，天天文库负责整理代发布。如果您对本文档版权有争议请及时联系客服。
3. 下载前请仔细阅读文档内容，确认文档内容符合您的需求后进行下载，若出现内容与标题不符可向本站投诉处理。
4. 下载文档时可能由于网络波动等原因无法下载或下载错误，付费完成后未能成功下载的用户请联系客服处理。

大家都在看

近期热门

基于平稳性分析的时间序列分割改进新样本预测-2021年

最近更新

大家都在看

相关文章

相关标签