A distance based time series classification framework

doi:10.1016/j.is.2015.02.005

Information Systems

Volume 51, July 2015, Pages 27-42

https://doi.org/10.1016/j.is.2015.02.005 Get rights and content

Highlights

•
A time series classification framework is proposed.
•
Some alignment techniques are implemented in it.
•
k-NN and SVM classification modules are available.
•
40 different datasets are classified by using the framework.
•
The performance of the alignment techniques is compared.

Abstract

One of the challenging tasks in machine learning is the classification of time series. It is not very different from standard classification except that the time shifts across time series should be corrected by using a suitable alignment algorithm. In this study, we proposed a framework designed for distance based time series classification which enables users to easily apply different alignment and classification methods to different time series datasets. The framework can be extended to implement new alignment and classification algorithms. Using the framework, we implemented the k-Nearest Neighbor and Support Vector Machines classifiers as well as the alignment methods Dynamic Time Warping, Signal Alignment via Genetic Algorithm, Parametric Time Warping and Canonical Time Warping. We also evaluated the framework on UCR time series repository for which we can conclude that a suitable alignment method enhances the time series classification performance on nearly every dataset.

Introduction

A time series is produced by recording successive measurements of some quantity over time. There are many application fields working with time series [1], [2], [3]. For instance, in the search for exoplanets a time series is created by periodically recording the brightness of a target star, which is called a star light. The time series is then classified with respect to a set of references in order to find unexpected dips due to a planet passing directly between the star and the observer. More than a thousand potential exoplanet has been discovered in the Kepler project by time series analysis [4].

Time series classification plays an important role in many applications such as signature verification [5], speech and handwriting recognition or change detection in mutation analysis. It has been a topic of great interest which has two fundamental components: (i) classification and (ii) alignment. In the classification component, the well known methods, such as the k-Nearest Neighbor (k-NN) and Support Vector Machines (SVM) [6], can be used in their original forms. In these methods, the distance between a pair of time series is calculated by using standard Euclidean metric because of its easy and fast implementation. A time series is created by recording measurements over time. The imperfections of measurement device or differences in the examined subjects result in distortion in the time axis called time drifts or retention time difference [7] which reduces the classification performance. In order to eliminate the distortions, one should make non-linear adjustments often called alignment [8].

Alignment involves the elimination of temporal variations by stretching or compressing the time axis of one or both time series [9]. Another equivalent definition of alignment is to create a mapping between a pair of time series by fitting a warping function. Although the alignment methods may follow very different strategies, they all aim to produce a mapping which is then used to correct the time drifts in the time series. The newly produced and time corrected time series are called aligned time series.

Alignment methods can be used to improve the performance of a classifier by integrating the alignment method into the distance calculation. In this setup, whenever a distance calculation between a pair of time series is requested by the classifier, the time series are first aligned, then the Euclidean distance of the aligned time series is returned to the classifier. By using such an approach, the alignment becomes an integral part of the distance calculation so that it is usually considered as a distance measure [10].

The methods proposed in the studies on alignment techniques are usually considered as new alternatives to the standard Euclidean distance metric or other distance measures based on different alignment methods. For instance, Dynamic Time Warping (DTW) [11], one of the first alignment techniques emerging from the spoken word recognition field, is perceived as a new distance measure. DTW is indeed a generalization of the Minkowski distance which can handle time series of different lengths [12]. However, many other distance measures, such as the Euclidean distance, cannot be applied to time series of different lengths. In such cases, the time series first need to be “aligned” by re-interpolating them to equal length. This implies that a distance measure does not always behave as an alignment technique but rather works in tandem with alignment methods. Therefore, in this study, it is preferred to distinguish alignment methods from distance measures even if the alignment is a variant of some distance measure.

Another challenge in time series classification is the application specific adjustments either in alignment or data processing steps. For instance, multi-dimensional time series are often converted to one dimension by summing, averaging or using other dimensional reduction algorithms because of the fact that the majority of the alignment algorithms are designed to work only with one-dimensional time series [5]. Amplitude normalization, baseline correction or measuring the quality of the alignment are the other minor tasks to be handled in times series classification [13], [14]. As a result, many alignment techniques proposed in the literature are entangled with the application specific tunings which make them unusable for standalone alignment purposes. For the same reason, the performance of the proposed alignment methods can be evaluated on a very limited number of application domains. The studies about time series classification have also a reasonable tendency to focus only on the field of study. Therefore, there are only a few studies trying to test their methods on datasets from different application domains.

In order to overcome the difficulties highlighted above, a public time series classification framework is proposed to make a clear distinction between classification and alignment.¹ The proposed framework enables one to freely change the alignment method and observe its performance without dealing with classification. Likewise, different classification techniques can be tested by keeping alignment method fixed. New classification or alignment algorithms can also be integrated to the framework by implementing the related interfaces. The framework also benefits parallel computing resources if available.

By using the proposed framework, two classification and four alignment methods were tested on 40 different datasets kindly provided by Eamonn Keogh of University of California, Riverside [15]. The most significant outcome of the experiments is that using an alignment method dramatically improves the classification performance on nearly every dataset. The second finding is that the performance of alignment techniques heavily relies on the characteristics of investigated dataset as such an example is analyzed in the experiments. Parallel programming feature of the framework was also tested by utilizing the facilities in High Performance Computing Center of Turkey. The framework has been designed as an open source project, so that researchers can implement their own algorithms.

The rest of this paper is organized as follows: In Section 2 we surveyed the literature on time series classification algorithms. In Section 3, we presented the framework and its current classification and alignment implementations. In Section 4, we experimented the framework with the dataset. The experimental setup and the results were also given. In the last section, we gave a conclusion and future work.

Section snippets

Related work

The studies in time series classification can be analyzed in three main categories with respect to the classification scheme: feature based, model based and distance based [16]. Our classification framework falls into the third category.

The classification framework

The main objective of this work is to propose a flexible and easy to use framework suitable for distance based classification of time series. The framework has classification and alignment components that are flexible to allow implementing different classification or alignment methods. The connections between the two components are handled within the framework as well as the other tasks required in time series classification such as performance evaluation of classifier and visualization of data

Datasets

The collection of time series datasets used in this study is kindly provided by Eamonn Keogh from University of California, Riverside (UCR) [15]. The datasets in this collection come from many diverse applications including person tracking with computer vision [51], monitoring of fish migration [52] and classification of leaves of Swedish trees [53]. The time series in each dataset is provided with class labels as well as training and testing sets. A summary of the datasets is shown in Table 1.

Experimental results

Using the proposed framework, we conducted the following experiments:

•
In Section 5.1, we compared the alignment methods in terms of the smoothness of the warping functions.
•
In Section 5.2, we tested the parallel processing ability of the framework.
•
In Section 5.3, we made a detailed analysis on Gun-Point dataset in order to explain the dramatic performance difference between the alignment methods in this dataset.
•
In Section 5.4, lastly we presented the performance of alignment methods on all

Conclusion and future work

In this study, we proposed a new framework for time series classification composed of two main components, alignment and classification. It allows us to implement custom designed alignments and classification techniques.

The choice of a classifier and the alignment method is not obvious to a researcher working on the classification of time series datasets. Consequently, the common methodology adopted by researchers is to experiment their dataset with a few combinations of well-known

References (81)

L. Andrade et al.
Robust normalization of DNA chromatograms by regression for improved base-calling
J. Frankl. Inst.—Eng. Appl. Math.
(2004)
K.J. Johnson et al.
High-speed peak matching algorithm for retention time alignment of gas chromatographic data for chemometric analysis
J. Chromatogr. A
(2003)
N.P.V. Nielsen et al.
Aligning of single and multiple wavelength chromatographic profiles for chemometric data analysis using correlation optimised warping
J. Chromatogr. A
(1998)
H. Kaya et al.
SAGAa novel signal alignment method based on genetic algorithm
Inf. Sci.
(2013)
A. van Nederkassel et al.
A comparison of three algorithms for chromatograms alignment
J. Chromatogr. A
(2006)
J. Liu et al.
uwaveAccelerometer-based personalized gesture recognition and its applications
Pervasive Mob. Comput.
(2009)
J.O. Ramsay et al.
Functional Data Analysis
(1997)
F. Sanger et al.
DNA sequencing with chain-terminating inhibitors
Proc. Natl. Acad. Sci. USA
(1977)
N.K.S. Thalange et al.
Model of normal prepubertal growth
Arch. Dis. Child.
(1996)
C. Middour, T.C. Klaus, J. Jenkins, D. Pletcher, M. Cote, H. Chandrasekaran, B. Wohler, F. Girouard, J.P. Gunter, K....

M. Bashir et al.

Reduced dynamic time warping for handwriting recognition based on multidimensional time series of a novel pen device

Int. J. Intell. Syst. Technol.,WASET

(2008)

C. Cortes et al.

Support-vector networks

Mach. Learn.

(1995)

L.S. Ettre

Nomenclature for chromatography

Pure Appl. Chem.

(1993)

R. Smith et al.

LC-MS alignment in theory and practice: a comprehensive algorithmic review

Brief. Bioinform.

(2015)

K. Coakley et al.

Alignment of noisy signals

IEEE Trans. Instrum. Meas.

(2001)

H. Ding et al.

Querying and mining of time series dataexperimental comparison of representations and distance measures

Proc. VLDB Endow.

(2008)

H. Sakoe et al.

Dynamic-programming algorithm optimization for spoken word recognition

IEEE Trans. Acoust. Speech Signal Process.

(1978)

Z. Prekopcsak et al.

Time series classification by class-specific Mahalanobis distance measures

Adv. Data Anal. Classif.

(2012)

E. Keogh, Q. Zhu, B. Hu, Y. Hao, X. Xi, L. Wei, C.A. Ratanamahatana, The UCR Time Series Classification/Clustering...

Z. Xing et al.

A brief survey on sequence classification

ACM SIGKDD Explor. Newslett.

(2010)

C. Faloutsos et al.

Fast subsequence matching in time-series databases

SIGMOD Rec.

(1994)

I. Popivanov, R. Miller, Similarity search over time-series data using wavelets, in: Proceedings of the 18th...

F. Korn, H.V. Jagadish, C. Faloutsos, Efficiently supporting ad hoc queries in large datasets of time sequences, in:...

J. Listgarten et al.

Difference detection in LC-MS data for protein biomarker discovery

Bioinformatics

(2007)

A. Nanopoulos et al.

Feature-based classification of time-series data

Int. J. Comput. Res.

(2001)

D. Pham et al.

Control chart pattern recognition using neural networks

J. Syst. Eng.

(1992)

D.T. Pham et al.

Control chart pattern-recognition using learning vector quantization networks

Int. J. Prod. Res.

(1994)

S. Gauri

Control chart pattern recognition using feature-based learning vector quantization

Int. J. Adv. Manuf. Technol.

(2010)

E. Alpaydin

Introduction to Machine Learning

(2010)

H. Ney et al.

Dynamic programming search for continuous speech recognition

IEEE Signal Process. Mag.

(1999)

B.S. Atal et al.

Speech analysis and synthesis by linear prediction of the speech wave

J. Acoust. Soc. Am.

(1971)

T. Vintsyuk

Speech discrimination by dynamic programming

Cybernetics

(1968)

R. Bellman

Dynamic Programming

(2003)

T. Rakthanmanon, B. Campana, A. Mueen, G. Batista, B. Westover, Q. Zhu, J. Zakaria, E. Keogh, Searching and mining...

E. Keogh, L. Wei, X. Xi, S. hee Lee, M. Vlachos, Lb-keogh supports exact indexing of shapes under rotation invariance...

S. Salvadora et al.

Toward accurate dynamic time warping in linear time and space

Intell. Data Anal.

(2007)

F. Itakura

Minimum prediction residual principle applied to speech recognition

IEEE Trans. Acoust. Speech Signal Process.

(1975)

C.A. Ratanamahatana et al.

Three myths about dynamic time warping data mining

P.H.C. Eilers

Parametric time warping

Anal. Chem.

(2004)

J.O. Ramsay et al.

Curve registration

J. R. Stat. Soc. Ser. B—Stat. Methodol.

(1998)

Cited by (31)

A novel distance measure based on dynamic time warping to improve time series classification
2024, Information Sciences
Dynamic time warping (DTW) is the most widely used method to evaluate the similarity between time series. However, the DTW distance only takes into account the difference in amplitude, but does not reflect the time distortion information between them. In this paper, we propose a novel time similarity metric, called the time distortion coefficient, based on the DTW warping path to quantify the time distortion between time series. It is able to characterize the type and degree of time distortion between two time series at each point. By summing the absolute values of the time distortion coefficients, the overall time distortion is introduced to quantify time distortion between two time series. For the Nearest Neighbor (NN) based time series classification, a fusional similarity measure combining the DTW distance and the overall time distortion measure is proposed, which is able to evaluate the similarity in both amplitude and time domains. The experimental results conducted on the UCR time series classification archive datasets demonstrate that the proposed fusional similarity measure can significantly improve the classification accuracy of the 1-NN classifier with only a small amount of additional computational cost compared to the DTW distance and other metrics.
Unsupervised clustering for the anomaly diagnosis of plunger lift operations
2023, Geoenergy Science and Engineering
Plunger lift is effective for dealing with liquid accumulation in gas wells. With the development of digital plunger control systems and large databases, machine learning has demonstrated significant advantages in plunger lift parameter optimization and pattern recognition. Generating high-quality labbels for massive field data, which is crucial for training machine learning models, is still a challenge due to the lack of a cost-effective labeling method. This paper presents an unsupervised clustering method to distinguish patterns for the plunger lift data. A model based on the transformer encoder is trained to identify periodic points in the continuous data. The periodic points are then used to partition the data into periodic sub-datasets with four distinct labels, including anomaly labels and three distinct operational conditions. Comparison of the effectiveness of data dimension reduction techniques is conducted, including Principal component analysis, Multidimensional scaling, statistical methods, and Autoencoder, in improving the clustering quality. Results show that the deep-neural-networks-based autoencoder achieves the highest clustering accuracy due to its ability to learn more compact representations through joint optimization of clustering loss and reconstruction loss. Moreover, the cyclic-feature-based algorithm is found to outperform the sliding window to obtain the input data for clustering.
TS-Evolutionary_Prototyping: A Python module for finding the prototype in large sets of time series[Formula presented]
2023, Software Impacts
Time series analysis has become one of the basic building blocks for the technological fields of science and engineering. Therefore, there are a large number of software tools that encompass the preparation of the data, the performance of a large number of processing tasks with the data, the generation of datasets and finally the implementation of the necessary evaluation techniques. Of particular importance within the above tasks is the prototyping or summarisation of sets of time series as they have direct application in the resolution of clustering problems. In this work, we introduce a Python package that implements an evolutionary strategy to find prototypes. Given a set of time series, the implemented software finds prototypes using dynamic time warping (DTW) as the distance measure between series and does not restrict the search space for the prototype to the series of the input set. The software also includes use cases for clustering and classification.
Spatial-temporal alignment of time series with different sampling rates based on cellular multi-objective whale optimization
2023, Information Processing and Management
Citation Excerpt :
Another class of profile-based alignment algorithm is the meta-heuristic and evolutionary algorithms, which can be divided into two types: single-objective optimization (SOO) and multi-objective optimization (MOO). The SOO methods include genetic algorithm (Kaya & Gunduz-Oguducu, 2013, Kaya, 2015), differential evolution (Wei, Ding & Zhou, 2020) et al. The MOO methods include Multi-Objective Particle Swarm Optimization (MOPSO) (Zhang, Pu & Schonfeld, 2020), Multi-Objective Evolutionary Algorithms with Inverse Model (IM-MOEA) (Xue, Jiang & Wang, 2021), Non-dominated Sorting Genetic Algorithm II (NSGA-II) (Zambrano-Vega, Nebro & García-Nieto, 2017, Ortuño et al., 2013, Feng & Zhang, 2021, Huang, Xue & Jiang, 2020, Acampora, Ishibuchi & Vitiello, 2014) et al.
Aligning time series of different sampling rates is an important but challenging task. Current commonly used dynamic time warping methods usually suffer from pathological temporal singularity problem. In order to overcome this, we transform the alignment task to a spatial-temporal multi-objective optimization (MOO) problem. Existing MOO algorithms are always inefficient in finding Pareto optimal alignment solutions due to their insufficiency in maintaining convergence and diversity among the obtained Pareto solutions. In light of this, we propose a novel and efficient MOO algorithm Cell-MOWOA which integrates Cellular automata with the rising Whale Optimization Algorithm to find Pareto optimal alignment solutions. Innovative multi-variate non-linear cell state evolutionary rules are designed within Pareto solution external archive to improve the convergence and diversity of the Pareto solutions, and novel whale population updating mechanism is designed to accelerate the convergence to the Pareto front. Besides, new integer whale updating mechanism is presented to transform real-number whale solutions to integer whale solutions. Experimental results on 85 gold-standard UCR time series datasets showed that Cell-MOWOA outperformed six state-of-the-art baselines by 24.53% in average in increasing alignment accuracy and 42.66% in average in reducing singularity. Besides, it achieved outstanding runtime efficiency, especially on long time series datasets.
Dual-PISA: An index for aggregation operations on time series data
2020, Information Systems
Citation Excerpt :
In recent years, more and more sensor data is collected for monitoring, analysis, and forecasting. This data is usually organized along the time dimension to form a considerable amount of time series data [1–4]. For example, there are more than 100,000 meteorological ground stations in China.
Aggregation operations play an essential role in time series database management. As the number of data increases, it is difficult for current solutions, such as summary table and MapReduce-based methods to respond to such queries with low latency. Other approaches, such as segment tree-based methods, have a poor insertion performance when the data size exceeds the available memory. This paper proposes a Persistent Index for Segmented Aggregations (PISA), which has fast insertion performance and low latency for aggregation queries. PISA uses a forest to overcome the performance disadvantage of insertion in traditional segment trees. By defining two kinds of tags, namely code number and serial number, we propose an algorithm to accelerate queries by avoiding unnecessary reading data on disk. Additionally, we extend it to Dual-PISA to tolerate a range of unordered data, which is very important in the real world. Dual-PISA is stored on disk and is hugely memory-efficient — only takes a few hundred bytes of memory for billions of data points. Dual-PISA can be easily implemented on both traditional databases and NoSQL systems. It handles aggregation queries within milliseconds on a commodity server, for a time range that contains tens of billions of data points.
A hybrid dynamic exploitation barebones particle swarm optimisation algorithm for time series segmentation
2019, Neurocomputing
Citation Excerpt :
Time series can be obtained from different areas, such as climate [3], hydrology [4], finances [5], satellite images [6], etc. They are used for different tasks depending on the objective of the researchers and the application areas, e.g. classification [7,8], forecasting [9,10], tipping point detection [11], clustering [12], similarity assessment [13,14] or segmentation [15]. Specifically, time series segmentation is an important task, which consists of cutting the time series in some specific points trying to achieve different objectives, which are generally related to two points of view.
Large time series are difficult to be mined and preprocessed, hence reducing their number of points with minimum information loss is an active field of study. This paper proposes new methods based on time series segmentation, including the adaptation of the particle swarm optimisation algorithm (PSO) to this problem, and more advanced PSO versions, such as barebones PSO (BBPSO) and its exploitation version (BBePSO). Moreover, a novel algorithm is derived, referred to as dynamic exploitation barebones PSO (DBBePSO), which updates the importance of the social and cognitive components throughout the generations. All these algorithms are further improved by considering a final local search step based on the combination of two well-known standard segmentation algorithms (Bottom-Up and Top-Down). The performance of the different methods is evaluated using 15 time series from various application fields, and the results show that the novel algorithm (DBBePSO) and its hybrid version (HDBBePSO) outperform the rest of segmentation techniques.

View all citing articles on Scopus

View full text

A distance based time series classification framework

Highlights

Abstract

Introduction

Section snippets

Related work

The classification framework

Datasets

Experimental results

Conclusion and future work

J. Frankl. Inst.—Eng. Appl. Math.

J. Chromatogr. A

J. Chromatogr. A

Inf. Sci.

J. Chromatogr. A

Pervasive Mob. Comput.

Functional Data Analysis

DNA sequencing with chain-terminating inhibitors

Proc. Natl. Acad. Sci. USA

Model of normal prepubertal growth

Arch. Dis. Child.

Reduced dynamic time warping for handwriting recognition based on multidimensional time series of a novel pen device

Int. J. Intell. Syst. Technol.,WASET

Support-vector networks

Mach. Learn.

Nomenclature for chromatography

Pure Appl. Chem.

LC-MS alignment in theory and practice: a comprehensive algorithmic review

Brief. Bioinform.

Alignment of noisy signals

IEEE Trans. Instrum. Meas.

Querying and mining of time series dataexperimental comparison of representations and distance measures

Proc. VLDB Endow.

Dynamic-programming algorithm optimization for spoken word recognition

IEEE Trans. Acoust. Speech Signal Process.

Time series classification by class-specific Mahalanobis distance measures

Adv. Data Anal. Classif.

A brief survey on sequence classification

ACM SIGKDD Explor. Newslett.

Fast subsequence matching in time-series databases

SIGMOD Rec.

Difference detection in LC-MS data for protein biomarker discovery

Bioinformatics

Feature-based classification of time-series data

Int. J. Comput. Res.

Control chart pattern recognition using neural networks

J. Syst. Eng.

Control chart pattern-recognition using learning vector quantization networks

Int. J. Prod. Res.

Control chart pattern recognition using feature-based learning vector quantization

Int. J. Adv. Manuf. Technol.

Introduction to Machine Learning

Dynamic programming search for continuous speech recognition

IEEE Signal Process. Mag.

Speech analysis and synthesis by linear prediction of the speech wave

J. Acoust. Soc. Am.

Speech discrimination by dynamic programming

Cybernetics

Dynamic Programming

Toward accurate dynamic time warping in linear time and space

Intell. Data Anal.

Minimum prediction residual principle applied to speech recognition

IEEE Trans. Acoust. Speech Signal Process.

Three myths about dynamic time warping data mining

Parametric time warping

Anal. Chem.

Curve registration

J. R. Stat. Soc. Ser. B—Stat. Methodol.