Skip to main content
Top
Published in: Evolutionary Intelligence 4/2021

21-05-2020 | Research Paper

Genetic algorithm-based fuzzy clustering applied to multivariate time series

Authors: Karine do Prado Ribeiro, Cristiano Hora Fontes, Gabriel Jesus Alves de Melo

Published in: Evolutionary Intelligence | Issue 4/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Despite the fact that the fuzzy clustering of time series based on genetic algorithm (GA) is mostly used in applications involving univariate time series, this paper presents an approach based on GA and Fuzzy C-Means (FCM) for clustering multivariate time series. Each chromosome is an individual or solution which encodes the clusters' centroids (patterns) and a bi-criterion constrained clustering is proposed to maximize both the similarity of objects in the same cluster (based on the SPCA metric) and the distance between the centers of the clusters. The proposed method is applied in two case studies involving a real industrial case which comprises pattern recognition for detecting operation failures in a gas turbine and a well-known benchmark industrial system (Tennessee Eastman process) used to evaluate techniques for detecting and diagnosing failures. The proposed approach was able to obtain better classification results compared to FCM based on classical optimization methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Aghabozorgi S, Shirkhorshid AS, Wah TY (2015) Time-series clustering—a decade review. Inf Syst 53:16–38CrossRef Aghabozorgi S, Shirkhorshid AS, Wah TY (2015) Time-series clustering—a decade review. Inf Syst 53:16–38CrossRef
2.
go back to reference Bandyppadhyay S, Baragona R, Maulik U (2010) Clustering multivariate time series by genetic multiobjective optimization. METRON Int J Stat LXVIII(2):161–183MathSciNetMATH Bandyppadhyay S, Baragona R, Maulik U (2010) Clustering multivariate time series by genetic multiobjective optimization. METRON Int J Stat LXVIII(2):161–183MathSciNetMATH
3.
go back to reference Bankó Z, Abonyi J (2012) Correlation based dynamic time warping of multivariate time series. Expert Syst Appl 39:12814–12823CrossRef Bankó Z, Abonyi J (2012) Correlation based dynamic time warping of multivariate time series. Expert Syst Appl 39:12814–12823CrossRef
4.
go back to reference Baragona R (2001) A simulation study on clustering time series with metaheuristic methods. Quad Stat 3:1–26MathSciNet Baragona R (2001) A simulation study on clustering time series with metaheuristic methods. Quad Stat 3:1–26MathSciNet
5.
go back to reference Baragona R (2011) Clusters of multivariate stationary time series by differential evolution and autoregressive distance. In: Kuznetsov SO, et al. (eds) Clusters of time series by differential evolution. Springer, Berlin, pp 382–387 Baragona R (2011) Clusters of multivariate stationary time series by differential evolution and autoregressive distance. In: Kuznetsov SO, et al. (eds) Clusters of time series by differential evolution. Springer, Berlin, pp 382–387
6.
go back to reference Bezdek JC et al (2005) Fuzzy models and algorithms for pattern recognition and image processing. Springer, New York Bezdek JC et al (2005) Fuzzy models and algorithms for pattern recognition and image processing. Springer, New York
7.
go back to reference Chawla NV (2010) Data mining for imbalanced data sets: an overview. In: Maimon L, Rokach O (eds) Data mining and knowledge discovery handbook. Springer, US Chawla NV (2010) Data mining for imbalanced data sets: an overview. In: Maimon L, Rokach O (eds) Data mining and knowledge discovery handbook. Springer, US
9.
go back to reference Dobos L, Abonyi J (2012) On-line detection of homogeneous operation ranges by dynamic principal component analysis based time-series segmentation. Chem Eng Sci 2(75):96–105CrossRef Dobos L, Abonyi J (2012) On-line detection of homogeneous operation ranges by dynamic principal component analysis based time-series segmentation. Chem Eng Sci 2(75):96–105CrossRef
10.
go back to reference Dohare, D. and Devi, V. S. (2011) Combination of similarity measures for time series classification using genetic algorithms. In: IEEE Congress of evolutionary computation (CEC), pp 401–408 Dohare, D. and Devi, V. S. (2011) Combination of similarity measures for time series classification using genetic algorithms. In: IEEE Congress of evolutionary computation (CEC), pp 401–408
12.
go back to reference Downs J, Vogel EF (1993) A plant-wide industrial process control problem. Comput Chem Eng 17(3):245–255CrossRef Downs J, Vogel EF (1993) A plant-wide industrial process control problem. Comput Chem Eng 17(3):245–255CrossRef
13.
go back to reference Durán-Rosal AM et al (2017) Detection and prediction of segments containing extreme significant wave heights. Ocean Eng 142:268–279CrossRef Durán-Rosal AM et al (2017) Detection and prediction of segments containing extreme significant wave heights. Ocean Eng 142:268–279CrossRef
14.
go back to reference Eslamloueyan R (2011) Designing a hierarchical neural network based on fuzzy clustering for fault diagnosis of the Tennessee-Eastman process. Appl Soft Comput 11(1):1407–1415CrossRef Eslamloueyan R (2011) Designing a hierarchical neural network based on fuzzy clustering for fault diagnosis of the Tennessee-Eastman process. Appl Soft Comput 11(1):1407–1415CrossRef
15.
go back to reference Fontes CHO, Budman HM (2017) A hybrid clustering approach for multivariate time series—a case study applied to failure analysis in a gas turbine. ISA Trans 71:513–529CrossRef Fontes CHO, Budman HM (2017) A hybrid clustering approach for multivariate time series—a case study applied to failure analysis in a gas turbine. ISA Trans 71:513–529CrossRef
16.
go back to reference Fontes CHO, Budman HM (2018) Evaluation of a hybrid clustering approach for a benchmark industrial system. Ind Eng Chem Res 57:11039–11049CrossRef Fontes CHO, Budman HM (2018) Evaluation of a hybrid clustering approach for a benchmark industrial system. Ind Eng Chem Res 57:11039–11049CrossRef
17.
go back to reference Fontes CH, Pereira O (2016) Pattern recognition in multivariate time series—a case study applied to fault detection in a gas turbine. Eng Appl Artif Intell 49:10–18CrossRef Fontes CH, Pereira O (2016) Pattern recognition in multivariate time series—a case study applied to fault detection in a gas turbine. Eng Appl Artif Intell 49:10–18CrossRef
18.
go back to reference Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24:164–181CrossRef Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24:164–181CrossRef
19.
go back to reference García MLL, García-Ródenas R, Gómez AG (2014) Hybrid meta-heuristic optimization algorithms fortime-domain-constrained data clustering. Appl Soft Comput 23:319–332CrossRef García MLL, García-Ródenas R, Gómez AG (2014) Hybrid meta-heuristic optimization algorithms fortime-domain-constrained data clustering. Appl Soft Comput 23:319–332CrossRef
20.
go back to reference Ghaemi R et al (2012) A novel Fuzzy C-Means algorithm to generate diverse and desirable cluster solutions used by genetic-based clustering ensemble algorithms. Memet Comp 4:49–71CrossRef Ghaemi R et al (2012) A novel Fuzzy C-Means algorithm to generate diverse and desirable cluster solutions used by genetic-based clustering ensemble algorithms. Memet Comp 4:49–71CrossRef
21.
go back to reference Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244CrossRef Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244CrossRef
22.
go back to reference Ji L, Ailing G, Lianyong Y (2011) Application of product cluster method based on fuzzy cluster and genetic algorithm. Procedia Eng 15:4930–4935CrossRef Ji L, Ailing G, Lianyong Y (2011) Application of product cluster method based on fuzzy cluster and genetic algorithm. Procedia Eng 15:4930–4935CrossRef
23.
go back to reference Kavitha V, Punithavalli M (2010) Clustering Time series data stream—a literature survey. Int J Comput Sci Inf Secur 8(1):289–294 Kavitha V, Punithavalli M (2010) Clustering Time series data stream—a literature survey. Int J Comput Sci Inf Secur 8(1):289–294
24.
go back to reference Lau CK et al (2013) Fault diagnosis of Tennessee Eastman process with multi-scale PCA and ANFIS. Chemom Intell Lab Syst 120:1–14CrossRef Lau CK et al (2013) Fault diagnosis of Tennessee Eastman process with multi-scale PCA and ANFIS. Chemom Intell Lab Syst 120:1–14CrossRef
25.
go back to reference Li D, Gu H, Zhang L (2013) A hybrid genetic algorithm–Fuzzy C-Means approach for incomplete data clustering based on nearest-neighbor intervals. Soft Comput 17:1787–1796CrossRef Li D, Gu H, Zhang L (2013) A hybrid genetic algorithm–Fuzzy C-Means approach for incomplete data clustering based on nearest-neighbor intervals. Soft Comput 17:1787–1796CrossRef
26.
27.
go back to reference Liao TW, Ting C-F, Chang P-C (2006) An adaptive genetic clustering method for exploratory mining of feature vector and time series data. Int J Prod Res 44(14):2731–2748MATHCrossRef Liao TW, Ting C-F, Chang P-C (2006) An adaptive genetic clustering method for exploratory mining of feature vector and time series data. Int J Prod Res 44(14):2731–2748MATHCrossRef
29.
go back to reference Mishra S, Sahu P, Senapati MR (2019) MASCA–PSO based LLRBFNN model and improved fast and robust FCM algorithm for detection and classification of brain tumor from MR image. Evol Intell 12(4):647–663CrossRef Mishra S, Sahu P, Senapati MR (2019) MASCA–PSO based LLRBFNN model and improved fast and robust FCM algorithm for detection and classification of brain tumor from MR image. Evol Intell 12(4):647–663CrossRef
30.
go back to reference Pérez-Ortiz M et al (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326–327:3–14CrossRef Pérez-Ortiz M et al (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326–327:3–14CrossRef
31.
go back to reference Plant, C., Wohlschlager, A. M. and Zherdin, A. (2009) Interaction-based clustering of multivariate time series. In: Proceedings of the ninth IEEE international conference on data mining. Miami-Floria (USA), pp 914–919 Plant, C., Wohlschlager, A. M. and Zherdin, A. (2009) Interaction-based clustering of multivariate time series. In: Proceedings of the ninth IEEE international conference on data mining. Miami-Floria (USA), pp 914–919
32.
go back to reference Rahman MA, Islam MZ (2014) A hybrid clustering technique combining a novel genetic algorithm with K-means. Knowl Based Syst 71:345–365CrossRef Rahman MA, Islam MZ (2014) A hybrid clustering technique combining a novel genetic algorithm with K-means. Knowl Based Syst 71:345–365CrossRef
33.
go back to reference Rani S, Sikka G (2012) Recent techniques of clustering of time series data: a survey. Int J Comput Appl 52(15):1–9 Rani S, Sikka G (2012) Recent techniques of clustering of time series data: a survey. Int J Comput Appl 52(15):1–9
34.
go back to reference Rato TJ, Reis MS (2013) Fault detection in the Tennessee Eastman benchmark process using dynamic principal components analysis based on decorrelated residuals (DPCA-DR). Chemom Intell Lab Syst 125:101–108CrossRef Rato TJ, Reis MS (2013) Fault detection in the Tennessee Eastman benchmark process using dynamic principal components analysis based on decorrelated residuals (DPCA-DR). Chemom Intell Lab Syst 125:101–108CrossRef
35.
go back to reference Ricardez-Sandoval LA, Budman HM, Douglas PL (2009) Simultaneous design and control of chemical processes with application to the Tennessee Eastman process. J Process Control 19(6):1377–1391CrossRef Ricardez-Sandoval LA, Budman HM, Douglas PL (2009) Simultaneous design and control of chemical processes with application to the Tennessee Eastman process. J Process Control 19(6):1377–1391CrossRef
36.
go back to reference Ricker NL (1996) Decentralized control of the Tennessee Eastman challenge process. J Process Control 6(4):205–221CrossRef Ricker NL (1996) Decentralized control of the Tennessee Eastman challenge process. J Process Control 6(4):205–221CrossRef
37.
go back to reference Rolls-Royce (2010) Training Manual RB 211-G62 DF Rolls-Royce (2010) Training Manual RB 211-G62 DF
38.
go back to reference Singhal A, Seborg DE (2005) Clustering multivariate time-series data. J Chemom 19(8):427–438CrossRef Singhal A, Seborg DE (2005) Clustering multivariate time-series data. J Chemom 19(8):427–438CrossRef
39.
go back to reference Singhal A, Seborg DE (2006) Evaluation of a pattern matching method for the Tennessee Eastman challenge process. J Process Control 16:601–613CrossRef Singhal A, Seborg DE (2006) Evaluation of a pattern matching method for the Tennessee Eastman challenge process. J Process Control 16:601–613CrossRef
40.
go back to reference Tang J et al (2015) A hybrid approach to integrate Fuzzy C-Means based imputation method with genetic algorithm for missing traffic volume data estimation. Transp Res Part C 51:29–40CrossRef Tang J et al (2015) A hybrid approach to integrate Fuzzy C-Means based imputation method with genetic algorithm for missing traffic volume data estimation. Transp Res Part C 51:29–40CrossRef
41.
go back to reference Tseng VS et al (2009) Cluster-based genetic segmentation of time series with DWT. Pattern Recogn Lett 30:1190–1197CrossRef Tseng VS et al (2009) Cluster-based genetic segmentation of time series with DWT. Pattern Recogn Lett 30:1190–1197CrossRef
43.
go back to reference Wang X et al (2013) Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov 26:275–309MathSciNetCrossRef Wang X et al (2013) Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov 26:275–309MathSciNetCrossRef
44.
go back to reference Wang Z et al (2012) Flood season division with an improved Fuzzy C-Mean clustering method in the Taihu Lake Basin in China. Procedia Eng 28:66–74CrossRef Wang Z et al (2012) Flood season division with an improved Fuzzy C-Mean clustering method in the Taihu Lake Basin in China. Procedia Eng 28:66–74CrossRef
45.
go back to reference Wikaisuksakul S (2014) A multi-objective genetic algorithm with Fuzzy C-Means for automatic data clustering. Appl Soft Comput 24:679–691CrossRef Wikaisuksakul S (2014) A multi-objective genetic algorithm with Fuzzy C-Means for automatic data clustering. Appl Soft Comput 24:679–691CrossRef
46.
go back to reference Wu J et al (2014) Prediction of solar radiation with genetic approach combing multi-model framework. Renew Energy 66:132–139CrossRef Wu J et al (2014) Prediction of solar radiation with genetic approach combing multi-model framework. Renew Energy 66:132–139CrossRef
47.
go back to reference Xun, L. and Zhishu, L. (2010) The similarity of multivariate time series and its application. In: Proceedings of the international conference on management of e-Commerce and e-Government, Sichuan, China, pp 76–81 Xun, L. and Zhishu, L. (2010) The similarity of multivariate time series and its application. In: Proceedings of the international conference on management of e-Commerce and e-Government, Sichuan, China, pp 76–81
48.
go back to reference Yang C-L et al (2015) Non-dominated sorting genetic algorithm using fuzzy membershipchromosome for categorical data clustering. Appl Soft Comput 30:113–122CrossRef Yang C-L et al (2015) Non-dominated sorting genetic algorithm using fuzzy membershipchromosome for categorical data clustering. Appl Soft Comput 30:113–122CrossRef
49.
go back to reference Yang, K. and Shahabi, C. (2004) A PCA-based similarity measure for multivariate time series. In: Proceedings of the International workshop on multimedia databases, ACM-MMDB, Washington DC, USA, pp 1–10 Yang, K. and Shahabi, C. (2004) A PCA-based similarity measure for multivariate time series. In: Proceedings of the International workshop on multimedia databases, ACM-MMDB, Washington DC, USA, pp 1–10
50.
go back to reference Yin S et al (2012) A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process. J Process Control 22(9):1567–1581CrossRef Yin S et al (2012) A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process. J Process Control 22(9):1567–1581CrossRef
Metadata
Title
Genetic algorithm-based fuzzy clustering applied to multivariate time series
Authors
Karine do Prado Ribeiro
Cristiano Hora Fontes
Gabriel Jesus Alves de Melo
Publication date
21-05-2020
Publisher
Springer Berlin Heidelberg
Published in
Evolutionary Intelligence / Issue 4/2021
Print ISSN: 1864-5909
Electronic ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-020-00422-8

Other articles of this Issue 4/2021

Evolutionary Intelligence 4/2021 Go to the issue

Premium Partner