Skip to main content
Erschienen in: Evolutionary Intelligence 4/2021

21.05.2020 | Research Paper

Genetic algorithm-based fuzzy clustering applied to multivariate time series

verfasst von: Karine do Prado Ribeiro, Cristiano Hora Fontes, Gabriel Jesus Alves de Melo

Erschienen in: Evolutionary Intelligence | Ausgabe 4/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Despite the fact that the fuzzy clustering of time series based on genetic algorithm (GA) is mostly used in applications involving univariate time series, this paper presents an approach based on GA and Fuzzy C-Means (FCM) for clustering multivariate time series. Each chromosome is an individual or solution which encodes the clusters' centroids (patterns) and a bi-criterion constrained clustering is proposed to maximize both the similarity of objects in the same cluster (based on the SPCA metric) and the distance between the centers of the clusters. The proposed method is applied in two case studies involving a real industrial case which comprises pattern recognition for detecting operation failures in a gas turbine and a well-known benchmark industrial system (Tennessee Eastman process) used to evaluate techniques for detecting and diagnosing failures. The proposed approach was able to obtain better classification results compared to FCM based on classical optimization methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aghabozorgi S, Shirkhorshid AS, Wah TY (2015) Time-series clustering—a decade review. Inf Syst 53:16–38CrossRef Aghabozorgi S, Shirkhorshid AS, Wah TY (2015) Time-series clustering—a decade review. Inf Syst 53:16–38CrossRef
2.
Zurück zum Zitat Bandyppadhyay S, Baragona R, Maulik U (2010) Clustering multivariate time series by genetic multiobjective optimization. METRON Int J Stat LXVIII(2):161–183MathSciNetMATH Bandyppadhyay S, Baragona R, Maulik U (2010) Clustering multivariate time series by genetic multiobjective optimization. METRON Int J Stat LXVIII(2):161–183MathSciNetMATH
3.
Zurück zum Zitat Bankó Z, Abonyi J (2012) Correlation based dynamic time warping of multivariate time series. Expert Syst Appl 39:12814–12823CrossRef Bankó Z, Abonyi J (2012) Correlation based dynamic time warping of multivariate time series. Expert Syst Appl 39:12814–12823CrossRef
4.
Zurück zum Zitat Baragona R (2001) A simulation study on clustering time series with metaheuristic methods. Quad Stat 3:1–26MathSciNet Baragona R (2001) A simulation study on clustering time series with metaheuristic methods. Quad Stat 3:1–26MathSciNet
5.
Zurück zum Zitat Baragona R (2011) Clusters of multivariate stationary time series by differential evolution and autoregressive distance. In: Kuznetsov SO, et al. (eds) Clusters of time series by differential evolution. Springer, Berlin, pp 382–387 Baragona R (2011) Clusters of multivariate stationary time series by differential evolution and autoregressive distance. In: Kuznetsov SO, et al. (eds) Clusters of time series by differential evolution. Springer, Berlin, pp 382–387
6.
Zurück zum Zitat Bezdek JC et al (2005) Fuzzy models and algorithms for pattern recognition and image processing. Springer, New York Bezdek JC et al (2005) Fuzzy models and algorithms for pattern recognition and image processing. Springer, New York
7.
Zurück zum Zitat Chawla NV (2010) Data mining for imbalanced data sets: an overview. In: Maimon L, Rokach O (eds) Data mining and knowledge discovery handbook. Springer, US Chawla NV (2010) Data mining for imbalanced data sets: an overview. In: Maimon L, Rokach O (eds) Data mining and knowledge discovery handbook. Springer, US
9.
Zurück zum Zitat Dobos L, Abonyi J (2012) On-line detection of homogeneous operation ranges by dynamic principal component analysis based time-series segmentation. Chem Eng Sci 2(75):96–105CrossRef Dobos L, Abonyi J (2012) On-line detection of homogeneous operation ranges by dynamic principal component analysis based time-series segmentation. Chem Eng Sci 2(75):96–105CrossRef
10.
Zurück zum Zitat Dohare, D. and Devi, V. S. (2011) Combination of similarity measures for time series classification using genetic algorithms. In: IEEE Congress of evolutionary computation (CEC), pp 401–408 Dohare, D. and Devi, V. S. (2011) Combination of similarity measures for time series classification using genetic algorithms. In: IEEE Congress of evolutionary computation (CEC), pp 401–408
11.
12.
Zurück zum Zitat Downs J, Vogel EF (1993) A plant-wide industrial process control problem. Comput Chem Eng 17(3):245–255CrossRef Downs J, Vogel EF (1993) A plant-wide industrial process control problem. Comput Chem Eng 17(3):245–255CrossRef
13.
Zurück zum Zitat Durán-Rosal AM et al (2017) Detection and prediction of segments containing extreme significant wave heights. Ocean Eng 142:268–279CrossRef Durán-Rosal AM et al (2017) Detection and prediction of segments containing extreme significant wave heights. Ocean Eng 142:268–279CrossRef
14.
Zurück zum Zitat Eslamloueyan R (2011) Designing a hierarchical neural network based on fuzzy clustering for fault diagnosis of the Tennessee-Eastman process. Appl Soft Comput 11(1):1407–1415CrossRef Eslamloueyan R (2011) Designing a hierarchical neural network based on fuzzy clustering for fault diagnosis of the Tennessee-Eastman process. Appl Soft Comput 11(1):1407–1415CrossRef
15.
Zurück zum Zitat Fontes CHO, Budman HM (2017) A hybrid clustering approach for multivariate time series—a case study applied to failure analysis in a gas turbine. ISA Trans 71:513–529CrossRef Fontes CHO, Budman HM (2017) A hybrid clustering approach for multivariate time series—a case study applied to failure analysis in a gas turbine. ISA Trans 71:513–529CrossRef
16.
Zurück zum Zitat Fontes CHO, Budman HM (2018) Evaluation of a hybrid clustering approach for a benchmark industrial system. Ind Eng Chem Res 57:11039–11049CrossRef Fontes CHO, Budman HM (2018) Evaluation of a hybrid clustering approach for a benchmark industrial system. Ind Eng Chem Res 57:11039–11049CrossRef
17.
Zurück zum Zitat Fontes CH, Pereira O (2016) Pattern recognition in multivariate time series—a case study applied to fault detection in a gas turbine. Eng Appl Artif Intell 49:10–18CrossRef Fontes CH, Pereira O (2016) Pattern recognition in multivariate time series—a case study applied to fault detection in a gas turbine. Eng Appl Artif Intell 49:10–18CrossRef
18.
Zurück zum Zitat Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24:164–181CrossRef Fu T (2011) A review on time series data mining. Eng Appl Artif Intell 24:164–181CrossRef
19.
Zurück zum Zitat García MLL, García-Ródenas R, Gómez AG (2014) Hybrid meta-heuristic optimization algorithms fortime-domain-constrained data clustering. Appl Soft Comput 23:319–332CrossRef García MLL, García-Ródenas R, Gómez AG (2014) Hybrid meta-heuristic optimization algorithms fortime-domain-constrained data clustering. Appl Soft Comput 23:319–332CrossRef
20.
Zurück zum Zitat Ghaemi R et al (2012) A novel Fuzzy C-Means algorithm to generate diverse and desirable cluster solutions used by genetic-based clustering ensemble algorithms. Memet Comp 4:49–71CrossRef Ghaemi R et al (2012) A novel Fuzzy C-Means algorithm to generate diverse and desirable cluster solutions used by genetic-based clustering ensemble algorithms. Memet Comp 4:49–71CrossRef
21.
Zurück zum Zitat Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244CrossRef Izakian H, Pedrycz W, Jamal I (2015) Fuzzy clustering of time series data using dynamic time warping distance. Eng Appl Artif Intell 39:235–244CrossRef
22.
Zurück zum Zitat Ji L, Ailing G, Lianyong Y (2011) Application of product cluster method based on fuzzy cluster and genetic algorithm. Procedia Eng 15:4930–4935CrossRef Ji L, Ailing G, Lianyong Y (2011) Application of product cluster method based on fuzzy cluster and genetic algorithm. Procedia Eng 15:4930–4935CrossRef
23.
Zurück zum Zitat Kavitha V, Punithavalli M (2010) Clustering Time series data stream—a literature survey. Int J Comput Sci Inf Secur 8(1):289–294 Kavitha V, Punithavalli M (2010) Clustering Time series data stream—a literature survey. Int J Comput Sci Inf Secur 8(1):289–294
24.
Zurück zum Zitat Lau CK et al (2013) Fault diagnosis of Tennessee Eastman process with multi-scale PCA and ANFIS. Chemom Intell Lab Syst 120:1–14CrossRef Lau CK et al (2013) Fault diagnosis of Tennessee Eastman process with multi-scale PCA and ANFIS. Chemom Intell Lab Syst 120:1–14CrossRef
25.
Zurück zum Zitat Li D, Gu H, Zhang L (2013) A hybrid genetic algorithm–Fuzzy C-Means approach for incomplete data clustering based on nearest-neighbor intervals. Soft Comput 17:1787–1796CrossRef Li D, Gu H, Zhang L (2013) A hybrid genetic algorithm–Fuzzy C-Means approach for incomplete data clustering based on nearest-neighbor intervals. Soft Comput 17:1787–1796CrossRef
26.
Zurück zum Zitat Liao TW (2005) Clustering of time series data—a survey. Pattern Recogn 38:1857–1874MATHCrossRef Liao TW (2005) Clustering of time series data—a survey. Pattern Recogn 38:1857–1874MATHCrossRef
27.
Zurück zum Zitat Liao TW, Ting C-F, Chang P-C (2006) An adaptive genetic clustering method for exploratory mining of feature vector and time series data. Int J Prod Res 44(14):2731–2748MATHCrossRef Liao TW, Ting C-F, Chang P-C (2006) An adaptive genetic clustering method for exploratory mining of feature vector and time series data. Int J Prod Res 44(14):2731–2748MATHCrossRef
29.
Zurück zum Zitat Mishra S, Sahu P, Senapati MR (2019) MASCA–PSO based LLRBFNN model and improved fast and robust FCM algorithm for detection and classification of brain tumor from MR image. Evol Intell 12(4):647–663CrossRef Mishra S, Sahu P, Senapati MR (2019) MASCA–PSO based LLRBFNN model and improved fast and robust FCM algorithm for detection and classification of brain tumor from MR image. Evol Intell 12(4):647–663CrossRef
30.
Zurück zum Zitat Pérez-Ortiz M et al (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326–327:3–14CrossRef Pérez-Ortiz M et al (2019) On the use of evolutionary time series analysis for segmenting paleoclimate data. Neurocomputing 326–327:3–14CrossRef
31.
Zurück zum Zitat Plant, C., Wohlschlager, A. M. and Zherdin, A. (2009) Interaction-based clustering of multivariate time series. In: Proceedings of the ninth IEEE international conference on data mining. Miami-Floria (USA), pp 914–919 Plant, C., Wohlschlager, A. M. and Zherdin, A. (2009) Interaction-based clustering of multivariate time series. In: Proceedings of the ninth IEEE international conference on data mining. Miami-Floria (USA), pp 914–919
32.
Zurück zum Zitat Rahman MA, Islam MZ (2014) A hybrid clustering technique combining a novel genetic algorithm with K-means. Knowl Based Syst 71:345–365CrossRef Rahman MA, Islam MZ (2014) A hybrid clustering technique combining a novel genetic algorithm with K-means. Knowl Based Syst 71:345–365CrossRef
33.
Zurück zum Zitat Rani S, Sikka G (2012) Recent techniques of clustering of time series data: a survey. Int J Comput Appl 52(15):1–9 Rani S, Sikka G (2012) Recent techniques of clustering of time series data: a survey. Int J Comput Appl 52(15):1–9
34.
Zurück zum Zitat Rato TJ, Reis MS (2013) Fault detection in the Tennessee Eastman benchmark process using dynamic principal components analysis based on decorrelated residuals (DPCA-DR). Chemom Intell Lab Syst 125:101–108CrossRef Rato TJ, Reis MS (2013) Fault detection in the Tennessee Eastman benchmark process using dynamic principal components analysis based on decorrelated residuals (DPCA-DR). Chemom Intell Lab Syst 125:101–108CrossRef
35.
Zurück zum Zitat Ricardez-Sandoval LA, Budman HM, Douglas PL (2009) Simultaneous design and control of chemical processes with application to the Tennessee Eastman process. J Process Control 19(6):1377–1391CrossRef Ricardez-Sandoval LA, Budman HM, Douglas PL (2009) Simultaneous design and control of chemical processes with application to the Tennessee Eastman process. J Process Control 19(6):1377–1391CrossRef
36.
Zurück zum Zitat Ricker NL (1996) Decentralized control of the Tennessee Eastman challenge process. J Process Control 6(4):205–221CrossRef Ricker NL (1996) Decentralized control of the Tennessee Eastman challenge process. J Process Control 6(4):205–221CrossRef
37.
Zurück zum Zitat Rolls-Royce (2010) Training Manual RB 211-G62 DF Rolls-Royce (2010) Training Manual RB 211-G62 DF
38.
Zurück zum Zitat Singhal A, Seborg DE (2005) Clustering multivariate time-series data. J Chemom 19(8):427–438CrossRef Singhal A, Seborg DE (2005) Clustering multivariate time-series data. J Chemom 19(8):427–438CrossRef
39.
Zurück zum Zitat Singhal A, Seborg DE (2006) Evaluation of a pattern matching method for the Tennessee Eastman challenge process. J Process Control 16:601–613CrossRef Singhal A, Seborg DE (2006) Evaluation of a pattern matching method for the Tennessee Eastman challenge process. J Process Control 16:601–613CrossRef
40.
Zurück zum Zitat Tang J et al (2015) A hybrid approach to integrate Fuzzy C-Means based imputation method with genetic algorithm for missing traffic volume data estimation. Transp Res Part C 51:29–40CrossRef Tang J et al (2015) A hybrid approach to integrate Fuzzy C-Means based imputation method with genetic algorithm for missing traffic volume data estimation. Transp Res Part C 51:29–40CrossRef
41.
Zurück zum Zitat Tseng VS et al (2009) Cluster-based genetic segmentation of time series with DWT. Pattern Recogn Lett 30:1190–1197CrossRef Tseng VS et al (2009) Cluster-based genetic segmentation of time series with DWT. Pattern Recogn Lett 30:1190–1197CrossRef
43.
Zurück zum Zitat Wang X et al (2013) Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov 26:275–309MathSciNetCrossRef Wang X et al (2013) Experimental comparison of representation methods and distance measures for time series data. Data Min Knowl Discov 26:275–309MathSciNetCrossRef
44.
Zurück zum Zitat Wang Z et al (2012) Flood season division with an improved Fuzzy C-Mean clustering method in the Taihu Lake Basin in China. Procedia Eng 28:66–74CrossRef Wang Z et al (2012) Flood season division with an improved Fuzzy C-Mean clustering method in the Taihu Lake Basin in China. Procedia Eng 28:66–74CrossRef
45.
Zurück zum Zitat Wikaisuksakul S (2014) A multi-objective genetic algorithm with Fuzzy C-Means for automatic data clustering. Appl Soft Comput 24:679–691CrossRef Wikaisuksakul S (2014) A multi-objective genetic algorithm with Fuzzy C-Means for automatic data clustering. Appl Soft Comput 24:679–691CrossRef
46.
Zurück zum Zitat Wu J et al (2014) Prediction of solar radiation with genetic approach combing multi-model framework. Renew Energy 66:132–139CrossRef Wu J et al (2014) Prediction of solar radiation with genetic approach combing multi-model framework. Renew Energy 66:132–139CrossRef
47.
Zurück zum Zitat Xun, L. and Zhishu, L. (2010) The similarity of multivariate time series and its application. In: Proceedings of the international conference on management of e-Commerce and e-Government, Sichuan, China, pp 76–81 Xun, L. and Zhishu, L. (2010) The similarity of multivariate time series and its application. In: Proceedings of the international conference on management of e-Commerce and e-Government, Sichuan, China, pp 76–81
48.
Zurück zum Zitat Yang C-L et al (2015) Non-dominated sorting genetic algorithm using fuzzy membershipchromosome for categorical data clustering. Appl Soft Comput 30:113–122CrossRef Yang C-L et al (2015) Non-dominated sorting genetic algorithm using fuzzy membershipchromosome for categorical data clustering. Appl Soft Comput 30:113–122CrossRef
49.
Zurück zum Zitat Yang, K. and Shahabi, C. (2004) A PCA-based similarity measure for multivariate time series. In: Proceedings of the International workshop on multimedia databases, ACM-MMDB, Washington DC, USA, pp 1–10 Yang, K. and Shahabi, C. (2004) A PCA-based similarity measure for multivariate time series. In: Proceedings of the International workshop on multimedia databases, ACM-MMDB, Washington DC, USA, pp 1–10
50.
Zurück zum Zitat Yin S et al (2012) A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process. J Process Control 22(9):1567–1581CrossRef Yin S et al (2012) A comparison study of basic data-driven fault diagnosis and process monitoring methods on the benchmark Tennessee Eastman process. J Process Control 22(9):1567–1581CrossRef
Metadaten
Titel
Genetic algorithm-based fuzzy clustering applied to multivariate time series
verfasst von
Karine do Prado Ribeiro
Cristiano Hora Fontes
Gabriel Jesus Alves de Melo
Publikationsdatum
21.05.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
Evolutionary Intelligence / Ausgabe 4/2021
Print ISSN: 1864-5909
Elektronische ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-020-00422-8

Weitere Artikel der Ausgabe 4/2021

Evolutionary Intelligence 4/2021 Zur Ausgabe

Premium Partner