Skip to main content
Top
Published in: International Journal of Machine Learning and Cybernetics 2/2023

17-09-2022 | Original Article

A novel adaptive methodology for removing spurious components in a modified incremental Gaussian mixture model

Authors: Shuping Sun, Yaonan Tong, Biqiang Zhang, Bowen Yang, Long Yan, Peiguang He, Hong Xu

Published in: International Journal of Machine Learning and Cybernetics | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Regarding the computational complexity of the update procedure in the fast incremental Gaussian mixture model (FIGMM) and no efficiency for removing the spurious component in the incremental Gaussian mixture model (IGMM), this study proposes a novel algorithm called the modified incremental Gaussian mixture model (MIGMM) which is an improvement of FIGMM, and a novel adaptive methodology for removing spurious components in the MIGMM. The major contributions in this study are twofold. Firstly, a more simple and efficient prediction matrix update, which is the core of the update procedure in the MIGMM algorithm, is proposed compared to that described in FIGMM. Secondly, an effective exponential model (\(p_{\mathrm {_{Thv}}}\)) related to the number of output components generated in MIGMM, combined with the Mahalanobis distance-based logical matrix (LM), is proposed to remove spurious components and determine the correct components. Based on the highlighted contributions, regarding the removal of spurious components, comparative experiments studied on synthetic and real data sets show that the proposed framework performs robustly compared with other famous information criteria used to determine the number of components. The performance evaluation of IGMM compared with other efficient unsupervised algorithms is verified by conducting on both synthetic and real-world data sets.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Show more products
Literature
1.
go back to reference Zhang J, Yin Z, Wang R (2017) Pattern classification of instantaneous cognitive task-load through GMM clustering, Laplacian Eigenmap, and ensemble SVMs. IEEE/ACM Trans Comput Biol Bioinf 14:947–965CrossRef Zhang J, Yin Z, Wang R (2017) Pattern classification of instantaneous cognitive task-load through GMM clustering, Laplacian Eigenmap, and ensemble SVMs. IEEE/ACM Trans Comput Biol Bioinf 14:947–965CrossRef
2.
go back to reference Li Z, Xia Y, Ji Z, Zhang Y (2017) Brain voxel classification in magnetic resonance images using niche differential evolution based Bayesian inference of variational mixture of Gaussians. Neurocomputing 269:47–57CrossRef Li Z, Xia Y, Ji Z, Zhang Y (2017) Brain voxel classification in magnetic resonance images using niche differential evolution based Bayesian inference of variational mixture of Gaussians. Neurocomputing 269:47–57CrossRef
3.
go back to reference Ortiz-Rosario A, Adeli H, Buford JA (2017) MUSIC-expected maximization gaussian mixture methodology for clustering and detection of task-related neuronal firing rates. Behav Brain Res 317:226–236CrossRef Ortiz-Rosario A, Adeli H, Buford JA (2017) MUSIC-expected maximization gaussian mixture methodology for clustering and detection of task-related neuronal firing rates. Behav Brain Res 317:226–236CrossRef
4.
go back to reference Davari A, Aptoula E, Yanikoglu B, Maier A, Riess C (2018) GMM-based synthetic samples for classification of hyperspectral images with limited training data. IEEE Geosci Remote Sens Lett 15:942–946CrossRef Davari A, Aptoula E, Yanikoglu B, Maier A, Riess C (2018) GMM-based synthetic samples for classification of hyperspectral images with limited training data. IEEE Geosci Remote Sens Lett 15:942–946CrossRef
5.
go back to reference Simms LM, Blair B, Ruz J, Wurtz R, Kaplan AD, Glenn A (2018) A pulse discrimination with a Gaussian mixture model on an FPGA. Nucl Inst Methods Phys Res A 900:1–7CrossRef Simms LM, Blair B, Ruz J, Wurtz R, Kaplan AD, Glenn A (2018) A pulse discrimination with a Gaussian mixture model on an FPGA. Nucl Inst Methods Phys Res A 900:1–7CrossRef
6.
go back to reference Xue W, Jiang T (2018) An adaptive algorithm for target recognition using Gaussian mixture models. Meas J Int Meas Confed 124:233–240CrossRef Xue W, Jiang T (2018) An adaptive algorithm for target recognition using Gaussian mixture models. Meas J Int Meas Confed 124:233–240CrossRef
7.
go back to reference Heinen MR, Engel PM, Pinto RC (2012) Using a gaussian mixture neural network for incremental learning and robotics. In: The 2012 international joint conference on neural networks (IJCNN), pp 1–8 Heinen MR, Engel PM, Pinto RC (2012) Using a gaussian mixture neural network for incremental learning and robotics. In: The 2012 international joint conference on neural networks (IJCNN), pp 1–8
8.
go back to reference Heinen MR, Engel PM, Pinto RC (2011) IGMN: an incremental gaussian mixture network that learns instantaneously from data flows. In: Proc VIII Encontro Nacional de Inteligência Artificial (ENIA2011) Heinen MR, Engel PM, Pinto RC (2011) IGMN: an incremental gaussian mixture network that learns instantaneously from data flows. In: Proc VIII Encontro Nacional de Inteligência Artificial (ENIA2011)
9.
go back to reference Dempster AP (1977) Maximum likelihood estimation from incomplete data via the EM algorithm. J R Stat Soc Ser B (Stat Methodol) 39:1–38MATH Dempster AP (1977) Maximum likelihood estimation from incomplete data via the EM algorithm. J R Stat Soc Ser B (Stat Methodol) 39:1–38MATH
10.
go back to reference Engel PM, Heinen MR Incremental learning of multivariate gaussian mixture models. In: Brazilian symposium on artificial intelligence. Springer, pp 82–91 Engel PM, Heinen MR Incremental learning of multivariate gaussian mixture models. In: Brazilian symposium on artificial intelligence. Springer, pp 82–91
11.
go back to reference Grossberg S (1987) Competitive learning: from interactive activation to adaptive resonance. Cogn Sci 11:23–63CrossRef Grossberg S (1987) Competitive learning: from interactive activation to adaptive resonance. Cogn Sci 11:23–63CrossRef
12.
go back to reference Pinto RC, Engel PM (2015) A fast incremental gaussian mixture model. PLOS ONE 10(10):e0141942CrossRef Pinto RC, Engel PM (2015) A fast incremental gaussian mixture model. PLOS ONE 10(10):e0141942CrossRef
13.
go back to reference Pragr M, Cizek P (2018) Cost of transport estimation for legged robot based on terrain features inference from aerial scan. In: IEEE international conference on intelligent robots and systems, pp 1745–1750 Pragr M, Cizek P (2018) Cost of transport estimation for legged robot based on terrain features inference from aerial scan. In: IEEE international conference on intelligent robots and systems, pp 1745–1750
14.
go back to reference Prágr M, Čížek P (2019) Incremental learning of traversability cost for aerial reconnaissance support to ground units, lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 11472 LNCS, pp 412–421 Prágr M, Čížek P (2019) Incremental learning of traversability cost for aerial reconnaissance support to ground units, lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics) 11472 LNCS, pp 412–421
16.
17.
go back to reference Pinto RC, Engel PM (2015) A fast incremental gaussian mixture model. PLoS ONE 10:1–12 Pinto RC, Engel PM (2015) A fast incremental gaussian mixture model. PLoS ONE 10:1–12
18.
go back to reference Chamby-Diaz CJ, Recamonde-Mendoza M, Bazzan LCA, Grunitzki R (2018) Adaptive incremental gaussian mixture network for non-stationary data stream classification. In: Proceedings of the international joint conference on neural networks 2018-July, pp 1–8 Chamby-Diaz CJ, Recamonde-Mendoza M, Bazzan LCA, Grunitzki R (2018) Adaptive incremental gaussian mixture network for non-stationary data stream classification. In: Proceedings of the international joint conference on neural networks 2018-July, pp 1–8
19.
go back to reference Koert D, Trick S, Ewerton M, Lutter (2019) Online learning of an open-ended skill library for collaborative tasks. In: IEEE-RAS international conference on humanoid robots 2018-November, pp 599–606 Koert D, Trick S, Ewerton M, Lutter (2019) Online learning of an open-ended skill library for collaborative tasks. In: IEEE-RAS international conference on humanoid robots 2018-November, pp 599–606
20.
go back to reference Drumond DA, Rolo RM, Costa JFCL (2019) Using Mahalanobis distance to detect and remove outliers in experimental covariograms. Nat Resour Res 28:145–152CrossRef Drumond DA, Rolo RM, Costa JFCL (2019) Using Mahalanobis distance to detect and remove outliers in experimental covariograms. Nat Resour Res 28:145–152CrossRef
21.
go back to reference Singh R, Pal BC, Jabr RA (2009) Statistical representation of distribution system loads using gaussian mixture model. IEEE Transa Power Syst 25:29–37CrossRef Singh R, Pal BC, Jabr RA (2009) Statistical representation of distribution system loads using gaussian mixture model. IEEE Transa Power Syst 25:29–37CrossRef
22.
go back to reference Salmond DJ, Atherton DP, Bather JA (1989) Mixture reduction algorithms for uncertain tracking. IFAC Proc Ser 2:775–780 Salmond DJ, Atherton DP, Bather JA (1989) Mixture reduction algorithms for uncertain tracking. IFAC Proc Ser 2:775–780
23.
go back to reference Proïa F, Pernet A, Thouroude T, Michel G, Clotault J (2016) On the characterization of flowering curves using Gaussian mixture models. J Theor Biol 402:75–88CrossRefMATH Proïa F, Pernet A, Thouroude T, Michel G, Clotault J (2016) On the characterization of flowering curves using Gaussian mixture models. J Theor Biol 402:75–88CrossRefMATH
24.
go back to reference Mungai PK (2017) Using keystroke dynamics in a multi-level architecture to protect online examinations from impersonation. In: 2017 IEEE 2nd international conference on big data analysis, pp 622–627 Mungai PK (2017) Using keystroke dynamics in a multi-level architecture to protect online examinations from impersonation. In: 2017 IEEE 2nd international conference on big data analysis, pp 622–627
25.
go back to reference Aryafar A, Mikaeil R, Ardejani FD, Haghshenas SS, Jafarpour A (2019) Application of non-linear regression and soft computing techniques for modeling process of pollutant adsorption from industrial wastewaters. J Min Environ 10(2):327–337 Aryafar A, Mikaeil R, Ardejani FD, Haghshenas SS, Jafarpour A (2019) Application of non-linear regression and soft computing techniques for modeling process of pollutant adsorption from industrial wastewaters. J Min Environ 10(2):327–337
26.
go back to reference Sun S, Wang H, Chang Z, Mao B, Liu Y (2019) On the Mahalanobis distance classification criterion for a ventricular septal defect diagnosis system. IEEE Sens J 19:2665–2674CrossRef Sun S, Wang H, Chang Z, Mao B, Liu Y (2019) On the Mahalanobis distance classification criterion for a ventricular septal defect diagnosis system. IEEE Sens J 19:2665–2674CrossRef
27.
go back to reference Xie C, Chang J, Liu Y (2013) Estimating the number of components in Gaussian mixture models adaptively. J Inf Comput Sci 10:4453–4460CrossRef Xie C, Chang J, Liu Y (2013) Estimating the number of components in Gaussian mixture models adaptively. J Inf Comput Sci 10:4453–4460CrossRef
28.
go back to reference Keribin C (2000) Consistent estimate of the order of mixture models. Sankhy A Ser A 62:49–66MATH Keribin C (2000) Consistent estimate of the order of mixture models. Sankhy A Ser A 62:49–66MATH
29.
go back to reference Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19:716–723CrossRefMATH Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19:716–723CrossRefMATH
30.
go back to reference Wang H, Luo B, Zhang Q, Wei S (2004) Estimation for the number of components in a mixture model using stepwise split-and-merge EM algorithm. Pattern Recognit Lett 25:1799–1809CrossRef Wang H, Luo B, Zhang Q, Wei S (2004) Estimation for the number of components in a mixture model using stepwise split-and-merge EM algorithm. Pattern Recognit Lett 25:1799–1809CrossRef
31.
go back to reference Shi L, Xu L (2006) Local factor analysis with automatic model selection: a comparative study and digits recognition application. In: Kollias S, Stafylopatis A, Duch W, Oja E (eds) Artificial neural networks—ICANN 2006. Springer, Berlin, pp 260–269CrossRef Shi L, Xu L (2006) Local factor analysis with automatic model selection: a comparative study and digits recognition application. In: Kollias S, Stafylopatis A, Duch W, Oja E (eds) Artificial neural networks—ICANN 2006. Springer, Berlin, pp 260–269CrossRef
33.
go back to reference Deb S, Tian Z, Fong S, Wong R, Millham R, Wong KK (2018) Elephant search algorithm applied to data clustering. Soft Comput 22:6035–6046CrossRef Deb S, Tian Z, Fong S, Wong R, Millham R, Wong KK (2018) Elephant search algorithm applied to data clustering. Soft Comput 22:6035–6046CrossRef
34.
go back to reference Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830MATH Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Blondel M, Prettenhofer P, Weiss R, Dubourg V et al (2011) Scikit-learn: machine learning in python. J Mach Learn Res 12:2825–2830MATH
35.
go back to reference Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef Lecun Y, Bottou L, Bengio Y, Haffner P (1998) Gradient-based learning applied to document recognition. Proc IEEE 86:2278–2324CrossRef
36.
go back to reference Krizhevsky HGLA (2009) Learning multiple layers of features from tiny images. Technical Report, Computer Science Department, University of Toronto Krizhevsky HGLA (2009) Learning multiple layers of features from tiny images. Technical Report, Computer Science Department, University of Toronto
37.
go back to reference Kusetogullari H, Yavariabdi A, Cheddad A, Grahn H, Hall J (2020) ARDIS: a Swedish historical handwritten digit dataset. Neural Comput Appl 32:16505–16518CrossRef Kusetogullari H, Yavariabdi A, Cheddad A, Grahn H, Hall J (2020) ARDIS: a Swedish historical handwritten digit dataset. Neural Comput Appl 32:16505–16518CrossRef
38.
go back to reference Wang Y, Chaib-draa B (2016) KNN-based Kalman filter: an efficient and non-stationary method for Gaussian process regression. Knowl-Based Syst 114:148–155CrossRef Wang Y, Chaib-draa B (2016) KNN-based Kalman filter: an efficient and non-stationary method for Gaussian process regression. Knowl-Based Syst 114:148–155CrossRef
Metadata
Title
A novel adaptive methodology for removing spurious components in a modified incremental Gaussian mixture model
Authors
Shuping Sun
Yaonan Tong
Biqiang Zhang
Bowen Yang
Long Yan
Peiguang He
Hong Xu
Publication date
17-09-2022
Publisher
Springer Berlin Heidelberg
Published in
International Journal of Machine Learning and Cybernetics / Issue 2/2023
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-022-01649-w

Other articles of this Issue 2/2023

International Journal of Machine Learning and Cybernetics 2/2023 Go to the issue