Skip to main content
Top
Published in: Advances in Data Analysis and Classification 1/2019

28-08-2018 | Regular Article

Clustering via finite nonparametric ICA mixture models

Authors: Xiaotian Zhu, David R. Hunter

Published in: Advances in Data Analysis and Classification | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We propose a novel extension of nonparametric multivariate finite mixture models by dropping the standard conditional independence assumption and incorporating the independent component analysis (ICA) structure instead. This innovation extends nonparametric mixture model estimation methods to situations in which conditional independence, a necessary assumption for the unique identifiability of the parameters in such models, is clearly violated. We formulate an objective function in terms of penalized smoothed Kullback–Leibler distance and introduce the nonlinear smoothed majorization-minimization independent component analysis algorithm for optimizing this function and estimating the model parameters. Our algorithm does not require any labeled observations a priori; it may be used for fully unsupervised clustering problems in a multivariate setting. We have implemented a practical version of this algorithm, which utilizes the FastICA algorithm, in the R package icamix. We illustrate this new methodology using several applications in unsupervised learning and image processing.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Aeberhard S, Coomans D, De Vel O (1992) Comparison of classifiers in high dimensional settings. Deparment of Mathematics and Statistics, James Cook University, North Queensland, Australia. Technical Report 92(02) Aeberhard S, Coomans D, De Vel O (1992) Comparison of classifiers in high dimensional settings. Deparment of Mathematics and Statistics, James Cook University, North Queensland, Australia. Technical Report 92(02)
go back to reference Allman ES, Matias C, Rhodes JA (2009) Identifiability of parameters in latent structure models with many observed variables. Ann Stat 37(6A):3099–3132MathSciNetMATHCrossRef Allman ES, Matias C, Rhodes JA (2009) Identifiability of parameters in latent structure models with many observed variables. Ann Stat 37(6A):3099–3132MathSciNetMATHCrossRef
go back to reference Anandkumar A, Hsu D, Kakade SM (2012) A method of moments for mixture models and hidden Markov models. In Mannor S, Srebro N, Williamson RC (eds) Proceedings of the 25th annual conference on learning theory, vol 23, pp 33.1–33.34. PMLR, Edinburgh, Scotland Anandkumar A, Hsu D, Kakade SM (2012) A method of moments for mixture models and hidden Markov models. In Mannor S, Srebro N, Williamson RC (eds) Proceedings of the 25th annual conference on learning theory, vol 23, pp 33.1–33.34. PMLR, Edinburgh, Scotland
go back to reference Bajari P, Hahn J, Hong H, Ridder G (2011) A note on semiparametric estimation of finite mixtures of discrete choice models with application to game theoretic models. Int Econ Rev 52(3):807–824MathSciNetCrossRef Bajari P, Hahn J, Hong H, Ridder G (2011) A note on semiparametric estimation of finite mixtures of discrete choice models with application to game theoretic models. Int Econ Rev 52(3):807–824MathSciNetCrossRef
go back to reference Banfield JD, Raftery AE (1993) Model-based Gaussian and non-Gaussian clustering. Biometrics 803–821 Banfield JD, Raftery AE (1993) Model-based Gaussian and non-Gaussian clustering. Biometrics 803–821
go back to reference Benaglia T, Chauveau D, Hunter DR (2009) An EM-like algorithm for semi-and nonparametric estimation in multivariate mixtures. J Comput Graph Stat 18(2):505–526MathSciNetMATHCrossRef Benaglia T, Chauveau D, Hunter DR (2009) An EM-like algorithm for semi-and nonparametric estimation in multivariate mixtures. J Comput Graph Stat 18(2):505–526MathSciNetMATHCrossRef
go back to reference Benaglia T, Chauveau D, Hunter DR (2011) Bandwidth selection in an EM-like algorithm for nonparametric multivariate mixtures. In: Hunter DR, Richards DSP, Rosenberger JL (eds) Nonparametric statistics and mixture models: a festschrift in honor of Thomas P. Hettmansperger. World Scientific, Singapore, pp 15–27MATHCrossRef Benaglia T, Chauveau D, Hunter DR (2011) Bandwidth selection in an EM-like algorithm for nonparametric multivariate mixtures. In: Hunter DR, Richards DSP, Rosenberger JL (eds) Nonparametric statistics and mixture models: a festschrift in honor of Thomas P. Hettmansperger. World Scientific, Singapore, pp 15–27MATHCrossRef
go back to reference Bonhomme S, Jochmans K, Robin J-M (2016b) Non-parametric estimation of finite mixtures from repeated measurements. J R Stat Soc Ser B (Stat Methodol) 78(1):211–229MathSciNetMATHCrossRef Bonhomme S, Jochmans K, Robin J-M (2016b) Non-parametric estimation of finite mixtures from repeated measurements. J R Stat Soc Ser B (Stat Methodol) 78(1):211–229MathSciNetMATHCrossRef
go back to reference Chauveau D, Hunter DR, Levine M (2015) Semi-parametric estimation for conditional independence multivariate finite mixture models. Stat Surv 9:1–31MathSciNetMATHCrossRef Chauveau D, Hunter DR, Levine M (2015) Semi-parametric estimation for conditional independence multivariate finite mixture models. Stat Surv 9:1–31MathSciNetMATHCrossRef
go back to reference Cohen EA (1984) Some effects of inharmonic partials on interval perception. Music Percept Interdiscip J 1(3):323–349CrossRef Cohen EA (1984) Some effects of inharmonic partials on interval perception. Music Percept Interdiscip J 1(3):323–349CrossRef
go back to reference De Castro Y, Gassiat E, Lacour C (2016) Minimax adaptive estimation of nonparametric hidden Markov models. J Mach Learn Res 17(1):3842–3884MathSciNetMATH De Castro Y, Gassiat E, Lacour C (2016) Minimax adaptive estimation of nonparametric hidden Markov models. J Mach Learn Res 17(1):3842–3884MathSciNetMATH
go back to reference Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 39(1):1–38MathSciNetMATH Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B (Methodol) 39(1):1–38MathSciNetMATH
go back to reference Eddelbuettel D, François R (2011) Rcpp: seamless R and C++ integration. J Stat Softw 40(8):1–18CrossRef Eddelbuettel D, François R (2011) Rcpp: seamless R and C++ integration. J Stat Softw 40(8):1–18CrossRef
go back to reference Eddelbuettel D, Sanderson C (2014) RcppArmadillo: accelerating R with high-performance C++ linear algebra. Comput Stat Data Anal 71:1054–1063MathSciNetMATHCrossRef Eddelbuettel D, Sanderson C (2014) RcppArmadillo: accelerating R with high-performance C++ linear algebra. Comput Stat Data Anal 71:1054–1063MathSciNetMATHCrossRef
go back to reference Forina M, Leardi R, Armanino C, Lanteri S, Conti P, Princi P (1988) Parvus: an extendable package of programs for data exploration, classification and correlation. J Chemometr 4(2):191–193 Forina M, Leardi R, Armanino C, Lanteri S, Conti P, Princi P (1988) Parvus: an extendable package of programs for data exploration, classification and correlation. J Chemometr 4(2):191–193
go back to reference Fraley C, Raftery AE (1998) How many clusters? Which clustering method? Answers via model-based cluster analysis. Comput J 41(8):578–588MATHCrossRef Fraley C, Raftery AE (1998) How many clusters? Which clustering method? Answers via model-based cluster analysis. Comput J 41(8):578–588MATHCrossRef
go back to reference Fraley C, Raftery AE (2002) Model-based clustering, discriminant analysis, and density estimation. J Am Stat Assoc 97(458):611–631MathSciNetMATHCrossRef Fraley C, Raftery AE (2002) Model-based clustering, discriminant analysis, and density estimation. J Am Stat Assoc 97(458):611–631MathSciNetMATHCrossRef
go back to reference Frühwirth-Schnatter S (2006) Finite mixture and markov switching models. Springer Science & Business Media, LLC, New YorkMATH Frühwirth-Schnatter S (2006) Finite mixture and markov switching models. Springer Science & Business Media, LLC, New YorkMATH
go back to reference Gassiat E, Cleynen A, Robin S (2016) Inference in finite state space non parametric hidden Markov models and applications. Stat Comput 26(1):61–71MathSciNetMATHCrossRef Gassiat E, Cleynen A, Robin S (2016) Inference in finite state space non parametric hidden Markov models and applications. Stat Comput 26(1):61–71MathSciNetMATHCrossRef
go back to reference Guglielmi A, Ieva F, Paganoni AM, Ruggeri F, Soriano J (2014) Semiparametric Bayesian models for clustering and classification in the presence of unbalanced in-hospital survival. J R Stat Soc Ser C (Appl Stat) 63(1):25–46MathSciNetCrossRef Guglielmi A, Ieva F, Paganoni AM, Ruggeri F, Soriano J (2014) Semiparametric Bayesian models for clustering and classification in the presence of unbalanced in-hospital survival. J R Stat Soc Ser C (Appl Stat) 63(1):25–46MathSciNetCrossRef
go back to reference Han B, Davis LS (2006) Semi-parametric model-based clustering for DNA microarray data. In: 18th International conference on pattern recognition (ICPR’06), vol 3, pp 324–327 Han B, Davis LS (2006) Semi-parametric model-based clustering for DNA microarray data. In: 18th International conference on pattern recognition (ICPR’06), vol 3, pp 324–327
go back to reference Hyvarinen A, Karhunen J, Oja E (2002) Independent component analysis. Stud Inform Control 11(2):205–207 Hyvarinen A, Karhunen J, Oja E (2002) Independent component analysis. Stud Inform Control 11(2):205–207
go back to reference Lee T-W, Lewicki MS, Sejnowski TJ (1999a) ICA mixture models for image processing. In: Sixth joint symposium on neural computation proceedings, pp 79–86 Lee T-W, Lewicki MS, Sejnowski TJ (1999a) ICA mixture models for image processing. In: Sixth joint symposium on neural computation proceedings, pp 79–86
go back to reference Lee T-W, Lewicki MS, Sejnowski TJ (1999b) Unsupervised classification with non-Gaussian mixture models using ICA. In: Kearns MJ, Solla SA, Cohn DA (eds) Advances in neural information processing systems, vol 11. MIT Press, Cambridge, pp 508–514 Lee T-W, Lewicki MS, Sejnowski TJ (1999b) Unsupervised classification with non-Gaussian mixture models using ICA. In: Kearns MJ, Solla SA, Cohn DA (eds) Advances in neural information processing systems, vol 11. MIT Press, Cambridge, pp 508–514
go back to reference Lee T-W, Lewicki MS, Sejnowski TJ (2000) ICA mixture models for unsupervised classification of non-Gaussian classes and automatic context switching in blind signal separation. IEEE Trans Pattern Anal Mach Intell 22(10):1078–1089CrossRef Lee T-W, Lewicki MS, Sejnowski TJ (2000) ICA mixture models for unsupervised classification of non-Gaussian classes and automatic context switching in blind signal separation. IEEE Trans Pattern Anal Mach Intell 22(10):1078–1089CrossRef
go back to reference Li J, Ray S, Lindsay BG (2007) A nonparametric statistical approach to clustering via mode identification. J Mach Learn Res 8(8):1687–1723MathSciNetMATH Li J, Ray S, Lindsay BG (2007) A nonparametric statistical approach to clustering via mode identification. J Mach Learn Res 8(8):1687–1723MathSciNetMATH
go back to reference Mallapragada PK, Jin R, Jain A (2010) Non-parametric mixture models for clustering. In: Structural, syntactic, and statistical pattern recognition. Springer, Berlin, pp 334–343 Mallapragada PK, Jin R, Jain A (2010) Non-parametric mixture models for clustering. In: Structural, syntactic, and statistical pattern recognition. Springer, Berlin, pp 334–343
go back to reference Palmer JA, Makeig S, Kreutz-Delgado K, Rao BD (2008) Newton method for the ICA mixture model. In: Proceedings of the 2008 IEEE international conference on acoustics, speech, and signal processing, pp 1805–1808 Palmer JA, Makeig S, Kreutz-Delgado K, Rao BD (2008) Newton method for the ICA mixture model. In: Proceedings of the 2008 IEEE international conference on acoustics, speech, and signal processing, pp 1805–1808
go back to reference Peña D, Prieto FJ, Viladomat J (2010) Eigenvectors of a kurtosis matrix as interesting directions to reveal cluster structure. J Multivar Anal 101(9):1995–2007MathSciNetMATHCrossRef Peña D, Prieto FJ, Viladomat J (2010) Eigenvectors of a kurtosis matrix as interesting directions to reveal cluster structure. J Multivar Anal 101(9):1995–2007MathSciNetMATHCrossRef
go back to reference R Core Team (2015) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria R Core Team (2015) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria
go back to reference Raykar VC, Yang C, Duraiswami R, Gumerov N (2005) Fast computation of sums of Gaussians in high dimensions. Technical report, University of Maryland Raykar VC, Yang C, Duraiswami R, Gumerov N (2005) Fast computation of sums of Gaussians in high dimensions. Technical report, University of Maryland
go back to reference Salazar A, Igual J, Safont G, Vergara L, Vidal A (2015) Image applications of agglomerative clustering using mixtures of non-Gaussian distributions. In: Proceedings of the 2015 international conference on computational science and computational intelligence (CSCI), pp 459–463 Salazar A, Igual J, Safont G, Vergara L, Vidal A (2015) Image applications of agglomerative clustering using mixtures of non-Gaussian distributions. In: Proceedings of the 2015 international conference on computational science and computational intelligence (CSCI), pp 459–463
go back to reference Salazar A, Vergara L, Serrano A, Igual J (2010) A general procedure for learning mixtures of independent component analyzers. Pattern Recognit 43(1):69–85MATHCrossRef Salazar A, Vergara L, Serrano A, Igual J (2010) A general procedure for learning mixtures of independent component analyzers. Pattern Recognit 43(1):69–85MATHCrossRef
go back to reference Shah CA, Arora MK, Varshney PK (2004) Unsupervised classification of hyperspectral data: an ICA mixture model based approach. Int J Remote Sens 25(2):481–487CrossRef Shah CA, Arora MK, Varshney PK (2004) Unsupervised classification of hyperspectral data: an ICA mixture model based approach. Int J Remote Sens 25(2):481–487CrossRef
go back to reference Tyler DE, Critchley F, Dümbgen L, Oja H (2009) Invariant co-ordinate selection (with discussion). J R Stat Soc Ser B (Stat Methodol) 71(3):549–592MathSciNetMATHCrossRef Tyler DE, Critchley F, Dümbgen L, Oja H (2009) Invariant co-ordinate selection (with discussion). J R Stat Soc Ser B (Stat Methodol) 71(3):549–592MathSciNetMATHCrossRef
go back to reference Wolfe JH (1963) Object cluster analysis of social areas. Ph.D. thesis, University of California Wolfe JH (1963) Object cluster analysis of social areas. Ph.D. thesis, University of California
go back to reference Zhu X, Hunter DR (2016) Theoretical grouding for estimation in conditional independence multivariate finite mixture models. J Nonparametr Stat 28(1):683–701MathSciNetMATHCrossRef Zhu X, Hunter DR (2016) Theoretical grouding for estimation in conditional independence multivariate finite mixture models. J Nonparametr Stat 28(1):683–701MathSciNetMATHCrossRef
Metadata
Title
Clustering via finite nonparametric ICA mixture models
Authors
Xiaotian Zhu
David R. Hunter
Publication date
28-08-2018
Publisher
Springer Berlin Heidelberg
Published in
Advances in Data Analysis and Classification / Issue 1/2019
Print ISSN: 1862-5347
Electronic ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-018-0338-x

Other articles of this Issue 1/2019

Advances in Data Analysis and Classification 1/2019 Go to the issue

Premium Partner