Skip to main content
Erschienen in: Advances in Data Analysis and Classification 2/2020

21.11.2019 | Regular Article

Mixtures of skewed matrix variate bilinear factor analyzers

verfasst von: Michael P. B. Gallaugher, Paul D. McNicholas

Erschienen in: Advances in Data Analysis and Classification | Ausgabe 2/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In recent years, data have become increasingly higher dimensional and, therefore, an increased need has arisen for dimension reduction techniques for clustering. Although such techniques are firmly established in the literature for multivariate data, there is a relative paucity in the area of matrix variate, or three-way, data. Furthermore, the few methods that are available all assume matrix variate normality, which is not always sensible if cluster skewness or excess kurtosis is present. Mixtures of bilinear factor analyzers using skewed matrix variate distributions are proposed. In all, four such mixture models are presented, based on matrix variate skew-t, generalized hyperbolic, variance-gamma, and normal inverse Gaussian distributions, respectively.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Anderlucci L, Viroli C (2015) Covariance pattern mixture models for the analysis of multivariate heterogeneous longitudinal data. Ann Appl Stat 9(2):777–800MathSciNetMATH Anderlucci L, Viroli C (2015) Covariance pattern mixture models for the analysis of multivariate heterogeneous longitudinal data. Ann Appl Stat 9(2):777–800MathSciNetMATH
Zurück zum Zitat Andrews JL, McNicholas PD (2011) Extending mixtures of multivariate t-factor analyzers. Stat Comput 21(3):361–373MathSciNetMATH Andrews JL, McNicholas PD (2011) Extending mixtures of multivariate t-factor analyzers. Stat Comput 21(3):361–373MathSciNetMATH
Zurück zum Zitat Andrews JL, McNicholas PD (2012) Model-based clustering, classification, and discriminant analysis via mixtures of multivariate \(t\)-distributions: the \(t\)EIGEN family. Stat Comput 22(5):1021–1029MathSciNetMATH Andrews JL, McNicholas PD (2012) Model-based clustering, classification, and discriminant analysis via mixtures of multivariate \(t\)-distributions: the \(t\)EIGEN family. Stat Comput 22(5):1021–1029MathSciNetMATH
Zurück zum Zitat Baum LE, Petrie T, Soules G, Weiss N (1970) A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 41:164–171MathSciNetMATH Baum LE, Petrie T, Soules G, Weiss N (1970) A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains. Ann Math Stat 41:164–171MathSciNetMATH
Zurück zum Zitat Bezanson J, Edelman A, Karpinski S, Shah V B (2017) Julia: a fresh approach to numerical computing. SIAM Rev 59(1):65–98MathSciNetMATH Bezanson J, Edelman A, Karpinski S, Shah V B (2017) Julia: a fresh approach to numerical computing. SIAM Rev 59(1):65–98MathSciNetMATH
Zurück zum Zitat Browne RP, McNicholas PD (2015) A mixture of generalized hyperbolic distributions. Can J Stat 43(2):176–198MathSciNetMATH Browne RP, McNicholas PD (2015) A mixture of generalized hyperbolic distributions. Can J Stat 43(2):176–198MathSciNetMATH
Zurück zum Zitat Chen JT, Gupta AK (2005) Matrix variate skew normal distributions. Statistics 39(3):247–253MathSciNetMATH Chen JT, Gupta AK (2005) Matrix variate skew normal distributions. Statistics 39(3):247–253MathSciNetMATH
Zurück zum Zitat Dang UJ, Browne RP, McNicholas PD (2015) Mixtures of multivariate power exponential distributions. Biometrics 71(4):1081–1089MathSciNetMATH Dang UJ, Browne RP, McNicholas PD (2015) Mixtures of multivariate power exponential distributions. Biometrics 71(4):1081–1089MathSciNetMATH
Zurück zum Zitat Domínguez-Molina JA, González-Farías G, Ramos-Quiroga R, Gupta AK (2007) A matrix variate closed skew-normal distribution with applications to stochastic frontier analysis. Commun Stat Theory Methods 36(9):1691–1703MathSciNetMATH Domínguez-Molina JA, González-Farías G, Ramos-Quiroga R, Gupta AK (2007) A matrix variate closed skew-normal distribution with applications to stochastic frontier analysis. Commun Stat Theory Methods 36(9):1691–1703MathSciNetMATH
Zurück zum Zitat Franczak BC, Browne RP, McNicholas PD (2014) Mixtures of shifted asymmetric Laplace distributions. IEEE Trans Pattern Anal Mach Intell 36(6):1149–1157 Franczak BC, Browne RP, McNicholas PD (2014) Mixtures of shifted asymmetric Laplace distributions. IEEE Trans Pattern Anal Mach Intell 36(6):1149–1157
Zurück zum Zitat Gallaugher MPB, McNicholas PD (2017) A matrix variate skew-t distribution. Stat 6(1):160–170MathSciNet Gallaugher MPB, McNicholas PD (2017) A matrix variate skew-t distribution. Stat 6(1):160–170MathSciNet
Zurück zum Zitat Gallaugher MPB, McNicholas PD (2018a) Finite mixtures of skewed matrix variate distributions. Pattern Recogn 80:83–93 Gallaugher MPB, McNicholas PD (2018a) Finite mixtures of skewed matrix variate distributions. Pattern Recogn 80:83–93
Zurück zum Zitat Gallaugher MPB, McNicholas PD (2018b) Mixtures of matrix variate bilinear factor analyzers. In: Proceedings of the joint statistical meetings. American Statistical Association, Alexandria, VA. arXiv:1712.08664 Gallaugher MPB, McNicholas PD (2018b) Mixtures of matrix variate bilinear factor analyzers. In: Proceedings of the joint statistical meetings. American Statistical Association, Alexandria, VA. arXiv:​1712.​08664
Zurück zum Zitat Gallaugher MPB, McNicholas PD (2019) Three skewed matrix variate distributions. Stat Probab Lett 145:103–109MathSciNetMATH Gallaugher MPB, McNicholas PD (2019) Three skewed matrix variate distributions. Stat Probab Lett 145:103–109MathSciNetMATH
Zurück zum Zitat Ghahramani Z, Hinton GE (1997) The EM algorithm for factor analyzers. Technical report CRG-TR-96-1, University of Toronto, Toronto, Canada Ghahramani Z, Hinton GE (1997) The EM algorithm for factor analyzers. Technical report CRG-TR-96-1, University of Toronto, Toronto, Canada
Zurück zum Zitat Harrar SW, Gupta AK (2008) On matrix variate skew-normal distributions. Statistics 42(2):179–194MathSciNetMATH Harrar SW, Gupta AK (2008) On matrix variate skew-normal distributions. Statistics 42(2):179–194MathSciNetMATH
Zurück zum Zitat Karlis D, Santourian A (2009) Model-based clustering with non-elliptically contoured distributions. Stat Comput 19(1):73–83MathSciNet Karlis D, Santourian A (2009) Model-based clustering with non-elliptically contoured distributions. Stat Comput 19(1):73–83MathSciNet
Zurück zum Zitat Lee S, McLachlan GJ (2014) Finite mixtures of multivariate skew t-distributions: some recent and new results. Stat Comput 24:181–202MathSciNetMATH Lee S, McLachlan GJ (2014) Finite mixtures of multivariate skew t-distributions: some recent and new results. Stat Comput 24:181–202MathSciNetMATH
Zurück zum Zitat Lin T-I (2010) Robust mixture modeling using multivariate skew t distributions. Stat Comput 20(3):343–356MathSciNet Lin T-I (2010) Robust mixture modeling using multivariate skew t distributions. Stat Comput 20(3):343–356MathSciNet
Zurück zum Zitat Lin T-I, McNicholas PD, Hsiu JH (2014) Capturing patterns via parsimonious t mixture models. Stat Probab Lett 88:80–87MathSciNetMATH Lin T-I, McNicholas PD, Hsiu JH (2014) Capturing patterns via parsimonious t mixture models. Stat Probab Lett 88:80–87MathSciNetMATH
Zurück zum Zitat McNicholas PD (2010) Model-based classification using latent Gaussian mixture models. J Stat Plan Inference 140(5):1175–1181MathSciNetMATH McNicholas PD (2010) Model-based classification using latent Gaussian mixture models. J Stat Plan Inference 140(5):1175–1181MathSciNetMATH
Zurück zum Zitat McNicholas PD (2016) Mixture model-based classification. Chapman & Hall/CRC Press, Boca RatonMATH McNicholas PD (2016) Mixture model-based classification. Chapman & Hall/CRC Press, Boca RatonMATH
Zurück zum Zitat McNicholas PD, Murphy TB (2008) Parsimonious Gaussian mixture models. Stat Comput 18(3):285–296MathSciNet McNicholas PD, Murphy TB (2008) Parsimonious Gaussian mixture models. Stat Comput 18(3):285–296MathSciNet
Zurück zum Zitat McNicholas PD, Murphy TB (2010) Model-based clustering of microarray expression data via latent Gaussian mixture models. Bioinformatics 26(21):2705–2712 McNicholas PD, Murphy TB (2010) Model-based clustering of microarray expression data via latent Gaussian mixture models. Bioinformatics 26(21):2705–2712
Zurück zum Zitat McNicholas PD, Tait PA (2019) Data science with Julia. Chapman & Hall/CRC Press, Boca Raton McNicholas PD, Tait PA (2019) Data science with Julia. Chapman & Hall/CRC Press, Boca Raton
Zurück zum Zitat McNicholas SM, McNicholas PD, Browne RP (2017) A mixture of variance-gamma factor analyzers. In: Ahmed SE (ed) Big and complex data analysis: methodologies and applications. Springer, Cham, pp 369–385 McNicholas SM, McNicholas PD, Browne RP (2017) A mixture of variance-gamma factor analyzers. In: Ahmed SE (ed) Big and complex data analysis: methodologies and applications. Springer, Cham, pp 369–385
Zurück zum Zitat Melnykov V, Zhu X (2018) On model-based clustering of skewed matrix data. J Multivar Anal 167:181–194MathSciNetMATH Melnykov V, Zhu X (2018) On model-based clustering of skewed matrix data. J Multivar Anal 167:181–194MathSciNetMATH
Zurück zum Zitat Melnykov V, Zhu X (2019) Studying crime trends in the USA over the years 2000–2012. Adv Data Anal Classif 13(1):325–341MathSciNetMATH Melnykov V, Zhu X (2019) Studying crime trends in the USA over the years 2000–2012. Adv Data Anal Classif 13(1):325–341MathSciNetMATH
Zurück zum Zitat Meng X-L, van Dyk D (1997) The EM algorithm—an old folk song sung to a fast new tune (with discussion). J R Stat Soc B 59(3):511–567MATH Meng X-L, van Dyk D (1997) The EM algorithm—an old folk song sung to a fast new tune (with discussion). J R Stat Soc B 59(3):511–567MATH
Zurück zum Zitat Morris K, McNicholas PD (2013) Dimension reduction for model-based clustering via mixtures of shifted asymmetric Laplace distributions. Stat Probab Lett 83(9):2088–2093MathSciNetMATH Morris K, McNicholas PD (2013) Dimension reduction for model-based clustering via mixtures of shifted asymmetric Laplace distributions. Stat Probab Lett 83(9):2088–2093MathSciNetMATH
Zurück zum Zitat Murray PM, Browne RB, McNicholas PD (2014a) Mixtures of skew-t factor analyzers. Comput Stat Data Anal 77:326–335MathSciNetMATH Murray PM, Browne RB, McNicholas PD (2014a) Mixtures of skew-t factor analyzers. Comput Stat Data Anal 77:326–335MathSciNetMATH
Zurück zum Zitat Murray PM, McNicholas PD, Browne RB (2014b) A mixture of common skew-\(t\) factor analyzers. Stat 3(1):68–82MathSciNetMATH Murray PM, McNicholas PD, Browne RB (2014b) A mixture of common skew-\(t\) factor analyzers. Stat 3(1):68–82MathSciNetMATH
Zurück zum Zitat Murray PM, Browne RB, McNicholas PD (2017) Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering. J Multivar Anal 161:141–156MathSciNetMATH Murray PM, Browne RB, McNicholas PD (2017) Hidden truncation hyperbolic distributions, finite mixtures thereof, and their application for clustering. J Multivar Anal 161:141–156MathSciNetMATH
Zurück zum Zitat Peel D, McLachlan GJ (2000) Robust mixture modelling using the t distribution. Stat Comput 10(4):339–348 Peel D, McLachlan GJ (2000) Robust mixture modelling using the t distribution. Stat Comput 10(4):339–348
Zurück zum Zitat Scott AJ, Symons MJ (1971) Clustering methods based on likelihood ratio criteria. Biometrics 27:387–397 Scott AJ, Symons MJ (1971) Clustering methods based on likelihood ratio criteria. Biometrics 27:387–397
Zurück zum Zitat Tait PA, McNicholas PD (2019) Clustering higher order data: finite mixtures of multidimensional arrays. arXiv preprint arXiv:1907.08566 Tait PA, McNicholas PD (2019) Clustering higher order data: finite mixtures of multidimensional arrays. arXiv preprint arXiv:​1907.​08566
Zurück zum Zitat Tang Y, Browne RP, McNicholas PD (2018) Flexible clustering of high-dimensional data via mixtures of joint generalized hyperbolic distributions. Stat 7(1):e177MathSciNet Tang Y, Browne RP, McNicholas PD (2018) Flexible clustering of high-dimensional data via mixtures of joint generalized hyperbolic distributions. Stat 7(1):e177MathSciNet
Zurück zum Zitat Tiedeman DV (1955) On the study of types. In: Sells SB (ed) Symposium on pattern analysis. Air University, U.S.A.F. School of Aviation Medicine, Randolph Field Tiedeman DV (1955) On the study of types. In: Sells SB (ed) Symposium on pattern analysis. Air University, U.S.A.F. School of Aviation Medicine, Randolph Field
Zurück zum Zitat Tortora C, Franczak BC, Browne RP, McNicholas PD (2019) A mixture of coalesced generalized hyperbolic distributions. J Classif 36(1):26–57MathSciNetMATH Tortora C, Franczak BC, Browne RP, McNicholas PD (2019) A mixture of coalesced generalized hyperbolic distributions. J Classif 36(1):26–57MathSciNetMATH
Zurück zum Zitat Viroli C (2011) Finite mixtures of matrix normal distributions for classifying three-way data. Stat Comput 21(4):511–522MathSciNetMATH Viroli C (2011) Finite mixtures of matrix normal distributions for classifying three-way data. Stat Comput 21(4):511–522MathSciNetMATH
Zurück zum Zitat Viroli C (2011) Model based clustering for three-way data structures. Bayesian Anal 6:573–602MathSciNetMATH Viroli C (2011) Model based clustering for three-way data structures. Bayesian Anal 6:573–602MathSciNetMATH
Zurück zum Zitat Vrbik I, McNicholas PD (2012) Analytic calculations for the EM algorithm for multivariate skew-t mixture models. Stat Probab Lett 82(6):1169–1174MathSciNetMATH Vrbik I, McNicholas PD (2012) Analytic calculations for the EM algorithm for multivariate skew-t mixture models. Stat Probab Lett 82(6):1169–1174MathSciNetMATH
Zurück zum Zitat Vrbik I, McNicholas PD (2014) Parsimonious skew mixture models for model-based clustering and classification. Comput Stat Data Anal 71:196–210MathSciNetMATH Vrbik I, McNicholas PD (2014) Parsimonious skew mixture models for model-based clustering and classification. Comput Stat Data Anal 71:196–210MathSciNetMATH
Zurück zum Zitat Wishart J (1928) The generalised product moment distribution in samples from a normal multivariate population. Biometrika 20A(1/2):32–52MATH Wishart J (1928) The generalised product moment distribution in samples from a normal multivariate population. Biometrika 20A(1/2):32–52MATH
Zurück zum Zitat Wolfe JH (1965) A computer program for the maximum likelihood analysis of types. Technical bulletin 65-15, U.S. Naval Personnel Research Activity Wolfe JH (1965) A computer program for the maximum likelihood analysis of types. Technical bulletin 65-15, U.S. Naval Personnel Research Activity
Zurück zum Zitat Xie X, Yan S, Kwok JT, Huang TS (2008) Matrix-variate factor analysis and its applications. IEEE Trans Neural Netw 19(10):1821–1826 Xie X, Yan S, Kwok JT, Huang TS (2008) Matrix-variate factor analysis and its applications. IEEE Trans Neural Netw 19(10):1821–1826
Zurück zum Zitat Yu S, Bi J, Ye J (2008) Probabilistic interpretations and extensions for a family of 2D PCA-style algorithms. In: Workshop data mining using matrices and tensors (DMMT 08): proceedings of a workshop held in conjunction with the 14th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD 2008) Yu S, Bi J, Ye J (2008) Probabilistic interpretations and extensions for a family of 2D PCA-style algorithms. In: Workshop data mining using matrices and tensors (DMMT 08): proceedings of a workshop held in conjunction with the 14th ACM SIGKDD international conference on knowledge discovery and data mining (SIGKDD 2008)
Zurück zum Zitat Zhao J, Philip L, Kwok JT (2012) Bilinear probabilistic principal component analysis. IEEE Trans Neural Netw Learn Syst 23(3):492–503 Zhao J, Philip L, Kwok JT (2012) Bilinear probabilistic principal component analysis. IEEE Trans Neural Netw Learn Syst 23(3):492–503
Metadaten
Titel
Mixtures of skewed matrix variate bilinear factor analyzers
verfasst von
Michael P. B. Gallaugher
Paul D. McNicholas
Publikationsdatum
21.11.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Advances in Data Analysis and Classification / Ausgabe 2/2020
Print ISSN: 1862-5347
Elektronische ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-019-00377-4

Weitere Artikel der Ausgabe 2/2020

Advances in Data Analysis and Classification 2/2020 Zur Ausgabe

Premium Partner