Skip to main content
Erschienen in: Journal of Classification 3/2022

23.08.2022

Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data

verfasst von: Yuan Fang, Dimitris Karlis, Sanjeena Subedi

Erschienen in: Journal of Classification | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Mixtures of multivariate normal inverse Gaussian (MNIG) distributions can be used to cluster data that exhibit features such as skewness and heavy tails. For cluster analysis, using a traditional finite mixture model framework, the number of components either needs to be known a priori or needs to be estimated a posteriori using some model selection criterion after deriving results for a range of possible number of components. However, different model selection criteria can sometimes result in different numbers of components yielding uncertainty. Here, an infinite mixture model framework, also known as Dirichlet process mixture model, is proposed for the mixtures of MNIG distributions. This Dirichlet process mixture model approach allows the number of components to grow or decay freely from 1 to \(\infty\) (in practice from 1 to N) and the number of components is inferred along with the parameter estimates in a Bayesian framework, thus alleviating the need for model selection criteria. We run our algorithm on simulated as well as real benchmark datasets and compare with other clustering approaches. The proposed method provides competitive results for both simulations and real data.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Antoniak, C. E. (1974). Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. The Annals of Statistics, 2(6), 1152–1174.MathSciNetCrossRefMATH Antoniak, C. E. (1974). Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems. The Annals of Statistics, 2(6), 1152–1174.MathSciNetCrossRefMATH
Zurück zum Zitat Barndorff-Nielsen, O. E. (1997). Normal inverse Gaussian distributions and stochastic volatility modelling. Scandinavian Journal of Statistics, 24(1), 1–13.MathSciNetCrossRefMATH Barndorff-Nielsen, O. E. (1997). Normal inverse Gaussian distributions and stochastic volatility modelling. Scandinavian Journal of Statistics, 24(1), 1–13.MathSciNetCrossRefMATH
Zurück zum Zitat Biernacki, C., Celeux, G., & Govaert, G. (2000). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(7), 719–725.CrossRef Biernacki, C., Celeux, G., & Govaert, G. (2000). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(7), 719–725.CrossRef
Zurück zum Zitat Blackwell, David, & MacQueen, J. B. (1973). Ferguson distributions via Polya urn schemes. The Annals of Statistics, 1(2), 353–355.MathSciNetCrossRefMATH Blackwell, David, & MacQueen, J. B. (1973). Ferguson distributions via Polya urn schemes. The Annals of Statistics, 1(2), 353–355.MathSciNetCrossRefMATH
Zurück zum Zitat Blei, D. M., Kucukelbir, A., & McAuliffe, J. D. (2017). Variational inference: A review for statisticians. Journal of the American Statistical Association, 112(518), 859–877.MathSciNetCrossRef Blei, D. M., Kucukelbir, A., & McAuliffe, J. D. (2017). Variational inference: A review for statisticians. Journal of the American Statistical Association, 112(518), 859–877.MathSciNetCrossRef
Zurück zum Zitat Browne, R. P., & McNicholas, P. D. (2015). A mixture of generalized hyperbolic distributions. The Canadian Journal of Statistics, 43(2), 176–198.MathSciNetCrossRefMATH Browne, R. P., & McNicholas, P. D. (2015). A mixture of generalized hyperbolic distributions. The Canadian Journal of Statistics, 43(2), 176–198.MathSciNetCrossRefMATH
Zurück zum Zitat Celeux, G., Hurn, M., & Robert, C. P. (2000). Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95(451), 957–970.MathSciNetCrossRefMATH Celeux, G., Hurn, M., & Robert, C. P. (2000). Computational and inferential difficulties with mixture posterior distributions. Journal of the American Statistical Association, 95(451), 957–970.MathSciNetCrossRefMATH
Zurück zum Zitat Dellaportas, P., & Papageorgiou, I. (2006). Multivariate mixtures of normals with unknown number of components. Statistics and Computing, 16(1), 57–68.MathSciNetCrossRef Dellaportas, P., & Papageorgiou, I. (2006). Multivariate mixtures of normals with unknown number of components. Statistics and Computing, 16(1), 57–68.MathSciNetCrossRef
Zurück zum Zitat Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22.MathSciNetMATH Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1), 1–22.MathSciNetMATH
Zurück zum Zitat Diebolt, J., & Robert, C. P. (1994). Estimation of finite mixture distributions through Bayesian sampling. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 56(2), 363–375.MathSciNetMATH Diebolt, J., & Robert, C. P. (1994). Estimation of finite mixture distributions through Bayesian sampling. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 56(2), 363–375.MathSciNetMATH
Zurück zum Zitat Escobar, M. D., & West, M. (1995). Bayesian density estimation and inference using mixtures. Journal of the American Statistical Association, 90(430), 577–588.MathSciNetCrossRefMATH Escobar, M. D., & West, M. (1995). Bayesian density estimation and inference using mixtures. Journal of the American Statistical Association, 90(430), 577–588.MathSciNetCrossRefMATH
Zurück zum Zitat Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. Springer Science & Business Media. Frühwirth-Schnatter, S. (2006). Finite mixture and Markov switching models. Springer Science & Business Media.
Zurück zum Zitat Frühwirth-Schnatter, S., & Malsiner-Walli, G. (2018). From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering. Advances in Data Analysis and Classification, 13, 1–32.CrossRefMATH Frühwirth-Schnatter, S., & Malsiner-Walli, G. (2018). From here to infinity: sparse finite versus Dirichlet process mixtures in model-based clustering. Advances in Data Analysis and Classification, 13, 1–32.CrossRefMATH
Zurück zum Zitat Fruhwirth-Schnatter, S., & Pyne, S. (2010). Bayesian inference for finite mixtures of univatiate and multivariate skew-normal and skew-t distributions. Biostatistics, 11(2), 317–336.CrossRefMATH Fruhwirth-Schnatter, S., & Pyne, S. (2010). Bayesian inference for finite mixtures of univatiate and multivariate skew-normal and skew-t distributions. Biostatistics, 11(2), 317–336.CrossRefMATH
Zurück zum Zitat Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian Data Analysis. CRC Press, third edition. Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., and Rubin, D. B. (2013). Bayesian Data Analysis. CRC Press, third edition.
Zurück zum Zitat Gelman, A., Rubin, D. B., et al. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472.CrossRefMATH Gelman, A., Rubin, D. B., et al. (1992). Inference from iterative simulation using multiple sequences. Statistical Science, 7(4), 457–472.CrossRefMATH
Zurück zum Zitat Görür, D., & Rasmussen, C. E. (2010). Dirichlet process Gaussian mixture models: Choice of the base distribution. Journal of Computer Science and Technology, 25(4), 653–664.MathSciNetCrossRef Görür, D., & Rasmussen, C. E. (2010). Dirichlet process Gaussian mixture models: Choice of the base distribution. Journal of Computer Science and Technology, 25(4), 653–664.MathSciNetCrossRef
Zurück zum Zitat Hakguder, Z., Shu, J., Liao, C., Pan, K., and Cui, J. (2018). Genome-scale microRNA target prediction through clustering with Dirichlet process mixture model. BMC Genomics, 19. Hakguder, Z., Shu, J., Liao, C., Pan, K., and Cui, J. (2018). Genome-scale microRNA target prediction through clustering with Dirichlet process mixture model. BMC Genomics, 19.
Zurück zum Zitat Hejblum, B. P., Alkhassim, C., Gottardo, R., Caron, F., Thiébaut, R., et al. (2019). Sequential Dirichlet process mixtures of multivariate skew t-distributions for model-based clustering of flow cytometry data. The Annals of Applied Statistics, 13(1), 638–660.MathSciNetCrossRefMATH Hejblum, B. P., Alkhassim, C., Gottardo, R., Caron, F., Thiébaut, R., et al. (2019). Sequential Dirichlet process mixtures of multivariate skew t-distributions for model-based clustering of flow cytometry data. The Annals of Applied Statistics, 13(1), 638–660.MathSciNetCrossRefMATH
Zurück zum Zitat Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2(1), 193–218.CrossRefMATH Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2(1), 193–218.CrossRefMATH
Zurück zum Zitat Huelsenbeck, J. P., & Andolfatto, P. (2007). Inference of population structure under a Dirichlet process model. Genetics, 175(4), 1787–1802.CrossRef Huelsenbeck, J. P., & Andolfatto, P. (2007). Inference of population structure under a Dirichlet process model. Genetics, 175(4), 1787–1802.CrossRef
Zurück zum Zitat Ishwaran, H., & James, L. F. (2001). Gibbs sampling methods for stick-breaking priors. Journal of the American Statistical Association, 96(453), 161–173.MathSciNetCrossRefMATH Ishwaran, H., & James, L. F. (2001). Gibbs sampling methods for stick-breaking priors. Journal of the American Statistical Association, 96(453), 161–173.MathSciNetCrossRefMATH
Zurück zum Zitat Karlis, D., & Santourian, A. (2009). Model-based clustering with non-elliptically contoured distributions. Statistics and Computing, 19(1), 73–83.MathSciNetCrossRef Karlis, D., & Santourian, A. (2009). Model-based clustering with non-elliptically contoured distributions. Statistics and Computing, 19(1), 73–83.MathSciNetCrossRef
Zurück zum Zitat Lartillot, N., & Philippe, H. (2004). A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Molecular Biology and Evolution, 21(6), 1095–1109.CrossRef Lartillot, N., & Philippe, H. (2004). A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. Molecular Biology and Evolution, 21(6), 1095–1109.CrossRef
Zurück zum Zitat Lijoi, A., Prünster, I., & Rigon, T. (2020). The Pitman-Yor multinomial process for mixture modelling. Biometrika, 107(4), 891–906.MathSciNetCrossRefMATH Lijoi, A., Prünster, I., & Rigon, T. (2020). The Pitman-Yor multinomial process for mixture modelling. Biometrika, 107(4), 891–906.MathSciNetCrossRefMATH
Zurück zum Zitat Lin, T. I. (2010). Robust mixture modeling using multivariate skew t distributions. Statistics and Computing, 20, 343–356.MathSciNetCrossRef Lin, T. I. (2010). Robust mixture modeling using multivariate skew t distributions. Statistics and Computing, 20, 343–356.MathSciNetCrossRef
Zurück zum Zitat Lin, T. I., Lee, J. C., & Hsieh, W. J. (2007). Robust mixture modeling using the skew t distribution. Statistics and Computing, 17, 81–92.MathSciNetCrossRef Lin, T. I., Lee, J. C., & Hsieh, W. J. (2007). Robust mixture modeling using the skew t distribution. Statistics and Computing, 17, 81–92.MathSciNetCrossRef
Zurück zum Zitat Lin, T. I., Lee, J. C., & Yen, S. Y. (2007). Finite mixture modeling using the skew normal distribution. Statistica Sinica, 17, 909–927.MathSciNetMATH Lin, T. I., Lee, J. C., & Yen, S. Y. (2007). Finite mixture modeling using the skew normal distribution. Statistica Sinica, 17, 909–927.MathSciNetMATH
Zurück zum Zitat Lu, X., Li, Y., & Love, T. (2021). On Bayesian analysis of parsimonious Gaussian mixture models. Journal of Classification, 38(3), 576–593.MathSciNetCrossRefMATH Lu, X., Li, Y., & Love, T. (2021). On Bayesian analysis of parsimonious Gaussian mixture models. Journal of Classification, 38(3), 576–593.MathSciNetCrossRefMATH
Zurück zum Zitat Maceachern, S. N., & Müller, P. (1998). Estimating mixture of Dirichlet process models. Journal of Computational and Graphical Statistics, 7(2), 223–238. Maceachern, S. N., & Müller, P. (1998). Estimating mixture of Dirichlet process models. Journal of Computational and Graphical Statistics, 7(2), 223–238.
Zurück zum Zitat Maindonald, J. H., & Braun, W. J. (2019). DAAG: Data analysis and graphics data and functions. R package version, 1(22), 1. Maindonald, J. H., & Braun, W. J. (2019). DAAG: Data analysis and graphics data and functions. R package version, 1(22), 1.
Zurück zum Zitat McNicholas, S. M., McNicholas, P. D., and Browne, R. P. (2017). A mixture of variance-gamma factor analyzers. In Big and Complex Data Analysis, pages 369–385. Springer. McNicholas, S. M., McNicholas, P. D., and Browne, R. P. (2017). A mixture of variance-gamma factor analyzers. In Big and Complex Data Analysis, pages 369–385. Springer.
Zurück zum Zitat Medvedovic, M., & Sivaganesan, S. (2002). Bayesian infinite mixture model based clustering of gene expression profiles. Bioinformatics, 18(9), 1194–1206.CrossRef Medvedovic, M., & Sivaganesan, S. (2002). Bayesian infinite mixture model based clustering of gene expression profiles. Bioinformatics, 18(9), 1194–1206.CrossRef
Zurück zum Zitat Miller, J. W., & Harrison, M. T. (2013). A simple example of Dirichlet process mixture inconsistency for the number of components. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, & K. Q. Weinberger (Eds.), Advances in Neural Information Processing Systems 26 (pp. 199–206). Curran Associates Inc. Miller, J. W., & Harrison, M. T. (2013). A simple example of Dirichlet process mixture inconsistency for the number of components. In C. J. C. Burges, L. Bottou, M. Welling, Z. Ghahramani, & K. Q. Weinberger (Eds.), Advances in Neural Information Processing Systems 26 (pp. 199–206). Curran Associates Inc.
Zurück zum Zitat Murray, P. M., Browne, R. P., & McNicholas, P. D. (2014). Mixtures of skew-t factor analyzers. Computational Statistics & Data Analysis, 77, 326–335.MathSciNetCrossRefMATH Murray, P. M., Browne, R. P., & McNicholas, P. D. (2014). Mixtures of skew-t factor analyzers. Computational Statistics & Data Analysis, 77, 326–335.MathSciNetCrossRefMATH
Zurück zum Zitat Neal, R. M. (2000). Markov chain sampling methods for Dirichlet process mixture models. Journal of Computational and Graphical Statistics, 9(2), 249–265.MathSciNet Neal, R. M. (2000). Markov chain sampling methods for Dirichlet process mixture models. Journal of Computational and Graphical Statistics, 9(2), 249–265.MathSciNet
Zurück zum Zitat O’Hagan, A., Murphy, T. B., Gormley, I. C., McNicholas, P. D., & Karlis, D. (2016). Clustering with the multivariate normal inverse Gaussian distribution. Computational Statistics & Data Analysis, 93, 18–30.MathSciNetCrossRefMATH O’Hagan, A., Murphy, T. B., Gormley, I. C., McNicholas, P. D., & Karlis, D. (2016). Clustering with the multivariate normal inverse Gaussian distribution. Computational Statistics & Data Analysis, 93, 18–30.MathSciNetCrossRefMATH
Zurück zum Zitat Onogi, A., Nurimoto, M., & Morita, M. (2011). Characterization of a Bayesian genetic clustering algorithm based on a Dirichlet process prior and comparison among Bayesian clustering methods. BMC bioinformatics, 12, 263.CrossRef Onogi, A., Nurimoto, M., & Morita, M. (2011). Characterization of a Bayesian genetic clustering algorithm based on a Dirichlet process prior and comparison among Bayesian clustering methods. BMC bioinformatics, 12, 263.CrossRef
Zurück zum Zitat Protassov, R. S. (2004). EM-based maximum likelihood parameter estimation for multivariate generalized hyperbolic distributions with fixed λ. Statistics and Computing, 14(1), 67–77.MathSciNetCrossRef Protassov, R. S. (2004). EM-based maximum likelihood parameter estimation for multivariate generalized hyperbolic distributions with fixed λ. Statistics and Computing, 14(1), 67–77.MathSciNetCrossRef
Zurück zum Zitat Pyne, S., Hu, X., Wang, K., Rossin, E., Lin, T.-I., Baecher-Allan, L. M. M. C., McLachlan, G. J., Tamayo, P., Hafler, D. A., Jager, P. L. D., & Mesirov, J. P. (2009). Automated high-dimensional flow cytometric data analysis. Proceedings of the National Academy of Sciences, 106(27), 8519–8524.CrossRef Pyne, S., Hu, X., Wang, K., Rossin, E., Lin, T.-I., Baecher-Allan, L. M. M. C., McLachlan, G. J., Tamayo, P., Hafler, D. A., Jager, P. L. D., & Mesirov, J. P. (2009). Automated high-dimensional flow cytometric data analysis. Proceedings of the National Academy of Sciences, 106(27), 8519–8524.CrossRef
Zurück zum Zitat Rasmussen, C. E. (2000). The infinite Gaussian mixture model. Advances in Neural Information Processing Systems, 12, 554–560. Rasmussen, C. E. (2000). The infinite Gaussian mixture model. Advances in Neural Information Processing Systems, 12, 554–560.
Zurück zum Zitat Richarson, S., & Green, P. J. (1997). On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 59(4), 731–792.MathSciNetCrossRef Richarson, S., & Green, P. J. (1997). On Bayesian analysis of mixtures with an unknown number of components. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 59(4), 731–792.MathSciNetCrossRef
Zurück zum Zitat Sethuraman, J. (1994). A constructive definition of Dirichlet priors. Statistica Sinica, 4(2), 639–650.MathSciNetMATH Sethuraman, J. (1994). A constructive definition of Dirichlet priors. Statistica Sinica, 4(2), 639–650.MathSciNetMATH
Zurück zum Zitat Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64(4), 583–639.MathSciNetCrossRefMATH Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64(4), 583–639.MathSciNetCrossRefMATH
Zurück zum Zitat Stephens, M. (2000). Dealing with label switching in mixture models. Journal of Royal Statistical Society. Series B (Methodoloty), 62(4), 795–809.MathSciNetCrossRefMATH Stephens, M. (2000). Dealing with label switching in mixture models. Journal of Royal Statistical Society. Series B (Methodoloty), 62(4), 795–809.MathSciNetCrossRefMATH
Zurück zum Zitat Subedi, S., & McNicholas, P. D. (2014). Variational Bayes approximations for clustering via mixtures of normal inverse Gaussian distributions. Advances in Data Analysis and Classification, 8(2), 167–193.MathSciNetCrossRefMATH Subedi, S., & McNicholas, P. D. (2014). Variational Bayes approximations for clustering via mixtures of normal inverse Gaussian distributions. Advances in Data Analysis and Classification, 8(2), 167–193.MathSciNetCrossRefMATH
Zurück zum Zitat Subedi, S., & McNicholas, P. D. (2021). A variational approximations-DIC rubric for parameter estimation and mixture model selection within a family setting. Journal of Classification, 38(1), 89–108.MathSciNetCrossRefMATH Subedi, S., & McNicholas, P. D. (2021). A variational approximations-DIC rubric for parameter estimation and mixture model selection within a family setting. Journal of Classification, 38(1), 89–108.MathSciNetCrossRefMATH
Zurück zum Zitat Sun, J., Herazo-Maya, J., Kaminski, N., Zhao, H., and Warren, J. (2016). A Dirichlet process mixture model for clustering longitudinal gene expression data. Statistics in Medicine, 36. Sun, J., Herazo-Maya, J., Kaminski, N., Zhao, H., and Warren, J. (2016). A Dirichlet process mixture model for clustering longitudinal gene expression data. Statistics in Medicine, 36.
Zurück zum Zitat Titterington, D. M., Smith, A. F., & Makov, U. E. (1985). Statistical analysis of finite mixture distributions. Wiley. Titterington, D. M., Smith, A. F., & Makov, U. E. (1985). Statistical analysis of finite mixture distributions. Wiley.
Zurück zum Zitat Tortora, C., ElSherbiny, A., Browne, R. P., Franczak, B. C., & McNicholas, P. D. (2018). MixGHD: Model based clustering, classification and discriminant analysis using the mixture of generalized hyperbolic distributions. R package version, 2, 2. Tortora, C., ElSherbiny, A., Browne, R. P., Franczak, B. C., & McNicholas, P. D. (2018). MixGHD: Model based clustering, classification and discriminant analysis using the mixture of generalized hyperbolic distributions. R package version, 2, 2.
Zurück zum Zitat Tortora, C., Franczak, B. C., Browne, R. P., & McNicholas, P. D. (2019). A mixture of coalesced generalized hyperbolic distributions. Journal of Classification, 36(1), 26–57.MathSciNetCrossRefMATH Tortora, C., Franczak, B. C., Browne, R. P., & McNicholas, P. D. (2019). A mixture of coalesced generalized hyperbolic distributions. Journal of Classification, 36(1), 26–57.MathSciNetCrossRefMATH
Zurück zum Zitat Venables, W. N. and Ripley, B. D. (2002). Modern Applied Statistics with S. Springer, New York, fourth edition. ISBN 0-387-95457-0. Venables, W. N. and Ripley, B. D. (2002). Modern Applied Statistics with S. Springer, New York, fourth edition. ISBN 0-387-95457-0.
Zurück zum Zitat Vrbik, I., & McNicholas, P. (2012). Analytic calculations for the EM algorithm for multivariate skew-t mixture models. Statistics & Probability Letters, 82(6), 1169–1174.MathSciNetCrossRefMATH Vrbik, I., & McNicholas, P. (2012). Analytic calculations for the EM algorithm for multivariate skew-t mixture models. Statistics & Probability Letters, 82(6), 1169–1174.MathSciNetCrossRefMATH
Zurück zum Zitat Wei, X., & Li, C. (2012). The infinite student’s t-mixture for robust modeling. Signal Processing, 92(1), 224–234.CrossRef Wei, X., & Li, C. (2012). The infinite student’s t-mixture for robust modeling. Signal Processing, 92(1), 224–234.CrossRef
Zurück zum Zitat Wei, Y., Tang, Y., & McNicholas, P. D. (2019). Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data. Computational Statistics & Data Analysis, 130, 18–41.MathSciNetCrossRefMATH Wei, Y., Tang, Y., & McNicholas, P. D. (2019). Mixtures of generalized hyperbolic distributions and mixtures of skew-t distributions for model-based clustering with incomplete data. Computational Statistics & Data Analysis, 130, 18–41.MathSciNetCrossRefMATH
Zurück zum Zitat West, M. (1992). Hyperparameter estimation in Dirichlet process mixture models. Technical report, Institute of Statistics and Decision Sciences, Duke University, Durham NC 27706, USA. West, M. (1992). Hyperparameter estimation in Dirichlet process mixture models. Technical report, Institute of Statistics and Decision Sciences, Duke University, Durham NC 27706, USA.
Zurück zum Zitat Windham, M. P., & Cutler, A. (1992). Information ratios for validating mixture analyses. Journal of the American Statistical Association, 87(420), 1188–1192.CrossRef Windham, M. P., & Cutler, A. (1992). Information ratios for validating mixture analyses. Journal of the American Statistical Association, 87(420), 1188–1192.CrossRef
Zurück zum Zitat Yang, C.-Y., Ho, N., and Jordan, M. I. (2019). Posterior distribution for the number of clusters in Dirichlet process mixture models. arXiv:1905.09959. Yang, C.-Y., Ho, N., and Jordan, M. I. (2019). Posterior distribution for the number of clusters in Dirichlet process mixture models. arXiv:​1905.​09959.
Metadaten
Titel
Infinite Mixtures of Multivariate Normal-Inverse Gaussian Distributions for Clustering of Skewed Data
verfasst von
Yuan Fang
Dimitris Karlis
Sanjeena Subedi
Publikationsdatum
23.08.2022
Verlag
Springer US
Erschienen in
Journal of Classification / Ausgabe 3/2022
Print ISSN: 0176-4268
Elektronische ISSN: 1432-1343
DOI
https://doi.org/10.1007/s00357-022-09417-9

Weitere Artikel der Ausgabe 3/2022

Journal of Classification 3/2022 Zur Ausgabe

Premium Partner