Skip to main content
Top
Published in: Advances in Data Analysis and Classification 1/2019

12-10-2018 | Regular Article

Random effects clustering in multilevel modeling: choosing a proper partition

Authors: Claudio Conversano, Massimo Cannas, Francesco Mola, Emiliano Sironi

Published in: Advances in Data Analysis and Classification | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

A novel criterion for estimating a latent partition of the observed groups based on the output of a hierarchical model is presented. It is based on a loss function combining the Gini income inequality ratio and the predictability index of Goodman and Kruskal in order to achieve maximum heterogeneity of random effects across groups and maximum homogeneity of predicted probabilities inside estimated clusters. The index is compared with alternative approaches in a simulation study and applied in a case study concerning the role of hospital level variables in deciding for a cesarean section.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, BelmontMATH Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. Wadsworth, BelmontMATH
go back to reference Caceres IA, Arcaya M, Declercq E, Belanoff CM, Janakiraman V, Cohen B, Ecker J, Smith LA, Subramanian SV (2013) Hospital differences in cesarean deliveries in Massachusetts (US) 2004–2006: the case against case-mix artifact. PLoS ONE 8(3):e57817CrossRef Caceres IA, Arcaya M, Declercq E, Belanoff CM, Janakiraman V, Cohen B, Ecker J, Smith LA, Subramanian SV (2013) Hospital differences in cesarean deliveries in Massachusetts (US) 2004–2006: the case against case-mix artifact. PLoS ONE 8(3):e57817CrossRef
go back to reference Cannas M, Conversano C, Mola F, Sironi E (2017) Variation in caesarean delivery rates across hospitals: a Bayesian semi-parametric approach. J Appl Stat 44(12):2095–2107MathSciNetCrossRef Cannas M, Conversano C, Mola F, Sironi E (2017) Variation in caesarean delivery rates across hospitals: a Bayesian semi-parametric approach. J Appl Stat 44(12):2095–2107MathSciNetCrossRef
go back to reference Dagum C (1997) A new approach to the decomposition of the Gini income inequality ratio. Empir Econ 22:515–531CrossRef Dagum C (1997) A new approach to the decomposition of the Gini income inequality ratio. Empir Econ 22:515–531CrossRef
go back to reference Dahl DB (2006) Model-based clustering for expression data via a Dirichlet process mixture model. In: Do KA, Muller P, Vannucci M (eds) Bayesian inference for gene expression and proteomics. Cambridge University Press, Cambridge, pp 201–218CrossRef Dahl DB (2006) Model-based clustering for expression data via a Dirichlet process mixture model. In: Do KA, Muller P, Vannucci M (eds) Bayesian inference for gene expression and proteomics. Cambridge University Press, Cambridge, pp 201–218CrossRef
go back to reference Duncan C, Jones K, Moon G (1998) Context, composition and heterogeneity: using multilevel models in health research. Soc Sci Med 46:97–117CrossRef Duncan C, Jones K, Moon G (1998) Context, composition and heterogeneity: using multilevel models in health research. Soc Sci Med 46:97–117CrossRef
go back to reference Dunson D (2008) Nonparametric Bayes applications to biostatistics (Tech. Rep.). Biostatistics Branch, National Institute of Environmental Health Sciences, U.S. National, Institute of Health, USA Dunson D (2008) Nonparametric Bayes applications to biostatistics (Tech. Rep.). Biostatistics Branch, National Institute of Environmental Health Sciences, U.S. National, Institute of Health, USA
go back to reference Egidi L, Pappadá R, Pauli F, Torelli N (2018) Relabelling in Bayesian mixture models by pivotal units. Stat Comput 28(4):957–969MathSciNetCrossRefMATH Egidi L, Pappadá R, Pauli F, Torelli N (2018) Relabelling in Bayesian mixture models by pivotal units. Stat Comput 28(4):957–969MathSciNetCrossRefMATH
go back to reference European Perinatal Health Report (2013) The health and care of pregnant women and babies in Europe in 2010. EURO-PERISTAT Project with SCPE and EUROCAT, Bruxelles European Perinatal Health Report (2013) The health and care of pregnant women and babies in Europe in 2010. EURO-PERISTAT Project with SCPE and EUROCAT, Bruxelles
go back to reference Goodman LA, Kruskal WH (1954) Measures of association for cross classification. J Am Stat Assoc 48:732–762MATH Goodman LA, Kruskal WH (1954) Measures of association for cross classification. J Am Stat Assoc 48:732–762MATH
go back to reference Grilli L, Panzera A, Rampichini C (2018) Clustering upper level units in multilevel models for ordinal data. In: Mola F, Conversano C, Vichi M (eds) Classification, (big) data analysis and statistical learning. Springer, Cham, pp 137–144CrossRef Grilli L, Panzera A, Rampichini C (2018) Clustering upper level units in multilevel models for ordinal data. In: Mola F, Conversano C, Vichi M (eds) Classification, (big) data analysis and statistical learning. Springer, Cham, pp 137–144CrossRef
go back to reference Guglielmi A, Ieva F, Paganoni AM, Ruggeri F, Soriano J (2014) Semiparametric bayesian models for clustering and classification in the presence of unbalanced in-hospital survival. J R Stat Soc C (Appl Stat) 63:25–46MathSciNetCrossRef Guglielmi A, Ieva F, Paganoni AM, Ruggeri F, Soriano J (2014) Semiparametric bayesian models for clustering and classification in the presence of unbalanced in-hospital survival. J R Stat Soc C (Appl Stat) 63:25–46MathSciNetCrossRef
go back to reference Jara A, Hanson T, Quintana F, Mueller P, Rosner G (2011) DPpackage: Bayesian semi-and nonparametric modeling in R. J Stat Softw 40(5):1–30CrossRef Jara A, Hanson T, Quintana F, Mueller P, Rosner G (2011) DPpackage: Bayesian semi-and nonparametric modeling in R. J Stat Softw 40(5):1–30CrossRef
go back to reference Kleinman KP, Ibrahim JG (1998) A semi-parametric Bayesian approach to generalized linear mixed models. Stat Med 17:2579–2596CrossRef Kleinman KP, Ibrahim JG (1998) A semi-parametric Bayesian approach to generalized linear mixed models. Stat Med 17:2579–2596CrossRef
go back to reference Kozhimannil KB, Law MR, Virnig BA (2013) Cesarean delivery rates vary among US hospitals: reducing variation may address quality and cost issues. Health Aff 32(3):527–535CrossRef Kozhimannil KB, Law MR, Virnig BA (2013) Cesarean delivery rates vary among US hospitals: reducing variation may address quality and cost issues. Health Aff 32(3):527–535CrossRef
go back to reference Lee Y, Roberts CL, Patterson JA, Simpson JM, Nicholl MC, Morris JM, Ford JB (2013) Unexplained variation in hospital caesarean section rates. Med J Aust 199(5):348–353CrossRef Lee Y, Roberts CL, Patterson JA, Simpson JM, Nicholl MC, Morris JM, Ford JB (2013) Unexplained variation in hospital caesarean section rates. Med J Aust 199(5):348–353CrossRef
go back to reference MacEachern SN (2000) Dependent nonparametric processes, Technical report. Dept. of Statistics, Ohio State University, Ohio MacEachern SN (2000) Dependent nonparametric processes, Technical report. Dept. of Statistics, Ohio State University, Ohio
go back to reference Medvedovic M, Yeung K, Bumgarner R (2004) Bayesian mixture model based clustering of replicated microarray data. Bioinformatics 20:1222–1232CrossRef Medvedovic M, Yeung K, Bumgarner R (2004) Bayesian mixture model based clustering of replicated microarray data. Bioinformatics 20:1222–1232CrossRef
go back to reference Mola F, Siciliano R (1997) A fast splitting procedure for classification trees. Stat Comput 7:209–216CrossRef Mola F, Siciliano R (1997) A fast splitting procedure for classification trees. Stat Comput 7:209–216CrossRef
go back to reference Pitman J, Yor M (1997) The two-parameter Poisson Dirichlet distribution derived from a stable subordinator. Ann Probab 25:855–900MathSciNetCrossRefMATH Pitman J, Yor M (1997) The two-parameter Poisson Dirichlet distribution derived from a stable subordinator. Ann Probab 25:855–900MathSciNetCrossRefMATH
go back to reference Rastelli R, Friel N (2017) Optimal Bayesian estimators for latent variable cluster models. Stat Comput 28(6):1169–1186MathSciNetCrossRef Rastelli R, Friel N (2017) Optimal Bayesian estimators for latent variable cluster models. Stat Comput 28(6):1169–1186MathSciNetCrossRef
go back to reference Roberts CL, Nippita TA (2015) International caesarean section rates: the rising tide. Lancet Glob Health 3(5):111–117CrossRef Roberts CL, Nippita TA (2015) International caesarean section rates: the rising tide. Lancet Glob Health 3(5):111–117CrossRef
go back to reference Sturtz S, Ligges U, Gelman A (2005) R2WinBUGS: a package for running WinBUGS from R. J Stat Softw 12(3):1–16CrossRef Sturtz S, Ligges U, Gelman A (2005) R2WinBUGS: a package for running WinBUGS from R. J Stat Softw 12(3):1–16CrossRef
go back to reference Tutz G, Oelker M (2017) Modeling clustered heterogeneity: fixed effects, random effects and mixtures. Int Stat Rev 85(2):204–227MathSciNetCrossRef Tutz G, Oelker M (2017) Modeling clustered heterogeneity: fixed effects, random effects and mixtures. Int Stat Rev 85(2):204–227MathSciNetCrossRef
Metadata
Title
Random effects clustering in multilevel modeling: choosing a proper partition
Authors
Claudio Conversano
Massimo Cannas
Francesco Mola
Emiliano Sironi
Publication date
12-10-2018
Publisher
Springer Berlin Heidelberg
Published in
Advances in Data Analysis and Classification / Issue 1/2019
Print ISSN: 1862-5347
Electronic ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-018-0347-9

Other articles of this Issue 1/2019

Advances in Data Analysis and Classification 1/2019 Go to the issue

Premium Partner