Skip to main content
Erschienen in: Advances in Data Analysis and Classification 1/2019

24.08.2018 | Regular Article

Finite mixture of regression models for censored data based on scale mixtures of normal distributions

verfasst von: Camila Borelli Zeller, Celso Rômulo Barbosa Cabral, Víctor Hugo Lachos, Luis Benites

Erschienen in: Advances in Data Analysis and Classification | Ausgabe 1/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In statistical analysis, particularly in econometrics, the finite mixture of regression models based on the normality assumption is routinely used to analyze censored data. In this work, an extension of this model is proposed by considering scale mixtures of normal distributions (SMN). This approach allows us to model data with great flexibility, accommodating multimodality and heavy tails at the same time. The main virtue of considering the finite mixture of regression models for censored data under the SMN class is that this class of models has a nice hierarchical representation which allows easy implementation of inferences. We develop a simple EM-type algorithm to perform maximum likelihood inference of the parameters in the proposed model. To examine the performance of the proposed method, we present some simulation studies and analyze a real dataset. The proposed algorithm and methods are implemented in the new R package CensMixReg.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Andrews DF, Mallows CL (1974) Scale mixtures of normal distributions. J R Stat Soc Ser B 36:99–102MathSciNetMATH Andrews DF, Mallows CL (1974) Scale mixtures of normal distributions. J R Stat Soc Ser B 36:99–102MathSciNetMATH
Zurück zum Zitat Arellano-Valle RB, Castro L, González-Farías G, Muños Gajardo K (2012) Student-t censored regression model: properties and inference. Stat Methods Appl 21:453–473MathSciNetCrossRefMATH Arellano-Valle RB, Castro L, González-Farías G, Muños Gajardo K (2012) Student-t censored regression model: properties and inference. Stat Methods Appl 21:453–473MathSciNetCrossRefMATH
Zurück zum Zitat Ateya SF (2014) Maximum likelihood estimation under a finite mixture of generalized exponential distributions based on censored data. Stat Pap 55:311–325MathSciNetCrossRefMATH Ateya SF (2014) Maximum likelihood estimation under a finite mixture of generalized exponential distributions based on censored data. Stat Pap 55:311–325MathSciNetCrossRefMATH
Zurück zum Zitat Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Comput Stat Data Anal 54:2926–2941MathSciNetCrossRefMATH Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Comput Stat Data Anal 54:2926–2941MathSciNetCrossRefMATH
Zurück zum Zitat Cabral CRB, Lachos VH, Prates MO (2012) Multivariate mixture modeling using skew-normal independent distributions. Comput Stat Data Anal 56:126–142MathSciNetCrossRefMATH Cabral CRB, Lachos VH, Prates MO (2012) Multivariate mixture modeling using skew-normal independent distributions. Comput Stat Data Anal 56:126–142MathSciNetCrossRefMATH
Zurück zum Zitat Caudill SB (2012) A partially adaptive estimator for the censored regression model based on a mixture of normal distributions. Stat Methods Appl 21:121–137MathSciNetCrossRef Caudill SB (2012) A partially adaptive estimator for the censored regression model based on a mixture of normal distributions. Stat Methods Appl 21:121–137MathSciNetCrossRef
Zurück zum Zitat Cuesta-Albertos JA, Gordaliza A, Matrán C (1997) Trimmed \(k\)-means: an attempt to robustify quantizers. Annal Stat 25:553–576MathSciNetCrossRefMATH Cuesta-Albertos JA, Gordaliza A, Matrán C (1997) Trimmed \(k\)-means: an attempt to robustify quantizers. Annal Stat 25:553–576MathSciNetCrossRefMATH
Zurück zum Zitat Depraetere N, Vandebroek M (2014) Order selection in finite mixtures of linear regressions: literature review and a simulation study. Stat Pap 55:871–911MathSciNetCrossRefMATH Depraetere N, Vandebroek M (2014) Order selection in finite mixtures of linear regressions: literature review and a simulation study. Stat Pap 55:871–911MathSciNetCrossRefMATH
Zurück zum Zitat Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38MathSciNetMATH Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38MathSciNetMATH
Zurück zum Zitat Fagundes RA, de Souza RM, Cysneiros FJA (2013) Robust regression with application to symbolic interval data. Eng Appl Artif Intell 26:564–573CrossRef Fagundes RA, de Souza RM, Cysneiros FJA (2013) Robust regression with application to symbolic interval data. Eng Appl Artif Intell 26:564–573CrossRef
Zurück zum Zitat Frühwirth-Schnatter S (2006) Finite mixture and Markov switching models. Springer, New YorkMATH Frühwirth-Schnatter S (2006) Finite mixture and Markov switching models. Springer, New YorkMATH
Zurück zum Zitat Galimberti G, Soffritti G (2014) A multivariate linear regression analysis using finite mixtures of t distributions. Comput Stat Data Anal 71:138–150MathSciNetCrossRefMATH Galimberti G, Soffritti G (2014) A multivariate linear regression analysis using finite mixtures of t distributions. Comput Stat Data Anal 71:138–150MathSciNetCrossRefMATH
Zurück zum Zitat Garay AM, Lachos VH, Bolfarine H, Cabral CRB (2015) Linear censored regression models with scale mixtures of normal distributions. Stat Pap 58:247–278MathSciNetCrossRefMATH Garay AM, Lachos VH, Bolfarine H, Cabral CRB (2015) Linear censored regression models with scale mixtures of normal distributions. Stat Pap 58:247–278MathSciNetCrossRefMATH
Zurück zum Zitat Garay AM, Lachos VH, Lin TI (2016) Nonlinear censored regression models with heavy-tailed distributions. Stat Interface 9:281–293MathSciNetCrossRefMATH Garay AM, Lachos VH, Lin TI (2016) Nonlinear censored regression models with heavy-tailed distributions. Stat Interface 9:281–293MathSciNetCrossRefMATH
Zurück zum Zitat Greene WH (2012) Econometric analysis, 7th edn. Pearson, Harlow Greene WH (2012) Econometric analysis, 7th edn. Pearson, Harlow
Zurück zum Zitat Grün B, Leisch F (2008) Finite mixtures of generalized linear regression models. In: Recent advances in linear models and related areas: essays in honour of helge toutenburg. Physica-Verlag HD, Heidelberg, pp 205–230 Grün B, Leisch F (2008) Finite mixtures of generalized linear regression models. In: Recent advances in linear models and related areas: essays in honour of helge toutenburg. Physica-Verlag HD, Heidelberg, pp 205–230
Zurück zum Zitat He J (2013) Mixture model based multivariate statistical analysis of multiply censored environmental data. Adv Water Res 59:15–24CrossRef He J (2013) Mixture model based multivariate statistical analysis of multiply censored environmental data. Adv Water Res 59:15–24CrossRef
Zurück zum Zitat Hennig C (2000) Identifiablity of models for clusterwise linear regression. J Classif 17:273–296CrossRefMATH Hennig C (2000) Identifiablity of models for clusterwise linear regression. J Classif 17:273–296CrossRefMATH
Zurück zum Zitat Lachos VH, Moreno EJL, Chen K, Cabral CRB (2017) Finite mixture modeling of censored data using the multivariate student-t distribution. J Multivar Anal 159:151–167MathSciNetCrossRefMATH Lachos VH, Moreno EJL, Chen K, Cabral CRB (2017) Finite mixture modeling of censored data using the multivariate student-t distribution. J Multivar Anal 159:151–167MathSciNetCrossRefMATH
Zurück zum Zitat Lange KL, Sinsheimer JS (1993) Normal/independent distributions and their applications in robust regression. J Comput Graph Stat 2:175–198MathSciNet Lange KL, Sinsheimer JS (1993) Normal/independent distributions and their applications in robust regression. J Comput Graph Stat 2:175–198MathSciNet
Zurück zum Zitat Lin TI, Ho HJ, Lee CR (2014) Flexible mixture modelling using the multivariate skew-t-normal distribution. Stat Comput 24:531–546MathSciNetCrossRefMATH Lin TI, Ho HJ, Lee CR (2014) Flexible mixture modelling using the multivariate skew-t-normal distribution. Stat Comput 24:531–546MathSciNetCrossRefMATH
Zurück zum Zitat Liu C, Rubin DB (1994) The ECME algorithm: a simple extension of EM and ECM with faster monotone convergence. Biometrika 81:633–648MathSciNetCrossRefMATH Liu C, Rubin DB (1994) The ECME algorithm: a simple extension of EM and ECM with faster monotone convergence. Biometrika 81:633–648MathSciNetCrossRefMATH
Zurück zum Zitat Louis T (1982) Finding the observed information matrix when using the em algorithm. J R Stat Soc Ser B 44:226–233MathSciNetMATH Louis T (1982) Finding the observed information matrix when using the em algorithm. J R Stat Soc Ser B 44:226–233MathSciNetMATH
Zurück zum Zitat Massuia MB, Cabral CRB, Matos LA, Lachos VH (2015) Influence diagnostics for student-t censored linear regression models. Statistics 49:1074–1094MathSciNetCrossRefMATH Massuia MB, Cabral CRB, Matos LA, Lachos VH (2015) Influence diagnostics for student-t censored linear regression models. Statistics 49:1074–1094MathSciNetCrossRefMATH
Zurück zum Zitat MATLAB (2016) version 9.0 (R2016a). The MathWorks Inc., Natick, Massachusetts MATLAB (2016) version 9.0 (R2016a). The MathWorks Inc., Natick, Massachusetts
Zurück zum Zitat McLachlan GJ, Krishnan T (2008) The EM algorithm and extensions. John Wiley & Sons, New JerseyCrossRefMATH McLachlan GJ, Krishnan T (2008) The EM algorithm and extensions. John Wiley & Sons, New JerseyCrossRefMATH
Zurück zum Zitat Melenberg B, Soest AV (1996) Parametric and semi-parametric modeling of vacation expenditures. J Appl Econ 11:59–76CrossRef Melenberg B, Soest AV (1996) Parametric and semi-parametric modeling of vacation expenditures. J Appl Econ 11:59–76CrossRef
Zurück zum Zitat Mroz TA (1987) The sensitivity of an empirical model of married women’s hours of work to economic and statistical assumptions. Econometrica 55:765–799CrossRef Mroz TA (1987) The sensitivity of an empirical model of married women’s hours of work to economic and statistical assumptions. Econometrica 55:765–799CrossRef
Zurück zum Zitat Raftery AE (1995) Bayesian model selection in social research. Sociol Methodol 25:111–163CrossRef Raftery AE (1995) Bayesian model selection in social research. Sociol Methodol 25:111–163CrossRef
Zurück zum Zitat Tzortzis G, Likas A (2014) The MinMax k-Means clustering algorithm. Pattern Recognit 47:2505–2516CrossRef Tzortzis G, Likas A (2014) The MinMax k-Means clustering algorithm. Pattern Recognit 47:2505–2516CrossRef
Zurück zum Zitat Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. J Comput Graph Stat 18:797–817MathSciNetCrossRef Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. J Comput Graph Stat 18:797–817MathSciNetCrossRef
Zurück zum Zitat Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econom J Econom Soc 57:307–333MATH Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econom J Econom Soc 57:307–333MATH
Zurück zum Zitat Witte A (1980) Estimating an economic model of crime with individual data. Q J Econ 94:57–84CrossRef Witte A (1980) Estimating an economic model of crime with individual data. Q J Econ 94:57–84CrossRef
Zurück zum Zitat Zhang B (2003) Regression clustering. In: Proceedings of the third IEEE international conference on data mining, Melbourne Zhang B (2003) Regression clustering. In: Proceedings of the third IEEE international conference on data mining, Melbourne
Zurück zum Zitat Zeller CB, Cabral CRB, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25:375–396MathSciNetCrossRefMATH Zeller CB, Cabral CRB, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25:375–396MathSciNetCrossRefMATH
Metadaten
Titel
Finite mixture of regression models for censored data based on scale mixtures of normal distributions
verfasst von
Camila Borelli Zeller
Celso Rômulo Barbosa Cabral
Víctor Hugo Lachos
Luis Benites
Publikationsdatum
24.08.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
Advances in Data Analysis and Classification / Ausgabe 1/2019
Print ISSN: 1862-5347
Elektronische ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-018-0337-y

Weitere Artikel der Ausgabe 1/2019

Advances in Data Analysis and Classification 1/2019 Zur Ausgabe

Premium Partner