Skip to main content
Top
Published in: Advances in Data Analysis and Classification 1/2019

24-08-2018 | Regular Article

Finite mixture of regression models for censored data based on scale mixtures of normal distributions

Authors: Camila Borelli Zeller, Celso Rômulo Barbosa Cabral, Víctor Hugo Lachos, Luis Benites

Published in: Advances in Data Analysis and Classification | Issue 1/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In statistical analysis, particularly in econometrics, the finite mixture of regression models based on the normality assumption is routinely used to analyze censored data. In this work, an extension of this model is proposed by considering scale mixtures of normal distributions (SMN). This approach allows us to model data with great flexibility, accommodating multimodality and heavy tails at the same time. The main virtue of considering the finite mixture of regression models for censored data under the SMN class is that this class of models has a nice hierarchical representation which allows easy implementation of inferences. We develop a simple EM-type algorithm to perform maximum likelihood inference of the parameters in the proposed model. To examine the performance of the proposed method, we present some simulation studies and analyze a real dataset. The proposed algorithm and methods are implemented in the new R package CensMixReg.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Andrews DF, Mallows CL (1974) Scale mixtures of normal distributions. J R Stat Soc Ser B 36:99–102MathSciNetMATH Andrews DF, Mallows CL (1974) Scale mixtures of normal distributions. J R Stat Soc Ser B 36:99–102MathSciNetMATH
go back to reference Arellano-Valle RB, Castro L, González-Farías G, Muños Gajardo K (2012) Student-t censored regression model: properties and inference. Stat Methods Appl 21:453–473MathSciNetCrossRefMATH Arellano-Valle RB, Castro L, González-Farías G, Muños Gajardo K (2012) Student-t censored regression model: properties and inference. Stat Methods Appl 21:453–473MathSciNetCrossRefMATH
go back to reference Ateya SF (2014) Maximum likelihood estimation under a finite mixture of generalized exponential distributions based on censored data. Stat Pap 55:311–325MathSciNetCrossRefMATH Ateya SF (2014) Maximum likelihood estimation under a finite mixture of generalized exponential distributions based on censored data. Stat Pap 55:311–325MathSciNetCrossRefMATH
go back to reference Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Comput Stat Data Anal 54:2926–2941MathSciNetCrossRefMATH Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Comput Stat Data Anal 54:2926–2941MathSciNetCrossRefMATH
go back to reference Cabral CRB, Lachos VH, Prates MO (2012) Multivariate mixture modeling using skew-normal independent distributions. Comput Stat Data Anal 56:126–142MathSciNetCrossRefMATH Cabral CRB, Lachos VH, Prates MO (2012) Multivariate mixture modeling using skew-normal independent distributions. Comput Stat Data Anal 56:126–142MathSciNetCrossRefMATH
go back to reference Caudill SB (2012) A partially adaptive estimator for the censored regression model based on a mixture of normal distributions. Stat Methods Appl 21:121–137MathSciNetCrossRef Caudill SB (2012) A partially adaptive estimator for the censored regression model based on a mixture of normal distributions. Stat Methods Appl 21:121–137MathSciNetCrossRef
go back to reference Depraetere N, Vandebroek M (2014) Order selection in finite mixtures of linear regressions: literature review and a simulation study. Stat Pap 55:871–911MathSciNetCrossRefMATH Depraetere N, Vandebroek M (2014) Order selection in finite mixtures of linear regressions: literature review and a simulation study. Stat Pap 55:871–911MathSciNetCrossRefMATH
go back to reference Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38MathSciNetMATH Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38MathSciNetMATH
go back to reference Fagundes RA, de Souza RM, Cysneiros FJA (2013) Robust regression with application to symbolic interval data. Eng Appl Artif Intell 26:564–573CrossRef Fagundes RA, de Souza RM, Cysneiros FJA (2013) Robust regression with application to symbolic interval data. Eng Appl Artif Intell 26:564–573CrossRef
go back to reference Frühwirth-Schnatter S (2006) Finite mixture and Markov switching models. Springer, New YorkMATH Frühwirth-Schnatter S (2006) Finite mixture and Markov switching models. Springer, New YorkMATH
go back to reference Galimberti G, Soffritti G (2014) A multivariate linear regression analysis using finite mixtures of t distributions. Comput Stat Data Anal 71:138–150MathSciNetCrossRefMATH Galimberti G, Soffritti G (2014) A multivariate linear regression analysis using finite mixtures of t distributions. Comput Stat Data Anal 71:138–150MathSciNetCrossRefMATH
go back to reference Garay AM, Lachos VH, Bolfarine H, Cabral CRB (2015) Linear censored regression models with scale mixtures of normal distributions. Stat Pap 58:247–278MathSciNetCrossRefMATH Garay AM, Lachos VH, Bolfarine H, Cabral CRB (2015) Linear censored regression models with scale mixtures of normal distributions. Stat Pap 58:247–278MathSciNetCrossRefMATH
go back to reference Garay AM, Lachos VH, Lin TI (2016) Nonlinear censored regression models with heavy-tailed distributions. Stat Interface 9:281–293MathSciNetCrossRefMATH Garay AM, Lachos VH, Lin TI (2016) Nonlinear censored regression models with heavy-tailed distributions. Stat Interface 9:281–293MathSciNetCrossRefMATH
go back to reference Greene WH (2012) Econometric analysis, 7th edn. Pearson, Harlow Greene WH (2012) Econometric analysis, 7th edn. Pearson, Harlow
go back to reference Grün B, Leisch F (2008) Finite mixtures of generalized linear regression models. In: Recent advances in linear models and related areas: essays in honour of helge toutenburg. Physica-Verlag HD, Heidelberg, pp 205–230 Grün B, Leisch F (2008) Finite mixtures of generalized linear regression models. In: Recent advances in linear models and related areas: essays in honour of helge toutenburg. Physica-Verlag HD, Heidelberg, pp 205–230
go back to reference He J (2013) Mixture model based multivariate statistical analysis of multiply censored environmental data. Adv Water Res 59:15–24CrossRef He J (2013) Mixture model based multivariate statistical analysis of multiply censored environmental data. Adv Water Res 59:15–24CrossRef
go back to reference Hennig C (2000) Identifiablity of models for clusterwise linear regression. J Classif 17:273–296CrossRefMATH Hennig C (2000) Identifiablity of models for clusterwise linear regression. J Classif 17:273–296CrossRefMATH
go back to reference Lachos VH, Moreno EJL, Chen K, Cabral CRB (2017) Finite mixture modeling of censored data using the multivariate student-t distribution. J Multivar Anal 159:151–167MathSciNetCrossRefMATH Lachos VH, Moreno EJL, Chen K, Cabral CRB (2017) Finite mixture modeling of censored data using the multivariate student-t distribution. J Multivar Anal 159:151–167MathSciNetCrossRefMATH
go back to reference Lange KL, Sinsheimer JS (1993) Normal/independent distributions and their applications in robust regression. J Comput Graph Stat 2:175–198MathSciNet Lange KL, Sinsheimer JS (1993) Normal/independent distributions and their applications in robust regression. J Comput Graph Stat 2:175–198MathSciNet
go back to reference Lin TI, Ho HJ, Lee CR (2014) Flexible mixture modelling using the multivariate skew-t-normal distribution. Stat Comput 24:531–546MathSciNetCrossRefMATH Lin TI, Ho HJ, Lee CR (2014) Flexible mixture modelling using the multivariate skew-t-normal distribution. Stat Comput 24:531–546MathSciNetCrossRefMATH
go back to reference Liu C, Rubin DB (1994) The ECME algorithm: a simple extension of EM and ECM with faster monotone convergence. Biometrika 81:633–648MathSciNetCrossRefMATH Liu C, Rubin DB (1994) The ECME algorithm: a simple extension of EM and ECM with faster monotone convergence. Biometrika 81:633–648MathSciNetCrossRefMATH
go back to reference Louis T (1982) Finding the observed information matrix when using the em algorithm. J R Stat Soc Ser B 44:226–233MathSciNetMATH Louis T (1982) Finding the observed information matrix when using the em algorithm. J R Stat Soc Ser B 44:226–233MathSciNetMATH
go back to reference Massuia MB, Cabral CRB, Matos LA, Lachos VH (2015) Influence diagnostics for student-t censored linear regression models. Statistics 49:1074–1094MathSciNetCrossRefMATH Massuia MB, Cabral CRB, Matos LA, Lachos VH (2015) Influence diagnostics for student-t censored linear regression models. Statistics 49:1074–1094MathSciNetCrossRefMATH
go back to reference MATLAB (2016) version 9.0 (R2016a). The MathWorks Inc., Natick, Massachusetts MATLAB (2016) version 9.0 (R2016a). The MathWorks Inc., Natick, Massachusetts
go back to reference McLachlan GJ, Krishnan T (2008) The EM algorithm and extensions. John Wiley & Sons, New JerseyCrossRefMATH McLachlan GJ, Krishnan T (2008) The EM algorithm and extensions. John Wiley & Sons, New JerseyCrossRefMATH
go back to reference Melenberg B, Soest AV (1996) Parametric and semi-parametric modeling of vacation expenditures. J Appl Econ 11:59–76CrossRef Melenberg B, Soest AV (1996) Parametric and semi-parametric modeling of vacation expenditures. J Appl Econ 11:59–76CrossRef
go back to reference Mroz TA (1987) The sensitivity of an empirical model of married women’s hours of work to economic and statistical assumptions. Econometrica 55:765–799CrossRef Mroz TA (1987) The sensitivity of an empirical model of married women’s hours of work to economic and statistical assumptions. Econometrica 55:765–799CrossRef
go back to reference Raftery AE (1995) Bayesian model selection in social research. Sociol Methodol 25:111–163CrossRef Raftery AE (1995) Bayesian model selection in social research. Sociol Methodol 25:111–163CrossRef
go back to reference Tzortzis G, Likas A (2014) The MinMax k-Means clustering algorithm. Pattern Recognit 47:2505–2516CrossRef Tzortzis G, Likas A (2014) The MinMax k-Means clustering algorithm. Pattern Recognit 47:2505–2516CrossRef
go back to reference Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. J Comput Graph Stat 18:797–817MathSciNetCrossRef Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. J Comput Graph Stat 18:797–817MathSciNetCrossRef
go back to reference Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econom J Econom Soc 57:307–333MATH Vuong QH (1989) Likelihood ratio tests for model selection and non-nested hypotheses. Econom J Econom Soc 57:307–333MATH
go back to reference Witte A (1980) Estimating an economic model of crime with individual data. Q J Econ 94:57–84CrossRef Witte A (1980) Estimating an economic model of crime with individual data. Q J Econ 94:57–84CrossRef
go back to reference Zhang B (2003) Regression clustering. In: Proceedings of the third IEEE international conference on data mining, Melbourne Zhang B (2003) Regression clustering. In: Proceedings of the third IEEE international conference on data mining, Melbourne
go back to reference Zeller CB, Cabral CRB, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25:375–396MathSciNetCrossRefMATH Zeller CB, Cabral CRB, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25:375–396MathSciNetCrossRefMATH
Metadata
Title
Finite mixture of regression models for censored data based on scale mixtures of normal distributions
Authors
Camila Borelli Zeller
Celso Rômulo Barbosa Cabral
Víctor Hugo Lachos
Luis Benites
Publication date
24-08-2018
Publisher
Springer Berlin Heidelberg
Published in
Advances in Data Analysis and Classification / Issue 1/2019
Print ISSN: 1862-5347
Electronic ISSN: 1862-5355
DOI
https://doi.org/10.1007/s11634-018-0337-y

Other articles of this Issue 1/2019

Advances in Data Analysis and Classification 1/2019 Go to the issue

Premium Partner