Skip to main content
Erschienen in: Journal of Classification 3/2022

08.07.2022

Finite Mixture of Censored Linear Mixed Models for Irregularly Observed Longitudinal Data

verfasst von: Francisco H. C. de Alencar, Larissa A Matos, Víctor H. Lachos

Erschienen in: Journal of Classification | Ausgabe 3/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Linear mixed-effects models are commonly used when multiple correlated measurements are made for each unit of interest. Some inherent features of these data can make the analysis challenging, such as when the series of responses are repeatedly collected for each subject at irregular intervals over time or when the data are subject to some upper and/or lower detection limits of the experimental equipment. Moreover, if units are suspected of forming distinct clusters over time, i.e., heterogeneity, then the class of finite mixtures of linear mixed-effects models is required. This paper considers the problem of clustering heterogeneous longitudinal data in a mixture framework and proposes a finite mixture of multivariate normal linear mixed-effects model. This model allows us to accommodate more complex features of longitudinal data, such as measurement at irregular intervals over time and censored data. Furthermore, we consider a damped exponential correlation structure for the random error to deal with serial correlation among the within-subject errors. An efficient expectation-maximization algorithm is employed to compute the maximum likelihood estimation of the parameters. The algorithm has closed-form expressions at the E-step that rely on formulas for the mean and variance of the multivariate truncated normal distributions. Furthermore, a general information-based method to approximate the asymptotic covariance matrix is also presented. Results obtained from the analysis of both simulated and real HIV/AIDS datasets are reported to demonstrate the effectiveness of the proposed method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Acosta EP, Wu H, Hammer SM, Yu S, Kuritzkes DR, Walawander A, Eron JJ, Fichtenbaum CJ, Pettinelli C, Neath D, et al. (2004) Comparison of two indinavir/ritonavir regimens in the treatment of hiv-infected individuals. JAIDS Journal of Acquired Immune Deficiency Syndromes 37(3), 1358–1366.CrossRef Acosta EP, Wu H, Hammer SM, Yu S, Kuritzkes DR, Walawander A, Eron JJ, Fichtenbaum CJ, Pettinelli C, Neath D, et al. (2004) Comparison of two indinavir/ritonavir regimens in the treatment of hiv-infected individuals. JAIDS Journal of Acquired Immune Deficiency Syndromes 37(3), 1358–1366.CrossRef
Zurück zum Zitat Bai X, Chen K, Yao W (2016) Mixture of linear mixed models using multivariate t distribution. Journal of Statistical Computation and Simulation 86(4), 771–787.MathSciNetCrossRefMATH Bai X, Chen K, Yao W (2016) Mixture of linear mixed models using multivariate t distribution. Journal of Statistical Computation and Simulation 86(4), 771–787.MathSciNetCrossRefMATH
Zurück zum Zitat Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Computational Statistics & Data Analysis 54(12), 2926–2941.MathSciNetCrossRefMATH Basso RM, Lachos VH, Cabral CRB, Ghosh P (2010) Robust mixture modeling based on scale mixtures of skew-normal distributions. Computational Statistics & Data Analysis 54(12), 2926–2941.MathSciNetCrossRefMATH
Zurück zum Zitat Booth JG, Casella G, Hobert JP (2008) Clustering using objective functions and stochastic search. Journal of the Royal Statistical Society, Series B (Statistical Methodology) 70(1):119–139 Booth JG, Casella G, Hobert JP (2008) Clustering using objective functions and stochastic search. Journal of the Royal Statistical Society, Series B (Statistical Methodology) 70(1):119–139
Zurück zum Zitat Celeux G, Martin O, Lavergne C (2005) Mixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments. Statistical Modelling 5(3), 243–267.MathSciNetCrossRefMATH Celeux G, Martin O, Lavergne C (2005) Mixture of linear mixed models for clustering gene expression profiles from repeated microarray experiments. Statistical Modelling 5(3), 243–267.MathSciNetCrossRefMATH
Zurück zum Zitat De la Cruz-Mesía R, Quintana FA, Marshall G (2008) Model-based clustering for longitudinal data. Computational Statistics & Data Analysis 52(3), 1441–1457.MathSciNetCrossRefMATH De la Cruz-Mesía R, Quintana FA, Marshall G (2008) Model-based clustering for longitudinal data. Computational Statistics & Data Analysis 52(3), 1441–1457.MathSciNetCrossRefMATH
Zurück zum Zitat Cuesta-Albertos JA, Gordaliza A, Matrán C, et al. (1997) Trimmed \(k\)-means: An attempt to robustify quantizers. The Annals of Statistics 25(2), 553–576. Cuesta-Albertos JA, Gordaliza A, Matrán C, et al. (1997) Trimmed \(k\)-means: An attempt to robustify quantizers. The Annals of Statistics 25(2), 553–576.
Zurück zum Zitat Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1–38.MathSciNetMATH Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society, Series B, 39:1–38.MathSciNetMATH
Zurück zum Zitat Faria S, Soromenho G (2010) Fitting mixtures of linear regressions. Journal of Statistical Computation and Simulation 80(2), 201–225.MathSciNetCrossRefMATH Faria S, Soromenho G (2010) Fitting mixtures of linear regressions. Journal of Statistical Computation and Simulation 80(2), 201–225.MathSciNetCrossRefMATH
Zurück zum Zitat Fitzgerald AP, DeGruttola VG, Vaida F (2002) Modelling hiv viral rebound using non-linear mixed effects models. Statistics in Medicine 21(14), 2093–2108.CrossRef Fitzgerald AP, DeGruttola VG, Vaida F (2002) Modelling hiv viral rebound using non-linear mixed effects models. Statistics in Medicine 21(14), 2093–2108.CrossRef
Zurück zum Zitat Gaffney S, Smyth P (2003) Curve clustering with random effects regression mixtures. In: AISTATS Gaffney S, Smyth P (2003) Curve clustering with random effects regression mixtures. In: AISTATS
Zurück zum Zitat Gałecki AT, Burzykowski T (2013) Linear mixed-effects models using R: a step-by-step approach. New York: Springer.CrossRefMATH Gałecki AT, Burzykowski T (2013) Linear mixed-effects models using R: a step-by-step approach. New York: Springer.CrossRefMATH
Zurück zum Zitat Hathaway RJ, et al. (1985) A constrained formulation of maximum-likelihood estimation for normal mixture distributions. The Annals of Statistics 13(2), 795–800.MathSciNetCrossRefMATH Hathaway RJ, et al. (1985) A constrained formulation of maximum-likelihood estimation for normal mixture distributions. The Annals of Statistics 13(2), 795–800.MathSciNetCrossRefMATH
Zurück zum Zitat Hughes J (1999) Mixed effects models with censored data with application to HIV RNA levels. Biometrics 55:625–629.CrossRefMATH Hughes J (1999) Mixed effects models with censored data with application to HIV RNA levels. Biometrics 55:625–629.CrossRefMATH
Zurück zum Zitat Kiefer NM (1978) Discrete parameter variation: Efficient estimation of a switching regression model. Econometrica: Journal of the Econometric Society 46:427–434 Kiefer NM (1978) Discrete parameter variation: Efficient estimation of a switching regression model. Econometrica: Journal of the Econometric Society 46:427–434
Zurück zum Zitat Lachos VH, Ghosh P, Arellano-Valle RB (2010) Likelihood based inference for skew-normal independent linear mixed models. Statistica Sinica 20:303–322.MathSciNetMATH Lachos VH, Ghosh P, Arellano-Valle RB (2010) Likelihood based inference for skew-normal independent linear mixed models. Statistica Sinica 20:303–322.MathSciNetMATH
Zurück zum Zitat Lachos, V. H., Matos, A. L., Castro, L. M., & Chen, M. H. (2019) Flexible longitudinal linear mixed models for multiple censored responses data. Statistics in Medicine, 38(6), 1074–1102.MathSciNetCrossRef Lachos, V. H., Matos, A. L., Castro, L. M., & Chen, M. H. (2019) Flexible longitudinal linear mixed models for multiple censored responses data. Statistics in Medicine, 38(6), 1074–1102.MathSciNetCrossRef
Zurück zum Zitat Laird NM, Ware JH, et al. (1982) Random-effects models for longitudinal data. Biometrics 38(4), 963–974.CrossRefMATH Laird NM, Ware JH, et al. (1982) Random-effects models for longitudinal data. Biometrics 38(4), 963–974.CrossRefMATH
Zurück zum Zitat Lin TI (2010) Robust mixture modeling using multivariate skew t distributions. Statistics and Computing 20(3), 343–356.MathSciNetCrossRef Lin TI (2010) Robust mixture modeling using multivariate skew t distributions. Statistics and Computing 20(3), 343–356.MathSciNetCrossRef
Zurück zum Zitat Lin TI (2014) Learning from incomplete data via parameterized t mixture models through eigenvalue decomposition. Computational Statistics & Data Analysis 71:183–195.MathSciNetCrossRefMATH Lin TI (2014) Learning from incomplete data via parameterized t mixture models through eigenvalue decomposition. Computational Statistics & Data Analysis 71:183–195.MathSciNetCrossRefMATH
Zurück zum Zitat Lin TI, Wang WL (2013) Multivariate skew-normal linear mixed models for multi-outcome longitudinal data. Statistical Modelling 13(3), 199–221.MathSciNetCrossRefMATH Lin TI, Wang WL (2013) Multivariate skew-normal linear mixed models for multi-outcome longitudinal data. Statistical Modelling 13(3), 199–221.MathSciNetCrossRefMATH
Zurück zum Zitat Lin TI, Wang WL (2020) Multivariate-t linear mixed models with censored responses, intermittent missing values and heavy tails. Statistical Methods in Medical Research 29(5), 1288–1304.MathSciNetCrossRef Lin TI, Wang WL (2020) Multivariate-t linear mixed models with censored responses, intermittent missing values and heavy tails. Statistical Methods in Medical Research 29(5), 1288–1304.MathSciNetCrossRef
Zurück zum Zitat Lin TI, McLachlan GJ, Lee SX (2016) Extending mixtures of factor models using the restricted multivariate skew-normal distribution. Journal of Multivariate Analysis 143:398–413.MathSciNetCrossRefMATH Lin TI, McLachlan GJ, Lee SX (2016) Extending mixtures of factor models using the restricted multivariate skew-normal distribution. Journal of Multivariate Analysis 143:398–413.MathSciNetCrossRefMATH
Zurück zum Zitat Lin TI, Lachos VH, Wang WL (2018) Multivariate longitudinal data analysis with censored and intermittent missing responses. Statistics in Medicine 37(19), 2822–2835.MathSciNetCrossRef Lin TI, Lachos VH, Wang WL (2018) Multivariate longitudinal data analysis with censored and intermittent missing responses. Statistics in Medicine 37(19), 2822–2835.MathSciNetCrossRef
Zurück zum Zitat Louis TA (1982) Finding the observed information matrix when using the em algorithm. Journal of the Royal Statistical Society, Series B (Methodological) 44(2):226–233 Louis TA (1982) Finding the observed information matrix when using the em algorithm. Journal of the Royal Statistical Society, Series B (Methodological) 44(2):226–233
Zurück zum Zitat Matos LA, Lachos VH, Balakrishnan N, Labra FV (2013a) Influence diagnostics in linear and nonlinear mixed-effects models with censored data. Computational Statistics & Data Analysis 57(1):450–464 Matos LA, Lachos VH, Balakrishnan N, Labra FV (2013a) Influence diagnostics in linear and nonlinear mixed-effects models with censored data. Computational Statistics & Data Analysis 57(1):450–464
Zurück zum Zitat Matos LA, Prates MO, Chen MH, Lachos VH (2013b) Likelihood-based inference for mixed-effects models with censored response using the multivariate-t distribution. Statistica Sinica 23(3):1323–1345 Matos LA, Prates MO, Chen MH, Lachos VH (2013b) Likelihood-based inference for mixed-effects models with censored response using the multivariate-t distribution. Statistica Sinica 23(3):1323–1345
Zurück zum Zitat Matos LA, Bandyopadhyay D, Castro LM, Lachos VH (2015) Influence assessment in censored mixed-effects models using the multivariate Student t distribution. Journal of Multivariate Analysis 141:104–117.MathSciNetCrossRefMATH Matos LA, Bandyopadhyay D, Castro LM, Lachos VH (2015) Influence assessment in censored mixed-effects models using the multivariate Student t distribution. Journal of Multivariate Analysis 141:104–117.MathSciNetCrossRefMATH
Zurück zum Zitat Matos LA, Castro LM, Lachos VH (2016) Censored mixed-effects models for irregularly observed repeated measures with applications to HIV viral loads. TEST 25(4), 627–653.MathSciNetCrossRefMATH Matos LA, Castro LM, Lachos VH (2016) Censored mixed-effects models for irregularly observed repeated measures with applications to HIV viral loads. TEST 25(4), 627–653.MathSciNetCrossRefMATH
Zurück zum Zitat McLachlan GJ, Peel D (2000) Finite mixture models. John Wiley & Sons. McLachlan GJ, Peel D (2000) Finite mixture models. John Wiley & Sons.
Zurück zum Zitat Meng XL, Rubin DB (1993) Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 80(2), 267–278.MathSciNetCrossRefMATH Meng XL, Rubin DB (1993) Maximum likelihood estimation via the ECM algorithm: A general framework. Biometrika 80(2), 267–278.MathSciNetCrossRefMATH
Zurück zum Zitat Muñoz A, Carey V, Schouten JP, Segal M, Rosner B (1992) A parametric family of correlation structures for the analysis of longitudinal data. Biometrics 48(3), 733–742.CrossRef Muñoz A, Carey V, Schouten JP, Segal M, Rosner B (1992) A parametric family of correlation structures for the analysis of longitudinal data. Biometrics 48(3), 733–742.CrossRef
Zurück zum Zitat Ng SK, McLachlan GJ, Wang K, Ben-Tovim Jones L, Ng SW (2006) A mixture model with random-effects components for clustering correlated gene-expression profiles. Bioinformatics 22(14), 1745–1752.CrossRef Ng SK, McLachlan GJ, Wang K, Ben-Tovim Jones L, Ng SW (2006) A mixture model with random-effects components for clustering correlated gene-expression profiles. Bioinformatics 22(14), 1745–1752.CrossRef
Zurück zum Zitat Rousseeuw PJ, Kaufman L (1990) Finding groups in data. Hoboken: Wiley Online Library.MATH Rousseeuw PJ, Kaufman L (1990) Finding groups in data. Hoboken: Wiley Online Library.MATH
Zurück zum Zitat Spiessens B, Verbeke G, Komárek A (2002) A sas-macro for the classification of longitudinal profiles using mixtures of normal distributions in nonlinear and generalised linear mixed models. Arbeitspapier, Katholische Universität Leuven: Biostatistisches Zentrum, Belgien. Spiessens B, Verbeke G, Komárek A (2002) A sas-macro for the classification of longitudinal profiles using mixtures of normal distributions in nonlinear and generalised linear mixed models. Arbeitspapier, Katholische Universität Leuven: Biostatistisches Zentrum, Belgien.
Zurück zum Zitat Tzortzis G, Likas A (2014) The minmax k-means clustering algorithm. Pattern Recognition 47(7), 2505–2516.CrossRef Tzortzis G, Likas A (2014) The minmax k-means clustering algorithm. Pattern Recognition 47(7), 2505–2516.CrossRef
Zurück zum Zitat Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. Journal of Computational and Graphical Statistics 18(4), 797–817.MathSciNetCrossRef Vaida F, Liu L (2009) Fast implementation for normal mixed effects models with censored response. Journal of Computational and Graphical Statistics 18(4), 797–817.MathSciNetCrossRef
Zurück zum Zitat Vaida F, Fitzgerald A, DeGruttola V (2007) Efficient hybrid EM for linear and nonlinear mixed effects models with censored response. Computational Statistics & Data Analysis 51:5718–5730 Vaida F, Fitzgerald A, DeGruttola V (2007) Efficient hybrid EM for linear and nonlinear mixed effects models with censored response. Computational Statistics & Data Analysis 51:5718–5730
Zurück zum Zitat Verbeke G, Lesaffre E (1996) A linear mixed-effects model with heterogeneity in the random-effects population. Journal of the American Statistical Association 91(433), 217–221.CrossRefMATH Verbeke G, Lesaffre E (1996) A linear mixed-effects model with heterogeneity in the random-effects population. Journal of the American Statistical Association 91(433), 217–221.CrossRefMATH
Zurück zum Zitat Wang WL (2013) Multivariate t linear mixed models for irregularly observed multiple repeated measures with missing outcomes. Biometrical Journal 55(4), 554–571.MathSciNetCrossRefMATH Wang WL (2013) Multivariate t linear mixed models for irregularly observed multiple repeated measures with missing outcomes. Biometrical Journal 55(4), 554–571.MathSciNetCrossRefMATH
Zurück zum Zitat Wang WL (2017) Mixture of multivariate t linear mixed models for multi-outcome longitudinal data with heterogeneity. Statistica Sinica 27:733–760.MathSciNetMATH Wang WL (2017) Mixture of multivariate t linear mixed models for multi-outcome longitudinal data with heterogeneity. Statistica Sinica 27:733–760.MathSciNetMATH
Zurück zum Zitat Wang WL (2019) Mixture of multivariate t nonlinear mixed models for multiple longitudinal data with heterogeneity and missing values. TEST 28(1), 196–222.MathSciNetCrossRefMATH Wang WL (2019) Mixture of multivariate t nonlinear mixed models for multiple longitudinal data with heterogeneity and missing values. TEST 28(1), 196–222.MathSciNetCrossRefMATH
Zurück zum Zitat Wang WL, Fan TH (2011) Estimation in multivariate t linear mixed models for multiple longitudinal data. Statistica Sinica 21:1857–1880.MathSciNetCrossRefMATH Wang WL, Fan TH (2011) Estimation in multivariate t linear mixed models for multiple longitudinal data. Statistica Sinica 21:1857–1880.MathSciNetCrossRefMATH
Zurück zum Zitat Wang WL, Lin TI (2015) Robust model-based clustering via mixtures of skew-t distributions with missing information. Advances in Data Analysis and Classification 9(4), 423–445.MathSciNetCrossRefMATH Wang WL, Lin TI (2015) Robust model-based clustering via mixtures of skew-t distributions with missing information. Advances in Data Analysis and Classification 9(4), 423–445.MathSciNetCrossRefMATH
Zurück zum Zitat Wu L (2009) Mixed effects models for complex data. Chapman and Hall/CRC Wu L (2009) Mixed effects models for complex data. Chapman and Hall/CRC
Zurück zum Zitat Yang YC, Lin TI, Castro LM, Wang WL (2020) Extending finite mixtures of t linear mixed-effects models with concomitant covariates. Computational Statistics & Data Analysis 148:106961, 1–20.MathSciNetMATH Yang YC, Lin TI, Castro LM, Wang WL (2020) Extending finite mixtures of t linear mixed-effects models with concomitant covariates. Computational Statistics & Data Analysis 148:106961, 1–20.MathSciNetMATH
Zurück zum Zitat Zeller CB, Cabral CR, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25(2), 375–396.MathSciNetCrossRefMATH Zeller CB, Cabral CR, Lachos VH (2016) Robust mixture regression modeling based on scale mixtures of skew-normal distributions. Test 25(2), 375–396.MathSciNetCrossRefMATH
Zurück zum Zitat Zhang B (2003) Regression clustering. In: Data Mining, 2003. ICDM 2003. Third IEEE International Conference on, IEEE, pp 451–458 Zhang B (2003) Regression clustering. In: Data Mining, 2003. ICDM 2003. Third IEEE International Conference on, IEEE, pp 451–458
Metadaten
Titel
Finite Mixture of Censored Linear Mixed Models for Irregularly Observed Longitudinal Data
verfasst von
Francisco H. C. de Alencar
Larissa A Matos
Víctor H. Lachos
Publikationsdatum
08.07.2022
Verlag
Springer US
Erschienen in
Journal of Classification / Ausgabe 3/2022
Print ISSN: 0176-4268
Elektronische ISSN: 1432-1343
DOI
https://doi.org/10.1007/s00357-022-09415-x

Weitere Artikel der Ausgabe 3/2022

Journal of Classification 3/2022 Zur Ausgabe

Premium Partner