Skip to main content
Top
Published in: Evolutionary Intelligence 4/2022

24-11-2020 | Special Issue

Variable selection for generalized partially linear models with longitudinal data

Authors: Jinghua Zhang, Liugen Xue

Published in: Evolutionary Intelligence | Issue 4/2022

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Variables selection and parameter estimation are of great significance in all regression analysis. A variety of approaches have been proposed to tackle this problem. Among those, the penalty-based shrinkage approach has been most popular for the ability to carry out the variable selection and parameter estimation simultaneously. However, not much work is available on the variable selection for the generalized partially models (GPLMs) with longitudinal data. In this paper, we proposed a variable selection procedure for GPLMs with longitudinal data. The inference is based on the SCAD-penalized quadratic inference functions, which is obtained after the B-spline approximating to non-parametric function in the model. The proposed approach efficiently utilized the within-cluster correlation information, which can improve estimating efficiency. The proposed approach also has the virtue of low computational cost. With the tuning parameter chosen by BIC, the correct model is identified with probability tends to 1. The resulted estimator of the parametric component is asymptotic to a normal distribution, and that of the non-parametric function achieves the optimal convergence rate. The performance of the proposed methods is evaluated through extensive simulation studies. A real data analysis shows that the proposed approach succeeds in excluding the insignificant variable.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Breiman L (1995) Better subset selection using nonnegative garrote. Techonometrics 37:373–384MATHCrossRef Breiman L (1995) Better subset selection using nonnegative garrote. Techonometrics 37:373–384MATHCrossRef
2.
go back to reference Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B 58:267–288MathSciNetMATH Tibshirani R (1996) Regression shrinkage and selection via the LASSO. J R Stat Soc Ser B 58:267–288MathSciNetMATH
3.
go back to reference Fu WJ (1998) Penalized regression: the bridge versus the LASSO. J Comput Graph Stat 7:397–416MathSciNet Fu WJ (1998) Penalized regression: the bridge versus the LASSO. J Comput Graph Stat 7:397–416MathSciNet
4.
go back to reference Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetMATHCrossRef Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetMATHCrossRef
6.
go back to reference Wang L, Li H, Huang JZ (2008) Variable selection in non-parametric varying coefficient models for analysis of repeated measurements. J Am Stat Assoc 103:1556–1569MATHCrossRef Wang L, Li H, Huang JZ (2008) Variable selection in non-parametric varying coefficient models for analysis of repeated measurements. J Am Stat Assoc 103:1556–1569MATHCrossRef
7.
go back to reference Xue L, Qu A, Zhou J (2010) Consistent model selection in marginal generalized additive models for correlated data. J Am Stat Assoc 105:1518–1530MATHCrossRef Xue L, Qu A, Zhou J (2010) Consistent model selection in marginal generalized additive models for correlated data. J Am Stat Assoc 105:1518–1530MATHCrossRef
8.
go back to reference Tian RQ, Xue LG, Liu CL (2014) Penalized quadratic functions for semiparametric varying coefficient partially linear models with longitudinal data. J Multivar Anal 132:94–110MathSciNetMATHCrossRef Tian RQ, Xue LG, Liu CL (2014) Penalized quadratic functions for semiparametric varying coefficient partially linear models with longitudinal data. J Multivar Anal 132:94–110MathSciNetMATHCrossRef
9.
go back to reference Fan J, Li R (2004) New estimation and model selection procedure for semiparametric modeling in longitudinal data analysis. J Am Stat Assoc 99:710–723MathSciNetMATHCrossRef Fan J, Li R (2004) New estimation and model selection procedure for semiparametric modeling in longitudinal data analysis. J Am Stat Assoc 99:710–723MathSciNetMATHCrossRef
11.
go back to reference Zhao PX, Xue LG (2009) Variable selection for semi-parametric varying coefficient partially linear models. Stat Probab Lett 79:2148–2157MATHCrossRef Zhao PX, Xue LG (2009) Variable selection for semi-parametric varying coefficient partially linear models. Stat Probab Lett 79:2148–2157MATHCrossRef
12.
go back to reference Wang L, Xue L, Qu A, Liang H (2014) Estimation and model selection in generalized additive partial linear models for correlated data with diverging number of covariates. Ann Stat 42:592–694MathSciNetMATHCrossRef Wang L, Xue L, Qu A, Liang H (2014) Estimation and model selection in generalized additive partial linear models for correlated data with diverging number of covariates. Ann Stat 42:592–694MathSciNetMATHCrossRef
14.
go back to reference Qu A, Lindsay BG, Li B (2000) Improving generalized estimating equations using quadratic inference functions. Biometrika 87:823–836MathSciNetMATHCrossRef Qu A, Lindsay BG, Li B (2000) Improving generalized estimating equations using quadratic inference functions. Biometrika 87:823–836MathSciNetMATHCrossRef
16.
17.
go back to reference Bai Y, Zhu ZY, Fung WK (2008) Partially linear models for longitudinal data based on quadratic inference functions. Scand J Stat 35:104–118MATHCrossRef Bai Y, Zhu ZY, Fung WK (2008) Partially linear models for longitudinal data based on quadratic inference functions. Scand J Stat 35:104–118MATHCrossRef
18.
go back to reference Zhang JH, Xue LG (2017) Quadratic inference functions for generalized partially models with longitudinal data. Chin J Appl Probab Stat 33:417–432MathSciNetMATH Zhang JH, Xue LG (2017) Quadratic inference functions for generalized partially models with longitudinal data. Chin J Appl Probab Stat 33:417–432MathSciNetMATH
19.
go back to reference Bai Y, Fung WK, Zhu ZY (2009) Penalized quadratic inference functions for single-index models with longitudinal data. J Multivar Anal 100:152–161MathSciNetMATHCrossRef Bai Y, Fung WK, Zhu ZY (2009) Penalized quadratic inference functions for single-index models with longitudinal data. J Multivar Anal 100:152–161MathSciNetMATHCrossRef
20.
go back to reference Cho H, Qu A (2013) Model selection for correlated data with diverging number of parameters. Stat Sin 23:901–927MathSciNetMATH Cho H, Qu A (2013) Model selection for correlated data with diverging number of parameters. Stat Sin 23:901–927MathSciNetMATH
21.
go back to reference Lin XH, Carroll RJ (2001) Non-parametric function estimation for clustered data when the predictor is measured without/with error. J Am Stat Assoc 95:520–534MATHCrossRef Lin XH, Carroll RJ (2001) Non-parametric function estimation for clustered data when the predictor is measured without/with error. J Am Stat Assoc 95:520–534MATHCrossRef
22.
go back to reference Lin XH, Carroll RJ (2001) Semiparametric regression for clustered data with generalized estimating equations. J Am Stat Assoc 96:1045–1056MathSciNetMATHCrossRef Lin XH, Carroll RJ (2001) Semiparametric regression for clustered data with generalized estimating equations. J Am Stat Assoc 96:1045–1056MathSciNetMATHCrossRef
23.
go back to reference He XM, Fung WK, Zhu ZY (2005) Robust estimation in a generalized partially linear model for cluster data. J Am Stat Assoc 34:391–410 He XM, Fung WK, Zhu ZY (2005) Robust estimation in a generalized partially linear model for cluster data. J Am Stat Assoc 34:391–410
24.
go back to reference Qin GY, Bai Y, Zhu ZY (2012) Robust empirical likelihood inference for generalized partially linear models with longitudinal data. J Multivar Anal 105:32–44MATHCrossRef Qin GY, Bai Y, Zhu ZY (2012) Robust empirical likelihood inference for generalized partially linear models with longitudinal data. J Multivar Anal 105:32–44MATHCrossRef
25.
go back to reference Qu A, Song XK (2004) Assessing robustness of generalized estimating equations and quadratic inference functions. Biometrika 91:447–459MathSciNetMATHCrossRef Qu A, Song XK (2004) Assessing robustness of generalized estimating equations and quadratic inference functions. Biometrika 91:447–459MathSciNetMATHCrossRef
26.
27.
28.
go back to reference Wang HS, Xia YC (2009) Shrinkage estimator of the varying coefficient model. J Am Stat Assoc 104:747–757MATHCrossRef Wang HS, Xia YC (2009) Shrinkage estimator of the varying coefficient model. J Am Stat Assoc 104:747–757MATHCrossRef
30.
go back to reference Oman SD (2009) Easily simulated multivariate binary distributions with given positive and negative correlations. Comput Stat Data Anal 53(4):999–1005MathSciNetMATHCrossRef Oman SD (2009) Easily simulated multivariate binary distributions with given positive and negative correlations. Comput Stat Data Anal 53(4):999–1005MathSciNetMATHCrossRef
31.
go back to reference Zeger SL, Karim MR (2001) Generalized linear models with random effects: a Gibbs sampling approach. J Am Stat Assoc 86:79–86MathSciNetCrossRef Zeger SL, Karim MR (2001) Generalized linear models with random effects: a Gibbs sampling approach. J Am Stat Assoc 86:79–86MathSciNetCrossRef
32.
go back to reference Diggle PJ, Liang KY, Zeger SL (1994) Analysis of longitudinal data. Oxford University Press, OxfordMATH Diggle PJ, Liang KY, Zeger SL (1994) Analysis of longitudinal data. Oxford University Press, OxfordMATH
33.
go back to reference Chang XJ, Ma ZG, Yang Y, Zeng ZQ, Hauptmann AG (2017) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans on Cybern 47(5):1180–1197CrossRef Chang XJ, Ma ZG, Yang Y, Zeng ZQ, Hauptmann AG (2017) Bi-level semantic representation analysis for multimedia event detection. IEEE Trans on Cybern 47(5):1180–1197CrossRef
34.
go back to reference Galiautdinov R (2020) The math model of drone behavior in the hive, providing algorithmic architecture. Int J Softw Sci Comput Intell 12(2):15–33CrossRef Galiautdinov R (2020) The math model of drone behavior in the hive, providing algorithmic architecture. Int J Softw Sci Comput Intell 12(2):15–33CrossRef
35.
go back to reference Zhang L (2019) Evaluating the effects of size and precision of training data on ANN training performance for the prediction of chaotic time series patterns. Int J Softw Sci Comput Intell 11(1):16–30CrossRef Zhang L (2019) Evaluating the effects of size and precision of training data on ANN training performance for the prediction of chaotic time series patterns. Int J Softw Sci Comput Intell 11(1):16–30CrossRef
Metadata
Title
Variable selection for generalized partially linear models with longitudinal data
Authors
Jinghua Zhang
Liugen Xue
Publication date
24-11-2020
Publisher
Springer Berlin Heidelberg
Published in
Evolutionary Intelligence / Issue 4/2022
Print ISSN: 1864-5909
Electronic ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-020-00521-6

Other articles of this Issue 4/2022

Evolutionary Intelligence 4/2022 Go to the issue

Premium Partner