Skip to main content
Top
Published in: Lifetime Data Analysis 4/2016

01-10-2016

Low-dimensional confounder adjustment and high-dimensional penalized estimation for survival analysis

Authors: Xiaochao Xia, Binyan Jiang, Jialiang Li, Wenyang Zhang

Published in: Lifetime Data Analysis | Issue 4/2016

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

High-throughput profiling is now common in biomedical research. In this paper we consider the layout of an etiology study composed of a failure time response, and gene expression measurements. In current practice, a widely adopted approach is to select genes according to a preliminary marginal screening and a follow-up penalized regression for model building. Confounders, including for example clinical risk factors and environmental exposures, usually exist and need to be properly accounted for. We propose covariate-adjusted screening and variable selection procedures under the accelerated failure time model. While penalizing the high-dimensional coefficients to achieve parsimonious model forms, our procedure also properly adjust the low-dimensional confounder effects to achieve more accurate estimation of regression coefficients. We establish the asymptotic properties of our proposed methods and carry out simulation studies to assess the finite sample performance. Our methods are illustrated with a real gene expression data analysis where proper adjustment of confounders produces more meaningful results.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Bradic J, Fan J, Jiang J (2011) Regularization for Cox’s proportional hazards model with NP-dimensionality. Ann Stat 39:3092–3120MathSciNetCrossRefMATH Bradic J, Fan J, Jiang J (2011) Regularization for Cox’s proportional hazards model with NP-dimensionality. Ann Stat 39:3092–3120MathSciNetCrossRefMATH
go back to reference Chen HY, Yu SL et al (2007) A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med 356:11–20CrossRef Chen HY, Yu SL et al (2007) A five-gene signature and clinical outcome in non-small-cell lung cancer. N Engl J Med 356:11–20CrossRef
go back to reference Cheng MY, Zhang W, Chen LH (2009) Statistical estimation in generalized multiparameter likelihood models. J Am Stat Assoc 104:1179–1191MathSciNetCrossRefMATH Cheng MY, Zhang W, Chen LH (2009) Statistical estimation in generalized multiparameter likelihood models. J Am Stat Assoc 104:1179–1191MathSciNetCrossRefMATH
go back to reference Cheng MY, Honda T, Li J, Peng H (2014) Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal/clustered data. Ann Stat 42:1819–1849MathSciNetCrossRefMATH Cheng MY, Honda T, Li J, Peng H (2014) Nonparametric independence screening and structure identification for ultra-high dimensional longitudinal/clustered data. Ann Stat 42:1819–1849MathSciNetCrossRefMATH
go back to reference Cheng MY, Honda T, Zhang JT (2015) Forward variable selection for sparse ultra-high dimensional varying coefficient models. J Am Stat Assoc. arXiv:1410.6556 Cheng MY, Honda T, Zhang JT (2015) Forward variable selection for sparse ultra-high dimensional varying coefficient models. J Am Stat Assoc. arXiv:​1410.​6556
go back to reference Fan J, Feng Y, Song R (2001) Nonparametric independence screening in sparse ultra-high dimensional additive models. J Am Stat Assoc 106:544–555MathSciNetCrossRefMATH Fan J, Feng Y, Song R (2001) Nonparametric independence screening in sparse ultra-high dimensional additive models. J Am Stat Assoc 106:544–555MathSciNetCrossRefMATH
go back to reference Fan J, Li R (2001) Variable selection via noncancave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetCrossRefMATH Fan J, Li R (2001) Variable selection via noncancave penalized likelihood and its oracle properties. J Am Stat Assoc 96:1348–1360MathSciNetCrossRefMATH
go back to reference Fan J, Lv J (2008) Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Ser B 70:849–911MathSciNetCrossRef Fan J, Lv J (2008) Sure independence screening for ultrahigh dimensional feature space. J R Stat Soc Ser B 70:849–911MathSciNetCrossRef
go back to reference Fan J, Samworth R, Wu Y (2009) Ultrahigh dimensional feature selection: beyond the linear model. J Mach Learn Res 10:2013–2038MathSciNetMATH Fan J, Samworth R, Wu Y (2009) Ultrahigh dimensional feature selection: beyond the linear model. J Mach Learn Res 10:2013–2038MathSciNetMATH
go back to reference Gordis L (2008) Epidemiology, 4th edn. Saunders, Philadelphia Gordis L (2008) Epidemiology, 4th edn. Saunders, Philadelphia
go back to reference Hu J, Chai H (2013) Adjusted regularized estimation in the accelerated failure time model with high dimensional covariates. J Multivar Anal 122:96–114MathSciNetCrossRefMATH Hu J, Chai H (2013) Adjusted regularized estimation in the accelerated failure time model with high dimensional covariates. J Multivar Anal 122:96–114MathSciNetCrossRefMATH
go back to reference Huang J, Ma S (2010) Variable selection in the accelerated failure time model via the bridge method. Lifetime Data Analysis 16:176–195MathSciNetCrossRefMATH Huang J, Ma S (2010) Variable selection in the accelerated failure time model via the bridge method. Lifetime Data Analysis 16:176–195MathSciNetCrossRefMATH
go back to reference Huang J, Ma S, Xie H (2006) Regularized estimation in the accelerated failure time model with high dimensional covariate. Biometrics 62:813–820MathSciNetCrossRefMATH Huang J, Ma S, Xie H (2006) Regularized estimation in the accelerated failure time model with high dimensional covariate. Biometrics 62:813–820MathSciNetCrossRefMATH
go back to reference Huang JZ, Wu CO, Zhou L (2004) Polynomial spline estimation and inference for varying-coefficient models with longitudinal data. Statistica Sinica 14:763–788MathSciNetMATH Huang JZ, Wu CO, Zhou L (2004) Polynomial spline estimation and inference for varying-coefficient models with longitudinal data. Statistica Sinica 14:763–788MathSciNetMATH
go back to reference Johnson BA, Lin DY, Zeng D (2008) Penalized estimating functions and variable selection in semiparametric regression models. J Am Stat Assoc 103:672–680MathSciNetCrossRefMATH Johnson BA, Lin DY, Zeng D (2008) Penalized estimating functions and variable selection in semiparametric regression models. J Am Stat Assoc 103:672–680MathSciNetCrossRefMATH
go back to reference Li J, Ma S (2010) Interval-censored data with repeated measurements and a cured subgroup. Appl Stat 59:693–705MathSciNet Li J, Ma S (2010) Interval-censored data with repeated measurements and a cured subgroup. Appl Stat 59:693–705MathSciNet
go back to reference Lian H, Li J, Tang X (2014) SCAD-penalized regression in additive partially linear proportional hazards models with an ultra-high-dimensional linear part. J Multivar Anal 125:50–64MathSciNetCrossRefMATH Lian H, Li J, Tang X (2014) SCAD-penalized regression in additive partially linear proportional hazards models with an ultra-high-dimensional linear part. J Multivar Anal 125:50–64MathSciNetCrossRefMATH
go back to reference Liu X, Wang L, Liang H (2011) Estimation and variable selection for semiparametric additive partially linear models. Statistica Sinica 21:1225–1248MathSciNetCrossRefMATH Liu X, Wang L, Liang H (2011) Estimation and variable selection for semiparametric additive partially linear models. Statistica Sinica 21:1225–1248MathSciNetCrossRefMATH
go back to reference Lu Y, Lemon W et al (2006) A gene expression signature predicts survival of subjects with state i non-small cell lung cancer. PLoS Med 3:2229–2243CrossRef Lu Y, Lemon W et al (2006) A gene expression signature predicts survival of subjects with state i non-small cell lung cancer. PLoS Med 3:2229–2243CrossRef
go back to reference Shao F, Li J, Ma S, Lee M-LT (2014) Semiparametric varying-coefficient model for interval censored data with a cured proportion. Stat Med 33:1700–1712MathSciNetCrossRef Shao F, Li J, Ma S, Lee M-LT (2014) Semiparametric varying-coefficient model for interval censored data with a cured proportion. Stat Med 33:1700–1712MathSciNetCrossRef
go back to reference Shedden K, Taylor JM et al (2008) Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat Med 14:822–827CrossRef Shedden K, Taylor JM et al (2008) Gene expression-based survival prediction in lung adenocarcinoma: a multi-site, blinded validation study. Nat Med 14:822–827CrossRef
go back to reference Stute W (1996) Distributional convergence under random censorship when covariables are present. Scand J Stat 23:461–471MathSciNetMATH Stute W (1996) Distributional convergence under random censorship when covariables are present. Scand J Stat 23:461–471MathSciNetMATH
go back to reference Wang H, Li B, Leng C (2009) Shrinkage tuning parameter selection with a diverging number of parameters. J R Stat Soc Ser B 71:671–683MathSciNetCrossRefMATH Wang H, Li B, Leng C (2009) Shrinkage tuning parameter selection with a diverging number of parameters. J R Stat Soc Ser B 71:671–683MathSciNetCrossRefMATH
go back to reference Xie Y, Xiao G et al (2011) Robust gene expression signature from formalin-fixed paraffin- embedded samples predicts prognosis of non-small-cell lung cancer patients. Clin Cancer Res 17:5705–5714CrossRef Xie Y, Xiao G et al (2011) Robust gene expression signature from formalin-fixed paraffin- embedded samples predicts prognosis of non-small-cell lung cancer patients. Clin Cancer Res 17:5705–5714CrossRef
Metadata
Title
Low-dimensional confounder adjustment and high-dimensional penalized estimation for survival analysis
Authors
Xiaochao Xia
Binyan Jiang
Jialiang Li
Wenyang Zhang
Publication date
01-10-2016
Publisher
Springer US
Published in
Lifetime Data Analysis / Issue 4/2016
Print ISSN: 1380-7870
Electronic ISSN: 1572-9249
DOI
https://doi.org/10.1007/s10985-015-9350-z

Other articles of this Issue 4/2016

Lifetime Data Analysis 4/2016 Go to the issue