Skip to main content

2022 | OriginalPaper | Buchkapitel

Robust Feature Screening for Ultrahigh-Dimensional Censored Data Subject to Measurement Error

verfasst von : Li-Pang Chen, Grace Y. Yi

Erschienen in: Advances and Innovations in Statistics and Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Feature screening is commonly used to handle ultrahigh-dimensional data prior to conducting a formal data analysis. While various feature screening methods have been developed in the literature, research gaps still exist. The existing methods usually make an implicit assumption that data are accurately measured. This requirement, however, is frequently violated in applications. In this chapter, we consider error-prone ultrahigh-dimensional survival data and propose a robust feature screening method. We develop an iteration algorithm to improve the performance of retaining all informative covariates. Theoretical results are established for the proposed method. Simulation studies are reported to assess the performance of the proposed method, together with an application of the proposed method to handle a mantle cell lymphoma microarray dataset.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Carroll, R. J., Ruppert, D., Stefanski, L. A., & Crainiceanu, C. M. (2006). Measurement Error in Nonlinear Model. New York: CRC Press.CrossRef Carroll, R. J., Ruppert, D., Stefanski, L. A., & Crainiceanu, C. M. (2006). Measurement Error in Nonlinear Model. New York: CRC Press.CrossRef
Zurück zum Zitat Chen, L.-P. (2019). Iterated feature screening based on distance correlation for ultrahigh-dimensional censored data with covariates measurement error. arXiv:1901.01610. Chen, L.-P. (2019). Iterated feature screening based on distance correlation for ultrahigh-dimensional censored data with covariates measurement error. arXiv:1901.01610.
Zurück zum Zitat Chen, J., & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95, 759–771.CrossRef Chen, J., & Chen, Z. (2008). Extended Bayesian information criteria for model selection with large model spaces. Biometrika, 95, 759–771.CrossRef
Zurück zum Zitat Chen, L.-P., & Yi, G. Y. (2020). Model selection and model averaging for analysis of truncated and censored data with measurement error. Electronic Journal of Statistics, 14, 4054–4109.CrossRef Chen, L.-P., & Yi, G. Y. (2020). Model selection and model averaging for analysis of truncated and censored data with measurement error. Electronic Journal of Statistics, 14, 4054–4109.CrossRef
Zurück zum Zitat Chen, X., Chen, X., & Wang, H. (2018). Robust feature screening for ultra-high dimensional right censored data via distance correlation. Computational Statistics and Data Analysis, 119, 118–138.CrossRef Chen, X., Chen, X., & Wang, H. (2018). Robust feature screening for ultra-high dimensional right censored data via distance correlation. Computational Statistics and Data Analysis, 119, 118–138.CrossRef
Zurück zum Zitat Cui, H., Li, R., & Zhong, W. (2015). Model-free feature screening for ultrahigh dimensional discriminant analysis. Journal of the American Statistical Association, 110, 630–641.CrossRefPubMed Cui, H., Li, R., & Zhong, W. (2015). Model-free feature screening for ultrahigh dimensional discriminant analysis. Journal of the American Statistical Association, 110, 630–641.CrossRefPubMed
Zurück zum Zitat Dreiera, I., & Kotzb, S. (2002). A note on the characteristic function of the t-distribution. Statistics and Probability Letters, 57, 221–224.CrossRef Dreiera, I., & Kotzb, S. (2002). A note on the characteristic function of the t-distribution. Statistics and Probability Letters, 57, 221–224.CrossRef
Zurück zum Zitat Fan, J., & Lv, J. (2008). Sure independence screening for ultrahigh dimensional feature space (with discussion). Journal of the Royal Statistical Society, Series B, 70, 849–911.CrossRef Fan, J., & Lv, J. (2008). Sure independence screening for ultrahigh dimensional feature space (with discussion). Journal of the Royal Statistical Society, Series B, 70, 849–911.CrossRef
Zurück zum Zitat Fan, J., & Song, R. (2010). Sure independence screening in generalized linear models with NP-dimensionality. The Annals of Statistics, 38, 3567–3604.CrossRef Fan, J., & Song, R. (2010). Sure independence screening in generalized linear models with NP-dimensionality. The Annals of Statistics, 38, 3567–3604.CrossRef
Zurück zum Zitat Fan, J., Samworth, R., & Wu, Y. (2009). Ultrahigh dimensional feature selection: beyond the linear model. Journal of Machine Learning Research, 10, 1829–1853. Fan, J., Samworth, R., & Wu, Y. (2009). Ultrahigh dimensional feature selection: beyond the linear model. Journal of Machine Learning Research, 10, 1829–1853.
Zurück zum Zitat Fan, J., Feng, Y., & Wu, Y. (2010). Ultrahigh dimensional variable selection for Cox’s proportional hazards model. IMS Collect, 6, 70–86. Fan, J., Feng, Y., & Wu, Y. (2010). Ultrahigh dimensional variable selection for Cox’s proportional hazards model. IMS Collect, 6, 70–86.
Zurück zum Zitat Földes, A., & Rejtö, L. (1981). A LIL type result for the product limit estimator. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 56, 75–86.CrossRef Földes, A., & Rejtö, L. (1981). A LIL type result for the product limit estimator. Z. Wahrscheinlichkeitstheorie verw. Gebiete, 56, 75–86.CrossRef
Zurück zum Zitat Hall, P., & Miller, H. (2009). Using generalized correlation to effect variable selection in very high dimensional problems. Journal of Computational and Graphical Statistics, 18, 533–550.CrossRef Hall, P., & Miller, H. (2009). Using generalized correlation to effect variable selection in very high dimensional problems. Journal of Computational and Graphical Statistics, 18, 533–550.CrossRef
Zurück zum Zitat Hao, M., Lin, Y., Liu, X., & Tang, W. (2019). Robust feature screening for high-dimensional survival data. Journal of Applied Statistics, 46, 979–994.CrossRef Hao, M., Lin, Y., Liu, X., & Tang, W. (2019). Robust feature screening for high-dimensional survival data. Journal of Applied Statistics, 46, 979–994.CrossRef
Zurück zum Zitat Isaev, M., & McKay, B. D. (2016). On a bound of Hoeffding in the complex case. Electronic Communications in Probability, 21, 1–7.CrossRef Isaev, M., & McKay, B. D. (2016). On a bound of Hoeffding in the complex case. Electronic Communications in Probability, 21, 1–7.CrossRef
Zurück zum Zitat Li, R., Zhong, W., & Zhu, L. (2012). Feature screening via distance correlation learning. Journal of the American Statistical Association, 107, 1129–1139.CrossRefPubMedPubMedCentral Li, R., Zhong, W., & Zhu, L. (2012). Feature screening via distance correlation learning. Journal of the American Statistical Association, 107, 1129–1139.CrossRefPubMedPubMedCentral
Zurück zum Zitat Marsden, J. E., & Hoffman, M. J. (1999). Basic complex analysis. New York: W. H. Freeman. Marsden, J. E., & Hoffman, M. J. (1999). Basic complex analysis. New York: W. H. Freeman.
Zurück zum Zitat Rosenwald, A., Wright, G., Chan, W. C., Connors, J. M., Campo, E., Fisher, R. I., Gascoyne, R. D., Muller-Hermelink, H. K., Smeland, E. B., & Staudt, L. M. (2003). The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma. Cancer Cell, 3, 185–197.CrossRefPubMed Rosenwald, A., Wright, G., Chan, W. C., Connors, J. M., Campo, E., Fisher, R. I., Gascoyne, R. D., Muller-Hermelink, H. K., Smeland, E. B., & Staudt, L. M. (2003). The proliferation gene expression signature is a quantitative integrator of oncogenic events that predicts survival in mantle cell lymphoma. Cancer Cell, 3, 185–197.CrossRefPubMed
Zurück zum Zitat Song, R., Lu, W., Ma, S., & Jeng, X. (2014). Censored rank independence screening for high-dimensional survival data. Biometrika, 101, 799–814.CrossRefPubMed Song, R., Lu, W., Ma, S., & Jeng, X. (2014). Censored rank independence screening for high-dimensional survival data. Biometrika, 101, 799–814.CrossRefPubMed
Zurück zum Zitat Székely, G. J., Rizzo, M. L., & Bakirov, N. K. (2007). Measuring and testing dependence by correlation of distances. The Annals of Statistics, 35, 2769–2794.CrossRef Székely, G. J., Rizzo, M. L., & Bakirov, N. K. (2007). Measuring and testing dependence by correlation of distances. The Annals of Statistics, 35, 2769–2794.CrossRef
Zurück zum Zitat Wand, M.P. & Jones, M.C. (1995). Kernel Smoothing. Chapman & Hall, London.CrossRef Wand, M.P. & Jones, M.C. (1995). Kernel Smoothing. Chapman & Hall, London.CrossRef
Zurück zum Zitat Xue, J., & Liang, F. (2017). A robust model free feature screening method for ultrahigh dimensional data. Journal of Computational and Graphical Statistics, 26, 803–813.CrossRefPubMedPubMedCentral Xue, J., & Liang, F. (2017). A robust model free feature screening method for ultrahigh dimensional data. Journal of Computational and Graphical Statistics, 26, 803–813.CrossRefPubMedPubMedCentral
Zurück zum Zitat Yan, X., Tang, N., & Zhao, X. (2017). The Spearman rank correlation screening for ultrahigh dimensional censored data. arXiv:1702.02708v1. Yan, X., Tang, N., & Zhao, X. (2017). The Spearman rank correlation screening for ultrahigh dimensional censored data. arXiv:1702.02708v1.
Zurück zum Zitat Yi, G. Y. (2017). Statistical Analysis with Measurement Error and Misclassication: Strategy, Method and Application. Springer.CrossRef Yi, G. Y. (2017). Statistical Analysis with Measurement Error and Misclassication: Strategy, Method and Application. Springer.CrossRef
Zurück zum Zitat Yi, G. Y., Ma, Y., Spiegelman, D., & Carroll, R. J. (2015). Functional and structural methods with mixed measurement error and misclassification in covariates. Journal of the American Statistical Association, 110, 681–696.CrossRefPubMed Yi, G. Y., Ma, Y., Spiegelman, D., & Carroll, R. J. (2015). Functional and structural methods with mixed measurement error and misclassification in covariates. Journal of the American Statistical Association, 110, 681–696.CrossRefPubMed
Zurück zum Zitat Zhong, W., & Zhu, L. (2015). An iterative approach to distance correlation-based sure independence screening. Journal of Statistical Computation and Simulation, 85, 2331–2345.CrossRef Zhong, W., & Zhu, L. (2015). An iterative approach to distance correlation-based sure independence screening. Journal of Statistical Computation and Simulation, 85, 2331–2345.CrossRef
Zurück zum Zitat Zhu, L., Li, L., Li, R., & Zhu, L. (2011). Model-free feature screening for ultrahigh-dimensional data. Journal of the American Statistical Association, 106, 1464–1475.CrossRefPubMed Zhu, L., Li, L., Li, R., & Zhu, L. (2011). Model-free feature screening for ultrahigh-dimensional data. Journal of the American Statistical Association, 106, 1464–1475.CrossRefPubMed
Metadaten
Titel
Robust Feature Screening for Ultrahigh-Dimensional Censored Data Subject to Measurement Error
verfasst von
Li-Pang Chen
Grace Y. Yi
Copyright-Jahr
2022
DOI
https://doi.org/10.1007/978-3-031-08329-7_2

Premium Partner