Skip to main content

2018 | OriginalPaper | Buchkapitel

Sampling Defective Pathways in Phenotype Prediction Problems via the Holdout Sampler

verfasst von : Juan Luis Fernández-Martínez, Ana Cernea, Enrique J. deAndrés-Galiana, Francisco Javier Fernández-Ovies, Zulima Fernández-Muñiz, Oscar Alvarez-Machancoses, Leorey Saligan, Stephen T. Sonis

Erschienen in: Bioinformatics and Biomedical Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we introduce the holdout sampler to find the defective pathways in high underdetermined phenotype prediction problems. This sampling algorithm is inspired by the bootstrapping procedure used in regression analysis to established confidence bounds. We show that working with partial information (data bags) serves to sample the linear uncertainty region in a simple regression problem, mainly along the axis of greatest uncertainty that corresponds to the smallest singular value of the system matrix. This procedure applied to a phenotype prediction problem, considered as a generalized prediction problem between the set of genetic signatures and the set of classes in which the phenotype is divided, serves to unravel the ensemble of altered pathways in the transcriptome that are involved in the disease development. The algorithm looks for the minimum-scale genetic signature in each random holdout and the likelihood (predictive accuracy) is established using the validation dataset via a nearest-neighbor classifier. The posterior analysis serves to identify the header genes that most-frequently appear in the different hold-outs and are therefore robust to a partial lack of samples. These genes are used to establish the genetic pathways and the biological processes involved in the disease progression. This algorithm is much faster, robust and simpler than Bayesian Networks. We show its application to a microarray dataset concerning a type of breast cancers with poor prognoses (TNBC).

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat de Andrés Galiana, E.J., Fernández-Martínez, J.L., Sonis, S.: Design of biomedical robots for phenotype prediction problems. J. Comput. Biol. 23(8), 678–92 (2016)CrossRef de Andrés Galiana, E.J., Fernández-Martínez, J.L., Sonis, S.: Design of biomedical robots for phenotype prediction problems. J. Comput. Biol. 23(8), 678–92 (2016)CrossRef
2.
Zurück zum Zitat Efron, B., Tibshirani, R.: An Introduction to the Bootstrap. Chapman & Hall/CRC, Boca Raton (1993). ISBN 0-412-04231-2CrossRefMATH Efron, B., Tibshirani, R.: An Introduction to the Bootstrap. Chapman & Hall/CRC, Boca Raton (1993). ISBN 0-412-04231-2CrossRefMATH
4.
Zurück zum Zitat Fernández-Martínez, J.L., Pallero, J.L.G., Fernández-Muñiz, Z., Pedruelo-González, L.M.: From Bayes to Tarantola: new insights to understand uncertainty in inverse problems. J. Appl. Geophys. 98, 62–72 (2013)CrossRef Fernández-Martínez, J.L., Pallero, J.L.G., Fernández-Muñiz, Z., Pedruelo-González, L.M.: From Bayes to Tarantola: new insights to understand uncertainty in inverse problems. J. Appl. Geophys. 98, 62–72 (2013)CrossRef
5.
Zurück zum Zitat de Andrés-Galiana, E.J., Fernández-Martínez, J.L., Sonis, S.: Sensitivity analysis of gene ranking methods in phenotype prediction. J. Biomed. Inform. 64, 255–264 (2016)CrossRef de Andrés-Galiana, E.J., Fernández-Martínez, J.L., Sonis, S.: Sensitivity analysis of gene ranking methods in phenotype prediction. J. Biomed. Inform. 64, 255–264 (2016)CrossRef
6.
Zurück zum Zitat Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)CrossRef Saeys, Y., Inza, I., Larrañaga, P.: A review of feature selection techniques in bioinformatics. Bioinformatics 23(19), 2507–2517 (2007)CrossRef
7.
Zurück zum Zitat Jiang, X., Barmada, M.M., Visweswaran, S.: Identifying genetic interactions in genome-wide data using Bayesian networks. Genet. Epidemiol. 34(6), 575–581 (2010)CrossRef Jiang, X., Barmada, M.M., Visweswaran, S.: Identifying genetic interactions in genome-wide data using Bayesian networks. Genet. Epidemiol. 34(6), 575–581 (2010)CrossRef
8.
Zurück zum Zitat Jézéquel, P., Loussouarn, D., Guérin-Charbonnel, C., Campion, L., et al.: Gene-expression molecular subtyping of triple-negative breast cancer tumours: importance of immune response. Breast Cancer Res. 20(17), 43 (2015)CrossRef Jézéquel, P., Loussouarn, D., Guérin-Charbonnel, C., Campion, L., et al.: Gene-expression molecular subtyping of triple-negative breast cancer tumours: importance of immune response. Breast Cancer Res. 20(17), 43 (2015)CrossRef
9.
Zurück zum Zitat Saligan, L.N., Fernández-Martínez, J.L., de Andrés Galiana, E.J., Sonis, S.: Supervised classification by filter methods and recursive feature elimination predicts risk of radiotherapy-related fatigue in patients with prostate cancer. Cancer Inform. 13(141–152), 2014 (2014) Saligan, L.N., Fernández-Martínez, J.L., de Andrés Galiana, E.J., Sonis, S.: Supervised classification by filter methods and recursive feature elimination predicts risk of radiotherapy-related fatigue in patients with prostate cancer. Cancer Inform. 13(141–152), 2014 (2014)
10.
Zurück zum Zitat Fernández-Martínez, J.L., de Andrés-Galiana, E.J., Sonis, S.: Genomic data integration in chronic lymphocytic leukemia. J. Gene Med. 19, 1–2 (2017)CrossRef Fernández-Martínez, J.L., de Andrés-Galiana, E.J., Sonis, S.: Genomic data integration in chronic lymphocytic leukemia. J. Gene Med. 19, 1–2 (2017)CrossRef
11.
Zurück zum Zitat Stelzer, G., Inger, A., Olender, T., Iny-Stein, T., Dalah, I., Harel, A., et al.: GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation. OMICS 13(6), 477 (2009)CrossRef Stelzer, G., Inger, A., Olender, T., Iny-Stein, T., Dalah, I., Harel, A., et al.: GeneDecks: paralog hunting and gene-set distillation with GeneCards annotation. OMICS 13(6), 477 (2009)CrossRef
12.
Zurück zum Zitat Jeon, M., Han, J., Nam, S.J., Lee, J.E., Kim, S.: STC-1 expression is upregulated through an Akt/NF-κB-dependent pathway in triple-negative breast cancer cells. Oncol. Rep. 36(3), 1717–1722 (2016). Epub 25 July 2016CrossRef Jeon, M., Han, J., Nam, S.J., Lee, J.E., Kim, S.: STC-1 expression is upregulated through an Akt/NF-κB-dependent pathway in triple-negative breast cancer cells. Oncol. Rep. 36(3), 1717–1722 (2016). Epub 25 July 2016CrossRef
13.
Zurück zum Zitat Han, J., Jeon, M., Shin, I., Kim, S.: Elevated STC-1 augments the invasiveness of triple-negative breast cancer cells through activation of the JNK/c-Jun signaling pathway. Oncol. Rep. 36(3), 1764–71 (2016). Epub 26 July 2016CrossRef Han, J., Jeon, M., Shin, I., Kim, S.: Elevated STC-1 augments the invasiveness of triple-negative breast cancer cells through activation of the JNK/c-Jun signaling pathway. Oncol. Rep. 36(3), 1764–71 (2016). Epub 26 July 2016CrossRef
18.
Zurück zum Zitat Cernea, A., Fernández-Martínez, J.L., de Andrés-Galiana, E.J., Fernández-Ovies, F.J., Fernández-Muñiz, Z., Álvarez-Machancoses, O., Saligan, L., Sonis, S.: Sampling defective pathways in phenotype prediction problems via the Fisher’s ratio sampler. In: IWBBIO 2018 (2018)CrossRef Cernea, A., Fernández-Martínez, J.L., de Andrés-Galiana, E.J., Fernández-Ovies, F.J., Fernández-Muñiz, Z., Álvarez-Machancoses, O., Saligan, L., Sonis, S.: Sampling defective pathways in phenotype prediction problems via the Fisher’s ratio sampler. In: IWBBIO 2018 (2018)CrossRef
Metadaten
Titel
Sampling Defective Pathways in Phenotype Prediction Problems via the Holdout Sampler
verfasst von
Juan Luis Fernández-Martínez
Ana Cernea
Enrique J. deAndrés-Galiana
Francisco Javier Fernández-Ovies
Zulima Fernández-Muñiz
Oscar Alvarez-Machancoses
Leorey Saligan
Stephen T. Sonis
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-78759-6_3