Skip to main content
Top

2015 | OriginalPaper | Chapter

28. Assessing a Spatial Boost Model for Quantitative Trait GWAS

Authors : Ian Johnston, Yang Jin, Luis Carvalho

Published in: Interdisciplinary Bayesian Statistics

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Bayesian variable selection provides a principled framework for incorporating prior information to regularize parameters in high-dimensional large-p-small-n regression models such as genomewide association studies (GWAS). Although these models produce more informative results, researchers often disregard them in favor of simpler models because of their high computational cost. We explore our recently proposed spatial boost model for GWAS on quantitative traits to assess the computational efficiency of a more representative model. The spatial boost model is a Bayesian hierarchical model that exploits spatial information on the genome to uniquely define prior probabilities of association of genetic markers based on their proximities to relevant genes. We propose analyzing large data sets by first applying an expectation–maximization filter to reduce the dimensionality of the space and then applying an efficient Gibbs sampler on the remaining markers. Finally we conduct a thorough simulation study based on real genotypes provided by the Wellcome Trust Case Control Consortium and compare our model to single association tests.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ardlie, K.G., Kruglyak, L., Seielstad, M.: Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3(4), 299–309 (2002)CrossRef Ardlie, K.G., Kruglyak, L., Seielstad, M.: Patterns of linkage disequilibrium in the human genome. Nat. Rev. Genet. 3(4), 299–309 (2002)CrossRef
2.
go back to reference Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 30(7), 1145–1159 (1997)CrossRef Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 30(7), 1145–1159 (1997)CrossRef
3.
go back to reference Brooks, S.P., Gelman, A.: General methods for monitoring convergence of iterative simulations. J. Comput. Graph. Stat. 7(4), 434–455 (1998)MathSciNet Brooks, S.P., Gelman, A.: General methods for monitoring convergence of iterative simulations. J. Comput. Graph. Stat. 7(4), 434–455 (1998)MathSciNet
4.
go back to reference Carvalho, L.E., Lawrence, C.E.: Centroid estimation in discrete high-dimensional spaces with applications in biology. Proc. Natl. Acad. Sci. 105(9), 3209–3214 (2008)CrossRef Carvalho, L.E., Lawrence, C.E.: Centroid estimation in discrete high-dimensional spaces with applications in biology. Proc. Natl. Acad. Sci. 105(9), 3209–3214 (2008)CrossRef
5.
go back to reference Guan, Y., Stephens, M.: Bayesian variable selection regression for genome-wide association studies and other large-scale problems. Ann. Appl. Stat. 5(3), 1780–1815 (2011)CrossRefMATHMathSciNet Guan, Y., Stephens, M.: Bayesian variable selection regression for genome-wide association studies and other large-scale problems. Ann. Appl. Stat. 5(3), 1780–1815 (2011)CrossRefMATHMathSciNet
6.
go back to reference Ishwaran, H., Rao, J.S.: Spike and slab variable selection: frequentist and Bayesian strategies. Ann. Stat. 33(2), 730–773 (2005)CrossRefMATHMathSciNet Ishwaran, H., Rao, J.S.: Spike and slab variable selection: frequentist and Bayesian strategies. Ann. Stat. 33(2), 730–773 (2005)CrossRefMATHMathSciNet
7.
go back to reference Lewis, B.: irlba: Fast partial SVD by implicitly-restarted Lanczos bidiagonalization. R package version 0.1 1, 1520 (2009) Lewis, B.: irlba: Fast partial SVD by implicitly-restarted Lanczos bidiagonalization. R package version 0.1 1, 1520 (2009)
8.
go back to reference Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., Reich, D.: Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38(8), 904–909 (2006)CrossRef Price, A.L., Patterson, N.J., Plenge, R.M., Weinblatt, M.E., Shadick, N.A., Reich, D.: Principal components analysis corrects for stratification in genome-wide association studies. Nat. Genet. 38(8), 904–909 (2006)CrossRef
9.
go back to reference Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A., Bender, D., Maller, J., Sklar, P., De Bakker, P.I., Daly, M.J., et al.: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81(3), 559–575 (2007)CrossRef Purcell, S., Neale, B., Todd-Brown, K., Thomas, L., Ferreira, M.A., Bender, D., Maller, J., Sklar, P., De Bakker, P.I., Daly, M.J., et al.: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81(3), 559–575 (2007)CrossRef
10.
go back to reference Ročková, V., George, E.I.: EMVS: The EM approach to Bayesian variable selection. J. Am. Stat. Assoc. 109(506), 828–846 (2014)CrossRef Ročková, V., George, E.I.: EMVS: The EM approach to Bayesian variable selection. J. Am. Stat. Assoc. 109(506), 828–846 (2014)CrossRef
11.
go back to reference Wigginton, J.E., Cutler, D.J., Abecasis, G.R.: A note on exact tests of Hardy-Weinberg equilibrium. Am. J. Hum. Genet. 76(5), 887–893 (2005)CrossRef Wigginton, J.E., Cutler, D.J., Abecasis, G.R.: A note on exact tests of Hardy-Weinberg equilibrium. Am. J. Hum. Genet. 76(5), 887–893 (2005)CrossRef
12.
go back to reference Wu, T.T., Chen, Y.F., Hastie, T., Sobel, E., Lange, K.: Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics 25(6), 714–721 (2009)CrossRef Wu, T.T., Chen, Y.F., Hastie, T., Sobel, E., Lange, K.: Genome-wide association analysis by lasso penalized logistic regression. Bioinformatics 25(6), 714–721 (2009)CrossRef
Metadata
Title
Assessing a Spatial Boost Model for Quantitative Trait GWAS
Authors
Ian Johnston
Yang Jin
Luis Carvalho
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-12454-4_28

Premium Partner