Skip to main content

2017 | OriginalPaper | Buchkapitel

EVE: Cloud-Based Annotation of Human Genetic Variants

verfasst von : Brian S. Cole, Jason H. Moore

Erschienen in: Applications of Evolutionary Computation

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Annotation of human genetic variants enables genotype-phenotype association studies at the gene, pathway, and tissue level. Annotation results are difficult to reproduce across study sites due to shifting software versions and a lack of a unified hardware interface between study sites. Cloud computing offers a promising solution by integrating hardware and software into reproducible virtual appliances which may be utilized on-demand and shared across institutions. We developed ENSEMBL VEP on EC2 (EVE), a cloud-based virtual appliance for annotation of human genetic variants built around the ENSEMBL Variant Effect Predictor. We integrated virtual hardware infrastructure, open-source software, and publicly available genomic datasets to provide annotation capability for genetic variants in the context of genes/transcripts, Gene Ontology pathways, tissue-specific expression from the Gene Expression Atlas, miRNA annotations, minor allele frequencies from the 1000 Genomes Project and the Exome Aggregation Consortium, and deleteriousness scores from Combined Annotation Dependent Depletion. We demonstrate the utility of EVE by annotating the genetic variants in a case-control study of glaucoma. Cloud computing can reduce the difficulty of replicating complex software pipelines such as annotation pipelines across study sites. We provide a publicly available CloudFormation template of the EVE virtual appliance which can automatically provision and deploy a parameterized, preconfigured hardware/software stack ready for annotation of human genetic variants (github.com/epistasislab/EVE). This approach offers increased reproducibility in human genetic studies by providing a unified appliance to researchers across the world.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Welter, D., MacArthur, J., Morales, J., et al.: The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42(D1), 1001–1006 (2014). doi:10.1093/nar/gkt1229CrossRef Welter, D., MacArthur, J., Morales, J., et al.: The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42(D1), 1001–1006 (2014). doi:10.​1093/​nar/​gkt1229CrossRef
6.
Zurück zum Zitat Greene, C.S., Voight, B.F.: Pathway and network-based strategies to translate genetic discoveries into effective therapies. Hum. Mol. Genet., 1–5 (2016). doi:10.1093/hmg/ddw160 Greene, C.S., Voight, B.F.: Pathway and network-based strategies to translate genetic discoveries into effective therapies. Hum. Mol. Genet., 1–5 (2016). doi:10.​1093/​hmg/​ddw160
7.
Zurück zum Zitat Greene, C.S., Krishnan, A., Wong, A.K., et al.: Understanding multicellular function and disease with human tissue-specific networks. Nat. Genet. 47(6) (2015). doi:10.1038/ng.3259 Greene, C.S., Krishnan, A., Wong, A.K., et al.: Understanding multicellular function and disease with human tissue-specific networks. Nat. Genet. 47(6) (2015). doi:10.​1038/​ng.​3259
9.
Zurück zum Zitat Wang, K., Li, M., Hakonarson, H.: ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38(16), e164 (2010). doi:10.1093/nar/gkq603CrossRef Wang, K., Li, M., Hakonarson, H.: ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38(16), e164 (2010). doi:10.​1093/​nar/​gkq603CrossRef
10.
Zurück zum Zitat Evangelou, E., Ioannidis, J.P.A.: Meta-analysis methods for genome-wide association studies and beyond. Nat. Rev. Genet. 14(6), 379–389 (2013). doi:10.1038/nrg3472CrossRef Evangelou, E., Ioannidis, J.P.A.: Meta-analysis methods for genome-wide association studies and beyond. Nat. Rev. Genet. 14(6), 379–389 (2013). doi:10.​1038/​nrg3472CrossRef
11.
Zurück zum Zitat Purcell, S., Neale, B., Todd-Brown, K., et al.: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81(3), 559–575 (2007). doi:10.1086/519795CrossRef Purcell, S., Neale, B., Todd-Brown, K., et al.: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81(3), 559–575 (2007). doi:10.​1086/​519795CrossRef
12.
15.
Zurück zum Zitat Kapushesky, M., Adamusiak, T., Burdett, T., et al.: Gene Expression Atlas update–a value-added database of microarray and sequencing-based functional genomics experiments. Nucleic Acids Res. 40(Database issue), D1077-81 (2012). doi:10.1093/nar/gkr913 Kapushesky, M., Adamusiak, T., Burdett, T., et al.: Gene Expression Atlas update–a value-added database of microarray and sequencing-based functional genomics experiments. Nucleic Acids Res. 40(Database issue), D1077-81 (2012). doi:10.​1093/​nar/​gkr913
16.
Zurück zum Zitat Wiggs, J.L., Hauser, M.A., Abdrabou, W., et al.: The NEIGHBOR consortium primary open angle glaucoma genome-wide association study: rationale, study design and clinical variables. J. Glaucoma 22(7), 517–525 (2013). doi:10.1097/IJG.0b013e31824d4fd8CrossRef Wiggs, J.L., Hauser, M.A., Abdrabou, W., et al.: The NEIGHBOR consortium primary open angle glaucoma genome-wide association study: rationale, study design and clinical variables. J. Glaucoma 22(7), 517–525 (2013). doi:10.​1097/​IJG.​0b013e31824d4fd8​CrossRef
17.
Zurück zum Zitat Wiggs, J.L., Yaspan, B.L., Hauser, M.A., et al.: Common variants at 9p21 and 8q22 are associated with increased susceptibility to optic nerve degeneration in glaucoma. PLoS Genet. 8(4) (2012). doi:10.1371/journal.pgen.1002654 Wiggs, J.L., Yaspan, B.L., Hauser, M.A., et al.: Common variants at 9p21 and 8q22 are associated with increased susceptibility to optic nerve degeneration in glaucoma. PLoS Genet. 8(4) (2012). doi:10.​1371/​journal.​pgen.​1002654
18.
Zurück zum Zitat Anderson, C.A., Pettersson, F.H., Clarke, G.M., Cardon, L.R., Morris, A.P., Zondervan, K.T.: Data quality control in genetic case-control association studies. Nat. Protoc. 5(9), 1564–1573 (2010). doi:10.1038/nprot.2010.116CrossRef Anderson, C.A., Pettersson, F.H., Clarke, G.M., Cardon, L.R., Morris, A.P., Zondervan, K.T.: Data quality control in genetic case-control association studies. Nat. Protoc. 5(9), 1564–1573 (2010). doi:10.​1038/​nprot.​2010.​116CrossRef
22.
Zurück zum Zitat Project Consortium G, Consortium Participants are arranged by project role G, by institution alphabetically then, et al.: An integrated map of genetic variation from 1,092 human genomes. Nature 490(7422), 56–65 (2012). doi:10.1038/nature11632 Project Consortium G, Consortium Participants are arranged by project role G, by institution alphabetically then, et al.: An integrated map of genetic variation from 1,092 human genomes. Nature 490(7422), 56–65 (2012). doi:10.​1038/​nature11632
23.
Zurück zum Zitat McLendon, R., Friedman, A., Bigner, D., et al.: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455(7216), 1061–1068 (2008). doi:10.1038/nature07385CrossRef McLendon, R., Friedman, A., Bigner, D., et al.: Comprehensive genomic characterization defines human glioblastoma genes and core pathways. Nature 455(7216), 1061–1068 (2008). doi:10.​1038/​nature07385CrossRef
24.
Zurück zum Zitat Li, J., Doyle, M.A., Saeed, I., et al.: Bioinformatics pipelines for targeted resequencing and whole-exome sequencing of human and mouse genomes: a virtual appliance approach for instant deployment. PLoS One 9(4) (2014). doi:10.1371/journal.pone.0095217 Li, J., Doyle, M.A., Saeed, I., et al.: Bioinformatics pipelines for targeted resequencing and whole-exome sequencing of human and mouse genomes: a virtual appliance approach for instant deployment. PLoS One 9(4) (2014). doi:10.​1371/​journal.​pone.​0095217
Metadaten
Titel
EVE: Cloud-Based Annotation of Human Genetic Variants
verfasst von
Brian S. Cole
Jason H. Moore
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-55849-3_6