1. Bromberg Y. Building a genome analysis pipeline to predict disease risk and prevent disease. J Mol Biol 2013;425:3993–4005. PMID:
23928561.
2. Moreau Y, Tranchevent LC. Computational tools for prioritizing candidate genes: boosting disease gene discovery. Nat Rev Genet 2012;13:523–536. PMID:
22751426.
3. Altmann A, Weber P, Bader D, Preuss M, Binder EB, Müller-Myhsok B. A beginners guide to SNP calling from high-throughput DNA-sequencing data. Hum Genet 2012;131:1541–1554. PMID:
22886560.
4. Lyon GJ, Wang K. Identifying disease mutations in genomic medicine settings: current challenges and how to accelerate progress. Genome Med 2012;4:58. PMID:
22830651.
5. Pabinger S, Dander A, Fischer M, Snajder R, Sperk M, Efremova M,
et al. A survey of tools for variant analysis of next-generation genome sequencing data. Brief Bioinform 2013 1 21 [Epub].
http://dx.doi.org/10.1093/bib/bbs086.
6. Ng SB, Turner EH, Robertson PD, Flygare SD, Bigham AW, Lee C,
et al. Targeted capture and massively parallel sequencing of 12 human exomes. Nature 2009;461:272–276. PMID:
19684571.
7. Lee WP, Stromberg M, Ward A, Stewart C, Garrison E, Marth GT. MOSAIK: a hash-based algorithm for accurate next-generation sequencing read mapping [database]. Ithaca: arXiv, Cornell University, 2013. arXiv:1309.1149.
8. Alkan C, Kidd JM, Marques-Bonet T, Aksay G, Antonacci F, Hormozdiari F,
et al. Personalized copy number and segmental duplication maps using next-generation sequencing. Nat Genet 2009;41:1061–1067. PMID:
19718026.
9. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 2009;25:1754–1760. PMID:
19451168.
10. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods 2012;9:357–359. PMID:
22388286.
11. Li R, Yu C, Li Y, Lam TW, Yiu SM, Kristiansen K,
et al. SOAP2: an improved ultrafast tool for short read alignment. Bioinformatics 2009;25:1966–1967. PMID:
19497933.
12. Liao Y, Smyth GK, Shi W. The Subread aligner: fast, accurate and scalable read mapping by seed-and-vote. Nucleic Acids Res 2013;41:e108. PMID:
23558742.
13. Hatem A, Bozdğ D, Toland AE, Çatalyürek ÜV. Benchmarking short sequence mapping tools. BMC Bioinformatics 2013;14:184. PMID:
23758764.
14. Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM [database]. Ithaca: arXiv, Cornell University, 2013. arXiv:1303.3997.
15. McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A,
et al. The Genome Analysis Toolkit: a Map-Reduce framework for analyzing next-generation DNA sequencing data. Genome Res 2010;20:1297–1303. PMID:
20644199.
16. 1000 Genomes Project Consortium. Abecasis GR, Auton A, Brooks LD, DePristo MA, Durbin RM,
et al. An integrated map of genetic variation from 1,092 human genomes. Nature 2012;491:56–65. PMID:
23128226.
17. Lam HY, Pan C, Clark MJ, Lacroute P, Chen R, Haraksingh R,
et al. Detecting and annotating genetic variations using the HugeSeq pipeline. Nat Biotechnol 2012;30:226–229. PMID:
22398614.
18. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N,
et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 2009;25:2078–2079. PMID:
19505943.
19. DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C,
et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 2011;43:491–498. PMID:
21478889.
20. Wang Y, Lu J, Yu J, Gibbs RA, Yu F. An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data. Genome Res 2013;23:833–842. PMID:
23296920.
21. You N, Murillo G, Su X, Zeng X, Xu J, Ning K,
et al. SNP calling using genotype model selection on high-throughput sequencing data. Bioinformatics 2012;28:643–650. PMID:
22253293.
22. Nielsen R, Paul JS, Albrechtsen A, Song YS. Genotype and SNP calling from next-generation sequencing data. Nat Rev Genet 2011;12:443–451. PMID:
21587300.
23. Browning BL, Browning SR. A unified approach to genotype imputation and haplotype-phase inference for large data sets of trios and unrelated individuals. Am J Hum Genet 2009;84:210–223. PMID:
19200528.
24. Browning BL, Yu Z. Simultaneous genotype calling and haplotype phasing improves genotype accuracy and reduces false-positive associations for genome-wide association studies. Am J Hum Genet 2009;85:847–861. PMID:
19931040.
25. Howie BN, Donnelly P, Marchini J. A flexible and accurate genotype imputation method for the next generation of genome-wide association studies. PLoS Genet 2009;5:e1000529. PMID:
19543373.
26. Li Y, Willer CJ, Ding J, Scheet P, Abecasis GR. MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes. Genet Epidemiol 2010;34:816–834. PMID:
21058334.
27. Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res 2010;38:e164. PMID:
20601685.
28. Quinlan AR, Hall IM. BEDTools: a flexible suite of utilities for comparing genomic features. Bioinformatics 2010;26:841–842. PMID:
20110278.
29. Reese MG, Moore B, Batchelor C, Salas F, Cunningham F, Marth GT,
et al. A standard variation file format for human genome sequences. Genome Biol 2010;11:R88. PMID:
20796305.
30. Hu H, Huff CD, Moore B, Flygare S, Reese MG, Yandell M. VAAST 2.0: improved variant classification and disease-gene identification using a conservation-controlled amino acid substitution matrix. Genet Epidemiol 2013;37:622–634. PMID:
23836555.
31. Yandell M, Huff C, Hu H, Singleton M, Moore B, Xing J,
et al. A probabilistic disease-gene finder for personal genomes. Genome Res 2011;21:1529–1542. PMID:
21700766.
32. Ng PC, Henikoff S. Predicting the effects of amino acid substitutions on protein function. Annu Rev Genomics Hum Genet 2006;7:61–80. PMID:
16824020.
33. Henikoff S, Henikoff JG. Amino acid substitution matrices from protein blocks. Proc Natl Acad Sci U S A 1992;89:10915–10919. PMID:
1438297.
34. Hubisz MJ, Pollard KS, Siepel A. PHAST and RPHAST: phylogenetic analysis with space/time models. Brief Bioinform 2011;12:41–51. PMID:
21278375.
35. Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P,
et al. A method and server for predicting damaging missense mutations. Nat Methods 2010;7:248–249. PMID:
20354512.
36. Bromberg Y, Rost B. SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res 2007;35:3823–3835. PMID:
17526529.
37. Olivier M. A haplotype map of the human genome. Physiol Genomics 2003;13:3–9. PMID:
12644628.
38. International HapMap Consortium. Frazer KA, Ballinger DG, Cox DR, Hinds DA, Stuve LL,
et al. A second generation human haplotype map of over 3.1 million SNPs. Nature 2007;449:851–861. PMID:
17943122.
39. Neale BM, Rivas MA, Voight BF, Altshuler D, Devlin B, Orho-Melander M,
et al. Testing for an unusual distribution of rare variants. PLoS Genet 2011;7:e1001322. PMID:
21408211.
40. Wu MC, Lee S, Cai T, Li Y, Boehnke M, Lin X. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 2011;89:82–93. PMID:
21737059.
41. Li B, Leal SM. Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. Am J Hum Genet 2008;83:311–321. PMID:
18691683.
42. Stitziel NO, Kiezun A, Sunyaev S. Computational and statistical approaches to analyzing variants identified by exome sequencing. Genome Biol 2011;12:227. PMID:
21920052.
43. Viswanathan GA, Seto J, Patil S, Nudelman G, Sealfon SC. Getting started in biological pathway construction and analysis. PLoS Comput Biol 2008;4:e16. PMID:
18463709.
44. Khatri P, Sirota M, Butte AJ. Ten years of pathway analysis: current approaches and outstanding challenges. PLoS Comput Biol 2012;8:e1002375. PMID:
22383865.
45. Mi H, Muruganujan A, Thomas PD. PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees. Nucleic Acids Res 2013;41:D377–D386. PMID:
23193289.
46. Joshi-Tope G, Gillespie M, Vastrik I, D'Eustachio P, Schmidt E, de Bono B,
et al. Reactome: a knowledgebase of biological pathways. Nucleic Acids Res 2005;33:D428–D432. PMID:
15608231.
47. Huang da W, Sherman BT, Lempicki RA. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nat Protoc 2009;4:44–57. PMID:
19131956.
48. Goecks J, Nekrutenko A, Taylor J. Galaxy Team. Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol 2010;11:R86. PMID:
20738864.
49. Blankenberg D, Von Kuster G, Coraor N, Ananda G, Lazarus R, Mangan M,
et al. Galaxy: a web-based genome analysis tool for experimentalists. Curr Protoc Mol Biol 2010;Chapter 19:Unit 19.10.1–Unit 19.10.21.
50. Giardine B, Riemer C, Hardison RC, Burhans R, Elnitski L, Shah P,
et al. Galaxy: a platform for interactive large-scale genome analysis. Genome Res 2005;15:1451–1455. PMID:
16169926.
51. Loman NJ, Misra RV, Dallman TJ, Constantinidou C, Gharbia SE, Wain J,
et al. Performance comparison of benchtop highthroughput sequencing platforms. Nat Biotechnol 2012;30:434–439. PMID:
22522955.
52. Sanders SJ, Murtha MT, Gupta AR, Murdoch JD, Raubeson MJ, Willsey AJ,
et al.
De novo mutations revealed by whole-exome sequencing are strongly associated with autism. Nature 2012;485:237–241. PMID:
22495306.
53. Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A,
et al. Patterns and rates of exonic
de novo mutations in autism spectrum disorders. Nature 2012;485:242–245. PMID:
22495311.
54. O'Roak BJ, Deriziotis P, Lee C, Vives L, Schwartz JJ, Girirajan S,
et al. Exome sequencing in sporadic autism spectrum disorders identifies severe
de novo mutations. Nat Genet 2011;43:585–589. PMID:
21572417.