Skip to main content
Top

2016 | OriginalPaper | Chapter

GENESIS—Cloud-Based System for Next Generation Sequencing Analysis: A Proof of Concept

Authors : Maider Alberich, Arkaitz Artetxe, Eduardo Santamaría-Navarro, Alfons Nonell-Canals, Grégory Maclair

Published in: Innovation in Medicine and Healthcare 2016

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

With the advent of the technology, the DNA sequencing has become cheaper and faster. Next-Generation Sequencing platforms are providing new opportunities to address biological and medical issues. However, they present new challenges of storing, handling and processing, as they produce massive amounts of data. Powerful computational infrastructure, new bioinformatics softwares and skilled people in programming are required to work with the analysis tools. This project aims to design and develop an intelligent system that analyses high-throughput datasets, with the purpose of improving the effectiveness in the biological and medical research fields. The target is to make a user-friendly tool that allows the user to automatically or manually design the desired analysis workflow. Therefore, the technological challenges consist in: (i) an interface between clinician and bioinformatics language, (ii) an intelligent tool that selects the appropriate analysis workflow and (iii) a solution that can handle, store and manage big datasets at a reasonable-price. In order to tackle these bottlenecks, a cloud-based prototype enhanced by a graphical user-friendly interface and implemented using Amazon Web Service.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Mardis, E.R.: The impact of next-generation sequencing technology on genetics. Trends Genet. 24(3), 133–141 (2008)CrossRef Mardis, E.R.: The impact of next-generation sequencing technology on genetics. Trends Genet. 24(3), 133–141 (2008)CrossRef
2.
go back to reference Quail, M., Smith, M.E., Coupland, P., Otto, T.D., Harris, S.R., Connor, T.R., Bertoni, A., Swerdlow, H.P., Gu, Y.: A tale of three next generation sequencing platforms: comparison of ion torrent, pacific biosciences and illumina MiSeq sequencers. BMC Genomics 13(1). 341 (2012) Quail, M., Smith, M.E., Coupland, P., Otto, T.D., Harris, S.R., Connor, T.R., Bertoni, A., Swerdlow, H.P., Gu, Y.: A tale of three next generation sequencing platforms: comparison of ion torrent, pacific biosciences and illumina MiSeq sequencers. BMC Genomics 13(1). 341 (2012)
3.
go back to reference Shendure, J., Ji, Hanlee: Next-generation DNA sequencing. Nat. Biotechnol. 26(10), 1135–1145 (2008)CrossRef Shendure, J., Ji, Hanlee: Next-generation DNA sequencing. Nat. Biotechnol. 26(10), 1135–1145 (2008)CrossRef
4.
go back to reference Bhuvaneshwar, K., Sulakhe, D., Gauba, R., Rodriguez, A., Madduri, R., Dave, U., Lacinski, L., Foster, I., Gusev, Y., Madhavan, S.: A case study for cloud based high throughput analysis of NGS data using the globus genomics system. Comput. Struct. Biotechnol. J. 13, 64–74 (2015)CrossRef Bhuvaneshwar, K., Sulakhe, D., Gauba, R., Rodriguez, A., Madduri, R., Dave, U., Lacinski, L., Foster, I., Gusev, Y., Madhavan, S.: A case study for cloud based high throughput analysis of NGS data using the globus genomics system. Comput. Struct. Biotechnol. J. 13, 64–74 (2015)CrossRef
5.
go back to reference Thakur, R.S., Bandopadhyay, R., Chaudhary, B., Chatterjee, S.: Now and next-generation sequencing techniques: future of sequence analysis using cloud computing. Front. Gene 3 (2012) Thakur, R.S., Bandopadhyay, R., Chaudhary, B., Chatterjee, S.: Now and next-generation sequencing techniques: future of sequence analysis using cloud computing. Front. Gene 3 (2012)
6.
go back to reference Nagasaki, H., Mochizuki, T., Kodama, Y., Saruhashi, S., Morizaki, S., Sugawara, H., Ohyanagi, H., Kurata, N., Okubo, K., Takagi, T., Kaminuma, E., Nakamura, Y.: DDBJ read annotation pipeline: A cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data. DNA Res. 20(4), 383–390 (2013)CrossRef Nagasaki, H., Mochizuki, T., Kodama, Y., Saruhashi, S., Morizaki, S., Sugawara, H., Ohyanagi, H., Kurata, N., Okubo, K., Takagi, T., Kaminuma, E., Nakamura, Y.: DDBJ read annotation pipeline: A cloud computing-based pipeline for high-throughput analysis of next-generation sequencing data. DNA Res. 20(4), 383–390 (2013)CrossRef
7.
go back to reference Goecks, J., Nekrutenko, A., Taylor, J.: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 11, R86 (2010)CrossRef Goecks, J., Nekrutenko, A., Taylor, J.: Galaxy: a comprehensive approach for supporting accessible, reproducible, and transparent computational research in the life sciences. Genome Biol. 11, R86 (2010)CrossRef
8.
go back to reference Rex, D.E., Ma, J.Q., Toga, A.W.: The LONI pipeline processing environment. Neuroimage 19, 1033–1048 (2003)CrossRef Rex, D.E., Ma, J.Q., Toga, A.W.: The LONI pipeline processing environment. Neuroimage 19, 1033–1048 (2003)CrossRef
9.
go back to reference Hull, D., Wolstencroft, K., Stevens, R., et al.: Taverna: a tool for building and running workflows of services. Nucleic Acids Res. 34, W729–W732 (2006)CrossRef Hull, D., Wolstencroft, K., Stevens, R., et al.: Taverna: a tool for building and running workflows of services. Nucleic Acids Res. 34, W729–W732 (2006)CrossRef
10.
go back to reference Pabinger, S., Dander, A., Fischer, M., Snajder, R., Sperk, M., Efremova, M., Krabichler, B., Speicher, M.R., Zschocke, J., Trajanoski, Z.: A survey of tools for variant analysis of next-generation genome sequencing data. Briefings Bioinform. 15(2), 256–278 (2013)CrossRef Pabinger, S., Dander, A., Fischer, M., Snajder, R., Sperk, M., Efremova, M., Krabichler, B., Speicher, M.R., Zschocke, J., Trajanoski, Z.: A survey of tools for variant analysis of next-generation genome sequencing data. Briefings Bioinform. 15(2), 256–278 (2013)CrossRef
11.
go back to reference Torri, F., Dinov, I.D., Zamanyan, A. et al.: Next generation sequence analysis and computational genomics using graphical pipeline workflows. Genes 3(4), 545–575 (2012) Torri, F., Dinov, I.D., Zamanyan, A. et al.: Next generation sequence analysis and computational genomics using graphical pipeline workflows. Genes 3(4), 545–575 (2012)
17.
go back to reference Langmead, B., Trapnell, C., Pop, M., Salzberg, S.L.: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Fenome Biol. 10(3), R25 (2009) Langmead, B., Trapnell, C., Pop, M., Salzberg, S.L.: Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Fenome Biol. 10(3), R25 (2009)
18.
go back to reference Li, H.: A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27(21), 2987–2993 (2011) Li, H.: A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter estimation from sequencing data. Bioinformatics 27(21), 2987–2993 (2011)
19.
go back to reference Danecek, P., Auton, A., et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011)CrossRef Danecek, P., Auton, A., et al.: The variant call format and VCFtools. Bioinformatics 27(15), 2156–2158 (2011)CrossRef
Metadata
Title
GENESIS—Cloud-Based System for Next Generation Sequencing Analysis: A Proof of Concept
Authors
Maider Alberich
Arkaitz Artetxe
Eduardo Santamaría-Navarro
Alfons Nonell-Canals
Grégory Maclair
Copyright Year
2016
DOI
https://doi.org/10.1007/978-3-319-39687-3_28

Premium Partner