Skip to main content
Top
Published in: The Journal of Supercomputing 2/2023

22-07-2022

Parallel protein multiple sequence alignment approaches: a systematic literature review

Authors: Sergio H. Almanza-Ruiz, Arturo Chavoya, Hector A. Duran-Limon

Published in: The Journal of Supercomputing | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Multiple sequence alignment approaches refer to algorithmic solutions for the alignment of biological sequences. Since multiple sequence alignment has exponential time complexity when a dynamic programming approach is applied, a substantial number of parallel computing approaches have been implemented in the last two decades to improve their performance. In this paper, we present a systematic literature review of parallel computing approaches applied to multiple sequence alignment algorithms for proteins, published in the open literature from 1988 to 2022; we extracted articles from four scientific databases: ACM Digital Library, IEEE Xplore, Science Direct and SpringerLink, and four journals: Bioinformatics, PLOS Computational Biology, PLOS ONE, and Scientific Reports. Additionally, in order to cover other potential databases and journals, we performed a transversal search through Google Scholar. We conducted a selection process that yielded 106 research articles; then, we analyzed these articles and defined a classification framework. Additionally, we point out some directions and trends for parallel computing approaches for multiple sequence alignment, as well as some unsolved problems.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
22.
go back to reference Yap TK, Munson PJ, Frieder O, Martino RL (1995) Parallel multiple sequence alignment using speculative computation. In: Proceedings of the 1995 International Conference on Parallel Processing ICPP Yap TK, Munson PJ, Frieder O, Martino RL (1995) Parallel multiple sequence alignment using speculative computation. In: Proceedings of the 1995 International Conference on Parallel Processing ICPP
24.
go back to reference Martino RL, Yap TK, Suh EB (1997) Parallel algorithms in molecular biology. In: Hertzberger B, Sloot P (eds) High-Performance Computing and Networking. Springer, Berlin, Heidelberg, pp 232–240CrossRef Martino RL, Yap TK, Suh EB (1997) Parallel algorithms in molecular biology. In: Hertzberger B, Sloot P (eds) High-Performance Computing and Networking. Springer, Berlin, Heidelberg, pp 232–240CrossRef
26.
go back to reference Anbarasu LA, Narayanasamy P, Sundararajan V (1999) Multiple sequence alignment using parallel genetic algorithms. In: McKay B, Yao X, Newton CS, Kim J-H, Furuhashi T (eds) Simulated Evolution and Learning. Springer, Berlin, Heidelberg, pp 130–137CrossRef Anbarasu LA, Narayanasamy P, Sundararajan V (1999) Multiple sequence alignment using parallel genetic algorithms. In: McKay B, Yao X, Newton CS, Kim J-H, Furuhashi T (eds) Simulated Evolution and Learning. Springer, Berlin, Heidelberg, pp 130–137CrossRef
27.
go back to reference Anbarasu LA, Narayanasamy P, Sundararajan V (2000) Multiple molecular sequence alignment by island parallel genetic algorithm. Curr Sci 78(7):858–863 Anbarasu LA, Narayanasamy P, Sundararajan V (2000) Multiple molecular sequence alignment by island parallel genetic algorithm. Curr Sci 78(7):858–863
28.
30.
go back to reference Catalyurek U, Gray M, Kurc T, Saltz J, Stahlberg E, Ferreira R (2003) A component-based implementation of multiple sequence alignment. In: Proceedings of the 2003 ACM Symposium on Applied Computing. SAC ’03, pp. 122–126. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/952532.952559 Catalyurek U, Gray M, Kurc T, Saltz J, Stahlberg E, Ferreira R (2003) A component-based implementation of multiple sequence alignment. In: Proceedings of the 2003 ACM Symposium on Applied Computing. SAC ’03, pp. 122–126. Association for Computing Machinery, New York, NY, USA. https://​doi.​org/​10.​1145/​952532.​952559
31.
go back to reference Cheetham J, Dehne F, Pitre S, Rau-Chaplin A, Taillon PJ (2003) Parallel CLUSTAL W for PC clusters. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds.) International Conference on Computational Science and Its Applications — ICCSA 2003, pp. 300–309. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44843-8_32 Cheetham J, Dehne F, Pitre S, Rau-Chaplin A, Taillon PJ (2003) Parallel CLUSTAL W for PC clusters. In: Kumar, V., Gavrilova, M.L., Tan, C.J.K., L’Ecuyer, P. (eds.) International Conference on Computational Science and Its Applications — ICCSA 2003, pp. 300–309. Springer, Berlin, Heidelberg. https://​doi.​org/​10.​1007/​3-540-44843-8_​32
33.
go back to reference Zhihua D, Feng L (2003) Parallel computation for multiple sequence alignments. In: Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, vol. 1, pp. 300–3031. https://doi.org/10.1109/ICICS.2003.1292464 Zhihua D, Feng L (2003) Parallel computation for multiple sequence alignments. In: Fourth International Conference on Information, Communications and Signal Processing, 2003 and the Fourth Pacific Rim Conference on Multimedia. Proceedings of the 2003 Joint, vol. 1, pp. 300–3031. https://​doi.​org/​10.​1109/​ICICS.​2003.​1292464
37.
39.
51.
57.
go back to reference Zola J, Yang X, Rospondek A, Aluru S (2007) PARALLEL-TCOFFEE: A parallel multiple sequence aligner. In: Proceedings of the ISCA 20th International Conference on Parallel and Distributed Computing Systems, September 24-26, 2007, Las Vegas, Nevada, USA, pp. 248–253 Zola J, Yang X, Rospondek A, Aluru S (2007) PARALLEL-TCOFFEE: A parallel multiple sequence aligner. In: Proceedings of the ISCA 20th International Conference on Parallel and Distributed Computing Systems, September 24-26, 2007, Las Vegas, Nevada, USA, pp. 248–253
58.
go back to reference Helal M, El-Gindy H, Mullin L, Gaeta B (2008) Parallelizing optimal multiple sequence alignment by dynamic programming. In: 2008 IEEE International Symposium on Parallel and Distributed Processing with Applications, pp. 669–674. https://doi.org/10.1109/ISPA.2008.93 Helal M, El-Gindy H, Mullin L, Gaeta B (2008) Parallelizing optimal multiple sequence alignment by dynamic programming. In: 2008 IEEE International Symposium on Parallel and Distributed Processing with Applications, pp. 669–674. https://​doi.​org/​10.​1109/​ISPA.​2008.​93
62.
go back to reference Liu Y, Schmidt B, Maskell DL (2009) MSA-CUDA: Multiple sequence alignment on graphics processing units with CUDA. In: 2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors, pp. 121–128. https://doi.org/10.1109/ASAP.2009.14 Liu Y, Schmidt B, Maskell DL (2009) MSA-CUDA: Multiple sequence alignment on graphics processing units with CUDA. In: 2009 20th IEEE International Conference on Application-specific Systems, Architectures and Processors, pp. 121–128. https://​doi.​org/​10.​1109/​ASAP.​2009.​14
65.
go back to reference Wirawan A, Schmidt B, Kwoh CK (2009) Pairwise distance matrix computation for multiple sequence alignment on the cell broadband engine. In: Allen G, Nabrzyski J, Seidel E, van Albada GD, Dongarra J, Sloot PMA (eds) Computational Science - ICCS 2009. Springer, Berlin, Heidelberg, pp 954–963CrossRef Wirawan A, Schmidt B, Kwoh CK (2009) Pairwise distance matrix computation for multiple sequence alignment on the cell broadband engine. In: Allen G, Nabrzyski J, Seidel E, van Albada GD, Dongarra J, Sloot PMA (eds) Computational Science - ICCS 2009. Springer, Berlin, Heidelberg, pp 954–963CrossRef
67.
71.
go back to reference Miranda LA, Caetano MAF, Melo ACMA, Correa JM, Bordim JL (2010) Multiple biological sequence alignment with a parallel island injection genetic algorithm. In: 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC), pp. 314–321. https://doi.org/10.1109/HPCC.2010.31 Miranda LA, Caetano MAF, Melo ACMA, Correa JM, Bordim JL (2010) Multiple biological sequence alignment with a parallel island injection genetic algorithm. In: 2010 IEEE 12th International Conference on High Performance Computing and Communications (HPCC), pp. 314–321. https://​doi.​org/​10.​1109/​HPCC.​2010.​31
73.
go back to reference de Araujo Macedo E, Magalhaes Alves de Melo AC, Pfitscher GH, Boukerche A (2011) Hybrid MPI/OpenMP strategy for biological multiple sequence alignment with DIALIGN-TX in heterogeneous multicore clusters. In: 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp. 418–425. https://doi.org/10.1109/IPDPS.2011.169 de Araujo Macedo E, Magalhaes Alves de Melo AC, Pfitscher GH, Boukerche A (2011) Hybrid MPI/OpenMP strategy for biological multiple sequence alignment with DIALIGN-TX in heterogeneous multicore clusters. In: 2011 IEEE International Symposium on Parallel and Distributed Processing Workshops and Phd Forum, pp. 418–425. https://​doi.​org/​10.​1109/​IPDPS.​2011.​169
78.
81.
go back to reference Mahram A, Herbordt MC (2012) FMSA: FPGA-accelerated ClustalW-based multiple sequence alignment through pipelined prefiltering. In: 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines, pp. 177–183. https://doi.org/10.1109/FCCM.2012.38 Mahram A, Herbordt MC (2012) FMSA: FPGA-accelerated ClustalW-based multiple sequence alignment through pipelined prefiltering. In: 2012 IEEE 20th International Symposium on Field-Programmable Custom Computing Machines, pp. 177–183. https://​doi.​org/​10.​1109/​FCCM.​2012.​38
83.
go back to reference Orobitg M, Cores F, Guirado F, Kemena C, Notredame C, Ripoll A (2012) Enhancing the scalability of consistency-based progressive multiple sequences alignment applications. In: 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp. 71–82. https://doi.org/10.1109/IPDPS.2012.17 Orobitg M, Cores F, Guirado F, Kemena C, Notredame C, Ripoll A (2012) Enhancing the scalability of consistency-based progressive multiple sequences alignment applications. In: 2012 IEEE 26th International Parallel and Distributed Processing Symposium, pp. 71–82. https://​doi.​org/​10.​1109/​IPDPS.​2012.​17
86.
go back to reference Esteban FJ, Díaz D, Hernández P, Caballero JA, Dorado G, Gálvez S (2013) Direct approaches to exploit many-core architecture in bioinformatics. Future Gener Comput Syst 29(1), 15–26. https://doi.org/10.1016/j.future.2012.03.018. Including Special section: AIRCC-NetCoM 2009 and Special section: Clouds and Service-Oriented Architectures Esteban FJ, Díaz D, Hernández P, Caballero JA, Dorado G, Gálvez S (2013) Direct approaches to exploit many-core architecture in bioinformatics. Future Gener Comput Syst 29(1), 15–26. https://​doi.​org/​10.​1016/​j.​future.​2012.​03.​018. Including Special section: AIRCC-NetCoM 2009 and Special section: Clouds and Service-Oriented Architectures
87.
go back to reference Hatem M, Ruml W (2013) External memory best-first search for multiple sequence alignment. Proc AAAI Conf Artif Intell 27(1):409–416 Hatem M, Ruml W (2013) External memory best-first search for multiple sequence alignment. Proc AAAI Conf Artif Intell 27(1):409–416
90.
go back to reference Orobitg M, Lladós J, Guirado F, Cores F, Notredame C (2013) Scalability and accuracy improvements of consistency-based multiple sequence alignment tools. In: Proceedings of the 20th European MPI Users’ Group Meeting. EuroMPI ’13, pp. 259–264. Association for Computing Machinery, New York, NY, USA. https://doi.org/10.1145/2488551.2488583 Orobitg M, Lladós J, Guirado F, Cores F, Notredame C (2013) Scalability and accuracy improvements of consistency-based multiple sequence alignment tools. In: Proceedings of the 20th European MPI Users’ Group Meeting. EuroMPI ’13, pp. 259–264. Association for Computing Machinery, New York, NY, USA. https://​doi.​org/​10.​1145/​2488551.​2488583
102.
go back to reference Sundfeld D, Teodoro G, Magalhaes Alves de Melo AC (2015) Parallel A-Star multiple sequence alignment with locality-sensitive hash functions. In: 2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems, pp. 342–347. https://doi.org/10.1109/CISIS.2015.50 Sundfeld D, Teodoro G, Magalhaes Alves de Melo AC (2015) Parallel A-Star multiple sequence alignment with locality-sensitive hash functions. In: 2015 Ninth International Conference on Complex, Intelligent, and Software Intensive Systems, pp. 342–347. https://​doi.​org/​10.​1109/​CISIS.​2015.​50
105.
go back to reference Amorim AR, Visotaky JMV, de Godoi Contessoto A, Neves LA, Gratão De Souza RC, Valêncio CR, Zafalon GFD (2016) Performance improvement of genetic algorithm for multiple sequence alignment. In: 2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), pp. 69–72. https://doi.org/10.1109/PDCAT.2016.029 Amorim AR, Visotaky JMV, de Godoi Contessoto A, Neves LA, Gratão De Souza RC, Valêncio CR, Zafalon GFD (2016) Performance improvement of genetic algorithm for multiple sequence alignment. In: 2016 17th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT), pp. 69–72. https://​doi.​org/​10.​1109/​PDCAT.​2016.​029
Metadata
Title
Parallel protein multiple sequence alignment approaches: a systematic literature review
Authors
Sergio H. Almanza-Ruiz
Arturo Chavoya
Hector A. Duran-Limon
Publication date
22-07-2022
Publisher
Springer US
Published in
The Journal of Supercomputing / Issue 2/2023
Print ISSN: 0920-8542
Electronic ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-022-04697-9

Other articles of this Issue 2/2023

The Journal of Supercomputing 2/2023 Go to the issue

Premium Partner