Skip to main content
Top

2015 | OriginalPaper | Chapter

Parallelizing the Smith-Waterman Algorithm Using OpenSHMEM and MPI-3 One-Sided Interfaces

Authors : Matthew Baker, Aaron Welch, Manjunath Gorentla Venkata

Published in: OpenSHMEM and Related Technologies. Experiences, Implementations, and Technologies

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The Smith-Waterman algorithm is used for determining the similarity between two very long data streams. A popular application of the Smith-Waterman algorithm is for sequence alignment in DNA sequences. Like many computational algorithms, the Smith-Waterman algorithm is constrained by the memory resources and the computational capacity of the system. As such, it can be accelerated and run at larger scales by parallelizing the implementation, allowing the work to be distributed to exploit HPC systems. A central part of the algorithm is computing the similarity matrix which is the mechanism that evaluates the quality of the matching sequences. This access pattern to the matrix to compute the similarity is non-uniform; as such, it better suits the Partioned Global Address Space (PGAS) programming model. In this paper, we explore parallelizing the Smith-Waterman algorithm using the OpenSHMEM model and interfaces in OpenSHMEM 1.2 as well as the one-sided communication interfaces in MPI-3. Further, we also explore the advantages of using non-blocking communication interfaces, which are proposed as extensions for a future OpenSHMEM specification. We evaluate the parallel implementation on Titan, a Cray XK7 system at the Oak Ridge Leadership Computing Facility (OLCF). Our results demonstrate good weak and strong scaling characteristics for both of the OpenSHMEM and MPI-3 implementations.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197 (1981)CrossRef Smith, T.F., Waterman, M.S.: Identification of common molecular subsequences. J. Mol. Biol. 147, 195–197 (1981)CrossRef
2.
go back to reference Jaleel, A., Mattina, M., Jacob, B.: Last level cache (llc) performance of data mining workloads on a CMP - a case study of parallel bioinformatics workloads. In: 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA), pp. 88–98 (2006) Jaleel, A., Mattina, M., Jacob, B.: Last level cache (llc) performance of data mining workloads on a CMP - a case study of parallel bioinformatics workloads. In: 2014 IEEE 20th International Symposium on High Performance Computer Architecture (HPCA), pp. 88–98 (2006)
3.
go back to reference Wang, Y., Lu, J., Yu, J., Gibbs, R.A., Yu, F.: An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data. Genome Res. 23, 833–842 (2013)CrossRef Wang, Y., Lu, J., Yu, J., Gibbs, R.A., Yu, F.: An integrative variant analysis pipeline for accurate genotype/haplotype inference in population NGS data. Genome Res. 23, 833–842 (2013)CrossRef
4.
go back to reference El-Saghir, Z., Kelash, H., Elnazly, S., Faheem, H.: Parallel implementation of smith-waterman algorithm using MPI, openmp and hybrid model. Int. J. Innovative Technol. Exploring Eng. 4, 1–5 (2014) El-Saghir, Z., Kelash, H., Elnazly, S., Faheem, H.: Parallel implementation of smith-waterman algorithm using MPI, openmp and hybrid model. Int. J. Innovative Technol. Exploring Eng. 4, 1–5 (2014)
5.
go back to reference Hamidouche, K., Mendonca, F., Falcou, J., de Melo, A., Etiemble, D.: Parallel smith-waterman comparison on multicore and manycore computing platforms with BSP++. Int. J. Parallel Prog. 41, 111–136 (2013)CrossRef Hamidouche, K., Mendonca, F., Falcou, J., de Melo, A., Etiemble, D.: Parallel smith-waterman comparison on multicore and manycore computing platforms with BSP++. Int. J. Parallel Prog. 41, 111–136 (2013)CrossRef
6.
go back to reference Noorian, M., Pooshfam, H., Noorian, Z., Abdullah, R.: Performance enhancement of smith-waterman algorithm using hybrid model: Comparing the mpi and hybrid programming paradigm on smp clusters. In: SMC 2009, IEEE International Conference on Systems, Man and Cybernetics, pp. 492–497 (2009) Noorian, M., Pooshfam, H., Noorian, Z., Abdullah, R.: Performance enhancement of smith-waterman algorithm using hybrid model: Comparing the mpi and hybrid programming paradigm on smp clusters. In: SMC 2009, IEEE International Conference on Systems, Man and Cybernetics, pp. 492–497 (2009)
7.
go back to reference Khajeh-Saeed, A., Poole, S., Perot, J.B.: Acceleration of the smith-waterman algorithm using single and multiple graphics processors. J. Comput. Phys. 229, 4247–4258 (2010)CrossRefMathSciNetMATH Khajeh-Saeed, A., Poole, S., Perot, J.B.: Acceleration of the smith-waterman algorithm using single and multiple graphics processors. J. Comput. Phys. 229, 4247–4258 (2010)CrossRefMathSciNetMATH
9.
go back to reference The MPI Forum: MPI: A Message Passing Interface. Technical report, Version 3.0 (2012) The MPI Forum: MPI: A Message Passing Interface. Technical report, Version 3.0 (2012)
10.
go back to reference Gotoh, O.: An improved algorithm for matching biological sequences. J. Mol. Biol. 162, 705–708 (1982)CrossRef Gotoh, O.: An improved algorithm for matching biological sequences. J. Mol. Biol. 162, 705–708 (1982)CrossRef
11.
go back to reference ten Bruggencate, M.: Cray shmem update. Presentation regarding extensions to OpenSHMEM by Cray (2014). Accessed 23 June 2015 ten Bruggencate, M.: Cray shmem update. Presentation regarding extensions to OpenSHMEM by Cray (2014). Accessed 23 June 2015
12.
go back to reference Bader, D., Madduri, K., Gilbert, J., Shah, V., Kepner, J., Meuse, T., Krishnamurthy, A.: Designing scalable synthetic compact applications for benchmarking high productivity computing systems. Cyberinfrastructure Tech. Watch 2, 1–10 (2006) Bader, D., Madduri, K., Gilbert, J., Shah, V., Kepner, J., Meuse, T., Krishnamurthy, A.: Designing scalable synthetic compact applications for benchmarking high productivity computing systems. Cyberinfrastructure Tech. Watch 2, 1–10 (2006)
Metadata
Title
Parallelizing the Smith-Waterman Algorithm Using OpenSHMEM and MPI-3 One-Sided Interfaces
Authors
Matthew Baker
Aaron Welch
Manjunath Gorentla Venkata
Copyright Year
2015
DOI
https://doi.org/10.1007/978-3-319-26428-8_12

Premium Partner