Skip to main content

2018 | OriginalPaper | Buchkapitel

10. Massively Parallel Searching of 3D Protein Structure Similarities on CUDA-Enabled GPU Devices

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Finding common molecular substructures in complex 3D protein structures is still challenging. This is especially visible when scanning entire databases containing tens or even hundreds of thousands protein structures. Graphics processing units (GPUs) and general purpose graphics processing units (GPGPUs) promise to give a high speedup of many time-consuming and computationally demanding processes over their original implementations on CPUs. In this chapter, we will see that massive parallelization of the 3D structure similarity searching on many core CUDA-enabled GPU devices leads to reduction of the execution time of the process and allows to perform it in real time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000) Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)
4.
Zurück zum Zitat Brożek, M.: Protein structure similarity searching with the use of CUDA. Master’s thesis, Institute of Informatics, Silesian University of Technology, Gliwice, Poland (2012) Brożek, M.: Protein structure similarity searching with the use of CUDA. Master’s thesis, Institute of Informatics, Silesian University of Technology, Gliwice, Poland (2012)
6.
Zurück zum Zitat Burkowski, F.: Structural Bioinformatics: An Algorithmic Approach, 1st edn. Chapman and Hall/CRC, Boca Raton (2008)MATHCrossRef Burkowski, F.: Structural Bioinformatics: An Algorithmic Approach, 1st edn. Chapman and Hall/CRC, Boca Raton (2008)MATHCrossRef
7.
Zurück zum Zitat Can, T., Wang, Y.F.: CTSS: a robust and efficient method for protein structure alignment based on local geometrical and biological features. In: Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference (CSB2003), pp. 169–179 (2003) Can, T., Wang, Y.F.: CTSS: a robust and efficient method for protein structure alignment based on local geometrical and biological features. In: Computational Systems Bioinformatics. CSB2003. Proceedings of the 2003 IEEE Bioinformatics Conference (CSB2003), pp. 169–179 (2003)
12.
Zurück zum Zitat Friedrichs, M.S., Eastman, P., Vaidyanathan, V., Houston, M., Legrand, S., Beberg, A.L., Ensign, D.L., Bruns, C.M., Pande, V.S.: Accelerating molecular dynamic simulation on graphics processing units. J. Comput. Chem. 30(6), 864–872 (2009). https://doi.org/10.1002/jcc.21209CrossRef Friedrichs, M.S., Eastman, P., Vaidyanathan, V., Houston, M., Legrand, S., Beberg, A.L., Ensign, D.L., Bruns, C.M., Pande, V.S.: Accelerating molecular dynamic simulation on graphics processing units. J. Comput. Chem. 30(6), 864–872 (2009). https://​doi.​org/​10.​1002/​jcc.​21209CrossRef
13.
Zurück zum Zitat Gibrat, J., Madej, T., Bryant, S.: Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6(3), 377–385 (1996)CrossRef Gibrat, J., Madej, T., Bryant, S.: Surprising similarities in structure comparison. Curr. Opin. Struct. Biol. 6(3), 377–385 (1996)CrossRef
14.
Zurück zum Zitat Gu, J., Bourne, P.: Structural Bioinformatics (Methods of Biochemical Analysis), 2nd edn. Wiley, Hoboken (2009) Gu, J., Bourne, P.: Structural Bioinformatics (Methods of Biochemical Analysis), 2nd edn. Wiley, Hoboken (2009)
16.
Zurück zum Zitat Holm, L., Kaariainen, S., Rosenstrom, P., Schenkel, A.: Searching protein structure databases with DaliLite v. 3. Bioinformatics 24, 2780–2781 (2008)CrossRef Holm, L., Kaariainen, S., Rosenstrom, P., Schenkel, A.: Searching protein structure databases with DaliLite v. 3. Bioinformatics 24, 2780–2781 (2008)CrossRef
17.
Zurück zum Zitat Holm, L., Sander, C.: Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233(1), 123–38 (1993)CrossRef Holm, L., Sander, C.: Protein structure comparison by alignment of distance matrices. J. Mol. Biol. 233(1), 123–38 (1993)CrossRef
22.
Zurück zum Zitat Leinweber, M., Baumgärtner, L., Mernberger, M., Fober, T., Hüllermeier, E., Klebe, G., Freisleben, B.: GPU-based cloud computing for comparing the structure of protein binding sites. In: 2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST), pp. 1–6 (2012) Leinweber, M., Baumgärtner, L., Mernberger, M., Fober, T., Hüllermeier, E., Klebe, G., Freisleben, B.: GPU-based cloud computing for comparing the structure of protein binding sites. In: 2012 6th IEEE International Conference on Digital Ecosystems and Technologies (DEST), pp. 1–6 (2012)
23.
Zurück zum Zitat Leinweber, M., Fober, T., Freisleben, B.: GPU-based point cloud superpositioning for structural comparisons of protein binding sites. IEEE/ACM Trans. Comput. Biol. Bioinform. PP(99), 1–14 (2018) Leinweber, M., Fober, T., Freisleben, B.: GPU-based point cloud superpositioning for structural comparisons of protein binding sites. IEEE/ACM Trans. Comput. Biol. Bioinform. PP(99), 1–14 (2018)
24.
Zurück zum Zitat Leinweber, M., Fober, T., Strickert, M., Baumgärtner, L., Klebe, G., Freisleben, B., Hüllermeier, E.: CavSimBase: a database for large scale comparison of protein binding sites. IEEE Trans. Knowl. Data Eng. 28(6), 1423–1434 (2016)CrossRef Leinweber, M., Fober, T., Strickert, M., Baumgärtner, L., Klebe, G., Freisleben, B., Hüllermeier, E.: CavSimBase: a database for large scale comparison of protein binding sites. IEEE Trans. Knowl. Data Eng. 28(6), 1423–1434 (2016)CrossRef
25.
Zurück zum Zitat Lesk, A.: Introduction to Protein Science: Architecture, Function, and Genomics, 2nd edn. Oxford University Press, USA (2010) Lesk, A.: Introduction to Protein Science: Architecture, Function, and Genomics, 2nd edn. Oxford University Press, USA (2010)
30.
Zurück zum Zitat Małysiak-Mrozek, B., Momot, A., Mrozek, D., Hera, Ł., Kozielski, S., Momot, M.: Scalable system for protein structure similarity searching. In: Jedrzejowicz, P., Nguyen, N.T., Hoang, K. (eds.) Computational Collective Intelligence. Technologies and Applications. Lecture Notes Computer Science, vol. 6923, pp. 271–280. Springer, Berlin (2011)CrossRef Małysiak-Mrozek, B., Momot, A., Mrozek, D., Hera, Ł., Kozielski, S., Momot, M.: Scalable system for protein structure similarity searching. In: Jedrzejowicz, P., Nguyen, N.T., Hoang, K. (eds.) Computational Collective Intelligence. Technologies and Applications. Lecture Notes Computer Science, vol. 6923, pp. 271–280. Springer, Berlin (2011)CrossRef
33.
Zurück zum Zitat Minami, S., Sawada, K., Chikenji, G.: MICAN: a protein structure alignment algorithm that can handle multiple-chains, inverse alignments, Ca only models, alternative alignments, and non-sequential alignments. BMC Bioinform. 14(24), 1–22 (2013)CrossRef Minami, S., Sawada, K., Chikenji, G.: MICAN: a protein structure alignment algorithm that can handle multiple-chains, inverse alignments, Ca only models, alternative alignments, and non-sequential alignments. BMC Bioinform. 14(24), 1–22 (2013)CrossRef
34.
Zurück zum Zitat Momot, A., Małysiak-Mrozek, B., Kozielski, S., Mrozek, D., Hera, Ł., Górczyńska-Kosiorz, S., Momot, M.: Improving Performance of Protein Structure Similarity Searching by Distributing Computations in Hierarchical Multi-Agent System. Lecture Notes in Computer Science, vol. 6421, pp. 320–329. Springer, Berlin (2010)CrossRef Momot, A., Małysiak-Mrozek, B., Kozielski, S., Mrozek, D., Hera, Ł., Górczyńska-Kosiorz, S., Momot, M.: Improving Performance of Protein Structure Similarity Searching by Distributing Computations in Hierarchical Multi-Agent System. Lecture Notes in Computer Science, vol. 6421, pp. 320–329. Springer, Berlin (2010)CrossRef
36.
Zurück zum Zitat Mrozek, D., Małysiak-Mrozek, B.: CASSERT: a two-phase alignment algorithm for matching 3D structures of proteins. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) Computer Networks. Communications in Computer and Information Science, vol. 370, pp. 334–343. Springer International Publishing, New York (2013)CrossRef Mrozek, D., Małysiak-Mrozek, B.: CASSERT: a two-phase alignment algorithm for matching 3D structures of proteins. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) Computer Networks. Communications in Computer and Information Science, vol. 370, pp. 334–343. Springer International Publishing, New York (2013)CrossRef
37.
Zurück zum Zitat Mrozek,D., Brozek,M.,Małysiak-Mrozek, B.: Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA. J. Mol. Model. 20, 2067 (2014) Mrozek,D., Brozek,M.,Małysiak-Mrozek, B.: Parallel implementation of 3D protein structure similarity searches using a GPU and the CUDA. J. Mol. Model. 20, 2067 (2014)
44.
Zurück zum Zitat Pawłowski, R., Małysiak-Mrozek, B., Kozielski, S., Mrozek, D.: Fast and accurate similarity searching of biopolymer sequences with GPU and CUDA. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) Algorithms and Architectures for Parallel Processing. Lecture Notes in Computer Science, vol. 7016, pp. 230–243. Springer, Berlin Heidelberg, Berlin, Heidelberg (2011)CrossRef Pawłowski, R., Małysiak-Mrozek, B., Kozielski, S., Mrozek, D.: Fast and accurate similarity searching of biopolymer sequences with GPU and CUDA. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) Algorithms and Architectures for Parallel Processing. Lecture Notes in Computer Science, vol. 7016, pp. 230–243. Springer, Berlin Heidelberg, Berlin, Heidelberg (2011)CrossRef
45.
Zurück zum Zitat Roberts, E., Stone, J.E., Sepulveda, L., Hwu, W.M.W., Luthey-Schulten, Z.: Long time-scale simulations of in vivo diffusion using GPU hardware. In: 2009 IEEE International Symposium on Parallel Distributed Processing, pp. 1–8 (2009) Roberts, E., Stone, J.E., Sepulveda, L., Hwu, W.M.W., Luthey-Schulten, Z.: Long time-scale simulations of in vivo diffusion using GPU hardware. In: 2009 IEEE International Symposium on Parallel Distributed Processing, pp. 1–8 (2009)
49.
Zurück zum Zitat Schmidt, B.: Bioinformatics: High Performance Parallel Computer Architectures (Embedded Multi-Core Systems), 1st edn. CRC Press, Boca Raton (2010) Schmidt, B.: Bioinformatics: High Performance Parallel Computer Architectures (Embedded Multi-Core Systems), 1st edn. CRC Press, Boca Raton (2010)
50.
Zurück zum Zitat Shapiro, J., Brutlag, D.: FoldMiner and LOCK2: protein structure comparison and motif discovery on the Web. Nucleic Acids Res. 32, 536–41 (2004)CrossRef Shapiro, J., Brutlag, D.: FoldMiner and LOCK2: protein structure comparison and motif discovery on the Web. Nucleic Acids Res. 32, 536–41 (2004)CrossRef
51.
Zurück zum Zitat Shindyalov, I., Bourne, P.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11(9), 739–747 (1998)CrossRef Shindyalov, I., Bourne, P.: Protein structure alignment by incremental combinatorial extension (CE) of the optimal path. Protein Eng. 11(9), 739–747 (1998)CrossRef
52.
Zurück zum Zitat Stanek, D., Mrozek, D., Małysiak-Mrozek, B.: MViewer: Visualization of protein molecular structures stored in the PDB, mmCIF and PDBML data formats. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) Computer Networks. Communications in Computer and Information Science, vol. 370, pp. 323–333. Springer, Berlin (2013)CrossRef Stanek, D., Mrozek, D., Małysiak-Mrozek, B.: MViewer: Visualization of protein molecular structures stored in the PDB, mmCIF and PDBML data formats. In: Kwiecień, A., Gaj, P., Stera, P. (eds.) Computer Networks. Communications in Computer and Information Science, vol. 370, pp. 323–333. Springer, Berlin (2013)CrossRef
54.
Zurück zum Zitat Striemer, G.M., Akoglu, A.: Sequence alignment with GPU: performance and design challenges. In: 2009 IEEE International Symposium on Parallel Distributed Processing, pp. 1–10 (2009) Striemer, G.M., Akoglu, A.: Sequence alignment with GPU: performance and design challenges. In: 2009 IEEE International Symposium on Parallel Distributed Processing, pp. 1–10 (2009)
56.
Zurück zum Zitat Ye, Y., Godzik, A.: Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19(2), 246–255 (2003)CrossRef Ye, Y., Godzik, A.: Flexible structure alignment by chaining aligned fragment pairs allowing twists. Bioinformatics 19(2), 246–255 (2003)CrossRef
61.
Zurück zum Zitat Zhu, J., Weng, Z.: FAST: A novel protein structure alignment algorithm. Proteins 58, 618–627 (2005)CrossRef Zhu, J., Weng, Z.: FAST: A novel protein structure alignment algorithm. Proteins 58, 618–627 (2005)CrossRef
Metadaten
Titel
Massively Parallel Searching of 3D Protein Structure Similarities on CUDA-Enabled GPU Devices
verfasst von
Dariusz Mrozek
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-98839-9_10