Skip to main content
Erschienen in: The Journal of Supercomputing 10/2020

07.02.2020

A novel parallel Markov clustering method in biological interaction network analysis under multi-GPU computing environment

verfasst von: You Fu, Wei Zhou

Erschienen in: The Journal of Supercomputing | Ausgabe 10/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The various clustering methods are widely applied in analyzing biological interaction networks, such as the protein–protein interaction and the genetic interaction networks. With the rapid growth of these biological datasets in scale, much longer runtime is required to make cluster analyses on them. In this paper, we propose a novel parallel Markov clustering (MCL) method based on the ELLPACK-R sparse matrix format that can run on multiple graphic processing units (GPUs) equipped standalone computers. The method is implemented using the Compute Unified Device Architecture (CUDA) programming framework, and fine-grained warp-level optimization is introduced for improving the performance. The BioGRID, a large-scale and freely accessible database of protein and genetic interactions, is adopted as the dataset in the experiment. The method has been assessed on a desktop computer equipped with two NVIDIA GTX 1070 GPUs. The results show that the proposed multi-GPU method can conduct MCL clustering on the full-size BioGRID database with about 6.5 min, that is much faster than the CPU serial MCL implementation which needs almost an hour and a half execution time.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Van Dongen SM (2000) Graph clustering by flow simulation. University of Utrecht, Utrecht Van Dongen SM (2000) Graph clustering by flow simulation. University of Utrecht, Utrecht
2.
Zurück zum Zitat Brohee S, Van Helden J (2006) Evaluation of clustering algorithms for protein–protein interaction networks. BMC Bioinform 7(1):488CrossRef Brohee S, Van Helden J (2006) Evaluation of clustering algorithms for protein–protein interaction networks. BMC Bioinform 7(1):488CrossRef
3.
Zurück zum Zitat Sharan R, Ulitsky I, Shamir R (2007) Network-based prediction of protein function. Mol Syst Biol 3(1):88CrossRef Sharan R, Ulitsky I, Shamir R (2007) Network-based prediction of protein function. Mol Syst Biol 3(1):88CrossRef
4.
Zurück zum Zitat Vlasblom J, Wodak SJ (2009) Markov clustering versus affinity propagation for the partitioning of protein interaction graphs. BMC Bioinform 10(1):99CrossRef Vlasblom J, Wodak SJ (2009) Markov clustering versus affinity propagation for the partitioning of protein interaction graphs. BMC Bioinform 10(1):99CrossRef
5.
Zurück zum Zitat Keckler SW, Dally WJ, Khailany B et al (2011) GPUs and the future of parallel computing. IEEE Micro 31(5):7–17CrossRef Keckler SW, Dally WJ, Khailany B et al (2011) GPUs and the future of parallel computing. IEEE Micro 31(5):7–17CrossRef
8.
Zurück zum Zitat Zhou W, Cai Z, Lian B et al (2017) Protein database search of hybrid alignment algorithm based on GPU parallel acceleration. J Supercomput 73(10):4517–4534CrossRef Zhou W, Cai Z, Lian B et al (2017) Protein database search of hybrid alignment algorithm based on GPU parallel acceleration. J Supercomput 73(10):4517–4534CrossRef
9.
Zurück zum Zitat Zhou W, Cai Z, Lian B et al (2018) A multi-GPU protein database search model with hybrid alignment manner on distributed GPU clusters. Concurr Comput Pract Exp 30(18):e4522CrossRef Zhou W, Cai Z, Lian B et al (2018) A multi-GPU protein database search model with hybrid alignment manner on distributed GPU clusters. Concurr Comput Pract Exp 30(18):e4522CrossRef
10.
Zurück zum Zitat Rani S, Gupta OP (2017) CLUS\_GPU-BLASTP: accelerated protein sequence alignment using GPU-enabled cluster. J Supercomput 73(10):4580–4595CrossRef Rani S, Gupta OP (2017) CLUS\_GPU-BLASTP: accelerated protein sequence alignment using GPU-enabled cluster. J Supercomput 73(10):4580–4595CrossRef
11.
Zurück zum Zitat Anderson JA, Lorenz CD, Travesset A (2008) General purpose molecular dynamics simulations fully implemented on graphics processing units. J Comput Phys 227(10):5342–5359CrossRef Anderson JA, Lorenz CD, Travesset A (2008) General purpose molecular dynamics simulations fully implemented on graphics processing units. J Comput Phys 227(10):5342–5359CrossRef
12.
Zurück zum Zitat Bustamam A, Burrage K, Hamilton NA (2012) Fast parallel Markov clustering in bioinformatics using massively parallel computing on GPU with CUDA and ELLPACK-R sparse format. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 9(3):679–692CrossRef Bustamam A, Burrage K, Hamilton NA (2012) Fast parallel Markov clustering in bioinformatics using massively parallel computing on GPU with CUDA and ELLPACK-R sparse format. IEEE/ACM Trans Comput Biol Bioinform (TCBB) 9(3):679–692CrossRef
14.
Zurück zum Zitat Escobar JJ, Ortega J, Díaz AF et al (2019) Time-energy analysis of multilevel parallelism in heterogeneous clusters: the case of EEG classification in BCI tasks. J Supercomput 75(7):3397–3425CrossRef Escobar JJ, Ortega J, Díaz AF et al (2019) Time-energy analysis of multilevel parallelism in heterogeneous clusters: the case of EEG classification in BCI tasks. J Supercomput 75(7):3397–3425CrossRef
15.
Zurück zum Zitat Cheng J, Grossman M, McKercher T (2014) Professional Cuda C programming. Wiley, Indianapolis Cheng J, Grossman M, McKercher T (2014) Professional Cuda C programming. Wiley, Indianapolis
16.
Zurück zum Zitat Saad Y (2003) Iterative methods for sparse linear systems. SIAM, PhiladelphiaCrossRef Saad Y (2003) Iterative methods for sparse linear systems. SIAM, PhiladelphiaCrossRef
17.
Zurück zum Zitat Vazquez F, Ortega G, Fernández JJ et al (2010) Improving the performance of the sparse matrix vector product with GPUs. In: 2010 10th IEEE International Conference on Computer and Information Technology. IEEE, pp 1146–1151 Vazquez F, Ortega G, Fernández JJ et al (2010) Improving the performance of the sparse matrix vector product with GPUs. In: 2010 10th IEEE International Conference on Computer and Information Technology. IEEE, pp 1146–1151
19.
Zurück zum Zitat Stark C, Breitkreutz BJ, Reguly T et al (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res 34(suppl-1):D535–D539CrossRef Stark C, Breitkreutz BJ, Reguly T et al (2006) BioGRID: a general repository for interaction datasets. Nucleic Acids Res 34(suppl-1):D535–D539CrossRef
20.
Zurück zum Zitat He L, Lu L, Wang Q (2017) An optimal parallel implementation of Markov Clustering based on the coordination of CPU and GPU. J Intell Fuzzy Syst 32(5):3609–3617CrossRef He L, Lu L, Wang Q (2017) An optimal parallel implementation of Markov Clustering based on the coordination of CPU and GPU. J Intell Fuzzy Syst 32(5):3609–3617CrossRef
21.
Zurück zum Zitat Azad A, Pavlopoulos GA, Ouzounis CA et al (2018) HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks. Nucleic Acids Res 46(6):e33–e33CrossRef Azad A, Pavlopoulos GA, Ouzounis CA et al (2018) HipMCL: a high-performance parallel implementation of the Markov clustering algorithm for large-scale networks. Nucleic Acids Res 46(6):e33–e33CrossRef
22.
Zurück zum Zitat Liu Y, Schmidt B (2018) Lightspmv: faster cuda-compatible sparse matrix-vector multiplication using compressed sparse rows. J Signal Process Syst 90(1):69–86CrossRef Liu Y, Schmidt B (2018) Lightspmv: faster cuda-compatible sparse matrix-vector multiplication using compressed sparse rows. J Signal Process Syst 90(1):69–86CrossRef
Metadaten
Titel
A novel parallel Markov clustering method in biological interaction network analysis under multi-GPU computing environment
verfasst von
You Fu
Wei Zhou
Publikationsdatum
07.02.2020
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 10/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-020-03193-2

Weitere Artikel der Ausgabe 10/2020

The Journal of Supercomputing 10/2020 Zur Ausgabe