Skip to main content
Erschienen in: Neural Computing and Applications 5/2019

19.05.2018 | S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Module overlapping structure detection in PPI using an improved link similarity-based Markov clustering algorithm

verfasst von: L. Gu, Y. Han, C. Wang, Wei Chen, Jun Jiao, X. Yuan

Erschienen in: Neural Computing and Applications | Ausgabe 5/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The identification and analysis of functional modules in protein–protein interaction (PPI) networks provide insight into understanding the organization and function of biological systems. A lot of overlapping structures are shared by the functional modules in PPI networks, which indicates there are some proteins play indispensable roles in different biological processes. Markov clustering (MCL) is a popular algorithm for clustering networks in bioinformatics. In this paper, to identify the overlapping structures among the functional modules and find more modules with biological significance in PPI networks, we propose a Markov clustering algorithm based on link similarity (MLS). First of all, the weighted link similarity is calculated and the link similarity matrix which measures the association strength of the protein interactions can be gotten. Then, the link similarity matrix is divided by applying Markov clustering, and the clustering results are mapped to original networks to analyze the protein modules. The method has been experimented on three databases, including DIP, Gavin and Krogan. Our results show that the MLS cannot only accurately identify the functional modules, but also outperform the original MCL algorithm and the F-measure value improved 5–10% compared with it.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bader GD, Hogue CWV (2003) An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinf 4(1):1471–2105CrossRef Bader GD, Hogue CWV (2003) An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinf 4(1):1471–2105CrossRef
2.
3.
Zurück zum Zitat King AD, Pržulj N, Jurisica I (2004) Protein complex prediction via cost-based clustering. Bioinformatics 20(17):3013–3020CrossRef King AD, Pržulj N, Jurisica I (2004) Protein complex prediction via cost-based clustering. Bioinformatics 20(17):3013–3020CrossRef
4.
Zurück zum Zitat Enright AJ, Van Dongen S, Van Ouzounis CA (2002) An efficient algorithm for large-scale detection of protein families. Nucl Acids Res 30(7):1575–1584CrossRef Enright AJ, Van Dongen S, Van Ouzounis CA (2002) An efficient algorithm for large-scale detection of protein families. Nucl Acids Res 30(7):1575–1584CrossRef
5.
Zurück zum Zitat Samuel J, Yuan X, Yuan X, et al (2010) Mining online full-text literature for novel protein interaction discovery. In: IEEE international conference on bioinformatics and biomedicine workshops (BIBMW). IEEE, pp 277–282 Samuel J, Yuan X, Yuan X, et al (2010) Mining online full-text literature for novel protein interaction discovery. In: IEEE international conference on bioinformatics and biomedicine workshops (BIBMW). IEEE, pp 277–282
6.
Zurück zum Zitat Nepusz T, Yu H, Paccanaro A (2012) Detecting overlapping protein complexes in protein-protein interaction networks. Nat Methods 9(5):471–472CrossRef Nepusz T, Yu H, Paccanaro A (2012) Detecting overlapping protein complexes in protein-protein interaction networks. Nat Methods 9(5):471–472CrossRef
7.
Zurück zum Zitat Brohée S, Helden JV (2006) Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinf 7(1602):2791–2797 Brohée S, Helden JV (2006) Evaluation of clustering algorithms for protein-protein interaction networks. BMC Bioinf 7(1602):2791–2797
8.
Zurück zum Zitat Satuluri V, Parthasarathy S (2009) Scalable graph clustering using stochastic flows: applications to community discovery. In: ACM SIGKDD international conference on knowledge discovery and data mining, Paris, France, June 28–July, 2009, DBLP, pp 737–746 Satuluri V, Parthasarathy S (2009) Scalable graph clustering using stochastic flows: applications to community discovery. In: ACM SIGKDD international conference on knowledge discovery and data mining, Paris, France, June 28–July, 2009, DBLP, pp 737–746
9.
Zurück zum Zitat Shih YK, Parthasarathy S (2012) Identifying functional modules in interaction networks through overlapping Markov clustering. Bioinformatics 28(18):i473–i479CrossRef Shih YK, Parthasarathy S (2012) Identifying functional modules in interaction networks through overlapping Markov clustering. Bioinformatics 28(18):i473–i479CrossRef
10.
Zurück zum Zitat Ahn YY, Bagrow JP, Lehmann S (2010) Link communities reveal multiscale complexity in networks. Nature 466(7307):761–764CrossRef Ahn YY, Bagrow JP, Lehmann S (2010) Link communities reveal multiscale complexity in networks. Nature 466(7307):761–764CrossRef
12.
Zurück zum Zitat Wang Y, Wang G, Meng D, et al (2014) A Markov clustering based link clustering method for overlapping module identification in yeast protein-protein interaction networks. In: Proceedings of the 10th international symposium on bioinformatics research and applications, ISBRA, Zhangjiajie, China, June 28–30. Springer, 8492, p 385 Wang Y, Wang G, Meng D, et al (2014) A Markov clustering based link clustering method for overlapping module identification in yeast protein-protein interaction networks. In: Proceedings of the 10th international symposium on bioinformatics research and applications, ISBRA, Zhangjiajie, China, June 28–30. Springer, 8492, p 385
13.
Zurück zum Zitat Yao FY, Chen L (2014) Similarity propagation based link prediction in bipartite networks. In: Proceedings of the 2014 international conference on network security and communication engineering (NSCE 2014), Hong Kong, Dec 25–26. CRC Press, pp 295–297 Yao FY, Chen L (2014) Similarity propagation based link prediction in bipartite networks. In: Proceedings of the 2014 international conference on network security and communication engineering (NSCE 2014), Hong Kong, Dec 25–26. CRC Press, pp 295–297
14.
Zurück zum Zitat Meyer AS, Garcia AAF, Souza AP et al (2004) Comparison of similarity coefficients used for cluster analysis with dominant markers in maize (Zea mays L. Genet Mol Biol 27(1):83–91CrossRef Meyer AS, Garcia AAF, Souza AP et al (2004) Comparison of similarity coefficients used for cluster analysis with dominant markers in maize (Zea mays L. Genet Mol Biol 27(1):83–91CrossRef
15.
Zurück zum Zitat Leger JB, Daudin JJ, Vacher C (2015) Clustering methods differ in their ability to detect patterns in ecological networks. Methods Ecol Evol 6(4):474–481CrossRef Leger JB, Daudin JJ, Vacher C (2015) Clustering methods differ in their ability to detect patterns in ecological networks. Methods Ecol Evol 6(4):474–481CrossRef
16.
Zurück zum Zitat Xenarios I, Salwinski L, Duan XJ et al (2002) DIP, the database of interacting proteins: a research tool for studying cellular networks of protein interactions. Nucl Acids Res 30(1):303–305CrossRef Xenarios I, Salwinski L, Duan XJ et al (2002) DIP, the database of interacting proteins: a research tool for studying cellular networks of protein interactions. Nucl Acids Res 30(1):303–305CrossRef
17.
Zurück zum Zitat Gavin AC, Bösche M, Krause R et al (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415(6868):141–147CrossRef Gavin AC, Bösche M, Krause R et al (2002) Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415(6868):141–147CrossRef
19.
Zurück zum Zitat Pu S, Wong J, Turner B et al (2009) Up-to-date catalogues of yeast protein complexes. Nucl Acids Res 37(3):825–831CrossRef Pu S, Wong J, Turner B et al (2009) Up-to-date catalogues of yeast protein complexes. Nucl Acids Res 37(3):825–831CrossRef
20.
Zurück zum Zitat Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113CrossRef Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026113CrossRef
21.
Zurück zum Zitat Shen H, Cheng X, Cai K et al (2009) Detect overlapping and hierarchical community structure in networks. Physica A 388(8):1706–1712CrossRef Shen H, Cheng X, Cai K et al (2009) Detect overlapping and hierarchical community structure in networks. Physica A 388(8):1706–1712CrossRef
22.
Zurück zum Zitat Li M, Wang J, Chen J (2008) A fast agglomerate algorithm for mining functional modules in protein interaction networks. In: International Conference on BMEI. IEEE, 1:3–7 Li M, Wang J, Chen J (2008) A fast agglomerate algorithm for mining functional modules in protein interaction networks. In: International Conference on BMEI. IEEE, 1:3–7
23.
Zurück zum Zitat Li IH, Huang JY, Liao IE, et al (2013) A sequence classification model based on pattern coverage rate. In: International conference on grid and pervasive computing. Springer, Berlin, pp 737–745 Li IH, Huang JY, Liao IE, et al (2013) A sequence classification model based on pattern coverage rate. In: International conference on grid and pervasive computing. Springer, Berlin, pp 737–745
24.
Zurück zum Zitat Rhrissorrakrai K, Gunsalus KC (2011) MINE: module identification in networks. BMC Bioinformatics 12(1):192CrossRef Rhrissorrakrai K, Gunsalus KC (2011) MINE: module identification in networks. BMC Bioinformatics 12(1):192CrossRef
25.
Zurück zum Zitat Zhao B, Wang J, Li M et al (2016) A new method for predicting protein functions from dynamic weighted interactome networks. IEEE Trans Nanobiosci 15(2):131–139CrossRef Zhao B, Wang J, Li M et al (2016) A new method for predicting protein functions from dynamic weighted interactome networks. IEEE Trans Nanobiosci 15(2):131–139CrossRef
26.
Zurück zum Zitat Zuo YC, Su WX, Zhang SH et al (2015) Discrimination of membrane transporter protein types using K-nearest neighbor method derived from the similarity distance of total diversity measure. Mol BioSyst 11(3):950–957CrossRef Zuo YC, Su WX, Zhang SH et al (2015) Discrimination of membrane transporter protein types using K-nearest neighbor method derived from the similarity distance of total diversity measure. Mol BioSyst 11(3):950–957CrossRef
27.
Zurück zum Zitat Sætre R, Sagae K, Tsujii JI (2007) Syntactic features for protein-protein interaction extraction. In: Short paper proceedings of the international symposium on languages in biology and medicine, DBL Sætre R, Sagae K, Tsujii JI (2007) Syntactic features for protein-protein interaction extraction. In: Short paper proceedings of the international symposium on languages in biology and medicine, DBL
28.
Zurück zum Zitat Zhao B, Wang J, Li M et al (2014) Detecting protein complexes based on uncertain graph model. IEEE/ACM Trans Comput Biol Bioinf (TCBB) 11(3):486–497CrossRef Zhao B, Wang J, Li M et al (2014) Detecting protein complexes based on uncertain graph model. IEEE/ACM Trans Comput Biol Bioinf (TCBB) 11(3):486–497CrossRef
29.
Zurück zum Zitat Butz M, Steenbuck ID, van Ooyen A (2014) Homeostatic structural plasticity increases the efficiency of small-world networks. Front Synaptic Neurosci 6:7CrossRef Butz M, Steenbuck ID, van Ooyen A (2014) Homeostatic structural plasticity increases the efficiency of small-world networks. Front Synaptic Neurosci 6:7CrossRef
30.
Zurück zum Zitat Schuch B, Feigenbutz M, Makino DL et al (2014) The exosome-binding factors Rrp6 and Rrp47 form a composite surface for recruiting the Mtr4 helicase. EMBO J 33(23):2829–2846CrossRef Schuch B, Feigenbutz M, Makino DL et al (2014) The exosome-binding factors Rrp6 and Rrp47 form a composite surface for recruiting the Mtr4 helicase. EMBO J 33(23):2829–2846CrossRef
31.
Zurück zum Zitat Gu L, Wang C, Zhang Y et al (2014) Trust model in cloud computing environment based on fuzzy theory. Int J Comput Commun Control 9(5):570–583CrossRef Gu L, Wang C, Zhang Y et al (2014) Trust model in cloud computing environment based on fuzzy theory. Int J Comput Commun Control 9(5):570–583CrossRef
Metadaten
Titel
Module overlapping structure detection in PPI using an improved link similarity-based Markov clustering algorithm
verfasst von
L. Gu
Y. Han
C. Wang
Wei Chen
Jun Jiao
X. Yuan
Publikationsdatum
19.05.2018
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 5/2019
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-018-3508-z

Weitere Artikel der Ausgabe 5/2019

Neural Computing and Applications 5/2019 Zur Ausgabe

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Abnormal event detection with semi-supervised sparse topic model

S.I. : Emerging Intelligent Algorithms for Edge-of-Things Computing

Deployment of smart home management system at the edge: mechanisms and protocols