Skip to main content

2018 | OriginalPaper | Buchkapitel

Cohesive Sub-network Mining in Protein Interaction Networks Using Score-Based Co-clustering with MapReduce Model (MR-CoC)

verfasst von : R. Gowri, R. Rathipriya

Erschienen in: Advances in Big Data and Cloud Computing

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Nowadays, due to the data deluge situation, every computation has to be carried out in voluminous data. The sub-network mining from the complex and voluminous interaction data is one of the research challenges. The highly connected sub-networks will be more cohesive in the network. They are responsible for communication among the network, which is useful for studying their functionalities. A novel score-based co-clustering (MR-CoC) technique with MapReduce is proposed to mine the highly connected sub-network from interaction networks. The MapReduce environment is chosen to cope with complex, voluminous data and to parallelize the computation process. This approach is used to mine cliques, non-cliques, and overlapping sub-network patterns from the adjacency matrix of the network. The complexity of the proposed work is O (Es + log Ns), which is minimal than the existing approaches like MCODE and spectral clustering.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Gowri, R., Rathipriya, R.: A study on clustering the protein interaction networks using bio-inspired optimization. IJCII 3, 89–95 (2013) Gowri, R., Rathipriya, R.: A study on clustering the protein interaction networks using bio-inspired optimization. IJCII 3, 89–95 (2013)
3.
Zurück zum Zitat Ekanayake, J., Pallickara, S., Fox, G.: MapReduce for data intensive scientific analyses. In: Proceeding ESCIENCE ‘08 Proceedings of the 2008 Fourth IEEE International Conference on eScience, pp. 277–284 (2008) Ekanayake, J., Pallickara, S., Fox, G.: MapReduce for data intensive scientific analyses. In: Proceeding ESCIENCE ‘08 Proceedings of the 2008 Fourth IEEE International Conference on eScience, pp. 277–284 (2008)
4.
Zurück zum Zitat Chen, S., Schlosser, S.W.: Map-reduce meets wider varieties of applications. Intel Research Pittsburgh, Technical Report 2008, IRP-TR-08-05 Chen, S., Schlosser, S.W.: Map-reduce meets wider varieties of applications. Intel Research Pittsburgh, Technical Report 2008, IRP-TR-08-05
5.
Zurück zum Zitat Rosen, J., Polyzotis, N., Borkar, V., Bu, Y., Carey, M.J., Weimer, M., Condie, T., Ramakrishnan, R.: Iterative mapreduce for large scale machine learning Rosen, J., Polyzotis, N., Borkar, V., Bu, Y., Carey, M.J., Weimer, M., Condie, T., Ramakrishnan, R.: Iterative mapreduce for large scale machine learning
6.
Zurück zum Zitat Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun ACM 51, 107–113 (2008) Dean, J., Ghemawat, S.: MapReduce: simplified data processing on large clusters. Commun ACM 51, 107–113 (2008)
7.
Zurück zum Zitat Aridhi, S., D’Orazio, L., Maddouri, M., Mephu, E.: A Novel MapReduce-Based Approach for Distributed Frequent Subgraph Mining. RFIA (2014) Aridhi, S., D’Orazio, L., Maddouri, M., Mephu, E.: A Novel MapReduce-Based Approach for Distributed Frequent Subgraph Mining. RFIA (2014)
8.
Zurück zum Zitat Hill, S., Srichandan, B., Sunderraman, R.: An iterative MapReduce approach to frequent subgraph mining in biological datasets. In: ACM-BCB’12, pp. 7–10 (2012) Hill, S., Srichandan, B., Sunderraman, R.: An iterative MapReduce approach to frequent subgraph mining in biological datasets. In: ACM-BCB’12, pp. 7–10 (2012)
9.
Zurück zum Zitat Gowri, R., Rathipriya, R.: Extraction of protein sequence motif information using PSO K-Means. J. Netw. Inf. Secur. (2014) Gowri, R., Rathipriya, R.: Extraction of protein sequence motif information using PSO K-Means. J. Netw. Inf. Secur. (2014)
10.
Zurück zum Zitat Gowri, R., Sivabalan, S., Rathipriya, R.: Biclustering using venus flytrap optimization algorithm. In: Proceedings of International Conference on Computational Intelligence in Data Mining CIDM, Advances in Intelligent Systems and Computing series, vol. 410, pp. 199–207 (2015) Gowri, R., Sivabalan, S., Rathipriya, R.: Biclustering using venus flytrap optimization algorithm. In: Proceedings of International Conference on Computational Intelligence in Data Mining CIDM, Advances in Intelligent Systems and Computing series, vol. 410, pp. 199–207 (2015)
11.
Zurück zum Zitat Gowri, R., Rathipriya, R.: Protein motif comparator using PSO k-means. Int. J. Appl. Metaheuristic Comput. (IJAMC) 7 (2016) Gowri, R., Rathipriya, R.: Protein motif comparator using PSO k-means. Int. J. Appl. Metaheuristic Comput. (IJAMC) 7 (2016)
12.
Zurück zum Zitat Bader, G.D., Hogue, C.W.V.: An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 4 (2003) Bader, G.D., Hogue, C.W.V.: An automated method for finding molecular complexes in large protein interaction networks. BMC Bioinform. 4 (2003)
13.
Zurück zum Zitat Laix, L., Qinzx, L., Linx, X., Chang, L.: Scalable subgraph enumeration in MapReduce. In: Proceedings of the VLDB Endowment, vol. 8, pp. 974–985 Laix, L., Qinzx, L., Linx, X., Chang, L.: Scalable subgraph enumeration in MapReduce. In: Proceedings of the VLDB Endowment, vol. 8, pp. 974–985
14.
Zurück zum Zitat Pereira-Leal, J.B., Enright, A.J., Ouzounis, C.A.: Detection of functional modules from protein interaction networks. In: PROTEINS: Struct. Funct. Bioinform. 54, 49–57 (2004) Pereira-Leal, J.B., Enright, A.J., Ouzounis, C.A.: Detection of functional modules from protein interaction networks. In: PROTEINS: Struct. Funct. Bioinform. 54, 49–57 (2004)
15.
Zurück zum Zitat Dittrich, M.T., Klau, G.W., Rosenwald, A., Dandekar, T., Müller, T.: Identifying functional modules in protein-protein interaction networks: an integrated exact approach. ISMB 24, 223–231 (2008) Dittrich, M.T., Klau, G.W., Rosenwald, A., Dandekar, T., Müller, T.: Identifying functional modules in protein-protein interaction networks: an integrated exact approach. ISMB 24, 223–231 (2008)
16.
Zurück zum Zitat Pinkert, S., Schultz, J., Reichardt, J.: Protein interaction networks—more than mere modules. PLoS Comput. Biol. 6 (2010) Pinkert, S., Schultz, J., Reichardt, J.: Protein interaction networks—more than mere modules. PLoS Comput. Biol. 6 (2010)
17.
Zurück zum Zitat Zhang, Y., Zeng, E., Li, T., Narasimhan, G.: Weighted Consensus Clustering for Identifying Functional Modules In Protein-Protein Interaction Networks Zhang, Y., Zeng, E., Li, T., Narasimhan, G.: Weighted Consensus Clustering for Identifying Functional Modules In Protein-Protein Interaction Networks
18.
Zurück zum Zitat A Faster Algorithm for Detecting Motifs. In: 5th WABI-05, vol. 3692, pp. 165–177. Springer (2005) A Faster Algorithm for Detecting Motifs. In: 5th WABI-05, vol. 3692, pp. 165–177. Springer (2005)
19.
Zurück zum Zitat Kashtan, N., Itzkovitz, S., Milo, R., Alon, U.: Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs. Bioinformatics 20 (11), 1746–1758 (2004) Kashtan, N., Itzkovitz, S., Milo, R., Alon, U.: Efficient sampling algorithm for estimating subgraph concentrations and detecting network motifs. Bioinformatics 20 (11), 1746–1758 (2004)
20.
Zurück zum Zitat Gursoy, A., Keskin, O., Nussinov, R.: Topological properties of protein interaction networks from a structural perspective. Biochem. Soc. Trans. 1398–1403 (2008) Gursoy, A., Keskin, O., Nussinov, R.: Topological properties of protein interaction networks from a structural perspective. Biochem. Soc. Trans. 1398–1403 (2008)
21.
Zurück zum Zitat Schaeffer, S.E.: Graph clustering. Comput. Sci. Rev. 27–64 (2007) Schaeffer, S.E.: Graph clustering. Comput. Sci. Rev. 27–64 (2007)
22.
Zurück zum Zitat Diestel, R.: Graph Theory. Springer (2016) Diestel, R.: Graph Theory. Springer (2016)
23.
Zurück zum Zitat Ray, S.S.: Subgraphs, paths and connected graphs. In: Graph Theory with Algorithms and its Applications, pp. 11–24 (2013) Ray, S.S.: Subgraphs, paths and connected graphs. In: Graph Theory with Algorithms and its Applications, pp. 11–24 (2013)
24.
Zurück zum Zitat Bapat, R.B.: Graphs and Matrices. Springer, Hindustan Book Agency (2010) Bapat, R.B.: Graphs and Matrices. Springer, Hindustan Book Agency (2010)
25.
Zurück zum Zitat Ke, H., Li, P., Guo, S., Guo, M.: On traffic-aware partition and aggregation in MapReduce for Big Data applications. IEEE Trans. Parallel Distrib. Syst. (2015) Ke, H., Li, P., Guo, S., Guo, M.: On traffic-aware partition and aggregation in MapReduce for Big Data applications. IEEE Trans. Parallel Distrib. Syst. (2015)
26.
Zurück zum Zitat Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., Simonovic, M., Roth, A., Santos, A., Tsafou, K.P., Kuhn, M., Bork, P., Jensen, L.J., von Mering, C.: STRING v10: protein–protein interaction networks, integrated over the tree of life. Nucl. Acids Res. 43, 447–452 (2015) Szklarczyk, D., Franceschini, A., Wyder, S., Forslund, K., Heller, D., Huerta-Cepas, J., Simonovic, M., Roth, A., Santos, A., Tsafou, K.P., Kuhn, M., Bork, P., Jensen, L.J., von Mering, C.: STRING v10: protein–protein interaction networks, integrated over the tree of life. Nucl. Acids Res. 43, 447–452 (2015)
Metadaten
Titel
Cohesive Sub-network Mining in Protein Interaction Networks Using Score-Based Co-clustering with MapReduce Model (MR-CoC)
verfasst von
R. Gowri
R. Rathipriya
Copyright-Jahr
2018
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-7200-0_20

Premium Partner