Skip to main content
Erschienen in: Artificial Intelligence Review 4/2017

18.06.2016

Weighted-spectral clustering algorithm for detecting community structures in complex networks

verfasst von: Tzy-Shiah Wang, Hui-Tang Lin, Ping Wang

Erschienen in: Artificial Intelligence Review | Ausgabe 4/2017

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The community structure is a common non-trivial topological feature of many complex real-world networks. Existing methods for identifying the community structure are generally based on statistical-type properties, such as the degree of centrality, the shortest path betweenness centrality, the modularity, and so forth. However, the form of the community structure may vary widely, even if the number of vertices and edges are fixed. Consequently, it is difficult to be certain of the exact number of clusters within the network. Clustering schemes which require the number of clusters to be specified in advance often misjudge the community structure and yield a poor clustering performance as a result. Accordingly, the present study proposes a clustering algorithm, designated as the Weighted-Spectral Clustering Algorithm, capable of detecting the community structure of a network with no prior knowledge of the cluster number. The proposed method is tested on both computer-generated networks and several real-world networks for which the community structures are already known. The results confirm the ability of the proposed algorithm to partition the network into an appropriate number of clusters in every case.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Belkin M, Niyogi P (2001) Laplacian eigenmaps and spectral techniques for embedding and clustering. NIPS 14:585–591 Belkin M, Niyogi P (2001) Laplacian eigenmaps and spectral techniques for embedding and clustering. NIPS 14:585–591
Zurück zum Zitat Biemann C (2006) Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In: Proceedings of the first workshop on graph based methods for natural language processing. Association for Computational Linguistics, pp 73–80 Biemann C (2006) Chinese whispers: an efficient graph clustering algorithm and its application to natural language processing problems. In: Proceedings of the first workshop on graph based methods for natural language processing. Association for Computational Linguistics, pp 73–80
Zurück zum Zitat Chung FR (1997) Spectral graph theory, vol. 92. American Mathematical Soc Chung FR (1997) Spectral graph theory, vol. 92. American Mathematical Soc
Zurück zum Zitat Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111CrossRef Clauset A, Newman ME, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70(6):066111CrossRef
Zurück zum Zitat Coppersmith D, Winograd S (1987) Matrix multiplication via arithmetic progressions. In: Proceedings of the nineteenth annual ACM symposium on Theory of computing, pp 1–6 Coppersmith D, Winograd S (1987) Matrix multiplication via arithmetic progressions. In: Proceedings of the nineteenth annual ACM symposium on Theory of computing, pp 1–6
Zurück zum Zitat Fay D, Haddadi H, Thomason A, Moore AW, Mortier R, Jamakovic A, Rio M (2010) Weighted spectral distribution for internet topology analysis: theory and applications. IEEE/ACM Trans Netw 18(1):164–176. doi:10.1109/TNET.2009.2022369 CrossRef Fay D, Haddadi H, Thomason A, Moore AW, Mortier R, Jamakovic A, Rio M (2010) Weighted spectral distribution for internet topology analysis: theory and applications. IEEE/ACM Trans Netw 18(1):164–176. doi:10.​1109/​TNET.​2009.​2022369 CrossRef
Zurück zum Zitat Good BH, de Montjoye YA, Clauset A (2010) Performance of modularity maximization in practical contexts. Phys Rev E 81(4):046106MathSciNetCrossRef Good BH, de Montjoye YA, Clauset A (2010) Performance of modularity maximization in practical contexts. Phys Rev E 81(4):046106MathSciNetCrossRef
Zurück zum Zitat Hecking T, Steinert L, Gohnert T, Hoppe HU (2014) Incremental clustering of dynamic bipartite networks. In: Network intelligence conference (ENIC), 2014 European. IEEE, pp. 9–16 Hecking T, Steinert L, Gohnert T, Hoppe HU (2014) Incremental clustering of dynamic bipartite networks. In: Network intelligence conference (ENIC), 2014 European. IEEE, pp. 9–16
Zurück zum Zitat Huang Z (2010) Link prediction based on graph topology: the predictive value of generalized clustering coefficient. SSRN 1634014 Huang Z (2010) Link prediction based on graph topology: the predictive value of generalized clustering coefficient. SSRN 1634014
Zurück zum Zitat Knuth DE (1993) The Stanford GraphBase: a platform for combinatorial computing, vol 37. Addison-Wesley, ReadingMATH Knuth DE (1993) The Stanford GraphBase: a platform for combinatorial computing, vol 37. Addison-Wesley, ReadingMATH
Zurück zum Zitat LaSalle D, Karypis G (2015) Multi-threaded modularity based graph clustering using the multilevel paradigm. J Parallel Distrib Comput 76:66–80CrossRef LaSalle D, Karypis G (2015) Multi-threaded modularity based graph clustering using the multilevel paradigm. J Parallel Distrib Comput 76:66–80CrossRef
Zurück zum Zitat Leskovec J, Huttenlocher D, Kleinberg J (2010) Predicting positive and negative links in online social networks. In: Proceedings of the 19th international conference on World wide web. ACM, pp 641–650 Leskovec J, Huttenlocher D, Kleinberg J (2010) Predicting positive and negative links in online social networks. In: Proceedings of the 19th international conference on World wide web. ACM, pp 641–650
Zurück zum Zitat Leskovec J, Lang KJ, Dasgupta A, Mahoney MW (2008) Statistical properties of community structure in large social and information networks. In: Proceedings of the 17th international conference on World Wide Web. ACM, pp 695–704. doi:10.1145/1367497.1367591 Leskovec J, Lang KJ, Dasgupta A, Mahoney MW (2008) Statistical properties of community structure in large social and information networks. In: Proceedings of the 17th international conference on World Wide Web. ACM, pp 695–704. doi:10.​1145/​1367497.​1367591
Zurück zum Zitat Leskovec J, Mcauley JJ (2012) Learning to discover social circles in ego networks. In: Advances in neural information processing systems, pp 539–547 Leskovec J, Mcauley JJ (2012) Learning to discover social circles in ego networks. In: Advances in neural information processing systems, pp 539–547
Zurück zum Zitat Li Z (2012) A non-MCMC procedure for fitting dirichlet process mixture models. Doctoral dissertation. University of Saskatchewan Li Z (2012) A non-MCMC procedure for fitting dirichlet process mixture models. Doctoral dissertation. University of Saskatchewan
Zurück zum Zitat Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM (2003) The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol 54(4):396–405. doi:10.1007/s00265-003-0651-y CrossRef Lusseau D, Schneider K, Boisseau OJ, Haase P, Slooten E, Dawson SM (2003) The bottlenose dolphin community of Doubtful Sound features a large proportion of long-lasting associations. Behav Ecol Sociobiol 54(4):396–405. doi:10.​1007/​s00265-003-0651-y CrossRef
Zurück zum Zitat Meila M, Shi J (2001) A random walks view of spectral segmentation. In: Proceedings of the 8th international workshop on artificial intelligence and statistics Meila M, Shi J (2001) A random walks view of spectral segmentation. In: Proceedings of the 8th international workshop on artificial intelligence and statistics
Zurück zum Zitat Nascimento MC, Carvalho AC (2011) A graph clustering algorithm based on a clustering coefficient for weighted graphs. J Braz Comput Soc 17(1):19–29CrossRefMATH Nascimento MC, Carvalho AC (2011) A graph clustering algorithm based on a clustering coefficient for weighted graphs. J Braz Comput Soc 17(1):19–29CrossRefMATH
Zurück zum Zitat Nascimento MC, Pitsoulis L (2013) Community detection by modularity maximization using GRASP with path relinking. Comput Oper Res 40(12):3121–3131MathSciNetCrossRefMATH Nascimento MC, Pitsoulis L (2013) Community detection by modularity maximization using GRASP with path relinking. Comput Oper Res 40(12):3121–3131MathSciNetCrossRefMATH
Zurück zum Zitat Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066133CrossRef Newman ME (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066133CrossRef
Zurück zum Zitat Pelleg D, Moore AW (2000) X-means: extending K-means with efficient estimation of the number of clusters. In: ICML, pp 727–734 Pelleg D, Moore AW (2000) X-means: extending K-means with efficient estimation of the number of clusters. In: ICML, pp 727–734
Zurück zum Zitat Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: Computer and information sciences-ISCIS 2005. Springer, Berlin Heidelberg, pp 284–293. doi:10.1007/11569596_31 Pons P, Latapy M (2005) Computing communities in large networks using random walks. In: Computer and information sciences-ISCIS 2005. Springer, Berlin Heidelberg, pp 284–293. doi:10.​1007/​11569596_​31
Zurück zum Zitat Van Dongen SM (2001) Graph clustering by flow simulation. Ph.D. Thesis, Dutch National Research Institute for Mathematics and Computer Science, University of Utrecht, Netherlands Van Dongen SM (2001) Graph clustering by flow simulation. Ph.D. Thesis, Dutch National Research Institute for Mathematics and Computer Science, University of Utrecht, Netherlands
Zurück zum Zitat Wang J, Li M, Wang H, Pan Y (2012) Identification of essential proteins based on edge clustering coefficient. IEEE/ACM Trans Comput Biol Bioinform 9(4):1070–1080CrossRef Wang J, Li M, Wang H, Pan Y (2012) Identification of essential proteins based on edge clustering coefficient. IEEE/ACM Trans Comput Biol Bioinform 9(4):1070–1080CrossRef
Zurück zum Zitat Wasserman S (1994) Social network analysis: methods and applications, vol 8. Cambridge University Press, CambridgeCrossRef Wasserman S (1994) Social network analysis: methods and applications, vol 8. Cambridge University Press, CambridgeCrossRef
Zurück zum Zitat Wehmuth K, Gomes ATA, Ziviani A, Da Silva APC (2010) On the joint dynamics of network diameter and spectral gap under node removal. In: LAWDN-Latin-American workshop on dynamic networks Wehmuth K, Gomes ATA, Ziviani A, Da Silva APC (2010) On the joint dynamics of network diameter and spectral gap under node removal. In: LAWDN-Latin-American workshop on dynamic networks
Zurück zum Zitat Wehmuth K, Ziviani A (2011) Distributed location of the critical nodes to network robustness based on spectral analysis. Network operations and management symposium (LANOMS) (2011) 7th Latin American. IEEE. doi:10.1109/LANOMS.2011.6102259 Wehmuth K, Ziviani A (2011) Distributed location of the critical nodes to network robustness based on spectral analysis. Network operations and management symposium (LANOMS) (2011) 7th Latin American. IEEE. doi:10.​1109/​LANOMS.​2011.​6102259
Zurück zum Zitat Xiang B, Chen EH, Zhou T (2009) Finding community structure based on subgraph similarity. Complex networks. Springer, Berlin Heidelberg, pp 73–81CrossRef Xiang B, Chen EH, Zhou T (2009) Finding community structure based on subgraph similarity. Complex networks. Springer, Berlin Heidelberg, pp 73–81CrossRef
Zurück zum Zitat Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33:452–473 Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33:452–473
Metadaten
Titel
Weighted-spectral clustering algorithm for detecting community structures in complex networks
verfasst von
Tzy-Shiah Wang
Hui-Tang Lin
Ping Wang
Publikationsdatum
18.06.2016
Verlag
Springer Netherlands
Erschienen in
Artificial Intelligence Review / Ausgabe 4/2017
Print ISSN: 0269-2821
Elektronische ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-016-9488-4

Weitere Artikel der Ausgabe 4/2017

Artificial Intelligence Review 4/2017 Zur Ausgabe