Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2023

01.12.2023 | Original Article

Community detection in directed graphs using stationary distribution and hitting times methods

verfasst von: Tien Dat Dang, Duy Hieu Do, Thi Ha Duong Phan

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Community detection has been extensively developed using various algorithms. One of the most powerful algorithms for undirected graphs is Walktrap, which determines the distance between vertices by employing random walk and evaluates clusters using modularity based on vertex degrees. Although several directions have been explored to extend this method to directed graphs, none of them have been effective. In this paper, we investigate the Walktrap algorithm (Pons and Latapy in J Graph Algorithms Appl 10:191–218, 2006) and the spectral method (Newman in Phys Rev E 88:042822, 2013) and extend them to directed graphs. We propose a novel approach in which the distance between vertices is defined using hitting time, and modularity is computed based on the stationary distribution of a random walk. These definitions are highly effective, as algorithms for hitting time and stationary distribution have been developed, allowing for good computational complexity. Our proposed method is particularly useful for directed graphs, with the well-known results for undirected graphs being special cases. Additionally, we utilize the spectral method for these problems, and we have implemented our algorithms to demonstrate their plausibility and effectiveness.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Adamic LA, Glance N (2005) The political blogosphere and the 2004 US Election. In: Proceedings of the WWW-2005 workshop on the weblogging ecosystem Adamic LA, Glance N (2005) The political blogosphere and the 2004 US Election. In: Proceedings of the WWW-2005 workshop on the weblogging ecosystem
Zurück zum Zitat Adelman JS, Moyers SC, Farine DR, Hawley DM (2015) Feeder use predicts both acquisition and transmission of a contagious pathogen in a North American songbird. Proc R Soc B 282(1815):20151429 CrossRef Adelman JS, Moyers SC, Farine DR, Hawley DM (2015) Feeder use predicts both acquisition and transmission of a contagious pathogen in a North American songbird. Proc R Soc B 282(1815):20151429 CrossRef
Zurück zum Zitat Arenas A, Duch J, Fernández A, Gómez S (2007) Size reduction of complex networks preserving modularity. New J Phys 9(6):176MathSciNetCrossRef Arenas A, Duch J, Fernández A, Gómez S (2007) Size reduction of complex networks preserving modularity. New J Phys 9(6):176MathSciNetCrossRef
Zurück zum Zitat Aldenderfer MS, Blashfleld RK (1984) Cluster Analysis. Number 07-044 in Sage University Paper Series on Quantitative Applications in the Social Sciences. Sage, Beverly Hills Aldenderfer MS, Blashfleld RK (1984) Cluster Analysis. Number 07-044 in Sage University Paper Series on Quantitative Applications in the Social Sciences. Sage, Beverly Hills
Zurück zum Zitat Bagrow J, Bollt E (2005) A local method for detecting communities. Phys Rev E 72(42):046108CrossRef Bagrow J, Bollt E (2005) A local method for detecting communities. Phys Rev E 72(42):046108CrossRef
Zurück zum Zitat Bremaud P (2020) Markov chains: Gibbs fields, Monte Carlo simulation and queues (Texts in applied mathematics), 2nd edn. Springer, BerlinCrossRef Bremaud P (2020) Markov chains: Gibbs fields, Monte Carlo simulation and queues (Texts in applied mathematics), 2nd edn. Springer, BerlinCrossRef
Zurück zum Zitat Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst 30(1–7):107–117CrossRef Brin S, Page L (1998) The anatomy of a large-scale hypertextual web search engine. Comput Netw ISDN Syst 30(1–7):107–117CrossRef
Zurück zum Zitat Capoccia A, Servedioa VDP, Caldarellia G, Colaiori F (2005) Detecting communities in large networks. Phys Stat Mech Appl 352(2–4):669–676CrossRef Capoccia A, Servedioa VDP, Caldarellia G, Colaiori F (2005) Detecting communities in large networks. Phys Stat Mech Appl 352(2–4):669–676CrossRef
Zurück zum Zitat Chung F (1997) Spectral graph theory. American Mathematical Society. ISBN 978-0821803158. [1992] Chung F (1997) Spectral graph theory. American Mathematical Society. ISBN 978-0821803158. [1992]
Zurück zum Zitat Cohen MB, Kelner J, Peebles J, Peng R, Sidford A, Vladu A (2016) Faster algorithms for computing the stationary distribution, simulating random walks, and more. In: Annual IEEE symposium on foundations of computer science, pp 583–592 Cohen MB, Kelner J, Peebles J, Peng R, Sidford A, Vladu A (2016) Faster algorithms for computing the stationary distribution, simulating random walks, and more. In: Annual IEEE symposium on foundations of computer science, pp 583–592
Zurück zum Zitat Clauset A (2005) Finding local community structure in networks. Phys Rev E 72:026132CrossRef Clauset A (2005) Finding local community structure in networks. Phys Rev E 72:026132CrossRef
Zurück zum Zitat Clauset A, Newman MEJ, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70:066111CrossRef Clauset A, Newman MEJ, Moore C (2004) Finding community structure in very large networks. Phys Rev E 70:066111CrossRef
Zurück zum Zitat Dongen SV (2000) Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht Dongen SV (2000) Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht
Zurück zum Zitat Dugué N, Perez A (2015) Directed Louvain: maximizing modularity in directed networks. [Research Report] Université d’Orléans Dugué N, Perez A (2015) Directed Louvain: maximizing modularity in directed networks. [Research Report] Université d’Orléans
Zurück zum Zitat Enzhi L, Zhengyi L (2020) Frustrated random walks: a faster algorithm to evaluate node distances on connected and undirected graphs. Phys Rev E 102:052135MathSciNetCrossRef Enzhi L, Zhengyi L (2020) Frustrated random walks: a faster algorithm to evaluate node distances on connected and undirected graphs. Phys Rev E 102:052135MathSciNetCrossRef
Zurück zum Zitat Everitt BS, Landau S, Leese M (2001) Cluster analysis, 4th edn. Hodder Arnold, London Everitt BS, Landau S, Leese M (2001) Cluster analysis, 4th edn. Hodder Arnold, London
Zurück zum Zitat Hubert L, Arabie P (1985) Comparing partitions. J Classif 2:193–218CrossRef Hubert L, Arabie P (1985) Comparing partitions. J Classif 2:193–218CrossRef
Zurück zum Zitat Jaccard P (1912) The Distribution of the Flora in the Alpine Zone. New Phytol 11(2):37–50CrossRef Jaccard P (1912) The Distribution of the Flora in the Alpine Zone. New Phytol 11(2):37–50CrossRef
Zurück zum Zitat Lancichinetti A, Fortunato S (2011) Limits of modularity maximization in community detection. Phys Rev E 84:066122CrossRef Lancichinetti A, Fortunato S (2011) Limits of modularity maximization in community detection. Phys Rev E 84:066122CrossRef
Zurück zum Zitat Lai D, Lu H, Nardini C (2010) Finding communities in directed networks by pagerank random walk induced network embedding. Phys A 389:2443–2454CrossRef Lai D, Lu H, Nardini C (2010) Finding communities in directed networks by pagerank random walk induced network embedding. Phys A 389:2443–2454CrossRef
Zurück zum Zitat Leicht EA, Newman MEJ (2008) Community structure in directed networks. Phys Rev Lett 100:118703CrossRef Leicht EA, Newman MEJ (2008) Community structure in directed networks. Phys Rev Lett 100:118703CrossRef
Zurück zum Zitat Moore C (2017) The computer science and physics of community detection: landscapes, phase transitions, and hardness. Bull. EATCS 121 Moore C (2017) The computer science and physics of community detection: landscapes, phase transitions, and hardness. Bull. EATCS 121
Zurück zum Zitat Newman MEJ (2013) Spectral methods for network community detection and graph partitioning. Phys Rev E 88:042822CrossRef Newman MEJ (2013) Spectral methods for network community detection and graph partitioning. Phys Rev E 88:042822CrossRef
Zurück zum Zitat Newman MEJ (2006) Finding community structure in networks using the eigenvectors of matrices. Phys Rev E 74:036104MathSciNetCrossRef Newman MEJ (2006) Finding community structure in networks using the eigenvectors of matrices. Phys Rev E 74:036104MathSciNetCrossRef
Zurück zum Zitat Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69:066133CrossRef Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69:066133CrossRef
Zurück zum Zitat Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69:026–113 Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69:026–113
Zurück zum Zitat Page L, Brin S, Motwani R, Winograd T (1998) The pagerank citation ranking: bringing order to the web. In: WWW ’98: Proceedings of the 7th international world wide web conference, pp 161–172 Page L, Brin S, Motwani R, Winograd T (1998) The pagerank citation ranking: bringing order to the web. In: WWW ’98: Proceedings of the 7th international world wide web conference, pp 161–172
Zurück zum Zitat Pons P, Latapy M (2006) Computing communities in large networks using random walks. J Graph Algorithms Appl 10(2):191–218MathSciNetCrossRef Pons P, Latapy M (2006) Computing communities in large networks using random walks. J Graph Algorithms Appl 10(2):191–218MathSciNetCrossRef
Zurück zum Zitat Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66:846–850CrossRef Rand WM (1971) Objective criteria for the evaluation of clustering methods. J Am Stat Assoc 66:846–850CrossRef
Zurück zum Zitat Reichardt J, Bornholdt S (2004) Detecting fuzzy community structures in complex networks with a potts model. Phys Rev Lett 93:218701CrossRef Reichardt J, Bornholdt S (2004) Detecting fuzzy community structures in complex networks with a potts model. Phys Rev Lett 93:218701CrossRef
Zurück zum Zitat Satuluri V, Parthasarathy S (2011) Symmetrizations for clustering directed graphs. In: EDBT ’11: Proceedings of the 14th international conference on extending database technology, pp 343–354 Satuluri V, Parthasarathy S (2011) Symmetrizations for clustering directed graphs. In: EDBT ’11: Proceedings of the 14th international conference on extending database technology, pp 343–354
Zurück zum Zitat Takacs C (2006) On the fundamental matrix of finite state Markov chains, its eigensystem and its relation to hitting times. Math Pannon 17(2):183–193MathSciNet Takacs C (2006) On the fundamental matrix of finite state Markov chains, its eigensystem and its relation to hitting times. Math Pannon 17(2):183–193MathSciNet
Zurück zum Zitat Tanimoto TT (1958) An elementary mathematical theory of classification and prediction. Internal IBM Technical Report Tanimoto TT (1958) An elementary mathematical theory of classification and prediction. Internal IBM Technical Report
Zurück zum Zitat Wang Z, Liang Y, Ji P (2020) Spectral algorithms for community detection in directed networks. J Mach Learn Res 21:1–45MathSciNet Wang Z, Liang Y, Ji P (2020) Spectral algorithms for community detection in directed networks. J Mach Learn Res 21:1–45MathSciNet
Zurück zum Zitat Watts DJ, Strogatz SH (1998) Nature 393:440–442. Original experimental data taken from J. G. White, E. Southgate, J. N. Thompson, and S. Brenner, Phil. Trans. R. Soc. London 314:1–340 (1986) Watts DJ, Strogatz SH (1998) Nature 393:440–442. Original experimental data taken from J. G. White, E. Southgate, J. N. Thompson, and S. Brenner, Phil. Trans. R. Soc. London 314:1–340 (1986)
Metadaten
Titel
Community detection in directed graphs using stationary distribution and hitting times methods
verfasst von
Tien Dat Dang
Duy Hieu Do
Thi Ha Duong Phan
Publikationsdatum
01.12.2023
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2023
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-023-01080-1

Weitere Artikel der Ausgabe 1/2023

Social Network Analysis and Mining 1/2023 Zur Ausgabe

Premium Partner