Skip to main content

2014 | OriginalPaper | Buchkapitel

The Randomized Greedy Modularity Clustering Algorithm and the Core Groups Graph Clustering Scheme

verfasst von : Andreas Geyer-Schulz, Michael Ovelgönne

Erschienen in: German-Japanese Interchange of Data Analysis Results

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The modularity measure of Newman and Girvan is a popular formal cluster criterium for graph clustering. Although the modularity maximization problem has been shown to be NP-hard, a large number of heuristic modularity maximization algorithms have been developed. In the 10th DIMACS Implementation Challenge of the Center for Discrete Mathematics & Theoretical Computer Science (DIMACS) for graph clustering our core groups graph clustering scheme combined with a randomized greedy modularity clustering algorithm won both modularity optimization challenges: the Modularity (highest modularity) and the Pareto Challenge (tradeoff between modularity and performance). The core groups graph clustering scheme is an ensemble learning clustering method which combines the local solutions of several base algorithms to form a good start solution for the final algorithm. The randomized greedy modularity algorithm is a non-deterministic agglomerative hierarchical clustering approach which finds locally optimal solutions. In this contribution we analyze the similarity of the randomized greedy modularity algorithm with incomplete solvers for the satisfiability problem and we establish an analogy between the cluster core group heuristic used in core groups graph clustering and a sampling of restart points on the Morse graph of a continuous optimization problem with the same local optima.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Adamic LA, Glance N (2005) The political blogosphere and the 2004 U.S. election: divided they blog. In: Proceedings of the 3rd international workshop on link discovery, LinkKDD’05, Chicago. ACM, New York, pp 36–43 Adamic LA, Glance N (2005) The political blogosphere and the 2004 U.S. election: divided they blog. In: Proceedings of the 3rd international workshop on link discovery, LinkKDD’05, Chicago. ACM, New York, pp 36–43
Zurück zum Zitat Aloise D, Caporossi G, Hansen P, Liberti L, Ruiz M (2012) Modularity maximization in networks by variable neighborhood search. In: 10th DIMACS implementation challenge – graph partitioning and graph clustering. http://www.cc.gatech.edu/dimacs10/papers/[11]-VNS_DIMACS.pdf Aloise D, Caporossi G, Hansen P, Liberti L, Ruiz M (2012) Modularity maximization in networks by variable neighborhood search. In: 10th DIMACS implementation challenge – graph partitioning and graph clustering. http://​www.​cc.​gatech.​edu/​dimacs10/​papers/​[11]-VNS_DIMACS.pdf
Zurück zum Zitat Biere A, Heule M, van Maaren H, Walsh T (eds) (2009) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam Biere A, Heule M, van Maaren H, Walsh T (eds) (2009) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam
Zurück zum Zitat Boguñá M, Pastor-Satorras R, Díaz-Guilera A, Arenas A (2004) Models of social networks based on social distance attachment. Phys Rev E 70(5):056,122CrossRef Boguñá M, Pastor-Satorras R, Díaz-Guilera A, Arenas A (2004) Models of social networks based on social distance attachment. Phys Rev E 70(5):056,122CrossRef
Zurück zum Zitat Boldi P, Rosa M, Santini M, Vigna S (2011) Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks. In: Proceedings of the 20th international conference on world wide web, Hyderabad. ACM, New York Boldi P, Rosa M, Santini M, Vigna S (2011) Layered label propagation: a multiresolution coordinate-free ordering for compressing social networks. In: Proceedings of the 20th international conference on world wide web, Hyderabad. ACM, New York
Zurück zum Zitat Brandes U, Delling D, Gaertler M, Görke R, Hoefer M, Nikoloski Z, Wagner D (2007) On finding graph clusterings with maximum modularity. In: Graph-theoretic concepts in computer science. Springer, Berlin/New York, pp 121–132 Brandes U, Delling D, Gaertler M, Görke R, Hoefer M, Nikoloski Z, Wagner D (2007) On finding graph clusterings with maximum modularity. In: Graph-theoretic concepts in computer science. Springer, Berlin/New York, pp 121–132
Zurück zum Zitat Brandes U, Delling D, Gaertler M, Görke R, Hoefer M, Nikoloski Z, Wagner D (2008) On modularity clustering. IEEE Trans Knowl Data Eng 20(2):172–188CrossRef Brandes U, Delling D, Gaertler M, Görke R, Hoefer M, Nikoloski Z, Wagner D (2008) On modularity clustering. IEEE Trans Knowl Data Eng 20(2):172–188CrossRef
Zurück zum Zitat Duch J, Arenas A (2005) Community detection in complex networks using extremal optimization. Phys Rev E 72(2):027,104CrossRef Duch J, Arenas A (2005) Community detection in complex networks using extremal optimization. Phys Rev E 72(2):027,104CrossRef
Zurück zum Zitat Geisberger R, Sanders P, Schultes D (2008) Better approximation of betweenness centrality. In: 10th workshop on algorithm engineering and experimentation, San Francisco. SIAM, pp 90–108 Geisberger R, Sanders P, Schultes D (2008) Better approximation of betweenness centrality. In: 10th workshop on algorithm engineering and experimentation, San Francisco. SIAM, pp 90–108
Zurück zum Zitat Goldberg A (1979) On the complexity of the satisfiability problem. In: 4th workshop of automatic deduction, Austin, pp 1–6 Goldberg A (1979) On the complexity of the satisfiability problem. In: 4th workshop of automatic deduction, Austin, pp 1–6
Zurück zum Zitat Guimerà R, Danon L, Díaz-Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68(6):065,103CrossRef Guimerà R, Danon L, Díaz-Guilera A, Giralt F, Arenas A (2003) Self-similar community structure in a network of human interactions. Phys Rev E 68(6):065,103CrossRef
Zurück zum Zitat Jongen H, Jonker P, Twilt F (2000) Nonlinear optimization in finite dimensions. Kluwer Academic, DordrechtMATH Jongen H, Jonker P, Twilt F (2000) Nonlinear optimization in finite dimensions. Kluwer Academic, DordrechtMATH
Zurück zum Zitat Kautz H, Sabharwal A, Selman B (2009) Incomplete algorithms. In: Biere A, Heule M, van Maaren H, Walsh T (eds) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam, chap 6, pp 185–203 Kautz H, Sabharwal A, Selman B (2009) Incomplete algorithms. In: Biere A, Heule M, van Maaren H, Walsh T (eds) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam, chap 6, pp 185–203
Zurück zum Zitat Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066,133CrossRef Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066,133CrossRef
Zurück zum Zitat Newman MEJ (2006) Finding community structure in networks using the eigenvectors of matrices. Phys Rev E 74(3):036,104CrossRef Newman MEJ (2006) Finding community structure in networks using the eigenvectors of matrices. Phys Rev E 74(3):036,104CrossRef
Zurück zum Zitat Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026,113CrossRef Newman MEJ, Girvan M (2004) Finding and evaluating community structure in networks. Phys Rev E 69(2):026,113CrossRef
Zurück zum Zitat Ovelgönne M, Geyer-Schulz A (2010) Cluster cores and modularity maximization. In: Fan W, Hsu W, Webb GI, Liu B, Zhang C, Gunopulos D, Wu X (eds) 10th IEEE international conference on data mining workshops, ICDMW’10, Sydney. IEEE Computer Society, Los Alamitos, pp 1204–1213CrossRef Ovelgönne M, Geyer-Schulz A (2010) Cluster cores and modularity maximization. In: Fan W, Hsu W, Webb GI, Liu B, Zhang C, Gunopulos D, Wu X (eds) 10th IEEE international conference on data mining workshops, ICDMW’10, Sydney. IEEE Computer Society, Los Alamitos, pp 1204–1213CrossRef
Zurück zum Zitat Ovelgönne M, Geyer-Schulz A (2012a) A comparison of agglomerative hierarchical algorithms for modularity clustering. In: Gaul W, Geyer-Schulz A, Schmidt-Thieme L, Kunze J (eds) Proceedings of the 34th conference of the German Classification Society, Karlsruhe. Studies in classification, data analysis, and knowledge organization. Springer, Heidelberg, pp 225–232 Ovelgönne M, Geyer-Schulz A (2012a) A comparison of agglomerative hierarchical algorithms for modularity clustering. In: Gaul W, Geyer-Schulz A, Schmidt-Thieme L, Kunze J (eds) Proceedings of the 34th conference of the German Classification Society, Karlsruhe. Studies in classification, data analysis, and knowledge organization. Springer, Heidelberg, pp 225–232
Zurück zum Zitat Ovelgönne M, Geyer-Schulz A (2012b) An ensemble-learning strategy for graph-clustering. In: Bader DA, Meyerhenke H, Sanders P, Wagner D (eds) 10th DIMACS implementation challenge – graph partitioning and graph clustering, Rutgers University. http://www.cc.gatech.edu/dimacs10/papers/[18]-dimacs10_ovelgoennegeyer%schulz.pdf Ovelgönne M, Geyer-Schulz A (2012b) An ensemble-learning strategy for graph-clustering. In: Bader DA, Meyerhenke H, Sanders P, Wagner D (eds) 10th DIMACS implementation challenge – graph partitioning and graph clustering, Rutgers University. http://​www.​cc.​gatech.​edu/​dimacs10/​papers/​[18]-dimacs10_ovelgoennegeyer%schulz.pdf
Zurück zum Zitat Ovelgönne M, Geyer-Schulz A (2013) An ensemble learning strategy for graph clustering. In: Bader DA, Meyerhenke H, Sanders P, Wagner D (eds) Graph partitioning and graph clustering. Contemporary mathematics, vol 588. American Mathematical Society, Providence, pp 187–205CrossRef Ovelgönne M, Geyer-Schulz A (2013) An ensemble learning strategy for graph clustering. In: Bader DA, Meyerhenke H, Sanders P, Wagner D (eds) Graph partitioning and graph clustering. Contemporary mathematics, vol 588. American Mathematical Society, Providence, pp 187–205CrossRef
Zurück zum Zitat Ovelgönne M, Geyer-Schulz A, Stein M (2010) Randomized greedy modularity optimization for group detection in huge social networks. In: 4th ACM SNA-KDD workshop on social network mining and analysis, Washington, DC Ovelgönne M, Geyer-Schulz A, Stein M (2010) Randomized greedy modularity optimization for group detection in huge social networks. In: 4th ACM SNA-KDD workshop on social network mining and analysis, Washington, DC
Zurück zum Zitat Sakallah KA (2009) Symmetry and satisfiability. In: Biere A, Heule M, van Maaren H, Walsh T (eds) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam, chap 10, pp 289–338 Sakallah KA (2009) Symmetry and satisfiability. In: Biere A, Heule M, van Maaren H, Walsh T (eds) Handbook of satisfiability. Frontiers in artificial intelligence and applications, vol 185. IOS, Amsterdam, chap 10, pp 289–338
Zurück zum Zitat Schuetz P, Caflisch A (2008) Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. Phys Rev E 77:046,112CrossRef Schuetz P, Caflisch A (2008) Efficient modularity optimization by multistep greedy algorithm and vertex mover refinement. Phys Rev E 77:046,112CrossRef
Zurück zum Zitat Shang Y, Wah BW (1998) A discrete Lagrangian-based global-search method for solving satisfiability problems. J Glob Optim 12(1):61–99MathSciNetCrossRefMATH Shang Y, Wah BW (1998) A discrete Lagrangian-based global-search method for solving satisfiability problems. J Glob Optim 12(1):61–99MathSciNetCrossRefMATH
Zurück zum Zitat Watts DJ, Strogatz SH (1998) Collective dynamics of “small-world” networks. Nature 393(6684):440–442CrossRef Watts DJ, Strogatz SH (1998) Collective dynamics of “small-world” networks. Nature 393(6684):440–442CrossRef
Zurück zum Zitat Zhu Z, Wang C, Ma L, Pan Y, Ding Z (2008) Scalable community discovery of large networks. In: Proceedings of the 2008 ninth international conference on web-age information management, WAIM’08, Zhangjiajie, IEEE Computer Society, Los Alamitos, pp 381–388 Zhu Z, Wang C, Ma L, Pan Y, Ding Z (2008) Scalable community discovery of large networks. In: Proceedings of the 2008 ninth international conference on web-age information management, WAIM’08, Zhangjiajie, IEEE Computer Society, Los Alamitos, pp 381–388
Metadaten
Titel
The Randomized Greedy Modularity Clustering Algorithm and the Core Groups Graph Clustering Scheme
verfasst von
Andreas Geyer-Schulz
Michael Ovelgönne
Copyright-Jahr
2014
DOI
https://doi.org/10.1007/978-3-319-01264-3_2

Premium Partner