Skip to main content
Erschienen in: Social Network Analysis and Mining 1/2016

01.12.2016 | Original Article

A scalable geometric algorithm for community detection from social networks with incremental update

verfasst von: Subu Surendran, D. Chithraprasad, M. Ramachandra Kaimal

Erschienen in: Social Network Analysis and Mining | Ausgabe 1/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In recent years, a series of algorithms have been proposed to detect community from social networks. Most of the algorithms are based on traditional spectral clustering algorithms such as k-means. One of the major limitations of such algorithms is that entire eigenvalues of the similarity matrix of the network need to be calculated in advance. In the case of a massive network, calculating entire eigenvalues is computationally expensive. This paper proposes a scalable geometric algorithm to find communities from large social networks. The major contributions of this work are: (1) We transform the network data into points by preserving the intrinsic properties and structure of the original data. (2) A novel geometric clustering is derived. And we use the data structure C-Tree and Voronoi diagram for identifying communities from the points in the Euclidean plane. (3) Since social networks grow dynamically, we further extend the algorithm to incrementally identify the community membership of newly introduced members. Experiments on both synthetic and real-world datasets show that the algorithm, in terms of objective matrices, is equally good as spectral clustering algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 10:P10008CrossRef Blondel VD, Guillaume J-L, Lambiotte R, Lefebvre E (2008) Fast unfolding of communities in large networks. J Stat Mech Theory Exp 10:P10008CrossRef
Zurück zum Zitat Brand M, Huang K (2003) A unifying theorem for spectral embedding and clustering. In: AISTATS Brand M, Huang K (2003) A unifying theorem for spectral embedding and clustering. In: AISTATS
Zurück zum Zitat Chen M, Kuzmin K, Szymanski BK (2014) Community detection via maximization of modularity and its variants. IEEE Trans Comput Soc Syst 1(1):46–65CrossRef Chen M, Kuzmin K, Szymanski BK (2014) Community detection via maximization of modularity and its variants. IEEE Trans Comput Soc Syst 1(1):46–65CrossRef
Zurück zum Zitat Chung FRK (1997) Spectral graph theory, vol 92. American Mathematical Society, ProvidenceMATH Chung FRK (1997) Spectral graph theory, vol 92. American Mathematical Society, ProvidenceMATH
Zurück zum Zitat Condon A, Karp RM (2001) Algorithms for graph partitioning on the planted partition model. Random Struct Algorithms 18:116–140 CiteseerMathSciNetCrossRefMATH Condon A, Karp RM (2001) Algorithms for graph partitioning on the planted partition model. Random Struct Algorithms 18:116–140 CiteseerMathSciNetCrossRefMATH
Zurück zum Zitat Dhanjal C, Gaudel R, Clémençon S (2014) Efficient eigen-updating for spectral graph clustering. Neurocomputing 131:440–452 ElsevierCrossRef Dhanjal C, Gaudel R, Clémençon S (2014) Efficient eigen-updating for spectral graph clustering. Neurocomputing 131:440–452 ElsevierCrossRef
Zurück zum Zitat Dhillon IS, Guan Y, Kulis B (2007) Weighted graph cuts without eigenvectors a multilevel approach. IEEE Trans Pattern Anal Mach Intell 29:1944–1957CrossRef Dhillon IS, Guan Y, Kulis B (2007) Weighted graph cuts without eigenvectors a multilevel approach. IEEE Trans Pattern Anal Mach Intell 29:1944–1957CrossRef
Zurück zum Zitat Duchon J (1977) Splines minimizing rotation-invariant semi-norms in Sobolev spaces. In: Schempp W, Zeller K(eds) Constructive theory of functions of several variables. Springer, Berlin Heidelberg, pp 85–100 Duchon J (1977) Splines minimizing rotation-invariant semi-norms in Sobolev spaces. In: Schempp W, Zeller K(eds) Constructive theory of functions of several variables. Springer, Berlin Heidelberg, pp 85–100
Zurück zum Zitat Ganti V, Ramakrishnan R, Gehrke J, Powell A, French J (1999) Clustering large datasets in arbitrary metric spaces. In: Proceedings of the 15th IEEE international conference on data engineering, 1999, pp 502–511 Ganti V, Ramakrishnan R, Gehrke J, Powell A, French J (1999) Clustering large datasets in arbitrary metric spaces. In: Proceedings of the 15th IEEE international conference on data engineering, 1999, pp 502–511
Zurück zum Zitat Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc. Series C (Applied Statistics) 28(1):100–108MATH Hartigan JA, Wong MA (1979) Algorithm as 136: A k-means clustering algorithm. J R Stat Soc. Series C (Applied Statistics) 28(1):100–108MATH
Zurück zum Zitat Karypis G, Kumar V (1998) A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J Sci Comput 20(1):359–392MathSciNetCrossRefMATH Karypis G, Kumar V (1998) A fast and high quality multilevel scheme for partitioning irregular graphs. SIAM J Sci Comput 20(1):359–392MathSciNetCrossRefMATH
Zurück zum Zitat Karypis G, Kumar V (1999) Parallel multilevel series k-way partitioning scheme for irregular graphs. SIAM Rev 41(2):278–300MathSciNetCrossRefMATH Karypis G, Kumar V (1999) Parallel multilevel series k-way partitioning scheme for irregular graphs. SIAM Rev 41(2):278–300MathSciNetCrossRefMATH
Zurück zum Zitat Karypis G, Vipin K (1997) A coarse-grain parallel formulation of multilevel k-way graph partitioning algorithm. In: PPSC Karypis G, Vipin K (1997) A coarse-grain parallel formulation of multilevel k-way graph partitioning algorithm. In: PPSC
Zurück zum Zitat Kernighan BW, Lin S (1970) An efficient heuristic procedure for partitioning graphs. Bell Syst Tech J 49(2):291–307CrossRefMATH Kernighan BW, Lin S (1970) An efficient heuristic procedure for partitioning graphs. Bell Syst Tech J 49(2):291–307CrossRefMATH
Zurück zum Zitat Lancichinetti Andrea, Fortunato Santo, Radicchi Filippo (2008) Benchmark graphs for testing community detection algorithms. Phys Rev E 78(4):46–110CrossRef Lancichinetti Andrea, Fortunato Santo, Radicchi Filippo (2008) Benchmark graphs for testing community detection algorithms. Phys Rev E 78(4):46–110CrossRef
Zurück zum Zitat Lanczos C (1950) An iteration method for the solution of the eigen- value problem of linear differential and integral operators. United States Government. Press Office Lanczos C (1950) An iteration method for the solution of the eigen- value problem of linear differential and integral operators. United States Government. Press Office
Zurück zum Zitat Medus A, Acuna G, Dorso CO (2005) Detection of community structures in networks via global optimization. Phys A Stat Mech Appl 358(2):593–604 ElsevierCrossRef Medus A, Acuna G, Dorso CO (2005) Detection of community structures in networks via global optimization. Phys A Stat Mech Appl 358(2):593–604 ElsevierCrossRef
Zurück zum Zitat Min W, Ke L, He X (2004) Locality pursuit embedding. Pattern Recognit 37(4):781–788CrossRefMATH Min W, Ke L, He X (2004) Locality pursuit embedding. Pattern Recognit 37(4):781–788CrossRefMATH
Zurück zum Zitat Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066–133CrossRef Newman MEJ (2004) Fast algorithm for detecting community structure in networks. Phys Rev E 69(6):066–133CrossRef
Zurück zum Zitat Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci 103(23):8577–8582CrossRef Newman MEJ (2006) Modularity and community structure in networks. Proc Natl Acad Sci 103(23):8577–8582CrossRef
Zurück zum Zitat Newman MEJ (2013) Spectral methods for community detection and graph partitioning. Phys Rev E 88(4):042822CrossRef Newman MEJ (2013) Spectral methods for community detection and graph partitioning. Phys Rev E 88(4):042822CrossRef
Zurück zum Zitat Ng AY, Jordan MI, Weiss Y et al (2002) On spectral clustering: Analysis and an algorithm. Adv Neural Inf Process Syst 2:849–856 Ng AY, Jordan MI, Weiss Y et al (2002) On spectral clustering: Analysis and an algorithm. Adv Neural Inf Process Syst 2:849–856
Zurück zum Zitat Nguyen NP, Dinh TN, Xuan Y, Thai MT (2011) Adaptive algorithms for detecting community structure in dynamic social networks. In: Proceedings of the IEEE INFOCOM, pp 2282–2290 Nguyen NP, Dinh TN, Xuan Y, Thai MT (2011) Adaptive algorithms for detecting community structure in dynamic social networks. In: Proceedings of the IEEE INFOCOM, pp 2282–2290
Zurück zum Zitat Nguyen NP, Dinh TN, Shen Y, Thai MT (2014) Dynamic social community detection and its applications. PloS One 9(4):e91431CrossRef Nguyen NP, Dinh TN, Shen Y, Thai MT (2014) Dynamic social community detection and its applications. PloS One 9(4):e91431CrossRef
Zurück zum Zitat Ning H, Xu W, Chi Y, Gong Y, Huang TS (2007) Incremental spectral clustering with application to monitoring of evolving blog communities. In: SIAM, pp 261–272 Ning H, Xu W, Chi Y, Gong Y, Huang TS (2007) Incremental spectral clustering with application to monitoring of evolving blog communities. In: SIAM, pp 261–272
Zurück zum Zitat Preparata FP, Shamos M (2012) Computational geometry: an introduction. Springer, New YorkMATH Preparata FP, Shamos M (2012) Computational geometry: an introduction. Springer, New YorkMATH
Zurück zum Zitat Rattigan MJ, Maier M, Jensen D (2007) Graph clustering with network structure indices. In: Proceedings of the 24th international conference on machine learning, ACM, pp 783–790 Rattigan MJ, Maier M, Jensen D (2007) Graph clustering with network structure indices. In: Proceedings of the 24th international conference on machine learning, ACM, pp 783–790
Zurück zum Zitat Sales-Pardo M, Guimera R, Moreira AA, Nunes Amaral LA (2007) Extracting the hierarchical organization of complex systems. Proc Natl Acad Sci 104(39):15224–15229CrossRef Sales-Pardo M, Guimera R, Moreira AA, Nunes Amaral LA (2007) Extracting the hierarchical organization of complex systems. Proc Natl Acad Sci 104(39):15224–15229CrossRef
Zurück zum Zitat Shen H-W, Cheng X-Q, Wang Y-Z, Chen Y (2012) A dimensionality reduction framework for detection of multiscale structure in heterogeneous networks. J Comput Sci Technol 27(2):341–357CrossRefMATH Shen H-W, Cheng X-Q, Wang Y-Z, Chen Y (2012) A dimensionality reduction framework for detection of multiscale structure in heterogeneous networks. J Comput Sci Technol 27(2):341–357CrossRefMATH
Zurück zum Zitat Surendran S, Chitraprasad D, Kaimal MR (2014) Voronoi diagram-based geometric approach for social network analysis. In: Computational intelligence, cyber security and computational models, Springer, pp 359–369 Surendran S, Chitraprasad D, Kaimal MR (2014) Voronoi diagram-based geometric approach for social network analysis. In: Computational intelligence, cyber security and computational models, Springer, pp 359–369
Zurück zum Zitat van der Maaten Laurens JP, Postma Eric O, van den Herik H Jaap (2009) Dimensionality reduction: a comparative review. J Mach Learn Res 10(1–41):66–71 van der Maaten Laurens JP, Postma Eric O, van den Herik H Jaap (2009) Dimensionality reduction: a comparative review. J Mach Learn Res 10(1–41):66–71
Zurück zum Zitat Whang JJ, Sui X, Dhillon IS (2012) Scalable and memory efficient clustering of large-scale social networks. In: IEEE 12th international conference on data mining (ICDM), pp 705–714 Whang JJ, Sui X, Dhillon IS (2012) Scalable and memory efficient clustering of large-scale social networks. In: IEEE 12th international conference on data mining (ICDM), pp 705–714
Zurück zum Zitat Xiang Shiming, Nie Feiping, Song Yangqiu, Zhang Changshui, Zhang Chunxia (2009) Embedding new data points for manifold learning via coordinate propagation. Knowl Inf Syst 19(2):159–184CrossRef Xiang Shiming, Nie Feiping, Song Yangqiu, Zhang Changshui, Zhang Chunxia (2009) Embedding new data points for manifold learning via coordinate propagation. Knowl Inf Syst 19(2):159–184CrossRef
Zurück zum Zitat Xie J, Chen M, Szymanski BK (2013) LabelrankT: incremental community detection in dynamic networks via label propagation. In: Proceedings of the workshop on dynamic networks management and mining, ACM, pp 25–32 Xie J, Chen M, Szymanski BK (2013) LabelrankT: incremental community detection in dynamic networks via label propagation. In: Proceedings of the workshop on dynamic networks management and mining, ACM, pp 25–32
Zurück zum Zitat Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33:452–473CrossRef Zachary WW (1977) An information flow model for conflict and fission in small groups. J Anthropol Res 33:452–473CrossRef
Metadaten
Titel
A scalable geometric algorithm for community detection from social networks with incremental update
verfasst von
Subu Surendran
D. Chithraprasad
M. Ramachandra Kaimal
Publikationsdatum
01.12.2016
Verlag
Springer Vienna
Erschienen in
Social Network Analysis and Mining / Ausgabe 1/2016
Print ISSN: 1869-5450
Elektronische ISSN: 1869-5469
DOI
https://doi.org/10.1007/s13278-016-0399-9

Weitere Artikel der Ausgabe 1/2016

Social Network Analysis and Mining 1/2016 Zur Ausgabe

Premium Partner