Skip to main content
Top

2017 | OriginalPaper | Chapter

Local Lanczos Spectral Approximation for Community Detection

Authors : Pan Shi, Kun He, David Bindel, John E. Hopcroft

Published in: Machine Learning and Knowledge Discovery in Databases

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

We propose a novel approach called the Local Lanczos Spectral Approximation (LLSA) for identifying all latent members of a local community from very few seed members. To reduce the computation complexity, we first apply a fast heat kernel diffusing to sample a comparatively small subgraph covering almost all possible community members around the seeds. Then starting from a normalized indicator vector of the seeds and by a few steps of Lanczos iteration on the sampled subgraph, a local eigenvector is gained for approximating the eigenvector of the transition matrix with the largest eigenvalue. Elements of this local eigenvector is a relaxed indicator for the affiliation probability of the corresponding nodes to the target community. We conduct extensive experiments on real-world datasets in various domains as well as synthetic datasets. Results show that the proposed method outperforms state-of-the-art local community detection algorithms. To the best of our knowledge, this is the first work to adapt the Lanczos method for local community detection, which is natural and potentially effective. Also, we did the first attempt of using heat kernel as a sampling method instead of detecting communities directly, which is proved empirically to be very efficient and effective.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Ahn, Y.Y., Bagrow, J.P., Lehmann, S.: Link communities reveal multiscale complexity in networks. Nature 466(7307), 761–764 (2010)CrossRef Ahn, Y.Y., Bagrow, J.P., Lehmann, S.: Link communities reveal multiscale complexity in networks. Nature 466(7307), 761–764 (2010)CrossRef
2.
go back to reference Andersen, R., Chung, F., Lang, K.: Local graph partitioning using pagerank vectors. In: FOCS, pp. 475–486 (2006) Andersen, R., Chung, F., Lang, K.: Local graph partitioning using pagerank vectors. In: FOCS, pp. 475–486 (2006)
3.
go back to reference Andersen, R., Lang, K.J.: Communities from seed sets. In: WWW, pp. 223–232 (2006) Andersen, R., Lang, K.J.: Communities from seed sets. In: WWW, pp. 223–232 (2006)
4.
go back to reference Andersen, R., Lang, K.J.: An algorithm for improving graph partitions. In: SODA, pp. 651–660 (2008) Andersen, R., Lang, K.J.: An algorithm for improving graph partitions. In: SODA, pp. 651–660 (2008)
5.
go back to reference Andersen, R., Peres, Y.: Finding sparse cuts locally using evolving sets. In: STOC, pp. 235–244 (2009) Andersen, R., Peres, Y.: Finding sparse cuts locally using evolving sets. In: STOC, pp. 235–244 (2009)
6.
go back to reference Barnes, E.R.: An algorithm for partitioning the nodes of a graph. SIAM J. Algebraic Discrete Methods 3(4), 303–304 (1982)MathSciNetCrossRef Barnes, E.R.: An algorithm for partitioning the nodes of a graph. SIAM J. Algebraic Discrete Methods 3(4), 303–304 (1982)MathSciNetCrossRef
7.
go back to reference Bentbib, A.H., El Guide, M., Jbilou, K., Reichel, L.: A global Lanczos method for image restoration. J. Comput. Appl. Math. 300, 233–244 (2016)MathSciNetCrossRefMATH Bentbib, A.H., El Guide, M., Jbilou, K., Reichel, L.: A global Lanczos method for image restoration. J. Comput. Appl. Math. 300, 233–244 (2016)MathSciNetCrossRefMATH
8.
go back to reference Chung, F.: The heat kernel as the pagerank of a graph. PNAS 104(50), 19735–19740 (2007)CrossRef Chung, F.: The heat kernel as the pagerank of a graph. PNAS 104(50), 19735–19740 (2007)CrossRef
9.
go back to reference Chung, F., Simpson, O.: Solving linear systems with boundary conditions using heat kernel pagerank. In: Algorithms and Models for the Web Graph (WAW), pp. 203–219 (2013) Chung, F., Simpson, O.: Solving linear systems with boundary conditions using heat kernel pagerank. In: Algorithms and Models for the Web Graph (WAW), pp. 203–219 (2013)
11.
go back to reference Chung, F.: Spectral Graph Theory, vol. 92. American Mathematical Society, Providence (1997)MATH Chung, F.: Spectral Graph Theory, vol. 92. American Mathematical Society, Providence (1997)MATH
12.
go back to reference Coscia, M., Giannotti, F., Pedreschi, D.: A classification for community discovery methods in complex networks. Stastical Anal. Data Min. 4(5), 512–546 (2011)MathSciNetCrossRef Coscia, M., Giannotti, F., Pedreschi, D.: A classification for community discovery methods in complex networks. Stastical Anal. Data Min. 4(5), 512–546 (2011)MathSciNetCrossRef
13.
go back to reference Coscia, M., Rossetti, G., Giannotti, F., Pedreschi, D.: DEMON: a local-first discovery method for overlapping communities. In: KDD, pp. 615–623 (2012) Coscia, M., Rossetti, G., Giannotti, F., Pedreschi, D.: DEMON: a local-first discovery method for overlapping communities. In: KDD, pp. 615–623 (2012)
14.
go back to reference He, K., Sun, Y., Bindel, D., Hopcroft, J., Li, Y.: Detecting overlapping communities from local spectral subspaces. In: ICDM, pp. 769–774 (2015) He, K., Sun, Y., Bindel, D., Hopcroft, J., Li, Y.: Detecting overlapping communities from local spectral subspaces. In: ICDM, pp. 769–774 (2015)
15.
go back to reference Kloster, K., Gleich, D.F.: Heat kernel based community detection. In: KDD, pp. 1386–1395 (2014) Kloster, K., Gleich, D.F.: Heat kernel based community detection. In: KDD, pp. 1386–1395 (2014)
16.
go back to reference Kloumann, I.M., Kleinberg, J.M.: Community membership identification from small seed sets. In: KDD, pp. 1366–1375 (2014) Kloumann, I.M., Kleinberg, J.M.: Community membership identification from small seed sets. In: KDD, pp. 1366–1375 (2014)
18.
go back to reference Komzsik, L.: The Lanczos Method: Evolution and Application, vol. 15. SIAM, Philadelphia (2003)CrossRefMATH Komzsik, L.: The Lanczos Method: Evolution and Application, vol. 15. SIAM, Philadelphia (2003)CrossRefMATH
19.
go back to reference Lancichinetti, A., Fortunato, S.: Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys. Rev. E 80(1), 016118 (2009)CrossRef Lancichinetti, A., Fortunato, S.: Benchmarks for testing community detection algorithms on directed and weighted graphs with overlapping communities. Phys. Rev. E 80(1), 016118 (2009)CrossRef
20.
go back to reference Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78(4), 046110 (2008)CrossRef Lancichinetti, A., Fortunato, S., Radicchi, F.: Benchmark graphs for testing community detection algorithms. Phys. Rev. E 78(4), 046110 (2008)CrossRef
21.
go back to reference Lanczos, C.: An iteration method for the solution of the eigenvalue problem of linear differential and integral operators. J. Res. Nat. Bur. Stan. 45, 255–282 (1950)MathSciNetCrossRef Lanczos, C.: An iteration method for the solution of the eigenvalue problem of linear differential and integral operators. J. Res. Nat. Bur. Stan. 45, 255–282 (1950)MathSciNetCrossRef
22.
go back to reference Li, Y., He, K., Bindel, D., Hopcroft, J.: Uncovering the small community structure in large networks. In: WWW, pp. 658–668 (2015) Li, Y., He, K., Bindel, D., Hopcroft, J.: Uncovering the small community structure in large networks. In: WWW, pp. 658–668 (2015)
23.
go back to reference Lütkepohl, H.: Handbook of Matrices, vol. 2. Wiley, Hoboken (1997)MATH Lütkepohl, H.: Handbook of Matrices, vol. 2. Wiley, Hoboken (1997)MATH
25.
go back to reference Mahoney, M.W., Orecchia, L., Vishnoi, N.K.: A local spectral method for graphs: with applications to improving graph partitions and exploring data graphs locally. J. Mach. Learn. Res. 13(1), 2339–2365 (2012)MathSciNetMATH Mahoney, M.W., Orecchia, L., Vishnoi, N.K.: A local spectral method for graphs: with applications to improving graph partitions and exploring data graphs locally. J. Mach. Learn. Res. 13(1), 2339–2365 (2012)MathSciNetMATH
26.
go back to reference Orecchia, L., Zhu, Z.A.: Flow-based algorithms for local graph clustering. In: SODA, pp. 1267–1286 (2014) Orecchia, L., Zhu, Z.A.: Flow-based algorithms for local graph clustering. In: SODA, pp. 1267–1286 (2014)
27.
go back to reference Paige, C.: The computation of eigenvalues and eigenvectors of very large sparse matrices. Ph.D. thesis, University of London (1971) Paige, C.: The computation of eigenvalues and eigenvectors of very large sparse matrices. Ph.D. thesis, University of London (1971)
28.
go back to reference Palla, G., Derenyi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043), 814–818 (2005)CrossRef Palla, G., Derenyi, I., Farkas, I., Vicsek, T.: Uncovering the overlapping community structure of complex networks in nature and society. Nature 435(7043), 814–818 (2005)CrossRef
29.
go back to reference Parlett, B.N., Poole Jr., W.G.: A geometric theory for the QR, LU and power iterations. SIAM J. Numer. Anal. 10(2), 389–412 (1973)MathSciNetCrossRefMATH Parlett, B.N., Poole Jr., W.G.: A geometric theory for the QR, LU and power iterations. SIAM J. Numer. Anal. 10(2), 389–412 (1973)MathSciNetCrossRefMATH
30.
go back to reference Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 888–905 (2000)CrossRef Shi, J., Malik, J.: Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 22(8), 888–905 (2000)CrossRef
31.
go back to reference Spielman, D.A., Teng, S.: Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In: STOC, pp. 81–90 (2004) Spielman, D.A., Teng, S.: Nearly-linear time algorithms for graph partitioning, graph sparsification, and solving linear systems. In: STOC, pp. 81–90 (2004)
33.
go back to reference Wu, G., Xu, W., Leng, H.: Inexact and incremental bilinear Lanczos components algorithms for high dimensionality reduction and image reconstruction. Pattern Recogn. 48(1), 244–263 (2015)CrossRefMATH Wu, G., Xu, W., Leng, H.: Inexact and incremental bilinear Lanczos components algorithms for high dimensionality reduction and image reconstruction. Pattern Recogn. 48(1), 244–263 (2015)CrossRefMATH
34.
go back to reference Xie, J., Kelley, S., Szymanski, B.K.: Overlapping community detection in networks: the state-of-the-art and comparative study. ACM Comput. Surv. (CSUR) 45(4), 43 (2013)CrossRefMATH Xie, J., Kelley, S., Szymanski, B.K.: Overlapping community detection in networks: the state-of-the-art and comparative study. ACM Comput. Surv. (CSUR) 45(4), 43 (2013)CrossRefMATH
35.
go back to reference Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. In: ICDM, pp. 745–754 (2012) Yang, J., Leskovec, J.: Defining and evaluating network communities based on ground-truth. In: ICDM, pp. 745–754 (2012)
Metadata
Title
Local Lanczos Spectral Approximation for Community Detection
Authors
Pan Shi
Kun He
David Bindel
John E. Hopcroft
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-71249-9_39

Premium Partner