skip to main content
10.1145/3487351.3488350acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
short-paper
Public Access

Constant community identification in million scale networks using image thresholding algorithms

Published:19 January 2022Publication History

ABSTRACT

Constant communities, i.e., groups of vertices that are always clustered together, independent of the community detection algorithm used, are necessary for reducing the inherent stochasticity of community detection results. Current methods for identifying constant communities require multiple runs of community detection algorithm(s). This process is extremely time consuming and not scalable to large networks. We propose a novel approach for finding the constant communities, by transforming the problem to a binary classification of edges. We apply the Otsu method from image thresholding to classify edges based on whether they are always within a community or not. Our algorithm does not require any explicit detection of communities and can thus scale to very large networks of the order of millions of vertices. Our results on real-world graphs show that our method is significantly faster and the constant communities produced have higher accuracy (as per F1 and NMI scores) than state-of-the-art baseline methods.

References

  1. T. Chakraborty, S. Srinivasan, N. Ganguly, S. Bhowmick, and A. Mukherjee, "Constant communities in complex networks," Scientific Reports, vol. 3, no. 1, 2013.Google ScholarGoogle Scholar
  2. N. Otsu, IEEE Transactions on Systems, Man and Cybernetics, no. 1.Google ScholarGoogle ScholarCross RefCross Ref
  3. D.-Y. Huang and C.-H. Wang, "Optimal multi-level thresholding using a two-stage otsu optimization approach," Pattern Recognition Letters, vol. 30, pp. 275--284, February 2009.Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. V. D. Blondel, J.-L. Guillaume, R. Lambiotte, and E. Lefebvre, "Fast unfolding of communities in large networks," J. of Stat. Mech., no. 10, p. P10008, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  5. M. Rosvall and C. T. Bergstrom, "Maps of random walks on complex networks reveal community structure," PNAS, vol. 105, no. 4, pp. 1118--1123, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  6. U. N. Raghavan, R. Albert, and S. Kumara, "Near linear time algorithm to detect community structures in large-scale networks," Phys. Rev. E, vol. 76, no. 3, Sep 2007.Google ScholarGoogle ScholarCross RefCross Ref
  7. S. P. Kolodziej, M. Aznaveh, M. Bullock, J. David, T. A. Davis, M. Henderson, Y. Hu, and R. Sandstrom, "The suitesparse matrix collection website interface," Journal of Open Source Software, vol. 4, no. 35, p. 1244, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  8. R. Guimerà, L. Danon, A. Díaz-Guilera, F. Giralt, and A. Arenas, "Self-similar community structure in a network of human interactions," Phys. Rev. E, vol. 68, no. 6, Dec 2003.Google ScholarGoogle ScholarCross RefCross Ref
  9. T. Chakraborty, S. Srinivasan, N. Ganguly, A. Mukherjee, and S. Bhowmick, "On the permanence of vertices in network communities." in KDD. ACM, 2014, pp. 1396--1405.Google ScholarGoogle Scholar
  10. J. Leskovec and A. Krevl, "SNAP Datasets: Stanford large network dataset collection," http://snap.stanford.edu/data, Jun. 2014.Google ScholarGoogle Scholar
  11. A. Tandon, A. Albeshri, V. Thayananthan, W. Alhalabi, and S. Fortunato, "Fast consensus clustering in complex networks," Phys. Rev. E, no. 4, Apr 2019.Google ScholarGoogle ScholarCross RefCross Ref
  12. W. Weir, S. Emmons, R. Gibson, D. Taylor, and P. Mucha, "Post-processing partitions to identify domains of modularity optimization," Algorithms, vol. 10, no. 3.Google ScholarGoogle Scholar
  13. "A comprehensive literature review on community detection: Approaches and applications," Procedia Computer Science.Google ScholarGoogle Scholar
  14. M. A. Riolo and M. E. J. Newman, "Consistency of community structure in complex networks," Phys. Rev. E, vol. 101, no. 5, May 2020.Google ScholarGoogle ScholarCross RefCross Ref
  15. A. Lancichinetti and S. Fortunato, "Consensus clustering in complex networks," Sci. Rep., vol. 2, no. 1, Mar 2012.Google ScholarGoogle Scholar
  16. L. Jeub, O. Sporns, and S. Fortunato, "Multiresolution consensus clustering in networks," Scientific Reports, vol. 8, February 2018.Google ScholarGoogle Scholar
  17. V. Poulin and F. Théberge, "Ensemble clustering for graphs: comparisons and applications," Applied Network Science, vol. 4, no. 1, p. 51, 2019.Google ScholarGoogle ScholarCross RefCross Ref
  18. T. Chakraborty, N. Park, and V. S. Subrahmanian, "Ensemble-based algorithms to detect disjoint and overlapping communities in networks," pp. 73--80, Aug 2016.Google ScholarGoogle Scholar
  19. T. Chakraborty, "On the discovery of invariant groups in complex networks," Journal of Complex Networks, vol. 5, pp. 734--749, 10 2017.Google ScholarGoogle Scholar
  20. N. Nguyen, A. Alim, T. Dinh, and M. Thai, "A method to detect communities with stability in social network," Social Network Analysis and Mining, vol. 4, 08 2014.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Constant community identification in million scale networks using image thresholding algorithms
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ASONAM '21: Proceedings of the 2021 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
          November 2021
          693 pages
          ISBN:9781450391283
          DOI:10.1145/3487351

          Copyright © 2021 ACM

          © 2021 Association for Computing Machinery. ACM acknowledges that this contribution was authored or co-authored by an employee, contractor or affiliate of the United States government. As such, the United States Government retains a nonexclusive, royalty-free right to publish or reproduce this article, or to allow others to do so, for Government purposes only.

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 19 January 2022

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • short-paper

          Acceptance Rates

          ASONAM '21 Paper Acceptance Rate22of118submissions,19%Overall Acceptance Rate116of549submissions,21%

          Upcoming Conference

          KDD '24

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader