skip to main content
10.1145/2505515.2505545acmconferencesArticle/Chapter ViewAbstractPublication PagescikmConference Proceedingsconference-collections
research-article

PATRIC: a parallel algorithm for counting triangles in massive networks

Authors Info & Claims
Published:27 October 2013Publication History

ABSTRACT

Massive networks arising in numerous application areas poses significant challenges for network analysts as these networks grow to billions of nodes and are prohibitively large to fit in the main memory. Finding the number of triangles in a network is an important problem in the analysis of complex networks. Several interesting graph mining applications depend on the number of triangles in the graph. In this paper, we present an efficient MPI-based distributed memory parallel algorithm, called PATRIC, for counting triangles in massive networks. PATRIC scales well to networks with billions of nodes and can compute the exact number of triangles in a network with one billion nodes and 10 billion edges in 16 minutes. Balancing computational loads among processors for a graph problem like counting triangles is a challenging issue. We present and analyze several schemes for balancing load among processors for the triangle counting problem. These schemes achieve very good load balancing. We also show how our parallel algorithm can adapt an existing edge sparsification technique to approximate the number of triangles with very high accuracy. This modification allows us to count triangles in even larger networks.

References

  1. M. Alam and M. Khan. Efficient algorithms for generating massive random networks. Technical Report 13-064, NDSSL at Virginia Tech, May 2013.Google ScholarGoogle Scholar
  2. N. Alon, R. Yuster, and U. Zwick. Finding and counting given length cycles. Algorithmica, 17:209--223, 1997.Google ScholarGoogle ScholarCross RefCross Ref
  3. C. Apte, B. Liu, E. Pednault, and P. Smyth. Business applications of data mining. Commun. ACM, 45(8):49--53, Aug. 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Z. Bar-Yosseff, R. Kumar, and D. Sivakumar. Reductions in streaming algorithms, with an application to counting triangles in graphs. In SODA, pages 623--632, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Barabasi and R. Albert. Emergence of scaling in random networks. Science, 286:509--512, 1999.Google ScholarGoogle ScholarCross RefCross Ref
  6. C. Barrett, R. Beckman, et al. Generation and analysis of large synthetic social contact networks. In WSC, pages 103--114, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. L. Becchetti, P. Boldi, C. Castillo, and A. Gionis. Efficient semi-streaming algorithms for local triangle counting in massive graphs. In KDD, pages 16--24, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. B. Bollobas. Random Graphs. Cambridge Univ. Press, 2001.Google ScholarGoogle Scholar
  9. J. Chen and S. Lonardi. Biological Data Mining. Chapman & Hall/CRC, 1st edition, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. S. Chu and J. Cheng. Triangle listing in massive networks and its applications. In KDD, pages 672--680, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. F. Chung and L. Lu. Complex Graphs and Networks. American Mathematical Society, Aug. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Coppersmith and S. Winograd. Matrix multiplication via arithmetic progressions. In STOC, pages 1--6, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Dean and S. Ghemawat. Mapreduce: Simplified data processing on large clusters. In OSDI, pages 137--150, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. J. Eckmann and E. Moses. Curvature of co-links uncovers hidden thematic layers in the world wide web. Proc. Natl. Acad. of Sci. USA, 99(9):5825--5829, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  15. M. Girvan and M. Newman. Community structure in social and biological networks. Proc. Natl. Acad. of Sci. USA, 99(12):7821--7826, June 2002.Google ScholarGoogle ScholarCross RefCross Ref
  16. M. Jha, C. Seshadhri, and A. Pinar. A space efficient streaming algorithm for triangle counting using the birthday paradox. In KDD, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Kwak, C. Lee, et al. What is twitter, a social network or a news media? In WWW, pages 591--600, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. M. Latapy. Main-memory triangle computations for very large (sparse (power-law)) graphs. Theor. Comput. Sci., 407:458--473, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. M. McPherson, L. Smith-Lovin, and J. Cook. Birds of a feather: Homophily in social networks. Annual Rev. of Soc., 27(1):415--444, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  20. R. Milo, S. Shen-Orr, et al. Network motifs: simple building blocks of complex networks. Science, 298(5594):824--827, October 2002.Google ScholarGoogle ScholarCross RefCross Ref
  21. M. Newman. Coauthorship networks and patterns of scientific collaboration. Proc. Natl. Acad. of Sci. USA, 101(1):5200--5205, April 2004.Google ScholarGoogle ScholarCross RefCross Ref
  22. R. Pagh and C. Tsourakakis. Colorful triangle counting and a mapreduce implementation. Information Processing Letters, 112(7):277--281, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. T. Schank. Algorithmic Aspects of Triangle-Based Network Analysis. PhD thesis, University of Karlsruhe, 2007.Google ScholarGoogle Scholar
  24. T. Schank and D. Wagner. Finding, counting and listing all triangles in large graphs, an experimental study. In Exp. and Efficient Algorithms, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. C. Seshadhri, A. Pinar, and T. Kolda. Triadic measures on graphs: the power of wedge sampling. In SDM, pages 10--18, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  26. SNAP. Stanford network analysis project. http://snap.stanford.edu/, 2012.Google ScholarGoogle Scholar
  27. S. Suri and S. Vassilvitskii. Counting triangles and the curse of the last reducer. In WWW, pages 607--614, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. C. Tsourakakis, U. Kang, G. Miller, and C. Faloutsos. Doulion: counting triangles in massive graphs with a coin. In KDD, pages 837--846, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. PATRIC: a parallel algorithm for counting triangles in massive networks

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          CIKM '13: Proceedings of the 22nd ACM international conference on Information & Knowledge Management
          October 2013
          2612 pages
          ISBN:9781450322638
          DOI:10.1145/2505515

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 27 October 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article

          Acceptance Rates

          CIKM '13 Paper Acceptance Rate143of848submissions,17%Overall Acceptance Rate1,861of8,427submissions,22%

          Upcoming Conference

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader