skip to main content
10.1145/1835804.1835873acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Neighbor query friendly compression of social networks

Published:25 July 2010Publication History

ABSTRACT

Compressing social networks can substantially facilitate mining and advanced analysis of large social networks. Preferably, social networks should be compressed in a way that they still can be queried efficiently without decompression. Arguably, neighbor queries, which search for all neighbors of a query vertex, are the most essential operations on social networks. Can we compress social networks effectively in a neighbor query friendly manner, that is, neighbor queries still can be answered in sublinear time using the compression? In this paper, we develop an effective social network compression approach achieved by a novel Eulerian data structure using multi-position linearizations of directed graphs. Our method comes with a nontrivial theoretical bound on the compression rate. To the best of our knowledge, our approach is the first that can answer both out-neighbor and in-neighbor queries in sublinear time. An extensive empirical study on more than a dozen benchmark real data sets verifies our design.

Skip Supplemental Material Section

Supplemental Material

kdd2010_maserrat_nqfcsn_01.mov

mov

128.8 MB

References

  1. M. Adler and M. Mitzenmacher. Towards compressing web graphs. In Data Compression Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Apostolico and G. Drovandi. Graph compression by BFS. Algorithms, 2(3):1031--1044, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  3. P. Boldi, M. Santini, and S. Vigna. Permuting web graphs. In Proceedings of the 6th International Workshop on Algorithms and Models for the Web-Graph (WAW'09), Berlin, Heidelberg, 2009. Springer-Verlag. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. P. Boldi and S. Vigna. The webgraph framework I: compression techniques. In WWW, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Boldi and S. Vigna. The webgraph framework II: Codes for the world-wide web. In Data Compression Conference, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. A. Z. Broder, M. Charikar, A. M. Frieze, and M. Mitzenmacher. Min-wise independent permutations. J. Comput. Syst. Sci., 60(3):630--659, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. G. Buehrer and K. Chellapilla. A scalable pattern mining approach to web graph compression with communities. In WSDM, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. F. Chierichetti, R. Kumar, S. Lattanzi, M. Mitzenmacher, A. Panconesi, and P. Raghavan. On compressing social networks. In KDD, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. H. Cormen, C. E. Leiserson, R. L. Rivest, and C. Stein. Introduction to Algorithms. MIT Press, 2nd edition, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. J. D'ıaz, J. Petit, and M. J. Serna. A survey of graph layout problems. ACM Comput. Surv., 34(3):313--356, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. H. Papadimitriou. The NP-completeness of the bandwidth minimization problem. Computing, 16(3):263--270, 1976.Google ScholarGoogle ScholarCross RefCross Ref
  12. S. Raghavan and H. Garcia-Molina. Representing web graphs. In ICDE, 2003.Google ScholarGoogle Scholar
  13. K. H. Randall, R. Stata, J. L. Wiener, and R. Wickremesinghe. The link database: Fast access to graphs of the web. In DCC, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. K. F. Stanley Wasserman. Social networks analysis: Methods and Applications. Cambridge Press, 1994.Google ScholarGoogle ScholarCross RefCross Ref
  15. T. Suel and J. Yuan. Compressing the graph structure of the web. In Data Compression Conference, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. J. Watts and S. H. Strogatz. Collective dynamics of 'small world' networks. Nature, 393(6684):440--442, 1998.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Neighbor query friendly compression of social networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
      July 2010
      1240 pages
      ISBN:9781450300551
      DOI:10.1145/1835804

      Copyright © 2010 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 25 July 2010

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader