skip to main content
10.1145/2339530.2339539acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

PageRank on an evolving graph

Published:12 August 2012Publication History

ABSTRACT

One of the most important features of the Web graph and social networks is that they are constantly evolving. The classical computational paradigm, which assumes a fixed data set as an input to an algorithm that terminates, is inadequate for such settings. In this paper we study the problem of computing PageRank on an evolving graph. We propose an algorithm that, at any moment in the time and by crawling a small portion of the graph, provides an estimate of the PageRank that is close to the true PageRank of the graph at that moment. We will also evaluate our algorithm experimentally on real data sets and on randomly generated inputs. Under a stylized model of graph evolution, we show that our algorithm achieves a provable performance guarantee that is significantly better than the naive algorithm that crawls the nodes in a round-robin fashion.

Skip Supplemental Material Section

Supplemental Material

306_m_talk_3.mp4

mp4

177.4 MB

References

  1. A. Anagnostopoulos, R. Kumar, M. Mahdian, and E. Upfal. Sort me if you can: How to sort dynamic data. In ICALP, pages 339--350, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Anagnostopoulos, R. Kumar, M. Mahdian, and E. Upfal. Algorithms on evolving graphs. In ITCS, pages 149--160, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. K. Avrachenkov, N. Litvak, D. Nemirovsky, and N. Osipova. Monte Carlo methods in Pagerank computation: When one iteration is sufficient. SIAM J. Numer. Anal., 45(2):890--904, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. B. Bahmani, K. Chakrabarti, and D. Xin. Fast personalized PageRank on MapReduce. In SIGMOD, pages 973--984, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. B. Bahmani, A. Chowdhury, and A. Goel. Fast incremental and personalized PageRank. PVLDB, 4:173--184, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Berkhin. Survey: A survey on PageRank computing. Internet Mathematics, 2(1):73--120, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  7. Y.-Y. Chen, Q. Gan, and T. Suel. Local methods for estimating PageRank values. In CIKM, pages 381--389, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. S. Chien, C. Dwork, S. Kumar, D. Simon, and D. Sivakumar. Link evolution: Analysis and algorithms. Internet Mathematics, 1(3):277--304, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  9. A. Das Sarma, S. Gollapudi, and R. Panigrahy. Estimating PageRank on graph streams. In PODS, pages 69--78, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. Dasgupta, A. Ghosh, R. Kumar, C. Olston, S. Pandey, and A. Tomkins. The discoverability of the web. In WWW, pages 421--430, 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. V. Davis and I. S. Dhillon. Estimating the global PageRank of web communities. In KDD, pages 116--125, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Eppstein, Z. Galil, and G. F. Italiano. Dynamic graph algorithms. In M. J. Atallah, editor, Algorithms and Theoretical Computing Handbook. CRC Press, 1999.Google ScholarGoogle Scholar
  13. S. Kamvar, T. Haveliwala, and G. Golub. Adaptive methods for the computation of PageRank. Technical report, Stanford University, 2003.Google ScholarGoogle Scholar
  14. S. Kamvar, T. Haveliwala, C. Manning, and G. Golub. Exploiting the block structure of the web for computing PageRank. Technical report, Stanford University, 2003.Google ScholarGoogle Scholar
  15. S. D. Kamvar, T. H. Haveliwala, C. D. Manning, and G. H. Golub. Extrapolation methods for accelerating PageRank computations. In WWW, pages 261--270, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. R. Kleinberg. Online Decision Problems with Large Strategy Sets. PhD thesis, MIT, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. A. N. Langville and C. D. Meyer. Updating PageRank using the group inverse and stochastic complementation. Technical report, NCSU, Mathematics Department, 2002.Google ScholarGoogle Scholar
  18. A. N. Langville and C. D. Meyer. Updating PageRank with iterative aggregation. In WWW (posters), pages 392--393, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. N. Langville and C. D. Meyer. Updating Markov Chains with an eye on Google's PageRank. SIAM J. Matrix Anal. Appl., 27(4):968--987, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. F. McSherry. A uniform approach to accelerated PageRank computation. In WWW, pages 575--582, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. S. Muthukrishnan. Data Streams: Algorithms and Applications. Now Publishers Inc, 2005.Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. C. Olston and S. Pandey. Recrawl scheduling based on information longevity. In WWW, pages 437--446, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. L. Page, S. Brin, R. Motwani, and T. Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford InfoLab, 1999.Google ScholarGoogle Scholar
  24. S. Pandey and C. Olston. User-centric web crawling. In WWW, pages 401--411, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. S. Pandey and C. Olston. Crawl ordering by search impact. In WSDM, pages 3--14, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. D. Ron. Property testing: A learning theory perspective. Found. Trends Mach. Learn., 1:307--402, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. D. Ron. Algorithmic and analysis techniques in property testing. Found. Trends Theor. Comput. Sci., 5:73--205, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. H. H. Song, T. W. Cho, V. Dave, Y. Zhang, and L. Qiu. Scalable proximity estimation and link prediction in online social networks. In IMC, pages 322--335, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. H. Tong, S. Papadimitriou, P. S. Yu, and C. Faloutsos. Proximity tracking on time-evolving bipartite graphs. In SDM, pages 704--715, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  30. J. L. Wolf, M. S. Squillante, P. S. Yu, J. Sethuraman, and L. Ozsen. Optimal crawling strategies for web search engines. In WWW, pages 136--147, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. PageRank on an evolving graph

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      KDD '12: Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
      August 2012
      1616 pages
      ISBN:9781450314626
      DOI:10.1145/2339530

      Copyright © 2012 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 12 August 2012

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,133of8,635submissions,13%

      Upcoming Conference

      KDD '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader