skip to main content
10.1145/1060745.1060827acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

PageRank as a function of the damping factor

Published:10 May 2005Publication History

ABSTRACT

PageRank is defined as the stationary state of a Markov chain. The chain is obtained by perturbing the transition matrix induced by a web graph with a damping factor α that spreads uniformly part of the rank. The choice of α is eminently empirical, and in most cases the original suggestion α = 0.85 by Brin and Page is still used. Recently, however, the behaviour of PageRank with respect to changes in α was discovered to be useful in link-spam detection[21]. Moreover, an analytical justification of the value chosen for α is still missing. In this paper, we give the first mathematical analysis of PageRank when α changes. In particular, we show that, contrarily to popular belief, for real-world graphs values of α close to 1 do not give a more meaningful ranking. Then, we give closed-form formulae for PageRank derivatives of any order, and an extension of the Power Method that approximates them with convergence O (tk αt) for the k-th derivative. Finally, we show a tight connection between iterated computation and analytical behaviour by proving that the k-th iteration of the Power Method gives exactly the PageRank value obtained using a Maclaurin polynomial of degree k. The latter result paves the way towards the application of analytical methods to the study of PageRank.

References

  1. Paolo Boldi, Bruno Codenotti, Massimo Santini, and Sebastiano Vigna. Ubicrawler: A scalable fully distributed web crawler. Software: Practice & Experience, 34(8):711--726, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Paolo Boldi and Sebastiano Vigna. The WebGraph framework I: Compression techniques. In Proc. WWW 13, pages 595--601, Manhattan, USA, 2004. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Soumen Chakrabarti, Byron Dom, David Gibson, Jon Kleinberg, S. Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, and Andrew Tomkins. Hypersearching the web. Scientific American, June 1999.Google ScholarGoogle Scholar
  4. Scott C. Deerwester, Susan T. Dumais, Thomas K. Landauer, George W. Furnas, and Richard A. Harshman. Indexing by latent semantic analysis. Journal of the American Society of Information Science, 41(6):391--407, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  5. Nadav Eiron, Kevin S. McCurley, and John A. Tomlin. Ranking the web frontier. In Proceedings of the thirteenth international conference on World--Wide Web, pages 309--318. ACM Press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Gene H. Golub and Chen Greif. Arnoldi-type algorithms for computing stationary distribution vectors, with application to PageRank. Technical Report SCCM-04-15, Stanford University Technical Report, 2004.Google ScholarGoogle Scholar
  7. Taher Haveliwala. Efficient computation of PageRank. Technical report, Stanford University Technical Report, October 1999.Google ScholarGoogle Scholar
  8. Taher Haveliwala and Sepandar Kamvar. The condition number of the PageRank problem. Technical Report~36, Stanford University Technical Report, June 2003.Google ScholarGoogle Scholar
  9. Taher Haveliwala and Sepandar Kamvar. The second eigenvalue of the Google matrix. Technical Report~20, Stanford University Technical Report, March 2003.Google ScholarGoogle Scholar
  10. Sepandar Kamvar, Taher Haveliwala, Christopher Manning, and Gene Golub. Exploiting the block structure of the web for computing PageRank. Technical Report 17, Stanford University Technical Report, March 2003.Google ScholarGoogle Scholar
  11. Sepandar D. Kamvar, Taher H. Haveliwala, Christopher D. Manning, and Gene H. Golub. Extrapolation methods for accelerating PageRank computations. In Proceedings of the twelfth international conference on World Wide Web, pages 261--270. ACM Press, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Jon M. Kleinberg. Authoritative sources in a hyperlinked environment. Journal of the ACM, 46(5):604--632, September 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, D. Sivakumar, Andrew Tomkins, and Eli Upfal. The Web as a graph. In Proc. 19th ACM SIGACT-SIGMOD-AIGART Symp. Principles of Database Systems, PODS, pages 1--10. ACM Press, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Amy N. Langville and Carl D. Meyer. Deeper inside PageRank. Internet Mathematics, 1(3):355--400, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  15. Chris Pan-Chi Lee, Gene H. Golub, and Stefanos A. Zenios. A fast two-stage algorithm for computing PageRank and its extensions. Technical report, Stanford University Technical Report, 2004.Google ScholarGoogle Scholar
  16. Ronny Lempel and Shlomo Moran. SALSA: the stochastic approach for link-structure analysis. ACM Trans. Inf. Syst., 19(2):131--160, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Lawrence Page, Sergey Brin, Rajeev Motwani, and Terry Winograd. The PageRank citation ranking: Bringing order to the web. Technical report, Stanford Digital Library Technologies Project, Stanford University, Stanford, CA, USA, 1998.Google ScholarGoogle Scholar
  18. Luca Pretto. A theoretical analysis of google's PageRank. In Proceedings of the Ninth Symposium on String Processing and Information Retrieval, pages 131--144, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Luca Pretto. A theoretical approach to link analysis algorithms, 2002. PhD Thesis.Google ScholarGoogle Scholar
  20. Eugene Seneta. Non-Negative Matrices and Markov Chains. Springer Series in Statistics. Springer-Verlag, 1981.Google ScholarGoogle Scholar
  21. Hui Zhang, Ashish Goel, Ramesh Govindan, Kahn Mason, and Benjamin~Van Roy. Making eigenvector-based reputation systems robust to collusion. In Stefano Leonardi, editor, Proceedings WAW 2004, number 3243 in LNCS, pages 92--104. Springer-Verlag, 2004.Google ScholarGoogle Scholar

Index Terms

  1. PageRank as a function of the damping factor

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            WWW '05: Proceedings of the 14th international conference on World Wide Web
            May 2005
            781 pages
            ISBN:1595930469
            DOI:10.1145/1060745

            Copyright © 2005 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 10 May 2005

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • Article

            Acceptance Rates

            Overall Acceptance Rate1,899of8,196submissions,23%

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader