skip to main content
10.1145/371920.371965acmconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
Article

Breadth-first crawling yields high-quality pages

Authors Info & Claims
Published:01 April 2001Publication History
First page image

References

  1. 1.K.Bharat,A.Broder,M.Henzinger,P.Kumar,and S.Venkatasubramanian.The connecti ity ser er:Fast access to linkage information on the web.In Proceedings of the 7th International Worl d Wide Web Conference pages 469 -477,Brisbane,Australia,April 1998.Elsevier Science. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. 2.S.Brin and L.Page.The anatomy of a large-scale hypertextual web search engine.In Proceedings of the 7th International World Wide Web Conference pages 107 -117,Brisbane,Australia,April 1998.Elsevier Science. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. 3.M.Burner.Crawling towards eternity:Building an archive of the world wide web.Web Techniques Magazine 2(5):37 -40,May 1997.Google ScholarGoogle Scholar
  4. 4.J.Cho,H.Garcia-Molina,and L.Page.E .cient crawling through URL ordering.In Proceedings of the 7th International World Wide Web Conference pages 161 -172,Brisbane,Australia,April 1998.Elsevier Science. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. 5.Google Inc.Press release:"Google launches world 's largest search engine."June 26,2000.A ailable at http://www.google.com/press/pressrel/pressrelease26.htmlGoogle ScholarGoogle Scholar
  6. 6.M.Henzinger,A.Heydon,M.Mitzenmacher,and M.Najork.On near-uniform URL sampling.In Proceedings of the 9th International Worl d Wide Web Conference pages 295 -308,Amsterdam,Netherlands, May 2000.Elsevier Science. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. 7.A.Heydon and M.Najork.Mercator:A scalable, extensible web crawler.World Wide Web 2(4):219 -229,Dec.1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. 8.J.Kleinberg.Authoritati e sources in a hyperlinked en ironment.In Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms pages 668 -677, San Francisco,CA,Jan.1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. 9.P.Lyman,H.Varian,J.Dunn,A.Strygin,and K.Swearingen.How much information?School of Information Management and Systems,Uni .of California at Berkeley,2000.A ailable at http://www.sims.berkeley.edu/how-much-infoGoogle ScholarGoogle Scholar
  10. 10.Mercator Home Page. http://www.research.digital.com/SRC/mercatorGoogle ScholarGoogle Scholar
  11. 11.J.L.Wiener,R.Wickremesinghe,M.Burrows, K.Randall,and R.Stata.Better link compression. Manuscript in progress.Compaq Systems Research Center,2001.Google ScholarGoogle Scholar

Index Terms

  1. Breadth-first crawling yields high-quality pages

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in
            • Published in

              cover image ACM Conferences
              WWW '01: Proceedings of the 10th international conference on World Wide Web
              May 2001
              770 pages
              ISBN:1581133480
              DOI:10.1145/371920

              Copyright © 2001 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 April 2001

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • Article

              Acceptance Rates

              Overall Acceptance Rate1,899of8,196submissions,23%

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader