Skip to main content

2004 | OriginalPaper | Buchkapitel

Minimizing the Network Distance in Distributed Web Crawling

verfasst von : Odysseas Papapetrou, George Samaras

Erschienen in: On the Move to Meaningful Internet Systems 2004: CoopIS, DOA, and ODBASE

Verlag: Springer Berlin Heidelberg

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Distributed crawling has shown that it can overcome important limitations of the centralized crawling paradigm. However, the distributed nature of current distributed crawlers is currently not fully utilized. The optimal benefits of this approach are usually limited to the sites hosting the crawler. In this work we describe IPMicra, a distributed location aware web crawler that utilizes an IP address hierarchy and allows crawling of links in a near optimal location aware manner. The crawler outperforms earlier distributed crawling approaches without a significant overhead.

Metadaten
Titel
Minimizing the Network Distance in Distributed Web Crawling
verfasst von
Odysseas Papapetrou
George Samaras
Copyright-Jahr
2004
Verlag
Springer Berlin Heidelberg
DOI
https://doi.org/10.1007/978-3-540-30468-5_36

Premium Partner