skip to main content
article

An analysis of Internet content delivery systems

Published:31 December 2002Publication History
Skip Abstract Section

Abstract

In the span of only a few years, the Internet has experienced an astronomical increase in the use of specialized content delivery systems, such as content delivery networks and peer-to-peer file sharing systems. Therefore, an understanding of content delivery on the lnternet now requires a detailed understanding of how these systems are used in practice.This paper examines content delivery from the point of view of four content delivery systems: HTTP web traffic, the Akamai content delivery network, and Kazaa and Gnutella peer-to-peer file sharing traffic. We collected a trace of all incoming and outgoing network traffic at the University of Washington, a large university with over 60,000 students, faculty, and staff. From this trace, we isolated and characterized traffic belonging to each of these four delivery classes. Our results (1) quantify, the rapidly increasing importance of new content delivery systems, particularly peer-to-peer networks, (2) characterize the behavior of these systems from the perspectives of clients, objects, and servers, and (3) derive implications for caching in these systems.

References

  1. Akamai. http://www.akamai.com.]]Google ScholarGoogle Scholar
  2. J. Almeida, V. Almeida, and D. Yates. Measuring the behavior of a world-wide web server. Technical Report 1996-025, Boston University, Oct. 1996.]] Google ScholarGoogle Scholar
  3. L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web caching and zipf-like distributions: Evidence and implications. In Proc. of IEEE INFOCOM 1999, March 1999.]]Google ScholarGoogle ScholarCross RefCross Ref
  4. R. Caceres, F. Douglis, A. Feldmann, G. Glass, and M. Rabinovich. Web proxy caching: The devil is in the details. In Workshop on Internet Server Performance, June 1998.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Cao, J. Zhang, and K. Beach. Active cache: Caching dynamic contents on the web. In Proc. of IFIP Int. Conf. on Distributed Systems Platforms and Open Distributed Processing, Sep. 1998.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Chakrabarti, B.E. Dom, S. R. Kumar, P. Raghavan, S. Rajagopalan, A. Tomkins, D. Gibson, and J. Kleinberg. Mining the Web's link structure. Computer, 32(8), 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. Chankhunthod, P. B. Danzig, C. Neerdaels, M. F. Schwartz, and K. J. Worrell. A hierarchical internet object cache. In Proc. of the 1996 USENIX Annual Technical Conf., Jan. 1996.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Chesire, A. Wolman, G. Voelker, and H. Levy. Measurement and analysis of a streaming media workload. In Proc. of the 2001 USENIX Symp. on lnternet Technologies and Systems, March 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. I. Clarke, O. Sandberg, B. Wiley, and T. Hong. Freenet: A distributed anonymous information storage and retrieval system. In Proc. of the ICSI Workshop on Design Issues in Anonymity and Unobservability, 2000.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Clip2. The Gnutella protocol specification v.0.4, March 2001. http://www.Clip2.com/GnutellaProtoco104.pdf.]]Google ScholarGoogle Scholar
  11. F. Douglis, A. Feldmann, B. Krishnamurthy, and J. C. Mogul. Rate of change and other metrics: a live study of the world wide web. In Proc. of the 1997 USENIX Symp. on Internet Technologies and Systems, Dec. 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. B. Duska, D. Marwood, and M. J. Feeley. The measured access characteristics of World Wide Web client proxy caches. In Proc. of the 1st USENIX Symp. on Internet Technologies and Systems, Dec. 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. L. Fan, P. Cao, J. Almeida, and A. Z. Broder. Summary cache: A scalable wide-area web cache sharing protocol. In Proc. of ACM SIGCOMM '98, Aug. 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. S. Gadde, J. Chase, and M. Rabinovich. Web caching and content distribution: A view from the interior. In Proc. of the 5th International Web Caching and Content Delivery Workshop, May 2000.]]Google ScholarGoogle Scholar
  15. S. D. Gribble and E. A. Brewer. System design issues for internet middleware services: Deductions from a large client trace. In Proc. of the 1997 USENIX Symp. on Internet Technologies and Systems, Dec. 1997.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Internet Engineering Task Force. Hypertext transfer protocol - http 1.1. RFC 2068, March 1997.]]Google ScholarGoogle Scholar
  17. Internet2. http://netflow.internet2.edu/weekly/20020422.]]Google ScholarGoogle Scholar
  18. K. L. Johnson, J. F. Carr, M. S. Day, and M. Frans Kaashoek. The measured performance of content distribution networks. Computer Communications, 24(2), 2001.]]Google ScholarGoogle Scholar
  19. J. Kangasharju, K.W. Ross, and J.W. Roberts. Performance evaluation of redirection schemes in content distribution networks. Computer Communications, 24(2):207--214, 2001.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Kazaa. http://www. kazaa, com.]]Google ScholarGoogle Scholar
  21. J. M. Kleinberg, R. Kumar, P. Raghavan, S. Rajagopalan, and A. S. Tomkins. The Web as a graph: Measurements, models, and methods. In T. Asano, H. Imai, D. T. Lee, S. Nakano, and T. Tokuyama, editors, Proc. of the 5th Annual Int. Conf. Computing and Combinatorics, number 1627. Springer-Verlag, 1999.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. M. Koletsou and G. M. Voelker. The Medusa proxy: A tool for exploring user-perceived web performance. In Proc. of the Sixth Int. Workshop on Web Caching and Content Distribution, June 2001.]]Google ScholarGoogle Scholar
  23. B. Krishnamurthy, C. Wills, and Y. Zhang. On the use and performance of content distribution networks. In Proc. of SIGCOMM IMW 2001, Nov. 2001.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. N. Leibowitz, A. Bergman, R. Ben-Shaul, and A. Shavit. Are file swapping networks cacheable? Characterizing P2P traffic. In Proc. of the 7th Int. WWW Caching Workshop, August 2002.]]Google ScholarGoogle Scholar
  25. B. Maggs. Global Internet Content Delivery. Talk delivered in the Internet and Distributed Systems Seminar at Stanford University. http://www.stanford.edu/class/cs548/abstracts.shtml#bruce.]]Google ScholarGoogle Scholar
  26. S. McCanne and V. Jacobson. The BSD packet filter: A new architecture for user-level packet capture. In Proc. of the USENIX Technical Conf., Winter 1993.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J.-M. Menaud, V. Issarny, and M. Banatre. A new protocol for efficient transversal Web caching. In Proc. of the 12th Int. Symp. on Distributed Computing, Sep. 1998.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Napster. http://www.napster.com.]]Google ScholarGoogle Scholar
  29. V. N. Padmanabhan and L. Qiu. The content and access dynamics of a busy web site: Findings and implications. In Proc. of ACM SIGCOMM 2000, August 2000.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Rabinovich, J. Chase, and S. Gadde. Not all hits are created equal: Cooperative proxy caching over a wide area network. In Proc. of the 3rd Int. WWW Caching Workshop, June 1998.]]Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. S. Saroiu, P. K. Gummadi, and S. D. Gribble. A measurement study of peer-to-peer file sharing systems. In Proc. of Multimedia Computing and Networking 2002, Jan. 2002.]]Google ScholarGoogle Scholar
  32. A. Shaikh, R. Tewari, and M. Agrawal. On the effectiveness of DNS-based server selection. In Proc. of IEEE INFOCOM 2001, Anchorage, AK, USA 2001.]]Google ScholarGoogle Scholar
  33. R. Tewari, M. Dahlin, H. Vin, aud J. Kay. Design considerations for distributed caching on the Internet. In The 19th IEEE Int. Conf. on Distributed Computing Systems, May 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. D. Wessels, K. Claffy, and H.-W. Braun. NLANR prototype web caching system, http://ircache.nlanr.net/.]]Google ScholarGoogle Scholar
  35. A. Wolman, G. Voelker, N. Sharma, N. Cardwell, M. Brown, T. Landray, D. Pinnel, A. Karlin, and H. Levy. Organization-based analysis of web-object sharing and caching. In Proc. of the 2nd USENIX Conf. on Internet Technologies and Systems, Oct. 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. A. Wolman, G. Voelker, N. Sharma, N. Cardwell, A. Karlin, and H. Levy. The scale and performance of cooperative web proxy caching. In Proc. of the 17th ACM Symp. on Operating Systems Principles, Dec. 1999.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. B. Yang and H. Garcia-Molina. Efficient search in peer-to-peer networks. In Proc. of the 22nd Int. Conf. on Distributed Computing Systems, July 2002.]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. An analysis of Internet content delivery systems

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in

          Full Access

          • Published in

            cover image ACM SIGOPS Operating Systems Review
            ACM SIGOPS Operating Systems Review  Volume 36, Issue SI
            OSDI '02: Proceedings of the 5th Symposium on Operating Systems Design and Implementation
            Winter 2002
            398 pages
            ISSN:0163-5980
            DOI:10.1145/844128
            Issue’s Table of Contents

            Copyright © 2002 Copyright is held by the owner/author(s)

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 31 December 2002

            Check for updates

            Qualifiers

            • article

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader