skip to main content
10.1145/1298306.1298310acmconferencesArticle/Chapter ViewAbstractPublication PagesimcConference Proceedingsconference-collections
Article

Youtube traffic characterization: a view from the edge

Published:24 October 2007Publication History

ABSTRACT

This paper presents a traffic characterization study of the popular video sharing service, YouTube. Over a three month period we observed almost 25 million transactions between users on an edge network and YouTube, including more than 600,000 video downloads. We also monitored the globally popular videos over this period of time.

In the paper we examine usage patterns, file properties, popularity and referencing characteristics, and transfer behaviors of YouTube, and compare them to traditional Web and media streaming workload characteristics. We conclude the paper with a discussion of the implications of the observed characteristics. For example, we find that as with the traditional Web, caching could improve the end user experience, reduce network bandwidth consumption, and reduce the load on YouTube's core server infrastructure. Unlike traditional Web caching, Web 2.0 provides additional meta-data that should be exploited to improve the effectiveness of strategies like caching.

References

  1. About the U of C. http://ucalgary.ca/about/.Google ScholarGoogle Scholar
  2. S. Acharya and B. Smith. An Experiment to Characterize Videos Stored on the Web. In Proc. SPIE/ACM MMCN, San Jose, USA, Jan. 1998.Google ScholarGoogle Scholar
  3. S. Acharya, B. Smith, and P. Parnes. Characterizing User Access to Videos on the World Wide Web. In Proc. SPIE/ACM MMCN, San Jose, USA, Jan. 2000.Google ScholarGoogle Scholar
  4. J. Almeida, J. Krueger, D. Eager, and M. Vernon. Analysis of Educational Media Server Workloads. In Proc. ACM NOSSDAV, Port Jefferson, USA, June 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. M. Ames and M. Naaman. Why We Tag: Motivations for Annotation in Mobile and Online Media. In Proc. ACM CHI, San Jose, USA, May 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Arlitt and T. Jin. Workload Characterization of the 1998 World Cup Website. IEEE Network, 14(3), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Arlitt and C. Williamson. Internet Web Servers: Workload Characterization and Performance Implications. IEEE/ACM Trans. on Networking, 5(5), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. Breslau, P. Cao, L. Fan, G. Phillips, and S. Shenker. Web Caching and Zipf-like Distributions: Evidence and Implications. In Proc. IEEE INFOCOM, New York, USA, Mar. 1999.Google ScholarGoogle ScholarCross RefCross Ref
  9. Bro. http://www.bro-ids.org.Google ScholarGoogle Scholar
  10. R. Bunt and J. Murphy. The Measurement of Locality and the Behaviour of Programs. Computer Journal, 27(3), 1984. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. CBC. YouTube's Bride Wig Out Revealed as 'net seed' for Ad Campaign. CBC Arts, Feb. 2007.Google ScholarGoogle Scholar
  12. E. Chang, M. Davis, P. Schmitz, and S. Boll. Panel Discussion: Web 2.0 and Multimedia: Challenge, Hype, Synergy. In Proc. ACM MULTIMEDIA, Santa Barbara, USA, Oct. 2006.Google ScholarGoogle Scholar
  13. X. Cheng, C. Dale, and J. Liu. Understanding the Characteristics of Internet Short Video Sharing: YouTube as a Case Study. Technical Report arXiv:0707.3670v1 {cs.NI}, Cornell University, arXiv e-prints, July 2007.Google ScholarGoogle Scholar
  14. L. Cherkasova and M. Gupta. Analysis of Enterprise Media Server Workloads: Access Patterns, Locality, Content Evolution, and Rate of Change. IEEE/ACM Trans. on Networking, 12(5), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Chesire, A. Wolman, G. Voelker, and H. Levy. Measurement and Analysis of a Streaming Media Workload. In Proc. USITS, San Francisco, USA, Mar. 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. C. Costa, I. Cunha, A. Borges, C. Ramos, M. Rocha, J. Almeida, and B. Ribeiro-Neto. Analyzing Client Interactivity in Streaming Media. In Proc. WWW, NY, USA, May 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. M. Crovella and A. Bestavros. Self-Similarity in World Wide Web Traffic: Evidence and Possible Causes. IEEE/ACM Trans. on Networking, 5(6), 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. C. Cunha, A. Bestavros, and M. Crovella. Characteristics of World Wide Web Client-based Traces. Technical Report BUCS-TR-1995-010, Boston University, USA, Apr. 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Denning. Working sets past and present. IEEE Trans. Software Eng., 6(1), 1980. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Duska, D. Marwood, and M. Freeley. The Measured Access Characteristics of World-Wide-Web Client Proxy Caches. In Proc. USITS, Monterey, USA, Mar. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Facebook. http://www.facebook.com.Google ScholarGoogle Scholar
  22. Flickr. http://www.flickr.com.Google ScholarGoogle Scholar
  23. FlixHunt. http://www.flixhunt.com.Google ScholarGoogle Scholar
  24. S. Gribble and E. Brewer. System Design Issues for Internet Middleware Services: Deductions from a Large Client Trace. In Proc. USITS, Monterey, USA, Mar. 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. L. Guo, E. Tan, S. Chen, Z. Xiao, O. Spatscheck, and X. Zhang. Delving into Internet Streaming Media Delivery: A Quality and Resource Utilization Perspective. In Proc. ACM IMC, Rio de Janeriro, Brazil, Oct. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. M. Halvey and M. Keane. Exploring Social Dynamics in Online Media Sharing. In Proc. of WWW, Banff, Canada, May 2007. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. N. Harel, V. Vellanki, A. Chervenak, G. Abowd, and U. Ramachandran. Characterizing A Media-Enhanced Classroom Server. In Proc. of IEEE Workshop on Workload Characterization (WCC), Austin, USA, Oct. 1999.Google ScholarGoogle Scholar
  28. M. Li, M. Claypool, R. Kinicki, and J. Nichols. Characteristics of streaming media stored on the web. ACM Trans. Inter. Tech., 5(4), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Business Intelligence Lowdown. Top 10 Largest Databases in the World, Feb. 2007.ÜGoogle ScholarGoogle Scholar
  30. A. Mahanti, D. Eager, and C. Williamson. Temporal Locality and its Impact on Web Proxy Cache Performance. Perform. Eval., 42(2--3), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. A. Mahanti, C. Williamson, and D. Eager. Traffic Analysis of a Web Proxy Caching Hierarchy. IEEE Network, 14(3), 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. S. Majumdar and R. Bunt. Measurement and Analysis of Locality Phases in File Referencing Behaviour. In Proc ACM SIGMETRICS/PERFORMANCE, Raleigh, USA, June 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. Milani. Coming to Your Screen: DIY TV. BBC Money Programme, 2007.Google ScholarGoogle Scholar
  34. M. Musgrove. Viacom Decides YouTube Is a Foe. Washington Post, Feb. 2007.Google ScholarGoogle Scholar
  35. My Space. http://www.myspace.com.Google ScholarGoogle Scholar
  36. V. Paxson. Empirically-Derived Analytic Models of Wide-Area TCP Connections. IEEE/ACM Trans. on Net., 2(4), 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. P. Schmitz. Leveraging Community Annotations for Image Adaptation to Small Presentation Formats. In Proc. ACM MULTIMEDIA, Santa Barbara, USA, Oct. 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. K. Sripanidkulchai, B. Maggs, and H. Zhang. An Analysis of Live Streaming Workloads on the Internet. In Proc. ACM IMC, Taormina, Italy, Oct. 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  39. Tcpdump. http://www.tcpdump.org.Google ScholarGoogle Scholar
  40. USA Today. YouTube Serves up 100 million Videos a Day Online, July 2006.Google ScholarGoogle Scholar
  41. Wordpress. http://www.wordpress.com.Google ScholarGoogle Scholar
  42. YouTube. http://www.youtube.com.Google ScholarGoogle Scholar
  43. H. Yu, D. Zheng, B. Zhao, and W. Zheng. Understanding User Behavior in Large-Scale Video-on-Demand Systems. SIGOPS Oper. Syst. Rev., 40(4), 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. G. Zipf. Human Behavior and the Principle of Least Effort. Addison-Wesley (Reading MA), 1949.Google ScholarGoogle Scholar

Index Terms

  1. Youtube traffic characterization: a view from the edge

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        IMC '07: Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
        October 2007
        390 pages
        ISBN:9781595939081
        DOI:10.1145/1298306

        Copyright © 2007 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 24 October 2007

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate277of1,083submissions,26%

        Upcoming Conference

        IMC '24
        ACM Internet Measurement Conference
        November 4 - 6, 2024
        Madrid , AA , Spain

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader