skip to main content
article

Characteristics of streaming media stored on the Web

Published:01 November 2005Publication History
Skip Abstract Section

Abstract

Despite the growth in multimedia, there have been few studies that focus on characterizing streaming audio and video stored on the Web. This investigation used a customized Web crawler to traverse 17 million Web pages from diverse geographic locations and identify nearly 30,000 streaming audio and video clips available for analysis. Using custom-built extraction tools, these streaming media objects were analyzed to determine attributes such as media type, encoding format, playout duration, bitrate, resolution, and codec. The streaming media content encountered is dominated by proprietary audio and video formats with the top four commercial products being RealPlayer, Windows Media Player, MP3 and QuickTime. The distribution of the stored playout durations of streaming audio and video clips are long-tailed. More than half of the streaming media clips encountered are video, encoded primarily for broadband connections and at resolutions considerably smaller than the resolutions of typical monitors.

References

  1. Acharya, S. and Smith, B. 1998. An experiment to characterize videos stored on the Web. In Proceedings of the ACM/SPIE Multimedia Computing and Networking (MMCN). San Jose, CA. 166--178.Google ScholarGoogle Scholar
  2. Baker, M., Hartman, J., Kupfer, M., Shirriff, K., and Ousterhout, J. 1991. Measurements of a distributed file system. In Proceedings of the 13th Symposium on Operating System Principles (SOSP). Pacific Grove, CA. 198--212. Google ScholarGoogle Scholar
  3. Bray, T. 1996. Measuring the Web. In Proceedings of the 4th International World Wide Web Conference. Paris, France, 994--1005. Google ScholarGoogle Scholar
  4. Brown, E. S. 2001. Broadband walks the last mile. Tech. Rev. (Online). Available at http://www.technologyreview.com/articles/print_version/brown060501.asp.Google ScholarGoogle Scholar
  5. CAIDA (Cooperative Association for Internet Data Analysis). 2000. www.caida.org.Google ScholarGoogle Scholar
  6. Cao, Z., Wang, Z., and Zegura, E. 2000. Rainbow fair queuing: Fair bandwidth sharing without per-flow state. In Proceedings of IEEE Infocom. Tel-Aviv, Israel, 922--931.Google ScholarGoogle Scholar
  7. Chesire, M., Wolman, A., Voelker, G., and Levy, H. 2001. Measurement and analysis of a streaming media workload. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (USITS). San Francisco, CA. 1--12. Google ScholarGoogle Scholar
  8. Chung, J., Claypool, M., and Zhu, Y. 2003. Measurement of the congestion responsiveness of RealPlayer streaming video over UDP. In Proceedings of the Packet Video Workshop (PV). Nantes, France.Google ScholarGoogle Scholar
  9. Crovella, M. E. and Taqqu, M. S. 1999. Estimating the heavy tail index from scaling properties. Methodol. Comput. Appl. Probab. 1, 1, 55--79. Google ScholarGoogle Scholar
  10. Downey, A. B. 2001. Evidence for long-tailed distributions in the Internet. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop. San Francisco, CA. 229--241. Google ScholarGoogle Scholar
  11. Feldmann, A., Gilbert, A., Huang, P., and Willinger, W. 1995. Dynamics of IP traffic: A study of the role of variability and the impact of control. In Proceedings of ACM SIGCOMM. Cambridge, MA. 301--313. Google ScholarGoogle Scholar
  12. Feng, W., Kandlur, D., Saha, D., and Shin, K. 2001. Stochastic fair blue: A queue management algorithm for enforcing fairness. In Proceedings of IEEE Infocom. Anchorage, AK. 1520--1529.Google ScholarGoogle Scholar
  13. Floyd, S., Handley, M., Padhye, J., and Widmer, J. 2000. Equation-based congestion control for unicast applications. In Proceedings of ACM SIGCOMM Conference. Stockholm, Sweden, 43--56. Google ScholarGoogle Scholar
  14. for Internet Data Analysis (CAIDA), C. A. 2002. Characterization of Internet traffic loads, segregated by application (Online). Available at http://www.caida.org/analysis/workload/byapplication/.Google ScholarGoogle Scholar
  15. Jupiter Media Metrix. 2001. Users of media player applications increased 33 percent since last year. Press Release. Available at http://www.jup.com/company/pressrelease-.jsp?doc=pr01040.Google ScholarGoogle Scholar
  16. Kuang, T. and Williamson, C. 2002a. A measurement study of RealMedia audio/video streaming traffic. In Proceedings of ITCOM. Boston, MA. 68--79.Google ScholarGoogle Scholar
  17. Kuang, T. and Williamson, C. 2002b. RealMedia streaming performance on an IEEE 802.11b wireless LAN. In Proceedings of IASTED Wireless and Optical Communications (WOC). 306--311.Google ScholarGoogle Scholar
  18. Li, M., Claypool, M., and Kinicki, R. 2002. MediaPlayer versus RealPlayer---A comparison of network turbulence. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop (IMW). Marseille, France, 131--136. Google ScholarGoogle Scholar
  19. Li, M., Claypool, M., Kinicki, R., and Nichols, J. 2003. Characteristics of streaming media stored on the web. Tech. Rep. WPI-CS-TR-03-18, CS Department, Worcester Polytechnic Institute (May).Google ScholarGoogle Scholar
  20. Mahajan, R., Floyd, S., and Wetherall, D. 2001. Controlling high-bandwidth flows at the congested routers. In Proceedings of the 9th International Conference on Network Protocols (ICNP). Mission Inn, Riverside, CA. 192--201. Google ScholarGoogle Scholar
  21. Mena, A. and Heidemann, J. 2000. An empirical study of real audio traffic. In Proceedings of IEEE Infocom. Tel-Aviv, Israel, 101--110.Google ScholarGoogle Scholar
  22. Merwe, J. V. D., Caceres, R., hua Chu, Y., and Sreenan, C. 2000. mmdump---A tool for monitoring Internet multimedia traffic. ACM Comput. Comm. Rev. 30, 5 (Oct.), 48--59. Google ScholarGoogle Scholar
  23. Merwe, J. V. D., Sen, S., and Kalmanek, C. 2002. Streaming video traffic: Characterization and network impact. In Proceedings of the 7th International Workshop on Web Content Caching and Distribution. Boulder, CO.Google ScholarGoogle Scholar
  24. Ousterhout, J., DaCosta, H., Harrison, D., Kunze, J., Kupfer, M., and Thompson, J. 1985. A trace-driven analysis of the Unix 4.2 BSD file system. In Proceedings of the 10th Symposium on Operating System Principles (SOSP). Orcas Island, WA. 15--24. Google ScholarGoogle Scholar
  25. Park, K. and Willinger, W. 2000. Self-similar network traffic and performance. In Self-Similar Network Traffic: An Overview. (Chapter 1) John Wiley Interscience. Google ScholarGoogle Scholar
  26. Paxson, V. and Floyd, S. 1995. Wide-area traffic: The failure of poisson modeling. IEEE/ACM Trans. Netw. 3, 226--244. Google ScholarGoogle Scholar
  27. Real Networks Incorporated. 2001. RealNetworks facts. URL: http://www.reanetworks.com/gcompany/index.html.Google ScholarGoogle Scholar
  28. RealNetworks. 2003. RealNetworks and major media companies launch streaming news, sports and entertainment content to mobile devices. Press Release. Available at http://www.realnetworks.com/company/press/releases/2003/mediaguides.html.Google ScholarGoogle Scholar
  29. Rejaie, R., Handley, M., and Estrin, D. 1999. RAP: An end-to-end rate-based congestion control mechanism for realtime streams in the Internet. In Proceedings of IEEE Infocom. New York, NY. 1337--1345.Google ScholarGoogle Scholar
  30. Saroiu, S., Gummadi, K. P., Dunn, R. J., Gribble, S. D., and Levy, H. M. 2002. An analysis of Internet content delivery systems. In Usenix Operating Systems Design and Implementation (OSDI). Boston, MA. 315--327. Google ScholarGoogle Scholar
  31. Saroiu, S., Gummadi, P., and Gribble, S. 2003. Measuring and analyzing the characteristics of Napster and Gnutella hosts. Multimedia Syst. J. 9, 2 (Aug.), 170--184. Google ScholarGoogle Scholar
  32. Stoica, I., Shenker, S., and Zhang, H. 1998. Core-stateless fair queueing: Achieving approximately fair bandwidth allocations in high speed networks. In Proceedings of ACM SIGCOMM Conference. Vancouver, British Columbia, Canada, 118--130. Google ScholarGoogle Scholar
  33. Sullivan, D. Search engine sizes. Available at http://searchenginewatch.com/reports/sizes.html.Google ScholarGoogle Scholar
  34. Topic, P. 2002. DSL Passes 30m lines worldwide. Available at http://www.point-topic.com/analysis.htm.Google ScholarGoogle Scholar
  35. Veloso, E., Almeida, V., Meira, W., Bestavros, A., and Jin, S. 2002. A hierarchical characterization of a live streaming media workload. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop. Marseille, France, 117--130. Google ScholarGoogle Scholar
  36. Wang, Y., Claypool, M., and Zuo, Z. 2001. An empirical study of RealVideo performance across the Internet. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop (IMW). San Francisco, CA. 295--309. Google ScholarGoogle Scholar
  37. Wang, Z., Banerjee, S., and Jamin, S. 2003. Studying streaming video quality: From an application point of view. In Proceedings of ACM Multimedia. Berkeley, CA. 327--330. Google ScholarGoogle Scholar
  38. Willinger, W., Taqqu, M., Sherman, R., and Wilson, D. 1995. Self-similarity through high-variability: statistical analysis of eithernet LAN traffic at the source level. In Proceedings of ACM SIGCOMM. Cambridge, MA. 100--113. Google ScholarGoogle Scholar
  39. Wills, C. E., Mikhailov, M., and Shang, H. 2003. Inferring relative popularity of Internet applications by actively querying DNS caches. In Proceedings of the Internet Measurement Conference (IMC). 78--90. Google ScholarGoogle Scholar
  40. Woodruff, A., Aoki, P., Brewer, E., Gautheir, P., and Rowe, L. 1996. An investigation of documents from the World Wide Web. In Proceedings of the 4th International World Wide Web Conference. Paris, France, 963--979. Google ScholarGoogle Scholar

Index Terms

  1. Characteristics of streaming media stored on the Web

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader