Abstract
Despite the growth in multimedia, there have been few studies that focus on characterizing streaming audio and video stored on the Web. This investigation used a customized Web crawler to traverse 17 million Web pages from diverse geographic locations and identify nearly 30,000 streaming audio and video clips available for analysis. Using custom-built extraction tools, these streaming media objects were analyzed to determine attributes such as media type, encoding format, playout duration, bitrate, resolution, and codec. The streaming media content encountered is dominated by proprietary audio and video formats with the top four commercial products being RealPlayer, Windows Media Player, MP3 and QuickTime. The distribution of the stored playout durations of streaming audio and video clips are long-tailed. More than half of the streaming media clips encountered are video, encoded primarily for broadband connections and at resolutions considerably smaller than the resolutions of typical monitors.
- Acharya, S. and Smith, B. 1998. An experiment to characterize videos stored on the Web. In Proceedings of the ACM/SPIE Multimedia Computing and Networking (MMCN). San Jose, CA. 166--178.Google Scholar
- Baker, M., Hartman, J., Kupfer, M., Shirriff, K., and Ousterhout, J. 1991. Measurements of a distributed file system. In Proceedings of the 13th Symposium on Operating System Principles (SOSP). Pacific Grove, CA. 198--212. Google Scholar
- Bray, T. 1996. Measuring the Web. In Proceedings of the 4th International World Wide Web Conference. Paris, France, 994--1005. Google Scholar
- Brown, E. S. 2001. Broadband walks the last mile. Tech. Rev. (Online). Available at http://www.technologyreview.com/articles/print_version/brown060501.asp.Google Scholar
- CAIDA (Cooperative Association for Internet Data Analysis). 2000. www.caida.org.Google Scholar
- Cao, Z., Wang, Z., and Zegura, E. 2000. Rainbow fair queuing: Fair bandwidth sharing without per-flow state. In Proceedings of IEEE Infocom. Tel-Aviv, Israel, 922--931.Google Scholar
- Chesire, M., Wolman, A., Voelker, G., and Levy, H. 2001. Measurement and analysis of a streaming media workload. In Proceedings of the USENIX Symposium on Internet Technologies and Systems (USITS). San Francisco, CA. 1--12. Google Scholar
- Chung, J., Claypool, M., and Zhu, Y. 2003. Measurement of the congestion responsiveness of RealPlayer streaming video over UDP. In Proceedings of the Packet Video Workshop (PV). Nantes, France.Google Scholar
- Crovella, M. E. and Taqqu, M. S. 1999. Estimating the heavy tail index from scaling properties. Methodol. Comput. Appl. Probab. 1, 1, 55--79. Google Scholar
- Downey, A. B. 2001. Evidence for long-tailed distributions in the Internet. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop. San Francisco, CA. 229--241. Google Scholar
- Feldmann, A., Gilbert, A., Huang, P., and Willinger, W. 1995. Dynamics of IP traffic: A study of the role of variability and the impact of control. In Proceedings of ACM SIGCOMM. Cambridge, MA. 301--313. Google Scholar
- Feng, W., Kandlur, D., Saha, D., and Shin, K. 2001. Stochastic fair blue: A queue management algorithm for enforcing fairness. In Proceedings of IEEE Infocom. Anchorage, AK. 1520--1529.Google Scholar
- Floyd, S., Handley, M., Padhye, J., and Widmer, J. 2000. Equation-based congestion control for unicast applications. In Proceedings of ACM SIGCOMM Conference. Stockholm, Sweden, 43--56. Google Scholar
- for Internet Data Analysis (CAIDA), C. A. 2002. Characterization of Internet traffic loads, segregated by application (Online). Available at http://www.caida.org/analysis/workload/byapplication/.Google Scholar
- Jupiter Media Metrix. 2001. Users of media player applications increased 33 percent since last year. Press Release. Available at http://www.jup.com/company/pressrelease-.jsp?doc=pr01040.Google Scholar
- Kuang, T. and Williamson, C. 2002a. A measurement study of RealMedia audio/video streaming traffic. In Proceedings of ITCOM. Boston, MA. 68--79.Google Scholar
- Kuang, T. and Williamson, C. 2002b. RealMedia streaming performance on an IEEE 802.11b wireless LAN. In Proceedings of IASTED Wireless and Optical Communications (WOC). 306--311.Google Scholar
- Li, M., Claypool, M., and Kinicki, R. 2002. MediaPlayer versus RealPlayer---A comparison of network turbulence. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop (IMW). Marseille, France, 131--136. Google Scholar
- Li, M., Claypool, M., Kinicki, R., and Nichols, J. 2003. Characteristics of streaming media stored on the web. Tech. Rep. WPI-CS-TR-03-18, CS Department, Worcester Polytechnic Institute (May).Google Scholar
- Mahajan, R., Floyd, S., and Wetherall, D. 2001. Controlling high-bandwidth flows at the congested routers. In Proceedings of the 9th International Conference on Network Protocols (ICNP). Mission Inn, Riverside, CA. 192--201. Google Scholar
- Mena, A. and Heidemann, J. 2000. An empirical study of real audio traffic. In Proceedings of IEEE Infocom. Tel-Aviv, Israel, 101--110.Google Scholar
- Merwe, J. V. D., Caceres, R., hua Chu, Y., and Sreenan, C. 2000. mmdump---A tool for monitoring Internet multimedia traffic. ACM Comput. Comm. Rev. 30, 5 (Oct.), 48--59. Google Scholar
- Merwe, J. V. D., Sen, S., and Kalmanek, C. 2002. Streaming video traffic: Characterization and network impact. In Proceedings of the 7th International Workshop on Web Content Caching and Distribution. Boulder, CO.Google Scholar
- Ousterhout, J., DaCosta, H., Harrison, D., Kunze, J., Kupfer, M., and Thompson, J. 1985. A trace-driven analysis of the Unix 4.2 BSD file system. In Proceedings of the 10th Symposium on Operating System Principles (SOSP). Orcas Island, WA. 15--24. Google Scholar
- Park, K. and Willinger, W. 2000. Self-similar network traffic and performance. In Self-Similar Network Traffic: An Overview. (Chapter 1) John Wiley Interscience. Google Scholar
- Paxson, V. and Floyd, S. 1995. Wide-area traffic: The failure of poisson modeling. IEEE/ACM Trans. Netw. 3, 226--244. Google Scholar
- Real Networks Incorporated. 2001. RealNetworks facts. URL: http://www.reanetworks.com/gcompany/index.html.Google Scholar
- RealNetworks. 2003. RealNetworks and major media companies launch streaming news, sports and entertainment content to mobile devices. Press Release. Available at http://www.realnetworks.com/company/press/releases/2003/mediaguides.html.Google Scholar
- Rejaie, R., Handley, M., and Estrin, D. 1999. RAP: An end-to-end rate-based congestion control mechanism for realtime streams in the Internet. In Proceedings of IEEE Infocom. New York, NY. 1337--1345.Google Scholar
- Saroiu, S., Gummadi, K. P., Dunn, R. J., Gribble, S. D., and Levy, H. M. 2002. An analysis of Internet content delivery systems. In Usenix Operating Systems Design and Implementation (OSDI). Boston, MA. 315--327. Google Scholar
- Saroiu, S., Gummadi, P., and Gribble, S. 2003. Measuring and analyzing the characteristics of Napster and Gnutella hosts. Multimedia Syst. J. 9, 2 (Aug.), 170--184. Google Scholar
- Stoica, I., Shenker, S., and Zhang, H. 1998. Core-stateless fair queueing: Achieving approximately fair bandwidth allocations in high speed networks. In Proceedings of ACM SIGCOMM Conference. Vancouver, British Columbia, Canada, 118--130. Google Scholar
- Sullivan, D. Search engine sizes. Available at http://searchenginewatch.com/reports/sizes.html.Google Scholar
- Topic, P. 2002. DSL Passes 30m lines worldwide. Available at http://www.point-topic.com/analysis.htm.Google Scholar
- Veloso, E., Almeida, V., Meira, W., Bestavros, A., and Jin, S. 2002. A hierarchical characterization of a live streaming media workload. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop. Marseille, France, 117--130. Google Scholar
- Wang, Y., Claypool, M., and Zuo, Z. 2001. An empirical study of RealVideo performance across the Internet. In Proceedings of the ACM SIGCOMM Internet Measurement Workshop (IMW). San Francisco, CA. 295--309. Google Scholar
- Wang, Z., Banerjee, S., and Jamin, S. 2003. Studying streaming video quality: From an application point of view. In Proceedings of ACM Multimedia. Berkeley, CA. 327--330. Google Scholar
- Willinger, W., Taqqu, M., Sherman, R., and Wilson, D. 1995. Self-similarity through high-variability: statistical analysis of eithernet LAN traffic at the source level. In Proceedings of ACM SIGCOMM. Cambridge, MA. 100--113. Google Scholar
- Wills, C. E., Mikhailov, M., and Shang, H. 2003. Inferring relative popularity of Internet applications by actively querying DNS caches. In Proceedings of the Internet Measurement Conference (IMC). 78--90. Google Scholar
- Woodruff, A., Aoki, P., Brewer, E., Gautheir, P., and Rowe, L. 1996. An investigation of documents from the World Wide Web. In Proceedings of the 4th International World Wide Web Conference. Paris, France, 963--979. Google Scholar
Index Terms
- Characteristics of streaming media stored on the Web
Recommendations
High resolution live streaming with the HYDRA architecture
First anniversary issueDigital continuous media (CM) are now well established as an integral part of many applications. With highdefinition (HD) displays becoming increasingly common and large network bandwidth available, high-quality video streaming has become feasible, and ...
Playback pattern aware interval caching for multimedia streaming systems
PDCN'07: Proceedings of the 25th conference on Proceedings of the 25th IASTED International Multi-Conference: parallel and distributed computing and networksIn this work, we present a novel caching scheme, PAIC, which is designed for multimedia streaming systems. Caching in a multimedia streaming system is an effective way to improve the performance of multimedia streaming systems and reduce the service ...
Effective bandwidth based scheduling for streaming media
We propose a class of rate-distortion optimized packet scheduling algorithms for streaming media by generating a number of nested substreams, with more important streams embedding less important ones in a progressive manner. Our goal is to determine the ...
Comments