ABSTRACT
The mismatch between increasingly large video resolution and constrained screen size of mobile devices has led to the proposal of zoomable video systems based on tiled video. In the current system, a tiled video frame is constructed from multiple tiles in a single resolution stream. In this paper, we explore the perceptual effect of mixed-resolution tiles in tiled video, in which tiles within a video frame could come from streams with different resolutions, with the aim to tradeoff bandwidth and perceptual video quality. To understand how users perceive the video quality of mixed-resolution tiled video, we conducted a psychophysical study with 50 participants on tiled videos where the tile resolutions are randomly chosen from two resolution levels with equal probability. The experiment results show that in many cases, we can mix tiles from HD (1920×1080p) stream and tiles from 1600×900p stream without being noticed by the viewers. Even when participants notice quality degradation in videos combined with tiles from HD stream and tiles from 960×540p stream, the majority of participants still accept the degradation when viewing videos with low and medium motion; and greater than 40% of participants accept the quality degradation when viewing video with dense motion.
- D Varuna SX De Silva, Warnakulasuriya Anil Chandana Fernando, Gokce Nur, Erhan Ekmekcioglu, and Stewart T Worrall. 3d video assessment with just noticeable difference in depth evaluation. In Image Processing (ICIP), 2010 17th IEEE International Conference on, pages 4013--4016. IEEE, 2010.Google ScholarCross Ref
- Wu-chi Feng, Thanh Dang, John Kassebaum, and Tim Bauman. Supporting region-of-interest cropping through constrained compression. In Proceedings of the 16th ACM international conference on Multimedia, pages 745--748. ACM, 2008. Google ScholarDigital Library
- Wu-chi Feng, Thanh Dang, John Kassebaum, and Tim Bauman. Supporting region-of-interest cropping through constrained compression. ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP), 7(3):17, 2011. Google ScholarDigital Library
- George A Gescheider. Psychophysics: the fundamentals. Psychology Press, 1997.Google Scholar
- Aditya Mavlankar, Pierpaolo Baccichet, David Varodayan, and Bernd Girod. Optimal slice size for streaming regions of high resolution video with virtual pan/tilt/zoom functionality. In EUSIPCO, 2007.Google Scholar
- Aditya Mavlankar, David Varodayan, and Bernd Girod. Region-of-interest prediction for interactively streaming regions of high resolution video. In Packet Video. IEEE, 2007.Google Scholar
- Ngo Quang Minh Khiem, Guntur Ravindra, Axel Carlier, and Wei Tsang Ooi. Supporting zoomable video streams with dynamic region-of-interest cropping. In MMSys. ACM, 2010. Google ScholarDigital Library
- Ngo Quang Minh Khiem, Guntur Ravindra, and Wei Tsang Ooi. Adaptive encoding of zoomable video streams based on user access pattern. In MMSys. ACM, 2011. Google ScholarDigital Library
- Aniruddha Sinha, Gaurav Agarwal, and Alwin Anbu. Region-of-interest based compressed domain video transcoding scheme. In Acoustics, Speech, and Signal Processing, 2004. Proceedings.(ICASSP'04). IEEE International Conference on, volume 3, pages iii--161. IEEE, 2004.Google Scholar
- Ray van Brandenburg, Omar Niamut, Martin Prins, and Hans Stokking. Spatial segmentation for immersive media delivery. In ICIN. IEEE, 2011.Google ScholarCross Ref
- Haohong Wang and Khaled El-Maleh. Joint adaptive background skipping and weighted bit allocation for wireless video telephony. In Wireless Networks, Communications and Mobile Computing, 2005 International Conference on, volume 2, pages 1243--1248. IEEE, 2005.Google Scholar
- Haohong Wang, Yi Liang, and Khaled El-Maleh. Real-time region-of-interest video coding using content-adaptive background skipping with dynamic bit reallocation. In Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, volume 2, pages II--II. IEEE, 2006.Google Scholar
- Wanmin Wu, Ahsan Arefin, Gregorij Kurillo, Pooja Agarwal, Klara Nahrstedt, and Ruzena Bajcsy. Color-plus-depth level-of-detail in 3d tele-immersive video: a psychophysical approach. In Proceedings of the 19th ACM international conference on Multimedia, pages 13--22. ACM, 2011. Google ScholarDigital Library
Index Terms
- Mixing Tile Resolutions in Tiled Video: A Perceptual Quality Assessment
Recommendations
Tile Rate Allocation for 360-Degree Tiled Adaptive Video Streaming
MM '20: Proceedings of the 28th ACM International Conference on Multimedia360-degree video streaming commonly encodes and transmits the video as independently-decodable tiles to conserve bandwidth of regions out of the viewer's field of view (FoV). The bitrate of the tiles, however, can vary significantly across the tiles, ...
Mixing Tile Resolutions in Tiled Video: A Perceptual Quality Assessment
NOSSDAV '14: Proceedings of Network and Operating System Support on Digital Audio and Video WorkshopThe mismatch between increasingly large video resolution and constrained screen size of mobile devices has led to the proposal of zoomable video systems based on tiled video. In the current system, a tiled video frame is constructed from multiple tiles ...
On tile assignment for region-of-interest video streaming in a wireless LAN
NOSSDAV '12: Proceedings of the 22nd international workshop on Network and Operating System Support for Digital Audio and VideoWe consider the following problem in this paper: A video is encoded as a set of tiles T and is streamed to multiple users via a one-hop wireless LAN. Each user selects a region-of-interest (RoI), represented as a subset of T, in the video to watch. The ...
Comments