ABSTRACT
Because of unique temporal and spatial properties of video data, different techniques for summarizing videos have been proposed. Key frames extracted directly from video inform users about content without requiring them to view the entire video. As part of ongoing work to develop video browsing interfaces, several interface displays based on key frames were investigated. Variations on dynamic key frame "slide shows" were examined and compared to a static key frame "filmstrip" display. The slide show mechanism displays key frames in rapid succession and is designed to facilitate visual browsing by exploiting human perceptual capabilities. User studies were conducted in a series of three experiments. Key frame display rate, number of simultaneous displays, and user perception were investigated as a function of user performance in object recognition and gist determination tasks. No significant performance degradation was detected at display rates up to 8 key frames per second, but performance degraded significantly at higher rates. Performance on gist determination tasks degraded less severely than performance on object recognition tasks as display rates increased. Furthermore, gist determination performance dropped significantly between three and four simultaneous slide shows in a single display. Users generally preferred key frame filmstrips to dynamic displays, although objective measures of performance were mixed. Implications for visual interface design and further questions for future research are provided.
- Beauchemin, S. S. and J. L. Barron. (1995). The computation of optical flow. ACM Computing Surveys, 27(3): 433--466. Google ScholarDigital Library
- Boyce, S. J., A. Pollatsek, and K. Rayner. (1989). Effect of background information on object identification. Journal of Experimental Psychology: Human Perception and Performance, 15(3): 556--566.Google ScholarCross Ref
- Christel, M. G., D. B. Winkler, and C. R. Taylor. (1997). Improving access to a digital video library. In Human-Computer Interaction: INTERACT97, Sydney Australia. Google ScholarDigital Library
- Ding, W., G. Marchionini, and T. Tse. (1997). Previewing video data: Browsing key frames at high rates using a video slide show interface. Proceedings of the International Symposium on Research, Development, and Practice in Digital Libraries (ISDL '97), Tsukuba, Japan.Google Scholar
- Ellis, H. C. and R. R. Hunt. (1989). Fundamentals of human memory and cognition. Wm. C. Brown Publishers: Dubuque, IA.Google Scholar
- Elliot, E. (1993). Watch, grab, arrange, see: Thinking with motion images via streams and collages. MSVS Thesis Document. Cambridge, MA: MIT Media Lab.Google Scholar
- Healey, C. G., K. S. Booth, and J. T. Enns. (1996). Highspeed visual estimation using pre-attentive processing. ACM Transactions on Computer-Human Interaction, 3(2), 107--135. Google ScholarDigital Library
- Komlodi, A. (1997). Visual surrogates for motion picture documents: Presentation techniques for key frames. CLIS-TR-97-15, College Park, MD: Digital Library Research Group (http://www.glue.umd.edu/~dlrg/).Google Scholar
- Marchionini, G. (1995). Information seeking in electronic environments. Cambridge Series on Human Computer Interaction. Cambridge University Press: New York. Google ScholarDigital Library
- O'Connor, B. C. (1991). Selecting key frames of moving image documents: A digital environment for analysis and navigation. Microcomputers for Information Management, 8(2), 119--133.Google Scholar
- Potter, M. C. (1976). Short-term conceptual memory for pictures. Journal of Experimental Psychology: Human Learning and Memory, 2(5), 509--522.Google ScholarCross Ref
- Slaughter, L., B. Shneiderman, and G. Marchionini. (1997). Comprehension and object recognition capabilities for presentations of simulataneous video key frame surrogates. In Peters C. and C. Thanos (Eds.), Research and Advanced Technology for Digital Libraries: Proceedings of the First European Conference (pp. 41--54). ECDL'97, Pisa, Italy. Google ScholarDigital Library
- Teodosio, L. and W. Bender. (1993). Salient stills from video. Proceedings of ACM Multimedia '93, Anaheim, CA: 39--46. Google ScholarDigital Library
- Watclar, H. D., T. Kanade, M. A. Smith, and S. M. Stevens. (1996). Intelligent access to digital video: Informedia project. Computer, 29(5), 46--52. Google ScholarDigital Library
- Wickens, C. D. (1992). Engineering psychology and human performance. Second Edition. New York: HarperCollins.Google Scholar
- Yeung, M. M, B. L. Yeo, W. Wolf, and B. Liu. (1995). Video browsing using clustering and scene transition on compressed sequences. Proceedings of Multimedia Computing and Networking, San Jose.Google ScholarCross Ref
- Yow, D, B. L. Yeo, M. M. Yeung, and B. Liu. (1995). Analysis and presentation of soccer highlights from digital video. Proceedings of the Second Asian Conference on Computer Vision (ACCV '95).Google Scholar
- Zhang, H. J., C. Y. Low, and S. W. Smoliar. (1995). Video parsing and browsing using compressed data. Multimedia Tools and Applications, 1, 89--111. Google ScholarDigital Library
- Dynamic key frame presentation techniques for augmenting video browsing
Recommendations
Video Interaction on Tablet Computers: Browsing with Pinch Gesture and Pen Tilt
MUM '22: Proceedings of the 21st International Conference on Mobile and Ubiquitous MultimediaDigital video technologies have been changing dramatically in the field of media production but reduced the tangibility of working with film. One interesting approach to get back such tangibility in digital systems is using pen and touch interactions. ...
Interactive video browsing on mobile devices
MM '07: Proceedings of the 15th ACM international conference on MultimediaToday, videos can be replayed on modern handheld devices, such as multimedia cellphones and personal digital assistants (PDAs), due to significant improvements in their processing power. However, screen size remains a limiting resource making it hard, ...
A new interface for video browsing on PDAs
MobileHCI '07: Proceedings of the 9th international conference on Human computer interaction with mobile devices and servicesWe present an interface for interactive video browsing on pen-based handheld devices. Our solution enables users to navigate through a video along the timeline at different granularity levels. In addition, one can skim a file's content by continuous ...
Comments