Dynamic key frame presentation techniques for augmenting video browsing

Authors:
Tony Tse

University of Maryland, College Park, MD

University of Maryland, College Park, MD
View Profile

,
Gary Marchionini

University of Maryland, College Park, MD

University of Maryland, College Park, MD
View Profile

,
Wei Ding

University of Maryland, College Park, MD

University of Maryland, College Park, MD
View Profile

,
Laura Slaughter

University of Maryland, College Park, MD

University of Maryland, College Park, MD
View Profile

,
Anita Komlodi

University of Maryland, College Park, MD

University of Maryland, College Park, MD
View Profile

AVI '98: Proceedings of the working conference on Advanced visual interfacesMay 1998Pages 185–194https://doi.org/10.1145/948496.948522

Published:24 May 1998Publication History

AVI '98: Proceedings of the working conference on Advanced visual interfaces

Pages 185–194

ABSTRACT

Because of unique temporal and spatial properties of video data, different techniques for summarizing videos have been proposed. Key frames extracted directly from video inform users about content without requiring them to view the entire video. As part of ongoing work to develop video browsing interfaces, several interface displays based on key frames were investigated. Variations on dynamic key frame "slide shows" were examined and compared to a static key frame "filmstrip" display. The slide show mechanism displays key frames in rapid succession and is designed to facilitate visual browsing by exploiting human perceptual capabilities. User studies were conducted in a series of three experiments. Key frame display rate, number of simultaneous displays, and user perception were investigated as a function of user performance in object recognition and gist determination tasks. No significant performance degradation was detected at display rates up to 8 key frames per second, but performance degraded significantly at higher rates. Performance on gist determination tasks degraded less severely than performance on object recognition tasks as display rates increased. Furthermore, gist determination performance dropped significantly between three and four simultaneous slide shows in a single display. Users generally preferred key frame filmstrips to dynamic displays, although objective measures of performance were mixed. Implications for visual interface design and further questions for future research are provided.

References

Beauchemin, S. S. and J. L. Barron. (1995). The computation of optical flow. ACM Computing Surveys, 27(3): 433--466. Google ScholarDigital Library
Boyce, S. J., A. Pollatsek, and K. Rayner. (1989). Effect of background information on object identification. Journal of Experimental Psychology: Human Perception and Performance, 15(3): 556--566.Google ScholarCross Ref
Christel, M. G., D. B. Winkler, and C. R. Taylor. (1997). Improving access to a digital video library. In Human-Computer Interaction: INTERACT97, Sydney Australia. Google ScholarDigital Library
Ding, W., G. Marchionini, and T. Tse. (1997). Previewing video data: Browsing key frames at high rates using a video slide show interface. Proceedings of the International Symposium on Research, Development, and Practice in Digital Libraries (ISDL '97), Tsukuba, Japan.Google Scholar
Ellis, H. C. and R. R. Hunt. (1989). Fundamentals of human memory and cognition. Wm. C. Brown Publishers: Dubuque, IA.Google Scholar
Elliot, E. (1993). Watch, grab, arrange, see: Thinking with motion images via streams and collages. MSVS Thesis Document. Cambridge, MA: MIT Media Lab.Google Scholar
Healey, C. G., K. S. Booth, and J. T. Enns. (1996). Highspeed visual estimation using pre-attentive processing. ACM Transactions on Computer-Human Interaction, 3(2), 107--135. Google ScholarDigital Library
Komlodi, A. (1997). Visual surrogates for motion picture documents: Presentation techniques for key frames. CLIS-TR-97-15, College Park, MD: Digital Library Research Group (http://www.glue.umd.edu/~dlrg/).Google Scholar
Marchionini, G. (1995). Information seeking in electronic environments. Cambridge Series on Human Computer Interaction. Cambridge University Press: New York. Google ScholarDigital Library
O'Connor, B. C. (1991). Selecting key frames of moving image documents: A digital environment for analysis and navigation. Microcomputers for Information Management, 8(2), 119--133.Google Scholar
Potter, M. C. (1976). Short-term conceptual memory for pictures. Journal of Experimental Psychology: Human Learning and Memory, 2(5), 509--522.Google ScholarCross Ref
Slaughter, L., B. Shneiderman, and G. Marchionini. (1997). Comprehension and object recognition capabilities for presentations of simulataneous video key frame surrogates. In Peters C. and C. Thanos (Eds.), Research and Advanced Technology for Digital Libraries: Proceedings of the First European Conference (pp. 41--54). ECDL'97, Pisa, Italy. Google ScholarDigital Library
Teodosio, L. and W. Bender. (1993). Salient stills from video. Proceedings of ACM Multimedia '93, Anaheim, CA: 39--46. Google ScholarDigital Library
Watclar, H. D., T. Kanade, M. A. Smith, and S. M. Stevens. (1996). Intelligent access to digital video: Informedia project. Computer, 29(5), 46--52. Google ScholarDigital Library
Wickens, C. D. (1992). Engineering psychology and human performance. Second Edition. New York: HarperCollins.Google Scholar
Yeung, M. M, B. L. Yeo, W. Wolf, and B. Liu. (1995). Video browsing using clustering and scene transition on compressed sequences. Proceedings of Multimedia Computing and Networking, San Jose.Google ScholarCross Ref
Yow, D, B. L. Yeo, M. M. Yeung, and B. Liu. (1995). Analysis and presentation of soccer highlights from digital video. Proceedings of the Second Asian Conference on Computer Vision (ACCV '95).Google Scholar
Zhang, H. J., C. Y. Low, and S. W. Smoliar. (1995). Video parsing and browsing using compressed data. Multimedia Tools and Applications, 1, 89--111. Google ScholarDigital Library

Dynamic key frame presentation techniques for augmenting video browsing
1. Human-centered computing

Recommendations

Video Interaction on Tablet Computers: Browsing with Pinch Gesture and Pen Tilt
MUM '22: Proceedings of the 21st International Conference on Mobile and Ubiquitous Multimedia

Digital video technologies have been changing dramatically in the field of media production but reduced the tangibility of working with film. One interesting approach to get back such tangibility in digital systems is using pen and touch interactions. ...
Read More
Interactive video browsing on mobile devices
MM '07: Proceedings of the 15th ACM international conference on Multimedia

Today, videos can be replayed on modern handheld devices, such as multimedia cellphones and personal digital assistants (PDAs), due to significant improvements in their processing power. However, screen size remains a limiting resource making it hard, ...
Read More
A new interface for video browsing on PDAs
MobileHCI '07: Proceedings of the 9th international conference on Human computer interaction with mobile devices and services

We present an interface for interactive video browsing on pen-based handheld devices. Our solution enables users to navigate through a video along the timeline at different granularity levels. In addition, one can skim a file's content by continuous ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
AVI '98: Proceedings of the working conference on Advanced visual interfaces
May 1998
295 pages
ISBN:9781450374354
DOI:10.1145/948496
Editors:
Tiziana Catarci
Università degli Studi di Roma "La Sapienza", Roma, Italy
,
Maria Francesca Costabile
Università di Bari, Bari, Italy
,
Giuseppe Santucci
Università degli Studi di Roma "La Sapienza", Roma, Italy
,
Laura Taranfino
Università dell'Aquila, L'Aquila, Italy
,
General Chair:
Stefano Levialdi
Copyright © 1998 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 May 1998
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
display rate
divided attention
dynamic displays
interface design
key frames
representations
video browsing
Qualifiers
- Article
Conference

Acceptance Rates
Overall Acceptance Rate107of408submissions,26%
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 39
  Total Citations
  View Citations
- 738
  Total Downloads
- Downloads (Last 12 months)37
- Downloads (Last 6 weeks)4
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Dynamic key frame presentation techniques for augmenting video browsing

AVI '98: Proceedings of the working conference on Advanced visual interfaces

ABSTRACT

References

Cited By

Recommendations

Video Interaction on Tablet Computers: Browsing with Pinch Gesture and Pen Tilt

Interactive video browsing on mobile devices

A new interface for video browsing on PDAs