skip to main content
10.1145/948496.948522acmconferencesArticle/Chapter ViewAbstractPublication PagesaviConference Proceedingsconference-collections
Article
Free Access

Dynamic key frame presentation techniques for augmenting video browsing

Published:24 May 1998Publication History

ABSTRACT

Because of unique temporal and spatial properties of video data, different techniques for summarizing videos have been proposed. Key frames extracted directly from video inform users about content without requiring them to view the entire video. As part of ongoing work to develop video browsing interfaces, several interface displays based on key frames were investigated. Variations on dynamic key frame "slide shows" were examined and compared to a static key frame "filmstrip" display. The slide show mechanism displays key frames in rapid succession and is designed to facilitate visual browsing by exploiting human perceptual capabilities. User studies were conducted in a series of three experiments. Key frame display rate, number of simultaneous displays, and user perception were investigated as a function of user performance in object recognition and gist determination tasks. No significant performance degradation was detected at display rates up to 8 key frames per second, but performance degraded significantly at higher rates. Performance on gist determination tasks degraded less severely than performance on object recognition tasks as display rates increased. Furthermore, gist determination performance dropped significantly between three and four simultaneous slide shows in a single display. Users generally preferred key frame filmstrips to dynamic displays, although objective measures of performance were mixed. Implications for visual interface design and further questions for future research are provided.

References

  1. Beauchemin, S. S. and J. L. Barron. (1995). The computation of optical flow. ACM Computing Surveys, 27(3): 433--466. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Boyce, S. J., A. Pollatsek, and K. Rayner. (1989). Effect of background information on object identification. Journal of Experimental Psychology: Human Perception and Performance, 15(3): 556--566.Google ScholarGoogle ScholarCross RefCross Ref
  3. Christel, M. G., D. B. Winkler, and C. R. Taylor. (1997). Improving access to a digital video library. In Human-Computer Interaction: INTERACT97, Sydney Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Ding, W., G. Marchionini, and T. Tse. (1997). Previewing video data: Browsing key frames at high rates using a video slide show interface. Proceedings of the International Symposium on Research, Development, and Practice in Digital Libraries (ISDL '97), Tsukuba, Japan.Google ScholarGoogle Scholar
  5. Ellis, H. C. and R. R. Hunt. (1989). Fundamentals of human memory and cognition. Wm. C. Brown Publishers: Dubuque, IA.Google ScholarGoogle Scholar
  6. Elliot, E. (1993). Watch, grab, arrange, see: Thinking with motion images via streams and collages. MSVS Thesis Document. Cambridge, MA: MIT Media Lab.Google ScholarGoogle Scholar
  7. Healey, C. G., K. S. Booth, and J. T. Enns. (1996). Highspeed visual estimation using pre-attentive processing. ACM Transactions on Computer-Human Interaction, 3(2), 107--135. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Komlodi, A. (1997). Visual surrogates for motion picture documents: Presentation techniques for key frames. CLIS-TR-97-15, College Park, MD: Digital Library Research Group (http://www.glue.umd.edu/~dlrg/).Google ScholarGoogle Scholar
  9. Marchionini, G. (1995). Information seeking in electronic environments. Cambridge Series on Human Computer Interaction. Cambridge University Press: New York. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. O'Connor, B. C. (1991). Selecting key frames of moving image documents: A digital environment for analysis and navigation. Microcomputers for Information Management, 8(2), 119--133.Google ScholarGoogle Scholar
  11. Potter, M. C. (1976). Short-term conceptual memory for pictures. Journal of Experimental Psychology: Human Learning and Memory, 2(5), 509--522.Google ScholarGoogle ScholarCross RefCross Ref
  12. Slaughter, L., B. Shneiderman, and G. Marchionini. (1997). Comprehension and object recognition capabilities for presentations of simulataneous video key frame surrogates. In Peters C. and C. Thanos (Eds.), Research and Advanced Technology for Digital Libraries: Proceedings of the First European Conference (pp. 41--54). ECDL'97, Pisa, Italy. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Teodosio, L. and W. Bender. (1993). Salient stills from video. Proceedings of ACM Multimedia '93, Anaheim, CA: 39--46. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Watclar, H. D., T. Kanade, M. A. Smith, and S. M. Stevens. (1996). Intelligent access to digital video: Informedia project. Computer, 29(5), 46--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Wickens, C. D. (1992). Engineering psychology and human performance. Second Edition. New York: HarperCollins.Google ScholarGoogle Scholar
  16. Yeung, M. M, B. L. Yeo, W. Wolf, and B. Liu. (1995). Video browsing using clustering and scene transition on compressed sequences. Proceedings of Multimedia Computing and Networking, San Jose.Google ScholarGoogle ScholarCross RefCross Ref
  17. Yow, D, B. L. Yeo, M. M. Yeung, and B. Liu. (1995). Analysis and presentation of soccer highlights from digital video. Proceedings of the Second Asian Conference on Computer Vision (ACCV '95).Google ScholarGoogle Scholar
  18. Zhang, H. J., C. Y. Low, and S. W. Smoliar. (1995). Video parsing and browsing using compressed data. Multimedia Tools and Applications, 1, 89--111. Google ScholarGoogle ScholarDigital LibraryDigital Library
  1. Dynamic key frame presentation techniques for augmenting video browsing

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      AVI '98: Proceedings of the working conference on Advanced visual interfaces
      May 1998
      295 pages
      ISBN:9781450374354
      DOI:10.1145/948496

      Copyright © 1998 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 May 1998

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate107of408submissions,26%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader