skip to main content
10.1145/1449715.1449719acmconferencesArticle/Chapter ViewAbstractPublication PagesuistConference Proceedingsconference-collections
research-article

Video object annotation, navigation, and composition

Published:19 October 2008Publication History

ABSTRACT

We explore the use of tracked 2D object motion to enable novel approaches to interacting with video. These include moving annotations, video navigation by direct manipulation of objects, and creating an image composite from multiple video frames. Features in the video are automatically tracked and grouped in an off-line preprocess that enables later interactive manipulation. Examples of annotations include speech and thought balloons, video graffiti, path arrows, video hyperlinks, and schematic storyboards. We also demonstrate a direct-manipulation interface for random frame access using spatial constraints, and a drag-and-drop interface for assembling still images from videos. Taken together, our tools can be employed in a variety of applications including film and video editing, visual tagging, and authoring rich media such as hyperlinked video.

Skip Supplemental Material Section

Supplemental Material

p3-goldman.mov

mov

45.6 MB

References

  1. A. Agarwala, M. Dontcheva, M. Agrawala, S. Drucker, A. Colburn, B. Curless, D. Salesin, and M. Cohen. Interactive digital photomontage. ACM Trans. Graph. (Proc. SIGGRAPH), 23(4):294--301, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Agarwala, A. Hertzmann, D. H. Salesin, and S. M. Seitz. Keyframe-based tracking for rotoscoping and animation. ACM Trans. Graph. (Proc. SIGGRAPH), 23(3):584--591, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. ASTAR Learning Systems. http://www.astarls.com, 2006. {Online; accessed 5-January-2008}.Google ScholarGoogle Scholar
  4. C. Bregler, M. Covell, and M. Slaney. Video rewrite: Driving visual speech with audio. In Proc. SIGGRAPH 97, pages 353--360, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Y.-Y. Chuang, A. Agarwala, B. Curless, D. H. Salesin, and R. Szeliski. Video matting of complex scenes. ACM Trans. Graph., 21(3):243--248, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. Dakss, S. Agamanolis, E. Chalom, and V. M. Bove Jr. Hyperlinked video. In Proc. SPIE, volume 3528, pages 2--10, 1999.Google ScholarGoogle Scholar
  7. P. Dragicevic, G. Ramos, J. Bibliowicz, D. Nowrouzezahrai, R. Balakrishnan, and K. Singh. Video browsing by direct manipulation. In CHI, pages 237--246, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. D. B Goldman, B. Curless, S. M. Seitz, and D. Salesin. Schematic storyboarding for video visualization and editing. ACM Trans. Graph. (Proc. SIGGRAPH), 25(3):862--871, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. J. Goldman. Kind of a Blur. http://phobos.apple.com/WebObjects/MZStore.woa/wa/viewMovie?id=197994758, 2005. {Short film available online; accessed 5-January-2008}.Google ScholarGoogle Scholar
  10. R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision, page 109. Cambridge University Press, ISBN: 0521540518, second edition, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. T. Karrer, M. Weiss, E. Lee, and J. Borchers. DRAGON: A direct manipulation interface for frame-accurate in-scene video navigation. In CHI, pages 247--250, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Kimber, T. Dunnigan, A. Girgensohn, F. Shipman, T. Turner, and T. Yang. Trailblazing: Video playback control by direct object manipulation. In ICME, pages 1015--1018, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  13. D. Kurlander, T. Skelly, and D. Salesin. Comic chat. In SIGGRAPH '96, pages 225--236, 1996. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Li, J. Sun, and H.-Y. Shum. Video object cut and paste. ACM Trans. Graph. (Proc. SIGGRAPH), 24(3):595--600, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. C. Morningstar and R. F. Farmer. The lessons of Lucasfilm's Habitat. In M. Benedikt, editor, Cyberspace: First Steps, pages 273--301. MIT Press, Cambridge, MA, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Pritch, A. Rav-Acha, A. Gutman, and S. Peleg. Webcam synopsis: Peeking around the world. In Proc. ICCV, pages 1--8, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  17. Y. Pritch, A. Rav-Acha, and S. Peleg. Non-chronological video synopsis and indexing. IEEE Trans. PAMI, 2008. (to appear). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. G. Ramos and R. Balakrishnan. Fluid interaction techniques for the control and annotation of digital video. In Proc. UIST '03, pages 105--114, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. A. Rav-Acha, Y. Pritch, D. Lischinski, and S. Peleg. Dynamosaicing: Video mosaics with non-chronological time. In Proc. CVPR, pages 58--65, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. A. Rav-Acha, Y. Pritch, and S. Peleg. Making a long video short: Dynamic video synopsis. In Proc. CVPR, pages 435--441, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. E. Rosten, G. Reitmayr, and T. Drummond. Real-time video annotations for augmented reality. In Proc. Intl. Symp. on Visual Computing, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. P. Sand. Long-Range Video Motion Estimation using Point Trajectories. PhD thesis, MIT, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. P. Sand and S. Teller. Particle video: Long-range motion estimation using point trajectories. In Proc. CVPR '06, pages 2195--2202, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. A. Schödl and I. A. Essa. Controlled animation of video sprites. In Proc. ACM/Eurographics Symp. on Comp. Animation, pages 121--127, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. A. Schödl, R. Szeliski, D. H. Salesin, and I. Essa. Video textures. In SIGGRAPH '00, pages 489--498, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. J. Sivic, F. Schaffalitzky, and A. Zisserman. Object level grouping for video shots. Intl. J. of Comp. Vis., 67(2):189--210, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. M. Smith, D. Stotts, and S.-U. Kum. An orthogonal taxonomy for hyperlink anchor generation in video streams using OvalTine. In Proc. ACM Conf. on Hypertext and Hypermedia, pages 11--18, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Sportvision. Changing The Game. http://www.sportvision.com, 2006. {Online; accessed 5-January-2008}.Google ScholarGoogle Scholar
  29. J. Wang, P. Bhat, R. A. Colburn, M. Agrawala, and M. F. Cohen. Interactive video cutout. ACM Trans. Graph. (Proc. SIGGRAPH), 24(3):585--594, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Wikipedia. Telestrator. http://en.wikipedia.org/w/index.php?title=Telestrator&oldid=180785495, 2006. {Online; accessed 5-January-2008}.Google ScholarGoogle Scholar
  31. A. Yilmaz, O. Javed, and M. Shah. Object tracking: A survey. ACM Computing Surveys, 38(4):13, December 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. L. Zhang, N. Snavely, B. Curless, and S. Seitz. Spacetime faces: high resolution capture for modeling and animation. ACM Trans. Graph. (Proc. SIGGRAPH), 23(3):548--558, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Video object annotation, navigation, and composition

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technology
      October 2008
      308 pages
      ISBN:9781595939753
      DOI:10.1145/1449715

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 October 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate842of3,967submissions,21%

      Upcoming Conference

      UIST '24

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader