research-article

Video object annotation, navigation, and composition

Authors:
Dan B. Goldman

Adobe Systems, Inc., Seattle, WA, USA

Adobe Systems, Inc., Seattle, WA, USA
View Profile

,
Chris Gonterman

University of Washington, Seattle, WA, USA

University of Washington, Seattle, WA, USA
View Profile

,
Brian Curless

University of Washington, Seattle, WA, USA

University of Washington, Seattle, WA, USA
View Profile

,
David Salesin

Adobe Systems, Inc., and University of Washington, Seattle, WA, USA

Adobe Systems, Inc., and University of Washington, Seattle, WA, USA
View Profile

,
Steven M. Seitz

University of Washington, Seattle, WA, USA

University of Washington, Seattle, WA, USA
View Profile

UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technologyOctober 2008Pages 3–12https://doi.org/10.1145/1449715.1449719

Published:19 October 2008Publication History

UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technology

Pages 3–12

ABSTRACT

We explore the use of tracked 2D object motion to enable novel approaches to interacting with video. These include moving annotations, video navigation by direct manipulation of objects, and creating an image composite from multiple video frames. Features in the video are automatically tracked and grouped in an off-line preprocess that enables later interactive manipulation. Examples of annotations include speech and thought balloons, video graffiti, path arrows, video hyperlinks, and schematic storyboards. We also demonstrate a direct-manipulation interface for random frame access using spatial constraints, and a drag-and-drop interface for assembling still images from videos. Taken together, our tools can be employed in a variety of applications including film and video editing, visual tagging, and authoring rich media such as hyperlinked video.

Supplemental Material

p3-goldman.mov

mov

45.6 MB

Download

References

A. Agarwala, M. Dontcheva, M. Agrawala, S. Drucker, A. Colburn, B. Curless, D. Salesin, and M. Cohen. Interactive digital photomontage. ACM Trans. Graph. (Proc. SIGGRAPH), 23(4):294--301, 2004. Google ScholarDigital Library
A. Agarwala, A. Hertzmann, D. H. Salesin, and S. M. Seitz. Keyframe-based tracking for rotoscoping and animation. ACM Trans. Graph. (Proc. SIGGRAPH), 23(3):584--591, 2004. Google ScholarDigital Library
ASTAR Learning Systems. http://www.astarls.com, 2006. {Online; accessed 5-January-2008}.Google Scholar
C. Bregler, M. Covell, and M. Slaney. Video rewrite: Driving visual speech with audio. In Proc. SIGGRAPH 97, pages 353--360, 1997. Google ScholarDigital Library
Y.-Y. Chuang, A. Agarwala, B. Curless, D. H. Salesin, and R. Szeliski. Video matting of complex scenes. ACM Trans. Graph., 21(3):243--248, 2002. Google ScholarDigital Library
J. Dakss, S. Agamanolis, E. Chalom, and V. M. Bove Jr. Hyperlinked video. In Proc. SPIE, volume 3528, pages 2--10, 1999.Google Scholar
P. Dragicevic, G. Ramos, J. Bibliowicz, D. Nowrouzezahrai, R. Balakrishnan, and K. Singh. Video browsing by direct manipulation. In CHI, pages 237--246, 2008. Google ScholarDigital Library
D. B Goldman, B. Curless, S. M. Seitz, and D. Salesin. Schematic storyboarding for video visualization and editing. ACM Trans. Graph. (Proc. SIGGRAPH), 25(3):862--871, 2006. Google ScholarDigital Library
J. Goldman. Kind of a Blur. http://phobos.apple.com/WebObjects/MZStore.woa/wa/viewMovie?id=197994758, 2005. {Short film available online; accessed 5-January-2008}.Google Scholar
R. I. Hartley and A. Zisserman. Multiple View Geometry in Computer Vision, page 109. Cambridge University Press, ISBN: 0521540518, second edition, 2004. Google ScholarDigital Library
T. Karrer, M. Weiss, E. Lee, and J. Borchers. DRAGON: A direct manipulation interface for frame-accurate in-scene video navigation. In CHI, pages 247--250, 2008. Google ScholarDigital Library
D. Kimber, T. Dunnigan, A. Girgensohn, F. Shipman, T. Turner, and T. Yang. Trailblazing: Video playback control by direct object manipulation. In ICME, pages 1015--1018, 2007.Google ScholarCross Ref
D. Kurlander, T. Skelly, and D. Salesin. Comic chat. In SIGGRAPH '96, pages 225--236, 1996. Google ScholarDigital Library
Y. Li, J. Sun, and H.-Y. Shum. Video object cut and paste. ACM Trans. Graph. (Proc. SIGGRAPH), 24(3):595--600, 2005. Google ScholarDigital Library
C. Morningstar and R. F. Farmer. The lessons of Lucasfilm's Habitat. In M. Benedikt, editor, Cyberspace: First Steps, pages 273--301. MIT Press, Cambridge, MA, 1991. Google ScholarDigital Library
Y. Pritch, A. Rav-Acha, A. Gutman, and S. Peleg. Webcam synopsis: Peeking around the world. In Proc. ICCV, pages 1--8, 2007.Google ScholarCross Ref
Y. Pritch, A. Rav-Acha, and S. Peleg. Non-chronological video synopsis and indexing. IEEE Trans. PAMI, 2008. (to appear). Google ScholarDigital Library
G. Ramos and R. Balakrishnan. Fluid interaction techniques for the control and annotation of digital video. In Proc. UIST '03, pages 105--114, 2003. Google ScholarDigital Library
A. Rav-Acha, Y. Pritch, D. Lischinski, and S. Peleg. Dynamosaicing: Video mosaics with non-chronological time. In Proc. CVPR, pages 58--65, 2005. Google ScholarDigital Library
A. Rav-Acha, Y. Pritch, and S. Peleg. Making a long video short: Dynamic video synopsis. In Proc. CVPR, pages 435--441, 2006. Google ScholarDigital Library
E. Rosten, G. Reitmayr, and T. Drummond. Real-time video annotations for augmented reality. In Proc. Intl. Symp. on Visual Computing, 2005. Google ScholarDigital Library
P. Sand. Long-Range Video Motion Estimation using Point Trajectories. PhD thesis, MIT, 2006. Google ScholarDigital Library
P. Sand and S. Teller. Particle video: Long-range motion estimation using point trajectories. In Proc. CVPR '06, pages 2195--2202, 2006. Google ScholarDigital Library
A. Schödl and I. A. Essa. Controlled animation of video sprites. In Proc. ACM/Eurographics Symp. on Comp. Animation, pages 121--127, 2002. Google ScholarDigital Library
A. Schödl, R. Szeliski, D. H. Salesin, and I. Essa. Video textures. In SIGGRAPH '00, pages 489--498, 2000. Google ScholarDigital Library
J. Sivic, F. Schaffalitzky, and A. Zisserman. Object level grouping for video shots. Intl. J. of Comp. Vis., 67(2):189--210, 2006. Google ScholarDigital Library
J. M. Smith, D. Stotts, and S.-U. Kum. An orthogonal taxonomy for hyperlink anchor generation in video streams using OvalTine. In Proc. ACM Conf. on Hypertext and Hypermedia, pages 11--18, 2000. Google ScholarDigital Library
Sportvision. Changing The Game. http://www.sportvision.com, 2006. {Online; accessed 5-January-2008}.Google Scholar
J. Wang, P. Bhat, R. A. Colburn, M. Agrawala, and M. F. Cohen. Interactive video cutout. ACM Trans. Graph. (Proc. SIGGRAPH), 24(3):585--594, 2005. Google ScholarDigital Library
Wikipedia. Telestrator. http://en.wikipedia.org/w/index.php?title=Telestrator&oldid=180785495, 2006. {Online; accessed 5-January-2008}.Google Scholar
A. Yilmaz, O. Javed, and M. Shah. Object tracking: A survey. ACM Computing Surveys, 38(4):13, December 2006. Google ScholarDigital Library
L. Zhang, N. Snavely, B. Curless, and S. Seitz. Spacetime faces: high resolution capture for modeling and animation. ACM Trans. Graph. (Proc. SIGGRAPH), 23(3):548--558, 2004. Google ScholarDigital Library

Index Terms

Video object annotation, navigation, and composition
1. Human-centered computing
  1. Human computer interaction (HCI)
    1. Interaction paradigms

Recommendations

Direct manipulation video navigation in 3D
CHI '13: Proceedings of the SIGCHI Conference on Human Factors in Computing Systems

Direct Manipulation Video Navigation (DMVN) systems allow a user to navigate a video by dragging an object along its motion trajectory. These systems have been shown effective for space-centric video browsing. Their performance, however, is often ...
Read More
Direct manipulation video navigation on touch screens
MobileHCI '14: Proceedings of the 16th international conference on Human-computer interaction with mobile devices & services

Direct Manipulation Video Navigation (DMVN) systems allow a user to directly drag an object of interest along its motion trajectory and have been shown effective for space-centric video browsing tasks. This paper designs touch-based interface techniques ...
Read More
Schematic storyboarding for video visualization and editing
SIGGRAPH '06: ACM SIGGRAPH 2006 Papers

We present a method for visualizing short video clips in a single static image, using the visual language of storyboards. These schematic storyboards are composed from multiple input frames and annotated using outlines, arrows, and text describing the ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technology
October 2008
308 pages
ISBN:9781595939753
DOI:10.1145/1449715
General Chair:
Steve Cousins
Willow Garage, USA
,
Program Chair:
Michel Beaudouin-Lafon
Université Paris-Sud, France
Copyright © 2008 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 19 October 2008
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
direct manipulation
video annotation
video interaction
video navigation
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate842of3,967submissions,21%
Upcoming Conference
UIST '24

Sponsor:

sigchi

sigchi

UIST '24: The 37th Annual ACM Symposium on User Interface Software and Technology

October 13 - 16, 2024

Pittsburgh , PA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 99
  Total Citations
  View Citations
- 2,025
  Total Downloads
- Downloads (Last 12 months)28
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Video object annotation, navigation, and composition

UIST '08: Proceedings of the 21st annual ACM symposium on User interface software and technology

ABSTRACT

Supplemental Material

References

Cited By

Index Terms

Recommendations

Direct manipulation video navigation in 3D

Direct manipulation video navigation on touch screens

Schematic storyboarding for video visualization and editing