ABSTRACT
Watching a long unedited video is usually a boring experience. In this paper we examine a particular subset of videos, tour videos, in which the video is captured by walking about with a running camera with the goal of conveying the essence of some place. We present a system that makes the process of sharing and watching a long tour video easier, less boring, and more informative. To achieve this, we augment the tour video with a map-based storyboard, where the tour path is reconstructed, and coherent shots at different locations are directly visualized on the map. This allows the viewer to navigate the video in the joint location-time space. To create such a storyboard we employ an automatic pre-processing component to parse the video into coherent shots, and an authoring tool to enable the user to tie the shots with landmarks on the map. The browser-based viewing tool allows users to navigate the video in a variety of creative modes with a rich set of controls, giving each viewer a unique, personal viewing experience. Informal evaluation shows that our approach works well for tour videos compared with conventional media players.
Supplemental Material
- M. Cooper and J. Foote. Scene boundary detection via video self-similarity analysis. Image Processing, 2001. Proceedings. 2001 International Conference on, 3:378--381 vol.3, 2001.Google ScholarCross Ref
- J. Foote, J. Boreczhy, A. Girgensohn, and L. Wilcox. An intelligent media browser using automatic multimodal analysis. In MULTIMEDIA '98: Proceedings of the sixth ACM international conference on Multimedia, pages 375---380, New York, NY, USA, 1998. ACM. Google ScholarDigital Library
- A. Girgensohn, J. Boreczky, P. Chiu, J. Doherty, J. Foote, G. Golovchinsky, S. Uchihashi, and L.Wilcox. A semi-automatic approach to home video editing. In Proceedings of UIST '00, pages 81--89, 2000. Google ScholarDigital Library
- D. R. Goldman. A Framework for Video Annotation, Visualization, and Interaction. PhD thesis, University of Washington, 2007. Section 2.4. Google ScholarDigital Library
- R. Kadobayashi and K. Tanaka. 3d viewpoint-based photo search and information browsing. In Proceedings of SIGIR '05, pages 621--622, 2005. Google ScholarDigital Library
- A. Komlodi and G. Marchionini. Key frame preview techniques for video browsing. In Proc. of DL '98, pages 118--125, New York, NY, USA, 1998. ACM. Google ScholarDigital Library
- Y. Li, T. Zhang, and D. Tretter. An overview of video abstraction techniques. Technical Report HPL-2001-191, HP Laboratory, July 2001.Google Scholar
- R. Lienhart. Abstracting home video automatically. In Proc. of MULTIMEDIA '99, pages 37--40, New York, NY, USA, 1999. ACM. Google ScholarDigital Library
- A. Lippman. Movie-maps: An application of the optical videodisc to computer graphics. SIGGRAPH Comput. Graph., 14(3):32--42, 1980. Google ScholarDigital Library
- D. Lowe. Distinctive image features from scaleinvariant keypoints. International Journal of Computer Vision, 60(2):91--110, 2004. Google ScholarDigital Library
- J. Matas, O. Chum, M. Urban, and T. Pajdla. Robust wide baseline stereo from maximally stable extremal regions. In Proc. of the British Machine Vision Conference, pages 384--393, 2002.Google ScholarCross Ref
- K. Mikolajczyk and C. Schmid. Scale and affine invariant interest point detectors. International Journal of Computer Vision, 60(1):63--86, 2004. Google ScholarDigital Library
- M. Naaman, Y. J. Song, A. Paepcke, and H. Garcia-Molina. Automatic organization for digital photographs with geographic coordinates. In Proceedings of JCDL '04, pages 53--62, 2004. Google ScholarDigital Library
- J. Nam and A. H. Tewfik. Video abstract of video. In Proc. of IEEE 3rdWorkshop on Multimedia Signal Processing, pages 117--122, 1999.Google Scholar
- S. Pfeiffer, R. Lienhart, S. Fischer, and W. Effelsberg. Abstracting digital movies automatically. Technical Report TR-96-005, 1, 1996. Google ScholarDigital Library
- M. A. Smith and T. Kanade. Video skimming and characterization through the combination of image and language understanding. In CAIVD, pages 61--70, 1998. Google ScholarDigital Library
- N. Snavely, S.M. Seitz, and R. Szeliski. Photo tourism: exploring photo collections in 3d. In Proc. of ACMSIGGRAPH 2006, pages 835--846, 2006. Google ScholarDigital Library
- Y. Tonomura, A. Akutsu, K. Otsuji, and T. Sadakata. Videomap and videospaceicon: tools for anatomizing video content. In Proceedings of CHI '93, pages 131--136, 1993. Google ScholarDigital Library
- K. Toyama, R. Logan, and A. Roseway. Geographic location tags on digital images. In Proceedings of MULTIMEDIA '03, pages 156--166, 2003. Google ScholarDigital Library
- B. T. Truong and S. Venkatesh. Video abstraction: A systematic review and classification. ACM Trans. Multimedia Comput. Commun. Appl., 3(1):3, 2007. Google ScholarDigital Library
Index Terms
- Creating map-based storyboards for browsing tour videos
Recommendations
Scanning News Videos With An Interactive Filmstrip
TVX '17 Adjunct: Adjunct Publication of the 2017 ACM International Conference on Interactive Experiences for TV and Online VideoDetermining whether a (news) video is of-interest and what it is about is a time-consuming process. This is a problem when users quickly want to catch up with the latest news and don't spend time to see something they already know or is not of interest ...
Multi-Device Storyboards for Cinematic Narratives in VR
UIST '16: Proceedings of the 29th Annual Symposium on User Interface Software and TechnologyVirtual Reality (VR) narratives have the unprecedented potential to connect with an audience through presence, placing viewers within the narrative. The onset of consumer VR has resulted in an explosion of interest in immersive storytelling. Planning ...
Video browsing using motion visualization
ICME'09: Proceedings of the 2009 IEEE international conference on Multimedia and ExpoWe present a video browsing tool that uses a novel and powerful visualization technique of video motion. The tool provides an interactive navigation index that allows users to quickly and easily recognize content semantics like scenes with fast/slow ...
Comments