Abstract
Due to the rapid proliferation of both user-generated and broadcasted content, the interfaces for search and browsing of visual media have become increasingly important. This paper presents a novel intuitive interactive interface for browsing of large-scale image and video collections. It visualises underlying structure of the dataset by the size and spatial relations of displayed images. In order to achieve this, images or video key-frames are initially clustered using an unsupervised graph-based clustering algorithm. By selecting images that are hierarchically laid out on the screen, user can intuitively navigate through the collection or search for specific content. The extensive experimental results based on user evaluation of photo search, browsing and selection as well as interactive video search demonstrate good usability of the presented system and improvement when compared to the standard methods for interaction with large-scale image and video collections.
Similar content being viewed by others
References
Apple Ltd., iPhoto09 (2009) http://www.apple.com/ilife/iphoto/
Bederson B (2001) PhotoMesa: a zoomable image browser using quantum treemaps and bubblemaps. In: Proceedings of the 14th annual ACM symposium on User interface software and technology, pp 71–80
Calic J, Campbell NW (2007) Compact visualisation of video summaries. EURASIP Journal on Advances in Signal Processing, vol 2007, Article ID 19496
Calic J, Gibson DP, Campbell NW (2007) Efficient layout of comic-like video summaries. IEEE Trans Circuits Syst Video Technol 17(7):931–936
Chong-Wah N, Yu-Fei M, Hong-Jiang Z (2005) Video summarization and scene detection by graph modelling. IEEE Trans Circuits Syst Video Technol 15(2):296–305
Cooper M, Foote J, Girgensohn A, Wilcox L (2003) Temporal event clustering for digital photo collections. In: Proceedings of the eleventh ACM international conference on Multimedia. ACM Press, pp 364–373
Felzenszwalb PF, Huttenlocher DP (2004) Efficient graph-based image segmentation. Int J Comput Vis 59(2):167–181
Ferman M, Tekalp M (1997) Multiscale content extraction and representation for video indexing. Proc SPIE Int Soc Opt Eng 3229:23–31. doi:10.1117/12.290352
Graham A, Garcia-Molina H, Paepcke A, Winograd T (2002) Time as essence for photo browsing through personal digital libraries. In: Proceedings of the second ACM/IEEE-CS joint conference on digital libraries
Huynh DF, Drucker SM, Baudisch P, Wong C (2005) Time quilt: scaling up zoomable photo browsers for large, unstructured photo collections. In: CHI ’05 Extended abstracts on human factors in computing systems, ACM, New York, NY, pp 1937–1940
Jung CG (1971) Psychological types. Princeton University Press, Princeton, New Jersey
Loui A, Savakis AE (2000) Automatic image event segmentation and quality screening for albuming applications. In: IEEE international conference on multimedia and expo
Ren K, Calic J (2009) FreeEye - interactive intuitive interface for large-scale image browsing. In: Proc of ACM multimedia
Ren K, Sarvas R, Calic J (2009) FreeEye - intuitive summarisation of photo collections. In: Proc of ACM multimedia multimedia grand challenge
Toyama K, Logan R, Roseway A (2003) Geographic location tags on digital images. In: Proc of 11th annual ACM international conference on multimedia (MM2003), Berkeley, CA, November 2–8, 2003. ACM Press, New York, NY, pp 156–166
Zhong D, Zhang H, Chang S (1996) Clustering methods for video browsing and annotation. Proc SPIE Int Soc Opt Eng 2670:239. doi:10.1117/12.234800
Zhuang Y, Rui Y, Huang TS, Mehrotra S (1998) Adaptive key frame extraction using unsupervised clustering. Image Processing, 1998. ICIP 98. Proceedings. 1998 International Conference on, vol 1, pp 866–870, 4–7 Oct 1998
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Ren, K., Sarvas, R. & Ćalić, J. Interactive search and browsing interface for large-scale visual repositories. Multimed Tools Appl 49, 513–528 (2010). https://doi.org/10.1007/s11042-009-0445-y
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-009-0445-y