Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 3/2013

01.09.2013 | Regular Paper

Searching for images by video

verfasst von: Linjun Yang, Yang Cai, Alan Hanjalic, Xian-Sheng Hua, Shipeng Li

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 3/2013

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Image retrieval based on the query-by-example (QBE) principle is still not reliable enough, largely because of the likely variations in the capture conditions (e.g. light, blur, scale, occlusion) and viewpoint between the query image and the images in the collection. In this paper, we propose a framework in which this problem is explicitly addressed to improve the reliability of QBE-based image retrieval. We aim at the use scenario involving the user capturing the query object by his/her mobile device and requesting information augmenting the query from the database. Reliability improvement is achieved by allowing the user to submit not a single image but a short video clip as a query. Since a video clip may combine object or scene appearances captured from different viewpoints and under different conditions, the rich information contained therein can be exploited to discover the proper query representation and to improve the relevance of the retrieved results. The experimental results show that video-based image retrieval (VBIR) is significantly more reliable than the retrieval using a single image as query. Furthermore, to make the proposed framework deployable in a practical mobile image retrieval system, where realtime query response is required, we also propose the priority queue-based feature description scheme and cache-based bi-quantization algorithm for an efficient parallel implementation of the VBIR concept.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Fußnoten
4
The experiments about the computational cost in this paper are performed on a workstation with two dual-core Intel Xeon 2.67 GHz CPUs and 12 GB memory.
 
Literatur
2.
Zurück zum Zitat Bay H, Tuytelaars T, Van Gool L (2006) SURF: speeded up robust features. In: ECCV Bay H, Tuytelaars T, Van Gool L (2006) SURF: speeded up robust features. In: ECCV
3.
Zurück zum Zitat Bradski G, Kaehler A (2008) Learning openCV: computer vision with the openCV library. O’Reilly, Cambridge Bradski G, Kaehler A (2008) Learning openCV: computer vision with the openCV library. O’Reilly, Cambridge
4.
Zurück zum Zitat Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: CVPR Chum O, Philbin J, Sivic J, Isard M, Zisserman A (2007) Total recall: automatic query expansion with a generative feature model for object retrieval. In: CVPR
5.
Zurück zum Zitat Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):5:1–5:60 Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv 40(2):5:1–5:60
6.
Zurück zum Zitat Heymann S, Muller K, Smolic A, Frohlich B, Wiegand T (2007) SIFT implementation and optimization for general-purpose GPU. In: Proceedings of the international conference in Central Europe on computer graphics, visualization and computer vision Heymann S, Muller K, Smolic A, Frohlich B, Wiegand T (2007) SIFT implementation and optimization for general-purpose GPU. In: Proceedings of the international conference in Central Europe on computer graphics, visualization and computer vision
7.
Zurück zum Zitat Li D, Yang L, Hua XS, Zhang HJ (2010) Large-scale robust visual codebook construction. In: ACM multimedia Li D, Yang L, Hua XS, Zhang HJ (2010) Large-scale robust visual codebook construction. In: ACM multimedia
9.
Zurück zum Zitat Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: Proceedings of the 1981 DARPA imaging understanding, workshop Lucas BD, Kanade T (1981) An iterative image registration technique with an application to stereo vision. In: Proceedings of the 1981 DARPA imaging understanding, workshop
10.
Zurück zum Zitat Makadia A (2010) Feature tracking for wide-baseline image retrieval. In: ECCV Makadia A (2010) Feature tracking for wide-baseline image retrieval. In: ECCV
11.
Zurück zum Zitat Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval, 1 edn. Cambridge University Press, Cambridge Manning CD, Raghavan P, Schütze H (2008) Introduction to information retrieval, 1 edn. Cambridge University Press, Cambridge
12.
Zurück zum Zitat Muja M, Lowe DG (2009) Fast approximate nearest neighbors with automatic algorithm configuration. In: VISSAPP Muja M, Lowe DG (2009) Fast approximate nearest neighbors with automatic algorithm configuration. In: VISSAPP
13.
Zurück zum Zitat Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPR Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPR
14.
Zurück zum Zitat Sivic J, Schaffalitzky F, Zisserman A (2006) Object level grouping for video shots. Int J Comput Vis 67(2):189–210 Sivic J, Schaffalitzky F, Zisserman A (2006) Object level grouping for video shots. Int J Comput Vis 67(2):189–210
15.
Zurück zum Zitat Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: ICCV Sivic J, Zisserman A (2003) Video google: a text retrieval approach to object matching in videos. In: ICCV
16.
Zurück zum Zitat Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Patt Anal Mach Intell 22:1349–1380CrossRef Smeulders AWM, Worring M, Santini S, Gupta A, Jain R (2000) Content-based image retrieval at the end of the early years. IEEE Trans Patt Anal Mach Intell 22:1349–1380CrossRef
17.
Zurück zum Zitat Stavens D, Thrun S (2010) Unsupervised learning of invariant features using video. In: CVPR Stavens D, Thrun S (2010) Unsupervised learning of invariant features using video. In: CVPR
18.
Zurück zum Zitat Turcot P, Lowe D (2009) Better matching with fewer features: the selection of useful features in large database recognition problems. In: ICCV workshop (WS-LAVD) Turcot P, Lowe D (2009) Better matching with fewer features: the selection of useful features in large database recognition problems. In: ICCV workshop (WS-LAVD)
19.
Zurück zum Zitat Wagner D, Schmalstieg D, Bischof H (2009) Multiple target detection and tracking with guaranteed framerates on mobile phones. In: ISMAR Wagner D, Schmalstieg D, Bischof H (2009) Multiple target detection and tracking with guaranteed framerates on mobile phones. In: ISMAR
20.
Zurück zum Zitat Wu X, Hauptmann AG, Ngo CW (2007) Practical elimination of near-duplicates from web video search. In: ACM multimedia Wu X, Hauptmann AG, Ngo CW (2007) Practical elimination of near-duplicates from web video search. In: ACM multimedia
21.
Zurück zum Zitat Yang L, Geng B, Hanjalic A, Hua XS (2010) Contextual image retrieval model. In: CIVR Yang L, Geng B, Hanjalic A, Hua XS (2010) Contextual image retrieval model. In: CIVR
Metadaten
Titel
Searching for images by video
verfasst von
Linjun Yang
Yang Cai
Alan Hanjalic
Xian-Sheng Hua
Shipeng Li
Publikationsdatum
01.09.2013
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 3/2013
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-012-0023-3

Weitere Artikel der Ausgabe 3/2013

International Journal of Multimedia Information Retrieval 3/2013 Zur Ausgabe