Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 1/2012

01.04.2012 | Invited Paper

Large-scale near-duplicate image retrieval by kernel density estimation

verfasst von: Wei Tong, Fengjie Li, Rong Jin, Anil Jain

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 1/2012

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Bag-of-words model is one of the most widely used methods in the recent studies of multimedia data retrieval. The key idea of the bag-of-words model is to quantize the bag of local features, for example SIFT, to a histogram of visual words and then standard information retrieval technologies developed from text retrieval can be applied directly. Despite its success, one problem of the bag-of-words model is that the two key steps, i.e., feature quantization and retrieval, are separated. In other words, the step of generating bag-of-words representation is not optimized for the step of retrieval which often leads to a sub-optimal performance. In this paper we propose a statistical framework for large-scale near-duplication image retrieval which unifies the two steps by introducing kernel density function. The central idea of the proposed method is to represent each image by a kernel density function and the similarity between the query image and a database image is then estimated as the query likelihood. In order to make the proposed method applicable to large-scale data sets, we have developed efficient algorithms for both estimating the density function of each image and computing the query likelihood. Our empirical studies confirm that the proposed method is not only more effective but also more efficient than the bag-of-words model.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Boughorbel S, Tarel JP, Fleuret F (2004) Non-mercer kernels for svm object recognition. In: BMVC Boughorbel S, Tarel JP, Fleuret F (2004) Non-mercer kernels for svm object recognition. In: BMVC
2.
Zurück zum Zitat Carson C, Belongie S, Greenspan H, Malik J (1997) Region-based image querying. In: Proceedings of IEEE workshop on content-based access of image and video libraries, pp 42–49 Carson C, Belongie S, Greenspan H, Malik J (1997) Region-based image querying. In: Proceedings of IEEE workshop on content-based access of image and video libraries, pp 42–49
3.
Zurück zum Zitat Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision
4.
Zurück zum Zitat Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on computational geometry Datar M, Immorlica N, Indyk P, Mirrokni VS (2004) Locality-sensitive hashing scheme based on p-stable distributions. In: Proceedings of the twentieth annual symposium on computational geometry
5.
Zurück zum Zitat Eakins J, Graham M (1999) Content-based image retrieval. Tech. Rep. JTAP-039, Institute for Image Data Research, University of Northumbria Newcastle Eakins J, Graham M (1999) Content-based image retrieval. Tech. Rep. JTAP-039, Institute for Image Data Research, University of Northumbria Newcastle
6.
Zurück zum Zitat Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: CVPR Fei-Fei L, Perona P (2005) A bayesian hierarchical model for learning natural scene categories. In: CVPR
7.
Zurück zum Zitat Felzenszwalb PF, Huttenlocher DP (2003) Pictorial structures for object recognition. In: IJCV Felzenszwalb PF, Huttenlocher DP (2003) Pictorial structures for object recognition. In: IJCV
8.
Zurück zum Zitat Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. In: VLDB Gionis A, Indyk P, Motwani R (1999) Similarity search in high dimensions via hashing. In: VLDB
9.
Zurück zum Zitat Grauman K, Darrell T (2006) Approximate correspondences in high dimensions. In: NIPSnewpage Grauman K, Darrell T (2006) Approximate correspondences in high dimensions. In: NIPSnewpage
10.
Zurück zum Zitat Grauman K, Darrell T (2007) Pyramid match hashing: Sub-linear time indexing over partial correspondences. In: CVPR Grauman K, Darrell T (2007) Pyramid match hashing: Sub-linear time indexing over partial correspondences. In: CVPR
11.
Zurück zum Zitat Grauman K, Darrell T, Perona P (2007) The pyramid match kernel: efficient learning with sets of features. J Mach Learn Res 8:725–760MATH Grauman K, Darrell T, Perona P (2007) The pyramid match kernel: efficient learning with sets of features. J Mach Learn Res 8:725–760MATH
12.
Zurück zum Zitat Hirata K, Kato T (1992) Query by visual example—content based image retrieval. In: Third international conference on extending database technology Hirata K, Kato T (1992) Query by visual example—content based image retrieval. In: Third international conference on extending database technology
13.
Zurück zum Zitat Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: ECCV Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: ECCV
14.
Zurück zum Zitat Kaplan LM, Murenz IR, Namuduri KR (1998) Fast texture database retrieval using extended fractal features. In: Storage and retrieval for image and video databases VI, pp 162–173 Kaplan LM, Murenz IR, Namuduri KR (1998) Fast texture database retrieval using extended fractal features. In: Storage and retrieval for image and video databases VI, pp 162–173
15.
Zurück zum Zitat Ke Y, Sukthankar R, Huston L (2004) Efficient near-duplicate detection and sub-image retrieval. In: ACM Multimedia Ke Y, Sukthankar R, Huston L (2004) Efficient near-duplicate detection and sub-image retrieval. In: ACM Multimedia
16.
Zurück zum Zitat Kivinen J, Sudderth E, Jordan M (2007) Learning multiscale representations of natural scenes using dirichlet processes. In: ICCV Kivinen J, Sudderth E, Jordan M (2007) Learning multiscale representations of natural scenes using dirichlet processes. In: ICCV
17.
Zurück zum Zitat Kondor RI, Jebara T (2003) A kernel between sets of vectors. In: ICML Kondor RI, Jebara T (2003) A kernel between sets of vectors. In: ICML
18.
Zurück zum Zitat Lazebnik S et al. (2003) A sparse texture representation using affine-invariant regions. In: CVPR Lazebnik S et al. (2003) A sparse texture representation using affine-invariant regions. In: CVPR
19.
Zurück zum Zitat Lepetit V, Lagger P, Fua P (2005) Randomized trees for real-time keypoint recognition. In: CVPR Lepetit V, Lagger P, Fua P (2005) Randomized trees for real-time keypoint recognition. In: CVPR
20.
Zurück zum Zitat Li F, Tong W, Jin R, Jain A (2009) An efficient key point quantization algorithm for large scale image retrieval. In: ACM multimedia international conference workshop on large-scale multimedia retrieval and mining Li F, Tong W, Jin R, Jain A (2009) An efficient key point quantization algorithm for large scale image retrieval. In: ACM multimedia international conference workshop on large-scale multimedia retrieval and mining
22.
Zurück zum Zitat Liu T, Moore A, Gray A, Yang K (2004) An investigation of practical approximate nearest neighbor algorithms. In: NIPS Liu T, Moore A, Gray A, Yang K (2004) An investigation of practical approximate nearest neighbor algorithms. In: NIPS
24.
Zurück zum Zitat Lyu S (2005) Mercer kernels for object recognition with local features. In: CVPR Lyu S (2005) Mercer kernels for object recognition with local features. In: CVPR
25.
Zurück zum Zitat Ma WY, Manjunath BS (1998) A texture thesaurus for browsing large aerial photographs. J Am Soc Inf Sci 49(7):633–648 Ma WY, Manjunath BS (1998) A texture thesaurus for browsing large aerial photographs. J Am Soc Inf Sci 49(7):633–648
26.
Zurück zum Zitat Mallapragada PK, Jin R, Jain AK (2010) Online visual vocabulary pruning using pairwise constraints. In: CVPR Mallapragada PK, Jin R, Jain AK (2010) Online visual vocabulary pruning using pairwise constraints. In: CVPR
27.
Zurück zum Zitat Manning CD, Raghavan P, Schntze H (2008) Introduction to information retrieval. Cambridge University Press, CambridgeMATH Manning CD, Raghavan P, Schntze H (2008) Introduction to information retrieval. Cambridge University Press, CambridgeMATH
29.
Zurück zum Zitat Moon H, Phillips PJ (2001) Computational and performance aspects of PCA-based face-recognition algorithms. Perception 30(3):303–321 Moon H, Phillips PJ (2001) Computational and performance aspects of PCA-based face-recognition algorithms. Perception 30(3):303–321
30.
Zurück zum Zitat Moreno PJ et al (2003) A Kullback-Leibler divergence based kernel for svm classification in multimedia applications. In: NIPS Moreno PJ et al (2003) A Kullback-Leibler divergence based kernel for svm classification in multimedia applications. In: NIPS
31.
Zurück zum Zitat Muja M, Lowe DG (2009) Fast approximate nearest neighbors with automatic algorithm configuration. In: International conference on computer vision theory and applications Muja M, Lowe DG (2009) Fast approximate nearest neighbors with automatic algorithm configuration. In: International conference on computer vision theory and applications
32.
Zurück zum Zitat Niblack CW, Barber R, Equitz W, Flickner MD, Glasman EH, Petkovic D, Yanker P, Faloutsos C, Taubin G (1993) The qbic project: querying images by color, texture and shape. Tech. Rep. RJ-9203, IBM Research Niblack CW, Barber R, Equitz W, Flickner MD, Glasman EH, Petkovic D, Yanker P, Faloutsos C, Taubin G (1993) The qbic project: querying images by color, texture and shape. Tech. Rep. RJ-9203, IBM Research
33.
Zurück zum Zitat Nister D, Stewenius H (2006) Scalable recognition with a vocabulary tree. In: CVPR Nister D, Stewenius H (2006) Scalable recognition with a vocabulary tree. In: CVPR
34.
Zurück zum Zitat Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076 Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
35.
Zurück zum Zitat Pavlidis T (2008) Limitations of cbir. In: ICPR Pavlidis T (2008) Limitations of cbir. In: ICPR
36.
Zurück zum Zitat Perdoch M et al (2009) Efficient representation of local geometry for large scale object retrieval. In: CVPR Perdoch M et al (2009) Efficient representation of local geometry for large scale object retrieval. In: CVPR
37.
Zurück zum Zitat Perronnin F, Dance C, Csurka G, Bressian M (2006) Adopted vocabularies for generic visual categorization. In: ECCV Perronnin F, Dance C, Csurka G, Bressian M (2006) Adopted vocabularies for generic visual categorization. In: ECCV
38.
Zurück zum Zitat Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPR Philbin J, Chum O, Isard M, Sivic J, Zisserman A (2007) Object retrieval with large vocabularies and fast spatial matching. In: CVPR
39.
Zurück zum Zitat Robertson SE, Walker S, Hancock-Beaulieu M (1998) Okapi at trec-7. In: Proceedings of the seventh text retrieval conference Robertson SE, Walker S, Hancock-Beaulieu M (1998) Okapi at trec-7. In: Proceedings of the seventh text retrieval conference
40.
Zurück zum Zitat Schmid C (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR Schmid C (2006) Beyond bags of features: spatial pyramid matching for recognizing natural scene categories. In: CVPR
41.
Zurück zum Zitat Sebe N, Lew MS (2001) Color-based retrieval. Pattern Recognit Lett 22(2):223–230 Sebe N, Lew MS (2001) Color-based retrieval. Pattern Recognit Lett 22(2):223–230
42.
Zurück zum Zitat Shakhnarovich G, Darrell T, Indyk P (2006) Nearest-neighbor methods in learning and vision: theory and practice. MIT Press, Cambridge Shakhnarovich G, Darrell T, Indyk P (2006) Nearest-neighbor methods in learning and vision: theory and practice. MIT Press, Cambridge
43.
Zurück zum Zitat Silpa-Anan C, Hartley R (2008) Optimised kd-trees for fast image descriptor matching. In: CVPR Silpa-Anan C, Hartley R (2008) Optimised kd-trees for fast image descriptor matching. In: CVPR
44.
Zurück zum Zitat Sivic J, Zisserman A (2003) Video Google: A text retrieval approach to object matching in videos. In: ICCV Sivic J, Zisserman A (2003) Video Google: A text retrieval approach to object matching in videos. In: ICCV
45.
Zurück zum Zitat Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of SPIE, vol 2420, pp 381–392 Stricker M, Orengo M (1995) Similarity of color images. In: Proceedings of SPIE, vol 2420, pp 381–392
46.
Zurück zum Zitat Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7:11–32 Swain MJ, Ballard DH (1991) Color indexing. Int J Comput Vis 7:11–32
47.
Zurück zum Zitat Tamura H, Mori S, Yamawaki T (1978) Textural features corresponding to visual perception. IEEE Trans Syst Man Cybern Tamura H, Mori S, Yamawaki T (1978) Textural features corresponding to visual perception. IEEE Trans Syst Man Cybern
48.
Zurück zum Zitat Tirilly P, Claveau V, Gros P (2008) Language modeling for bag-of-visual words image categorization. In: CIVR Tirilly P, Claveau V, Gros P (2008) Language modeling for bag-of-visual words image categorization. In: CIVR
49.
Zurück zum Zitat Viitaniemi V, Laaksonen J (2009) Spatial extensions to bag of visual words. In: Proceeding of the ACM international conference on image and video retrieval Viitaniemi V, Laaksonen J (2009) Spatial extensions to bag of visual words. In: Proceeding of the ACM international conference on image and video retrieval
50.
Zurück zum Zitat Wallraven C, Caputo B, Graf A (2003) Recognition with local features: the kernel recipe. In: CVPR Wallraven C, Caputo B, Graf A (2003) Recognition with local features: the kernel recipe. In: CVPR
51.
Zurück zum Zitat Winn J, Criminisi A, Minka T (2005) Object categorization by learned universal visual dictionary. In: ICCV Winn J, Criminisi A, Minka T (2005) Object categorization by learned universal visual dictionary. In: ICCV
52.
Zurück zum Zitat Wu L, Li M, Li Z, Ma W, Yu N (2007) Visual language modeling for image classification. In: CIVR Wu L, Li M, Li Z, Ma W, Yu N (2007) Visual language modeling for image classification. In: CIVR
53.
Zurück zum Zitat Wu Z et al (2009) Bundling features for large scale partial-duplicate web image search. In: CVPR Wu Z et al (2009) Bundling features for large scale partial-duplicate web image search. In: CVPR
54.
Zurück zum Zitat Yuan J, Wu Y, Yang M (2007) Discovery of collocation patterns: from visual words to visual phrases. In: CVPR Yuan J, Wu Y, Yang M (2007) Discovery of collocation patterns: from visual words to visual phrases. In: CVPR
55.
Zurück zum Zitat Zhang Y, Chen T (2009) Effcient kernels for identifying unbounded-order spatial features. In CVPR Zhang Y, Chen T (2009) Effcient kernels for identifying unbounded-order spatial features. In CVPR
56.
Zurück zum Zitat Zhang Y, Jia Z, Chen T (2011) Image retrieval with geometry-preserving visual phrases. In: CVPR Zhang Y, Jia Z, Chen T (2011) Image retrieval with geometry-preserving visual phrases. In: CVPR
Metadaten
Titel
Large-scale near-duplicate image retrieval by kernel density estimation
verfasst von
Wei Tong
Fengjie Li
Rong Jin
Anil Jain
Publikationsdatum
01.04.2012
Verlag
Springer-Verlag
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 1/2012
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-012-0012-6

Weitere Artikel der Ausgabe 1/2012

International Journal of Multimedia Information Retrieval 1/2012 Zur Ausgabe