Skip to main content
Erschienen in: International Journal of Multimedia Information Retrieval 4/2016

01.11.2016 | Regular Paper

Improving content-based image retrieval with compact global and local multi-features

verfasst von: Ahmad Alzu’bi, Abbes Amira, Naeem Ramzan, Tareq Jaber

Erschienen in: International Journal of Multimedia Information Retrieval | Ausgabe 4/2016

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The accuracy of content-based image retrieval (CBIR) systems is significantly affected by the discriminatory power of image features and distance measures. This paper performs an investigation towards finding the best local and global features and distance measures for content-based image retrieval. It provides insights into the trade-offs regarding computational costs, memory utilization and accuracy on several standard datasets which include MIRFLICKR, Corel, Holidays and ZuBuD. First, low-dimensional global and local features are extracted individually to generate a bank of small image features. Second, multilevel descriptor forms are utilized to produce highly discriminative image representations based on multi-features aggregation scheme. The relationship is highlighted between features (local and global) and other retrieval factors such as quantization approaches, visual codebooks, distance measures, vectorization methods, memory and retrieval speed. The resulting composite image representations are compact, i.e., only 32–64 vector dimension and 32–128 codebook size, and preserve high discriminative levels which further boost the retrieval accuracy and performance. The experimental results show that the presented multi-features image representations are efficient and outperform many competitive methods of the state-of-the-art.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Alzu’bi A, Amira A, Ramzan N (2015) Semantic content-based image retrieval: a comprehensive study. J Vis Commun Image Represent 32:20–54CrossRef Alzu’bi A, Amira A, Ramzan N (2015) Semantic content-based image retrieval: a comprehensive study. J Vis Commun Image Represent 32:20–54CrossRef
2.
Zurück zum Zitat Li J, Allinson NM (2013) Relevance feedback in content-based image retrieval: a survey. In: Handbook on neural information processing. Springer Berlin, Heidelberg, pp 433–469 Li J, Allinson NM (2013) Relevance feedback in content-based image retrieval: a survey. In: Handbook on neural information processing. Springer Berlin, Heidelberg, pp 433–469
3.
Zurück zum Zitat Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv (CSUR) 40(2):5CrossRef Datta R, Joshi D, Li J, Wang JZ (2008) Image retrieval: ideas, influences, and trends of the new age. ACM Comput Surv (CSUR) 40(2):5CrossRef
4.
Zurück zum Zitat Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recognit 40(1):262–282CrossRefMATH Liu Y, Zhang D, Lu G, Ma WY (2007) A survey of content-based image retrieval with high-level semantics. Pattern Recognit 40(1):262–282CrossRefMATH
5.
Zurück zum Zitat Duanmu X (2010) Image retrieval using color moment invariant. In: The seventh international conference on information technology: new generations (ITNG), 12–14, pp 200–203 Duanmu X (2010) Image retrieval using color moment invariant. In: The seventh international conference on information technology: new generations (ITNG), 12–14, pp 200–203
6.
Zurück zum Zitat Qiu G (2003) Color image indexing using BTC. IEEE Trans Image Process 12(1):93–101CrossRef Qiu G (2003) Color image indexing using BTC. IEEE Trans Image Process 12(1):93–101CrossRef
7.
Zurück zum Zitat Talib A, Mahmuddin M, Husni H, George LE (2013) A weighted dominant color descriptor for content-based image retrieval. J Vis Commun Image Represent 24(3):345–360CrossRef Talib A, Mahmuddin M, Husni H, George LE (2013) A weighted dominant color descriptor for content-based image retrieval. J Vis Commun Image Represent 24(3):345–360CrossRef
8.
Zurück zum Zitat Shrivastava N, Tyagi V (2014) Content based image retrieval based on relative locations of multiple regions of interest using selective regions matching. Inf Sci 259:212–224CrossRef Shrivastava N, Tyagi V (2014) Content based image retrieval based on relative locations of multiple regions of interest using selective regions matching. Inf Sci 259:212–224CrossRef
9.
10.
Zurück zum Zitat Lasmar NE, Berthoumieu Y (2014) Gaussian copula multivariate modeling for texture image retrieval using wavelet transforms. IEEE Trans Image Process 23(5):2246–2261MathSciNetCrossRef Lasmar NE, Berthoumieu Y (2014) Gaussian copula multivariate modeling for texture image retrieval using wavelet transforms. IEEE Trans Image Process 23(5):2246–2261MathSciNetCrossRef
11.
Zurück zum Zitat Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef Haralick RM (1979) Statistical and structural approaches to texture. Proc IEEE 67(5):786–804CrossRef
12.
Zurück zum Zitat Tamura H, Mori S, Yamawaki T (1978) Textural features corresponding to visual perception. IEEE Trans Syst Man Cybern 8(6):460–473CrossRef Tamura H, Mori S, Yamawaki T (1978) Textural features corresponding to visual perception. IEEE Trans Syst Man Cybern 8(6):460–473CrossRef
13.
Zurück zum Zitat Vogel J, Schiele B (2006) Performance evaluation and optimization for content-based image retrieval. Pattern Recognit 39(5):897–909CrossRefMATH Vogel J, Schiele B (2006) Performance evaluation and optimization for content-based image retrieval. Pattern Recognit 39(5):897–909CrossRefMATH
14.
Zurück zum Zitat Lee J, Nang J (2011) Content-based image retrieval method using the relative location of multiple ROIs. Adv Electr Comput Eng 11(3):85–90CrossRef Lee J, Nang J (2011) Content-based image retrieval method using the relative location of multiple ROIs. Adv Electr Comput Eng 11(3):85–90CrossRef
15.
Zurück zum Zitat Hoàng N, Gouet-Brunet V, Rukoz M, Manouvrier M (2010) Embedding spatial information into image content description for scene retrieval. Pattern Recognit 43(9):3013–3024CrossRefMATH Hoàng N, Gouet-Brunet V, Rukoz M, Manouvrier M (2010) Embedding spatial information into image content description for scene retrieval. Pattern Recognit 43(9):3013–3024CrossRefMATH
16.
Zurück zum Zitat Wang S, Liu D, Gu F, Feng Yang HL (2012) Similar matching for images with complex spatial relations. J Comput Inf Syst 8:8727–8734 Wang S, Liu D, Gu F, Feng Yang HL (2012) Similar matching for images with complex spatial relations. J Comput Inf Syst 8:8727–8734
17.
Zurück zum Zitat Jaworska T, Kacprzyk J, Marín N, Zadrożny S (2010) On dealing with imprecise information in a content based image retrieval system. In: Computational intelligence for knowledge-based systems design. Springer, Berlin, Heidelberg, pp 149–158 Jaworska T, Kacprzyk J, Marín N, Zadrożny S (2010) On dealing with imprecise information in a content based image retrieval system. In: Computational intelligence for knowledge-based systems design. Springer, Berlin, Heidelberg, pp 149–158
18.
Zurück zum Zitat Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110CrossRef
19.
Zurück zum Zitat Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRef Bay H, Ess A, Tuytelaars T, Van Gool L (2008) Speeded-up robust features (SURF). Comput Vis Image Underst 110(3):346–359CrossRef
20.
Zurück zum Zitat Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer Society conference on computer vision and pattern recognition, CVPR, vol 1, pp 886–893 Dalal N, Triggs B (2005) Histograms of oriented gradients for human detection. In: IEEE Computer Society conference on computer vision and pattern recognition, CVPR, vol 1, pp 886–893
21.
Zurück zum Zitat Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Computer vision-ECCV, pp 430–443 Rosten E, Drummond T (2006) Machine learning for high-speed corner detection. In: Computer vision-ECCV, pp 430–443
22.
Zurück zum Zitat Wang XY, Zhang BB, Yang HY (2014) Content-based image retrieval by integrating color and texture features. Multimed Tools Appl 68(3):545–569MathSciNetCrossRef Wang XY, Zhang BB, Yang HY (2014) Content-based image retrieval by integrating color and texture features. Multimed Tools Appl 68(3):545–569MathSciNetCrossRef
23.
Zurück zum Zitat Liu GH, Zhang L, Hou YK, Li ZY, Yang JY (2010) Image retrieval based on multi-texton histogram. Pattern Recognit 43(7):2380–2389CrossRefMATH Liu GH, Zhang L, Hou YK, Li ZY, Yang JY (2010) Image retrieval based on multi-texton histogram. Pattern Recognit 43(7):2380–2389CrossRefMATH
24.
Zurück zum Zitat Huang ZC, Chan P, Ng W, Yeung DS (2010) Content-based image retrieval using color moment and Gabor texture feature. In: IEEE international conference on machine learning and cybernetics (ICMLC), vol 2, pp 719–724 Huang ZC, Chan P, Ng W, Yeung DS (2010) Content-based image retrieval using color moment and Gabor texture feature. In: IEEE international conference on machine learning and cybernetics (ICMLC), vol 2, pp 719–724
25.
Zurück zum Zitat Harris C, Stephens M (1988) A combined corner and edge detector. In: Alvey vision conference, vol 15, p 50 Harris C, Stephens M (1988) A combined corner and edge detector. In: Alvey vision conference, vol 15, p 50
26.
Zurück zum Zitat Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of British machine vision conference, pp 384–393 Matas J, Chum O, Urban M, Pajdla T (2002) Robust wide baseline stereo from maximally stable extremal regions. In: Proceedings of British machine vision conference, pp 384–393
27.
Zurück zum Zitat Sivic J, Zisserman A (2003) Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE ICCV, pp 1470–1477 Sivic J, Zisserman A (2003) Video Google: a text retrieval approach to object matching in videos. In: Ninth IEEE ICCV, pp 1470–1477
28.
Zurück zum Zitat Ke Y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of IEEE CVPR, vol 2, p II-506 Ke Y, Sukthankar R (2004) PCA-SIFT: a more distinctive representation for local image descriptors. In: Proceedings of IEEE CVPR, vol 2, p II-506
29.
Zurück zum Zitat Perronnin F, Dance C (2007) Fisher kernels on visual vocabularies for image categorization. In: IEEE CVPR’07, pp 1–8 Perronnin F, Dance C (2007) Fisher kernels on visual vocabularies for image categorization. In: IEEE CVPR’07, pp 1–8
30.
Zurück zum Zitat Jégou H, Perronnin F, Douze M, Sanchez J, Perez P, Schmid C (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716CrossRef Jégou H, Perronnin F, Douze M, Sanchez J, Perez P, Schmid C (2012) Aggregating local image descriptors into compact codes. IEEE Trans Pattern Anal Mach Intell 34(9):1704–1716CrossRef
31.
Zurück zum Zitat Iakovidou C, Anagnostopoulos N, Kapoutsis A, Boutalis Y, Lux M, Chatzichristofis SA (2015) Localizing global descriptors for content-based image retrieval. EURASIP J Adv Signal Process 1:1–20 Iakovidou C, Anagnostopoulos N, Kapoutsis A, Boutalis Y, Lux M, Chatzichristofis SA (2015) Localizing global descriptors for content-based image retrieval. EURASIP J Adv Signal Process 1:1–20
32.
Zurück zum Zitat ElAlami M (2014) A new matching strategy for content based image retrieval system. Appl Soft Comput 14:407–418CrossRef ElAlami M (2014) A new matching strategy for content based image retrieval system. Appl Soft Comput 14:407–418CrossRef
33.
Zurück zum Zitat Zhang Y, Zhaoxing Z, Han X (2009) Category specific SIFT descriptor and its combination with color information for content-based image retrieval. In: Proceedings of the 2nd ACM international conference on interaction sciences: information technology, culture and human, pp 685–690 Zhang Y, Zhaoxing Z, Han X (2009) Category specific SIFT descriptor and its combination with color information for content-based image retrieval. In: Proceedings of the 2nd ACM international conference on interaction sciences: information technology, culture and human, pp 685–690
34.
Zurück zum Zitat Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retr 11(2):77–107CrossRef Deselaers T, Keysers D, Ney H (2008) Features for image retrieval: an experimental comparison. Inf Retr 11(2):77–107CrossRef
35.
Zurück zum Zitat Walia E, Verma V (2016) Boosting local texture descriptors with Log-Gabor filters response for improved image retrieval. Int J Multimed Inf Retr 5(4):173–184CrossRef Walia E, Verma V (2016) Boosting local texture descriptors with Log-Gabor filters response for improved image retrieval. Int J Multimed Inf Retr 5(4):173–184CrossRef
36.
Zurück zum Zitat Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE 11th ICCV, pp 1–8 Bosch A, Zisserman A, Munoz X (2007) Image classification using random forests and ferns. In: IEEE 11th ICCV, pp 1–8
37.
Zurück zum Zitat Alzu’bi A, Amira A, Ramzan N, Jaber T (2015) Robust fusion of color and local descriptors for image retrieval and classification. In: IEEE international conference on systems, signals and image processing (IWSSIP), pp 253–256 Alzu’bi A, Amira A, Ramzan N, Jaber T (2015) Robust fusion of color and local descriptors for image retrieval and classification. In: IEEE international conference on systems, signals and image processing (IWSSIP), pp 253–256
38.
Zurück zum Zitat Lee TS (1996) Image representation using 2D Gabor wavelets. IEEE Trans Pattern Anal Mach Intell 18(10):959–971CrossRef Lee TS (1996) Image representation using 2D Gabor wavelets. IEEE Trans Pattern Anal Mach Intell 18(10):959–971CrossRef
39.
Zurück zum Zitat Mallat S (1998) A wavelet tour of signal processing. Academic Press, San DiegoMATH Mallat S (1998) A wavelet tour of signal processing. Academic Press, San DiegoMATH
40.
Zurück zum Zitat Costa A F, Humpire-Mamani G,Traina A J (2012) An efficient algorithm for fractal analysis of textures. In: 25th IEEE SIBGRAPI, pp 39–46 Costa A F, Humpire-Mamani G,Traina A J (2012) An efficient algorithm for fractal analysis of textures. In: 25th IEEE SIBGRAPI, pp 39–46
41.
Zurück zum Zitat Ojala T, Pietikäinen M, Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987CrossRefMATH Ojala T, Pietikäinen M, Mäenpää T (2002) Multiresolution gray-scale and rotation invariant texture classification with local binary patterns. IEEE Trans Pattern Anal Mach Intell 24(7):971–987CrossRefMATH
42.
Zurück zum Zitat Brahnam S, Jain LC, Nanni L, Lumini A (2014) Local binary patterns: new variants and applications. Springer, Berlin, HeidelbergCrossRefMATH Brahnam S, Jain LC, Nanni L, Lumini A (2014) Local binary patterns: new variants and applications. Springer, Berlin, HeidelbergCrossRefMATH
43.
Zurück zum Zitat Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175CrossRefMATH Oliva A, Torralba A (2001) Modeling the shape of the scene: a holistic representation of the spatial envelope. Int J Comput Vis 42(3):145–175CrossRefMATH
44.
Zurück zum Zitat Bianconi F, Harvey R, Southam P, Fernández A (2011) Theoretical and experimental comparison of different approaches for color texture classification. J Electron Imaging 20(4):043006–043006CrossRef Bianconi F, Harvey R, Southam P, Fernández A (2011) Theoretical and experimental comparison of different approaches for color texture classification. J Electron Imaging 20(4):043006–043006CrossRef
45.
Zurück zum Zitat Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vis 60(1):63–86CrossRef Mikolajczyk K, Schmid C (2004) Scale and affine invariant interest point detectors. Int J Comput Vis 60(1):63–86CrossRef
46.
Zurück zum Zitat Wang Z, Fan B, Wu F (2011) Local intensity order pattern for feature description. In: ICCV, pp 603–610 Wang Z, Fan B, Wu F (2011) Local intensity order pattern for feature description. In: ICCV, pp 603–610
47.
Zurück zum Zitat Arandjelović R, Zisserman A (2012) Three things everyone should know to improve object retrieval. In: IEEE CVPR, pp 2911–2918 Arandjelović R, Zisserman A (2012) Three things everyone should know to improve object retrieval. In: IEEE CVPR, pp 2911–2918
48.
Zurück zum Zitat Tola E, Lepetit V, Fua P (2010) An efficient dense descriptor applied to wide-baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830CrossRef Tola E, Lepetit V, Fua P (2010) An efficient dense descriptor applied to wide-baseline stereo. IEEE Trans Pattern Anal Mach Intell 32(5):815–830CrossRef
49.
Zurück zum Zitat Huiskes MJ, Thomee B, Lew MS (2010) New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative. I:n Multimedia information retrieval, pp 527– 536 Huiskes MJ, Thomee B, Lew MS (2010) New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative. I:n Multimedia information retrieval, pp 527– 536
50.
Zurück zum Zitat Vedaldi A, Fulkerson B (2010) VLFeat: an open and portable library of computer vision algorithms. In: Proceedings of the 18th ACM conference on multimedia, pp 1469–1472 Vedaldi A, Fulkerson B (2010) VLFeat: an open and portable library of computer vision algorithms. In: Proceedings of the 18th ACM conference on multimedia, pp 1469–1472
51.
Zurück zum Zitat Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: ECCV, pp 304–317 Jegou H, Douze M, Schmid C (2008) Hamming embedding and weak geometric consistency for large scale image search. In: ECCV, pp 304–317
52.
Zurück zum Zitat Perronnin F, Liu Y, Sánchez J, Poirier H (2010) Large-scale image retrieval with compressed fisher vectors. In: Proceedings of CVPR, pp 3384–3391 Perronnin F, Liu Y, Sánchez J, Poirier H (2010) Large-scale image retrieval with compressed fisher vectors. In: Proceedings of CVPR, pp 3384–3391
53.
Zurück zum Zitat Arandjelovic R, Zisserman A (2013) All about VLAD. In: CVPR, pp 1578–1585 Arandjelovic R, Zisserman A (2013) All about VLAD. In: CVPR, pp 1578–1585
54.
Zurück zum Zitat Delhumeau J, Gosselin P H, Jégou H, Pérez P (2013) Revisiting the VLAD image representation. In: ACM multimedia, pp 653–656 Delhumeau J, Gosselin P H, Jégou H, Pérez P (2013) Revisiting the VLAD image representation. In: ACM multimedia, pp 653–656
55.
Zurück zum Zitat Shao H, Svoboda T, Van Gool L (2003) Zubud-zurich buildings database for image based recognition. Technical report 260, Computer Vision Lab, Swiss Federal Institute of Technology, Switzerland Shao H, Svoboda T, Van Gool L (2003) Zubud-zurich buildings database for image based recognition. Technical report 260, Computer Vision Lab, Swiss Federal Institute of Technology, Switzerland
Metadaten
Titel
Improving content-based image retrieval with compact global and local multi-features
verfasst von
Ahmad Alzu’bi
Abbes Amira
Naeem Ramzan
Tareq Jaber
Publikationsdatum
01.11.2016
Verlag
Springer London
Erschienen in
International Journal of Multimedia Information Retrieval / Ausgabe 4/2016
Print ISSN: 2192-6611
Elektronische ISSN: 2192-662X
DOI
https://doi.org/10.1007/s13735-016-0109-4

Weitere Artikel der Ausgabe 4/2016

International Journal of Multimedia Information Retrieval 4/2016 Zur Ausgabe