Skip to main content
Top

2017 | OriginalPaper | Chapter

Image Retrieval Based on Query by Saliency Content

Authors : Adrian G. Bors, Alex Papushoy

Published in: Visual Content Indexing and Retrieval with Psycho-Visual Models

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This chapter outlines a content based image retrieval (CBIR) methodology that takes into account the saliency in images. Natural images are depictions of real-life objects and scenes, usually set in cluttered environments. The performance of image retrieval in these scenarios may suffer because there is no way of knowing which parts of the image are of interest to the user. The human visual system provides a clue to what would be of interest in the image, by involuntarily shifting the focus of attention to salient image areas. The application of computational models of selective visual attention to image understanding can produce better, unsupervised retrieval results by identifying perceptually important areas of the image that usually correspond to its semantic meaning, whilst discarding irrelevant information. This chapter explores the construction of a retrieval system incorporating a visual attention model and proposes a new method for selecting salient image regions, as well as embedding an improved representation for salient image edges for determining global image saliency.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Achanta, R., Hemami, S., Estrada, F., Susstrunk, S.: Frequency-tuned salient region detection. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1597–1604 (2009) Achanta, R., Hemami, S., Estrada, F., Susstrunk, S.: Frequency-tuned salient region detection. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1597–1604 (2009)
2.
go back to reference Ashley, J.J., Barber, R., Flickner, M., Hafner, J.L., Lee, D., Niblack, C.W., Petkovic, D.: Automatic and semiautomatic methods for image annotation and retrieval in query by image content (QBIC). In: Proceedings of SPIE 2420, Storage and Retrieval for Image and Video Databases III, San Jose, CA, pp. 24–35 (1995) Ashley, J.J., Barber, R., Flickner, M., Hafner, J.L., Lee, D., Niblack, C.W., Petkovic, D.: Automatic and semiautomatic methods for image annotation and retrieval in query by image content (QBIC). In: Proceedings of SPIE 2420, Storage and Retrieval for Image and Video Databases III, San Jose, CA, pp. 24–35 (1995)
3.
go back to reference Bors, A.G., Nasios, N.: Kernel bandwidth estimation for nonparametric modelling. IEEE Trans. Syst. Man Cybern. B Cybern. 39(6), 1543–1555 (2009)CrossRef Bors, A.G., Nasios, N.: Kernel bandwidth estimation for nonparametric modelling. IEEE Trans. Syst. Man Cybern. B Cybern. 39(6), 1543–1555 (2009)CrossRef
4.
go back to reference Bi J., Chen, Y., Wang, J.: A sparse support vector machine approach to region-based image categorization. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, vol. 19, pp. 1121–1128 (2005) Bi J., Chen, Y., Wang, J.: A sparse support vector machine approach to region-based image categorization. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, vol. 19, pp. 1121–1128 (2005)
5.
go back to reference Carson, C., Belongie, S., Greenspan, H., Malik, J.: Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Trans. Pattern Anal. Mach. Intell. 24(8), 1026–1038 (2002)CrossRef Carson, C., Belongie, S., Greenspan, H., Malik, J.: Blobworld: image segmentation using expectation-maximization and its application to image querying. IEEE Trans. Pattern Anal. Mach. Intell. 24(8), 1026–1038 (2002)CrossRef
6.
go back to reference Chang, T., Kuo, C.-C.J.: Texture analysis and classification with tree-structured wavelet transform. IEEE Trans. Image Process. 2(4), 429–441 (1993)CrossRef Chang, T., Kuo, C.-C.J.: Texture analysis and classification with tree-structured wavelet transform. IEEE Trans. Image Process. 2(4), 429–441 (1993)CrossRef
7.
go back to reference Chang, R., Liao, C.C.C.: Region-based image retrieval using edgeflow segmentation and region adjacency graph. In: Proceedings of International Conference on Multimedia and Expo, pp. 1883–1886 (2004) Chang, R., Liao, C.C.C.: Region-based image retrieval using edgeflow segmentation and region adjacency graph. In: Proceedings of International Conference on Multimedia and Expo, pp. 1883–1886 (2004)
8.
go back to reference Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with images. J. Mach. Learn. Res. 5, 913–939 (2004) Chen, Y., Wang, J.Z.: Image categorization by learning and reasoning with images. J. Mach. Learn. Res. 5, 913–939 (2004)
9.
go back to reference Cheng, Y.: Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17(8), 790–799 (1995)CrossRef Cheng, Y.: Mean shift, mode seeking, and clustering. IEEE Trans. Pattern Anal. Mach. Intell. 17(8), 790–799 (1995)CrossRef
10.
go back to reference Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef Comaniciu, D., Meer, P.: Mean shift: a robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 24(5), 603–619 (2002)CrossRef
11.
go back to reference Datta, R., Joshi, D., Li, J., Wan, J.Z.: Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv. 40(2), 5:1–5:60 (2008) Datta, R., Joshi, D., Li, J., Wan, J.Z.: Image retrieval: Ideas, influences, and trends of the new age. ACM Comput. Surv. 40(2), 5:1–5:60 (2008)
12.
go back to reference Derrington, A.M., Krauskopf, J., Lennie, P.: Chromatic mechanisms in lateral geniculate. Nucleus Macaque J. Physiol. 357, 241–265 (1984) Derrington, A.M., Krauskopf, J., Lennie, P.: Chromatic mechanisms in lateral geniculate. Nucleus Macaque J. Physiol. 357, 241–265 (1984)
13.
go back to reference Faloutsos, C., Barber, R., Flickner, M., Hafner, J., Niblack, W., Petkovic, D., Equitz, W.: Efficient and effective querying by image content. J. Intell. Inf. Syst. 3, 231–262 (1994)CrossRef Faloutsos, C., Barber, R., Flickner, M., Hafner, J., Niblack, W., Petkovic, D., Equitz, W.: Efficient and effective querying by image content. J. Intell. Inf. Syst. 3, 231–262 (1994)CrossRef
14.
go back to reference Feng, S., Xu, D., Yang, X.: Attention-driven salient edge(s) and region(s) extraction with application to CBIR. Signal Process. 90(1), 1–15 (2010)CrossRefMATH Feng, S., Xu, D., Yang, X.: Attention-driven salient edge(s) and region(s) extraction with application to CBIR. Signal Process. 90(1), 1–15 (2010)CrossRefMATH
15.
go back to reference Frintrop, S., Klodt, M., Rome, E.: International Conference on Computer Vision Systems (2007) Frintrop, S., Klodt, M., Rome, E.: International Conference on Computer Vision Systems (2007)
16.
go back to reference Frintrop, S., Rome, E., Christensen, H.: Computational visual attention systems and their cognitive foundations. A survey. ACM Trans. Appl. Percept. 7(1), 6.1–6.46 (2010) Frintrop, S., Rome, E., Christensen, H.: Computational visual attention systems and their cognitive foundations. A survey. ACM Trans. Appl. Percept. 7(1), 6.1–6.46 (2010)
17.
go back to reference Gao, D., Han, S., Vasconcelos, N.: Discriminant saliency, the detection of suspicious coincidences, and applications to visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(6), 989–1005 (2009)CrossRef Gao, D., Han, S., Vasconcelos, N.: Discriminant saliency, the detection of suspicious coincidences, and applications to visual recognition. IEEE Trans. Pattern Anal. Mach. Intell. 31(6), 989–1005 (2009)CrossRef
18.
go back to reference Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Proceedings of Advances in Neural Information Processing Systems (NIPS), vol. 19, pp. 545–552 (2007) Harel, J., Koch, C., Perona, P.: Graph-based visual saliency. In: Proceedings of Advances in Neural Information Processing Systems (NIPS), vol. 19, pp. 545–552 (2007)
19.
go back to reference Hou, X., Zhang, L.: Saliency detection: a spectral residual approach. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007) Hou, X., Zhang, L.: Saliency detection: a spectral residual approach. In: Proceedings of IEEE International Conference on Computer Vision and Pattern Recognition, pp. 1–8 (2007)
20.
go back to reference Islam, M.M., Zhang, D., Lu, G.: Comparison of retrieval effectiveness of different region based image representations. In: International Conference on Information, Communications and Signal Processing, pp. 1–6 (2007) Islam, M.M., Zhang, D., Lu, G.: Comparison of retrieval effectiveness of different region based image representations. In: International Conference on Information, Communications and Signal Processing, pp. 1–6 (2007)
21.
go back to reference Itti, L., Ullman, S.: Shifts in selective visual attention, Towards the underlying neural circuitry. Hum. Neurobiol. 4(4), 219–227 (1985) Itti, L., Ullman, S.: Shifts in selective visual attention, Towards the underlying neural circuitry. Hum. Neurobiol. 4(4), 219–227 (1985)
22.
go back to reference Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)CrossRef Itti, L., Koch, C., Niebur, E.: A model of saliency-based visual attention for rapid scene analysis. IEEE Trans. Pattern Anal. Mach. Intell. 20(11), 1254–1259 (1998)CrossRef
23.
go back to reference Jing, F., Li, M., Zhang, H.-J., Zhang, B.: An efficient and effective region-based image retrieval framework. IEEE Trans. Image Process. 13(5), 699–709 (2004)CrossRef Jing, F., Li, M., Zhang, H.-J., Zhang, B.: An efficient and effective region-based image retrieval framework. IEEE Trans. Image Process. 13(5), 699–709 (2004)CrossRef
24.
go back to reference Kadir, T., Brady, M.: Saliency, scale and image description. Int. J. Comput. Vis. 45(2), 83–105 (2001)CrossRefMATH Kadir, T., Brady, M.: Saliency, scale and image description. Int. J. Comput. Vis. 45(2), 83–105 (2001)CrossRefMATH
25.
go back to reference Kelly, P.M., Cannon, M., Hush, D.R.: Query by image example: the CANDID approach. In: Proceedings of SPIE 2420, San Jose, CA, pp. 238–248 (1995) Kelly, P.M., Cannon, M., Hush, D.R.: Query by image example: the CANDID approach. In: Proceedings of SPIE 2420, San Jose, CA, pp. 238–248 (1995)
26.
go back to reference Mallat, S.G.: A Wavelet Tour of Signal Processing: The Sparse Way. Academic, New York (2009)MATH Mallat, S.G.: A Wavelet Tour of Signal Processing: The Sparse Way. Academic, New York (2009)MATH
27.
go back to reference Ogle, V.E., Stonebraker, M.: Chabot: retrieval from a relational database of images. IEEE Comput. 28(9), 40–48 (1995)CrossRef Ogle, V.E., Stonebraker, M.: Chabot: retrieval from a relational database of images. IEEE Comput. 28(9), 40–48 (1995)CrossRef
28.
go back to reference Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRef Otsu, N.: A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9(1), 62–66 (1979)CrossRef
29.
go back to reference Papushoy, A., Bors, A.G.: Image retrieval based on query by saliency content. Digital Signal Process. 36(1), 156–173 (2015)MathSciNetCrossRef Papushoy, A., Bors, A.G.: Image retrieval based on query by saliency content. Digital Signal Process. 36(1), 156–173 (2015)MathSciNetCrossRef
30.
go back to reference Papushoy, A., Bors, A.G.: Visual attention for content based image retrieval. In: Proceedings of IEEE International Conference on Image Processing (ICIP), Quebec City, pp. 971–975 (2015) Papushoy, A., Bors, A.G.: Visual attention for content based image retrieval. In: Proceedings of IEEE International Conference on Image Processing (ICIP), Quebec City, pp. 971–975 (2015)
31.
go back to reference Pele, O., Taskar, B.: The tangent earth mover’s distance. In: Proceedings of Geometric Science of Information. Lecture Notes on Computer Science, vol. 8085, pp. 397–404 (2013)MATH Pele, O., Taskar, B.: The tangent earth mover’s distance. In: Proceedings of Geometric Science of Information. Lecture Notes on Computer Science, vol. 8085, pp. 397–404 (2013)MATH
32.
go back to reference Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: content-based manipulation of image databases. Int. J. Comput. Vis. 18(3), 233–254 (1995)CrossRef Pentland, A., Picard, R.W., Sclaroff, S.: Photobook: content-based manipulation of image databases. Int. J. Comput. Vis. 18(3), 233–254 (1995)CrossRef
33.
go back to reference Rahmani, R., Goldman, S.A., Zhang, H., Cholleti, S.R., Fritts, J.E.: Localized content-based image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1902–1912 (2008)CrossRef Rahmani, R., Goldman, S.A., Zhang, H., Cholleti, S.R., Fritts, J.E.: Localized content-based image retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 30(11), 1902–1912 (2008)CrossRef
34.
go back to reference Rubner, Y., Tomasi, C., Guibas, L.J.: The earth mover distance as a metric for image retrieval. Int. J. Comput. Vis. 402, 99–121 (2000)CrossRefMATH Rubner, Y., Tomasi, C., Guibas, L.J.: The earth mover distance as a metric for image retrieval. Int. J. Comput. Vis. 402, 99–121 (2000)CrossRefMATH
35.
go back to reference Rui, Y., Huang, T., Ortega, M., Mehrotra, S.: Relevance feedback: a power tool in interactive content-based image retrieval. IEEE Trans. Circuits Syst. Video Technol. 8(5), 644–655 (1998)CrossRef Rui, Y., Huang, T., Ortega, M., Mehrotra, S.: Relevance feedback: a power tool in interactive content-based image retrieval. IEEE Trans. Circuits Syst. Video Technol. 8(5), 644–655 (1998)CrossRef
36.
go back to reference Sharma, G., Wu, W., Dalal, E.N.: The CIEDE2000 color-difference formula: implementation notes, supplementary test data, and mathematical observations. Color Res. Appl. 30(1), 1–24 (2005)CrossRef Sharma, G., Wu, W., Dalal, E.N.: The CIEDE2000 color-difference formula: implementation notes, supplementary test data, and mathematical observations. Color Res. Appl. 30(1), 1–24 (2005)CrossRef
37.
go back to reference Smith, J.R., Chang, S.-F.: VisualSEEk: a fully automated content-based image query system. In: Proceedings of the ACM International Conference on Multimedia, Boston, pp. 87–98 (1996) Smith, J.R., Chang, S.-F.: VisualSEEk: a fully automated content-based image query system. In: Proceedings of the ACM International Conference on Multimedia, Boston, pp. 87–98 (1996)
38.
go back to reference Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef Smeulders, A., Worring, M., Santini, S., Gupta, A., Jain, R.: Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12), 1349–1380 (2000)CrossRef
39.
go back to reference Stentiford, F.W.M.: A visual attention estimator applied to image subject enhancement and colour and grey level compression. In: 17th International Conference on Pattern Recognition, vol. 3, Cambridge, pp. 638–641 (2004) Stentiford, F.W.M.: A visual attention estimator applied to image subject enhancement and colour and grey level compression. In: 17th International Conference on Pattern Recognition, vol. 3, Cambridge, pp. 638–641 (2004)
40.
go back to reference Tas, A.C., Luck, S.J., Hollingworth, A.: The relationship between visual attention and visual working memory encoding: a dissociation between covert and overt orienting. J. Exp. Psychol. Hum. Percept. Perform. 42, 1121–1138 (2016)CrossRef Tas, A.C., Luck, S.J., Hollingworth, A.: The relationship between visual attention and visual working memory encoding: a dissociation between covert and overt orienting. J. Exp. Psychol. Hum. Percept. Perform. 42, 1121–1138 (2016)CrossRef
41.
go back to reference Vondrick, C., Khosla, A., Pirsiavash, H., Malisiewicz, T., Torralba, A.: Visualizing object detection features. Int. J. Comput. Vis. 119(2), 145–158 (2016)MathSciNetCrossRef Vondrick, C., Khosla, A., Pirsiavash, H., Malisiewicz, T., Torralba, A.: Visualizing object detection features. Int. J. Comput. Vis. 119(2), 145–158 (2016)MathSciNetCrossRef
42.
go back to reference Wan, J., Wang, D., Hoi, S., Wu, P., Zhu, J., Li, J.: Deep learning for content-based image retrieval: a comprehensive study. In: Proceedings of ACM International Conference on Multimedia, pp. 157–266 (2014) Wan, J., Wang, D., Hoi, S., Wu, P., Zhu, J., Li, J.: Deep learning for content-based image retrieval: a comprehensive study. In: Proceedings of ACM International Conference on Multimedia, pp. 157–266 (2014)
43.
go back to reference Wang, J.Z., Wiederhold, G., Firschein, O., Sha, X.W.: Content-based image indexing and searching using Daubechies’ wavelets. Int. J. Digit. Libr. 1(4), 311–328 (1998)CrossRef Wang, J.Z., Wiederhold, G., Firschein, O., Sha, X.W.: Content-based image indexing and searching using Daubechies’ wavelets. Int. J. Digit. Libr. 1(4), 311–328 (1998)CrossRef
44.
go back to reference Wang, J., Li, J., Wiederhold, G.: Simplicity: semantics-sensitive integrated matching for picture libraries. IEEE Trans. Pattern Anal. Mach. Intell. 23(9), 947–963 (2001)CrossRef Wang, J., Li, J., Wiederhold, G.: Simplicity: semantics-sensitive integrated matching for picture libraries. IEEE Trans. Pattern Anal. Mach. Intell. 23(9), 947–963 (2001)CrossRef
45.
go back to reference Wang, H., Cai,Y., Zhang, Y., Pan, H., Lv, W., Han, H.: Deep learning for image retrieval: what works and what doesn’t. In: Proceedings of IEEE International Conference on Data Mining Workshop, pp. 1576–1583 (2015) Wang, H., Cai,Y., Zhang, Y., Pan, H., Lv, W., Han, H.: Deep learning for image retrieval: what works and what doesn’t. In: Proceedings of IEEE International Conference on Data Mining Workshop, pp. 1576–1583 (2015)
46.
go back to reference Wolfe, J.M.: Visual search. In: Pashler, H. (ed.) Attention. Psychology Press, Hove, East Sussex (1998) Wolfe, J.M.: Visual search. In: Pashler, H. (ed.) Attention. Psychology Press, Hove, East Sussex (1998)
47.
go back to reference Zhang, L., Tong, M.H., Marks, T.K., Shan, H., Cottrell, G.W.: SUN, A Bayesian framework for saliency using natural statistics. J. Vis. 8(7), 32.1–32.20 (2008)CrossRef Zhang, L., Tong, M.H., Marks, T.K., Shan, H., Cottrell, G.W.: SUN, A Bayesian framework for saliency using natural statistics. J. Vis. 8(7), 32.1–32.20 (2008)CrossRef
Metadata
Title
Image Retrieval Based on Query by Saliency Content
Authors
Adrian G. Bors
Alex Papushoy
Copyright Year
2017
DOI
https://doi.org/10.1007/978-3-319-57687-9_8

Premium Partner