Skip to main content

2019 | OriginalPaper | Buchkapitel

Image Retrieval Research Based on Significant Regions

verfasst von : Jie Xu, Shuwei Sheng, Yuhao Cai, Yin Bian, Du Xu

Erschienen in: Communications and Networking

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep Convolution neural networks (CNN) has achieved great success in the field of image recognition. But in the image retrieval task, the global CNN features ignore local detail description for paying too much attention to semantic information of images. So the MAP of image retrieval remains to be improved. Aiming at this problem, this paper proposes a local CNN feature extraction algorithm based on image understanding, which includes three steps: significant regions extraction, significant regions description and pool coding. This method overcomes the semantic gap problem in traditional local characteristic and improves the retrieval effect of global CNN features. Then, we apply this local CNN feature in the image retrieval task, including the same category retrieval task by feature fusion strategy and the instance retrieval task by re-ranking strategy. The experimental results show that this method has achieved good performance on the Caltech 101 and Caltech 256 classification datasets, and competitive results on the Oxford 5k and Paris 6k instance retrieval datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lowe, D.G.: Object recognition from local scale-invariant features. In: IEEE International Conference on Computer Vision (2001) Lowe, D.G.: Object recognition from local scale-invariant features. In: IEEE International Conference on Computer Vision (2001)
2.
Zurück zum Zitat Sivic, J.: A text retrieval approach to object matching in videos. In: Proceedings of IEEE International Conference on Computer Vision (2003) Sivic, J.: A text retrieval approach to object matching in videos. In: Proceedings of IEEE International Conference on Computer Vision (2003)
3.
Zurück zum Zitat Perronnin, F., Liu, Y., Sánchez, J., Poirier, H.: Large-scale image retrieval with compressed Fisher vectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 3384–3391 (2010) Perronnin, F., Liu, Y., Sánchez, J., Poirier, H.: Large-scale image retrieval with compressed Fisher vectors. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 3384–3391 (2010)
4.
Zurück zum Zitat Jegou, H., Perronnin, F., Douze, M., Sánchez, J., Perez, P.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2012)CrossRef Jegou, H., Perronnin, F., Douze, M., Sánchez, J., Perez, P.: Aggregating local image descriptors into compact codes. IEEE Trans. Pattern Anal. Mach. Intell. 34(9), 1704–1716 (2012)CrossRef
5.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS 2012), pp. 1097–1105 (2012) Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems (NIPS 2012), pp. 1097–1105 (2012)
6.
7.
Zurück zum Zitat Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 1–9 (2015) Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 1–9 (2015)
8.
Zurück zum Zitat Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Proceedings of the European Conference on Computer Vision, pp. 818–833 (2014) Zeiler, M.D., Fergus, R.: Visualizing and understanding convolutional networks. In: Proceedings of the European Conference on Computer Vision, pp. 818–833 (2014)
9.
Zurück zum Zitat Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 1717–1724 (2014) Oquab, M., Bottou, L., Laptev, I., Sivic, J.: Learning and transferring mid-level image representations using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 1717–1724 (2014)
10.
Zurück zum Zitat Hoang, T., Do, T.T., Tan, D.K.L., Cheung, N.M.: Selective deep convolutional features for image retrieval. In: ACM, pp. 1600–1608 (2017) Hoang, T., Do, T.T., Tan, D.K.L., Cheung, N.M.: Selective deep convolutional features for image retrieval. In: ACM, pp. 1600–1608 (2017)
11.
Zurück zum Zitat Xu, J., Shi, C.Z., Qi, C.Z., Wang, C.H., Xiao, B.H.: Unsupervised part-based weighting aggregation of deep convolutional features for image retrieval. In: AAAI2018 (2018) Xu, J., Shi, C.Z., Qi, C.Z., Wang, C.H., Xiao, B.H.: Unsupervised part-based weighting aggregation of deep convolutional features for image retrieval. In: AAAI2018 (2018)
12.
Zurück zum Zitat Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 512–519 (2014) Razavian, A.S., Azizpour, H., Sullivan, J., Carlsson, S.: CNN features off-the-shelf: an astounding baseline for recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 512–519 (2014)
13.
Zurück zum Zitat Babenko, A., Slesarev, A., Chigorin, A., Lempitsky, V.: Neural codes for image retrieval. In: Proceedings of the European Conference on Computer Vision, pp. 584–599 (2014) Babenko, A., Slesarev, A., Chigorin, A., Lempitsky, V.: Neural codes for image retrieval. In: Proceedings of the European Conference on Computer Vision, pp. 584–599 (2014)
14.
Zurück zum Zitat Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 25–37 (2015) Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. Workshops, pp. 25–37 (2015)
15.
Zurück zum Zitat Wang, J., Song, Y., Leung, T., Rosenberg, C., Wang, J.: Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1386–1393 (2014) Wang, J., Song, Y., Leung, T., Rosenberg, C., Wang, J.: Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1386–1393 (2014)
16.
Zurück zum Zitat Chen, Q., Huang, J., Feris, R., Brown, LM., Dong, J.: Deep domain adaptation for describing people based on fine-grained clothing attributes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5315–5324 (2015) Chen, Q., Huang, J., Feris, R., Brown, LM., Dong, J.: Deep domain adaptation for describing people based on fine-grained clothing attributes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 5315–5324 (2015)
17.
Zurück zum Zitat Mairal, J., Koniusz, P., Harchaoui, Z., Schmid, C.: Convolutional kernel networks. In: International Conference on Neural Information Processing Systems. MIT Press, pp. 2627–2635 (2014) Mairal, J., Koniusz, P., Harchaoui, Z., Schmid, C.: Convolutional kernel networks. In: International Conference on Neural Information Processing Systems. MIT Press, pp. 2627–2635 (2014)
18.
Zurück zum Zitat Paulin, M., Douze, M., Harchaoui, Z., Mairal, J., Perronin, F.: Local convolutional features with unsupervised training for image retrieval. In: IEEE International Conference on Computer Vision, pp. 91–99 (2015) Paulin, M., Douze, M., Harchaoui, Z., Mairal, J., Perronin, F.: Local convolutional features with unsupervised training for image retrieval. In: IEEE International Conference on Computer Vision, pp. 91–99 (2015)
19.
Zurück zum Zitat Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K.: Visual genome: connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123(1), 32–73 (2017)MathSciNetCrossRef
20.
Zurück zum Zitat Kalantidis, Y., Mellina, C., Osindero, S.: Cross-dimensional weighting for aggregated deep convolutional features. In: Proceedings of the European Conference on Computer Vision. Workshops, pp. 685–701 (2016) Kalantidis, Y., Mellina, C., Osindero, S.: Cross-dimensional weighting for aggregated deep convolutional features. In: Proceedings of the European Conference on Computer Vision. Workshops, pp. 685–701 (2016)
21.
Zurück zum Zitat Tolias, G., Sicre, R., Jegou, H.: Particular object retrieval with integral maxpooling of CNN activations. In: Proceedings of the International Conference on Learning Representations, pp. 1–12 (2016) Tolias, G., Sicre, R., Jegou, H.: Particular object retrieval with integral maxpooling of CNN activations. In: Proceedings of the International Conference on Learning Representations, pp. 1–12 (2016)
22.
Zurück zum Zitat Jegou, H., Zisserman, A.: Triangulation embedding and democratic aggregation for image search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3310–3317 (2014) Jegou, H., Zisserman, A.: Triangulation embedding and democratic aggregation for image search. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3310–3317 (2014)
23.
Zurück zum Zitat Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: Visual instance retrieval with deep convolutional networks (2014). arXiv:1412.6574 Razavian, A.S., Sullivan, J., Maki, A., Carlsson, S.: Visual instance retrieval with deep convolutional networks (2014). arXiv:​1412.​6574
Metadaten
Titel
Image Retrieval Research Based on Significant Regions
verfasst von
Jie Xu
Shuwei Sheng
Yuhao Cai
Yin Bian
Du Xu
Copyright-Jahr
2019
DOI
https://doi.org/10.1007/978-3-030-06161-6_12