Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 9/2023

25.03.2023 | Original Article

Deep hashing via multi-scale and multi-directional pooling for image retrieval

verfasst von: Yunbo Rao, Wang Zhou, Shaoning Zeng, Junmin Xue

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 9/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep Hashing methods have been widely used for large-scale image retrieval due to its advantages in retrieval efficiency and accuracy. Recent methods cannot effectively capture the scale variation and complex distribution of image features in the feature extraction process. As a result, these methods were easily affected by the environment and the retrieval accuracy is not high. In this regard, we propose multi-scale and multi-direction pooling for deep hashing (MMDH). Specifically, the proposed method uses a multi-direction strip pooling module (MSPM) so that it can also fuse the features in the diagonal direction. Also, we combine MSPM with pyramid pooling which capture multi-scale features to build a hybrid pooling module. The input image is subjected to the hybrid pooling operation to extract multi-scale and multi-direction image features which can help the network improve its feature extraction ability. The atrous spatial pyramid pooling operation is used to retain the multi-scale features further to help the model analyze the image structure flexibly and effectively. The experimental results show that the proposed method can perform well on two public datasets. In addition, to verify the generality of the results, we also conducted experiments on the cloth dataset Fabric. The results prove that the proposed method can better extract multi-scale and multi-direction image feature information in various situations.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Revaud J, Almazán J, Rezende RS, Souza CR (2019) Learning with average precision: Training image retrieval with a listwise loss. In Proceedings of the IEEE/CVF International Conference on Computer Vision. p 5107–5116 Revaud J, Almazán J, Rezende RS, Souza CR (2019) Learning with average precision: Training image retrieval with a listwise loss. In Proceedings of the IEEE/CVF International Conference on Computer Vision. p 5107–5116
2.
Zurück zum Zitat Vo N, Jiang L, Sun C, Murphy K, Li L-J, Fei-Fei L, Hays J (2019) Composing text and image for image retrieval-an empirical odyssey. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 6439–6448 Vo N, Jiang L, Sun C, Murphy K, Li L-J, Fei-Fei L, Hays J (2019) Composing text and image for image retrieval-an empirical odyssey. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 6439–6448
3.
Zurück zum Zitat Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: Large-scale image retrieval benchmarking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 5706–5715 Radenović F, Iscen A, Tolias G, Avrithis Y, Chum O (2018) Revisiting oxford and paris: Large-scale image retrieval benchmarking. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 5706–5715
4.
Zurück zum Zitat Haomiao L, Ruiping W, Shiguang S, Xilin C (2016) Deep supervised hashing for fast image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2064–2072 Haomiao L, Ruiping W, Shiguang S, Xilin C (2016) Deep supervised hashing for fast image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2064–2072
5.
Zurück zum Zitat Rui Y, Huang T (2000) Optimizing learning in image retrieval. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No. PR00662), 1:236–243. IEEE Rui Y, Huang T (2000) Optimizing learning in image retrieval. In Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No. PR00662), 1:236–243. IEEE
6.
Zurück zum Zitat Li J, Wang JZ, Wiederhold G (2000) . Irm: Integrated region matching for image retrieval. In Proceedings of the eighth ACM International Conference on Multimedia. p 147–156 Li J, Wang JZ, Wiederhold G (2000) . Irm: Integrated region matching for image retrieval. In Proceedings of the eighth ACM International Conference on Multimedia. p 147–156
7.
Zurück zum Zitat Schmid C, Mohr R (1997) Local grayvalue invariants for image retrieval. IEEE Trans Pattern Anal Mach Intel 19(5):530–535CrossRef Schmid C, Mohr R (1997) Local grayvalue invariants for image retrieval. IEEE Trans Pattern Anal Mach Intel 19(5):530–535CrossRef
8.
Zurück zum Zitat Jain AK, Vailaya A (1996) Image retrieval using color and shape. Pattern Recognit 29(8):1233–1244CrossRef Jain AK, Vailaya A (1996) Image retrieval using color and shape. Pattern Recognit 29(8):1233–1244CrossRef
9.
Zurück zum Zitat Xia R, Pan Y, Lai H, Liu C, Yan S (2014) Supervised hashing for image retrieval via image representation learning. In Twenty-eighth AAAI Conference on Artificial Intelligence 3:2156–2162 Xia R, Pan Y, Lai H, Liu C, Yan S (2014) Supervised hashing for image retrieval via image representation learning. In Twenty-eighth AAAI Conference on Artificial Intelligence 3:2156–2162
10.
Zurück zum Zitat Erin LV, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2475–2483 Erin LV, Lu J, Wang G, Moulin P, Zhou J (2015) Deep hashing for compact binary codes learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2475–2483
11.
Zurück zum Zitat Lai H, Pan Y, Liu Y, Yan S (2015) Simultaneous feature learning and hash coding with deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 3270–3278 Lai H, Pan Y, Liu Y, Yan S (2015) Simultaneous feature learning and hash coding with deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 3270–3278
12.
Zurück zum Zitat Hou Q, Zhang L, Cheng M-M, Feng J (2020) Strip pooling: Rethinking spatial pooling for scene parsing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 4003–4012 Hou Q, Zhang L, Cheng M-M, Feng J (2020) Strip pooling: Rethinking spatial pooling for scene parsing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 4003–4012
13.
Zurück zum Zitat He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 770–778 He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 770–778
14.
Zurück zum Zitat Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) Nus-wide: a real-world web image database from national university of singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval. p 1–9 Chua T-S, Tang J, Hong R, Li H, Luo Z, Zheng Y (2009) Nus-wide: a real-world web image database from national university of singapore. In Proceedings of the ACM International Conference on Image and Video Retrieval. p 1–9
15.
Zurück zum Zitat Alex K, Hinton G et al (2009) Learning multiple layers of features from tiny images. Comput Sci Dep Univ Toronto Tech Rep 1:2009 Alex K, Hinton G et al (2009) Learning multiple layers of features from tiny images. Comput Sci Dep Univ Toronto Tech Rep 1:2009
16.
Zurück zum Zitat Indyk P, Motwani R (1998) Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of the thirtieth annual ACM symposium on Theory of computing. p 604–613 Indyk P, Motwani R (1998) Approximate nearest neighbors: towards removing the curse of dimensionality. In Proceedings of the thirtieth annual ACM symposium on Theory of computing. p 604–613
17.
Zurück zum Zitat Gionis A, Indyk P, Motwani R et al (1999) Similarity search in high dimensions via hashing. Vldb 99:518–529 Gionis A, Indyk P, Motwani R et al (1999) Similarity search in high dimensions via hashing. Vldb 99:518–529
18.
Zurück zum Zitat Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) Multi-probe lsh: efficient indexing for high-dimensional similarity search. In 33rd International Conference on Very Large Data Bases, VLDB 2007. p 950–961. Association for Computing Machinery, Inc Lv Q, Josephson W, Wang Z, Charikar M, Li K (2007) Multi-probe lsh: efficient indexing for high-dimensional similarity search. In 33rd International Conference on Very Large Data Bases, VLDB 2007. p 950–961. Association for Computing Machinery, Inc
19.
Zurück zum Zitat Bai X, Yang H, Zhou J, Ren P, Cheng J (2014) Data-dependent hashing based on p-stable distribution. IEEE Trans Image Process 23(12):5033–5046MathSciNetCrossRefMATH Bai X, Yang H, Zhou J, Ren P, Cheng J (2014) Data-dependent hashing based on p-stable distribution. IEEE Trans Image Process 23(12):5033–5046MathSciNetCrossRefMATH
20.
Zurück zum Zitat Andoni A, Razenshteyn I (2015) Optimal data-dependent hashing for approximate near neighbors. In Proceedings of the forty-seventh annual ACM symposium on Theory of computing. p 793–801 Andoni A, Razenshteyn I (2015) Optimal data-dependent hashing for approximate near neighbors. In Proceedings of the forty-seventh annual ACM symposium on Theory of computing. p 793–801
21.
Zurück zum Zitat Bai C, Li H, Zhang J, Huang L, Zhang L (2021) Unsupervised adversarial instance-level image retrieval. IEEE Trans Multimed 23:2199–2207CrossRef Bai C, Li H, Zhang J, Huang L, Zhang L (2021) Unsupervised adversarial instance-level image retrieval. IEEE Trans Multimed 23:2199–2207CrossRef
22.
Zurück zum Zitat Sun Y, Ye Y, Li X, Feng S, Zhang B, Kang Jian, Dai Kuai (2022) Unsupervised deep hashing through learning soft pseudo label for remote sensing image retrieval. Knowl Based Syst 239:107807CrossRef Sun Y, Ye Y, Li X, Feng S, Zhang B, Kang Jian, Dai Kuai (2022) Unsupervised deep hashing through learning soft pseudo label for remote sensing image retrieval. Knowl Based Syst 239:107807CrossRef
23.
Zurück zum Zitat Zakariya SM, Atif JM (2021) Unsupervised content based image retrieval at different precision level by combining multiple features. J Phys 1950:012059 (IOP Publishing) Zakariya SM, Atif JM (2021) Unsupervised content based image retrieval at different precision level by combining multiple features. J Phys 1950:012059 (IOP Publishing)
24.
Zurück zum Zitat Wenjin H, Lifang W, Jian M, Chen Y, Hui Y (2021) Cosine metric supervised deep hashing with balanced similarity. Neurocomputing 448:94–105CrossRef Wenjin H, Lifang W, Jian M, Chen Y, Hui Y (2021) Cosine metric supervised deep hashing with balanced similarity. Neurocomputing 448:94–105CrossRef
25.
Zurück zum Zitat Wang X, Zhang Z, Wu B, Shen F, Lu G (2021) Prototype-supervised adversarial network for targeted attack of deep hashing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 16357–16366 Wang X, Zhang Z, Wu B, Shen F, Lu G (2021) Prototype-supervised adversarial network for targeted attack of deep hashing. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. p 16357–16366
26.
Zurück zum Zitat Jiang Q-Y, Li W-J (2018) Asymmetric deep supervised hashing. In Proceedings of the AAAI Conference on Artificial Intelligence. 32 Jiang Q-Y, Li W-J (2018) Asymmetric deep supervised hashing. In Proceedings of the AAAI Conference on Artificial Intelligence. 32
27.
Zurück zum Zitat Weiss Y, Torralba A (2009) Fergus R (2008) Spectral hashing. In Advances in Neural Information Processing Systems. 21:1753–1760 Weiss Y, Torralba A (2009) Fergus R (2008) Spectral hashing. In Advances in Neural Information Processing Systems. 21:1753–1760
28.
Zurück zum Zitat Gong Y, Lazebnik S, Gordo A, Perronnin F (2012) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intel 35(12):2916–2929CrossRef Gong Y, Lazebnik S, Gordo A, Perronnin F (2012) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE Trans Pattern Anal Mach Intel 35(12):2916–2929CrossRef
29.
Zurück zum Zitat Norouzi M, Fleet D (2011) Minimal loss hashing for compact binary codes. In International Conference of Machine Learning. 1: p 353–360 Norouzi M, Fleet D (2011) Minimal loss hashing for compact binary codes. In International Conference of Machine Learning. 1: p 353–360
30.
Zurück zum Zitat Wang J, Kumar S, Chang S-F (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intel 34(12):2393–2406CrossRef Wang J, Kumar S, Chang S-F (2012) Semi-supervised hashing for large-scale search. IEEE Trans Pattern Anal Mach Intel 34(12):2393–2406CrossRef
31.
Zurück zum Zitat Liu W, Wang J, Ji R, Jiang Y-G, Chang S-F (2012) Supervised hashing with kernels. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. p 2074–2081. IEEE Liu W, Wang J, Ji R, Jiang Y-G, Chang S-F (2012) Supervised hashing with kernels. In 2012 IEEE Conference on Computer Vision and Pattern Recognition. p 2074–2081. IEEE
32.
Zurück zum Zitat Kulis B, Darrell T (2009) Learning to hash with binary reconstructive embeddings. Adv Neural Inf Process Syst 22:2009 Kulis B, Darrell T (2009) Learning to hash with binary reconstructive embeddings. Adv Neural Inf Process Syst 22:2009
33.
Zurück zum Zitat Liu W, Cun M, Kumar S, Chang S-F (2014) Discrete graph hashing. Adv Neural Inf Process Syst 27:2014 Liu W, Cun M, Kumar S, Chang S-F (2014) Discrete graph hashing. Adv Neural Inf Process Syst 27:2014
34.
Zurück zum Zitat Shen F, Shen C, Liu W, Tao SH (2015) Supervised discrete hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 37–45 Shen F, Shen C, Liu W, Tao SH (2015) Supervised discrete hashing. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 37–45
35.
Zurück zum Zitat Unar S, Wang X, Wang C, Wang Y (2019) A decisive content based image retrieval approach for feature fusion in visual and textual images. Knowl Based Syst 179:8–20CrossRef Unar S, Wang X, Wang C, Wang Y (2019) A decisive content based image retrieval approach for feature fusion in visual and textual images. Knowl Based Syst 179:8–20CrossRef
36.
Zurück zum Zitat Zahid M, Fakhar A, Toqeer M, Arshad JM, Amjad Rehman, Tabassam Nawaz (2018) Content-based image retrieval based on visual words fusion versus features fusion of local and global features. Arab J Sci Eng 43(12):7265–7284CrossRef Zahid M, Fakhar A, Toqeer M, Arshad JM, Amjad Rehman, Tabassam Nawaz (2018) Content-based image retrieval based on visual words fusion versus features fusion of local and global features. Arab J Sci Eng 43(12):7265–7284CrossRef
37.
Zurück zum Zitat Chen Y, Zhao D, Xiongbo L, Xiong S, Wang H (2022) Unsupervised balanced hash codes learning with multichannel feature fusion. IEEE J Selected Topics Appl Earth Observations Remote Sensing 15:2816–2825CrossRef Chen Y, Zhao D, Xiongbo L, Xiong S, Wang H (2022) Unsupervised balanced hash codes learning with multichannel feature fusion. IEEE J Selected Topics Appl Earth Observations Remote Sensing 15:2816–2825CrossRef
38.
Zurück zum Zitat Yang Z, Ge Y, Huang Z, Xiong C (2021) Supervised hashing with kernel based on feature fusion for remote sensing image retrieval. In 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE). p 470–475. IEEE Yang Z, Ge Y, Huang Z, Xiong C (2021) Supervised hashing with kernel based on feature fusion for remote sensing image retrieval. In 2021 IEEE 2nd International Conference on Big Data, Artificial Intelligence and Internet of Things Engineering (ICBAIE). p 470–475. IEEE
39.
Zurück zum Zitat Li J, En Y, Ma J, Chang X, Zhang H, Sun Jiande (2022) Discrete fusion adversarial hashing for cross-modal retrieval. Knowl Based Syst 253:109503CrossRef Li J, En Y, Ma J, Chang X, Zhang H, Sun Jiande (2022) Discrete fusion adversarial hashing for cross-modal retrieval. Knowl Based Syst 253:109503CrossRef
40.
Zurück zum Zitat Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2881–2890 Zhao H, Shi J, Qi X, Wang X, Jia J (2017) Pyramid scene parsing network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. p 2881–2890
41.
Zurück zum Zitat Liang-Chieh C, George P, Iasonas K, Kevin M, Yuille Alan L (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intel 40(4):834–848 Liang-Chieh C, George P, Iasonas K, Kevin M, Yuille Alan L (2017) Deeplab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans Pattern Anal Mach Intel 40(4):834–848
42.
Zurück zum Zitat Cao Z, Long M, Wang J, Yu P (2017) Hashnet: Deep learning to hash by continuation. In Proceedings of the IEEE international conference on computer vision. p 5608–5617 Cao Z, Long M, Wang J, Yu P (2017) Hashnet: Deep learning to hash by continuation. In Proceedings of the IEEE international conference on computer vision. p 5608–5617
43.
Zurück zum Zitat Su S, Zhang C, Han K (2018) Tian Y (2018) Greedy hash: Towards fast optimization for accurate hash coding in cnn. In Advances in Neural Information Processing Systems. 31:798–807 Su S, Zhang C, Han K (2018) Tian Y (2018) Greedy hash: Towards fast optimization for accurate hash coding in cnn. In Advances in Neural Information Processing Systems. 31:798–807
44.
Zurück zum Zitat Li Yunqiang, Pei Wenjie, Zha Yufei, Gemert Jan van, et al. (2019) Push for Quantization: Deep fisher hashing. In Proceedings of the British Machine Vision Conference (BMVC). p 182.1–182.12  Li Yunqiang,  Pei Wenjie,  Zha Yufei,  Gemert Jan van, et al. (2019) Push for Quantization: Deep fisher hashing. In Proceedings of the British Machine Vision Conference (BMVC). p 182.1–182.12
45.
Zurück zum Zitat Zheng X, Zhang Y, Xiaoqiang L (2020) Deep balanced discrete hashing for image retrieval. Neurocomputing 403:224–236CrossRef Zheng X, Zhang Y, Xiaoqiang L (2020) Deep balanced discrete hashing for image retrieval. Neurocomputing 403:224–236CrossRef
Metadaten
Titel
Deep hashing via multi-scale and multi-directional pooling for image retrieval
verfasst von
Yunbo Rao
Wang Zhou
Shaoning Zeng
Junmin Xue
Publikationsdatum
25.03.2023
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 9/2023
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-023-01819-4

Weitere Artikel der Ausgabe 9/2023

International Journal of Machine Learning and Cybernetics 9/2023 Zur Ausgabe

Neuer Inhalt