Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 1/2022

27.04.2021 | Original Article

Triplet-object loss for large scale deep image retrieval

verfasst von: Jie Zhu, Yang Shu, Junsan Zhang, Xuanye Wang, Shufang Wu

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 1/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Deep hashing has been widely applied in large scale image retrieval due to its high computation efficiency and retrieval performance. Recently, training deep hashing networks with a triplet ranking loss become a common framework. However, most of the triplet ranking loss based deep hashing methods cannot obtain satisfactory retrieval performance due to their ignoring the relative similarities among the objects. In this paper, we propose a method to learn the discriminative object features and utilize these features to compute the adaptive margins of the proposed loss for learning powerful hash codes. Experimental results show that our learned hash codes can yield state-of-the-art retrieval performance on three challenging datasets

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Habimana O, Li Y, Li R et al (2020) Attentive convolutional gated recurrent network: a contextual model to sentiment analysis. Int J Mach Learn Cybern 11:2637–2651CrossRef Habimana O, Li Y, Li R et al (2020) Attentive convolutional gated recurrent network: a contextual model to sentiment analysis. Int J Mach Learn Cybern 11:2637–2651CrossRef
2.
Zurück zum Zitat Zhu J, Wu S, Zhu H et al (2019) Multi-center convolutional descriptor aggregation for image retrieval. Int J Mach Learn Cybern 10(7):1863–1873CrossRef Zhu J, Wu S, Zhu H et al (2019) Multi-center convolutional descriptor aggregation for image retrieval. Int J Mach Learn Cybern 10(7):1863–1873CrossRef
3.
Zurück zum Zitat Yao T, Long F, Mei T, et al (2016) Deep semantic-preserving and ranking-based hashing for image retrieval. In: International joint conference on artificial intelligence, pp 3931–3937 Yao T, Long F, Mei T, et al (2016) Deep semantic-preserving and ranking-based hashing for image retrieval. In: International joint conference on artificial intelligence, pp 3931–3937
4.
Zurück zum Zitat Lai H, Pan Y, Liu Y, et al (2015) Simultaneous feature learning and hash coding with deep neural networks. In: Computer vision and pattern recognition, pp 3270–3278 Lai H, Pan Y, Liu Y, et al (2015) Simultaneous feature learning and hash coding with deep neural networks. In: Computer vision and pattern recognition, pp 3270–3278
5.
Zurück zum Zitat Jin X, Sun W, Jin Z (2020) A discriminative deep association learning for facial expression recognition. Int J Mach Learn Cybern 11(4):779–793CrossRef Jin X, Sun W, Jin Z (2020) A discriminative deep association learning for facial expression recognition. Int J Mach Learn Cybern 11(4):779–793CrossRef
6.
Zurück zum Zitat Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision, pp 685–701 Kalantidis Y, Mellina C, Osindero S (2016) Cross-dimensional weighting for aggregated deep convolutional features. In: European conference on computer vision, pp 685–701
7.
Zurück zum Zitat Xu J, Shi C, Chengzuo Q, et al (2018) Unsupervised part-based weighting aggregation of deep convolutional features for image retrieval. In: AAAI Conference on artificial intelligence, pp 7436–7443 Xu J, Shi C, Chengzuo Q, et al (2018) Unsupervised part-based weighting aggregation of deep convolutional features for image retrieval. In: AAAI Conference on artificial intelligence, pp 7436–7443
8.
Zurück zum Zitat Shen F, Shen C, Liu W, et al (2015) Supervised discrete hashing. In: Computer vision and pattern recognition, pp 37–45 Shen F, Shen C, Liu W, et al (2015) Supervised discrete hashing. In: Computer vision and pattern recognition, pp 37–45
9.
Zurück zum Zitat Ng WWY, Jiang X, Tian X et al (2020) Incremental hashing with sample selection using dominant sets. Int J Mach Learn Cybern 11(12):2689–2702CrossRef Ng WWY, Jiang X, Tian X et al (2020) Incremental hashing with sample selection using dominant sets. Int J Mach Learn Cybern 11(12):2689–2702CrossRef
10.
Zurück zum Zitat Li S, Chen Z, Lu J, et al (2019) Neighborhood preserving hashing for scalable video retrieval. In: International conference on computer vision, pp: 8212–8221. Li S, Chen Z, Lu J, et al (2019) Neighborhood preserving hashing for scalable video retrieval. In: International conference on computer vision, pp: 8212–8221.
11.
Zurück zum Zitat Li J, Ng WW, Tian X et al (2020) Weighted multi-deep ranking supervised hashing for efficient image retrieval. Int J Mach Learn Cybern 11(4):883–897CrossRef Li J, Ng WW, Tian X et al (2020) Weighted multi-deep ranking supervised hashing for efficient image retrieval. Int J Mach Learn Cybern 11(4):883–897CrossRef
12.
Zurück zum Zitat Zhang J, Peng Y (2018) Query-adaptive image retrieval by deep-weighted hashing. IEEE T Multim 20(9):2400–2414CrossRef Zhang J, Peng Y (2018) Query-adaptive image retrieval by deep-weighted hashing. IEEE T Multim 20(9):2400–2414CrossRef
13.
Zurück zum Zitat Deng C, Yang E, Liu T et al (2019) Unsupervised semantic-preserving adversarial hashing for image search. IEEE T Image Process 28(8):4032–4044MathSciNetCrossRef Deng C, Yang E, Liu T et al (2019) Unsupervised semantic-preserving adversarial hashing for image search. IEEE T Image Process 28(8):4032–4044MathSciNetCrossRef
14.
Zurück zum Zitat Do T, Doan A, Cheung N (2016) Learning to hash with binary deep neural network. In: European conference on computer vision, pp 219–234 Do T, Doan A, Cheung N (2016) Learning to hash with binary deep neural network. In: European conference on computer vision, pp 219–234
15.
Zurück zum Zitat Shen F, Xu Y, Liu L et al (2018) Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE T Pattern Anal 40(12):3034–3044CrossRef Shen F, Xu Y, Liu L et al (2018) Unsupervised deep hashing with similarity-adaptive and discrete optimization. IEEE T Pattern Anal 40(12):3034–3044CrossRef
16.
Zurück zum Zitat Shen Y, Liu L, Shao L (2019) Unsupervised binary representation learning with deep variational networks. Int J Comput Vision 127:1614–1628CrossRef Shen Y, Liu L, Shao L (2019) Unsupervised binary representation learning with deep variational networks. Int J Comput Vision 127:1614–1628CrossRef
18.
Zurück zum Zitat Xia R, Pan Y, Lai H, et al (2014) Supervised hashing for image retrieval via image representation learning. In: National conference on artificial intelligence, pp 2156–2162 Xia R, Pan Y, Lai H, et al (2014) Supervised hashing for image retrieval via image representation learning. In: National conference on artificial intelligence, pp 2156–2162
19.
Zurück zum Zitat Wang D, Gao X, Wang X et al (2016) Multimodal discriminative binary embedding for large-scale cross-modal retrieval. IEEE T Image Process 25(10):4540–4554MathSciNetCrossRef Wang D, Gao X, Wang X et al (2016) Multimodal discriminative binary embedding for large-scale cross-modal retrieval. IEEE T Image Process 25(10):4540–4554MathSciNetCrossRef
20.
Zurück zum Zitat Xu X, Shen F, Yang Y et al (2017) Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE T Image Process 26(5):2494–2507MathSciNetCrossRef Xu X, Shen F, Yang Y et al (2017) Learning discriminative binary codes for large-scale cross-modal retrieval. IEEE T Image Process 26(5):2494–2507MathSciNetCrossRef
21.
Zurück zum Zitat Li W, Wang S, Kang W (2016) Feature learning based deep supervised hashing with pairwise labels. In: International joint conference on artificial intelligence, pp 1711–1717 Li W, Wang S, Kang W (2016) Feature learning based deep supervised hashing with pairwise labels. In: International joint conference on artificial intelligence, pp 1711–1717
22.
Zurück zum Zitat Zhu H, Long M, Wang J, et al (2016) Deep hashing network for efficient similarity retrieval. In: National conference on artificial intelligence, pp 2415–2421 Zhu H, Long M, Wang J, et al (2016) Deep hashing network for efficient similarity retrieval. In: National conference on artificial intelligence, pp 2415–2421
23.
Zurück zum Zitat Liu H, Wang R, Shan S, Chen X (2016) Deep Supervised Hashing for Fast Image Retrieval. In: IEEE conference on computer vision and pattern recognition, pp. 2064–2072. Liu H, Wang R, Shan S, Chen X (2016) Deep Supervised Hashing for Fast Image Retrieval. In: IEEE conference on computer vision and pattern recognition, pp. 2064–2072.
24.
Zurück zum Zitat Junjie C, Cheung WK, Wang A (2018) Learning deep unsupervised binary codes for image retrieval. In: International joint conference on artificial intelligence, pp 613–619 Junjie C, Cheung WK, Wang A (2018) Learning deep unsupervised binary codes for image retrieval. In: International joint conference on artificial intelligence, pp 613–619
25.
Zurück zum Zitat Huang C, Loy CC, Tang X (2016) Unsupervised learning of discriminative attributes and visual representations. Computer vision and pattern recognition, pp 5175–5184 Huang C, Loy CC, Tang X (2016) Unsupervised learning of discriminative attributes and visual representations. Computer vision and pattern recognition, pp 5175–5184
26.
Zurück zum Zitat Deng C, Chen Z, Liu X et al (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE T Image Process 27(8):3893–3903MathSciNetCrossRef Deng C, Chen Z, Liu X et al (2018) Triplet-based deep hashing network for cross-modal retrieval. IEEE T Image Process 27(8):3893–3903MathSciNetCrossRef
27.
Zurück zum Zitat Dai P, Ji R, Wang H, et al (2018) Cross-modality person re-identification with generative adversarial training. In: International joint conference on artificial intelligence, pp 677–683 Dai P, Ji R, Wang H, et al (2018) Cross-modality person re-identification with generative adversarial training. In: International joint conference on artificial intelligence, pp 677–683
29.
Zurück zum Zitat Norouzi M, Fleet D J, Salakhutdinov R (2012) Hamming distance metric learning. In: Neural information processing systems, pp 1061–1069 Norouzi M, Fleet D J, Salakhutdinov R (2012) Hamming distance metric learning. In: Neural information processing systems, pp 1061–1069
30.
Zurück zum Zitat Chen W, Chen X, Zhang J, et al (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Computer vision and pattern recognition, pp 1320–1329 Chen W, Chen X, Zhang J, et al (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Computer vision and pattern recognition, pp 1320–1329
31.
Zurück zum Zitat Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arxiv: 1409.1556 Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arxiv: 1409.1556
32.
Zurück zum Zitat Tolias G, Sicre R, Jegou H (2016) Particular object retrieval with integral max-pooling of CNN activations. International conference on learning representations, pp 1–12 Tolias G, Sicre R, Jegou H (2016) Particular object retrieval with integral max-pooling of CNN activations. International conference on learning representations, pp 1–12
33.
Zurück zum Zitat Wei X, Luo J, Wu J et al (2017) Selective convolutional descriptor aggregation for fine-grained image retrieval. IEEE T Image Process 26(6):2868–2881MathSciNetCrossRef Wei X, Luo J, Wu J et al (2017) Selective convolutional descriptor aggregation for fine-grained image retrieval. IEEE T Image Process 26(6):2868–2881MathSciNetCrossRef
34.
Zurück zum Zitat Fu Z, Robleskelly A, Zhou J (2011) Milis: multiple instance learning with instance selection. IEEE T Pattern Anal 33(5):958–977CrossRef Fu Z, Robleskelly A, Zhou J (2011) Milis: multiple instance learning with instance selection. IEEE T Pattern Anal 33(5):958–977CrossRef
35.
Zurück zum Zitat Krizhevsky A (2009) Learning multiple layers of features from tiny images, Technical Report, Computer Science Department, University of Toronto Krizhevsky A (2009) Learning multiple layers of features from tiny images, Technical Report, Computer Science Department, University of Toronto
36.
Zurück zum Zitat Chua T, Tang J, Hong R, Li H (2009) Nus-wide: a real-world web image database from national university of Singapore. In: ACM conference on image and video retrieval, pp 1–9 Chua T, Tang J, Hong R, Li H (2009) Nus-wide: a real-world web image database from national university of Singapore. In: ACM conference on image and video retrieval, pp 1–9
37.
Zurück zum Zitat Netzer Y, Wang T, Coates A (2012) Reading digits in natural images with unsupervised feature learning. In: NIPS Workshop on Deep Learning and Unsupervised Feature Learning, pp 1–9 Netzer Y, Wang T, Coates A (2012) Reading digits in natural images with unsupervised feature learning. In: NIPS Workshop on Deep Learning and Unsupervised Feature Learning, pp 1–9
38.
Zurück zum Zitat Gong Y, Lazebnik S, Gordo A et al (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE T Pattern Anal 35(12):2916–2929CrossRef Gong Y, Lazebnik S, Gordo A et al (2013) Iterative quantization: a procrustean approach to learning binary codes for large-scale image retrieval. IEEE T Pattern Anal 35(12):2916–2929CrossRef
39.
Zurück zum Zitat Zha Y, Qiu Z, Zhang P et al (2020) Unsupervised ensemble hashing: boosting minimum hamming distance. IEEE Access: 42937–42947 Zha Y, Qiu Z, Zhang P et al (2020) Unsupervised ensemble hashing: boosting minimum hamming distance. IEEE Access: 42937–42947
40.
Zurück zum Zitat Liu W, Wang J, Ji R, et al (2012) Supervised hashing with kernels. In: Computer vision and pattern recognition, pp 2074–2081 Liu W, Wang J, Ji R, et al (2012) Supervised hashing with kernels. In: Computer vision and pattern recognition, pp 2074–2081
41.
Zurück zum Zitat Yu T, Yuan J, Fang C, et al (2018) Product quantization network for fast image retrieval. In: European conference on computer vision, pp 191–206 Yu T, Yuan J, Fang C, et al (2018) Product quantization network for fast image retrieval. In: European conference on computer vision, pp 191–206
42.
Zurück zum Zitat Cao Y, Long M, Liu B et al (2018) Deep cauchy hashing for hamming space retrieval. In: Computer vision and pattern recognition, pp 1229–1237 Cao Y, Long M, Liu B et al (2018) Deep cauchy hashing for hamming space retrieval. In: Computer vision and pattern recognition, pp 1229–1237
44.
Zurück zum Zitat Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In: International conference on machine learning, pp 148–156 Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. In: International conference on machine learning, pp 148–156
45.
Zurück zum Zitat Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Neural information processing systems, pp 1097–1105 Krizhevsky A, Sutskever I, Hinton GE (2012) ImageNet classification with deep convolutional neural networks. In: Neural information processing systems, pp 1097–1105
46.
Zurück zum Zitat Lin M, Chen Q, Yan S (2013) Network in network. arXiv:1312.4400 Lin M, Chen Q, Yan S (2013) Network in network. arXiv:1312.4400
Metadaten
Titel
Triplet-object loss for large scale deep image retrieval
verfasst von
Jie Zhu
Yang Shu
Junsan Zhang
Xuanye Wang
Shufang Wu
Publikationsdatum
27.04.2021
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 1/2022
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-021-01330-8

Weitere Artikel der Ausgabe 1/2022

International Journal of Machine Learning and Cybernetics 1/2022 Zur Ausgabe