Skip to main content
Erschienen in: International Journal of Computer Vision 2/2021

26.09.2020

Deep Hashing with Hash-Consistent Large Margin Proxy Embeddings

verfasst von: Pedro Morgado, Yunsheng Li, Jose Costa Pereira, Mohammad Saberian, Nuno Vasconcelos

Erschienen in: International Journal of Computer Vision | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Image hash codes are produced by binarizing the embeddings of convolutional neural networks (CNN) trained for either classification or retrieval. While proxy embeddings achieve good performance on both tasks, they are non-trivial to binarize, due to a rotational ambiguity that encourages non-binary embeddings. The use of a fixed set of proxies (weights of the CNN classification layer) is proposed to eliminate this ambiguity, and a procedure to design proxy sets that are nearly optimal for both classification and hashing is introduced. The resulting hash-consistent large margin (HCLM) proxies are shown to encourage saturation of hashing units, thus guaranteeing a small binarization error, while producing highly discriminative hash-codes. A semantic extension (sHCLM), aimed to improve hashing performance in a transfer scenario, is also proposed. Extensive experiments show that sHCLM embeddings achieve significant improvements over state-of-the-art hashing procedures on several small and large datasets, both within and beyond the set of training classes.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Akata, Z., Perronnin, F., Harchaoui, Z., & Schmid, C. (2013). Label-embedding for attribute-based classification. In IEEE conference on computer vision and pattern recognition (CVPR). Akata, Z., Perronnin, F., Harchaoui, Z., & Schmid, C. (2013). Label-embedding for attribute-based classification. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Andoni, A., & Indyk, P. (2006). Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In IEEE symposium on foundations of computer science (FOCS). Andoni, A., & Indyk, P. (2006). Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. In IEEE symposium on foundations of computer science (FOCS).
Zurück zum Zitat Babenko, A., Slesarev, A., Chigorin, A., & Lempitsky, V. (2014). Neural codes for image retrieval. In European conference on computer vision (ECCV). Babenko, A., Slesarev, A., Chigorin, A., & Lempitsky, V. (2014). Neural codes for image retrieval. In European conference on computer vision (ECCV).
Zurück zum Zitat Bach, J. R., Fuller, C., Gupta, A., Hampapur, A., Horowitz, B., Humphrey, R., Jain, R. C., & Shu, C. F. (1996). Virage image search engine: an open framework for image management. In Storage and retrieval for still image and video databases IV (Vol. 2670). Bach, J. R., Fuller, C., Gupta, A., Hampapur, A., Horowitz, B., Humphrey, R., Jain, R. C., & Shu, C. F. (1996). Virage image search engine: an open framework for image management. In Storage and retrieval for still image and video databases IV (Vol. 2670).
Zurück zum Zitat Banerjee, A., Merugu, S., Dhillon, I. S., & Ghosh, J. (2005). Clustering with bregman divergences. Journal of Machine Learning Research, 6, 1705–1749.MathSciNetMATH Banerjee, A., Merugu, S., Dhillon, I. S., & Ghosh, J. (2005). Clustering with bregman divergences. Journal of Machine Learning Research, 6, 1705–1749.MathSciNetMATH
Zurück zum Zitat Barndorff-Nielsen, O. (2014). Information and exponential families: In statistical theory. New York: Wiley. CrossRef Barndorff-Nielsen, O. (2014). Information and exponential families: In statistical theory. New York: Wiley. CrossRef
Zurück zum Zitat Bell, S., & Bala, K. (2015). Learning visual similarity for product design with convolutional neural networks. ACM Transactions on Graphics (TOG), 34(4), 1–10.CrossRef Bell, S., & Bala, K. (2015). Learning visual similarity for product design with convolutional neural networks. ACM Transactions on Graphics (TOG), 34(4), 1–10.CrossRef
Zurück zum Zitat Bregman, L. M. (1967). The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Computational Mathematics and Mathematical Physics, 7(3), 200–217. MathSciNetCrossRef Bregman, L. M. (1967). The relaxation method of finding the common point of convex sets and its application to the solution of problems in convex programming. USSR Computational Mathematics and Mathematical Physics, 7(3), 200–217. MathSciNetCrossRef
Zurück zum Zitat Cakir, F., He, K., Adel Bargal, S., & Sclaroff, S. (2017). Mihash: Online hashing with mutual information. In International conference on computer vision (ICCV). Cakir, F., He, K., Adel Bargal, S., & Sclaroff, S. (2017). Mihash: Online hashing with mutual information. In International conference on computer vision (ICCV).
Zurück zum Zitat Cakir, F., He, K., & Sclaroff, S. (2018). Hashing with binary matrix pursuit. In Proceedings of the European conference on computer vision (ECCV) (pp. 332–348). Cakir, F., He, K., & Sclaroff, S. (2018). Hashing with binary matrix pursuit. In Proceedings of the European conference on computer vision (ECCV) (pp. 332–348).
Zurück zum Zitat Cao, Y., Long, M., Wang, J., Zhu, H., & Wen, Q. (2016) Deep quantization network for efficient image retrieval. In AAAI Conference on Artificial Intelligence. Cao, Y., Long, M., Wang, J., Zhu, H., & Wen, Q. (2016) Deep quantization network for efficient image retrieval. In AAAI Conference on Artificial Intelligence.
Zurück zum Zitat Chopra, S., Hadsell, R., & LeCun, Y. (2005). Learning a similarity metric discriminatively, with application to face verification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR). Chopra, S., Hadsell, R., & LeCun, Y. (2005). Learning a similarity metric discriminatively, with application to face verification. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
Zurück zum Zitat Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. T. (2009). NUS-WIDE: A real-world web image database from national university of Singapore. In ACM Conferrence on Image and Video Retrieval (CIVR). Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. T. (2009). NUS-WIDE: A real-world web image database from national university of Singapore. In ACM Conferrence on Image and Video Retrieval (CIVR).
Zurück zum Zitat Datar, M., Immorlica, N., Indyk, P., & Mirrokni, V. S. (2004). Locality-sensitive hashing scheme based on p-stable distributions. In Symposium on computational geometry (SOCG). Datar, M., Immorlica, N., Indyk, P., & Mirrokni, V. S. (2004). Locality-sensitive hashing scheme based on p-stable distributions. In Symposium on computational geometry (SOCG).
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition (CVPR). Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., et al. (1995). Query by image and video content: The QBIC system. Computer, 28(9), 23–32.CrossRef Flickner, M., Sawhney, H., Niblack, W., Ashley, J., Huang, Q., Dom, B., et al. (1995). Query by image and video content: The QBIC system. Computer, 28(9), 23–32.CrossRef
Zurück zum Zitat Frome, A., Corrado, G. S., Shlens, J., Bengio, S., Dean, J., Mikolov, T., et al. (2013). Devise: A deep visual-semantic embedding model. In advances in neural information processing systems (NIPS). Frome, A., Corrado, G. S., Shlens, J., Bengio, S., Dean, J., Mikolov, T., et al. (2013). Devise: A deep visual-semantic embedding model. In advances in neural information processing systems (NIPS).
Zurück zum Zitat Goldberger, J., Hinton, G. E., Roweis, S. T., & Salakhutdinov, R. R. (2005) Neighbourhood components analysis. In: Advances in neural information processing systems (NIPS). Goldberger, J., Hinton, G. E., Roweis, S. T., & Salakhutdinov, R. R. (2005) Neighbourhood components analysis. In: Advances in neural information processing systems (NIPS).
Zurück zum Zitat Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 35(12), 2916–2929.CrossRef Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 35(12), 2916–2929.CrossRef
Zurück zum Zitat Gordo, A., Almazán, J., Revaud, J., & Larlus, D. (2016). Deep image retrieval: Learning global representations for image search. In European conference on computer vision (ECCV) Gordo, A., Almazán, J., Revaud, J., & Larlus, D. (2016). Deep image retrieval: Learning global representations for image search. In European conference on computer vision (ECCV)
Zurück zum Zitat Hadsell, R., Chopra, S., & LeCun, Y. (2006). Dimensionality reduction by learning an invariant mapping. In IEEE conference on computer vision and pattern recognition (CVPR) . Hadsell, R., Chopra, S., & LeCun, Y. (2006). Dimensionality reduction by learning an invariant mapping. In IEEE conference on computer vision and pattern recognition (CVPR) .
Zurück zum Zitat He, K., Cakir, F., Bargal, S. A., & Sclaroff, S. (2018). Hashing as tie-aware learning to rank. In IEEE conference on computer vision and pattern recognition (CVPR) He, K., Cakir, F., Bargal, S. A., & Sclaroff, S. (2018). Hashing as tie-aware learning to rank. In IEEE conference on computer vision and pattern recognition (CVPR)
Zurück zum Zitat Huang, S., Xiong, Y., Zhang, Y., & Wang, J. (2017). Unsupervised triplet hashing for fast image retrieval. In Thematic workshops of ACM multimedia Huang, S., Xiong, Y., Zhang, Y., & Wang, J. (2017). Unsupervised triplet hashing for fast image retrieval. In Thematic workshops of ACM multimedia
Zurück zum Zitat Jain, H., Zepeda, J., Pérez, P., & Gribonval, R. (2017). Subic: A supervised, structured binary code for image search. In International conference on computer vision (ICCV) Jain, H., Zepeda, J., Pérez, P., & Gribonval, R. (2017). Subic: A supervised, structured binary code for image search. In International conference on computer vision (ICCV)
Zurück zum Zitat Jegou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 33(1), 117–128.CrossRef Jegou, H., Douze, M., & Schmid, C. (2011). Product quantization for nearest neighbor search. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 33(1), 117–128.CrossRef
Zurück zum Zitat Jiang, Q. Y., & Li, W. J. (2018). Asymmetric deep supervised hashing. In Thirty-second AAAI conference on artificial intelligence. Jiang, Q. Y., & Li, W. J. (2018). Asymmetric deep supervised hashing. In Thirty-second AAAI conference on artificial intelligence.
Zurück zum Zitat Krizhevsky, A. (2009). Learning multiple layers of features from tiny images. University of Toronto. Krizhevsky, A. (2009). Learning multiple layers of features from tiny images. University of Toronto.
Zurück zum Zitat Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (NIPS). Krizhevsky, A., Sutskever, I., & Hinton, G. E. (2012). Imagenet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (NIPS).
Zurück zum Zitat Kulis, B., & Darrell, T. (2009). Learning to hash with binary reconstructive embeddings. In Advances in neural information processing systems (NIPS) Kulis, B., & Darrell, T. (2009). Learning to hash with binary reconstructive embeddings. In Advances in neural information processing systems (NIPS)
Zurück zum Zitat Lai, H., Pan, Y., Liu, Y., & Yan, S. (2015). Simultaneous feature learning and hash coding with deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR). Lai, H., Pan, Y., Liu, Y., & Yan, S. (2015). Simultaneous feature learning and hash coding with deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lampert, C. H., Nickisch, H., & Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition (CVPR). Lampert, C. H., Nickisch, H., & Harmeling, S. (2009). Learning to detect unseen object classes by between-class attribute transfer. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Li, L., Su, H., Xing, E., & Fei-Fei, L. (2010). Object bank: A high-level image representation for scene classification and semantic feature sparsification. In Advances in neural information processing systems (NIPS). Li, L., Su, H., Xing, E., & Fei-Fei, L. (2010). Object bank: A high-level image representation for scene classification and semantic feature sparsification. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Li, Q., Sun, Z., He, R., & Tan, T. (2017). Deep supervised discrete hashing. In Advances in neural information processing systems (NIPS). Li, Q., Sun, Z., He, R., & Tan, T. (2017). Deep supervised discrete hashing. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Li, W. J., Wang, S., & Kang, W. C. (2016). Feature learning based deep supervised hashing with pairwise labels. In AAAI conference on artificial intelligence. Li, W. J., Wang, S., & Kang, W. C. (2016). Feature learning based deep supervised hashing with pairwise labels. In AAAI conference on artificial intelligence.
Zurück zum Zitat Lin, K., Lu, J., Chen, C. S., & Zhou, J. (2016). Learning compact binary descriptors with unsupervised deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR). Lin, K., Lu, J., Chen, C. S., & Zhou, J. (2016). Learning compact binary descriptors with unsupervised deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lin, K., Yang, H. F., Hsiao, J. H., & Chen, C.S. (2015) Deep learning of binary hash codes for fast image retrieval. In IEEE conference on Computer vision and pattern recognition (workshops). Lin, K., Yang, H. F., Hsiao, J. H., & Chen, C.S. (2015) Deep learning of binary hash codes for fast image retrieval. In IEEE conference on Computer vision and pattern recognition (workshops).
Zurück zum Zitat Liong, E., Lu, J., Wang, G., Moulin, P., & Zhou, J. (2015). Deep hashing for compact binary codes learning. In IEEE conference on computer vision and pattern recognition (CVPR). Liong, E., Lu, J., Wang, G., Moulin, P., & Zhou, J. (2015). Deep hashing for compact binary codes learning. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liu, H., Wang, R., Shan, S., & Chen, X. (2016). Deep supervised hashing for fast image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR). Liu, H., Wang, R., Shan, S., & Chen, X. (2016). Deep supervised hashing for fast image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In IEEE conference on computer vision and pattern recognition (CVPR). Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lu, J., Liong, V. E., & Zhou, J. (2017). Deep hashing for scalable image search. IEEE Transactions on Image Processing (TIP), 26(5), 2352–2367.MathSciNetCrossRef Lu, J., Liong, V. E., & Zhou, J. (2017). Deep hashing for scalable image search. IEEE Transactions on Image Processing (TIP), 26(5), 2352–2367.MathSciNetCrossRef
Zurück zum Zitat Morgado, P., & Vasconcelos, N. (2017). Semantically consistent regularization for zero-shot recognition. In IEEE conference on computer vision and pattern recognition (CVPR). Morgado, P., & Vasconcelos, N. (2017). Semantically consistent regularization for zero-shot recognition. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Movshovitz-Attias, Y., Toshev, A., Leung, T. K., Ioffe, S., & Singh, S. (2017). No fuss distance metric learning using proxies. In International conference on computer vision (ICCV). Movshovitz-Attias, Y., Toshev, A., Leung, T. K., Ioffe, S., & Singh, S. (2017). No fuss distance metric learning using proxies. In International conference on computer vision (ICCV).
Zurück zum Zitat Mu, Y., & Yan, S. (2010). Non-metric locality-sensitive hashing. In AAAI conference on artificial intelligence. Mu, Y., & Yan, S. (2010). Non-metric locality-sensitive hashing. In AAAI conference on artificial intelligence.
Zurück zum Zitat Nelder, J. A., & Wedderburn, R. W. (1972). Generalized linear models. Journal of the Royal Statistical Society: Series A (General), 135(3), 370–384.CrossRef Nelder, J. A., & Wedderburn, R. W. (1972). Generalized linear models. Journal of the Royal Statistical Society: Series A (General), 135(3), 370–384.CrossRef
Zurück zum Zitat Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML). Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference  on machine learning (ICML).
Zurück zum Zitat Oh Song, H., Xiang, Y., Jegelka, S., & Savarese, S. (2016). Deep metric learning via lifted structured feature embedding. In IEEE conference on computer vision and pattern recognition (CVPR) Oh Song, H., Xiang, Y., Jegelka, S., & Savarese, S. (2016). Deep metric learning via lifted structured feature embedding. In IEEE conference on computer vision and pattern recognition (CVPR)
Zurück zum Zitat Pereira, J. C., & Vasconcelos, N. (2014). Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems. In Computer Vision and Image Understanding (Vol. 124). Pereira, J. C., & Vasconcelos, N. (2014). Cross-modal domain adaptation for text-based regularization of image semantics in image retrieval systems. In Computer Vision and Image Understanding (Vol. 124).
Zurück zum Zitat Rasiwasia, N., Moreno, P., & Vasconcelos, N. (2007). Bridging the gap: Query by semantic example. IEEE Transactions on Multimedia, 9(5), 923–938.CrossRef Rasiwasia, N., Moreno, P., & Vasconcelos, N. (2007). Bridging the gap: Query by semantic example. IEEE Transactions on Multimedia, 9(5), 923–938.CrossRef
Zurück zum Zitat Rohrbach, M., Stark, M., & Schiele, B. (2011). Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In IEEE conference on computer vision and pattern recognition (CVPR). Rohrbach, M., Stark, M., & Schiele, B. (2011). Evaluating knowledge transfer and zero-shot learning in a large-scale setting. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Sablayrolles, A., Douze, M., Usunier, N., & Jégou, H. (2017). How should we evaluate supervised hashing? In IEEE International conference on acoustics, speech and signal processing (ICASSP). Sablayrolles, A., Douze, M., Usunier, N., & Jégou, H. (2017). How should we evaluate supervised hashing? In IEEE International conference on acoustics, speech and signal processing (ICASSP).
Zurück zum Zitat Schroff, F., Kalenichenko, D., & Philbin, J. Facenet (2015). A unified embedding for face recognition and clustering. In IEEE conference on computer vision and pattern recognition (CVPR). Schroff, F., Kalenichenko, D., & Philbin, J. Facenet (2015). A unified embedding for face recognition and clustering. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In IEEE conference on computer vision and pattern recognition (CVPR) Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In IEEE conference on computer vision and pattern recognition (CVPR)
Zurück zum Zitat Smeulders, A., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 22(12), 1349–1380.CrossRef Smeulders, A., Worring, M., Santini, S., Gupta, A., & Jain, R. (2000). Content-based image retrieval at the end of the early years. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 22(12), 1349–1380.CrossRef
Zurück zum Zitat Smith, J. R., & Chang, S. F. (1997). Visualseek: A fully automated content-based image query system. In ACM international conference on multimedia. Smith, J. R., & Chang, S. F. (1997). Visualseek: A fully automated content-based image query system. In ACM international conference on multimedia.
Zurück zum Zitat Sohn, K. (2016). Improved deep metric learning with multi-class n-pair loss objective. In Advances in neural information processing systems (NIPS) Sohn, K. (2016). Improved deep metric learning with multi-class n-pair loss objective. In Advances in neural information processing systems (NIPS)
Zurück zum Zitat Song, H. O., Jegelka, S., Rathod, V., & Murphy, K. (2016). Learnable structured clustering framework for deep metric learning. arXiv preprint arXiv:1612.01213. Song, H. O., Jegelka, S., Rathod, V., & Murphy, K. (2016). Learnable structured clustering framework for deep metric learning. arXiv preprint arXiv:​1612.​01213.
Zurück zum Zitat Sun, Y., Chen, Y., Wang, X., & Tang, X. (2014). Deep learning face representation by joint identification-verification. In Advances in neural information processing systems (NIPS). Sun, Y., Chen, Y., Wang, X., & Tang, X. (2014). Deep learning face representation by joint identification-verification. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Tammes, P. (1930). On the origin of number and arrangement of the places of exit on the surface of pollen-grains. Ph.D. thesis, University of Groningen. Tammes, P. (1930). On the origin of number and arrangement of the places of exit on the surface of pollen-grains. Ph.D. thesis, University of Groningen.
Zurück zum Zitat Torresani, L., Szummer, M., & Fitzgibbon, A. (2010). Efficient object category recognition using classemes. In European conference on computer vision (ECCV). Torresani, L., Szummer, M., & Fitzgibbon, A. (2010). Efficient object category recognition using classemes. In European conference on computer vision (ECCV).
Zurück zum Zitat Wang, J., Kumar, S., & Chang, S. F. (2010). Semi-supervised hashing for scalable image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR). Wang, J., Kumar, S., & Chang, S. F. (2010). Semi-supervised hashing for scalable image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Wang, J., Song, Y., Leung, T., Rosenberg, C., Wang, J., Philbin, J., Chen, B., & Wu, Y. (2014). Learning fine-grained image similarity with deep ranking. In IEEE conference on computer vision and pattern recognition (CVPR). Wang, J., Song, Y., Leung, T., Rosenberg, C., Wang, J., Philbin, J., Chen, B., & Wu, Y. (2014). Learning fine-grained image similarity with deep ranking. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Wang, X., Shi, Y., & Kitani, K. M. (2016). Deep supervised hashing with triplet labels. In Asian conference on computer vision (ACCV). Wang, X., Shi, Y., & Kitani, K. M. (2016). Deep supervised hashing with triplet labels. In Asian conference on computer vision (ACCV).
Zurück zum Zitat Weinberger, K. Q., & Saul, L. K. (2009). Distance metric learning for large margin nearest neighbor classification. Journal of Machine Learning Research, 10(9), 207–244.MATH Weinberger, K. Q., & Saul, L. K. (2009). Distance metric learning for large margin nearest neighbor classification. Journal of Machine Learning Research, 10(9), 207–244.MATH
Zurück zum Zitat Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in Neural Information Processing Systems (NIPS). Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in Neural Information Processing Systems (NIPS).
Zurück zum Zitat Wen, Y., Zhang, K., Li, Z., & Qiao, Y. (2016). A discriminative feature learning approach for deep face recognition. In European conference on computer vision (ECCV) (pp. 499–515). Berlin: Springer. Wen, Y., Zhang, K., Li, Z., & Qiao, Y. (2016). A discriminative feature learning approach for deep face recognition. In European conference on computer vision (ECCV) (pp. 499–515). Berlin: Springer.
Zurück zum Zitat Wright, S., & Nocedal, J. (1999). Numerical optimization. Science, 35, 566.MATH Wright, S., & Nocedal, J. (1999). Numerical optimization. Science, 35, 566.MATH
Zurück zum Zitat Xia, R., Pan, Y., Lai, H., Liu, C., & Yan, S. (2014). Supervised hashing for image retrieval via image representation learning. In AAAI conference on artificial intelligence Xia, R., Pan, Y., Lai, H., Liu, C., & Yan, S. (2014). Supervised hashing for image retrieval via image representation learning. In AAAI conference on artificial intelligence
Zurück zum Zitat Xie, S., & Tu, Z. (2015). Holistically-nested edge detection. In: Proceedings of the IEEE international conference on computer vision (pp. 1395–1403). Xie, S., & Tu, Z. (2015). Holistically-nested edge detection. In: Proceedings of the IEEE international conference on computer vision (pp. 1395–1403).
Zurück zum Zitat Yang, H. F., Lin, K., & Chen, C. S. (2017). Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(2), 437–451.CrossRef Yang, H. F., Lin, K., & Chen, C. S. (2017). Supervised learning of semantics-preserving hash via deep convolutional neural networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 40(2), 437–451.CrossRef
Zurück zum Zitat Zhang, D., Wu, W., Cheng, H., Zhang, R., Dong, Z., & Cai, Z. (2017). Image-to-video person re-identification with temporally memorized similarity learning. IEEE Transactions on Circuits and Systems for Video Technology, 28(10), 2622–2632.CrossRef Zhang, D., Wu, W., Cheng, H., Zhang, R., Dong, Z., & Cai, Z. (2017). Image-to-video person re-identification with temporally memorized similarity learning. IEEE Transactions on Circuits and Systems for Video Technology, 28(10), 2622–2632.CrossRef
Zurück zum Zitat Zhang, J., Peng, Y., & Zhang, J. (2016). Query-adaptive image retrieval by deep weighted hashing. IEEE Transactions on Multimedia. Zhang, J., Peng, Y., & Zhang, J. (2016). Query-adaptive image retrieval by deep weighted hashing. IEEE Transactions on Multimedia.
Zurück zum Zitat Zhang, R., Li, J., Sun, H., Ge, Y., Luo, P., Wang, X., et al. (2019). Scan: Self-and-collaborative attention network for video person re-identification. IEEE Transactions on Image Processing, 28(10), 4870–4882.MathSciNetCrossRef Zhang, R., Li, J., Sun, H., Ge, Y., Luo, P., Wang, X., et al. (2019). Scan: Self-and-collaborative attention network for video person re-identification. IEEE Transactions on Image Processing, 28(10), 4870–4882.MathSciNetCrossRef
Zurück zum Zitat Zhang, R., Lin, L., Zhang, R., Zuo, W., & Zhang, L. (2015). Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Transactions on Image Processing, 24(12), 4766–4779.MathSciNetCrossRef Zhang, R., Lin, L., Zhang, R., Zuo, W., & Zhang, L. (2015). Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification. IEEE Transactions on Image Processing, 24(12), 4766–4779.MathSciNetCrossRef
Zurück zum Zitat Zhao, F., Huang, Y., Wang, L., & Tan, T. (2015). Deep semantic ranking based hashing for multi-label image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR). Zhao, F., Huang, Y., Wang, L., & Tan, T. (2015). Deep semantic ranking based hashing for multi-label image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Zhong, G., Xu, H., Yang, P., Wang, S., & Dong, J. (2016). Deep hashing learning networks. In International joint conference on neural networks (IJCNN). Zhong, G., Xu, H., Yang, P., Wang, S., & Dong, J. (2016). Deep hashing learning networks. In International joint conference on neural networks (IJCNN).
Zurück zum Zitat Zhu, H., Long, M., Wang, J., & Cao, Y. (2016). Deep hashing network for efficient similarity retrieval. In AAAI conference on artificial intelligence. Zhu, H., Long, M., Wang, J., & Cao, Y. (2016). Deep hashing network for efficient similarity retrieval. In AAAI conference on artificial intelligence.
Metadaten
Titel
Deep Hashing with Hash-Consistent Large Margin Proxy Embeddings
verfasst von
Pedro Morgado
Yunsheng Li
Jose Costa Pereira
Mohammad Saberian
Nuno Vasconcelos
Publikationsdatum
26.09.2020
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 2/2021
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-020-01362-7

Weitere Artikel der Ausgabe 2/2021

International Journal of Computer Vision 2/2021 Zur Ausgabe