Skip to main content
Erschienen in: International Journal of Computer Vision 8-9/2020

06.05.2020

Hadamard Matrix Guided Online Hashing

verfasst von: Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian

Erschienen in: International Journal of Computer Vision | Ausgabe 8-9/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Online image hashing has attracted increasing research attention recently, which receives large-scale data in a streaming manner to update the hash functions on-the-fly. Its key challenge lies in the difficulty of balancing the learning timeliness and model accuracy. To this end, most works follow a supervised setting, i.e., using class labels to boost the hashing performance, which defects in two aspects: first, strong constraints, e.g., orthogonal or similarity preserving, are used, which however are typically relaxed and lead to large accuracy drops. Second, large amounts of training batches are required to learn the up-to-date hash functions, which largely increase the learning complexity. To handle the above challenges, a novel supervised online hashing scheme termed Hadamard Matrix Guided Online Hashing (HMOH) is proposed in this paper. Our key innovation lies in introducing Hadamard matrix, which is an orthogonal binary matrix built via Sylvester method. In particular, to release the need of strong constraints, we regard each column of Hadamard matrix as the target code for each class label, which by nature satisfies several desired properties of hashing codes. To accelerate the online training, LSH is first adopted to align the lengths of target code and to-be-learned binary code. We then treat the learning of hash functions as a set of binary classification problems to fit the assigned target code. Finally, extensive experiments on four widely-used benchmarks demonstrate the superior accuracy and efficiency of HMOH over various state-of-the-art methods. Codes can be available at https://​github.​com/​lmbxmu/​mycode.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Fußnoten
1
Our test with 32-bit on CIFAR-10 shows that classification has less averaged quantization error of 2.861 than regression of 4.543.
 
2
Take the Places205 dataset as an example: There are in total 205 categories. According to Eq. 10, \(r^* = 256\) for the code length r varying from 8 to 128.
 
3
When \(r^* = r\), we set \(\tilde{{\mathbf {W}}}\) as an identity matrix and the above equation still holds.
 
4
Since it is just a matrix-addition operation at each stage.
 
5
\({\tilde{W}}\) is a random matrix that need not be optimized. When \(r=r^*\), we set \({\tilde{W}}\) as an identity matrix
 
Literatur
Zurück zum Zitat Babenko, B., Yang, MH., & Belongie, S. (2009). A family of online boosting algorithms. In International conference on computer vision (ICCV Workshops) (pp. 1346–1353). Babenko, B., Yang, MH., & Belongie, S. (2009). A family of online boosting algorithms. In International conference on computer vision (ICCV Workshops) (pp. 1346–1353).
Zurück zum Zitat Cakir, F., & Sclaroff, S. (2015). Adaptive hashing for fast similarity search. In International conference on computer vision (ICCV) (pp. 1044–1052). Cakir, F., & Sclaroff, S. (2015). Adaptive hashing for fast similarity search. In International conference on computer vision (ICCV) (pp. 1044–1052).
Zurück zum Zitat Cakir, F., Bargal, S. A., & Sclaroff, S. (2017a). Online supervised hashing. Computer Vision and Image Understanding (CVIU), 156, 162–173.CrossRef Cakir, F., Bargal, S. A., & Sclaroff, S. (2017a). Online supervised hashing. Computer Vision and Image Understanding (CVIU), 156, 162–173.CrossRef
Zurück zum Zitat Cakir, F., He, K., Adel Bargal, S., & Sclaroff, S. (2017b). Mihash: Online hashing with mutual information. In International conference on computer vision (ICCV) (pp. 437–445). Cakir, F., He, K., Adel Bargal, S., & Sclaroff, S. (2017b). Mihash: Online hashing with mutual information. In International conference on computer vision (ICCV) (pp. 437–445).
Zurück zum Zitat Chen, X., King, I., & Lyu, MR. (2017). Frosh: Faster online sketching hashing. In The conference on uncertainty in intelligence. Chen, X., King, I., & Lyu, MR. (2017). Frosh: Faster online sketching hashing. In The conference on uncertainty in intelligence.
Zurück zum Zitat Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. (2009). Nus-wide: A real-world web image database from national university of Singapore. In International conference on image and video retrieval (CIVR) (pp. 1–9). Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. (2009). Nus-wide: A real-world web image database from national university of Singapore. In International conference on image and video retrieval (CIVR) (pp. 1–9).
Zurück zum Zitat Cover, T. M., & Thomas, J. A. (2012). Elements of information theory. Hoboken: Wiley.MATH Cover, T. M., & Thomas, J. A. (2012). Elements of information theory. Hoboken: Wiley.MATH
Zurück zum Zitat Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., & Singer, Y. (2006). Online passive–aggressive algorithms. Journal of Machine Learning Research (JMLR), 7, 551–585.MathSciNetMATH Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., & Singer, Y. (2006). Online passive–aggressive algorithms. Journal of Machine Learning Research (JMLR), 7, 551–585.MathSciNetMATH
Zurück zum Zitat Datar, M., Immorlica, N., Indyk, P., & Mirrokni, VS. (2004). Locality-sensitive hashing scheme based on p-stable distributions. In Symposium on computational geometry (SoCG) (pp. 253–262). Datar, M., Immorlica, N., Indyk, P., & Mirrokni, VS. (2004). Locality-sensitive hashing scheme based on p-stable distributions. In Symposium on computational geometry (SoCG) (pp. 253–262).
Zurück zum Zitat Deng, C., Yang, E., Liu, T., Li, J., Liu, W., & Tao, D. (2019). Unsupervised semantic-preserving adversarial hashing for image search. IEEE Transactions on Image Processing (TIP), 28, 4032–4044.MathSciNetCrossRef Deng, C., Yang, E., Liu, T., Li, J., Liu, W., & Tao, D. (2019). Unsupervised semantic-preserving adversarial hashing for image search. IEEE Transactions on Image Processing (TIP), 28, 4032–4044.MathSciNetCrossRef
Zurück zum Zitat Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In Computer vision and pattern recognition (CVPR) (pp. 248–255). Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In Computer vision and pattern recognition (CVPR) (pp. 248–255).
Zurück zum Zitat Freund, Y., & Schapire, R. E. (1999). Large margin classification using the perceptron algorithm. Machine Learning (ML), 37, 277–296.CrossRef Freund, Y., & Schapire, R. E. (1999). Large margin classification using the perceptron algorithm. Machine Learning (ML), 37, 277–296.CrossRef
Zurück zum Zitat Gionis, A., Indyk, P., Motwani, R., et al. (1999). Similarity search in high dimensions via hashing. Very Large Data Bases Conferences (VLDB), 99, 518–529. Gionis, A., Indyk, P., Motwani, R., et al. (1999). Similarity search in high dimensions via hashing. Very Large Data Bases Conferences (VLDB), 99, 518–529.
Zurück zum Zitat Goldberg, K. (1966). Hadamard matrices of order cube plus one. Transactions of the American Mathematical Society (AMS), 17, 744–746.MathSciNetMATH Goldberg, K. (1966). Hadamard matrices of order cube plus one. Transactions of the American Mathematical Society (AMS), 17, 744–746.MathSciNetMATH
Zurück zum Zitat Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2012). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 35, 2916–2929.CrossRef Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2012). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 35, 2916–2929.CrossRef
Zurück zum Zitat Gui, J., Liu, T., Sun, Z., Tao, D., & Tan, T. (2017). Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 490–496.CrossRef Gui, J., Liu, T., Sun, Z., Tao, D., & Tan, T. (2017). Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 490–496.CrossRef
Zurück zum Zitat Horadam, K. J. (2012). Hadamard matrices and their applications. Princeton: Princeton University Press.MATH Horadam, K. J. (2012). Hadamard matrices and their applications. Princeton: Princeton University Press.MATH
Zurück zum Zitat Huang, L. K., Yang, Q., & Zheng, W. S. (2013). Online hashing. In International Joint Conference on Artificial Intelligence (IJCAI) (pp. 1422–1428). Huang, L. K., Yang, Q., & Zheng, W. S. (2013). Online hashing. In International Joint Conference on Artificial Intelligence (IJCAI) (pp. 1422–1428).
Zurück zum Zitat Huang, L. K., Yang, Q., & Zheng, W. S. (2017). Online hashing. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 29, 2309–2322.MathSciNetCrossRef Huang, L. K., Yang, Q., & Zheng, W. S. (2017). Online hashing. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 29, 2309–2322.MathSciNetCrossRef
Zurück zum Zitat Jiang, J., & Tu, Z. (2009). Efficient scale space auto-context for image segmentation and labeling. In Computer vision and pattern recognition (CVPR) (pp. 1810–1817). Jiang, J., & Tu, Z. (2009). Efficient scale space auto-context for image segmentation and labeling. In Computer vision and pattern recognition (CVPR) (pp. 1810–1817).
Zurück zum Zitat Kittler, J., Ghaderi, R., Windeatt, T., & Matas, J. (2001). Face verification using error correcting output codes. Computer Vision and Pattern Recognition (CVPR), 21, 1163–1169.MATH Kittler, J., Ghaderi, R., Windeatt, T., & Matas, J. (2001). Face verification using error correcting output codes. Computer Vision and Pattern Recognition (CVPR), 21, 1163–1169.MATH
Zurück zum Zitat Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images (p. 1) . Technical Reports on Computer Science Department: University of Toronto. Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images (p. 1) . Technical Reports on Computer Science Department: University of Toronto.
Zurück zum Zitat Krizhevsky, A., & Sutskever, I., & Hinton, GE. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (NeurIPS) (pp. 1097–1105). Krizhevsky, A., & Sutskever, I., & Hinton, GE. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (NeurIPS) (pp. 1097–1105).
Zurück zum Zitat LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278–2324.CrossRef LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278–2324.CrossRef
Zurück zum Zitat Leng, C., Wu, J., Cheng, J., Bai, X., & Lu, H. (2015). Online sketching hashing. In Computer vision and pattern recognition (pp. 2503–2511). Leng, C., Wu, J., Cheng, J., Bai, X., & Lu, H. (2015). Online sketching hashing. In Computer vision and pattern recognition (pp. 2503–2511).
Zurück zum Zitat Liberty, E. (2013). Simple and deterministic matrix sketching. In ACM SIGKDD international conference on knowledge discovery and data mining (pp. 581–588). Liberty, E. (2013). Simple and deterministic matrix sketching. In ACM SIGKDD international conference on knowledge discovery and data mining (pp. 581–588).
Zurück zum Zitat Lin, M., Ji, R., Liu, H., & Wu, Y. (2018). Supervised online hashing via hadamard codebook learning. In ACM international conference on multimedia (ACM MM) (pp. 1635–1643). Lin, M., Ji, R., Liu, H., & Wu, Y. (2018). Supervised online hashing via hadamard codebook learning. In ACM international conference on multimedia (ACM MM) (pp. 1635–1643).
Zurück zum Zitat Lin, M., Ji, R., Liu, H., Sun, X., Wu, Y., & Wu, Y. (2019). Towards optimal discrete online hashing with balanced similarity. In The AAAI conference on artificial intelligence (AAAI) (pp. 8722–8729). Lin, M., Ji, R., Liu, H., Sun, X., Wu, Y., & Wu, Y. (2019). Towards optimal discrete online hashing with balanced similarity. In The AAAI conference on artificial intelligence (AAAI) (pp. 8722–8729).
Zurück zum Zitat Liu, H., Lin, M., Zhang, S., Wu, Y., Huang, F., & Ji, R. (2018). Dense auto-encoder hashing for robust cross-modality retrieval. In ACM international conference on multimedia (ACM MM) (pp. 1589–1597). Liu, H., Lin, M., Zhang, S., Wu, Y., Huang, F., & Ji, R. (2018). Dense auto-encoder hashing for robust cross-modality retrieval. In ACM international conference on multimedia (ACM MM) (pp. 1589–1597).
Zurück zum Zitat Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In Computer vision and pattern recognition (CVPR) (pp. 2074–2081). Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In Computer vision and pattern recognition (CVPR) (pp. 2074–2081).
Zurück zum Zitat Liu, W., Mu, C., Kumar, S., & Chang, S. F. (2014). Discrete graph hashing. In Advances in neural information processing systems (NeurIPS) (pp. 3419–3427). Liu, W., Mu, C., Kumar, S., & Chang, S. F. (2014). Discrete graph hashing. In Advances in neural information processing systems (NeurIPS) (pp. 3419–3427).
Zurück zum Zitat Lu, Y., Dhillon, P., Foster, D. P., & Ungar, L. (2013). Faster ridge regression via the subsampled randomized hadamard transform. In Advances in neural information processing systems (NeurIPS) (pp. 369–377). Lu, Y., Dhillon, P., Foster, D. P., & Ungar, L. (2013). Faster ridge regression via the subsampled randomized hadamard transform. In Advances in neural information processing systems (NeurIPS) (pp. 369–377).
Zurück zum Zitat Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML). Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML).
Zurück zum Zitat Novikoff, A. B. (1963). On convergence proofs for perceptrons. Technical reports on Stanford Research Inst, Menlo Park, CA. Novikoff, A. B. (1963). On convergence proofs for perceptrons. Technical reports on Stanford Research Inst, Menlo Park, CA.
Zurück zum Zitat Ockwig, N. W., Delgado-Friedrichs, O., O’Keeffe, M., & Yaghi, O. M. (2005). Reticular chemistry: Occurrence and taxonomy of nets and grammar for the design of frameworks. Accounts of Chemical Research, 38, 176–182.CrossRef Ockwig, N. W., Delgado-Friedrichs, O., O’Keeffe, M., & Yaghi, O. M. (2005). Reticular chemistry: Occurrence and taxonomy of nets and grammar for the design of frameworks. Accounts of Chemical Research, 38, 176–182.CrossRef
Zurück zum Zitat Paley, R. E. (1933). On orthogonal matrices. Journal of Mathematics and Physics, 12, 311–320.CrossRef Paley, R. E. (1933). On orthogonal matrices. Journal of Mathematics and Physics, 12, 311–320.CrossRef
Zurück zum Zitat Peterson, W. W., Peterson, W., Weldon, E., & Weldon, E. (1972). Error-correcting codes. Cambridge: MIT Press.MATH Peterson, W. W., Peterson, W., Weldon, E., & Weldon, E. (1972). Error-correcting codes. Cambridge: MIT Press.MATH
Zurück zum Zitat Sablayrolles, A., Douze, M., Usunier, N., & Jégou, H. (2017). How should we evaluate supervised hashing? In International conference on acoustics, speech and signal processing (ICASSP) (pp. 1732–1736) Sablayrolles, A., Douze, M., Usunier, N., & Jégou, H. (2017). How should we evaluate supervised hashing? In International conference on acoustics, speech and signal processing (ICASSP) (pp. 1732–1736)
Zurück zum Zitat Schapire, R. E. (1997). Using output codes to boost multiclass learning problems. International Conference on Machine Learning (ICML), 97, 313–321. Schapire, R. E. (1997). Using output codes to boost multiclass learning problems. International Conference on Machine Learning (ICML), 97, 313–321.
Zurück zum Zitat Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In Computer vision and pattern recognition (pp. 37–45). Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In Computer vision and pattern recognition (pp. 37–45).
Zurück zum Zitat Sylvester, J. J. (1867). Lx. thoughts on inverse orthogonal matrices, simultaneous signsuccessions, and tessellated pavements in two or more colours, with applications to newton’s rule, ornamental tile-work, and the theory of numbers. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 34, 461–475.CrossRef Sylvester, J. J. (1867). Lx. thoughts on inverse orthogonal matrices, simultaneous signsuccessions, and tessellated pavements in two or more colours, with applications to newton’s rule, ornamental tile-work, and the theory of numbers. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 34, 461–475.CrossRef
Zurück zum Zitat Wang, J., Kumar, S., & Chang, S. F. (2010). Semi-supervised hashing for scalable image retrieval. In Computer vision and pattern recognition (CVPR) (pp. 3424–3431). Wang, J., Kumar, S., & Chang, S. F. (2010). Semi-supervised hashing for scalable image retrieval. In Computer vision and pattern recognition (CVPR) (pp. 3424–3431).
Zurück zum Zitat Wang, J., Zhang, T., Sebe, N., Shen, H. T., et al. (2017). A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 769–790.CrossRef Wang, J., Zhang, T., Sebe, N., Shen, H. T., et al. (2017). A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 769–790.CrossRef
Zurück zum Zitat Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems (NeurIPS) (pp. 1753–1760). Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems (NeurIPS) (pp. 1753–1760).
Zurück zum Zitat Williamson, J., et al. (1944). Hadamard’s determinant theorem and the sum of four squares. Duke Mathematical Journal, 11, 65–81.MathSciNetCrossRef Williamson, J., et al. (1944). Hadamard’s determinant theorem and the sum of four squares. Duke Mathematical Journal, 11, 65–81.MathSciNetCrossRef
Zurück zum Zitat Yang, E., Deng, C., Li, C., Liu, W., Li, J., & Tao, D. (2018). Shared predictive cross-modal deep quantization. IEEE Transactions on Neural Networks and Learning Systems, 29, 5292–5303.CrossRef Yang, E., Deng, C., Li, C., Liu, W., Li, J., & Tao, D. (2018). Shared predictive cross-modal deep quantization. IEEE Transactions on Neural Networks and Learning Systems, 29, 5292–5303.CrossRef
Zurück zum Zitat Zhao, B., & Xing, E. P. (2013). Sparse output coding for large-scale visual recognition. In Computer vision and pattern recognition (CVPR) (pp. 3350–3357). Zhao, B., & Xing, E. P. (2013). Sparse output coding for large-scale visual recognition. In Computer vision and pattern recognition (CVPR) (pp. 3350–3357).
Zurück zum Zitat Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., & Oliva, A. (2014a). Learning deep features for scene recognition using places database. In Advances in neural information processing systems (NeurIPS) (pp. 487–495). Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., & Oliva, A. (2014a). Learning deep features for scene recognition using places database. In Advances in neural information processing systems (NeurIPS) (pp. 487–495).
Zurück zum Zitat Zhou, J., Ding, G., & Guo, Y. (2014b). Latent semantic sparse hashing for cross-modal similarity search. In International ACM SIGIR conference on research and development in information retrieval (pp. 415–424). Zhou, J., Ding, G., & Guo, Y. (2014b). Latent semantic sparse hashing for cross-modal similarity search. In International ACM SIGIR conference on research and development in information retrieval (pp. 415–424).
Metadaten
Titel
Hadamard Matrix Guided Online Hashing
verfasst von
Mingbao Lin
Rongrong Ji
Hong Liu
Xiaoshuai Sun
Shen Chen
Qi Tian
Publikationsdatum
06.05.2020
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 8-9/2020
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-020-01332-z

Weitere Artikel der Ausgabe 8-9/2020

International Journal of Computer Vision 8-9/2020 Zur Ausgabe