nach oben

International Journal of Computer Vision

Erschienen in:

06.05.2020

Hadamard Matrix Guided Online Hashing

verfasst von: Mingbao Lin, Rongrong Ji, Hong Liu, Xiaoshuai Sun, Shen Chen, Qi Tian

Erschienen in: International Journal of Computer Vision | Ausgabe 8-9/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Online image hashing has attracted increasing research attention recently, which receives large-scale data in a streaming manner to update the hash functions on-the-fly. Its key challenge lies in the difficulty of balancing the learning timeliness and model accuracy. To this end, most works follow a supervised setting, i.e., using class labels to boost the hashing performance, which defects in two aspects: first, strong constraints, e.g., orthogonal or similarity preserving, are used, which however are typically relaxed and lead to large accuracy drops. Second, large amounts of training batches are required to learn the up-to-date hash functions, which largely increase the learning complexity. To handle the above challenges, a novel supervised online hashing scheme termed Hadamard Matrix Guided Online Hashing (HMOH) is proposed in this paper. Our key innovation lies in introducing Hadamard matrix, which is an orthogonal binary matrix built via Sylvester method. In particular, to release the need of strong constraints, we regard each column of Hadamard matrix as the target code for each class label, which by nature satisfies several desired properties of hashing codes. To accelerate the online training, LSH is first adopted to align the lengths of target code and to-be-learned binary code. We then treat the learning of hash functions as a set of binary classification problems to fit the assigned target code. Finally, extensive experiments on four widely-used benchmarks demonstrate the superior accuracy and efficiency of HMOH over various state-of-the-art methods. Codes can be available at https://github.com/lmbxmu/mycode.

Vorheriger Artikel Weakly-supervised Semantic Guided Hashing for Social Image Retrieval

Nächster Artikel Anchor-Based Self-Ensembling for Semi-Supervised Deep Pairwise Hashing

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Our test with 32-bit on CIFAR-10 shows that classification has less averaged quantization error of 2.861 than regression of 4.543.

Take the Places205 dataset as an example: There are in total 205 categories. According to Eq. 10, \(r^* = 256\) for the code length r varying from 8 to 128.

When \(r^* = r\), we set \(\tilde{{\mathbf {W}}}\) as an identity matrix and the above equation still holds.

Since it is just a matrix-addition operation at each stage.

\({\tilde{W}}\) is a random matrix that need not be optimized. When \(r=r^*\), we set \({\tilde{W}}\) as an identity matrix

Babenko, B., Yang, MH., & Belongie, S. (2009). A family of online boosting algorithms. In International conference on computer vision (ICCV Workshops) (pp. 1346–1353).

Cakir, F., & Sclaroff, S. (2015). Adaptive hashing for fast similarity search. In International conference on computer vision (ICCV) (pp. 1044–1052).

Cakir, F., Bargal, S. A., & Sclaroff, S. (2017a). Online supervised hashing. Computer Vision and Image Understanding (CVIU), 156, 162–173.CrossRef

Cakir, F., He, K., Adel Bargal, S., & Sclaroff, S. (2017b). Mihash: Online hashing with mutual information. In International conference on computer vision (ICCV) (pp. 437–445).

Chen, X., King, I., & Lyu, MR. (2017). Frosh: Faster online sketching hashing. In The conference on uncertainty in intelligence.

Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. (2009). Nus-wide: A real-world web image database from national university of Singapore. In International conference on image and video retrieval (CIVR) (pp. 1–9).

Cover, T. M., & Thomas, J. A. (2012). Elements of information theory. Hoboken: Wiley.MATH

Crammer, K., Dekel, O., Keshet, J., Shalev-Shwartz, S., & Singer, Y. (2006). Online passive–aggressive algorithms. Journal of Machine Learning Research (JMLR), 7, 551–585.MathSciNetMATH

Datar, M., Immorlica, N., Indyk, P., & Mirrokni, VS. (2004). Locality-sensitive hashing scheme based on p-stable distributions. In Symposium on computational geometry (SoCG) (pp. 253–262).

Deng, C., Yang, E., Liu, T., Li, J., Liu, W., & Tao, D. (2019). Unsupervised semantic-preserving adversarial hashing for image search. IEEE Transactions on Image Processing (TIP), 28, 4032–4044.MathSciNetCrossRef

Deng, J., Dong, W., Socher, R., Li, L. J., Li, K., & Fei-Fei, L. (2009). Imagenet: A large-scale hierarchical image database. In Computer vision and pattern recognition (CVPR) (pp. 248–255).

Freund, Y., & Schapire, R. E. (1999). Large margin classification using the perceptron algorithm. Machine Learning (ML), 37, 277–296.CrossRef

Gionis, A., Indyk, P., Motwani, R., et al. (1999). Similarity search in high dimensions via hashing. Very Large Data Bases Conferences (VLDB), 99, 518–529.

Goldberg, K. (1966). Hadamard matrices of order cube plus one. Transactions of the American Mathematical Society (AMS), 17, 744–746.MathSciNetMATH

Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2012). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 35, 2916–2929.CrossRef

Gui, J., Liu, T., Sun, Z., Tao, D., & Tan, T. (2017). Fast supervised discrete hashing. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 490–496.CrossRef

Horadam, K. J. (2012). Hadamard matrices and their applications. Princeton: Princeton University Press.MATH

Huang, L. K., Yang, Q., & Zheng, W. S. (2013). Online hashing. In International Joint Conference on Artificial Intelligence (IJCAI) (pp. 1422–1428).

Huang, L. K., Yang, Q., & Zheng, W. S. (2017). Online hashing. IEEE Transactions on Neural Networks and Learning Systems (TNNLS), 29, 2309–2322.MathSciNetCrossRef

Jiang, J., & Tu, Z. (2009). Efficient scale space auto-context for image segmentation and labeling. In Computer vision and pattern recognition (CVPR) (pp. 1810–1817).

Kittler, J., Ghaderi, R., Windeatt, T., & Matas, J. (2001). Face verification using error correcting output codes. Computer Vision and Pattern Recognition (CVPR), 21, 1163–1169.MATH

Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images (p. 1) . Technical Reports on Computer Science Department: University of Toronto.

Krizhevsky, A., & Sutskever, I., & Hinton, GE. (2012). Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems (NeurIPS) (pp. 1097–1105).

LeCun, Y., Bottou, L., Bengio, Y., & Haffner, P. (1998). Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86, 2278–2324.CrossRef

Leng, C., Wu, J., Cheng, J., Bai, X., & Lu, H. (2015). Online sketching hashing. In Computer vision and pattern recognition (pp. 2503–2511).

Liberty, E. (2013). Simple and deterministic matrix sketching. In ACM SIGKDD international conference on knowledge discovery and data mining (pp. 581–588).

Lin, M., Ji, R., Liu, H., & Wu, Y. (2018). Supervised online hashing via hadamard codebook learning. In ACM international conference on multimedia (ACM MM) (pp. 1635–1643).

Lin, M., Ji, R., Liu, H., Sun, X., Wu, Y., & Wu, Y. (2019). Towards optimal discrete online hashing with balanced similarity. In The AAAI conference on artificial intelligence (AAAI) (pp. 8722–8729).

Liu, H., Lin, M., Zhang, S., Wu, Y., Huang, F., & Ji, R. (2018). Dense auto-encoder hashing for robust cross-modality retrieval. In ACM international conference on multimedia (ACM MM) (pp. 1589–1597).

Liu, W., Wang, J., Ji, R., Jiang, Y. G., & Chang, S. F. (2012). Supervised hashing with kernels. In Computer vision and pattern recognition (CVPR) (pp. 2074–2081).

Liu, W., Mu, C., Kumar, S., & Chang, S. F. (2014). Discrete graph hashing. In Advances in neural information processing systems (NeurIPS) (pp. 3419–3427).

Lu, Y., Dhillon, P., Foster, D. P., & Ungar, L. (2013). Faster ridge regression via the subsampled randomized hadamard transform. In Advances in neural information processing systems (NeurIPS) (pp. 369–377).

Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML).

Novikoff, A. B. (1963). On convergence proofs for perceptrons. Technical reports on Stanford Research Inst, Menlo Park, CA.

Ockwig, N. W., Delgado-Friedrichs, O., O’Keeffe, M., & Yaghi, O. M. (2005). Reticular chemistry: Occurrence and taxonomy of nets and grammar for the design of frameworks. Accounts of Chemical Research, 38, 176–182.CrossRef

Paley, R. E. (1933). On orthogonal matrices. Journal of Mathematics and Physics, 12, 311–320.CrossRef

Peterson, W. W., Peterson, W., Weldon, E., & Weldon, E. (1972). Error-correcting codes. Cambridge: MIT Press.MATH

Sablayrolles, A., Douze, M., Usunier, N., & Jégou, H. (2017). How should we evaluate supervised hashing? In International conference on acoustics, speech and signal processing (ICASSP) (pp. 1732–1736)

Schapire, R. E. (1997). Using output codes to boost multiclass learning problems. International Conference on Machine Learning (ICML), 97, 313–321.

Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In Computer vision and pattern recognition (pp. 37–45).

Simonyan, K., & Zisserman, A. (2014). Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556.

Sylvester, J. J. (1867). Lx. thoughts on inverse orthogonal matrices, simultaneous signsuccessions, and tessellated pavements in two or more colours, with applications to newton’s rule, ornamental tile-work, and the theory of numbers. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 34, 461–475.CrossRef

Wang, J., Kumar, S., & Chang, S. F. (2010). Semi-supervised hashing for scalable image retrieval. In Computer vision and pattern recognition (CVPR) (pp. 3424–3431).

Wang, J., Zhang, T., Sebe, N., Shen, H. T., et al. (2017). A survey on learning to hash. IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 40, 769–790.CrossRef

Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems (NeurIPS) (pp. 1753–1760).

Williamson, J., et al. (1944). Hadamard’s determinant theorem and the sum of four squares. Duke Mathematical Journal, 11, 65–81.MathSciNetCrossRef

Yang, E., Deng, C., Li, C., Liu, W., Li, J., & Tao, D. (2018). Shared predictive cross-modal deep quantization. IEEE Transactions on Neural Networks and Learning Systems, 29, 5292–5303.CrossRef

Zhao, B., & Xing, E. P. (2013). Sparse output coding for large-scale visual recognition. In Computer vision and pattern recognition (CVPR) (pp. 3350–3357).

Zhou, B., Lapedriza, A., Xiao, J., Torralba, A., & Oliva, A. (2014a). Learning deep features for scene recognition using places database. In Advances in neural information processing systems (NeurIPS) (pp. 487–495).

Zhou, J., Ding, G., & Guo, Y. (2014b). Latent semantic sparse hashing for cross-modal similarity search. In International ACM SIGIR conference on research and development in information retrieval (pp. 415–424).

Titel: Hadamard Matrix Guided Online Hashing
verfasst von: Mingbao Lin
Rongrong Ji
Hong Liu
Xiaoshuai Sun
Shen Chen
Qi Tian
Publikationsdatum: 06.05.2020
Verlag: Springer US
Erschienen in: International Journal of Computer Vision / Ausgabe 8-9/2020
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI: https://doi.org/10.1007/s11263-020-01332-z

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Weitere Artikel der Ausgabe 8-9/2020

Unified Binary Generative Adversarial Network for Image Retrieval and Compression

Fine-Grained Multi-human Parsing

A Survey of Deep Facial Attribute Analysis

Learning Multifunctional Binary Codes for Personalized Image Retrieval

HetConv: Beyond Homogeneous Convolution Kernels for Deep CNNs

SSN: Learning Sparse Switchable Normalization via SparsestMax