Skip to main content
Top

2018 | OriginalPaper | Chapter

ForestHash: Semantic Hashing with Shallow Random Forests and Tiny Convolutional Networks

Authors : Qiang Qiu, José Lezama, Alex Bronstein, Guillermo Sapiro

Published in: Computer Vision – ECCV 2018

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, we introduce a random forest semantic hashing scheme that embeds tiny convolutional neural networks (CNN) into shallow random forests. A binary hash code for a data point is obtained by a set of decision trees, setting ‘1’ for the visited tree leaf, and ‘0’ for the rest. We propose to first randomly group arriving classes at each tree split node into two groups, obtaining a significantly simplified two-class classification problem that can be a handled with a light-weight CNN weak learner. Code uniqueness is achieved via the random class grouping, whilst code consistency is achieved using a low-rank loss in the CNN weak learners that encourages intra-class compactness for the two random class groups. Finally, we introduce an information-theoretic approach for aggregating codes of individual trees into a single hash code, producing a near-optimal unique hash for each class. The proposed approach significantly outperforms state-of-the-art hashing methods for image retrieval tasks on large-scale public datasets, and is comparable to image classification methods while utilizing a more compact, efficient and scalable representation. This work proposes a principled and robust procedure to train and deploy in parallel an ensemble of light-weight CNNs, instead of simply going deeper.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Footnotes
1
Results are taken from the respective papers.
 
Literature
1.
go back to reference Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of International Conference on Very Large Data Bases (1999) Gionis, A., Indyk, P., Motwani, R.: Similarity search in high dimensions via hashing. In: Proceedings of International Conference on Very Large Data Bases (1999)
2.
go back to reference Kulis, B., Grauman, K.: Kernelized locality-sensitive hashing for scalable image search. In: Proceedings of International Conference on Computer vision (2009) Kulis, B., Grauman, K.: Kernelized locality-sensitive hashing for scalable image search. In: Proceedings of International Conference on Computer vision (2009)
3.
go back to reference Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems (2009) Weiss, Y., Torralba, A., Fergus, R.: Spectral hashing. In: Advances in Neural Information Processing Systems (2009)
4.
go back to reference Masci, J., Bronstein, A.M., Bronstein, M.M., Sprechmann, P., Sapiro, G.: Sparse similarity-preserving hashing. In: International Conference on Learning Representations, Banff, Canada, April 2014 Masci, J., Bronstein, A.M., Bronstein, M.M., Sprechmann, P., Sapiro, G.: Sparse similarity-preserving hashing. In: International Conference on Learning Representations, Banff, Canada, April 2014
5.
go back to reference Liu, W., Wang, J., Ji, R., Jiang, Y., Chang, S.: Supervised hashing with Kernels. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2012 Liu, W., Wang, J., Ji, R., Jiang, Y., Chang, S.: Supervised hashing with Kernels. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2012
6.
go back to reference Liu, W., Wang, J., Chang, S.: Hashing with graphs. In: International Conference on Machine Learning (2011) Liu, W., Wang, J., Chang, S.: Hashing with graphs. In: International Conference on Machine Learning (2011)
7.
go back to reference Zhang, D., Wang, J., Cai, D., Lu, J.: Self-taught hashing for fast similarity search. In: Proceedings of International Conference on Research and Development in Information Retrieval (2010) Zhang, D., Wang, J., Cai, D., Lu, J.: Self-taught hashing for fast similarity search. In: Proceedings of International Conference on Research and Development in Information Retrieval (2010)
8.
go back to reference Liu, H., Wang, R., Shan, S., Chen, X.: Deep supervised hashing for fast image retrieval. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2016) Liu, H., Wang, R., Shan, S., Chen, X.: Deep supervised hashing for fast image retrieval. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2016)
9.
go back to reference Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Patt. Anal. Mach. Intell. 36(4), 824–830 (2014)CrossRef Masci, J., Bronstein, M.M., Bronstein, A.M., Schmidhuber, J.: Multimodal similarity-preserving hashing. IEEE Trans. Patt. Anal. Mach. Intell. 36(4), 824–830 (2014)CrossRef
10.
go back to reference Norouzi, M., Fleet, D.J., Salakhutdinov, R.: Hamming distance metric learning. In: Advances in Neural Information Processing Systems (2012) Norouzi, M., Fleet, D.J., Salakhutdinov, R.: Hamming distance metric learning. In: Advances in Neural Information Processing Systems (2012)
13.
go back to reference Shotton, J.: Efficient human pose estimation from single depth images. IEEE Trans. Patt. Anal. Mach. Intell. 35(12), 2821–2840 (2013)CrossRef Shotton, J.: Efficient human pose estimation from single depth images. IEEE Trans. Patt. Anal. Mach. Intell. 35(12), 2821–2840 (2013)CrossRef
14.
go back to reference Gall, J., Lempitsky, V.: Class-specific HOUGH forests for object detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2009) Gall, J., Lempitsky, V.: Class-specific HOUGH forests for object detection. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2009)
15.
go back to reference Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., Burlington (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers Inc., Burlington (1993)
16.
go back to reference Fazel, M.: Matrix Rank Minimization with Applications. Ph.D Thesis, Stanford University (2002) Fazel, M.: Matrix Rank Minimization with Applications. Ph.D Thesis, Stanford University (2002)
17.
go back to reference Qiu, Q., Sapiro, G.: Learning transformations for clustering and classification. J. Mach. Learn. Res. 16, 187–225 (2015)MathSciNetMATH Qiu, Q., Sapiro, G.: Learning transformations for clustering and classification. J. Mach. Learn. Res. 16, 187–225 (2015)MathSciNetMATH
18.
go back to reference Aharon, M., Elad, M., Bruckstein, A.: K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Sig. Process. 54(11), 4311–4322 (2006)CrossRef Aharon, M., Elad, M., Bruckstein, A.: K-SVD: an algorithm for designing overcomplete dictionaries for sparse representation. IEEE Trans. Sig. Process. 54(11), 4311–4322 (2006)CrossRef
19.
go back to reference Qiu, Q., Sapiro, G.: Learning transformations for classification forests. In: ICLR (2014) Qiu, Q., Sapiro, G.: Learning transformations for classification forests. In: ICLR (2014)
20.
go back to reference Watson, G.A.: Characterization of the subdifferential of some matrix norms. Linear Algebra Appl. 170, 1039–1053 (1992)MathSciNetCrossRef Watson, G.A.: Characterization of the subdifferential of some matrix norms. Linear Algebra Appl. 170, 1039–1053 (1992)MathSciNetCrossRef
21.
go back to reference Sriperumbudur, B.K., Lanckriet, G.R.G.: A proof of convergence of the concave-convex procedure using Zangwill’s theory. Neural Comput. 24(6), 1391–1407 (2012)MathSciNetCrossRef Sriperumbudur, B.K., Lanckriet, G.R.G.: A proof of convergence of the concave-convex procedure using Zangwill’s theory. Neural Comput. 24(6), 1391–1407 (2012)MathSciNetCrossRef
22.
go back to reference Yuille, A.L., Rangarajan, A.: The concave-convex procedure. Neural Comput. 4, 915–936 (2003)CrossRef Yuille, A.L., Rangarajan, A.: The concave-convex procedure. Neural Comput. 4, 915–936 (2003)CrossRef
23.
go back to reference Lezama, J., Qiu, Q., Musé, P., Sapiro, G.: Olé: orthogonal low-rank embedding, a plug and play geometric loss for deep learning. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2018) Lezama, J., Qiu, Q., Musé, P., Sapiro, G.: Olé: orthogonal low-rank embedding, a plug and play geometric loss for deep learning. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2018)
24.
go back to reference Krause, A., Singh, A., Guestrin, C.: Near-optimal sensor placements in Gaussian processes: theory, efficient algorithms and empirical studies. J. Mach. Learn. Res. 9, 235–284 (2008)MATH Krause, A., Singh, A., Guestrin, C.: Near-optimal sensor placements in Gaussian processes: theory, efficient algorithms and empirical studies. J. Mach. Learn. Res. 9, 235–284 (2008)MATH
25.
go back to reference Nemhauser, G., Wolsey, L., Fisher, M.: An analysis of approximations for maximizing submodular set functions. Math. Program. 14(1), 265–294 (1978)MathSciNetCrossRef Nemhauser, G., Wolsey, L., Fisher, M.: An analysis of approximations for maximizing submodular set functions. Math. Program. 14(1), 265–294 (1978)MathSciNetCrossRef
26.
go back to reference Hellman, M.E., Raviv, J.: Probability of error, equivocation, and the Chernoff bound. IEEE Trans. Inf. Theory 16, 368–372 (1979)MathSciNetCrossRef Hellman, M.E., Raviv, J.: Probability of error, equivocation, and the Chernoff bound. IEEE Trans. Inf. Theory 16, 368–372 (1979)MathSciNetCrossRef
27.
go back to reference Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report (2009) Krizhevsky, A.: Learning multiple layers of features from tiny images. Technical report (2009)
28.
go back to reference Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)CrossRef
29.
go back to reference Rasiwasia, N., et al.: A new approach to cross-modal multimedia retrieval. In: Proceedings of the International Conference on Multimedia (2010) Rasiwasia, N., et al.: A new approach to cross-modal multimedia retrieval. In: Proceedings of the International Conference on Multimedia (2010)
30.
go back to reference Gong, Y., Lazebnik, S.: Iterative quantization: a procrustean approach to learning binary codes. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2011) Gong, Y., Lazebnik, S.: Iterative quantization: a procrustean approach to learning binary codes. In: Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2011)
31.
go back to reference Norouzi, M., Fleet, D.J.: Minimal loss hashing for compact binary codes. In: International Conference on Machine Learning (2011) Norouzi, M., Fleet, D.J.: Minimal loss hashing for compact binary codes. In: International Conference on Machine Learning (2011)
32.
go back to reference Kulis, B., Darrell, T.: Learning to hash with binary reconstructive embeddings. In: Advances in Neural Information Processing Systems (2009) Kulis, B., Darrell, T.: Learning to hash with binary reconstructive embeddings. In: Advances in Neural Information Processing Systems (2009)
33.
go back to reference Xia, R., Pan, Y., Lai, H., Liu, C., Yan, S.: Supervised hashing for image retrieval via image representation learning. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (2014) Xia, R., Pan, Y., Lai, H., Liu, C., Yan, S.: Supervised hashing for image retrieval via image representation learning. In: Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (2014)
34.
go back to reference Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2015) Lin, K., Yang, H.F., Hsiao, J.H., Chen, C.S.: Deep learning of binary hash codes for fast image retrieval. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) (2015)
35.
go back to reference Lai, H., Pan, Y., Liu, Y., Yan, S.: Simultaneous feature learning and hash coding with deep neural networks. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2015) Lai, H., Pan, Y., Liu, Y., Yan, S.: Simultaneous feature learning and hash coding with deep neural networks. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2015)
36.
go back to reference Lin, M., Chen, Q., Yan, S.: Network In Network. In: ICLR (2014) Lin, M., Chen, Q., Yan, S.: Network In Network. In: ICLR (2014)
37.
go back to reference Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.A.: Striving for simplicity: The all convolutional net. CoRR abs/1412.6806 (2014) Springenberg, J.T., Dosovitskiy, A., Brox, T., Riedmiller, M.A.: Striving for simplicity: The all convolutional net. CoRR abs/1412.6806 (2014)
38.
go back to reference Lee, C., Xie, S., Gallagher, P.W., Zhang, Z., Tu, Z.: Deeply-supervised nets. In: AISTATS (2015) Lee, C., Xie, S., Gallagher, P.W., Zhang, Z., Tu, Z.: Deeply-supervised nets. In: AISTATS (2015)
39.
go back to reference Larsson, G., Maire, M., Shakhnarovich, G.: Fractalnet: ultra-deep neural networks without residuals. In: ICLR (2017) Larsson, G., Maire, M., Shakhnarovich, G.: Fractalnet: ultra-deep neural networks without residuals. In: ICLR (2017)
42.
go back to reference Lin, G., Shen, C., Shi, Q., van den Hengel, A., Suter, D.: Fast supervised hashing with decision trees for high-dimensional data. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2014) Lin, G., Shen, C., Shi, Q., van den Hengel, A., Suter, D.: Fast supervised hashing with decision trees for high-dimensional data. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition (2014)
43.
go back to reference Lin, G., Shen, C., Suter, D., van den Hengel, A.: A general two-step approach to learning-based hashing. In: Proceedings of the International Conference on Computer vision (2013) Lin, G., Shen, C., Suter, D., van den Hengel, A.: A general two-step approach to learning-based hashing. In: Proceedings of the International Conference on Computer vision (2013)
44.
go back to reference Wang, J., Kumar, S., Chang, S.F.: Sequential projection learning for hashing with compact codes. In: International Conference on Machine Learning, Haifa, Israel (2010) Wang, J., Kumar, S., Chang, S.F.: Sequential projection learning for hashing with compact codes. In: International Conference on Machine Learning, Haifa, Israel (2010)
45.
go back to reference Raginsky, M., Lazebnik, S.: Locality-sensitive binary codes from shift-invariant kernels. In: Advances in Neural Information Processing Systems (2010) Raginsky, M., Lazebnik, S.: Locality-sensitive binary codes from shift-invariant kernels. In: Advances in Neural Information Processing Systems (2010)
46.
go back to reference Bronstein, M., Bronstein, A., Michel, F., Paragios, N.: Data fusion through cross-modality metric learning using similarity-sensitive hashing. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2010 Bronstein, M., Bronstein, A., Michel, F., Paragios, N.: Data fusion through cross-modality metric learning using similarity-sensitive hashing. In: Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, June 2010
47.
go back to reference Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Patt. Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRef Bengio, Y., Courville, A., Vincent, P.: Representation learning: a review and new perspectives. IEEE Trans. Patt. Anal. Mach. Intell. 35(8), 1798–1828 (2013)CrossRef
Metadata
Title
ForestHash: Semantic Hashing with Shallow Random Forests and Tiny Convolutional Networks
Authors
Qiang Qiu
José Lezama
Alex Bronstein
Guillermo Sapiro
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-01216-8_27

Premium Partner