Skip to main content
Erschienen in: International Journal of Computer Vision 11-12/2019

21.02.2019

Unsupervised Binary Representation Learning with Deep Variational Networks

verfasst von: Yuming Shen, Li Liu, Ling Shao

Erschienen in: International Journal of Computer Vision | Ausgabe 11-12/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Learning to hash is regarded as an efficient approach for image retrieval and many other big-data applications. Recently, deep learning frameworks are adopted for image hashing, suggesting an alternative way to formulate the encoding function other than the conventional projections. Although deep learning has been proved to be successful in supervised hashing, existing unsupervised deep hashing techniques still cannot produce leading performance compared with the non-deep methods, as it is hard to unveil the intrinsic structure of the whole sample space by simply regularizing the output codes within each single training batch. To tackle this problem, in this paper, we propose a novel unsupervised deep hashing model, named deep variational binaries (DVB). The conditional auto-encoding variational Bayesian networks are introduced in this work to exploit the feature space structure of the training data using the latent variables. Integrating the probabilistic inference process with hashing objectives, the proposed DVB model estimates the statistics of data representations, and thus produces compact binary codes. Experimental results on three benchmark datasets, i.e., CIFAR-10, SUN-397 and NUS-WIDE, demonstrate that DVB outperforms state-of-the-art unsupervised hashing methods with significant margins.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., & Devin, M., et al. (2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:1603.04467. Abadi, M., Agarwal, A., Barham, P., Brevdo, E., Chen, Z., Citro, C., Corrado, G. S., Davis, A., Dean, J., & Devin, M., et al. (2016). Tensorflow: Large-scale machine learning on heterogeneous distributed systems. arXiv preprint arXiv:​1603.​04467.
Zurück zum Zitat Belkin, M., & Niyogi, P. (2001). Laplacian eigenmaps and spectral techniques for embedding and clustering. In Advances in neural information processing systems (NIPS). Belkin, M., & Niyogi, P. (2001). Laplacian eigenmaps and spectral techniques for embedding and clustering. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Cao, Y., Liu, B., Long, M., & Wang, J. (2018). Hashgan: Deep learning to hash with pair conditional wasserstein gan. In IEEE conference on computer vision and pattern recognition (CVPR). Cao, Y., Liu, B., Long, M., & Wang, J. (2018). Hashgan: Deep learning to hash with pair conditional wasserstein gan. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Cao, Y., Long, M., Wang, J., Zhu, H., & Wen, Q. (2016). Deep quantization network for efficient image retrieval. In AAAI conference on artificial intelligence (AAAI). Cao, Y., Long, M., Wang, J., Zhu, H., & Wen, Q. (2016). Deep quantization network for efficient image retrieval. In AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Carreira-Perpinán, M. A., & Raziperchikolaei, R. (2015). Hashing with binary autoencoders. In IEEE conference on computer vision and pattern recognition (CVPR). Carreira-Perpinán, M. A., & Raziperchikolaei, R. (2015). Hashing with binary autoencoders. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Chaidaroon, S., & Fang, Y. (2017). Variational deep semantic hashing for text documents. In ACM conference on research and development in information retrieval (SIGIR). Chaidaroon, S., & Fang, Y. (2017). Variational deep semantic hashing for text documents. In ACM conference on research and development in information retrieval (SIGIR).
Zurück zum Zitat Charikar, M.S. (2002). Similarity estimation techniques from rounding algorithms. In ACM symposium on theory of computing (STOC). Charikar, M.S. (2002). Similarity estimation techniques from rounding algorithms. In ACM symposium on theory of computing (STOC).
Zurück zum Zitat Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. (2009). Nus-wide: A real-world web image database from national university of singapore. In ACM international conference on image and video retrieval (CIVR). Chua, T. S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y. (2009). Nus-wide: A real-world web image database from national university of singapore. In ACM international conference on image and video retrieval (CIVR).
Zurück zum Zitat Dai, B., Guo, R., Kumar, S., He, N., & Song, L. (2017). Stochastic generative hashing. In International conference on machine learning (ICML). Dai, B., Guo, R., Kumar, S., He, N., & Song, L. (2017). Stochastic generative hashing. In International conference on machine learning (ICML).
Zurück zum Zitat Do, T. T., Doan, A. D., & Cheung, N. M. (2016). Learning to hash with binary deep neural network. In European conference on computer vision (ECCV). Do, T. T., Doan, A. D., & Cheung, N. M. (2016). Learning to hash with binary deep neural network. In European conference on computer vision (ECCV).
Zurück zum Zitat Erin Liong, V., Lu, J., Tan, Y. P., & Zhou, J. (2017). Cross-modal deep variational hashing. In IEEE international conference on computer vision (ICCV). Erin Liong, V., Lu, J., Tan, Y. P., & Zhou, J. (2017). Cross-modal deep variational hashing. In IEEE international conference on computer vision (ICCV).
Zurück zum Zitat Erin Liong, V., Lu, J., Wang, G., Moulin, P., & Zhou, J. (2015). Deep hashing for compact binary codes learning. In IEEE conference on computer vision and pattern recognition (CVPR). Erin Liong, V., Lu, J., Wang, G., Moulin, P., & Zhou, J. (2015). Deep hashing for compact binary codes learning. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Eslami, S.A., Heess, N., Weber, T., Tassa, Y., Szepesvari, D., & Hinton, G.E., et al. (2016). Attend, infer, repeat: Fast scene understanding with generative models. In Advances in neural information processing systems (NIPS). Eslami, S.A., Heess, N., Weber, T., Tassa, Y., Szepesvari, D., & Hinton, G.E., et al. (2016). Attend, infer, repeat: Fast scene understanding with generative models. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2916–2929. As References are repeated twice in reference list, we have deleted the duplicate reference please check and confirm.CrossRef Gong, Y., Lazebnik, S., Gordo, A., & Perronnin, F. (2013). Iterative quantization: A procrustean approach to learning binary codes for large-scale image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(12), 2916–2929. As References are repeated twice in reference list, we have deleted the duplicate reference please check and confirm.CrossRef
Zurück zum Zitat Guo, Y., Ding, G., Liu, L., Han, J., & Shao, L. (2017). Learning to hash with optimized anchor embedding for scalable retrieval. IEEE Transactions on Image Processing, 26(3), 1344–1354.MathSciNetCrossRef Guo, Y., Ding, G., Liu, L., Han, J., & Shao, L. (2017). Learning to hash with optimized anchor embedding for scalable retrieval. IEEE Transactions on Image Processing, 26(3), 1344–1354.MathSciNetCrossRef
Zurück zum Zitat He, K., Wen, F., & Sun, J. (2013). K-means hashing: An affinity-preserving quantization method for learning binary compact codes. In IEEE conference on computer vision and pattern recognition (CVPR). He, K., Wen, F., & Sun, J. (2013). K-means hashing: An affinity-preserving quantization method for learning binary compact codes. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat He, X., & Niyogi, P. (2003). Locality preserving projections. In Advances in neural information processing systems (NIPS). He, X., & Niyogi, P. (2003). Locality preserving projections. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Heo, J. P., Lee, Y., He, J., Chang, S. F., & Yoon, S. E. (2012). Spherical hashing. In IEEE conference on computer vision and pattern recognition (CVPR). Heo, J. P., Lee, Y., He, J., Chang, S. F., & Yoon, S. E. (2012). Spherical hashing. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Hu, G., Hua, Y., Yuan, Y., Zhang, Z., Lu, Z., Mukherjee, S. S., Hospedales, T. M., Robertson, N. M., & Yang, Y. (2017). Attribute-enhanced face recognition with neural tensor fusion networks. In IEEE International conference on computer vision (ICCV). Hu, G., Hua, Y., Yuan, Y., Zhang, Z., Lu, Z., Mukherjee, S. S., Hospedales, T. M., Robertson, N. M., & Yang, Y. (2017). Attribute-enhanced face recognition with neural tensor fusion networks. In IEEE International conference on computer vision (ICCV).
Zurück zum Zitat Jiang, Q. Y., & Li, W. J. (2017). Deep cross-modal hashing. In IEEE conference on computer vision and pattern recognition (CVPR). Jiang, Q. Y., & Li, W. J. (2017). Deep cross-modal hashing. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Kingma, D., & Ba, J. (2015). Adam: A method for acm symposium on theory of computing (stoc)hastic optimization. In International conference on learning representations (ICLR). Kingma, D., & Ba, J. (2015). Adam: A method for acm symposium on theory of computing (stoc)hastic optimization. In International conference on learning representations (ICLR).
Zurück zum Zitat Kingma, D., & Welling, M. (2014). Auto-encoding variational bayes. In International conference on learning representations (ICLR). Kingma, D., & Welling, M. (2014). Auto-encoding variational bayes. In International conference on learning representations (ICLR).
Zurück zum Zitat Kingma, D. P., Mohamed, S., Rezende, D. J., & Welling, M. (2014). Semi-supervised learning with deep generative models. In Advances in neural information processing systems (NIPS). Kingma, D. P., Mohamed, S., Rezende, D. J., & Welling, M. (2014). Semi-supervised learning with deep generative models. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Kong, W., & Li, W. J. (2012). Isotropic hashing. In Advances in neural information processing systems (NIPS). Kong, W., & Li, W. J. (2012). Isotropic hashing. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images. Krizhevsky, A., & Hinton, G. (2009). Learning multiple layers of features from tiny images.
Zurück zum Zitat Kulis, B., & Darrell, T. (2009). Learning to hash with binary reconstructive embeddings. In Advances in neural information processing systems (NIPS). Kulis, B., & Darrell, T. (2009). Learning to hash with binary reconstructive embeddings. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Kulis, B., & Grauman, K. (2009). Kernelized locality-sensitive hashing for scalable image search. In IEEE international conference on computer vision (ICCV). Kulis, B., & Grauman, K. (2009). Kernelized locality-sensitive hashing for scalable image search. In IEEE international conference on computer vision (ICCV).
Zurück zum Zitat Kulkarni, T. D., Whitney, W. F., Kohli, P., & Tenenbaum, J. (2015). Deep convolutional inverse graphics network. In Advances in neural information processing systems (NIPS). Kulkarni, T. D., Whitney, W. F., Kohli, P., & Tenenbaum, J. (2015). Deep convolutional inverse graphics network. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Lai, H., Pan, Y., Liu, Y., & Yan, S. (2015). Simultaneous feature learning and hash coding with deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR). Lai, H., Pan, Y., Liu, Y., & Yan, S. (2015). Simultaneous feature learning and hash coding with deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Lin, K., Lu, J., Chen, C. S., & Zhou, J. (2016). Learning compact binary descriptors with unsupervised deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR). Lin, K., Lu, J., Chen, C. S., & Zhou, J. (2016). Learning compact binary descriptors with unsupervised deep neural networks. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liu, L., Lin, Z., Shao, L., Shen, F., Ding, G., & Han, J. (2017). Sequential discrete hashing for scalable cross-modality similarity retrieval. IEEE Transactions on Image Processing, 26(1), 107–118.MathSciNetCrossRef Liu, L., Lin, Z., Shao, L., Shen, F., Ding, G., & Han, J. (2017). Sequential discrete hashing for scalable cross-modality similarity retrieval. IEEE Transactions on Image Processing, 26(1), 107–118.MathSciNetCrossRef
Zurück zum Zitat Liu, L., & Shao, L. (2016). Sequential compact code learning for unsupervised image hashing. IEEE Transactions on Neural Networks and Learning Systems, 27(12), 2526–2536.CrossRef Liu, L., & Shao, L. (2016). Sequential compact code learning for unsupervised image hashing. IEEE Transactions on Neural Networks and Learning Systems, 27(12), 2526–2536.CrossRef
Zurück zum Zitat Liu, L., Shao, L., Shen, F., & Yu, M. (2017). Discretely coding semantic rank orders for supervised image hashing. In IEEE conference on computer vision and pattern recognition (CVPR). Liu, L., Shao, L., Shen, F., & Yu, M. (2017). Discretely coding semantic rank orders for supervised image hashing. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liu, L., Shen, F., Shen, Y., Liu, X., & Shao, L. (2017). Deep sketch hashing: Fast free-hand sketch-based image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR). Liu, L., Shen, F., Shen, Y., Liu, X., & Shao, L. (2017). Deep sketch hashing: Fast free-hand sketch-based image retrieval. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Liu, L., Yu, M., & Shao, L. (2016). Unsupervised local feature hashing for image similarity search. IEEE Transactions on Cybernetics, 46(11), 2548–2558.CrossRef Liu, L., Yu, M., & Shao, L. (2016). Unsupervised local feature hashing for image similarity search. IEEE Transactions on Cybernetics, 46(11), 2548–2558.CrossRef
Zurück zum Zitat Liu, L., Yu, M., & Shao, L. (2017). Latent structure preserving hashing. International Journal of Computer Vision, 122(3), 439–457.MathSciNetCrossRef Liu, L., Yu, M., & Shao, L. (2017). Latent structure preserving hashing. International Journal of Computer Vision, 122(3), 439–457.MathSciNetCrossRef
Zurück zum Zitat Liu, L., Yu, M., & Shao, L. (2017). Learning short binary codes for large-scale image retrieval. IEEE Transactions on Image Processing, 26(3), 1289–1299.MathSciNetCrossRef Liu, L., Yu, M., & Shao, L. (2017). Learning short binary codes for large-scale image retrieval. IEEE Transactions on Image Processing, 26(3), 1289–1299.MathSciNetCrossRef
Zurück zum Zitat Liu, W., Mu, C., Kumar, S., & Chang, S. F. (2014). Discrete graph hashing. In Advances in neural information processing systems (NIPS). Liu, W., Mu, C., Kumar, S., & Chang, S. F. (2014). Discrete graph hashing. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Liu, W., Wang, J., Kumar, S., & Chang, S. F. (2011). Hashing with graphs. In International conference on machine learning (ICML). Liu, W., Wang, J., Kumar, S., & Chang, S. F. (2011). Hashing with graphs. In International conference on machine learning (ICML).
Zurück zum Zitat Maaten, Lvd, & Hinton, G. (2008). Visualizing data using t-sne. Journal of Machine Learning Research, 9(Nov), 2579–2605.MATH Maaten, Lvd, & Hinton, G. (2008). Visualizing data using t-sne. Journal of Machine Learning Research, 9(Nov), 2579–2605.MATH
Zurück zum Zitat Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML). Norouzi, M., & Blei, D. M. (2011). Minimal loss hashing for compact binary codes. In International conference on machine learning (ICML).
Zurück zum Zitat Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3), 145–175.CrossRef Oliva, A., & Torralba, A. (2001). Modeling the shape of the scene: A holistic representation of the spatial envelope. International Journal of Computer Vision, 42(3), 145–175.CrossRef
Zurück zum Zitat Purushotham, S., Carvalho, W., Nilanon, T., & Liu, Y. (2017). Variational recurrent adversarial deep domain adaptation. In International conference on learning representations (ICLR). Purushotham, S., Carvalho, W., Nilanon, T., & Liu, Y. (2017). Variational recurrent adversarial deep domain adaptation. In International conference on learning representations (ICLR).
Zurück zum Zitat Raginsky, M., & Lazebnik, S. (2009). Locality-sensitive binary codes from shift-invariant kernels. In Advances in neural information processing systems (NIPS). Raginsky, M., & Lazebnik, S. (2009). Locality-sensitive binary codes from shift-invariant kernels. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2015). Imagenet large scale visual recognition challenge. International Journal of Computer Vision, 115(3), 211–252.MathSciNetCrossRef Russakovsky, O., Deng, J., Su, H., Krause, J., Satheesh, S., Ma, S., et al. (2015). Imagenet large scale visual recognition challenge. International Journal of Computer Vision, 115(3), 211–252.MathSciNetCrossRef
Zurück zum Zitat Salakhutdinov, R., & Hinton, G. (2009). Semantic hashing. International Journal of Approximate Reasoning, 50(7), 969–978.CrossRef Salakhutdinov, R., & Hinton, G. (2009). Semantic hashing. International Journal of Approximate Reasoning, 50(7), 969–978.CrossRef
Zurück zum Zitat Serban, I. V., Sordoni, A., Lowe, R., Charlin, L., Pineau, J., Courville, A. C., & Bengio, Y. (2017). A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI conference on artificial intelligence (AAAI). Serban, I. V., Sordoni, A., Lowe, R., Charlin, L., Pineau, J., Courville, A. C., & Bengio, Y. (2017). A hierarchical latent variable encoder-decoder model for generating dialogues. In AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In IEEE conference on computer vision and pattern recognition (CVPR). Shen, F., Shen, C., Liu, W., & Tao Shen, H. (2015). Supervised discrete hashing. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Shen, Y., Liu, l., & Shao, L. (2017). Unsupervised deep generative hashing. In British machine vision conference (BMVC). Shen, Y., Liu, l., & Shao, L. (2017). Unsupervised deep generative hashing. In British machine vision conference (BMVC).
Zurück zum Zitat Shen, Y., Liu, l., Shao, L., & Song, J. (2017). Deep binaries: Encoding semantic-rich cues for efficient textual-visual cross retrieval. In IEEE international conference on computer vision (ICCV). Shen, Y., Liu, l., Shao, L., & Song, J. (2017). Deep binaries: Encoding semantic-rich cues for efficient textual-visual cross retrieval. In IEEE international conference on computer vision (ICCV).
Zurück zum Zitat Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. In International conference in learning representations (ICLR). Simonyan, K., & Zisserman, A. (2015). Very deep convolutional networks for large-scale image recognition. In International conference in learning representations (ICLR).
Zurück zum Zitat Sohn, K., Lee, H., & Yan, X. (2015). Learning structured output representation using deep conditional generative models. In Advances in neural information processing systems (NIPS). Sohn, K., Lee, H., & Yan, X. (2015). Learning structured output representation using deep conditional generative models. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Song, J., Yang, Y., Yang, Y., Huang, Z., & Shen, H. T. (2013). Inter-media hashing for large-scale retrieval from heterogeneous data sources. In ACM international conference on management of data (SIGMOD). Song, J., Yang, Y., Yang, Y., Huang, Z., & Shen, H. T. (2013). Inter-media hashing for large-scale retrieval from heterogeneous data sources. In ACM international conference on management of data (SIGMOD).
Zurück zum Zitat Tucker, L. R. (1966). Some mathematical notes on three-mode factor analysis. Psychometrika, 31(3), 279–311.MathSciNetCrossRef Tucker, L. R. (1966). Some mathematical notes on three-mode factor analysis. Psychometrika, 31(3), 279–311.MathSciNetCrossRef
Zurück zum Zitat Wang, J., Kumar, S., & Chang, S. F. (2012). Semi-supervised hashing for large-scale search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(12), 2393–2406.CrossRef Wang, J., Kumar, S., & Chang, S. F. (2012). Semi-supervised hashing for large-scale search. IEEE Transactions on Pattern Analysis and Machine Intelligence, 34(12), 2393–2406.CrossRef
Zurück zum Zitat Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems (NIPS). Weiss, Y., Torralba, A., & Fergus, R. (2009). Spectral hashing. In Advances in neural information processing systems (NIPS).
Zurück zum Zitat Xia, R., Pan, Y., Lai, H., Liu, C., & Yan, S. (2014). Supervised hashing for image retrieval via image representation learning. In AAAI conference on artificial intelligence (AAAI). Xia, R., Pan, Y., Lai, H., Liu, C., & Yan, S. (2014). Supervised hashing for image retrieval via image representation learning. In AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Xiao, J., Hays, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2010). Sun database: Large-scale scene recognition from abbey to zoo. In IEEE conference on computer vision and pattern recognition (CVPR). Xiao, J., Hays, J., Ehinger, K. A., Oliva, A., & Torralba, A. (2010). Sun database: Large-scale scene recognition from abbey to zoo. In IEEE conference on computer vision and pattern recognition (CVPR).
Zurück zum Zitat Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., & Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning (ICML). Xu, K., Ba, J., Kiros, R., Cho, K., Courville, A., Salakhudinov, R., Zemel, R., & Bengio, Y. (2015). Show, attend and tell: Neural image caption generation with visual attention. In International conference on machine learning (ICML).
Zurück zum Zitat Yan, X., Yang, J., Sohn, K., & Lee, H. (2016). Attribute2image: Conditional image generation from visual attributes. In European conference on computer vision (ECCV). Yan, X., Yang, J., Sohn, K., & Lee, H. (2016). Attribute2image: Conditional image generation from visual attributes. In European conference on computer vision (ECCV).
Zurück zum Zitat Yang, Z., Hu, Z., Salakhutdinov, R., & Berg-Kirkpatrick, T. (2017). Improved variational autoencoders for text modeling using dilated convolutions. In arXiv preprint arXiv:1702.08139. Yang, Z., Hu, Z., Salakhutdinov, R., & Berg-Kirkpatrick, T. (2017). Improved variational autoencoders for text modeling using dilated convolutions. In arXiv preprint arXiv:​1702.​08139.
Zurück zum Zitat Yu, M., Liu, L., & Shao, L. (2016). Structure-preserving binary representations for rgb-d action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(8), 1651–1664.CrossRef Yu, M., Liu, L., & Shao, L. (2016). Structure-preserving binary representations for rgb-d action recognition. IEEE Transactions on Pattern Analysis and Machine Intelligence, 38(8), 1651–1664.CrossRef
Zurück zum Zitat Zhu, H., Long, M., Wang, J., & Cao, Y. (2016). Deep hashing network for efficient similarity retrieval. In AAAI conference on artificial intelligence (AAAI). Zhu, H., Long, M., Wang, J., & Cao, Y. (2016). Deep hashing network for efficient similarity retrieval. In AAAI conference on artificial intelligence (AAAI).
Zurück zum Zitat Zhu, X., Zhang, L., & Huang, Z. (2014). A sparse embedding and least variance encoding approach to hashing. IEEE Transactions on Image Processing, 23(9), 3737–3750.MathSciNetCrossRef Zhu, X., Zhang, L., & Huang, Z. (2014). A sparse embedding and least variance encoding approach to hashing. IEEE Transactions on Image Processing, 23(9), 3737–3750.MathSciNetCrossRef
Metadaten
Titel
Unsupervised Binary Representation Learning with Deep Variational Networks
verfasst von
Yuming Shen
Li Liu
Ling Shao
Publikationsdatum
21.02.2019
Verlag
Springer US
Erschienen in
International Journal of Computer Vision / Ausgabe 11-12/2019
Print ISSN: 0920-5691
Elektronische ISSN: 1573-1405
DOI
https://doi.org/10.1007/s11263-019-01166-4

Weitere Artikel der Ausgabe 11-12/2019

International Journal of Computer Vision 11-12/2019 Zur Ausgabe