ABSTRACT
This paper proposes a novel recursive hashing scheme, in contrast to conventional "one-off" based hashing algorithms. Inspired by human's "nonsalient-to-salient" perception path, the proposed hashing scheme generates a series of binary codes based on progressively expanded salient regions. Built on a recurrent deep network, i.e., LSTM structure, the binary codes generated from later output nodes naturally inherit information aggregated from previously codes while explore novel information from the extended salient region, and therefore it possesses good scalability property. The proposed deep hashing network is trained via minimizing a triplet ranking loss, which is end-to-end trainable. Extensive experimental results on several image retrieval benchmarks demonstrate good performance gain over state-of-the-art image retrieval methods and its scalability property.
- Neil Bruce and John Tsotsos. 2006. Saliency based on information maximization. Advances in neural information processing systems Vol. 18 (2006), 155.Google Scholar
- Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S. Yu. 2016. Deep Visual-Semantic Hashing for Cross-Modal Retrieval Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13-17, 2016. 1445--1454. Google ScholarDigital Library
- Tat Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore ACM International Conference on Image and Video Retrieval. 48. Google ScholarDigital Library
- Qi Dai, Jianguo Li, Jingdong Wang, and Yu-Gang Jiang. 2016. Binary Optimized Hashing. In Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15--19, 2016. 1247--1256. Google ScholarDigital Library
- Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Trevor Darrell, and Kate Saenko. 2015. Long-term recurrent convolutional networks for visual recognition and description IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7--12, 2015. 2625--2634.Google Scholar
- Yunchao Gong, Sanjiv Kumar, Henry A. Rowley, and Svetlana Lazebnik. 2013 a. Learning Binary Codes for High-Dimensional Data Using Bilinear Projections 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, June 23-28, 2013. 484--491. Google ScholarDigital Library
- Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013 b. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 35, 12 (2013), 2916--2929. Google ScholarDigital Library
- Alex Graves, Abdel-rahman Mohamed, and Geoffrey E. Hinton. 2013. Speech recognition with deep recurrent neural networks IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26--31, 2013. 6645--6649.Google Scholar
- Yun Gu, Chao Ma, and Jie Yang. 2016. Supervised Recurrent Hashing for Large Scale Video Retrieval Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. 272--276. Google ScholarDigital Library
- Jonathan Harel, Christof Koch, Pietro Perona, et almbox.. 2006. Graph-based visual saliency. In NIPS, Vol. Vol. 1. 5. Google ScholarDigital Library
- Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarDigital Library
- Laurent Itti, Christof Koch, and Ernst Niebur. 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on pattern analysis and machine intelligence, Vol. 20, 11 (1998), 1254--1259. Google ScholarDigital Library
- Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional Architecture for Fast Feature Embedding Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. 675--678. Google ScholarDigital Library
- Andrej Karpathy and Fei-Fei Li. 2015. Deep visual-semantic alignments for generating image descriptions IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7--12, 2015. 3128--3137.Google Scholar
- Alex Krizhevsky. 2012. Learning Multiple Layers of Features from Tiny Images. (2012).Google Scholar
- Brian Kulis and Trevor Darrell. 2009. Learning to Hash with Binary Reconstructive Embeddings Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7--10 December 2009, Vancouver, British Columbia, Canada. 1042--1050. Google ScholarDigital Library
- Brian Kulis and Kristen Grauman. 2009. Kernelized locality-sensitive hashing for scalable image search IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27 - October 4, 2009. 2130--2137.Google Scholar
- Brian Kulis and Kristen Grauman. 2012. Kernelized Locality-Sensitive Hashing. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 34, 6 (2012), 1092--1104. Google ScholarDigital Library
- Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015. 3270--3278.Google Scholar
- Xi Li, Guosheng Lin, Chunhua Shen, Anton van den Hengel, and Anthony R. Dick. 2013. Learning Hash Functions Using Column Generation. Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013. 142--150. Google ScholarDigital Library
- Yin Li, Xiaodi Hou, Christof Koch, James M Rehg, and Alan L Yuille. 2014. The secrets of salient object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 280--287. Google ScholarDigital Library
- Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, and David Suter. 2014. Fast Supervised Hashing with Decision Trees for High-Dimensional Data 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, Columbus, OH, USA, June 23-28, 2014. 1971--1978. Google ScholarDigital Library
- Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, and Jie Zhou. 2015. Deep hashing for compact binary codes learning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015. 2475--2483.Google ScholarCross Ref
- Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, June 16-21, 2012 2074--2081.Google ScholarCross Ref
- Xianglong Liu, Junfeng He, Bo Lang, and Shih-Fu Chang. 2013. Hash Bit Selection: A Unified Solution for Selection Problems in Hashing 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, June 23--28, 2013. 1570--1577. Google ScholarDigital Library
- Viet Anh Nguyen, Jiwen Lu, and Minh N. Do. 2014. Supervised Discriminative Hashing for Compact Binary Codes Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03-07, 2014. 989--992. Google ScholarDigital Library
- Mohammad Norouzi and David J. Fleet. 2011. Minimal Loss Hashing for Compact Binary Codes. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011, Bellevue, Washington, USA, June 28 - July 2, 2011. 353--360. Google ScholarDigital Library
- Mohammad Norouzi, David J. Fleet, and Ruslan Salakhutdinov. 2012. Hamming Distance Metric Learning. In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States. 1070--1078. Google ScholarDigital Library
- Ruslan Salakhutdinov and Geoffrey E Hinton. 2007. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure. AISTATS, Vol. Vol. 11.Google Scholar
- Ruslan Salakhutdinov and Geoffrey E. Hinton. 2009. Semantic hashing. Int. J. Approx. Reasoning Vol. 50, 7 (2009), 969--978. Google ScholarDigital Library
- O Vinyals, A Toshev, S Bengio, and D Erhan. 2015. Show and tell: A neural image caption generator. Computer Science (2015), 3156--3164.Google Scholar
- Jun Wang, Wei Liu, Andy X. Sun, and Yu-Gang Jiang. 2013. Learning Hash Codes with Listwise Supervision. In IEEE International Conference on Computer Vision, ICCV 2013, Sydney, Australia, December 1-8, 2013. 3032--3039. Google ScholarDigital Library
- Qifan Wang, Luo Si, and Dan Zhang. 2014. Learning to Hash with Partial Tags: Exploring Correlation between Tags and Hashing Bits for Large Scale Image Retrieval Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6--12, 2014, Proceedings, Part III. 378--392.Google Scholar
- Botong Wu and Yizhou Wang. 2016. Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. 352--356. Google ScholarDigital Library
- Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised Hashing for Image Retrieval via Image Representation Learning Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27-31, 2014, Québec City, Québec, Canada. 2156--2162. Google ScholarDigital Library
- Junchi Yan, Mengyuan Zhu, Huanxi Liu, and Yuncai Liu. 2010. Visual saliency detection via sparsity pursuit. IEEE Signal Processing Letters Vol. 17, 8 (2010), 739--742.Google ScholarCross Ref
- Huei-Fang Yang, Kevin Lin, and Chu-Song Chen. 2015. Supervised Learning of Semantics-Preserving Hashing via Deep Neural Networks for Large-Scale Image Search. CoRR Vol. abs/1507.00101 (2015).Google Scholar
- Wojciech Zaremba and Ilya Sutskever. 2014. Learning to execute. arXiv preprint arXiv:1410.4615 (2014).Google Scholar
- Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. 2015. Deep semantic ranking based hashing for multi-label image retrieval Computer Vision and Pattern Recognition. 1556--1564.Google Scholar
- Deep Progressive Hashing for Image Retrieval
Recommendations
Deep Double Center Hashing for Face Image Retrieval
Pattern Recognition and Computer VisionAbstractHashing is an effective and widely used technology for fast approximate nearest neighbor search in large-scale images. In recent years, it has been combined with a powerful feature learning model, convolutional neural network(CNN), to boost the ...
Hierarchical deep hashing for image retrieval
We present a new method to generate efficient multi-level hashing codes for image retrieval based on the deep siamese convolutional neural network (DSCNN). Conventional deep hashing methods trade off the capability of capturing highly complex and ...
Deep hashing with top similarity preserving for image retrieval
Hashing has drawn more and more attention in image retrieval due to its high search speed and low storage cost. Traditional hashing methods project the high-dimensional hand-crafted visual features to compact binary codes by linear or non-linear hashing ...
Comments