skip to main content
10.1145/3123266.3123280acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
research-article

Deep Progressive Hashing for Image Retrieval

Authors Info & Claims
Published:19 October 2017Publication History

ABSTRACT

This paper proposes a novel recursive hashing scheme, in contrast to conventional "one-off" based hashing algorithms. Inspired by human's "nonsalient-to-salient" perception path, the proposed hashing scheme generates a series of binary codes based on progressively expanded salient regions. Built on a recurrent deep network, i.e., LSTM structure, the binary codes generated from later output nodes naturally inherit information aggregated from previously codes while explore novel information from the extended salient region, and therefore it possesses good scalability property. The proposed deep hashing network is trained via minimizing a triplet ranking loss, which is end-to-end trainable. Extensive experimental results on several image retrieval benchmarks demonstrate good performance gain over state-of-the-art image retrieval methods and its scalability property.

References

  1. Neil Bruce and John Tsotsos. 2006. Saliency based on information maximization. Advances in neural information processing systems Vol. 18 (2006), 155.Google ScholarGoogle Scholar
  2. Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, and Philip S. Yu. 2016. Deep Visual-Semantic Hashing for Cross-Modal Retrieval Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, August 13-17, 2016. 1445--1454. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Tat Seng Chua, Jinhui Tang, Richang Hong, Haojie Li, Zhiping Luo, and Yantao Zheng. 2009. NUS-WIDE: a real-world web image database from National University of Singapore ACM International Conference on Image and Video Retrieval. 48. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Qi Dai, Jianguo Li, Jingdong Wang, and Yu-Gang Jiang. 2016. Binary Optimized Hashing. In Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15--19, 2016. 1247--1256. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Jeff Donahue, Lisa Anne Hendricks, Sergio Guadarrama, Marcus Rohrbach, Subhashini Venugopalan, Trevor Darrell, and Kate Saenko. 2015. Long-term recurrent convolutional networks for visual recognition and description IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7--12, 2015. 2625--2634.Google ScholarGoogle Scholar
  6. Yunchao Gong, Sanjiv Kumar, Henry A. Rowley, and Svetlana Lazebnik. 2013 a. Learning Binary Codes for High-Dimensional Data Using Bilinear Projections 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, June 23-28, 2013. 484--491. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Yunchao Gong, Svetlana Lazebnik, Albert Gordo, and Florent Perronnin. 2013 b. Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 35, 12 (2013), 2916--2929. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Alex Graves, Abdel-rahman Mohamed, and Geoffrey E. Hinton. 2013. Speech recognition with deep recurrent neural networks IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2013, Vancouver, BC, Canada, May 26--31, 2013. 6645--6649.Google ScholarGoogle Scholar
  9. Yun Gu, Chao Ma, and Jie Yang. 2016. Supervised Recurrent Hashing for Large Scale Video Retrieval Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. 272--276. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Jonathan Harel, Christof Koch, Pietro Perona, et almbox.. 2006. Graph-based visual saliency. In NIPS, Vol. Vol. 1. 5. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long Short-Term Memory. Neural Computation, Vol. 9, 8 (1997), 1735--1780. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Laurent Itti, Christof Koch, and Ernst Niebur. 1998. A model of saliency-based visual attention for rapid scene analysis. IEEE Transactions on pattern analysis and machine intelligence, Vol. 20, 11 (1998), 1254--1259. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Yangqing Jia, Evan Shelhamer, Jeff Donahue, Sergey Karayev, Jonathan Long, Ross B. Girshick, Sergio Guadarrama, and Trevor Darrell. 2014. Caffe: Convolutional Architecture for Fast Feature Embedding Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03 - 07, 2014. 675--678. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Andrej Karpathy and Fei-Fei Li. 2015. Deep visual-semantic alignments for generating image descriptions IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7--12, 2015. 3128--3137.Google ScholarGoogle Scholar
  15. Alex Krizhevsky. 2012. Learning Multiple Layers of Features from Tiny Images. (2012).Google ScholarGoogle Scholar
  16. Brian Kulis and Trevor Darrell. 2009. Learning to Hash with Binary Reconstructive Embeddings Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7--10 December 2009, Vancouver, British Columbia, Canada. 1042--1050. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Brian Kulis and Kristen Grauman. 2009. Kernelized locality-sensitive hashing for scalable image search IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27 - October 4, 2009. 2130--2137.Google ScholarGoogle Scholar
  18. Brian Kulis and Kristen Grauman. 2012. Kernelized Locality-Sensitive Hashing. IEEE Trans. Pattern Anal. Mach. Intell. Vol. 34, 6 (2012), 1092--1104. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Hanjiang Lai, Yan Pan, Ye Liu, and Shuicheng Yan. 2015. Simultaneous feature learning and hash coding with deep neural networks IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015. 3270--3278.Google ScholarGoogle Scholar
  20. Xi Li, Guosheng Lin, Chunhua Shen, Anton van den Hengel, and Anthony R. Dick. 2013. Learning Hash Functions Using Column Generation. Proceedings of the 30th International Conference on Machine Learning, ICML 2013, Atlanta, GA, USA, 16-21 June 2013. 142--150. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Yin Li, Xiaodi Hou, Christof Koch, James M Rehg, and Alan L Yuille. 2014. The secrets of salient object segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. 280--287. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Guosheng Lin, Chunhua Shen, Qinfeng Shi, Anton van den Hengel, and David Suter. 2014. Fast Supervised Hashing with Decision Trees for High-Dimensional Data 2014 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2014, Columbus, OH, USA, June 23-28, 2014. 1971--1978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Venice Erin Liong, Jiwen Lu, Gang Wang, Pierre Moulin, and Jie Zhou. 2015. Deep hashing for compact binary codes learning. In IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, USA, June 7-12, 2015. 2475--2483.Google ScholarGoogle ScholarCross RefCross Ref
  24. Wei Liu, Jun Wang, Rongrong Ji, Yu-Gang Jiang, and Shih-Fu Chang. 2012. Supervised hashing with kernels. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, June 16-21, 2012 2074--2081.Google ScholarGoogle ScholarCross RefCross Ref
  25. Xianglong Liu, Junfeng He, Bo Lang, and Shih-Fu Chang. 2013. Hash Bit Selection: A Unified Solution for Selection Problems in Hashing 2013 IEEE Conference on Computer Vision and Pattern Recognition, Portland, OR, USA, June 23--28, 2013. 1570--1577. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Viet Anh Nguyen, Jiwen Lu, and Minh N. Do. 2014. Supervised Discriminative Hashing for Compact Binary Codes Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03-07, 2014. 989--992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Mohammad Norouzi and David J. Fleet. 2011. Minimal Loss Hashing for Compact Binary Codes. In Proceedings of the 28th International Conference on Machine Learning, ICML 2011, Bellevue, Washington, USA, June 28 - July 2, 2011. 353--360. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Mohammad Norouzi, David J. Fleet, and Ruslan Salakhutdinov. 2012. Hamming Distance Metric Learning. In Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012, Lake Tahoe, Nevada, United States. 1070--1078. Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Ruslan Salakhutdinov and Geoffrey E Hinton. 2007. Learning a Nonlinear Embedding by Preserving Class Neighbourhood Structure. AISTATS, Vol. Vol. 11.Google ScholarGoogle Scholar
  30. Ruslan Salakhutdinov and Geoffrey E. Hinton. 2009. Semantic hashing. Int. J. Approx. Reasoning Vol. 50, 7 (2009), 969--978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. O Vinyals, A Toshev, S Bengio, and D Erhan. 2015. Show and tell: A neural image caption generator. Computer Science (2015), 3156--3164.Google ScholarGoogle Scholar
  32. Jun Wang, Wei Liu, Andy X. Sun, and Yu-Gang Jiang. 2013. Learning Hash Codes with Listwise Supervision. In IEEE International Conference on Computer Vision, ICCV 2013, Sydney, Australia, December 1-8, 2013. 3032--3039. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. Qifan Wang, Luo Si, and Dan Zhang. 2014. Learning to Hash with Partial Tags: Exploring Correlation between Tags and Hashing Bits for Large Scale Image Retrieval Computer Vision - ECCV 2014 - 13th European Conference, Zurich, Switzerland, September 6--12, 2014, Proceedings, Part III. 378--392.Google ScholarGoogle Scholar
  34. Botong Wu and Yizhou Wang. 2016. Neighborhood-Preserving Hashing for Large-Scale Cross-Modal Search Proceedings of the 2016 ACM Conference on Multimedia Conference, MM 2016, Amsterdam, The Netherlands, October 15-19, 2016. 352--356. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. Rongkai Xia, Yan Pan, Hanjiang Lai, Cong Liu, and Shuicheng Yan. 2014. Supervised Hashing for Image Retrieval via Image Representation Learning Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence, July 27-31, 2014, Québec City, Québec, Canada. 2156--2162. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Junchi Yan, Mengyuan Zhu, Huanxi Liu, and Yuncai Liu. 2010. Visual saliency detection via sparsity pursuit. IEEE Signal Processing Letters Vol. 17, 8 (2010), 739--742.Google ScholarGoogle ScholarCross RefCross Ref
  37. Huei-Fang Yang, Kevin Lin, and Chu-Song Chen. 2015. Supervised Learning of Semantics-Preserving Hashing via Deep Neural Networks for Large-Scale Image Search. CoRR Vol. abs/1507.00101 (2015).Google ScholarGoogle Scholar
  38. Wojciech Zaremba and Ilya Sutskever. 2014. Learning to execute. arXiv preprint arXiv:1410.4615 (2014).Google ScholarGoogle Scholar
  39. Fang Zhao, Yongzhen Huang, Liang Wang, and Tieniu Tan. 2015. Deep semantic ranking based hashing for multi-label image retrieval Computer Vision and Pattern Recognition. 1556--1564.Google ScholarGoogle Scholar
  1. Deep Progressive Hashing for Image Retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '17: Proceedings of the 25th ACM international conference on Multimedia
      October 2017
      2028 pages
      ISBN:9781450349062
      DOI:10.1145/3123266

      Copyright © 2017 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 19 October 2017

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      MM '17 Paper Acceptance Rate189of684submissions,28%Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader