skip to main content
10.1145/1991996.1992050acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
research-article

NV-Tree: nearest neighbors at the billion scale

Published:18 April 2011Publication History

ABSTRACT

This paper presents the NV-Tree (Nearest Vector Tree). It addresses the specific, yet important, problem of efficiently and effectively finding the approximate k-nearest neighbors within a collection of a few billion high-dimensional data points. The NV-Tree is a very compact index, as only six bytes are kept in the index for each high-dimensional descriptor. It thus scales extremely well when indexing large collections of high-dimensional descriptors. The NV-Tree efficiently produces results of good quality, even at such a large scale that the indices cannot be kept entirely in main memory any more. We demonstrate this with extensive experiments using a collection of 2.5 billion SIFT (Scale Invariant Feature Transform) descriptors.

References

  1. M. Batko, F. Falchi, C. Lucchese, D. Novak, R. Perego, F. Rabitti, J. Sedmidubsky, and P. Zezula. Building a web-scale image similarity search system. Multimedia Tools and Applications, 47, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Datar, P. Indyk, N. Immorlica, and V. Mirrokni. Locality-sensitive hashing using stable distributions. MIT Press, 2006.Google ScholarGoogle Scholar
  3. M. Douze, H. Jégou, H. Sandhawalia, L. Amsaleg, and C. Schmid. Evaluation of gist descriptors for web-scale image search. In Proc. ACM CIVR, Santorini, Greece, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. R. Fagin, R. Kumar, and D. Sivakumar. Efficient similarity search and classification via rank aggregation. In Proc. ACM SIGMOD, San Diego, CA, USA, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. H. Jégou, M. Douze, C. Schmid, and P. Pérez. Aggregating local descriptors into a compact image representation. In Proc. IEEE CVPR, San Francisco, CA, USA, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  6. A. Joly and O. Buisson. A posteriori multi-probe locality sensitive hashing. In Proc ACM Multimedia, Vancouver, BC, Canada, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. H. Lejsek, F. H. Ásmundsson, B. T. Jónsson, and L. Amsaleg. NV-tree: An efficient disk-based index for approximate search in very large high-dimensional collections. IEEE TPAMI, 31(5), 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. H. Lejsek, H. Thormódsdóttir, F. H. Ásmundsson, K. Dadason, Á. T. Jóhannsson, B. T. Jónsson, and L. Amsaleg. Videntifier#8482; Forensic: Large-scale video identification in practise. In Proc. MiFor, Florence, Italy, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Liu, A. Moore, A. Gray, and K. Yang. An investigation of practical approximate nearest neighbor algorithms. In Proc. NIPS, Vancouver, BC, Canada, 2004.Google ScholarGoogle Scholar
  10. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. A. Ólafsson, B. T. Jónsson, L. Amsaleg, and H. Lejsek. Dynamic behavior of balanced NV-trees. Multimedia Systems, 17(2), 2011.Google ScholarGoogle Scholar
  12. F. A. P. Petitcolas et al. A public automated web-based evaluation service for watermarking schemes: StirMark benchmark. In Proc. of Electronic Imaging, Security and Watermarking of Multimedia Contents III, San Jose, CA, USA, 2001.Google ScholarGoogle Scholar
  13. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Object retrieval with large vocabularies and fast spatial matching. In Proc. CVPR, Minneapolis, MN, USA, 2007.Google ScholarGoogle ScholarCross RefCross Ref
  14. J. Philbin, O. Chum, M. Isard, J. Sivic, and A. Zisserman. Lost in quantization: Improving particular object retrieval in large scale image databases. In Proc. CVPR, Anchorage, AK, USA, 2008.Google ScholarGoogle ScholarCross RefCross Ref
  15. U. Shaft and R. Ramakrishnan. Theory of nearest neighbors indexability. ACM Transactions on Database Systems, 31(3):814--838, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. J. Sivic and A. Zisserman. Video Google: A text retrieval approach to object matching in videos. In Proc. ICCV, Nice, France, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Z. Wu, Q. Ke, M. Isard, and J. Sun. Bundling features for large scale partial-duplicate web image search. In Proc. CVPR, Miami, FL, USA, 2009.Google ScholarGoogle Scholar

Index Terms

  1. NV-Tree: nearest neighbors at the billion scale

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            ICMR '11: Proceedings of the 1st ACM International Conference on Multimedia Retrieval
            April 2011
            512 pages
            ISBN:9781450303361
            DOI:10.1145/1991996

            Copyright © 2011 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 18 April 2011

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article

            Acceptance Rates

            Overall Acceptance Rate254of830submissions,31%

            Upcoming Conference

            ICMR '24
            International Conference on Multimedia Retrieval
            June 10 - 14, 2024
            Phuket , Thailand

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader