ABSTRACT
The traditional near-duplicate detection systems developed for digital photo management and copyright protection are not applicable for the de-duplication of large-scale web image corpus. In this paper, we present a fast, accurate and highly scalable image fingerprinting technique suited for near-duplicate detection at the web-scale. The image fingerprint is a compact 130 bit representation computed using Fourier-Mellin transform. Near-duplicate images are detected in O(1) time using fingerprint equality and is faster than fast approximate near-neighbor searches like LSH.
- J. Barr, B. Bradley, and B. Hannigan. Using digital watermarks with image signatures to mitigate the treat of the copy attack. In ICASSP, 2003.Google ScholarCross Ref
- D. Casasent and D. Psaltis. Scale invariant optical transform. Optical Engineering, 1976.Google Scholar
- E. Chang, J. Z. Wang, C. Li, and G. Wiederhold. Rime: A replicated image detector for the world wide web. In Proceedings of SPIE Multimedia Storage and Archiving Systems, 1998.Google ScholarCross Ref
- B. Coskun and B. Sankur. Robust video hash extraction. In European Conference on Signal Processing, 2004.Google Scholar
- M. Datar, N. Immorlica, P. Indyk, and V. Mirrokni. Locality-sensitive hashing scheme based on p-stable distributions. In Annual Symposium on Computational Geometry, 2004. Google ScholarDigital Library
- S. Derrode. Efficient near-duplicate detection and sub-image retrieval. In ACM Multimedia, 2004.Google Scholar
- J. J. Foo, J. Zobel, R. Sinha, and S. M. M. Tahaghoghi. Detection of near-duplicate images for web search. In CIVR, 2007. Google ScholarDigital Library
- P. Ghosh, B. Manjunath, and K. Ramakrishnan. A compact image signature for rts-invariant image retrieval. In IEE International Conference on Visual Information Engineering, 2006.Google ScholarCross Ref
- A. Jaimes, S.-F. Chang, and A. C. Loui. Detection of non-identical duplicate consumer photographs. In ACM Multimedia, 2003.Google ScholarCross Ref
- Y. Ke, R. Sukthankar, and L. Huston. An efficient parts-based near-duplicate and sub-image retrieval system. In ACM Multimedia, 2004. Google ScholarDigital Library
- Y. Maret, F. Dufaux, and T. Ebrahimi. Image replica detection based on support vector classifier. Optical Information System III, 2005.Google ScholarCross Ref
- A. Oppenheim and J. Lim. The importance of phase in signals. Proceedings of the IEEE, 69(5):529---541, 1981.Google Scholar
- B. Wang, Z. Li, M. Li, and W. ying Ma. Large-scale duplicate detection for web image search. In ICME, 2006.Google ScholarCross Ref
Index Terms
- Finding near-duplicate images on the web using fingerprints
Recommendations
Finding duplicate images in biology papers
SAC '17: Proceedings of the Symposium on Applied ComputingDuplicated images in biology papers are a possible indicator for plagiarism or data fabrication. A manual detection of such duplicates can be time consuming or even infeasible for huge image collections. In this paper, a semi-automatic duplicate ...
Fast Convex Layers Algorithm for Near-Duplicate Image Detection
This paper builds on a novel, fast algorithm for generating the convex layers on grid points with linear time complexity. Convex layers are extracted from the binary image. The obtained convex hulls are characterized by the number of their vertices and ...
Revisiting Gist-PCA Hashing for Near Duplicate Image Detection
This paper presents a scalable method of near duplicate image detection based on Gist-PCA (principal component analysis) hashing. While most of transform coding methods have been interested in nearest neighbor search with applications to similar image ...
Comments