Skip to main content
Log in

Optimized learning instance-based image retrieval

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Image retrieval is a recognition technique in the field of computer vision. In most cases, high-quality retrieval is often supported by adequate learning instances. However, in the process of learning instance selection, some useless, repeated, invalid, and even mistaken learning instances are often selected. Low-quality instances not only add to the computing burden but also decrease the retrieval quality. In this study, we propose a learning instance optimization method. Initially, we classify the images into scene and object images by using the K-means clustering model. We use different methods to handle these two groups of images. For scene images, we use the Euclidean distance of the GIST descriptor to select the optimized learning instances. For object images, we use the improved spatial pyramid matching and optimal instance distance methods to select the optimized learning instances. Finally, we implement experiments using one large image database to check the effectiveness of our proposed algorithm. Results show that our method can not only improve retrieval quality but also decrease the number of learning instances.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

References

  1. Barrow HG, Tenenbaum JM, Bolles RC, Wolf HC (1977) Parametric correspondence and chamfer matching: two new techniques for image matching. tech. rep., DTIC Document

  2. Bart E, Porteous I, Perona P, Welling M (2008) Unsupervised learning of visual taxonomies. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8

  3. Bosch A, Zisserman A, Munoz X (2007) Representing shape with a spatial pyramid kernel. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval, ACM, pp 401–408

  4. Boser BE, Guyon IM, Vapnik VN (1992) A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, ACM, pp 144–152

  5. Carson C, Thomas M, Belongie S, Hellerstein JM, Malik J (1999) Blobworld: a system for region-based image indexing and retrieval. In: International Conference on Advances in Visual Information Systems. Springer, pp 509–517

  6. Chong W, Blei D, Li F-F (2009) Simultaneous image classification and annotation. In Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on IEEE, pp 1903–1910

  7. Comaniciu D, Ramesh V, Meer P (2000) Real-time tracking of non-rigid objects using mean shift. In: Computer Vision and Pattern Recognition, 2000. Proceedings. IEEE Conference on, vol 2, IEEE, pp 142–149

  8. Csurka G, Dance C, Fan L, Willamowski J, Bray C (2004) Visual categorization with bags of keypoints. In: Workshop on statistical learning in computer vision, ECCV, vol 1. Prague, pp 1–2

  9. Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) Imagenet: a large-scale hierarchical image database. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, IEEE, pp 248–255

  10. Fei-Fei L, Fergus R, Perona P (2006) One-shot learning of object categories. IEEE Trans Pattern Anal Mach Intell 28(4):594–611

    Article  Google Scholar 

  11. Felzenszwalb P, McAllester D, Ramanan D (2008) A discriminatively trained, multiscale, deformable part model. In Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8

  12. Fergus R, Bernal H, Weiss Y, Torralba A (2010) Semantic label sharing for learning with many categories. In: European Conference on Computer Vision. Springer, pp 762–775

  13. Fergus R, Weiss Y, Torralba A (2009) Semi-supervised learning in gigantic image collections. In: Advances in neural information processing systems, pp 522–530

  14. Griffin G, Holub A, Perona P (2007) Caltech-256 object category dataset

  15. Hays J, Efros AA (2007) Scene completion using millions of photographs. In ACM Transactions on Graphics (TOG), vol 26, ACM, p 4

  16. He K, Zhang X, Ren S, Sun J (2015) Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In: Proceedings of the IEEE International Conference on Computer Vision, pp 1026–1034

  17. Ioffe S, Szegedy C (2015) Batch normalization: accelerating deep network training by reducing internal covariate shift. arXiv preprint arXiv:1502.03167

  18. Jaakkola TS, Haussler D (1998) Exploiting generative models in discriminative classifiers. Adv Neural Inf Proces Syst 11(11):487–493

    Google Scholar 

  19. Jain AK (2010) Data clustering: 50 years beyond k-means. Pattern Recogn Lett 31(8):651–666

    Article  Google Scholar 

  20. Kanimozhi T, Latha K (2015) An integrated approach to region based image retrieval using firefly algorithm and support vector machine. Neurocomputing 151:1099–1111

    Article  Google Scholar 

  21. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp 1097–1105

  22. Kumar N, Berg AC, Belhumeur PN, Nayar SK (2009) Attribute and simile classifiers for face verification. In 2009 I.E. 12th International Conference on Computer Vision, IEEE, pp 365–372

  23. Lampert CH, Nickisch H, Harmeling S (2009) Learning to detect unseen object classes by between-class attribute transfer. In: Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on, IEEE, pp 951–958

  24. Lazebnik S, Schmid C, Ponce J (2006) Beyond bags of features: Spatial pyramid matching for recognizing natural scene categories. In: 2006 I.E. Computer Society Conference on Computer Vision and Pattern Recognition (CVPR’06), vol 2. IEEE, pp 2169–2178

  25. Lin Y, Lv F, Zhu S, Yang M, Cour T, Yu K, Cao L, Huang T (2011) Large-scale image classification: fast feature extraction and svm training. In: Computer Vision and Pattern Recognition (CVPR), 2011 I.E. Conference on, IEEE, pp 1689–1696

  26. Lowe DG (2004) Distinctive image features from scale-invariant keypoints. Int J Comput Vis 60(2):91–110

    Article  Google Scholar 

  27. Maji S, Berg AC (2009) Max-margin additive classifiers for detection. In: 2009 I.E. 12th International Conference on Computer Vision, IEEE, pp 40–47

  28. Master S (2014) Large scale object detection. Czech Technical University

  29. Palatucci M, Pomerleau D, Hinton GE, Mitchell TM (2009) Zero-shot learning with semantic output codes. In: Advances in neural information processing systems, pp 1410–1418

  30. Parikh D, Grauman K (2011) Relative attributes. In: 2011 International Conference on Computer Vision, IEEE, pp 503–510

  31. Perronnin F, Sánchez J, Mensink T (2010) Improving the fisher kernel for large-scale image classification. In: European Conference on Computer Vision. Springer, pp 143–156

  32. Rubner Y, Tomasi C, Guibas LJ (2000) The earth mover’s distance as a metric for image retrieval. Int J Comput Vis 40(2):99–121

    Article  MATH  Google Scholar 

  33. Russell BC, Torralba A, Murphy KP, Freeman WT (2008) Labelme: a database and web-based tool for image annotation. Int J Comput Vis 77(1–3):157–173

    Article  Google Scholar 

  34. Swain MJ, Ballard DH (1992) Indexing via color histograms. In: Active Perception and Robot Vision. Springer, pp 261–273

  35. Toyama K, Blake A (2001) Probabilistic tracking in a metric space. In: Computer Vision, 2001. ICCV 2001. Proceedings. Eighth IEEE International Conference on, vol 2, IEEE, pp 50–57

  36. Ulges A, Schulze C, Keysers D, Breuel T (2008) Identifying relevant frames in weakly labeled videos for training concept detectors. In: Proceedings of the 2008 international conference on Content-based image and video retrieval, ACM, pp 9–16

  37. Van de Sande KEA, Gevers T, Snoek CGM (2008) Evaluation of color descriptors for object and scene recognition. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition: CVPR 2008, pp 1–8

  38. Vedaldi A, Zisserman A (2011) Image classification practical. http://www.robots.ox.ac.uk/vgg/share/practical-image-classification.htm

  39. Wu H, Miao Z, Chen J, Yang J, Gao X (2015) Recognition improvement through the optimisation of learning instances. IET Comput Vis 9(3):419–427

    Article  Google Scholar 

  40. Wu H, Miao Z, Wang Y, Lin M (2015) Optimized recognition with few instances based on semantic distance. Vis Comput 31(4):367–375

    Article  Google Scholar 

  41. Wu R, Yan S, Shan Y, Dang Q, Sun G (2015) Deep image: scaling up image recognition. arXiv preprint arXiv:1501.02876, 7(8)

  42. Ye Z, Chen X, Li Z (2010) Video based mobile location search with large set of sift points in cloud. In: Proceedings of the 2010 ACM multimedia workshop on Mobile cloud media computing, ACM, pp 25–30

  43. Zha Z-J, Hua X-S, Mei T, Wang J, Qi G-J, Wang Z (2008) Joint multi-label multi-instance learning for image classification. In: Computer Vision and Pattern Recognition, 2008. CVPR 2008. IEEE Conference on IEEE, pp 1–8

  44. Zhang X, Liu W, Dundar M, Badve S, Zhang S (2015) Towards large-scale histopathological image analysis: hashing-based image retrieval. IEEE Trans Med Imaging 34(2):496–506

    Article  Google Scholar 

  45. Wu H, Miao Z, Wang Y, Chen J, Ma C, Zhou T (2015) Image completion with multi-image based on entropy reduction. Neurocomputing 159:157–171

Download references

Acknowledgements

This research is sponsored by National Natural Science Foundation of China (Nos.61601033,61371185, 61401029, 61571049), China Postdoctoral Science Foundation(212400201, 2016M591109), the Fundamental Research Funds for the Central Universities (Nos. 2014KJJCB32, 2013NT57, 2012LYB46), Research Funds (15ZR003) and by SRF for ROCS, SEM, NSFC61273274, 61370127 and 61201158,NSFB4123104, FRFCU 2014JBZ004, Z131110001913143.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hao Wu.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Bie, R., Zhang, C. et al. Optimized learning instance-based image retrieval. Multimed Tools Appl 76, 16749–16766 (2017). https://doi.org/10.1007/s11042-016-3950-9

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-016-3950-9

Keywords

Navigation