Skip to main content
Log in

Web image retrieval using majority-based ranking approach

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Web image retrieval has characteristics different from typical content-based image retrieval; web images have associated textual cues. However, a web image retrieval system often yields undesirable results, because it uses limited text information such as surrounding text, URLs, and image filenames. In this paper, we propose a new approach to retrieval, which uses the image content of retrieved results without relying on assistance from the user. Our basic hypothesis is that more popular images have a higher probability of being the ones that the user wishes to retrieve. According to this hypothesis, we propose a retrieval approach that is based on a majority of the images under consideration. We define four methods for finding the visual features of majority of images; (1) majority-first method, (2) centroid-of-all method, (3) centroid-of-top K method, and (4) centroid-of-largest-cluster method. In addition, we implement a graph/picture classifier for improving the effectiveness of web image retrieval. We evaluate the retrieval effectiveness of both our methods and conventional ones by using precision and recall graphs. Experimental results show that the proposed methods are more effective than conventional keyword-based retrieval methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Athitsos V, Swain MJ, Frankel C (1997) Distinguishing photographs and graphics on the world wide web. In: Proceedings of IEEE workshop on content-based access of image and video libraries, pp 10-16

  2. Ballard DH, Brown CM (1982) Computer Vision. Prentice-Hall Inc., pp 181-189

  3. Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2):121–167

    Article  Google Scholar 

  4. Carson C, Belongie S, Greenspan H., Malik J (2002) Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Trans. Pattern Anal. Mach. Intell. 24(8):1026–1038

    Article  Google Scholar 

  5. Celebi E, Alpkocak A (2000) Clustering of texture features for content-based image retrieval. In: Proceedings of International Conference on Advances in Information Systems(ADVIS2000), pp 216-225

  6. Chen Y, Wang JZ, Krovetz R (2003 November) Content-Based Image Retrieval by Clustering. 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, pp 193-200

  7. Faloutsos C, Barber R, Flickner M, Hafner J, Niblack W, Petkovic D, Equitz W (1994) Efficient and effective querying by image content. Journal of Intelligent Information Systems 3(3/4):231-262

    Article  Google Scholar 

  8. Frakes WB, Baeza-Yates R (1992) Information Retrieval: Data Structures & Algorithms. Prentice Hall, pp 419 - 443

  9. Frankel C, Swain MJ, Athitsos V (1996) WebSeer: An image search engine for the world wide web. University of Chicago Technical Report TR-96-14

  10. Gevers T, Smeulders AWM (1997) Pictoseek: A content-based image search engine for the WWW. In: Proceedings of International Conference On Visual Information Systems, pp 93-100

  11. Haralick RM, Shapiro LG (1992) Computer and Robot Vision. Addison-Wesley publishing company, pp 453-470

  12. Hartmann A, Lienhart R (2002) Automatic classification of images on the web. In: Proceedings of Storage and Retrieval for Media Databases 2002, Vol. SPIE 4676, pp 31–40

  13. Jain AK, Vailaya A (1996) Image retrieval using color and shape. Pattern Recognition 29(8):1233–1244

    Article  Google Scholar 

  14. Lee KS, Park YC, Choi KS (2001) Re-ranking model based on document clusters. Inf. Process. Manag. 37(1):1–14

    Article  Google Scholar 

  15. Li J, Gray RM (2000) Context-based multiscale classification of document images using wavelet coefficient distributions. IEEE Trans. Image Process. 9(9):1604–1616

    Article  Google Scholar 

  16. Ma WY, Manjunath BS (1999) NeTra: A toolbox for navigating large image databases. ACM Multimedia Syst. 7(3):184–198

    Article  Google Scholar 

  17. Mukherjea S, Hirata K, Hara Y (1998) Using clustering and visualization for refining the results of a WWW image search engine. In: Proceedings of Workshop on New Paradigms in Information Visualization and Manipulation (NPIV 1998), pp 29-35

  18. Mukherjea S, Cho J (1999) Automatically determining semantics for world wide web multimedia information retrieval. J. Vis. Lang. Comput. 10(6):585–606

    Article  Google Scholar 

  19. Park G, Baek Y, Lee HK (2002) A ranking algorithm using dynamic clustering for content-based image retrieval. In: Proceeding of the Challenge of Image and Video Retrieval(CIVR2002): International Conference on Image and Video Retrieval, pp 316-324

  20. Parker JR (1997) Algorithms for Image Processing and Computer Vision. John Wiley & Sons, Inc., pp 23–53

  21. Partio M, Cramariuc B, Gabbouj M, Visa A (2002, October 4-7) Rock texture retrieval using gray level co-occurrence matrix. In: Proceeding of the 5th Nordic Signal Processing Symposium (NORSIG 2002), Norway

  22. Sclaroff S, la Cascia M, Sethi S, Taycher L (1999) Unifying textual and visual cues for content-based image retrieval on the world wide web. Comput. Vis. Image Underst. 75(1/2):86-98

    Article  Google Scholar 

  23. Serrano N, Savakis A, Luo J (2002) A computationally efficient approach to indoor/outdoor scene classification. In: Proceedings of International Conference on Pattern Recognition (ICPR’02), volume 4, pp 146-149

  24. Smith JR (1997) Integrated Spatial and Feature Image Systems: Retrieval, Analysis and Compression. Doctoral Dissertations, Columbia University

  25. Stollnitz EJ, DeRose TD, Salesin DH (1995) Wavelets for computer graphics: A primer, part 1. IEEE Comput. Graph. Appl. 15(3):76–84

    Article  Google Scholar 

  26. Swain MJ, Ballard DH (1991) Color indexing. Int. J. Comput. Vis. 7(1):11–32

    Article  Google Scholar 

  27. Szummer M, Picard RW (1998) Indoor-outdoor image classification. In: Proceedings of IEEE International Workshop on Content-Based Access of Image and Video Databases, pp 42-51

  28. Tombros A, Villa R, Van Rijsbergen CJ (2002) The effectiveness of query-specific hierarchic clustering in information retrieval. Inf. Process. Manag. 38(4):559–582

    Article  MATH  Google Scholar 

  29. Vailaya A, Jain AK, Zhang H-J (1998) On image classification: city images vs. landscapes. Pattern Recogn. 31(12):1921–1936

    Article  Google Scholar 

  30. Vapnik VN (1999) The Nature of Statistical Learning Theory. Springer-Verlag, pp 123 – 169

  31. Voorhees EM (1985) The cluster hypothesis revisited. In: Proceedings of 8th ACM SIGIR International Conference on Research and Development in Information Retrieval, pp 188-196

  32. Voorhees EM (1986) Implementation agglomerative hierarchic clustering algorithms for use in document retrieval. Technical Report TR 86-765 of the Department of Computing Science, Cornell University

  33. Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-sensitive Integrated Matching for Picture Libraries. IEEE Trans. Pattern Anal. Machine Intel. 23(9):947–963

    Article  Google Scholar 

  34. Willett P (1998) Recent trends in hierarchic document clustering: A critical review. Inf. Process. Manag. 24(5):577–587

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gunhan Park.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Park, G., Baek, Y. & Lee, HK. Web image retrieval using majority-based ranking approach. Multimed Tools Appl 31, 195–219 (2006). https://doi.org/10.1007/s11042-006-0039-x

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-006-0039-x

Keywords

Navigation