Abstract
Web image retrieval has characteristics different from typical content-based image retrieval; web images have associated textual cues. However, a web image retrieval system often yields undesirable results, because it uses limited text information such as surrounding text, URLs, and image filenames. In this paper, we propose a new approach to retrieval, which uses the image content of retrieved results without relying on assistance from the user. Our basic hypothesis is that more popular images have a higher probability of being the ones that the user wishes to retrieve. According to this hypothesis, we propose a retrieval approach that is based on a majority of the images under consideration. We define four methods for finding the visual features of majority of images; (1) majority-first method, (2) centroid-of-all method, (3) centroid-of-top K method, and (4) centroid-of-largest-cluster method. In addition, we implement a graph/picture classifier for improving the effectiveness of web image retrieval. We evaluate the retrieval effectiveness of both our methods and conventional ones by using precision and recall graphs. Experimental results show that the proposed methods are more effective than conventional keyword-based retrieval methods.
Similar content being viewed by others
References
Athitsos V, Swain MJ, Frankel C (1997) Distinguishing photographs and graphics on the world wide web. In: Proceedings of IEEE workshop on content-based access of image and video libraries, pp 10-16
Ballard DH, Brown CM (1982) Computer Vision. Prentice-Hall Inc., pp 181-189
Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery 2(2):121–167
Carson C, Belongie S, Greenspan H., Malik J (2002) Blobworld: Image segmentation using expectation-maximization and its application to image querying. IEEE Trans. Pattern Anal. Mach. Intell. 24(8):1026–1038
Celebi E, Alpkocak A (2000) Clustering of texture features for content-based image retrieval. In: Proceedings of International Conference on Advances in Information Systems(ADVIS2000), pp 216-225
Chen Y, Wang JZ, Krovetz R (2003 November) Content-Based Image Retrieval by Clustering. 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, pp 193-200
Faloutsos C, Barber R, Flickner M, Hafner J, Niblack W, Petkovic D, Equitz W (1994) Efficient and effective querying by image content. Journal of Intelligent Information Systems 3(3/4):231-262
Frakes WB, Baeza-Yates R (1992) Information Retrieval: Data Structures & Algorithms. Prentice Hall, pp 419 - 443
Frankel C, Swain MJ, Athitsos V (1996) WebSeer: An image search engine for the world wide web. University of Chicago Technical Report TR-96-14
Gevers T, Smeulders AWM (1997) Pictoseek: A content-based image search engine for the WWW. In: Proceedings of International Conference On Visual Information Systems, pp 93-100
Haralick RM, Shapiro LG (1992) Computer and Robot Vision. Addison-Wesley publishing company, pp 453-470
Hartmann A, Lienhart R (2002) Automatic classification of images on the web. In: Proceedings of Storage and Retrieval for Media Databases 2002, Vol. SPIE 4676, pp 31–40
Jain AK, Vailaya A (1996) Image retrieval using color and shape. Pattern Recognition 29(8):1233–1244
Lee KS, Park YC, Choi KS (2001) Re-ranking model based on document clusters. Inf. Process. Manag. 37(1):1–14
Li J, Gray RM (2000) Context-based multiscale classification of document images using wavelet coefficient distributions. IEEE Trans. Image Process. 9(9):1604–1616
Ma WY, Manjunath BS (1999) NeTra: A toolbox for navigating large image databases. ACM Multimedia Syst. 7(3):184–198
Mukherjea S, Hirata K, Hara Y (1998) Using clustering and visualization for refining the results of a WWW image search engine. In: Proceedings of Workshop on New Paradigms in Information Visualization and Manipulation (NPIV 1998), pp 29-35
Mukherjea S, Cho J (1999) Automatically determining semantics for world wide web multimedia information retrieval. J. Vis. Lang. Comput. 10(6):585–606
Park G, Baek Y, Lee HK (2002) A ranking algorithm using dynamic clustering for content-based image retrieval. In: Proceeding of the Challenge of Image and Video Retrieval(CIVR2002): International Conference on Image and Video Retrieval, pp 316-324
Parker JR (1997) Algorithms for Image Processing and Computer Vision. John Wiley & Sons, Inc., pp 23–53
Partio M, Cramariuc B, Gabbouj M, Visa A (2002, October 4-7) Rock texture retrieval using gray level co-occurrence matrix. In: Proceeding of the 5th Nordic Signal Processing Symposium (NORSIG 2002), Norway
Sclaroff S, la Cascia M, Sethi S, Taycher L (1999) Unifying textual and visual cues for content-based image retrieval on the world wide web. Comput. Vis. Image Underst. 75(1/2):86-98
Serrano N, Savakis A, Luo J (2002) A computationally efficient approach to indoor/outdoor scene classification. In: Proceedings of International Conference on Pattern Recognition (ICPR’02), volume 4, pp 146-149
Smith JR (1997) Integrated Spatial and Feature Image Systems: Retrieval, Analysis and Compression. Doctoral Dissertations, Columbia University
Stollnitz EJ, DeRose TD, Salesin DH (1995) Wavelets for computer graphics: A primer, part 1. IEEE Comput. Graph. Appl. 15(3):76–84
Swain MJ, Ballard DH (1991) Color indexing. Int. J. Comput. Vis. 7(1):11–32
Szummer M, Picard RW (1998) Indoor-outdoor image classification. In: Proceedings of IEEE International Workshop on Content-Based Access of Image and Video Databases, pp 42-51
Tombros A, Villa R, Van Rijsbergen CJ (2002) The effectiveness of query-specific hierarchic clustering in information retrieval. Inf. Process. Manag. 38(4):559–582
Vailaya A, Jain AK, Zhang H-J (1998) On image classification: city images vs. landscapes. Pattern Recogn. 31(12):1921–1936
Vapnik VN (1999) The Nature of Statistical Learning Theory. Springer-Verlag, pp 123 – 169
Voorhees EM (1985) The cluster hypothesis revisited. In: Proceedings of 8th ACM SIGIR International Conference on Research and Development in Information Retrieval, pp 188-196
Voorhees EM (1986) Implementation agglomerative hierarchic clustering algorithms for use in document retrieval. Technical Report TR 86-765 of the Department of Computing Science, Cornell University
Wang JZ, Li J, Wiederhold G (2001) SIMPLIcity: Semantics-sensitive Integrated Matching for Picture Libraries. IEEE Trans. Pattern Anal. Machine Intel. 23(9):947–963
Willett P (1998) Recent trends in hierarchic document clustering: A critical review. Inf. Process. Manag. 24(5):577–587
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Park, G., Baek, Y. & Lee, HK. Web image retrieval using majority-based ranking approach. Multimed Tools Appl 31, 195–219 (2006). https://doi.org/10.1007/s11042-006-0039-x
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-006-0039-x