skip to main content
10.1145/1180639.1180774acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Image annotation refinement using random walk with restarts

Published:23 October 2006Publication History

ABSTRACT

Image annotation plays an important role in image retrieval and management. However, the results of the state-of-the-art image annotation methods are often unsatisfactory. Therefore, it is necessary to refine the imprecise annotations obtained by existing annotation methods. In this paper, a novel approach to automatically refine the original annotations of images is proposed. On the one hand, for Web images, textual information, e.g. file name and surrounding text, is used to retrieve a set of candidate annotations. On the other hand, for non-Web images that are lack of textual information, a relevance model-based algorithm using visual information is used to decide the candidate annotations. Then, candidate annotations are re-ranked and only the top ones are reserved as the final annotations. To re-rank the annotations, an algorithm using Random Walk with Restarts (RWR) is proposed to leverage both the corpus information and the original confidence information of the annotations. Experimental results on both non-Web images of Corel dataset and Web images of photo forum sites demonstrate the effectiveness of the proposed method.

References

  1. http://images.google.comGoogle ScholarGoogle Scholar
  2. http://www.photosig.comGoogle ScholarGoogle Scholar
  3. Blei, D. M. and Jordan, M. I. Modeling annotated data. In Proc. SIGIR, Toronto, July. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Chang, E., Kingshy, G., Sychay, G., and Wu, G. CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines. IEEE Trans. on CSVT, 13(1):26--38, Jan. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Cusano, C., Ciocca, G., and Schettini, R. Image annotation using SVM. In Proc. Of Internet imaging IV, Vol. SPIE, 2004Google ScholarGoogle Scholar
  6. Duygulu, P. and Barnard, K. Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In Proc. of ECCV, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Feng, S. L., Manmatha, R., and Lavrenko, V. Multiple bernoulli relevance models for image and video annotation. In Proc. of CVPR, Washington, DC, June, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jeon, J., Lavrenko, V., and Manmatha, R. Automatic Image Annotation and Retrieval Using Cross-media Relevance Models. In Proc. of SIGIR, Toronto, July 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Jin, Y., Khan, L., Wang, L., and Awad, M. Image Annotations By Combining Multiple Evidence & Wordnet. Proc. of ACM Multimedia, Singapore, 2005 Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Lavrenko, V. and Croft, W. Relevance-based language models. Proc. of SIGIR, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Lavrenko, V., Manmatha, R., and Jeon, J. A Model for Learning the Semantics of Pictures. In Proc. NIPS, 2003.Google ScholarGoogle Scholar
  12. Li, J. and Wang, J. Z. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Trans. on PAMI, 25(10), Oct. 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. Miller, G. A. WordNet: A lexical database for English. Communication of ACM, 38, 11 (Nov. 1995), 39--41. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Mori, Y., Takahashi, H., and Oka, R. Image-to-word transformation based on dividing and vector quantizing images with words. In MISRM, 1999.Google ScholarGoogle Scholar
  15. Page, L., Brin, S., Motwani, R., and Winograd, T. The Pagerank Citation Ranking: Bringing Order to the web. technical report, Stanford University, Stanford, CA, 1998.Google ScholarGoogle Scholar
  16. Zhang, L., Chen, L., Jing, F., Deng, K. F., and Ma, W.Y. EnjoyPhoto-A Vertical Image Search Engine for Enjoying High-Quality Photos. In ACM multimedia 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Image annotation refinement using random walk with restarts

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MM '06: Proceedings of the 14th ACM international conference on Multimedia
      October 2006
      1072 pages
      ISBN:1595934472
      DOI:10.1145/1180639

      Copyright © 2006 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 23 October 2006

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader