skip to main content
10.1145/1027527.1027664acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Multimodal concept-dependent active learning for image retrieval

Authors Info & Claims
Published:10 October 2004Publication History

ABSTRACT

It has been established that active learning is effective for learning complex, subjective query concepts for image retrieval. However, active learning has been applied in a concept independent way, (i.e., the kernel-parameters and the sampling strategy are identically chosen) for learning query concepts of differing <i>complexity</i>. In this work, we first characterize a concept's complexity using three measures: <i>hit-rate</i>, <i>isolation</i> and <i>diversity</i>. We then propose a multimodal learning approach that uses images' semantic labels to guide a <i>concept-dependent</i>, <i>active-learning</i> process. Based on the complexity of a concept, we make intelligent adjustments to the sampling strategy and the sampling pool from which images are to be selected and labeled, to improve concept learnability. Our empirical study on a $300$K-image dataset shows that concept-dependent learning is highly effective for image-retrieval accuracy.

References

  1. R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. Proceedings of ACM SIGMOD, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Barnard, P. Duygulu, and D. Forsyth. Exploiting text and image feature co-occurrence statistics in large datasets. Trends and Advances in Content-Based Image and Video Retrieval (To Appear), 2004.Google ScholarGoogle Scholar
  3. K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume~2, pages 408--415, 2000.Google ScholarGoogle Scholar
  4. A. B. Benitez and S.-F. Chang. Image classification using multimedia knowledge networks. Proc. of the Int. Conf. on Image Processing, September 2003.Google ScholarGoogle ScholarCross RefCross Ref
  5. A. B. Benitez, J. R. Smith, and S.-F. Chang. Medianet: A multimedia information network for knowledge representation. Proceeding of the SPIE Conference on Internet Multimedia Management Systems, November 2000.Google ScholarGoogle ScholarCross RefCross Ref
  6. K. Brinker. Incorporating diversity in active learning with support vector machines. Proceedings of the Twentieth International Conference on Machine Learning, pages 59--66, August 2003.Google ScholarGoogle Scholar
  7. E. Chang and B. Li. Mega --- the maximizing expected generalization algorithm for learning complex query concepts. ACM Transaction on Information Systems, December 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. E. Y. Chang, K. Goh, and W.-C. Lai. On scalability of active learning for formulating query concepts. Workshop on Computer Vision Meets Databases (CVDB) in conjunction with ACM SIGMOD, June 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. R. Duda, P. Hart, and D. G. Stork. Pattern Classification. Wiley, New York, 2 edition, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. A. K. Jain, R. P. Duin, and J. Mao. Statistical pattern recognition: A review. IEEE Trans. on Pattern Analysis and Machine Intelligence, 22(1):4--37, January 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. Proceedings of the 26th Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, August 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. C. Li, E. Chang, H. Garcia-Molina, and G. Wilderhold. Clindex: Approximate similarity queries in high-dimensional spaces. IEEE Transactions on Knowledge and Data Engineering (TKDE), 14(4), July 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. J. Li and J. Z. Wang. Automatic linguistic indexing of pictures by a statistical modeling approach. IEEE Transactions on Pattern Analysis and Machine Intelligence, 25(20), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Y. Lu, C. Hu, X. Zhu, H. Zhang, and Q. Yang. A unified framework for semantics and feature based relevance feedback in image retrieval systems. In Proc. of ACM Int. Conf. on Multimedia, pages 31--37, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Y. Mori, H. Takahashi, and R. Oka. Automatic words assignment to images based on image division and vector quantization. In Proc. of RIAO 2000: Content-Based Multimedia Information Access, Apr. 2000.Google ScholarGoogle Scholar
  16. S. Tong and E. Chang. Support vector machine active learning for image retrieval. Proc. of ACM Int. Conf. on Multimedia, pages 107--118, October 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. T. Westerveld. Image retrieval: Content versus context. Content-Based Multimedia Information Access, RIAO, pages 276--284, 2000.Google ScholarGoogle Scholar
  18. H. Zhang, Z. Chen, M. Li, and Z. Su. Relevance feedback and learning in content-based image search. WWW: Internet and Web Information Systems, 6(2):131--155, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. X. S. Zhou and T. S. Huang. Unifying keywords and visual contents in image retrieval. IEEE Multimedia, 9(2):23--33, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Zhu, A. Rao, and A. Zhang. Theory of keyblock-based image retreival. ACM Trans. on Information Systems, 20(2):224--257, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Multimodal concept-dependent active learning for image retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MULTIMEDIA '04: Proceedings of the 12th annual ACM international conference on Multimedia
      October 2004
      1028 pages
      ISBN:1581138938
      DOI:10.1145/1027527

      Copyright © 2004 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 October 2004

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate995of4,171submissions,24%

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader