skip to main content
10.1145/860435.860460acmconferencesArticle/Chapter ViewAbstractPublication PagesirConference Proceedingsconference-collections
Article

Modeling annotated data

Published:28 July 2003Publication History

ABSTRACT

We consider the problem of modeling annotated data---data with multiple types where the instance of one type (such as a caption) serves as a description of the other type (such as an image). We describe three hierarchical probabilistic mixture models which aim to describe such data, culminating in correspondence latent Dirichlet allocation, a latent variable model that is effective at modeling the joint distribution of both types and the conditional distribution of the annotation given the primary type. We conduct experiments on the Corel database of images and captions, assessing performance in terms of held-out likelihood, automatic annotation, and text-based image retrieval.

References

  1. H. Attias. A variational Bayesian framework for graphical models. In Advances in Neural Information Processing Systems 12, 2000.Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei, and M. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. D. Blei, A. Ng, and M. Jordan. Latent Dirichlet allocation. Journal of Machine Learning Research, 3:993--1022, January 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Cohn and T. Hofmann. The missing link-A probabilistic model of document content and hypertext connectivity. In Advances in Neural Information Processing Systems 13, 2001.Google ScholarGoogle Scholar
  5. A. Goodrum. Image information retrieval: An overview of current research. Informing Science, 3(2):63--67, 2000.Google ScholarGoogle ScholarCross RefCross Ref
  6. J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. In ACM SIGIR 2003, July 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. M. Jordan, Z. Ghahramani, T. Jaakkola, and L. Saul. Introduction to variational methods for graphical models. Machine Learning, 37:183--233, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Meghini, F. Sebastiani, and U. Straccia. A model of multimedia information retrieval. Journal of the ACM (JACM), 48(5):909--970, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Morris. Parametric empirical Bayes inference: Theory and applications. Journal of the American Statistical Association, 78(381):47--65, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  10. M. Naphade and T. Huang. A probabilistic framework for semantic video indexing, filtering and retrieval. IEEE Transactions on Multimedia, 3(1):141--151, March 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. J. Ponte and B. Croft. A language modeling approach to information retrieval. In ACM SIGIR 1998, pages 275--281. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. J. Shi and J. Malik. Normalized cuts and image segmentation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 22(9):888--905, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. B. Taskar, E. Segal, and D. Koller. Probabilistic clustering in relational data. In IJCAI-01, pages 870--876, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Modeling annotated data

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
        July 2003
        490 pages
        ISBN:1581136463
        DOI:10.1145/860435

        Copyright © 2003 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 28 July 2003

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • Article

        Acceptance Rates

        SIGIR '03 Paper Acceptance Rate46of266submissions,17%Overall Acceptance Rate792of3,983submissions,20%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader