skip to main content
10.3115/1072228.1072389dlproceedingsArticle/Chapter ViewAbstractPublication PagescolingConference Proceedingsconference-collections
Article
Free Access

Lexical query paraphrasing for document retrieval

Published:24 August 2002Publication History

ABSTRACT

We describe a mechanism for the generation of lexical paraphrases of queries posed to an Internet resource. These paraphrases are generated using WordNet and part-of-speech information to propose synonyms for the content words in the queries. Statistical information, obtained from a corpus, is then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. There was a 14% improvement in performance when paraphrases were used for document retrieval.

References

  1. E. Brill. 1992. A simple rule-based part of speech tagger. In ANLP-92 - Proceedings of the 3rd Conference on Applied Natural Language Processing, pages 152--155, Trento, IT. Google ScholarGoogle Scholar
  2. C. Buckley, G. Salton, J. Allan, and A. Singhal. 1995. Automatic query expansion using SMART. In D. Harman, editor, The Third Text REtrieval Conference (TREC3). NIST Special Publication.Google ScholarGoogle Scholar
  3. J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarran. 1998. Indexing with WordNet synsets can improve text retrieval. In Proceedings of the COLING-ACL'98 Workshop on Usage of WordNet in Natural Language Processing Systems, pages 38--44, Montreal, Canada.Google ScholarGoogle Scholar
  4. S. Harabagiu, D. Moldovan, M. Pasca, R. Mihalcea, M. Surdeanu, R. Bunescu, R. Girju, V. Rus, and P. Morarescu. 2001. The role of lexico-semantic feedback in open domain textual question-answering. In ACL01 - Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, pages 274--281, Toulouse, France. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. D. Lin. 1998. Automatic retrieval and clustering of similar words. In COLING-ACL'98 - Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics, pages 768--774, Montreal, Canada. Google ScholarGoogle Scholar
  6. S. Lytinen, N. Tomuro, and T. Repede. 2000. The use of WordNet sense tagging in FAQfinder. In Proceedings of the AAAI00 Workshop on AI and Web Search, Austin, Texas.Google ScholarGoogle Scholar
  7. R. Mihalcea and D. Moldovan. 1999. A method for word sense disambiguation of unrestricted text. In ACL99 -Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. 1990. Introduction to WordNet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.Google ScholarGoogle ScholarCross RefCross Ref
  9. M. Mitra, A. Singhal, and C. Buckley. 1998. Improving automatic query expansion. In SIGIR'98-Proceedings of the 21th ACM International Conference on Research and Development in Information Retrieval, pages 206--214, Melbourne, Australia. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. G. Salton and M. J. McGill. 1983. An Introduction to Modern Information Retrieval. McGraw Hill. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Sanderson. 1994. Word sense disambiguation and information retrieval. In SIGIR'94 - Proceedings of the 17th ACM International Conference on Research and Development in Information Retrieval, pages 142--151, Dublin, Ireland. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. H. Schütze and J. O. Pedersen. 1995. Information retrieval based on word senses. In Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval, pages 161--175, Las Vegas, Nevada.Google ScholarGoogle Scholar
  1. Lexical query paraphrasing for document retrieval

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image DL Hosted proceedings
      COLING '02: Proceedings of the 19th international conference on Computational linguistics - Volume 1
      August 2002
      1184 pages

      Publisher

      Association for Computational Linguistics

      United States

      Publication History

      • Published: 24 August 2002

      Qualifiers

      • Article

      Acceptance Rates

      Overall Acceptance Rate1,537of1,537submissions,100%

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader