ABSTRACT
We describe a mechanism for the generation of lexical paraphrases of queries posed to an Internet resource. These paraphrases are generated using WordNet and part-of-speech information to propose synonyms for the content words in the queries. Statistical information, obtained from a corpus, is then used to rank the paraphrases. We evaluated our mechanism using 404 queries whose answers reside in the LA Times subset of the TREC-9 corpus. There was a 14% improvement in performance when paraphrases were used for document retrieval.
- E. Brill. 1992. A simple rule-based part of speech tagger. In ANLP-92 - Proceedings of the 3rd Conference on Applied Natural Language Processing, pages 152--155, Trento, IT. Google Scholar
- C. Buckley, G. Salton, J. Allan, and A. Singhal. 1995. Automatic query expansion using SMART. In D. Harman, editor, The Third Text REtrieval Conference (TREC3). NIST Special Publication.Google Scholar
- J. Gonzalo, F. Verdejo, I. Chugur, and J. Cigarran. 1998. Indexing with WordNet synsets can improve text retrieval. In Proceedings of the COLING-ACL'98 Workshop on Usage of WordNet in Natural Language Processing Systems, pages 38--44, Montreal, Canada.Google Scholar
- S. Harabagiu, D. Moldovan, M. Pasca, R. Mihalcea, M. Surdeanu, R. Bunescu, R. Girju, V. Rus, and P. Morarescu. 2001. The role of lexico-semantic feedback in open domain textual question-answering. In ACL01 - Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, pages 274--281, Toulouse, France. Google ScholarDigital Library
- D. Lin. 1998. Automatic retrieval and clustering of similar words. In COLING-ACL'98 - Proceedings of the International Conference on Computational Linguistics and the Annual Meeting of the Association for Computational Linguistics, pages 768--774, Montreal, Canada. Google Scholar
- S. Lytinen, N. Tomuro, and T. Repede. 2000. The use of WordNet sense tagging in FAQfinder. In Proceedings of the AAAI00 Workshop on AI and Web Search, Austin, Texas.Google Scholar
- R. Mihalcea and D. Moldovan. 1999. A method for word sense disambiguation of unrestricted text. In ACL99 -Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, Baltimore, Maryland. Google ScholarDigital Library
- G. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. 1990. Introduction to WordNet: An on-line lexical database. Journal of Lexicography, 3(4):235--244.Google ScholarCross Ref
- M. Mitra, A. Singhal, and C. Buckley. 1998. Improving automatic query expansion. In SIGIR'98-Proceedings of the 21th ACM International Conference on Research and Development in Information Retrieval, pages 206--214, Melbourne, Australia. Google ScholarDigital Library
- G. Salton and M. J. McGill. 1983. An Introduction to Modern Information Retrieval. McGraw Hill. Google ScholarDigital Library
- M. Sanderson. 1994. Word sense disambiguation and information retrieval. In SIGIR'94 - Proceedings of the 17th ACM International Conference on Research and Development in Information Retrieval, pages 142--151, Dublin, Ireland. Google ScholarDigital Library
- H. Schütze and J. O. Pedersen. 1995. Information retrieval based on word senses. In Proceedings of the Fourth Annual Symposium on Document Analysis and Information Retrieval, pages 161--175, Las Vegas, Nevada.Google Scholar
- Lexical query paraphrasing for document retrieval
Recommendations
Lexical paraphrasing for document retrieval and node identification
PARAPHRASE '03: Proceedings of the second international workshop on Paraphrasing - Volume 16We investigate lexical paraphrasing in the context of two distinct applications: document retrieval and node identification. Document retrieval --- the first step in question answering --- retrieves documents that contain answers to user queries. Node ...
Experiments in Query Paraphrasing for Information Retrieval
AI '02: Proceedings of the 15th Australian Joint Conference on Artificial Intelligence: Advances in Artificial IntelligenceWe investigate the effect of paraphrase generation on document retrieval performance. Specifically, we describe experiments where three information sources are used to generate lexical paraphrases of queries posed to the Internet. These information ...
Web mining for lexical context-specific paraphrasing
AIRS'06: Proceedings of the Third Asia conference on Information Retrieval TechnologyIn most applications of paraphrasing, contextual information should be considered since a word may have different paraphrases in different contexts. This paper presents a method that automatically acquires lexical context-specific paraphrases from the ...
Comments