skip to main content
10.3115/1220575.1220690dlproceedingsArticle/Chapter ViewAbstractPublication PageshltConference Proceedingsconference-collections
Article
Free Access

Using random walks for question-focused sentence retrieval

Published:06 October 2005Publication History

ABSTRACT

We consider the problem of question-focused sentence retrieval from complex news articles describing multi-event stories published over time. Annotators generated a list of questions central to understanding each story in our corpus. Because of the dynamic nature of the stories, many questions are time-sensitive (e.g. "How many victims have been found?") Judges found sentences providing an answer to each question. To address the sentence retrieval problem, we apply a stochastic, graph-based method for comparing the relative importance of the textual units, which was previously used successfully for generic summarization. Currently, we present a topic-sensitive version of our method and hypothesize that it can outperform a competitive baseline, which compares the similarity of each sentence to the input question via IDF-weighted word overlap. In our experiments, the method achieves a TRDR score that is significantly higher than that of the baseline.

References

  1. James Allan, Courtney Wade, and Alvaro Bolivar. 2003. Retrieval and novelty detection at the sentence level. In SIGIR '03: Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval, pages 314--321. ACM Press. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Enrique Amigo, Julio Gonzalo, Victor Peinado, Anselmo Peñas, and Felisa Verdejo. 2004. An Empirical Study of Information Synthesis Task. In Proceedings of the 42nd Meeting of the Association for Computational Linguistics (ACL'04), Main Volume, pages 207--214, Barcelona, Spain, July. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Sergey Brin and Lawrence Page. 1998. The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems, 30(1--7): 107--117. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Jean Carletta. 1996. Assessing Agreement on Classification Tasks: The Kappa Statistic. CL, 22(2):249--254. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Gunes Erkan and Dragomir Radev. 2004. LexRank: Graph-based Lexical Centrality as Salience in Text. JAIR, 22:457--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Robert Gaizauskas, Mark Hepple, and Mark Greenwood. 2004. Information Retrieval for Question Answering: a SIGIR 2004 Workshop. In SIGIR 2004 Workshop on Information Retrieval for Question Answering. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Oren Kurland and Lillian Lee. 2005. PageRank without hyperlinks: Structural re-ranking using links induced by language models. In SIGIR 2005, Salvador, Brazil, August. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. L. Page, S. Brin, R. Motwani, and T. Winograd. 1998. The pagerank citation ranking: Bringing order to the web. Technical report, Stanford University, Stanford, CA.Google ScholarGoogle Scholar
  9. Bo Pang and Lillian Lee. 2004. A Sentimental Education: Sentiment Analysis Using Subjectivity Summarization Based on Minimum Cuts. In Association for Computational Linguistics. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Dragomir Radev, Weiguo Fan, Hong Qi, Harris Wu, and Amardeep Grewal. 2005. Probabilistic Question Answering on the Web. Journal of the American Society for Information Science and Technology, 56(3), March. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Stephen E. Robertson, Steve Walker, Micheline Hancock-Beaulieu, Aarron Gull, and Marianna Lau. 1992. Okapi at TREC. In Text REtrieval Conference, pages 21--30.Google ScholarGoogle Scholar
  12. G. Salton, J. Allan, and C. Buckley. 1993. Approaches to Passage REtrieval in Full Text Information Systems. In Proceedings of the 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 49--58. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. E. Seneta. 1981. Non-negative matrices and markov chains. Springer-Verlag, New York.Google ScholarGoogle Scholar
  14. Ellen Voorhees and Dawn Tice. 2000. The TREC-8 Question Answering Track Evaluation. In Text Retrieval Conference TREC-8, Gaithersburg, MD.Google ScholarGoogle ScholarCross RefCross Ref
  15. Harris Wu, Dragomir R. Radev, and Weiguo Fan. 2004. Towards Answer-focused Summarization Using Search Engines. New Directions in Question Answering.Google ScholarGoogle Scholar
  1. Using random walks for question-focused sentence retrieval

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        HLT '05: Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
        October 2005
        1054 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 6 October 2005

        Qualifiers

        • Article

        Acceptance Rates

        HLT '05 Paper Acceptance Rate127of402submissions,32%Overall Acceptance Rate240of768submissions,31%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader