ABSTRACT
A current application of automatic text summarization is to provide an overview of relevant documents coming from an information retrieval (IR) system. This paper examines how Centrifuser, one such summarization system, was designed with respect to methods used in the library community. We have reviewed these librarian expert techniques to assist information seekers and codified them into eight distinct strategies. We detail how we have operationalized six of these strategies in Centrifuser by computing an informative extract, indicative differences between documents, as well as navigational links to narrow or broaden a user's query. We conclude the paper with results from a preliminary evaluation.
- E. Amitay. What Lays in the Layout: Using anchor-paragraph arrangements to extract descriptions of Web documents. PhD thesis, Macquarie University, Sydney, Australia, UnpublishedGoogle Scholar
- ANSI. American national standard for describing books in advertisements, catalogs, promotional materials and book jackets. New York, USA, 1979Google Scholar
- R. Balay, editor. Guide to Reference Books. American Library Association, Chicago, USA, 11th edition edition, 1996Google Scholar
- M. J. Bates. Information search tactics. Journal of the American Society for Information Science, 30(7):205--214, 79Google Scholar
- C. Borgman. Why are online catalogs hard to use? lessons learned from information-retrieval studies. Journal of the American Society for Information Science, 37(6):387--400, 1986Google ScholarCross Ref
- M. Chamlers and P. Chitson. BEAD: Exploration in information visualization. In Proc. of 15th SIGIR '92, 1992 Google ScholarDigital Library
- J. Ding, L. Gravano, and N. Shivakumar. Computing geographic scopes of web resources. In Proc. of the 26th Intl. Conf. on Very Large Data Bases, 2000 Google ScholarDigital Library
- R. Fidel. Writing abstracts for free-text searching. Journal of Documentation, 42(1):11--21, March 1986Google ScholarCross Ref
- V. Hatzivassiliglou, J. L. Klavans, M. L. Holcombe, R. Barzilay, M.-Y. Kan, and K. R. McKeown. Simfinder: A flexible clustering tool for summarization. In Proc. of the Workshop on Automatic Summarization, NAACL 2001, 2001Google Scholar
- M. A. Hearst. Tilebars: Visualization of term distribution information in full text information access. In Proc. of CHI 1995, 1995 Google ScholarDigital Library
- E. Z. Jennerich and E. J. Jennerich. The Reference Interview as a Creative Art. Libraries Unlimited, Littleton, Colorado, 1987Google Scholar
- M.-Y. Kan, K. McKeown, and J. Klavans. Domain-specific informative and indicative summarization for information retrieval. In Proc. of the Document Understanding Conference (DUC), pages 19--26, New Orleans, USA, 2001Google Scholar
- W. A. Katz, editor. Introduction to Reference Work. McGraw-Hill, New York, USA, fifth edition edition, 1987Google Scholar
- J. Koenemann and N. Belkin. A case for interaction: a study of interactive information retrieval behavior and effectiveness. In Proceedings of CHI '96, pages 205--212, Vancouver, Canada, May 1996. ACM Press Google ScholarDigital Library
- Library of Congress. Marc 21 format for classification data : including guidelines for content designation. Washington, D.C., USA, 2000. ISN 0660179903Google Scholar
- E. Liddy. The discourse-level structure of empirical abstracts: An exploratory study. Information Processing and Management, 27(1):55--81, 1991 Google ScholarDigital Library
- S. Lok and S. K. Feiner. The AIL automated interface layout system. In Proc. of Intelligent User Interface, 2002 Google ScholarDigital Library
- I. Mani and E. Bloedorn. Summarizing similarities and differences among related documents. Information Retrieval, 1(1-2):35--67, 1999 Google ScholarDigital Library
- K. McKeown, S.-F. Chang, J. Cimino, S. Feiner, C. Friedman, L. Gravano, V. Hatzivassiloglou, S. Johnson, D. Jordan, J. Klavans, A. Kushniruk, V. Patel, and S. Teufel. PERSIVAL, a system for personalized search and summarization over multimedia healtcare information. In Proc. of the 1st JCDL, Roanoke, USA, 2001 Google ScholarDigital Library
- E. Morse, M. Lewis, and K. A. Olsen. Testing visual information retrieval methodologies case study: Comparative analysis of textual, icon, graphical and "spring" displays. Journal of the American Society for Information Science and Technology, 53(1):28--40, 2002 Google ScholarDigital Library
- B. Nicolas J. Interaction with texts: Information retrieval as information-seeking behavior. Information Retrieval, pages 55--66, 1993Google Scholar
- ODP. Open Directory Project guidelines. http://dmoz.org/guidelines.html, Last accessed November 2000Google Scholar
- C. D. Paice. Constructing literature abstracts by computer: techniques and prospects. Information Processing and Management, 26(1):171--186, 1990 Google ScholarDigital Library
- D. H. Sonnenwald. Developing a theory to guide the process of designing information retrieval systems. In Proc. of the 15th SIGIR, 1992 Google ScholarDigital Library
- C. Witcombe. Art history resources on the web. http://witcombe.sbc.edu/ARTHLinks.html, Last accessed January 2002Google Scholar
Index Terms
- Using librarian techniques in automatic text summarization for information retrieval
Recommendations
Fuzzy Logic based Hybrid Model for Automatic Extractive Text Summarization
ICIIT '20: Proceedings of the 2020 5th International Conference on Intelligent Information TechnologyIn the contemporary age of information, accessing data becomes easy, but finding knowledge is very difficult. The participation & publishing of information has consequently escalated the suffering of 'Information Glut.' Assisting users' informational ...
Automatic Text Summarization Methods: A Comprehensive Review
AbstractText summarization is the process of condensing a long text into a shorter version by maintaining the key information and its meaning. Automatic text summarization can save time and helps in selecting the important and relevant sentences from the ...
Automatic text summarization based on latent semantic indexing
Automatic summarization is a topic of common concern in computational linguistics and information science, since a computer system of text summarization is considered to be an effective means of processing information resources. A method of text ...
Comments