Abstract
In preparation for the widespread use of automatic scanners which will read documents and transmit their contents to other machines for analysis, this report presents a new concept in automatic analysis: the relative-frequency approach to measuring the significance of words, word groups, and sentences. The relative-frequency approach is discussed in detail, as is its application to problems of automatic indexing and automatic abstracting. Included in the report is a summary of automatic analysis studies published as of the date of writing. Conclusions are drawn that point toward more sophisticated mathematical and linguistic techniques for the solution of problems of automatic analysis.
- 1 BAXENDALE, P. B. Machine-made index for technical liter:qture-an experiment. IBM J. Res. Dev. 2, 4 (Oct. 1958), 354-361.Google ScholarDigital Library
- 2 LUHN, H.P. The automatic creation of literature abstracts. IBM J. Res. Dev. 2, 2 (Apr. 1958), 159-165.Google ScholarDigital Library
- 3 OSWALD, V. A., JR., ET AL. Automatic indexing and abstract. ing of the contents of documents. RADC-TR-59-208, 3L October 1959, prepared for the Rome Air Development Center, Air Research and Development Command, United States Air Force, pp. 5-34, 59-133.Google Scholar
- 4 OSWALD, V. A. JR.; AND LAWSON, R.H. An idioglossary for mechanical translation. Mod. Language Forum 38, 2 (Sept.- Dec. 1953), 1-11.Google Scholar
- 5 RATH, G. J.; RESNICK, A.; and SAVAGE, T.R. The formation of abstracts by the selection of sentences. Research Report RC-184, 29 June 1959, IBM Research Center, Yorktown Heights, N. Y.Google Scholar
- 6 RATH, G. J.; RESNICK, A.; and SAVAGE, W.R. Comparisons of four types of lexical indicators of contents. Research Report RC-187, 14 August 1959, IBM Research Center, Yorktown Heights, N. Y.Google Scholar
- 7 RESNICK, A.; and SAVAGE, T .n . A re-evaluation of machine generated abstracts. Research Report RC-230, 1 March 1960, IBM Research Center, Yorktown Heights, N. Y.Google ScholarCross Ref
Index Terms
- Automatic abstracting and indexing—survey and recommendations
Recommendations
Automatic indexing
ACM '81: Proceedings of the ACM '81 conferenceOne of the first projects in computer analysis of natural language was to devise procedures for representing the subject content of a document by a few text-derived terms, a process called automatic indexing. Although many developments have taken place ...
Thesaurus based automatic keyphrase indexing
JCDL '06: Proceedings of the 6th ACM/IEEE-CS joint conference on Digital librariesWe propose a new method that enhances automatic keyphrase extraction by using semantic information on terms and phrases gleaned from a domain-specific thesaurus. We evaluate the results against keyphrase sets assigned by a state-of-the-art keyphrase ...
Comments