skip to main content
article
Free Access

Another stemmer

Published:01 November 1990Publication History
Skip Abstract Section

Abstract

In natural language processing, conflation is the process of merging or lumping together nonidentical words which refer to the same principal concept. This can relate both to words which are entirely different in form (e.g., "group" and "collection"), and to words which share some common root (e.g., "group", "grouping", "subgroups"). In the former case the words can only be mapped by referring to a dictionary or thesaurus, but in the latter case use can be made of the orthographic similarities between the forms. One popular approach is to remove affixes from the input words, thus reducing them to a stem; if this could be done correctly, all the variant forms of a word would be converted to the same standard form. Since the process is aimed at mapping for retrieval purposes, the stem need not be a linguistically correct lemma or root (see also Frakes 1982).

References

  1. Dawson, J. L. 1974: "Suffix removal and word conflation," ALLC Bulletin, 2(3), 33--46 (1974).Google ScholarGoogle Scholar
  2. Frakes, W. B., 1982: Term Conflation for Information Retrieval, Ph.D. dissertation, Syracuse University, August 1982.Google ScholarGoogle Scholar
  3. Lennon, M., Pierce, D. S., Tarry, B. D. and Willett, P. 1981: "An evaluation of some conflation algorithms for information retrieval", Journal of Information Science, 3, 177--183 (1981).Google ScholarGoogle ScholarCross RefCross Ref
  4. Lovins, J. B. 1968: "Development of a stemming algorithm", Mechanical Translation and Computational Linguistics, 11, 22--31 (1968).Google ScholarGoogle Scholar
  5. Paice, C. D. 1977: Information Retrieval and the Computer, London: MacDonald & Jane's, 1977; chapter 4.Google ScholarGoogle Scholar
  6. Porter, M. F. 1980: "An algorithm for suffix stripping", Program, 14, 130--137 (1980).Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Ulmschneider, J. and Doszkocs, T. 1983: "A practical stemming algorithm for online search assistance", Online Review, 7(4), (1983).Google ScholarGoogle Scholar

Index Terms

  1. Another stemmer

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in

      Full Access

      • Published in

        cover image ACM SIGIR Forum
        ACM SIGIR Forum  Volume 24, Issue 3
        Fall 1990
        106 pages
        ISSN:0163-5840
        DOI:10.1145/101306
        Issue’s Table of Contents

        Copyright © 1990 Author

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 1 November 1990

        Check for updates

        Qualifiers

        • article

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader