skip to main content
10.3115/981658.981684dlproceedingsArticle/Chapter ViewAbstractPublication PagesaclConference Proceedingsconference-collections
Article
Free Access

Unsupervised word sense disambiguation rivaling supervised methods

Published:26 June 1995Publication History

ABSTRACT

This paper presents an unsupervised learning algorithm for sense disambiguation that, when trained on unannotated English text, rivals the performance of supervised techniques that require time-consuming hand annotations. The algorithm is based on two powerful constraints---that words tend to have one sense per discourse and one sense per collocation---exploited in an iterative bootstrapping procedure. Tested accuracy exceeds 96%.

References

  1. Baum, L. E., "An Inequality and Associated Maximization Technique in Statistical Estimation of Probabilistic Functions of a Markov Process," Inequalities, v 3, pp 1--8, 1972.Google ScholarGoogle Scholar
  2. Black, Ezra, "An Experiment in Computational Discrimination of English Word Senses," in IBM Journal of Research and Development, v 232, pp 185--194, 1988. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Brill, Eric, "A Corpus-Based Approach to Language Learning," Ph.D. Thesis, University of Pennsylvania, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. Brown, Peter, Stephen Della Pietra, Vincent Della Pietra, and Robert Mercer, "Word Sense Disambiguation using Statistical Methods," Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, pp 264--270, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Bruce, Rebecca and Janyce Wiebe, "Word-Sense Disambiguation Using Decomposable Models," in Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Church, K. W., "A Stochastic Parts Program an Noun Phrase Parser for Unrestricted Text," in Proceeding, IEEE International Conference on Acoustics, Speech and Signal Processing, Glasgow, 1989.Google ScholarGoogle Scholar
  7. Dagan, Ido and Alon Itai, "Word Sense Disambiguation Using a Second Language Monolingual Corpus", Computational Linguistics, v 20, pp 563--596, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Dempster, A. P., Laird, N. M., and Rubin, D. B., "Maximum Likelihood From Incomplete Data via the EM Algorithm," Journal of the Royal Statistical Society, v 39, pp 1--38, 1977.Google ScholarGoogle Scholar
  9. Gale, W., K. Church, and D. Yarowsky, "A Method for Disambiguating Word Senses in a Large Corpus," Computers and the Humanities, 26, pp 415--439, 1992.Google ScholarGoogle ScholarCross RefCross Ref
  10. Gale, W., K. Church, and D. Yarowsky. "Discrimination Decisions for 100,000-Dimensional Spaces." In A. Zampoli, N. Calzolari and M. Palmer (eds.), Current Issues in Computational Linguistics: In Honour of Don Walker, Kluwer Academic Publishers, pp. 429--450, 1994.Google ScholarGoogle Scholar
  11. Guthrie, J., L. Guthrie, Y. Wilks and H. Aidinejad, "Subject Dependent Co-occurrence and Word Sense Disambiguation," in Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, pp 146--152, 1991. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Hearst, Marti, "Noun Homograph Disambiguation Using Local Context in Large Text Corpora," in Using Corpora, University of Waterloo, Ontario, 1991.Google ScholarGoogle Scholar
  13. Leacock, Claudia, Geoffrey Towell and Ellen Voorhees "Corpus-Based Statistical Sense Resolution," in Proceedings, ARPA Human Language Technology Workshop, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Lehman, Jill Fain, "Toward the Essential Nature of Statistical Knowledge in Sense Resolution", in Proceedings of the Twelfth National Conference on Artificial Intelligence, pp 734--471, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Lesk, Michael, "Automatic Sense Disambiguation: How to tell a Pine Cone from an Ice Cream Cone," Proceeding of the 1986 SIGDOC Conference, Association for Computing Machinery, New York, 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Miller, George, "WordNet: An On-Line Lexical Database," International Journal of Lexicography, 3, 4, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  17. Mosteller, Frederick, and David Wallace, Inference and Disputed Authorship: The Federalist, Addison-Wesley, Reading, Massachusetts, 1964.Google ScholarGoogle Scholar
  18. Rivest, R. L., "Learning Decision Lists," in Machine Learning, 2, pp 229--246, 1987. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Schütze, Hinrich, "Dimensions of Meaning," in Proceedings of Supercomputing '92, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Slator, Brian, "Using Context for Sense Preference," in Text-Based Intelligent Systems: Current Research in Text Analysis, Information Extraction and Retrieval, P. S. Jacobs, ed., GE Research and Development Center, Schenectady, New York, 1990.Google ScholarGoogle Scholar
  21. Veronis, Jean and Nancy Ide, "Word Sense Disambiguation with Very Large Neural Networks Extracted from Machine Readable Dictionaries," in Proceedings, COLING-90, pp 389--394, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Yarowsky, David "Word-Sense Disambiguation Using Statistical Models of Roget's Categories Trained on Large Corpora," in Proceedings, COLING-92, Nantes, France, 1992. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Yarowsky, David, "One Sense Per Collocation," in Proceedings, ARPA Human Language Technology Workshop, Princeton, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Yarowsky, David, "Decision Lists for Lexical Ambiguity Resolution: Application to Accent Restoration in Spanish and French," in Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, Las Cruces, NM, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Yarowsky, David. "Homograph Disambiguation in Speech Synthesis." In J. Hirschberg, R. Sproat and J. van Santen (eds.), Progress in Speech Synthesis, Springer-Verlag, to appear.Google ScholarGoogle Scholar
  1. Unsupervised word sense disambiguation rivaling supervised methods

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image DL Hosted proceedings
        ACL '95: Proceedings of the 33rd annual meeting on Association for Computational Linguistics
        June 1995
        354 pages

        Publisher

        Association for Computational Linguistics

        United States

        Publication History

        • Published: 26 June 1995

        Qualifiers

        • Article

        Acceptance Rates

        Overall Acceptance Rate85of443submissions,19%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader