skip to main content
article

Measuring praise and criticism: Inference of semantic orientation from association

Published:01 October 2003Publication History
Skip Abstract Section

Abstract

The evaluative character of a word is called its semantic orientation. Positive semantic orientation indicates praise (e.g., "honest", "intrepid") and negative semantic orientation indicates criticism (e.g., "disturbing", "superfluous"). Semantic orientation varies in both direction (positive or negative) and degree (mild to strong). An automated system for measuring semantic orientation would have application in text classification, text filtering, tracking opinions in online discussions, analysis of survey responses, and automated chat systems (chatbots). This article introduces a method for inferring the semantic orientation of a word from its statistical association with a set of positive and negative paradigm words. Two instances of this approach are evaluated, based on two different statistical measures of word association: pointwise mutual information (PMI) and latent semantic analysis (LSA). The method is experimentally tested with 3,596 words (including adjectives, adverbs, nouns, and verbs) that have been manually labeled positive (1,614 words) and negative (1,982 words). The method attains an accuracy of 82.8% on the full test set, but the accuracy rises above 95% when the algorithm is allowed to abstain from classifying mild words.

References

  1. Agresti, A. 1996. An Introduction to Categorical Data Analysis. Wiley, New York.]]Google ScholarGoogle Scholar
  2. Bartell, B. T., Cottrell, G. W., and Belew, R. K. 1992. Latent semantic indexing is an optimal special case of multidimensional scaling. Proceedings of the 15th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM, New York, 161--167.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Budanitsky, A. and Hirst, G. 2001. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Workshop on WordNet and Other Lexical Resources, Second meeting of the North American Chapter of the Association for Computational Linguistics (Pittsburgh, Pa).]]Google ScholarGoogle Scholar
  4. Church, K. W. and Hanks, P. 1989. Word association norms, mutual information and lexicography. In Proceedings of the 27th Annual Conference of the Association of Computational Linguistics. Association for Computational Linguistics, New Brunswick, N.J., 76--83.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. 1990. Indexing by latent semantic analysis. J. Amer. Soc. Inf. Sci. 41, 6, 391--407.]]Google ScholarGoogle ScholarCross RefCross Ref
  6. Dunning, T. 1993. Accurate methods for the statistics of surprise and coincidence. Comput. Ling. 19, 61--74.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Firth, J. R. 1957. A synopsis of linguistic theory 1930--1955. In Studies in Linguistic Analysis, Philological Society, Oxford, England, 1--32. (Reprinted in F. R. Palmer (ed.), Selected Papers of J. R. Firth 1952--1959, Longman, London, England, 1968).]]Google ScholarGoogle Scholar
  8. Golub, G. H. and Van Loan, C. F. 1996. Matrix Computations. Third edition. Johns Hopkins University Press, Baltimore, Md.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Hatzivassiloglou, V. and McKeown, K. R. 1997. Predicting the semantic orientation of adjectives. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and the 8th Conference of the European Chapter of the ACL. Association for Computational Linguistics, New Brunswick, N.J., 174--181.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Hatzivassiloglou, V. and Wiebe, J. M. 2000. Effects of adjective orientation and gradability on sentence subjectivity. Proceedings of 18th International Conference on Computational Linguistics. Association for Computational Linguistics, New Brunswick, N.J.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. Hearst, M. A. 1992. Direction-based text interpretation as an information access refinement. In Text-Based Intelligent Systems: Current Research and Practice in Information Extraction and Retrieval. P. Jacobs, Ed. Lawrence Erlbaum Associates, Mahwah, N.J.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Kamps, J. and Marx, M. 2002. Words with attitude. In Proceedings of the 1st International Conference on Global WordNet. CIIL, Mysore, India, 332--341.]]Google ScholarGoogle Scholar
  13. Landauer, T. K. 2002. On the computational basis of learning and cognition: Arguments from LSA. In The Psychology of Learning and Motivation, B. H. Ross, Ed. To appear.]]Google ScholarGoogle Scholar
  14. Landauer, T. K. and Dumais, S. T. 1997. A solution to Plato's problem: The latent semantic analysis theory of the acquisition, induction, and representation of knowledge. Psych. Rev. 104, 211--240.]]Google ScholarGoogle ScholarCross RefCross Ref
  15. Littman, M. L. 2001. Language games and other meaningful pursuits. Presentation slides. (http://www.cs.rutgers.edu/∼mlittman/talks/CA-lang.ppt).]]Google ScholarGoogle Scholar
  16. Manning, C. D. and Schütze, H. 1999. Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, Mass.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Miller, G. A. 1990. WordNet: An on-line lexical database. Int. J. Lexicog. 3, 4, 235--312.]]Google ScholarGoogle ScholarCross RefCross Ref
  18. Osgood, C. E., Suci, G. J., and Tannenbaum, P. H. 1957. The Measurement of Meaning. University of Illinois Press, Chicago, Ill.]]Google ScholarGoogle Scholar
  19. Pang, B., Lee, L., and Vaithyanathan, S. 2002. Thumbs up? Sentiment classification using machine learning techniques. In Proceedings of the 2002 Conference on Empirical Methods in Natural Language Processing. 79--86.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Resnik, P. 1995. Using information content to evaluate semantic similarity in a taxonomy. In Proceedings of the 14th International Joint Conference on Artificial Intelligence. Morgan-Kaufmann, San Mateo, Calif., 448--453.]]Google ScholarGoogle Scholar
  21. Schütze, H. 1993. Word space. In Advances in Neural Information Processing Systems 5. S. J. Hanson, J. D. Cowan, and C. L. Giles, Ed. Morgan-Kaufmann., San Mateo, Calif., 895--902.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Smadja, F. 1993. Retrieving collocations from Text: Xtract. Comput. Ling. 19, 143--177.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Spertus, E. 1997. Smokey: Automatic recognition of hostile messages. In Proceedings of the Conference on Innovative Applications of Artificial Intelligence. AAAI Press, Menlo Park, Calif., 1058--1065.]]Google ScholarGoogle Scholar
  24. Stone, P. J., Dunphy, D. C., Smith, M. S., and Ogilvie, D. M. 1966. The General Inquirer: A Computer Approach to Content Analysis. MIT Press, Cambridge, Mass.]]Google ScholarGoogle Scholar
  25. Tong, R. M. 2001. An operational system for detecting and tracking opinions in on-line discussions. Working Notes of the ACM SIGIR 2001 Workshop on Operational Text Classification. ACM, New York, 1--6.]]Google ScholarGoogle Scholar
  26. Turney, P. D. 2001. Mining the Web for synonyms: PMI-IR versus LSA on TOEFL. In Proceedings of the 12th European Conference on Machine Learning. Springer-Verlag, Berlin, Germany, 491--502.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Turney, P. D. 2002. Thumbs up or thumbs down? Semantic orientation applied to unsupervised classification of reviews. In Proceedings of the Association for Computational Linguistics 40th Anniversary Meeting. Association for Computational Linguistics, New Brunswick, N.J.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. van Rijsbergen, C. J. 1979. Information Retrieval (2nd edition), Butterworths, London, England.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  29. Wiebe, J. M. 2000. Learning subjective adjectives from corpora. In Proceedings of the 17th National Conference on Artificial Intelligence. AAAI Press, Menlo Park, Calif.]] Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. Wiebe, J. M., Bruce, R., Bell, M., Martin, M., & Wilson, T. 2001. A corpus study of evaluative and speculative language. In Proceedings of the 2nd ACL SIG on Dialogue Workshop on Discourse and Dialogue (Aalborg, Denmark).]] Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Measuring praise and criticism: Inference of semantic orientation from association

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Information Systems
              ACM Transactions on Information Systems  Volume 21, Issue 4
              October 2003
              179 pages
              ISSN:1046-8188
              EISSN:1558-2868
              DOI:10.1145/944012
              Issue’s Table of Contents

              Copyright © 2003 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 1 October 2003
              Published in tois Volume 21, Issue 4

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • article

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader