skip to main content
10.1145/1456223.1456269acmotherconferencesArticle/Chapter ViewAbstractPublication PagescststConference Proceedingsconference-collections
research-article

Web opinion mining: how to extract opinions from blogs?

Published:28 October 2008Publication History

ABSTRACT

The growing popularity of Web 2.0 provides with increasing numbers of documents expressing opinions on different topics. Recently, new research approaches have been defined in order to automatically extract such opinions from the Internet. They usually consider opinions to be expressed through adjectives, and make extensive use of either general dictionaries or experts to provide the relevant adjectives. Unfortunately, these approaches suffer from the following drawback: in a specific domain, a given adjective may either not exist or have a different meaning from another domain. In this paper, we propose a new approach focusing on two steps. First, we automatically extract a learning dataset for a specific domain from the Internet. Secondly, from this learning set we extract the set of positive and negative adjectives relevant to the domain. The usefulness of our approach was demonstrated by experiments performed on real data.

References

  1. R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In VLDB'94, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. A. Andreevskaia and S. Bergler. Semantic tag extraction from wordnet glosses. 2007.Google ScholarGoogle Scholar
  3. K. Church and P. Hanks. Word association norms, mutual information, and lexicography. In Computational Linguistics, volume 16, pages 22--29, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. D. Downey, M. Broadhead, and O. Etzioni. Locating complex named entities in web text. In Proceedings of IJCAI'07, pages 2733--2739, 2007.Google ScholarGoogle Scholar
  5. V. Hatzivassiloglou and K. McKeown. Predicting the semantic orientation of adjectives. In Proceedings of 35th Meeting of the Association for Computational Linguistics, Madrid, Spain, 1997. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of KDD'04, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Kamps, M. Marx, R. J. Mokken, and M. Rijke. Using wordnet to measure semantic orientation of adjectives. In Proceedings of LREC 2004, the 4th International Conference on Language Resources and Evaluation, pages 174--181, Lisbon, Portugal, 2004.Google ScholarGoogle Scholar
  8. G. Miller. Wordnet: A lexical database for english. In Communications of the ACM, 1995. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. M. Plantié, M. Roche, G. Dray, and P. Poncelet. Is a voting approach accurate for opinion mining? In Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery (DaWaK '08), Torino Italy, 2008. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. V. Risbergen. Information retrieval, 2nd edition. In Butterworths, London, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. M. Roche and V. Prince. AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms. In Proceedings of CONTEXT, Springer-Verlag, LNCS, pages 411--424, 2007.Google ScholarGoogle Scholar
  12. H. Schmid. Treetagger. In TC project at the Institute for Computational Linguistics of the University of Stuttgart, 1994.Google ScholarGoogle Scholar
  13. P. Stone, D. Dunphy, M. Smith, and D. Ogilvie. The general inquirer: A computer approach to content analysis. Cambridge, MA, 1966. MIT Press.Google ScholarGoogle Scholar
  14. M. Taboada, C. Anthony, and K. Voll. Creating semantic orientation dictionaries. 2006.Google ScholarGoogle Scholar
  15. P. Turney. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of 40th Meeting of the Association for Computational Linguistics, pages 417--424, Paris, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. K. Voll and M. Taboada. Not all words are created equal: Extracting semantic orientation as a function of adjective relevance. pages 337--346. Volume 4830/2007 AI 2007: Advances in Artificial Intelligence, 2007.Google ScholarGoogle Scholar

Index Terms

  1. Web opinion mining: how to extract opinions from blogs?

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CSTST '08: Proceedings of the 5th international conference on Soft computing as transdisciplinary science and technology
      October 2008
      733 pages
      ISBN:9781605580463
      DOI:10.1145/1456223

      Copyright © 2008 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 28 October 2008

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader