ABSTRACT
The growing popularity of Web 2.0 provides with increasing numbers of documents expressing opinions on different topics. Recently, new research approaches have been defined in order to automatically extract such opinions from the Internet. They usually consider opinions to be expressed through adjectives, and make extensive use of either general dictionaries or experts to provide the relevant adjectives. Unfortunately, these approaches suffer from the following drawback: in a specific domain, a given adjective may either not exist or have a different meaning from another domain. In this paper, we propose a new approach focusing on two steps. First, we automatically extract a learning dataset for a specific domain from the Internet. Secondly, from this learning set we extract the set of positive and negative adjectives relevant to the domain. The usefulness of our approach was demonstrated by experiments performed on real data.
- R. Agrawal and R. Srikant. Fast algorithms for mining association rules in large databases. In VLDB'94, 1994. Google ScholarDigital Library
- A. Andreevskaia and S. Bergler. Semantic tag extraction from wordnet glosses. 2007.Google Scholar
- K. Church and P. Hanks. Word association norms, mutual information, and lexicography. In Computational Linguistics, volume 16, pages 22--29, 1990. Google ScholarDigital Library
- D. Downey, M. Broadhead, and O. Etzioni. Locating complex named entities in web text. In Proceedings of IJCAI'07, pages 2733--2739, 2007.Google Scholar
- V. Hatzivassiloglou and K. McKeown. Predicting the semantic orientation of adjectives. In Proceedings of 35th Meeting of the Association for Computational Linguistics, Madrid, Spain, 1997. Google ScholarDigital Library
- M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of KDD'04, ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Seattle, WA, 2004. Google ScholarDigital Library
- J. Kamps, M. Marx, R. J. Mokken, and M. Rijke. Using wordnet to measure semantic orientation of adjectives. In Proceedings of LREC 2004, the 4th International Conference on Language Resources and Evaluation, pages 174--181, Lisbon, Portugal, 2004.Google Scholar
- G. Miller. Wordnet: A lexical database for english. In Communications of the ACM, 1995. Google ScholarDigital Library
- M. Plantié, M. Roche, G. Dray, and P. Poncelet. Is a voting approach accurate for opinion mining? In Proceedings of the 10th International Conference on Data Warehousing and Knowledge Discovery (DaWaK '08), Torino Italy, 2008. Google ScholarDigital Library
- V. Risbergen. Information retrieval, 2nd edition. In Butterworths, London, 1979. Google ScholarDigital Library
- M. Roche and V. Prince. AcroDef: A Quality Measure for Discriminating Expansions of Ambiguous Acronyms. In Proceedings of CONTEXT, Springer-Verlag, LNCS, pages 411--424, 2007.Google Scholar
- H. Schmid. Treetagger. In TC project at the Institute for Computational Linguistics of the University of Stuttgart, 1994.Google Scholar
- P. Stone, D. Dunphy, M. Smith, and D. Ogilvie. The general inquirer: A computer approach to content analysis. Cambridge, MA, 1966. MIT Press.Google Scholar
- M. Taboada, C. Anthony, and K. Voll. Creating semantic orientation dictionaries. 2006.Google Scholar
- P. Turney. Thumbs up or thumbs down? semantic orientation applied to unsupervised classification of reviews. In Proceedings of 40th Meeting of the Association for Computational Linguistics, pages 417--424, Paris, 2002. Google ScholarDigital Library
- K. Voll and M. Taboada. Not all words are created equal: Extracting semantic orientation as a function of adjective relevance. pages 337--346. Volume 4830/2007 AI 2007: Advances in Artificial Intelligence, 2007.Google Scholar
Index Terms
- Web opinion mining: how to extract opinions from blogs?
Recommendations
Finer Granularity Clustering for Opinion Mining
ISCID '09: Proceedings of the 2009 Second International Symposium on Computational Intelligence and Design - Volume 01The boom of opinion-rich resources such as online review websites, discussion groups, personal blogs and forums on the web has attracted many research efforts on opinion mining. Positive and negative opinions represented in review documents are helpful ...
Opinion Mining and Summarization of Hotel Reviews
CICN '14: Proceedings of the 2014 International Conference on Computational Intelligence and Communication NetworksEveryday many users purchases product, book travel tickets, buy goods and services through web. Users also share their views about product, hotel, news, and topic on web in the form of reviews, blogs, comments etc. Many users read review information ...
Feature and Opinion Mining for Customer Review Summarization
PReMI '09: Proceedings of the 3rd International Conference on Pattern Recognition and Machine IntelligenceIn this paper, we present an opinion mining system to identify product features and opinions from review documents. The features and opinions are extracted using semantic and linguistic analysis of text documents. The polarity of opinion sentences is ...
Comments