ABSTRACT
Several messages express opinions about events, products, and services, political views or even their author's emotional state and mood. Sentiment analysis has been used in several applications including analysis of the repercussions of events in social networks, analysis of opinions about products and services, and simply to better understand aspects of social communication in Online Social Networks (OSNs). There are multiple methods for measuring sentiments, including lexical-based approaches and supervised machine learning methods. Despite the wide use and popularity of some methods, it is unclear which method is better for identifying the polarity (i.e., positive or negative) of a message as the current literature does not provide a method of comparison among existing methods. Such a comparison is crucial for understanding the potential limitations, advantages, and disadvantages of popular methods in analyzing the content of OSNs messages. Our study aims at filling this gap by presenting comparisons of eight popular sentiment analysis methods in terms of coverage (i.e., the fraction of messages whose sentiment is identified) and agreement (i.e., the fraction of identified sentiments that are in tune with ground truth). We develop a new method that combines existing approaches, providing the best coverage results and competitive agreement. We also present a free Web service called iFeel, which provides an open API for accessing and comparing results across different sentiment methods for a given text.
- List of text emoticons: The ultimate resource. www.cool-smileys.com/text-emoticons.Google Scholar
- Msn messenger emoticons. http://messenger.msn.com/Resource/Emoticons.aspx.Google Scholar
- Omg! oxford english dictionary grows a heart: Graphic symbol for love (and that exclamation) are added as words. tinyurl.com/klv36p.Google Scholar
- Yahoo messenger emoticons. http://messenger.yahoo.com/features/emoticons.Google Scholar
- Amazon. Amazon mechanical turk. https://www.mturk.com/. Accessed June 17, 2013.Google Scholar
- A. Bermingham and A. F. Smeaton. Classifying sentiment in microblogs: is brevity an advantage? In ACM international conference on Information and knowledge management (CIKM), pages 1833--1836, 2010. Google ScholarDigital Library
- J. Bollen, H. Mao, and X.-J. Zeng. Twitter mood predicts the stock market. CoRR, abs/1010.3003, 2010.Google Scholar
- J. Bollen, A. Pepe, and H. Mao. Modeling public mood and emotion: Twitter sentiment and socio-economic phenomena. CoRR, abs/0911.1583, 2009.Google Scholar
- M. M. Bradley and P. J. Lang. Affective norms for english words (ANEW): Stimuli, instruction manual, and affective ratings. Technical report, Center for Research in Psychophysiology, University of Florida, Gainesville, Florida, 1999.Google Scholar
- E. Cambria, A. Hussain, C. Havasi, C. Eckl, and J. Munro. Towards crowd validation of the uk national health service. In ACM Web Science Conference (WebSci), 2010.Google Scholar
- E. Cambria, R. Speer, C. Havasi, and A. Hussain. Senticnet: A publicly available semantic resource for opinion mining. In AAAI Fall Symposium Series, 2010.Google Scholar
- M. Cha, H. Haddadi, F. Benevenuto, and K. P. Gummadi. Measuring User Influence in Twitter: The Million Follower Fallacy. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2010.Google Scholar
- P. S. Dodds and C. M. Danforth. Measuring the happiness of large-scale written expression: songs, blogs, and presidents. Journal of Happiness Studies, 11(4):441--456, 2009.Google ScholarCross Ref
- Esuli and Sebastiani. Sentwordnet: A publicly available lexical resource for opinion mining. In In Proceedings of the 5th Conference on Language Resources and Evaluation, pages 417--422, 2006.Google Scholar
- J. Gomide, A. Veloso, W. M. Jr., V. Almeida, F. Benevenuto, F. Ferraz, and M. Teixeira. Dengue surveillance based on a computational model of spatio-temporal locality of twitter. In ACM Web Science Conference (WebSci), 2011.Google ScholarDigital Library
- P. Gonçalves, F. Benevenuto, and M. Cha. Panas-t: A pychometric scale for measuring sentiments on twitter. abs/1308.1857v1, 2013.Google Scholar
- A. Hannak, E. Anderson, L. F. Barrett, S. Lehmann, A. Mislove, and M. Riedewald. Tweetin' in the rain: Exploring societal-scale effects of weather on mood. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2012.Google Scholar
- X. Hu, J. Tang, H. Gao, and H. Liu. Unsupervised sentiment analysis with emotional signals. In Int'l Conference on World Wide Web, 2013. Google ScholarDigital Library
- A. Lamb, M. J. Paul, and M. Dredze. Separating fact from fear: Tracking flu infections on twitter. In Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 789--795, June 2013.Google Scholar
- G. A. Miller. Wordnet: a lexical database for english. Communications of the ACM, 38(11):39--41, 1995. Google ScholarDigital Library
- G. Paltoglou and M. Thelwall. Twitter, myspace, digg: Unsupervised sentiment analysis in social media. ACM Trans. Intell. Syst. Technol., 3(4):66:1--66:19, 2012. Google ScholarDigital Library
- B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up : sentiment classification using machine learning techniques. In ACL Conference on Empirical Methods in Natural Language Processing, pages 79--86, 2002. Google ScholarDigital Library
- J. Park, V. Barash, C. Fink, and M. Cha. Emoticon style: Interpreting differences in emoticons across cultures. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2013.Google Scholar
- J. Read. Using emoticons to reduce dependency in machine learning techniques for sentiment classification. In ACL Student Research Workshop, pages 43--48, 2005. Google ScholarDigital Library
- T. Sakaki, M. Okazaki, and Y. Matsuo. Earthquake shakes twitter users: real-time event detection by social sensors. In Int'l Conference on World wide web (WWW), pages 851--860, 2010. Google ScholarDigital Library
- S. Somasundaran, J. Wiebe, and J. Ruppenhofer. Discourse level opinion interpretation. In Int'l Conference on Computational Linguistics (COLING), pages 801--808, 2008. Google ScholarDigital Library
- Y. R. Tausczik and J. W. Pennebaker. The psychological meaning of words: Liwc and computerized text analysis methods. Journal of Language and Social Psychology, 29(1):24--54, 2010.Google ScholarCross Ref
- M. Thelwall. Heart and soul: Sentiment strength detection in the social web with sentistrength. http://sentistrength.wlv.ac.uk/documentation/SentiStrengthChapter.pdf.Google Scholar
- A. Tumasjan, T. O. Sprenger, P. G. Sandner, and I. M. Welpe. Predicting elections with twitter: What 140 characters reveal about political sentiment. In Int'l AAAI Conference on Weblogs and Social Media (ICWSM), 2010.Google Scholar
- H. Wang, D. Can, A. Kazemzadeh, F. Bar, and S. Narayanan. A system for real-time twitter sentiment analysis of 2012 u.s. presidential election cycle. In ACL System Demonstrations, pages 115--120, 2012. Google ScholarDigital Library
- D. Watson and L. Clark. Development and validation of brief measures of positive and negative affect: the panas scales. Journal of Personality and Social Psychology, 54(1):1063--1070, 1985.Google Scholar
- K. Wickre. Celebrating twitter7. http://blog.twitter.com/2013/03/celebrating-twitter7.html. Accessed March 25, 2013.Google Scholar
- T. Wilson, P. Hoffmann, S. Somasundaran, J. Kessler, J. Wiebe, Y. Choi, C. Cardie, E. Riloff, and S. Patwardhan. Opinionfinder: a system for subjectivity analysis. In HLT/EMNLP on Interactive Demonstrations, pages 34--35, 2005. Google ScholarDigital Library
Index Terms
- Comparing and combining sentiment analysis methods
Recommendations
Joint sentiment/topic model for sentiment analysis
CIKM '09: Proceedings of the 18th ACM conference on Information and knowledge managementSentiment analysis or opinion mining aims to use automated tools to detect subjective information such as opinions, attitudes, and feelings expressed in text. This paper proposes a novel probabilistic modeling framework based on Latent Dirichlet ...
Sentence compression for aspect-based sentiment analysis
Sentiment analysis, which addresses the computational treatment of opinion, sentiment, and subjectivity in text, has received considerable attention in recent years. In contrast to the traditional coarse-grained sentiment analysis tasks, such as ...
Social sentiment sensor: a visualization system for topic detection and topic sentiment analysis on microblog
As a new form of social media, microblogging provides platform sharing, wherein users can share their feelings and ideas on certain topics. Bursty topics from microblogs are the results of the emerging issues that instantly attract more followers and ...
Comments