skip to main content
10.1145/2488388.2488442acmotherconferencesArticle/Chapter ViewAbstractPublication PageswwwConference Proceedingsconference-collections
research-article

Unsupervised sentiment analysis with emotional signals

Published:13 May 2013Publication History

ABSTRACT

The explosion of social media services presents a great opportunity to understand the sentiment of the public via analyzing its large-scale and opinion-rich data. In social media, it is easy to amass vast quantities of unlabeled data, but very costly to obtain sentiment labels, which makes unsupervised sentiment analysis essential for various applications. It is challenging for traditional lexicon-based unsupervised methods due to the fact that expressions in social media are unstructured, informal, and fast-evolving. Emoticons and product ratings are examples of emotional signals that are associated with sentiments expressed in posts or words. Inspired by the wide availability of emotional signals in social media, we propose to study the problem of unsupervised sentiment analysis with emotional signals. In particular, we investigate whether the signals can potentially help sentiment analysis by providing a unified way to model two main categories of emotional signals, i.e., emotion indication and emotion correlation. We further incorporate the signals into an unsupervised learning framework for sentiment analysis. In the experiment, we compare the proposed framework with the state-of-the-art methods on two Twitter datasets and empirically evaluate our proposed framework to gain a deep understanding of the effects of emotional signals.

References

  1. R. Abelson. Whatever became of consistency theory? Personality and Social Psychology Bulletin, 1983.Google ScholarGoogle ScholarCross RefCross Ref
  2. A. Andreevskaia and S. Bergler. Mining wordnet for fuzzy sentiment: Sentiment tag extraction from wordnet glosses. In Proceedings of EACL, 2006.Google ScholarGoogle Scholar
  3. S. Asur and B. Huberman. Predicting the future with social media. In WI-IAT, pages 492--499, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. J. Bollen, H. Mao, and X. Zeng. Twitter mood predicts the stock market. Journal of Computational Science, 2011.Google ScholarGoogle ScholarCross RefCross Ref
  5. S. Boyd and L. Vandenberghe. Convex optimization. Cambridge university press, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Brody and N. Diakopoulos. Cooooooooooooooollllllllllllll!!!!!!!!!!!!!!: using word lengthening to detect sentiment in microblogs. In Proceedings of EMNLP, pages 562--570, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. Ding, T. Li, and M. Jordan. Convex and semi-nonnegative matrix factorizations. IEEE TPAMI, 32:45--55, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. C. Ding, T. Li, W. Peng, and H. Park. Orthogonal nonnegative matrix t-factorizations for clustering. In Proceedings of SIGKDD, pages 126--135, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. T. Egener, J. Granado, and M. Guitton. High frequency of phenotypic deviations in physcomitrella patens plants transformed with a gene-disruption library. BMC Plant Biology, 2:6, 2002.Google ScholarGoogle ScholarCross RefCross Ref
  10. A. Go, R. Bhayani, and L. Huang. Twitter sentiment classification using distant supervision. Technical Report, Stanford, pages 1--12, 2009.Google ScholarGoogle Scholar
  11. Q. Gu and J. Zhou. Co-clustering on manifolds. In Proceedings of SIGKDD, pages 359--368, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. T. Hofmann. Probabilistic latent semantic indexing. In Proceedings of SIGIR, pages 50--57, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. M. Hu and B. Liu. Mining and summarizing customer reviews. In Proceedings of SIGKDD, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. X. Hu, N. Sun, C. Zhang, and T.-S. Chua. Exploiting internal and external semantics for the clustering of short texts using world knowledge. In Proceedings of CIKM, pages 919--928, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. X. Hu, L. Tang, J. Tang, and H. Liu. Exploiting social relations for sentiment analysis in microblogging. In Proceedings of WSDM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. Y. Hu, S. D. Farnham, and A. Monroy-Hernández. Whoo. ly: Facilitating information seeking for hyperlocal communities using social media. In Proceedings of CHI, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. Y. Hu, A. John, D. Seligmann, and F. Wang. What were the tweets about? topical associations between public events and twitter feeds. ICWSM, 2012.Google ScholarGoogle Scholar
  18. E. Kim, S. Gilbert, M. Edwards, and E. Graeff. Detecting sadness in 140 characters: Sentiment analysis of mourning michael jackson on twitter. 2009.Google ScholarGoogle Scholar
  19. T. Li, V. Sindhwani, C. Ding, and Y. Zhang. Bridging domains with words: Opinion analysis with matrix tri-factorizations. In Proceedings of SDM, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  20. B. Liu. Sentiment analysis and subjectivity. Handbook of Natural Language Processing, 2010.Google ScholarGoogle Scholar
  21. B. Liu and L. Zhang. A survey of opinion mining and sentiment analysis. Mining Text Data, 2012.Google ScholarGoogle ScholarCross RefCross Ref
  22. K.-L. Liu, W.-J. Li, and M. Guo. Emoticon smoothed language models for twitter sentiment analysis. In Proceedings of AAAI, 2012.Google ScholarGoogle Scholar
  23. Y. Lu, M. Castellanos, U. Dayal, and C. Zhai. Automatic construction of a context-aware sentiment lexicon: an optimization approach. In Proceedings of WWW, pages 347--356, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. O Connor, R. Balasubramanyan, B. Routledge, and N. Smith. From tweets to polls: Linking text sentiment to public opinion time series. In Proceedings of ICWSM, 2010.Google ScholarGoogle Scholar
  25. B. Pang, L. Lee, and S. Vaithyanathan. Thumbs up?: sentiment classification using machine learning techniques. In Proceedings of ACL, pages 79--86, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. W. Peng and D. H. Park. Generate adjective sentiment dictionary for social media sentiment analysis using constrained nonnegative matrix factorization. In ICWSM, 2011.Google ScholarGoogle Scholar
  27. S. Prentice and E. Huffman. Social medias new role in emergency management. Idaho National Laboratory, pages 1--5, 2008.Google ScholarGoogle Scholar
  28. D. Seung and L. Lee. Algorithms for non-negative matrix factorization. NIPS, pages 556--562, 2001.Google ScholarGoogle Scholar
  29. D. Shamma, L. Kennedy, and E. Churchill. Tweet the debates: understanding community annotation of uncollected sources. In Proceedings of WSM, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. M. Speriosu, N. Sudan, S. Upadhyay, and J. Baldridge. Twitter polarity classification with label propagation over lexical links and the follower graph. In Proceedings of the First Workshop on Unsupervised Learning in NLP, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. P. Stone, D. Dunphy, and M. Smith. The general inquirer: A computer approach to content analysis. 1966.Google ScholarGoogle Scholar
  32. M. Taboada, J. Brooke, M. Tofiloski, K. Voll, and M. Stede. Lexicon-based methods for sentiment analysis. Computational Linguistics, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. J. Tang, H. Gao, X. Hu, and H. Liu. Exploiting homophily effect for trust prediction. In WSDM, 2013. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. P. Turney. Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews. In Proceedings of ACL, pages 417--424, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. H. Wang, Y. Lu, and C. Zhai. Latent aspect rating analysis on review text data: a rating regression approach. In Proceedings of SIGKDD, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. J. Wiebe, T. Wilson, and C. Cardie. Annotating expressions of opinions and emotions in language. Language Resources and Evaluation, 39:165--210, 2005.Google ScholarGoogle ScholarCross RefCross Ref
  37. T. Wilson, J. Wiebe, and P. Hoffmann. Recognizing contextual polarity in phrase-level sentiment analysis. In Proceedings of HLT and EMNLP, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. Y. Xie, Z. Chen, K. Zhang, M. M. A. Patwary, Y. Cheng, H. Liu, A. Agrawal, and A. Choudhary. Graphical modeling of macro behavioral targeting in social networks. In Proceedings of SDM, 2013.Google ScholarGoogle ScholarCross RefCross Ref
  39. L. Zhang and B. Liu. Identifying noun product features that imply opinions. In Proceedings of ACL:HLT, pages 575--580, 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. J. Zhao, L. Dong, J. Wu, and K. Xu. Moodlens: an emoticon-based sentiment analysis system for chinese tweets. In Proceedings of SIGKDD, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Unsupervised sentiment analysis with emotional signals

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader