Skip to main content

2017 | OriginalPaper | Buchkapitel

LingoSent — A Platform for Linguistic Aware Sentiment Analysis for Social Media Messages

verfasst von : Yuting Su, Huijing Wang

Erschienen in: MultiMedia Modeling

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is an important natural language processing (NLP) task and applied to a wide range of scenarios. Social media messages such as tweets often differ from formal writing, exhibiting unorthodox capitalization, expressive lengthenings, Internet slang, etc. While such characteristics are inherently beneficial for the task of sentiment analysis, they also pose new challenges for existing NLP platforms. In this article, we present a new approach to improve lexicon-based sentiment analysis by extracting and utilizing linguistic features in a comprehensive manner. In contrast to existing solutions, we design our sentiment analysis approach as a framework with data preprocessing, linguistic feature extraction and sentiment calculation being separate components. This allows for easy modification and extension of each component. More importantly, we can easily configure the sentiment calculation with respect to the extracted features to optimize sentiment analysis for different application contexts. In a comprehensive evaluation, we show that our system outperforms existing state-of-the-art lexicon-based sentiment analysis solutions.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Saif, H., He, Y., Alani, H.: Alleviating data sparsity for Twitter sentiment analysis. In: Proceedings of the WWW 2012 Workshop on ‘Making Sense of Microposts’, vol. 838, pp. 2–9 (2012) Saif, H., He, Y., Alani, H.: Alleviating data sparsity for Twitter sentiment analysis. In: Proceedings of the WWW 2012 Workshop on ‘Making Sense of Microposts’, vol. 838, pp. 2–9 (2012)
2.
Zurück zum Zitat Khan, A.Z.H., Atique, M., Thakare, V.M.: Combining lexicon-based and learning-based methods for Twitter sentiment analysis. Int. J. Electron. Commun. Soft Comput. Sci. Eng. (IJECSCSE) 89 (2015) Khan, A.Z.H., Atique, M., Thakare, V.M.: Combining lexicon-based and learning-based methods for Twitter sentiment analysis. Int. J. Electron. Commun. Soft Comput. Sci. Eng. (IJECSCSE) 89 (2015)
3.
Zurück zum Zitat Liu, A.A., Nie, W.Z., Gao, Y., et al.: Multi-modal clique-graph matching for view-based 3D model retrieval. IEEE Trans. Image Process. 25(5), 2103–2116 (2016)MathSciNetCrossRef Liu, A.A., Nie, W.Z., Gao, Y., et al.: Multi-modal clique-graph matching for view-based 3D model retrieval. IEEE Trans. Image Process. 25(5), 2103–2116 (2016)MathSciNetCrossRef
4.
Zurück zum Zitat Liu, A.A., Su, Y.T., Nie, W.Z., Kankanhalli, M.: Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(1), 102–114 (2017). doi:10.1109/TPAMI.2016.2537337 CrossRef Liu, A.A., Su, Y.T., Nie, W.Z., Kankanhalli, M.: Hierarchical clustering multi-task learning for joint human action grouping and recognition. IEEE Trans. Pattern Anal. Mach. Intell. 39(1), 102–114 (2017). doi:10.​1109/​TPAMI.​2016.​2537337 CrossRef
5.
Zurück zum Zitat Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation. European Language Resources Association (2010) Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the Seventh International Conference on Language Resources and Evaluation. European Language Resources Association (2010)
6.
Zurück zum Zitat Agarwal, A., Xie, B., Vovsha, I., et al.: Sentiment analysis of Twitter data. In: Proceedings of the Workshop on Languages in Social Media, pp. 30–38 (2011) Agarwal, A., Xie, B., Vovsha, I., et al.: Sentiment analysis of Twitter data. In: Proceedings of the Workshop on Languages in Social Media, pp. 30–38 (2011)
7.
Zurück zum Zitat Kouloumpis, E., Wilson, T., Moore, J.D.: Twitter sentiment analysis: the good the bad and the OMG!. In: Proceedings of the Fifth International Conference on Weblogs and Social Media, vol. 11, pp. 538–541 (2011) Kouloumpis, E., Wilson, T., Moore, J.D.: Twitter sentiment analysis: the good the bad and the OMG!. In: Proceedings of the Fifth International Conference on Weblogs and Social Media, vol. 11, pp. 538–541 (2011)
8.
Zurück zum Zitat Chikersal, P., Poria, S., Cambria, E.: SeNTU: sentiment analysis of tweets by combining a rule-based classifier with supervised learning. In: Proceedings of the International Workshop on Semantic Evaluation, SemEval, pp. 647–651 (2015) Chikersal, P., Poria, S., Cambria, E.: SeNTU: sentiment analysis of tweets by combining a rule-based classifier with supervised learning. In: Proceedings of the International Workshop on Semantic Evaluation, SemEval, pp. 647–651 (2015)
9.
Zurück zum Zitat Bakliwal, A., Arora, P., Madhappan, S., et al.: Mining sentiments from tweets. In: Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, vol. 12, pp. 11–18. Association for Computational Linguistics, Stroudsburg (2012) Bakliwal, A., Arora, P., Madhappan, S., et al.: Mining sentiments from tweets. In: Proceedings of the 3rd Workshop in Computational Approaches to Subjectivity and Sentiment Analysis, vol. 12, pp. 11–18. Association for Computational Linguistics, Stroudsburg (2012)
10.
Zurück zum Zitat Mudinas, A., Zhang, D., Levene, M.: Combining lexicon and learning based approaches for concept-level sentiment analysis. In: Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining, pp. 5:1–5:8. ACM, New York (2012) Mudinas, A., Zhang, D., Levene, M.: Combining lexicon and learning based approaches for concept-level sentiment analysis. In: Proceedings of the First International Workshop on Issues of Sentiment Discovery and Opinion Mining, pp. 5:1–5:8. ACM, New York (2012)
11.
Zurück zum Zitat Nielsen, Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs (2011). CoRR, arXiv preprint: arXiv:1103.2903 Nielsen, Å.: A new ANEW: evaluation of a word list for sentiment analysis in microblogs (2011). CoRR, arXiv preprint: arXiv:​1103.​2903
12.
Zurück zum Zitat Hutto, C.J., Vader, G.E.: VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the Eighth International Conference on Weblogs and Social Media. The AAAI Press (2014) Hutto, C.J., Vader, G.E.: VADER: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the Eighth International Conference on Weblogs and Social Media. The AAAI Press (2014)
13.
Zurück zum Zitat Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM, New York (2005) Liu, B., Hu, M., Cheng, J.: Opinion observer: analyzing and comparing opinions on the web. In: Proceedings of the 14th International Conference on World Wide Web, pp. 342–351. ACM, New York (2005)
14.
Zurück zum Zitat Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63, 163–173 (2012)CrossRef Thelwall, M., Buckley, K., Paltoglou, G.: Sentiment strength detection for the social web. J. Am. Soc. Inf. Sci. Technol. 63, 163–173 (2012)CrossRef
15.
Zurück zum Zitat Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the International Conference on Language Resources and Evaluation, vol. 10, pp. 2200–2204. European Language Resources Association (2010) Baccianella, S., Esuli, A., Sebastiani, F.: SentiWordNet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Proceedings of the International Conference on Language Resources and Evaluation, vol. 10, pp. 2200–2204. European Language Resources Association (2010)
16.
Zurück zum Zitat Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 231–240. ACM, New York (2008) Ding, X., Liu, B., Yu, P.S.: A holistic lexicon-based approach to opinion mining. In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 231–240. ACM, New York (2008)
17.
Zurück zum Zitat Taboada, M., Brooke, J., Tofiloski, M., et al.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37, 267–307 (2011)CrossRef Taboada, M., Brooke, J., Tofiloski, M., et al.: Lexicon-based methods for sentiment analysis. Comput. Linguist. 37, 267–307 (2011)CrossRef
18.
Zurück zum Zitat Owoputi, O., O’Connor, B., Dyer, C., et al.: Improved part-of-speech tagging for online conversational text with word clusters. In: Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, pp. 380–390. Association for Computational Linguistics, Stroudsburg (2013) Owoputi, O., O’Connor, B., Dyer, C., et al.: Improved part-of-speech tagging for online conversational text with word clusters. In: Human Language Technologies: Conference of the North American Chapter of the Association of Computational Linguistics, pp. 380–390. Association for Computational Linguistics, Stroudsburg (2013)
19.
Zurück zum Zitat Berardi, G., Esuli, A., Marcheggiani, D., et al.: ISTI@ TREC Microblog Track 2011: exploring the use of hashtag segmentation and text quality ranking. In: Text REtrieval and Evaluation Conference (2011) Berardi, G., Esuli, A., Marcheggiani, D., et al.: ISTI@ TREC Microblog Track 2011: exploring the use of hashtag segmentation and text quality ranking. In: Text REtrieval and Evaluation Conference (2011)
Metadaten
Titel
LingoSent — A Platform for Linguistic Aware Sentiment Analysis for Social Media Messages
verfasst von
Yuting Su
Huijing Wang
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-51811-4_37

Neuer Inhalt