Skip to main content
Erschienen in:

18.07.2022

Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis

verfasst von: Yabing Wang, Guimin Huang, Maolin Li, Yiqun Li, Xiaowei Zhang, Hui Li

Erschienen in: Cognitive Computation | Ausgabe 1/2023

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is an important research area in natural language processing (NLP), and the performance of sentiment analysis models is largely influenced by the quality of sentiment lexicons. Existing sentiment lexicons contain only the sentiment information of words. In this paper, we propose an approach for automatically constructing a fine-grained sentiment lexicon that contains both emotion information and sentiment information to solve the problem that the emotion and sentiment of texts cannot be jointly analyzed. We design an emotion-sentiment transfer method and construct a fine-grained sentiment seed lexicon, and we then expand the sentiment seed lexicon by applying the graph dissemination method to the synonym set. Subsequently, we propose a multi-information fusion method based on neural network to expand the sentiment lexicon based on a corpus. Finally, we generate the Fine-Grained Sentiment Lexicon (FGSL), which contains 40,554 words. FGSL achieves F1 values of 61.97%, 69.58%, and 66.99% on three emotion datasets and 88.19%, 89.31%, and 86.88% on three sentiment datasets. Experimental results on multiple public benchmark datasets illustrate that FGSL achieves significantly better performance in both emotion analysis and sentiment analysis tasks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Cambria E, Kumar A, Al-Ayyoub M, Howard N. Guest Editorial: explainable artificial intelligence for sentiment analysis. Elsevier; 2021. Cambria E, Kumar A, Al-Ayyoub M, Howard N. Guest Editorial: explainable artificial intelligence for sentiment analysis. Elsevier; 2021.
2.
Zurück zum Zitat Liang B, Su H, Gui L, Cambria E, Xu R. Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks. Knowl-Based Syst. 2022;235:107643. Liang B, Su H, Gui L, Cambria E, Xu R. Aspect-based sentiment analysis via affective knowledge enhanced graph convolutional networks. Knowl-Based Syst. 2022;235:107643.
3.
Zurück zum Zitat Mohammad SM, Turney PD. Crowdsourcing a word-emotion association lexicon. Comput Intell. 2013;29(3):436–65.MathSciNetCrossRef Mohammad SM, Turney PD. Crowdsourcing a word-emotion association lexicon. Comput Intell. 2013;29(3):436–65.MathSciNetCrossRef
5.
Zurück zum Zitat Wilson T, Wiebe J, Hoffmann P. Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005. p. 347–54. Wilson T, Wiebe J, Hoffmann P. Recognizing contextual polarity in phrase-level sentiment analysis. In: Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing. 2005. p. 347–54.
6.
Zurück zum Zitat Stone PJ, Dunphy DC, Smith MS. The general inquirer: a computer approach to content analysis. 1966. Stone PJ, Dunphy DC, Smith MS. The general inquirer: a computer approach to content analysis. 1966.
7.
Zurück zum Zitat Hu M, Liu B. Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004. p. 168–77. Hu M, Liu B. Mining and summarizing customer reviews. In: Proceedings of the Tenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. 2004. p. 168–77.
8.
Zurück zum Zitat Bravo-Marquez F, Khanchandani A, Pfahringer B. Incremental word vectors for time-evolving sentiment lexicon induction. Cogn Comput. 2022;14(1):425–41.CrossRef Bravo-Marquez F, Khanchandani A, Pfahringer B. Incremental word vectors for time-evolving sentiment lexicon induction. Cogn Comput. 2022;14(1):425–41.CrossRef
9.
Zurück zum Zitat Sharma SS, Dutta G. Sentidraw: using star ratings of reviews to develop domain specific sentiment lexicon for polarity determination. Inf Process Manag. 2021;58(1):102412. Sharma SS, Dutta G. Sentidraw: using star ratings of reviews to develop domain specific sentiment lexicon for polarity determination. Inf Process Manag. 2021;58(1):102412.
10.
Zurück zum Zitat Huang M, Xie H, Rao Y, Feng J, Wang FL. Sentiment strength detection with a context-dependent lexicon-based convolutional neural network. Inform Sci. 2020;520:389–99.CrossRef Huang M, Xie H, Rao Y, Feng J, Wang FL. Sentiment strength detection with a context-dependent lexicon-based convolutional neural network. Inform Sci. 2020;520:389–99.CrossRef
11.
Zurück zum Zitat Viegas F, Alvim MS, Canuto S, Rosa T, Gonçalves MA, Rocha L. Exploiting semantic relationships for unsupervised expansion of sentiment lexicons. Inf Syst. 2020;94:101606. Viegas F, Alvim MS, Canuto S, Rosa T, Gonçalves MA, Rocha L. Exploiting semantic relationships for unsupervised expansion of sentiment lexicons. Inf Syst. 2020;94:101606.
12.
Zurück zum Zitat Hutto C, Gilbert E. Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 8. 2014. Hutto C, Gilbert E. Vader: a parsimonious rule-based model for sentiment analysis of social media text. In: Proceedings of the International AAAI Conference on Web and Social Media, vol. 8. 2014.
13.
Zurück zum Zitat De Bruyne L, Atanasova P, Augenstein I. Joint emotion label space modeling for affect lexica. Comput Speech Lang. 2022;71:101257. De Bruyne L, Atanasova P, Augenstein I. Joint emotion label space modeling for affect lexica. Comput Speech Lang. 2022;71:101257.
14.
Zurück zum Zitat Bandhakavi A, Wiratunga N, Massie S. Emotion-aware polarity lexicons for twitter sentiment analysis. Expert Syst. 2021;38(7):12332. Bandhakavi A, Wiratunga N, Massie S. Emotion-aware polarity lexicons for twitter sentiment analysis. Expert Syst. 2021;38(7):12332.
15.
Zurück zum Zitat Yin F, Wang Y, Liu J, Lin L. The construction of sentiment lexicon based on context-dependent part-of-speech chunks for semantic disambiguation. IEEE Access. 2020;8:63359–67.CrossRef Yin F, Wang Y, Liu J, Lin L. The construction of sentiment lexicon based on context-dependent part-of-speech chunks for semantic disambiguation. IEEE Access. 2020;8:63359–67.CrossRef
16.
Zurück zum Zitat Du M, Li X, Luo L. A training-optimization-based method for constructing domain-specific sentiment lexicon. Complexity. 2021;2021. Du M, Li X, Luo L. A training-optimization-based method for constructing domain-specific sentiment lexicon. Complexity. 2021;2021.
17.
Zurück zum Zitat Ekman P. An argument for basic emotions. Cognit Emot. 1992;6(3–4):169–200.CrossRef Ekman P. An argument for basic emotions. Cognit Emot. 1992;6(3–4):169–200.CrossRef
18.
19.
Zurück zum Zitat Kilgarriff A. Wordnet: an electronic lexical database. JSTOR; 2000. Kilgarriff A. Wordnet: an electronic lexical database. JSTOR; 2000.
20.
Zurück zum Zitat Mohammad SM, Kiritchenko S. Using hashtags to capture fine emotion categories from tweets. Comput Intell. 2015;31(2):301–26.MathSciNetCrossRef Mohammad SM, Kiritchenko S. Using hashtags to capture fine emotion categories from tweets. Comput Intell. 2015;31(2):301–26.MathSciNetCrossRef
21.
Zurück zum Zitat Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:1301.3781. 2013. Mikolov T, Chen K, Corrado G, Dean J. Efficient estimation of word representations in vector space. arXiv preprint arXiv:​1301.​3781. 2013.
22.
Zurück zum Zitat Pennington J, Socher R, Manning CD. Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014. p. 1532–43. Pennington J, Socher R, Manning CD. Glove: global vectors for word representation. In: Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014. p. 1532–43.
23.
Zurück zum Zitat Cilibrasi RL, Vitanyi PM. The Google similarity distance. IEEE Trans Knowl Data Eng. 2007;19(3):370–83.CrossRef Cilibrasi RL, Vitanyi PM. The Google similarity distance. IEEE Trans Knowl Data Eng. 2007;19(3):370–83.CrossRef
24.
Zurück zum Zitat Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. J Am Soc Inf Sci. 1990;41(6):391–407.CrossRef Deerwester S, Dumais ST, Furnas GW, Landauer TK, Harshman R. Indexing by latent semantic analysis. J Am Soc Inf Sci. 1990;41(6):391–407.CrossRef
25.
Zurück zum Zitat Strapparava C, Mihalcea R. Semeval-2007 task 14: affective text. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 2007. p. 70–4. Strapparava C, Mihalcea R. Semeval-2007 task 14: affective text. In: Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007). 2007. p. 70–4.
26.
Zurück zum Zitat Wang W, Chen L, Thirunarayan K, Sheth AP. Harnessing twitter “big data” for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. IEEE; 2012. p. 587–92. Wang W, Chen L, Thirunarayan K, Sheth AP. Harnessing twitter “big data” for automatic emotion identification. In: 2012 International Conference on Privacy, Security, Risk and Trust and 2012 International Confernece on Social Computing. IEEE; 2012. p. 587–92.
27.
Zurück zum Zitat Bandhakavi A, Wiratunga N, Massie S, Deepak P. Emotion-corpus guided lexicons for sentiment analysis on twitter. In: International Conference on Innovative Techniques and Applications of Artificial Intelligence. Springer; 2016. p. 71–85. Bandhakavi A, Wiratunga N, Massie S, Deepak P. Emotion-corpus guided lexicons for sentiment analysis on twitter. In: International Conference on Innovative Techniques and Applications of Artificial Intelligence. Springer; 2016. p. 71–85.
28.
Zurück zum Zitat Aman S, Szpakowicz S. Identifying expressions of emotion in text. In: International Conference on Text, Speech and Dialogue. Springer; 2007. p. 196–205. Aman S, Szpakowicz S. Identifying expressions of emotion in text. In: International Conference on Text, Speech and Dialogue. Springer; 2007. p. 196–205.
29.
Zurück zum Zitat Pang B, Lee L. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. arXiv preprint cs/0506075. 2005. Pang B, Lee L. Seeing stars: exploiting class relationships for sentiment categorization with respect to rating scales. arXiv preprint cs/​0506075. 2005.
30.
Zurück zum Zitat Potts C. On the negativity of negation. In: Semantics and Linguistic Theory, vol. 20. 2010. p. 636–59. Potts C. On the negativity of negation. In: Semantics and Linguistic Theory, vol. 20. 2010. p. 636–59.
31.
Zurück zum Zitat Nakov P, Kozareva Z, Ritter A, Rosenthal S, Stoyanov V, Wilson T. Semeval-2013 task 2: sentiment analysis in twitter. 2013. Nakov P, Kozareva Z, Ritter A, Rosenthal S, Stoyanov V, Wilson T. Semeval-2013 task 2: sentiment analysis in twitter. 2013.
32.
Zurück zum Zitat Staiano J, Guerini M. Depechemood: a lexicon for emotion analysis from crowd-annotated news. arXiv preprint arXiv:1405.1605. 2014. Staiano J, Guerini M. Depechemood: a lexicon for emotion analysis from crowd-annotated news. arXiv preprint arXiv:​1405.​1605. 2014.
33.
Zurück zum Zitat Badaro G, Jundi H, Hajj H, El-Hajj W. Emowordnet: automatic expansion of emotion lexicon using English wordnet. In: Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics. 2018. p. 86–93. Badaro G, Jundi H, Hajj H, El-Hajj W. Emowordnet: automatic expansion of emotion lexicon using English wordnet. In: Proceedings of the Seventh Joint Conference on Lexical and Computational Semantics. 2018. p. 86–93.
34.
Zurück zum Zitat Wang L, Xia R. Sentiment lexicon construction with representation learning based on hierarchical sentiment supervision. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017. p. 502–10. Wang L, Xia R. Sentiment lexicon construction with representation learning based on hierarchical sentiment supervision. In: Proceedings of the 2017 Conference on Empirical Methods in Natural Language Processing. 2017. p. 502–10.
35.
Zurück zum Zitat Tang D, Wei F, Qin B, Zhou M, Liu T. Building large-scale twitter-specific sentiment lexicon: a representation learning approach. In: Proceedings of Coling 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014. p. 172–82. Tang D, Wei F, Qin B, Zhou M, Liu T. Building large-scale twitter-specific sentiment lexicon: a representation learning approach. In: Proceedings of Coling 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 2014. p. 172–82.
36.
Zurück zum Zitat Vo DT, Zhang Y. Don’t count, predict! an automatic approach to learning sentiment lexicons for short text. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2. 2016. p. 219–24. Vo DT, Zhang Y. Don’t count, predict! an automatic approach to learning sentiment lexicons for short text. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, vol. 2. 2016. p. 219–24.
37.
Zurück zum Zitat Mohammad SM, Kiritchenko S, Zhu X. NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:1308.6242. 2013. Mohammad SM, Kiritchenko S, Zhu X. NRC-Canada: building the state-of-the-art in sentiment analysis of tweets. arXiv preprint arXiv:​1308.​6242. 2013.
38.
Zurück zum Zitat Suttles J, Ide N. Distant supervision for emotion classification with discrete binary values. In: International Conference on Intelligent Text Processing and Computational Linguistics. Springer; 2013. p. 121–36. Suttles J, Ide N. Distant supervision for emotion classification with discrete binary values. In: International Conference on Intelligent Text Processing and Computational Linguistics. Springer; 2013. p. 121–36.
39.
Zurück zum Zitat Thelwall M, Buckley K, Paltoglou G. Sentiment strength detection for the social web. J Am Soc Inf Sci Technol. 2012;63(1):163–73.CrossRef Thelwall M, Buckley K, Paltoglou G. Sentiment strength detection for the social web. J Am Soc Inf Sci Technol. 2012;63(1):163–73.CrossRef
Metadaten
Titel
Automatically Constructing a Fine-Grained Sentiment Lexicon for Sentiment Analysis
verfasst von
Yabing Wang
Guimin Huang
Maolin Li
Yiqun Li
Xiaowei Zhang
Hui Li
Publikationsdatum
18.07.2022
Verlag
Springer US
Erschienen in
Cognitive Computation / Ausgabe 1/2023
Print ISSN: 1866-9956
Elektronische ISSN: 1866-9964
DOI
https://doi.org/10.1007/s12559-022-10043-1