Skip to main content
Erschienen in: International Journal of Machine Learning and Cybernetics 2/2021

11.08.2020 | Original Article

Cross-domain sentiment aware word embeddings for review sentiment analysis

verfasst von: Jun Liu, Shuang Zheng, Guangxia Xu, Mingwei Lin

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 2/2021

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Learning low-dimensional vector representations of words from a large corpus is one of the basic tasks in natural language processing (NLP). The existing universal word embedding model learns word vectors mainly through grammar and semantic information from the context, while ignoring the sentiment information contained in the words. Some approaches, although they model sentiment information in the reviews, do not consider certain words in different domains. In a case where the emotion changes, if the general word vector is directly applied to the review sentiment analysis task, then this will inevitably affect the performance of the sentiment classification. To solve this problem, this paper extends the CBoW (continuous bag-of-words) word vector model and proposes a cross-domain sentiment aware word embedding learning model, which can capture the sentiment information and domain relevance of a word at the same time. This paper conducts several experiments on Amazon user review data in different domains to evaluate the performance of the model. The experimental results show that the proposed model can obtain a nearly 2% accuracy improvement compared with the general word vector when modeling only the sentiment information of the context. At the same time, when the domain information and the sentiment information are both included, the accuracy and Macro-F1 value of the sentiment classification tasks are significantly improved compared with existing sentiment word embeddings.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Weitere Produktempfehlungen anzeigen
Literatur
1.
Zurück zum Zitat Hu S, Zou L, Yu J, Wang H (2018) Answering natural language questions by subgraph matching over knowledge graphs. IEEE Trans Knowl Data Eng 30(5):824–837CrossRef Hu S, Zou L, Yu J, Wang H (2018) Answering natural language questions by subgraph matching over knowledge graphs. IEEE Trans Knowl Data Eng 30(5):824–837CrossRef
2.
Zurück zum Zitat Mikolov T, Sutskever I, Chen K et al (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst Stateline Curran Assoc 26:3111–3119 Mikolov T, Sutskever I, Chen K et al (2013) Distributed representations of words and phrases and their compositionality. Adv Neural Inf Process Syst Stateline Curran Assoc 26:3111–3119
3.
Zurück zum Zitat Moreno E, Gonzalez R (2016) Automatic algorithm to classify and locate research papers using natural language. IEEE Latin Am Trans 14(3):1367–1371CrossRef Moreno E, Gonzalez R (2016) Automatic algorithm to classify and locate research papers using natural language. IEEE Latin Am Trans 14(3):1367–1371CrossRef
4.
Zurück zum Zitat Almuhareb A, Alsanie W (2019) Arabic word segmentation with long short-term memory neural networks and word embedding. IEEE Access 7:12879–12887CrossRef Almuhareb A, Alsanie W (2019) Arabic word segmentation with long short-term memory neural networks and word embedding. IEEE Access 7:12879–12887CrossRef
5.
Zurück zum Zitat Mills M, Bourbakis N (2014) Graph-based methods for natural language processing and understanding—a survey and analysis. IEEE Trans Syst Man Cybern Syst 44(1):59–71CrossRef Mills M, Bourbakis N (2014) Graph-based methods for natural language processing and understanding—a survey and analysis. IEEE Trans Syst Man Cybern Syst 44(1):59–71CrossRef
6.
Zurück zum Zitat Bollegala D, Mu T, Goulermas JY (2016) Cross-domain sentiment classification using sentiment sensitive embeddings. IEEE Trans Knowl Data Eng 28(2):398–410CrossRef Bollegala D, Mu T, Goulermas JY (2016) Cross-domain sentiment classification using sentiment sensitive embeddings. IEEE Trans Knowl Data Eng 28(2):398–410CrossRef
7.
Zurück zum Zitat LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444CrossRef LeCun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444CrossRef
8.
Zurück zum Zitat Le A, Clanuwat T, Kitamoto A (2019) A human-inspired recognition system for pre-modern japanese historical documents. IEEE Access 7:84163–84169CrossRef Le A, Clanuwat T, Kitamoto A (2019) A human-inspired recognition system for pre-modern japanese historical documents. IEEE Access 7:84163–84169CrossRef
9.
Zurück zum Zitat Dong L, Wei F, Xu K, Liu S, Zhou M (2016) Adaptive multi-compositionality for recursive neural network models. IEEE Trans Audio Speech Lang Process 24(3):422–431CrossRef Dong L, Wei F, Xu K, Liu S, Zhou M (2016) Adaptive multi-compositionality for recursive neural network models. IEEE Trans Audio Speech Lang Process 24(3):422–431CrossRef
10.
Zurück zum Zitat Hassan A, Mahmood A (2018) Convolutional recurrent deep learning model for sentence classification. IEEE Access 6:13949–13957CrossRef Hassan A, Mahmood A (2018) Convolutional recurrent deep learning model for sentence classification. IEEE Access 6:13949–13957CrossRef
11.
Zurück zum Zitat Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830CrossRef Schouten K, Frasincar F (2016) Survey on aspect-level sentiment analysis. IEEE Trans Knowl Data Eng 28(3):813–830CrossRef
12.
Zurück zum Zitat Er MJ, Zhang Y, Wang N et al (2016) Attention pooling-based convolutional neural network for sentence modelling. Inf Sci 373:388–403CrossRef Er MJ, Zhang Y, Wang N et al (2016) Attention pooling-based convolutional neural network for sentence modelling. Inf Sci 373:388–403CrossRef
13.
Zurück zum Zitat Tang D, Wei F, Qin B, Yang N (2016) Sentiment embeddings with applications to sentiment analysis. IEEE Trans Knowl Data Eng 28(2):496–509CrossRef Tang D, Wei F, Qin B, Yang N (2016) Sentiment embeddings with applications to sentiment analysis. IEEE Trans Knowl Data Eng 28(2):496–509CrossRef
14.
Zurück zum Zitat Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828CrossRef Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–1828CrossRef
15.
Zurück zum Zitat Bengio Y, Ducharme R, Vincent P et al (2003) A neural probabilistic language model. J Mach Learn Res 3(Feb):1137–1155 Bengio Y, Ducharme R, Vincent P et al (2003) A neural probabilistic language model. J Mach Learn Res 3(Feb):1137–1155
17.
Zurück zum Zitat Dong X, Dong J (2018) The visual word booster: a spatial layout of words descriptor exploiting contour cues. IEEE Trans Image Process 27(8):3904–3917MathSciNetCrossRef Dong X, Dong J (2018) The visual word booster: a spatial layout of words descriptor exploiting contour cues. IEEE Trans Image Process 27(8):3904–3917MathSciNetCrossRef
18.
Zurück zum Zitat Duyu T, Furu W, Bing Q et al (2016) Sentiment embeddings with applications to sentiment analysis. IEEE Trans Knowl Data Eng 28(2):496–509CrossRef Duyu T, Furu W, Bing Q et al (2016) Sentiment embeddings with applications to sentiment analysis. IEEE Trans Knowl Data Eng 28(2):496–509CrossRef
19.
Zurück zum Zitat Deng D, Jing L, Yu J, Sun S (2019) Sparse self-attention LSTM for sentiment lexicon construction. IEEE/ACM Trans Audio Speech Lang Process 27(11):704–718CrossRef Deng D, Jing L, Yu J, Sun S (2019) Sparse self-attention LSTM for sentiment lexicon construction. IEEE/ACM Trans Audio Speech Lang Process 27(11):704–718CrossRef
20.
Zurück zum Zitat Rida-E-Fatima S, Javed A, Banjar A (2019) A multi-layer dual attention deep learning model with refined word embeddings for aspect-based sentiment analysis. IEEE Access 7:114795–114807CrossRef Rida-E-Fatima S, Javed A, Banjar A (2019) A multi-layer dual attention deep learning model with refined word embeddings for aspect-based sentiment analysis. IEEE Access 7:114795–114807CrossRef
21.
Zurück zum Zitat Sarma PK, Liang Y, Sethares WA (2018) Domain adapted word embeddings for improved sentiment classification. In: Proceedings of the 56th Annual Meeting of the Association for computational linguistics (short Papers). ACL Press, Melbourne, pp 534–539 Sarma PK, Liang Y, Sethares WA (2018) Domain adapted word embeddings for improved sentiment classification. In: Proceedings of the 56th Annual Meeting of the Association for computational linguistics (short Papers). ACL Press, Melbourne, pp 534–539
22.
Zurück zum Zitat Y. Hao, T. Mu, R. Hong, M. Wang (2019) Cross-domain sentiment encoding through stochastic word embedding. IEEE Trans Knowl Data Eng Y. Hao, T. Mu, R. Hong, M. Wang (2019) Cross-domain sentiment encoding through stochastic word embedding. IEEE Trans Knowl Data Eng
24.
Zurück zum Zitat Lu W, Hai LC, Lofgren J (2016) A general regularization framework for domain adaptation. In: Proceedings of the 2016 Conference on empirical methods in natural language processing. ACL Press, Austin, pp 950–954 Lu W, Hai LC, Lofgren J (2016) A general regularization framework for domain adaptation. In: Proceedings of the 2016 Conference on empirical methods in natural language processing. ACL Press, Austin, pp 950–954
25.
Zurück zum Zitat McAuley J, Targett C, Shi Q et al (2015) Image-based recommendations on styles and substitutes. In: Proceedings of the 38th International ACM SIGIR conference on research and development in information retrieval. ACM Press, Shanghai, pp 43–52 McAuley J, Targett C, Shi Q et al (2015) Image-based recommendations on styles and substitutes. In: Proceedings of the 38th International ACM SIGIR conference on research and development in information retrieval. ACM Press, Shanghai, pp 43–52
26.
Zurück zum Zitat Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605MATH Maaten L, Hinton G (2008) Visualizing data using t-SNE. J Mach Learn Res 9:2579–2605MATH
27.
Zurück zum Zitat Xiong S, Lv H, Zhao W et al (2018) Towards Twitter sentiment classification by multi-level sentiment-enriched word embeddings. Neurocomputing 278:2459–2466CrossRef Xiong S, Lv H, Zhao W et al (2018) Towards Twitter sentiment classification by multi-level sentiment-enriched word embeddings. Neurocomputing 278:2459–2466CrossRef
28.
Zurück zum Zitat Lin M, Xu Z, Yao Z (2018) Multi-attribute group decision-making under probabilistic uncertain linguistic environment. J Oper Res Soc 69(2):157–170CrossRef Lin M, Xu Z, Yao Z (2018) Multi-attribute group decision-making under probabilistic uncertain linguistic environment. J Oper Res Soc 69(2):157–170CrossRef
29.
Zurück zum Zitat Lin M, Chen Z, Liao H, Xu Z (2019) ELECTRE II method to deal with probabilistic linguistic term sets and its application to edge computing. Nonlinear Dyn 96(3):2125–2143CrossRef Lin M, Chen Z, Liao H, Xu Z (2019) ELECTRE II method to deal with probabilistic linguistic term sets and its application to edge computing. Nonlinear Dyn 96(3):2125–2143CrossRef
30.
Zurück zum Zitat Garg H, Kumar K (2019) Prioritized aggregation operators based on linguistic connection number for multiple attribute group decision-making under linguistic intuitionistic fuzzy environment. ICSES Trans Neural Fuzzy Comput 2(3):1–15 Garg H, Kumar K (2019) Prioritized aggregation operators based on linguistic connection number for multiple attribute group decision-making under linguistic intuitionistic fuzzy environment. ICSES Trans Neural Fuzzy Comput 2(3):1–15
31.
Zurück zum Zitat Wu XL, Liao HC (2019) Comparison analysis between DNMA method and other MCDM methods. ICSES Trans Neural Fuzzy Comput 2(1):4–10 Wu XL, Liao HC (2019) Comparison analysis between DNMA method and other MCDM methods. ICSES Trans Neural Fuzzy Comput 2(1):4–10
Metadaten
Titel
Cross-domain sentiment aware word embeddings for review sentiment analysis
verfasst von
Jun Liu
Shuang Zheng
Guangxia Xu
Mingwei Lin
Publikationsdatum
11.08.2020
Verlag
Springer Berlin Heidelberg
Erschienen in
International Journal of Machine Learning and Cybernetics / Ausgabe 2/2021
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI
https://doi.org/10.1007/s13042-020-01175-7

Weitere Artikel der Ausgabe 2/2021

International Journal of Machine Learning and Cybernetics 2/2021 Zur Ausgabe

Neuer Inhalt