Skip to main content
Erschienen in: Soft Computing 2/2019

01.11.2017 | Methodologies and Application

An improved algorithm for sentiment analysis based on maximum entropy

verfasst von: Xin Xie, Songlin Ge, Fengping Hu, Mingye Xie, Nan Jiang

Erschienen in: Soft Computing | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis is an important field of study in natural language processing. In the massive data and irregular data, sentiment classification with high accuracy is a major challenge in sentiment analysis. To address this problem, a novel maximum entropy-PLSA model is proposed. In this model, we first use the probabilistic latent semantic analysis to extract the seed emotion words from the Wikipedia and the training corpus. Then features are extracted from these seed emotion words, which are the input of the maximum entropy model for training the maximum entropy model. The test set is processed similarly into the maximum entropy model for emotional classification. Meanwhile, the training set and the test set are divided by the K-fold method. The maximum entropy classification based on probabilistic latent semantic analysis uses important emotional classification features to classify words, such as the relevance of words and parts of speech in the context, the relevance with degree adverbs, the similarity with the benchmark emotional words and so on. The experiments prove that the classification method proposed by this paper outperforms the compared methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Berger AL, Pietra VJD, Pietra SAD (1996) A maximum entropy approach to natural language processing. Comput Linguist 22(1):39–71 Berger AL, Pietra VJD, Pietra SAD (1996) A maximum entropy approach to natural language processing. Comput Linguist 22(1):39–71
Zurück zum Zitat Brody S, Elhadad N (2013) An unsupervised aspect-sentiment model for online reviews. In: Human language technologies: conference of the North American chapter of the Association of Computational Linguistics, Proceedings, June 2–4, 2010. Los Angeles, California, USA, pp 804–812 Brody S, Elhadad N (2013) An unsupervised aspect-sentiment model for online reviews. In: Human language technologies: conference of the North American chapter of the Association of Computational Linguistics, Proceedings, June 2–4, 2010. Los Angeles, California, USA, pp 804–812
Zurück zum Zitat Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2(2):121–167CrossRef Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2(2):121–167CrossRef
Zurück zum Zitat Cheeseman P, Stutz J (1996) Bayesian classification (autoclass): theory and results. Fayyad U.m.etc. advances in Knowledge Discovery & Data Mining Aaai, pp 153–180 Cheeseman P, Stutz J (1996) Bayesian classification (autoclass): theory and results. Fayyad U.m.etc. advances in Knowledge Discovery & Data Mining Aaai, pp 153–180
Zurück zum Zitat Chen Q, Wenjie Li Y, Lei XL, He Y (2015) Learning to adapt credible knowledge in cross-lingual sentiment analysis. ACL 1:419–429 Chen Q, Wenjie Li Y, Lei XL, He Y (2015) Learning to adapt credible knowledge in cross-lingual sentiment analysis. ACL 1:419–429
Zurück zum Zitat Cheng K, Li J, Tang J, Liu H (2017) Unsupervised sentiment analysis with signed social networks. In: AAAI, pp 3429–3435 Cheng K, Li J, Tang J, Liu H (2017) Unsupervised sentiment analysis with signed social networks. In: AAAI, pp 3429–3435
Zurück zum Zitat Chen D, Wang D, Yu G, Yu F (2007) A PLSA-based approach for building user profile and implementing personalized recommendation. In: Advances in data and web management. Springer, pp 606–613 Chen D, Wang D, Yu G, Yu F (2007) A PLSA-based approach for building user profile and implementing personalized recommendation. In: Advances in data and web management. Springer, pp 606–613
Zurück zum Zitat Du K, Shi Y, Lei B, Chen J, Sun M (2016) A method of human action recognition based on spatio-temporal interest points and PLSA. In: 2016 international conference on industrial informatics-computing technology, intelligent technology, industrial information integration (ICIICII). IEEE, pp 69–72 Du K, Shi Y, Lei B, Chen J, Sun M (2016) A method of human action recognition based on spatio-temporal interest points and PLSA. In: 2016 international conference on industrial informatics-computing technology, intelligent technology, industrial information integration (ICIICII). IEEE, pp 69–72
Zurück zum Zitat Ganu G, Elhadad N, Marian A (2009) Beyond the stars: improving rating predictions using review text content. In: International workshop on the web and databases, WEBDB (2009) Providence. Rhode Island, USA, June Ganu G, Elhadad N, Marian A (2009) Beyond the stars: improving rating predictions using review text content. In: International workshop on the web and databases, WEBDB (2009) Providence. Rhode Island, USA, June
Zurück zum Zitat Gehring J, Miao Y, Metze F, Waibel A (2013) Extracting deep bottleneck features using stacked auto-encoders. In: 2013 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 3377–3381 Gehring J, Miao Y, Metze F, Waibel A (2013) Extracting deep bottleneck features using stacked auto-encoders. In: 2013 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 3377–3381
Zurück zum Zitat Haidar MA, O’Shaughnessy D (2015) Document-specific context PLSA language model for speech recognition. In: 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 5326–5330 Haidar MA, O’Shaughnessy D (2015) Document-specific context PLSA language model for speech recognition. In: 2015 IEEE international conference on acoustics, speech and signal processing (ICASSP). IEEE, pp 5326–5330
Zurück zum Zitat Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1):177–196CrossRefMATH Hofmann T (2001) Unsupervised learning by probabilistic latent semantic analysis. Mach Learn 42(1):177–196CrossRefMATH
Zurück zum Zitat Hong HZ, Hwang JI (2015) Multimodal PLSA for movie genre classification. In: International workshop on multiple classifier systems. Springer, pp 159–167 Hong HZ, Hwang JI (2015) Multimodal PLSA for movie genre classification. In: International workshop on multiple classifier systems. Springer, pp 159–167
Zurück zum Zitat Huang F, Jing X, Sun S, Lu Y (2012) Incorporate spatial information into PLSA for scene classification. In: International conference on trustworthy computing and services. Springer, pp 170–177 Huang F, Jing X, Sun S, Lu Y (2012) Incorporate spatial information into PLSA for scene classification. In: International conference on trustworthy computing and services. Springer, pp 170–177
Zurück zum Zitat Lipenkova J (2015) A system for fine-grained aspect-based sentiment analysis of Chinese. In: ACL (system demonstrations), pp 55–60 Lipenkova J (2015) A system for fine-grained aspect-based sentiment analysis of Chinese. In: ACL (system demonstrations), pp 55–60
Zurück zum Zitat Nguyen TH, Shirai K, Velcin J (2015 Modeling based sentiment analysis on social media for stock market prediction. In: The meeting of the association for computational linguistics and the international joint conference on natural language processing of the Asian Federation of natural language processing Nguyen TH, Shirai K, Velcin J (2015 Modeling based sentiment analysis on social media for stock market prediction. In: The meeting of the association for computational linguistics and the international joint conference on natural language processing of the Asian Federation of natural language processing
Zurück zum Zitat Ni X, Xue GR, Ling X, Yu Y, Yang Q (2007) Exploring in the weblog space by detecting informative and affective articles. In: Proceedings of the 16th international conference on World Wide Web. ACM, pp 281–290 Ni X, Xue GR, Ling X, Yu Y, Yang Q (2007) Exploring in the weblog space by detecting informative and affective articles. In: Proceedings of the 16th international conference on World Wide Web. ACM, pp 281–290
Zurück zum Zitat Pang B, Lee L (2004) A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 271 Pang B, Lee L (2004) A sentimental education: sentiment analysis using subjectivity summarization based on minimum cuts. In: Proceedings of the 42nd annual meeting on Association for Computational Linguistics. Association for Computational Linguistics, p 271
Zurück zum Zitat Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing-vol 10. Association for Computational Linguistics, pp 79–86 Pang B, Lee L, Vaithyanathan S (2002) Thumbs up? Sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 conference on empirical methods in natural language processing-vol 10. Association for Computational Linguistics, pp 79–86
Zurück zum Zitat Wang SY, Hsieh JW, Yan Y, Chen LC, Chen DY (2015a) PLSA-based sparse representation for vehicle color classification. In: 2015 12th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6 Wang SY, Hsieh JW, Yan Y, Chen LC, Chen DY (2015a) PLSA-based sparse representation for vehicle color classification. In: 2015 12th IEEE international conference on advanced video and signal based surveillance (AVSS). IEEE, pp 1–6
Zurück zum Zitat Wang Y, Wang S, Tang J, Liu H, Li B (2015b) Supervised sentiment analysis for social media images. In: IJCAI, pp 2378–2379 Wang Y, Wang S, Tang J, Liu H, Li B (2015b) Supervised sentiment analysis for social media images. In: IJCAI, pp 2378–2379
Zurück zum Zitat Wang J, Fu J, Xu Y, Mei T (2016) Beyond object recognition: visual sentiment analysis with deep coupled adjective and noun neural networks. In: IJCAI, pp 3484–3490 Wang J, Fu J, Xu Y, Mei T (2016) Beyond object recognition: visual sentiment analysis with deep coupled adjective and noun neural networks. In: IJCAI, pp 3484–3490
Zurück zum Zitat Wasilewski J, Hurley N (2016) Intent-aware diversification using a constrained PLSA. In: ACM conference on recommender systems, pp 39–42 Wasilewski J, Hurley N (2016) Intent-aware diversification using a constrained PLSA. In: ACM conference on recommender systems, pp 39–42
Zurück zum Zitat Xu WR, Liu DX, Guo J, Cai YC et al (2009) Supervised dual-PLSA for personalized SMS filtering. In: Asia information retrieval symposium. Springer, pp 254–264 Xu WR, Liu DX, Guo J, Cai YC et al (2009) Supervised dual-PLSA for personalized SMS filtering. In: Asia information retrieval symposium. Springer, pp 254–264
Zurück zum Zitat You Q, Jin H, Luo J (2017) Visual sentiment analysis by attending on local image regions. In: AAAI, pp 231–237 You Q, Jin H, Luo J (2017) Visual sentiment analysis by attending on local image regions. In: AAAI, pp 231–237
Zurück zum Zitat You Q, Luo J, Jin H, Yang J (2015) Robust image sentiment analysis using progressively trained and domain transferred deep networks. arXiv preprint arXiv:1509.06041 You Q, Luo J, Jin H, Yang J (2015) Robust image sentiment analysis using progressively trained and domain transferred deep networks. arXiv preprint arXiv:​1509.​06041
Zurück zum Zitat Zhang Y, Yuan Y, Guoren W (2015) A multimodal multimedia retrieval model based on PLSA. In: Web information system and application conference, pp 33–36 Zhang Y, Yuan Y, Guoren W (2015) A multimodal multimedia retrieval model based on PLSA. In: Web information system and application conference, pp 33–36
Zurück zum Zitat Zhang M, Zhang Y, Vo DT (2016) Gated neural networks for targeted sentiment analysis. In: AAAI, pp 3087–3093 Zhang M, Zhang Y, Vo DT (2016) Gated neural networks for targeted sentiment analysis. In: AAAI, pp 3087–3093
Zurück zum Zitat Zhong C, Miao Z (2014) Modeling correlation between multi-modal continuous words for PLSA-based video classification. In: 2014 IEEE international conference on image processing (ICIP). IEEE, pp 4304–4308 Zhong C, Miao Z (2014) Modeling correlation between multi-modal continuous words for PLSA-based video classification. In: 2014 IEEE international conference on image processing (ICIP). IEEE, pp 4304–4308
Metadaten
Titel
An improved algorithm for sentiment analysis based on maximum entropy
verfasst von
Xin Xie
Songlin Ge
Fengping Hu
Mingye Xie
Nan Jiang
Publikationsdatum
01.11.2017
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 2/2019
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-017-2904-0

Weitere Artikel der Ausgabe 2/2019

Soft Computing 2/2019 Zur Ausgabe