Skip to main content
Erschienen in: Wireless Personal Communications 2/2022

13.03.2022

Information Gain Based Feature Selection for Improved Textual Sentiment Analysis

verfasst von: Madhumathi Ramasamy, A. Meena Kowshalya

Erschienen in: Wireless Personal Communications | Ausgabe 2/2022

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Sentiment analysis or opinion mining is the process of mining the emotion from a given text. It is a text mining technique that effectively measures the inclination of public opinions and aids in analysing the subjective information from the given context. Sentiment analysis evaluates the opinion of a sentiment as either positive or negative or neutral. Sentiments are very specific and with respect to the underlying content, it plays a very crucial role in depicting the real-world scenario. Sentiment analysis can be performed at three levels namely document level, sentence level and feature level. This paper proposes a novel Information Gain based Feature Selection algorithm that selects highly correlated features by removing inappropriate content. Using this algorithm, extensive sentimental analysis is performed at the document level, sentence level and feature level. Datasets from Cornell and Kaggle are exploited for experimental purposes. Compared to other baseline classifiers experimental results show that the proposed Information Gain based classifier resulted in an accuracy of 95, 96.3 and 97.4% for document, sentence and feature levels respectively. The proposed method is also tested with higher dimensional datasets namely Movielens 1M, 10M and 25M datasets. Experimental results proved that the proposed method works better even for high dimensional datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
6.
Zurück zum Zitat Vamsi, B., Suneetha, N., Sudhakar, Ch., & Amaravati, K. (2017). Sentiment analysis on online reviews using supervised learning: A survey. International Journal of Control Theory and Applications, 10(30), 143–152. Vamsi, B., Suneetha, N., Sudhakar, Ch., & Amaravati, K. (2017). Sentiment analysis on online reviews using supervised learning: A survey. International Journal of Control Theory and Applications, 10(30), 143–152.
13.
Zurück zum Zitat Shirsat, V., Jagdale, R., Shende, K., Deshmukh, S. N., & Kawale, S. (2019). Sentence level sentiment analysis from news articles and blogs using machine learning techniques. International Journal of Computer Sciences and Engineering, 7(5), 1–6.CrossRef Shirsat, V., Jagdale, R., Shende, K., Deshmukh, S. N., & Kawale, S. (2019). Sentence level sentiment analysis from news articles and blogs using machine learning techniques. International Journal of Computer Sciences and Engineering, 7(5), 1–6.CrossRef
14.
Zurück zum Zitat Rintyarna, B. S., Sarno, R., & Fatichah, C. (2019). Evaluating the performance of sentence level features and domain sensitive features of product reviews on supervised sentiment analysis tasks. Journal of Big Data, 6, 1–19.CrossRef Rintyarna, B. S., Sarno, R., & Fatichah, C. (2019). Evaluating the performance of sentence level features and domain sensitive features of product reviews on supervised sentiment analysis tasks. Journal of Big Data, 6, 1–19.CrossRef
15.
Zurück zum Zitat Schouten, K., Frasincar, F., & R. Dekker, R., (2016). An information gain-driven feature study for aspect-based sentiment analysis. In: Proceedings of International Conference on Applications of Natural Language to Information Systems, pp. 48–59, 2016. https://doi.org/10.1007/978-3-319-41754-7_5. Schouten, K., Frasincar, F., & R. Dekker, R., (2016). An information gain-driven feature study for aspect-based sentiment analysis. In: Proceedings of International Conference on Applications of Natural Language to Information Systems, pp. 48–59, 2016. https://​doi.​org/​10.​1007/​978-3-319-41754-7_​5.
16.
Zurück zum Zitat Franky, & Manurung, R. (2008). “Machine Learning-based Sentiment Analysis of Automatic Indonesian Translations of English Movie Reviews. In: Proceedings of International Conference on Advanced Computational Intelligence and Its Applications (ICACIA), Depok, Indonesia, Jan, 2008. Franky, & Manurung, R. (2008). “Machine Learning-based Sentiment Analysis of Automatic Indonesian Translations of English Movie Reviews. In: Proceedings of International Conference on Advanced Computational Intelligence and Its Applications (ICACIA), Depok, Indonesia, Jan, 2008.
17.
Zurück zum Zitat Singh, M., & Gupta, S. (2020). Sentiment analysis using Naive Bayes classifier and information gain feature selection over twitter. International Journal of Computer Trends and Technology, 68(5), 84–91.CrossRef Singh, M., & Gupta, S. (2020). Sentiment analysis using Naive Bayes classifier and information gain feature selection over twitter. International Journal of Computer Trends and Technology, 68(5), 84–91.CrossRef
19.
Zurück zum Zitat Ikonomakis, M., Kotsiantis, S., & Tampakas, V. (2005). Text classification using machine learning techniques. SEAS Transactions on Computers, 4(8), 966–974. Ikonomakis, M., Kotsiantis, S., & Tampakas, V. (2005). Text classification using machine learning techniques. SEAS Transactions on Computers, 4(8), 966–974.
20.
Zurück zum Zitat Raza, H., Faizan, M., Hamza, A., Mushtaq, A., & Akhtar, N. (2019). Scientific text sentiment analysis using machine learning techniques. International Journal of Advanced Computer Science and Applications, 10(12), 157–165.CrossRef Raza, H., Faizan, M., Hamza, A., Mushtaq, A., & Akhtar, N. (2019). Scientific text sentiment analysis using machine learning techniques. International Journal of Advanced Computer Science and Applications, 10(12), 157–165.CrossRef
21.
Zurück zum Zitat Yu, H., & Hatzivassiloglou, V. (2003). “Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, Jul, 2003. https://doi.org/10.3115/1119355.1119372. Yu, H., & Hatzivassiloglou, V. (2003). “Towards Answering Opinion Questions: Separating Facts from Opinions and Identifying the Polarity of Opinion Sentences. In: Proceedings of Conference on Empirical Methods in Natural Language Processing, Jul, 2003. https://​doi.​org/​10.​3115/​1119355.​1119372.
23.
Zurück zum Zitat dos Santos, C.N., & M. Gatti, M. (2014). “Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts”. In: Proceedings of 25th International Conference on Computational Linguistics: Technical Papers, COLING, pp. 69-78, Aug, 2014. http://creativecommons.org/licenses/by/4.0. dos Santos, C.N., & M. Gatti, M. (2014). “Deep Convolutional Neural Networks for Sentiment Analysis of Short Texts”. In: Proceedings of 25th International Conference on Computational Linguistics: Technical Papers, COLING, pp. 69-78, Aug, 2014. http://​creativecommons.​org/​licenses/​by/​4.​0.
24.
Zurück zum Zitat Kouloumpis, E., Wilson, T. & Moore, J. (2011). “Twitter Sentiment Analysis: The Good the Bad and the OMG!”. In: Proceedings of International Conference on Weblogs and Social Media, vol. 2, Jul, 2011. Kouloumpis, E., Wilson, T. & Moore, J. (2011). “Twitter Sentiment Analysis: The Good the Bad and the OMG!”. In: Proceedings of International Conference on Weblogs and Social Media, vol. 2, Jul, 2011.
25.
Zurück zum Zitat Alshamsi, A., Bayari, R., & Salloum, S. (2020). Sentiment Analysis in English Texts. Advances in Science, Technology and Engineering Systems Journal, 5(6), 1683–1689.CrossRef Alshamsi, A., Bayari, R., & Salloum, S. (2020). Sentiment Analysis in English Texts. Advances in Science, Technology and Engineering Systems Journal, 5(6), 1683–1689.CrossRef
26.
Zurück zum Zitat Agarwal, A., Biadsy, F., & Mckeown, K.R. (2009). Contextual phrase-level polarity analysis using lexical affect scoring and syntactic N-grams. In: Proceedings of the 12th Conference of the European Chapter of the ACL, pp. 24–32, Mar, 2009. https://doi.org/10.3115/1609067.1609069. Agarwal, A., Biadsy, F., & Mckeown, K.R. (2009). Contextual phrase-level polarity analysis using lexical affect scoring and syntactic N-grams. In: Proceedings of the 12th Conference of the European Chapter of the ACL, pp. 24–32, Mar, 2009. https://​doi.​org/​10.​3115/​1609067.​1609069.
28.
Zurück zum Zitat Singh, V.K., Piryani, R., Uddin, A., & Waila, P. (2013). “Sentiment analysis of movie reviews: A new feature-based heuristic for aspect-level sentiment classification”. In: Proceedings of the International Multi-Conference on Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), Mar, 2013. https://doi.org/10.1109/iMac4s.2013.6526500. Singh, V.K., Piryani, R., Uddin, A., & Waila, P. (2013). “Sentiment analysis of movie reviews: A new feature-based heuristic for aspect-level sentiment classification”. In: Proceedings of the International Multi-Conference on Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), Mar, 2013. https://​doi.​org/​10.​1109/​iMac4s.​2013.​6526500.
29.
Zurück zum Zitat Zhang, Y., Lai, G., Zhang, M., Zhang, Y., Liu, Y., Ma, S., (2014). “Explicit factor models for explainable recommendation based on phrase-level sentiment analysis”. In: Proceedings of the. 37th International ACM SIGIR conference on Research & development in information retrieval, pp. 83–92, Jul, 2014. https://doi.org/10.1145/2600428.2609579. Zhang, Y., Lai, G., Zhang, M., Zhang, Y., Liu, Y., Ma, S., (2014). “Explicit factor models for explainable recommendation based on phrase-level sentiment analysis”. In: Proceedings of the. 37th International ACM SIGIR conference on Research & development in information retrieval, pp. 83–92, Jul, 2014. https://​doi.​org/​10.​1145/​2600428.​2609579.
32.
Zurück zum Zitat Stone, P. J., Dunphy, D., Smith, M. S., & Ogilvie, D. M. (1966). The general inquirer: A computer approach to content analysis. The MIT Press. Stone, P. J., Dunphy, D., Smith, M. S., & Ogilvie, D. M. (1966). The general inquirer: A computer approach to content analysis. The MIT Press.
Metadaten
Titel
Information Gain Based Feature Selection for Improved Textual Sentiment Analysis
verfasst von
Madhumathi Ramasamy
A. Meena Kowshalya
Publikationsdatum
13.03.2022
Verlag
Springer US
Erschienen in
Wireless Personal Communications / Ausgabe 2/2022
Print ISSN: 0929-6212
Elektronische ISSN: 1572-834X
DOI
https://doi.org/10.1007/s11277-022-09597-y

Weitere Artikel der Ausgabe 2/2022

Wireless Personal Communications 2/2022 Zur Ausgabe

Neuer Inhalt