Skip to main content
Top

2018 | OriginalPaper | Chapter

Term Co-occurrence Based Feature Selection for Sentiment Classification

Authors : Sudarshan S. Sonawane, Satish R. Kolhe

Published in: Smart and Innovative Trends in Next Generation Computing Technologies

Publisher: Springer Singapore

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

In this paper, the strategy of feature selection for sentiment classification explored and compared to other significant feature selection strategies found in contemporary literature. The feature selection models performed using the statistical measure of t-score and z-score. SVM, NB and AdaBoost classifiers used for classification and compared. The objective of the paper is to explore and evaluate the scope of statistical measures for identifying the optimal features and its significance to classify the opinion using divergent classifiers. Performance analysis carried out on varied datasets with diverse range like the movie reviews, product reviews and tweets, the experiments carried out on feature selection strategies proposed and other strategies found in literature. From the results of the experimental studies, it is evident that optimal features selected using t-score and z-score are robust and outperformed the other feature selection strategies. In order to assess the significance of the feature selection models proposed, the classification process carried out using three classifiers called SVM, NB and AdaBoost. The classification accuracy about the features obtain by proposed models is much higher that compared to the classification accuracy obtained for the features selected by other contemporary models. Among the three classifiers that used to assess classification accuracy, AdaBoost has outperformed the other two models of SVM and NB.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, 2nd edn, pp. 627–666. CRC Press, Taylor and Francis Group (2010) Liu, B.: Sentiment analysis and subjectivity. In: Handbook of Natural Language Processing, 2nd edn, pp. 627–666. CRC Press, Taylor and Francis Group (2010)
2.
go back to reference Verma, S., Bhattacharyya, P.: Incorporating semantic knowledge for sentiment analysis. In: Proceedings of 6th International Conference on Natural Language Processing (2009) Verma, S., Bhattacharyya, P.: Incorporating semantic knowledge for sentiment analysis. In: Proceedings of 6th International Conference on Natural Language Processing (2009)
4.
go back to reference Vinodhini, G., Chandrasekaran, R.M.: Performance evaluation of machine learning classifiers in sentiment mining. Int. J. Comput. Trends Technol. 4(6), 1783–1786 (2013) Vinodhini, G., Chandrasekaran, R.M.: Performance evaluation of machine learning classifiers in sentiment mining. Int. J. Comput. Trends Technol. 4(6), 1783–1786 (2013)
7.
go back to reference Ghosh, M., Kar, A.: Unsupervised linguistic approach for sentiment classification from online reviews using SentiWordNet 3.0. Int. J. Eng. Res. Technol. 2(9), 55–60 (2013) Ghosh, M., Kar, A.: Unsupervised linguistic approach for sentiment classification from online reviews using SentiWordNet 3.0. Int. J. Eng. Res. Technol. 2(9), 55–60 (2013)
10.
go back to reference Kim, Y., Zhang, O.: Credibility adjusted term frequency: a supervised term weighting scheme for sentiment analysis and text classification. In: Proceedings of 5th Workshop of Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 79–83 (2014) Kim, Y., Zhang, O.: Credibility adjusted term frequency: a supervised term weighting scheme for sentiment analysis and text classification. In: Proceedings of 5th Workshop of Computational Approaches to Subjectivity, Sentiment and Social Media Analysis, pp. 79–83 (2014)
13.
go back to reference Nie, F., Huang, H., Cai, X., Ding, C.: Efficient and robust feature selection via joint ℓ2, 1-norms minimization. In: Proceedings of the 23rd International Conference on Neural Information Processing Systems, pp. 1813–1821 (2010) Nie, F., Huang, H., Cai, X., Ding, C.: Efficient and robust feature selection via joint ℓ2, 1-norms minimization. In: Proceedings of the 23rd International Conference on Neural Information Processing Systems, pp. 1813–1821 (2010)
18.
go back to reference Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia, Vancouver (2006) Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia, Vancouver (2006)
21.
go back to reference Kummer, O., Savoy, J.: Feature selection in sentiment analysis. In: CORIA, Bordeaux, France, pp. 273–284 (2012) Kummer, O., Savoy, J.: Feature selection in sentiment analysis. In: CORIA, Bordeaux, France, pp. 273–284 (2012)
22.
go back to reference Sahoo, P.K., Riedel, T.: Mean Value Theorems and Functional Equations. World Scientific, Singapore (1998)CrossRef Sahoo, P.K., Riedel, T.: Mean Value Theorems and Functional Equations. World Scientific, Singapore (1998)CrossRef
26.
go back to reference Ihaka, R., Gentleman, R.: R: a language for data analysis and graphics. J. Comput. Graph. Stat. 5(3), 299–314 (1996) Ihaka, R., Gentleman, R.: R: a language for data analysis and graphics. J. Comput. Graph. Stat. 5(3), 299–314 (1996)
Metadata
Title
Term Co-occurrence Based Feature Selection for Sentiment Classification
Authors
Sudarshan S. Sonawane
Satish R. Kolhe
Copyright Year
2018
Publisher
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-8657-1_31

Premium Partner