Skip to main content

2016 | OriginalPaper | Buchkapitel

Twitter Feature Selection and Classification Using Support Vector Machine for Aspect-Based Sentiment Analysis

verfasst von : Nurulhuda Zainuddin, Ali Selamat, Roliana Ibrahim

Erschienen in: Trends in Applied Knowledge-Based Systems and Data Science

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, with regards to aspect-based sentiment classification accuracy problem, we propose a Principal Component Analysis (PCA) feature selection method that can determine the most relevant set of features for aspect-based sentiment classification. Feature selection helps to reduce redundant features and remove irrelevant features which affect classifier accuracy. In this paper we present a method for feature selection for twitter aspect-based sentiment classification based on Principal Component Analysis (PCA). PCA is combined with Sentiwordnet lexicon-based method which is incorporated with Support Vector Machine (SVM) learning framework to perform the classification. Experiments on our own Hate Crime Twitter Sentiment (HCTS) and benchmark Stanford Twitter Sentiment (STS) datasets yields accuracies of 94.53 % and 97.93 % respectively. The comparisons with other statistical feature selection methods shows that our proposed approach shows promising results in improving aspect-based sentiment classification performance.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010), (Valletta, Malta), European Language Resources Association (ELRA), May 2010 Pak, A., Paroubek, P.: Twitter as a corpus for sentiment analysis and opinion mining. In: Proceedings of the Seventh Conference on International Language Resources and Evaluation (LREC 2010), (Valletta, Malta), European Language Resources Association (ELRA), May 2010
2.
Zurück zum Zitat Go, A., Bhayani, R.: Twitter sentiment classificationusing distant supervision. CS224N Project Rep. Stanford 1, 1–12 (2009) Go, A., Bhayani, R.: Twitter sentiment classificationusing distant supervision. CS224N Project Rep. Stanford 1, 1–12 (2009)
3.
Zurück zum Zitat Niu, Z., Yin, Z., Kong, X.: Sentiment classification formicroblog by machine learning. In: 2012 Fourth International Conference on Computational and Information Sciences (ICCIS), pp. 286–289, August 2012 Niu, Z., Yin, Z., Kong, X.: Sentiment classification formicroblog by machine learning. In: 2012 Fourth International Conference on Computational and Information Sciences (ICCIS), pp. 286–289, August 2012
4.
Zurück zum Zitat Lek, H.H., Poo, D.: Aspect-based twitter sentimentclassification. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 366–373, November 2013 Lek, H.H., Poo, D.: Aspect-based twitter sentimentclassification. In: 2013 IEEE 25th International Conference on Tools with Artificial Intelligence (ICTAI), pp. 366–373, November 2013
5.
Zurück zum Zitat Ghiassi, M., Skinner, J., Zimbra, D.: Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst. Appl. 40(16), 6266–6282 (2013)CrossRef Ghiassi, M., Skinner, J., Zimbra, D.: Twitter brand sentiment analysis: a hybrid system using n-gram analysis and dynamic artificial neural network. Expert Syst. Appl. 40(16), 6266–6282 (2013)CrossRef
6.
Zurück zum Zitat Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56, 82–89 (2013)CrossRef Feldman, R.: Techniques and applications for sentiment analysis. Commun. ACM 56, 82–89 (2013)CrossRef
7.
Zurück zum Zitat Zhang, Y., Dang, Y., Chen, H.: Research note: examining gender emotional differences in web forum communication. Decis. Support Syst. 55(3), 851–860 (2013)CrossRef Zhang, Y., Dang, Y., Chen, H.: Research note: examining gender emotional differences in web forum communication. Decis. Support Syst. 55(3), 851–860 (2013)CrossRef
8.
Zurück zum Zitat Bhuta, S., Doshi, A., Doshi, U., Narvekar, M.: A review oftechniques for sentiment analysis of twitter data. In: 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), pp. 583–591, February 2014 Bhuta, S., Doshi, A., Doshi, U., Narvekar, M.: A review oftechniques for sentiment analysis of twitter data. In: 2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), pp. 583–591, February 2014
9.
Zurück zum Zitat Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86, Association for Computational Linguistics, Stroudsburg, PA, USA (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL-02 Conference on Empirical Methods in Natural Language Processing, EMNLP 2002, vol. 10, pp. 79–86, Association for Computational Linguistics, Stroudsburg, PA, USA (2002)
10.
Zurück zum Zitat Brychcin, T., Konkol, M., Steinberger, J.: UWB: machine learning approach to aspect-based sentiment analysis, SemEval 2014, p. 817 (2014) Brychcin, T., Konkol, M., Steinberger, J.: UWB: machine learning approach to aspect-based sentiment analysis, SemEval 2014, p. 817 (2014)
11.
Zurück zum Zitat Liu, K.L., Li, W.J., Guo, M.: Emoticon smoothed language modelsfor twitter sentiment analysis, vol. 2, pp. 1678–1684, 2012. cited By (since 1996) Liu, K.L., Li, W.J., Guo, M.: Emoticon smoothed language modelsfor twitter sentiment analysis, vol. 2, pp. 1678–1684, 2012. cited By (since 1996)
12.
Zurück zum Zitat Kansal, H., Toshniwal, D.: Aspect based summarization of context dependent opinion words. Procedia Comput. Sci. 35, 166–175 (2014). 2014 Proceedings of 18th Annual Conference on Knowledge-Based and Intelligent Information and amp; Engineering Systems, KES-2014 Gdynia, Poland, SeptemberCrossRef Kansal, H., Toshniwal, D.: Aspect based summarization of context dependent opinion words. Procedia Comput. Sci. 35, 166–175 (2014). 2014 Proceedings of 18th Annual Conference on Knowledge-Based and Intelligent Information and amp; Engineering Systems, KES-2014 Gdynia, Poland, SeptemberCrossRef
13.
Zurück zum Zitat Zhang, W., Xu, H., Wan, W.: Weakness finder: find product weakness from chinese reviews by using aspects based sentiment analysis. Expert Syst. Appl. 39(11), 10283–10291 (2012)CrossRef Zhang, W., Xu, H., Wan, W.: Weakness finder: find product weakness from chinese reviews by using aspects based sentiment analysis. Expert Syst. Appl. 39(11), 10283–10291 (2012)CrossRef
14.
Zurück zum Zitat Jmal, J., Faiz, R.: Customer review summarization approach using twitter and sentiwordnet. In: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, WIMS 2013, pp. 33:1–33:8, New York, NY, USA. ACM (2013) Jmal, J., Faiz, R.: Customer review summarization approach using twitter and sentiwordnet. In: Proceedings of the 3rd International Conference on Web Intelligence, Mining and Semantics, WIMS 2013, pp. 33:1–33:8, New York, NY, USA. ACM (2013)
15.
Zurück zum Zitat Marrese-Taylor, E., Velsquez, J.D., Bravo-Marquez, F.: A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst. Appl. 41(17), 7764–7775 (2014)CrossRef Marrese-Taylor, E., Velsquez, J.D., Bravo-Marquez, F.: A novel deterministic approach for aspect-based opinion mining in tourism products reviews. Expert Syst. Appl. 41(17), 7764–7775 (2014)CrossRef
16.
Zurück zum Zitat Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) LREC, European Language Resources Association (2010) Baccianella, S., Esuli, A., Sebastiani, F.: Sentiwordnet 3.0: an enhanced lexical resource for sentiment analysis and opinion mining. In: Calzolari, N., Choukri, K., Maegaard, B., Mariani, J., Odijk, J., Piperidis, S., Rosner, M., Tapias, D. (eds.) LREC, European Language Resources Association (2010)
17.
Zurück zum Zitat Zainuddin, N., Selamat, A., Ibrahim, R.: Improving twitter aspect-based sentiment analysis using hybrid approach. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.P. (eds.) ACIIDS 2016, Part I. LNCS, vol. 9621, pp. 151–160. Springer, Heidelberg (2016)CrossRef Zainuddin, N., Selamat, A., Ibrahim, R.: Improving twitter aspect-based sentiment analysis using hybrid approach. In: Nguyen, N.T., Trawiński, B., Fujita, H., Hong, T.P. (eds.) ACIIDS 2016, Part I. LNCS, vol. 9621, pp. 151–160. Springer, Heidelberg (2016)CrossRef
18.
Zurück zum Zitat Liu, B.: Sentiment Analysis and Subjectivity: Handbook of Natural Language Processing, 2nd edn. CRC Press, Boca Raton (2010) Liu, B.: Sentiment Analysis and Subjectivity: Handbook of Natural Language Processing, 2nd edn. CRC Press, Boca Raton (2010)
19.
Zurück zum Zitat Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. 417–422 (2006) Esuli, A., Sebastiani, F.: Sentiwordnet: a publicly available lexical resource for opinion mining. In: Proceedings of LREC, vol. 6, pp. 417–422 (2006)
20.
Zurück zum Zitat Selamat, A., Omatu, S.: Web page feature selection and classification using neural networks. Inf. Sci. 158, 69–88 (2004)MathSciNetCrossRef Selamat, A., Omatu, S.: Web page feature selection and classification using neural networks. Inf. Sci. 158, 69–88 (2004)MathSciNetCrossRef
21.
Zurück zum Zitat Vinodhini, G., Chandrasekaran, M.R.: Opinion mining using principal component analysis based ensemble model for e-commerce application. CSI Trans. ICT 2(3), 169–179 (2014)CrossRef Vinodhini, G., Chandrasekaran, M.R.: Opinion mining using principal component analysis based ensemble model for e-commerce application. CSI Trans. ICT 2(3), 169–179 (2014)CrossRef
22.
Zurück zum Zitat Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH
23.
Zurück zum Zitat Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)CrossRef Joachims, T.: Text categorization with support vector machines: learning with many relevant features. In: Nédellec, C., Rouveirol, C. (eds.) ECML 1998. LNCS, vol. 1398, pp. 137–142. Springer, Heidelberg (1998)CrossRef
24.
Zurück zum Zitat Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media, LSM 2011, pp. 30–38, Association for Computational Linguistics, Stroudsburg, PA, USA (2011) Agarwal, A., Xie, B., Vovsha, I., Rambow, O., Passonneau, R.: Sentiment analysis of twitter data. In: Proceedings of the Workshop on Languages in Social Media, LSM 2011, pp. 30–38, Association for Computational Linguistics, Stroudsburg, PA, USA (2011)
Metadaten
Titel
Twitter Feature Selection and Classification Using Support Vector Machine for Aspect-Based Sentiment Analysis
verfasst von
Nurulhuda Zainuddin
Ali Selamat
Roliana Ibrahim
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-42007-3_23