Skip to main content
Top

2018 | OriginalPaper | Chapter

SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks

Authors : Konstantinas Korovkinas, Paulius Danėnas, Gintautas Garšva

Published in: Information and Software Technologies

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

SVM technique is one of the best techniques to classify data, but it has a slow performance in the big data arrays. This paper introduces the method to improve the speed of SVM classification in sentiment analysis by reducing the training set. The method was tested on the Stanford Twitter sentiment corpus dataset and Amazon customer reviews dataset. The results show that the execution time of the introduced method outperforms the standard SVM classification method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992) Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)
2.
go back to reference Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef
3.
go back to reference Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH
4.
go back to reference Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef
5.
go back to reference Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH
6.
go back to reference Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009) Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009)
7.
go back to reference Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef
8.
go back to reference Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005) Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005)
9.
go back to reference Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013) Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013)
10.
11.
go back to reference Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef
13.
go back to reference Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001) Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001)
14.
go back to reference Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005) Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005)
16.
go back to reference Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef
17.
go back to reference Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef
18.
go back to reference Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016) Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016)
20.
go back to reference Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017) Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017)
21.
go back to reference Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH
22.
go back to reference Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017) Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017)
23.
go back to reference Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)
24.
go back to reference Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
25.
go back to reference Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH
26.
go back to reference Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016) Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016)
28.
go back to reference Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef
Metadata
Title
SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks
Authors
Konstantinas Korovkinas
Paulius Danėnas
Gintautas Garšva
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-99972-2_18

Premium Partner