Skip to main content

2018 | OriginalPaper | Buchkapitel

SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks

verfasst von : Konstantinas Korovkinas, Paulius Danėnas, Gintautas Garšva

Erschienen in: Information and Software Technologies

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

SVM technique is one of the best techniques to classify data, but it has a slow performance in the big data arrays. This paper introduces the method to improve the speed of SVM classification in sentiment analysis by reducing the training set. The method was tested on the Stanford Twitter sentiment corpus dataset and Amazon customer reviews dataset. The results show that the execution time of the introduced method outperforms the standard SVM classification method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992) Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)
2.
Zurück zum Zitat Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef
3.
Zurück zum Zitat Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH
4.
Zurück zum Zitat Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef
5.
Zurück zum Zitat Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH
6.
Zurück zum Zitat Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009) Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009)
7.
Zurück zum Zitat Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef
8.
Zurück zum Zitat Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005) Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005)
9.
Zurück zum Zitat Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013) Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013)
10.
11.
Zurück zum Zitat Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef
13.
Zurück zum Zitat Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001) Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001)
14.
Zurück zum Zitat Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005) Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005)
16.
Zurück zum Zitat Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef
17.
Zurück zum Zitat Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef
18.
Zurück zum Zitat Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016) Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016)
20.
Zurück zum Zitat Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017) Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017)
21.
Zurück zum Zitat Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH
22.
Zurück zum Zitat Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017) Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017)
23.
Zurück zum Zitat Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002) Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)
24.
Zurück zum Zitat Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH
25.
Zurück zum Zitat Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH
26.
Zurück zum Zitat Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016) Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016)
28.
Zurück zum Zitat Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef
Metadaten
Titel
SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks
verfasst von
Konstantinas Korovkinas
Paulius Danėnas
Gintautas Garšva
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-99972-2_18