Top

Published in:

2018 | OriginalPaper | Chapter

SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks

Authors : Konstantinas Korovkinas, Paulius Danėnas, Gintautas Garšva

Published in: Information and Software Technologies

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

SVM technique is one of the best techniques to classify data, but it has a slow performance in the big data arrays. This paper introduces the method to improve the speed of SVM classification in sentiment analysis by reducing the training set. The method was tested on the Stanford Twitter sentiment corpus dataset and Amazon customer reviews dataset. The results show that the execution time of the introduced method outperforms the standard SVM classification method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

previous chapter A Workflow-Based Large-Scale Patent Mining and Analytics Framework

next chapter J48S: A Sequence Classification Approach to Text Analysis Based on Decision Trees

http://help.sentiment140.com/.

https://www.kaggle.com/bittlingmayer/amazonreviews/.

https://www.csie.ntu.edu.tw/~cjlin/liblinear/.

https://www.csie.ntu.edu.tw/~cjlin/libsvm/.

Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)

Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef

Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH

Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef

Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH

Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009)

Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef

Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005)

Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013)

10.

Kharde, V., Sonawane, P.: Sentiment analysis of twitter data: a survey of techniques. arXiv preprint arXiv:1601.06971 (2016)

11.

Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef

12.

Le, B., Nguyen, H.: Twitter sentiment analysis using machine learning techniques. In: Le Thi, H.A., Nguyen, N.T., Do, T.V. (eds.) Advanced Computational Methods for Knowledge Engineering. AISC, vol. 358, pp. 279–289. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-17996-4_25CrossRef

13.

Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001)

14.

Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005)

15.

Liu, M., Xu, C., Xu, C., Tao, D.: Fast SVM Trained by Divide-and-Conquer Anchors, pp. 2322–2328 (2017). https://doi.org/10.24963/ijcai.2017/323

16.

Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef

17.

Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef

18.

Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016)

19.

Meyer, O., Bischl, B., Weihs, C.: Support vector machines on large data sets: simple parallel approaches. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds.) Data Analysis, Machine Learning and Knowledge Discovery. SCDAKO, pp. 87–95. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-01595-8_10CrossRef

20.

Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017)

21.

Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH

22.

Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017)

23.

Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)

24.

Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH

25.

Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH

26.

Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016)

27.

R Development Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org

28.

Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef

Title: SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks
Authors: Konstantinas Korovkinas
Paulius Danėnas
Gintautas Garšva
Publisher: Springer International Publishing
Book: Information and Software Technologies
Print ISBN: 978-3-319-99971-5

Electronic ISBN: 978-3-319-99972-2

Copyright Year: 2018
DOI: https://doi.org/10.1007/978-3-319-99972-2_18

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Premium Partner