nach oben

Erschienen in:

2018 | OriginalPaper | Buchkapitel

SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks

verfasst von : Konstantinas Korovkinas, Paulius Danėnas, Gintautas Garšva

Erschienen in: Information and Software Technologies

Verlag: Springer International Publishing

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

SVM technique is one of the best techniques to classify data, but it has a slow performance in the big data arrays. This paper introduces the method to improve the speed of SVM classification in sentiment analysis by reducing the training set. The method was tested on the Stanford Twitter sentiment corpus dataset and Amazon customer reviews dataset. The results show that the execution time of the introduced method outperforms the standard SVM classification method.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Vorheriges Kapitel A Workflow-Based Large-Scale Patent Mining and Analytics Framework

Nächstes Kapitel J48S: A Sequence Classification Approach to Text Analysis Based on Decision Trees

http://help.sentiment140.com/.

https://www.kaggle.com/bittlingmayer/amazonreviews/.

https://www.csie.ntu.edu.tw/~cjlin/liblinear/.

https://www.csie.ntu.edu.tw/~cjlin/libsvm/.

Boser, B.E., Guyon, I.M., Vapnik, V.N.: A training algorithm for optimal margin classifiers. In: Proceedings of the Fifth Annual Workshop on Computational Learning Theory, pp. 144–152 (1992)

Chorowski, J., Wang, J., Zurada, J.M.: Review and performance comparison of SVM-and ELM-based classifiers. Neurocomputing 128, 507–516 (2014)CrossRef

Cortes, C., Vapnik, V.: Support-vector networks. Mach. Learn. 20(3), 273–297 (1995)MATH

Damaševičius, R.: Optimization of SVM parameters for recognition of regulatory DNA sequences. Top 18(2), 339–353 (2010)MathSciNetCrossRef

Fan, R.E., Chang, K.W., Hsieh, C.J., Wang, X.R., Lin, C.J.: LIBLINEAR: a library for large linear classification. J. Mach. Learn. Res. 9(Aug), 1871–1874 (2008)MATH

Go, A., Bhayani, R., Huang, L.: Twitter sentiment classification using distant supervision. CS224N Project Report, Stanford, 1, 12 (2009)

Guo, L., Boukir, S.: Fast data selection for SVM training using ensemble margin. Pattern Recogn. Lett. 51, 112–119 (2015)CrossRef

Graf, H.P., Cosatto, E., Bottou, L., Dourdanovic, I., Vapnik, V.: Parallel support vector machines: the cascade SVM. In: Advances in Neural Information Processing Systems, pp. 521–528 (2005)

Khairnar, J., Kinikar, M.: Machine learning algorithms for opinion mining and sentiment classification. Int. J. Sci. Res. Publ. 3(6), 1–6 (2013)

10.

Kharde, V., Sonawane, P.: Sentiment analysis of twitter data: a survey of techniques. arXiv preprint arXiv:1601.06971 (2016)

11.

Korovkinas, K., Danėnas, P., Garšva, G.: SVM and Naïve Bayes classification ensemble method for sentiment analysis. Baltic J. Modern Comput. 5(4), 398–409 (2017)CrossRef

12.

Le, B., Nguyen, H.: Twitter sentiment analysis using machine learning techniques. In: Le Thi, H.A., Nguyen, N.T., Do, T.V. (eds.) Advanced Computational Methods for Knowledge Engineering. AISC, vol. 358, pp. 279–289. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-17996-4_25CrossRef

13.

Lee, Y.J., Mangasarian, O.L.: RSVM: reduced support vector machines. In: Proceedings of the 2001 SIAM International Conference on Data Mining, pp. 1–17 (2001)

14.

Lei, H., Govindaraju, V.: Speeding up multi-class SVM evaluation by PCA and feature selection. In: Feature Selection for Data Mining, 72 (2005)

15.

Liu, M., Xu, C., Xu, C., Tao, D.: Fast SVM Trained by Divide-and-Conquer Anchors, pp. 2322–2328 (2017). https://doi.org/10.24963/ijcai.2017/323

16.

Liu, J., Zio, E.: SVM hyperparameters tuning for recursive multi-step-ahead prediction. Neural Comput. Appl. 28(12), 3749–3763 (2017)CrossRef

17.

Manek, A.S., Shenoy, P.D., Mohan, M.C., Venugopal, K.R.: Aspect term extraction for sentiment analysis in large movie reviews using Gini Index feature selection method and SVM classifier. World Wide Web 20(2), 135–154 (2017)CrossRef

18.

Mao, X., Fu, Z., Wu, O., Hu, W.: Fast kernel SVM training via support vector identification. In: 2016 23rd International Conference Pattern Recognition (ICPR), pp. 1554–1559 (2016)

19.

Meyer, O., Bischl, B., Weihs, C.: Support vector machines on large data sets: simple parallel approaches. In: Spiliopoulou, M., Schmidt-Thieme, L., Janning, R. (eds.) Data Analysis, Machine Learning and Knowledge Discovery. SCDAKO, pp. 87–95. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-01595-8_10CrossRef

20.

Mourad, S., Tewfik, A., Vikalo, H.: Data subset selection for efficient SVM training. In: 2017 25th European Signal Processing Conference (EUSIPCO), pp. 833–837 (2017)

21.

Nandan, M., Khargonekar, P.P., Talathi, S.S.: Fast SVM training using approximate extreme points. J. Mach. Learning Res. 15(1), 59–98 (2014)MathSciNetMATH

22.

Osman, H., Ghafari, M., Nierstrasz, O.: Hyperparameter optimization to improve bug prediction accuracy. In: IEEE Workshop on Machine Learning Techniques for Software Quality Evaluation (MaLTeSQuE), pp. 33–38 (2017)

23.

Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: sentiment classification using machine learning techniques. In: Proceedings of the ACL 2002 Conference on Empirical Methods in Natural Language Processing, vol. 10, pp. 79–86 (2002)

24.

Pedregosa, F., et al.: Scikit-learn: machine learning in python. J. Mach. Learn. Res. 12, 2825–2830 (2011)MathSciNetMATH

25.

Sammut, C., Webb, G.I. (eds.): Encyclopedia of Machine Learning. Springer, New York (2011)MATH

26.

Sunkad, Z.A.: Feature selection and hyperparameter optimization of SVM for human activity recognition. In: 2016 3rd International Conference Soft Computing & Machine Intelligence (ISCMI), pp. 104–109 (2016)

27.

R Development Core Team: R: A language and environment for statistical computing, R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org

28.

Wang, S., Li, Z., Liu, C., Zhang, X., Zhang, H.: Training data reduction to speed up SVM training. Appl. Intell. 41(2), 405–420 (2014)CrossRef

Titel: SVM Accuracy and Training Speed Trade-Off in Sentiment Analysis Tasks
verfasst von: Konstantinas Korovkinas
Paulius Danėnas
Gintautas Garšva
Verlag: Springer International Publishing
Buch: Information and Software Technologies
Print ISBN: 978-3-319-99971-5

Electronic ISBN: 978-3-319-99972-2

Copyright-Jahr: 2018
DOI: https://doi.org/10.1007/978-3-319-99972-2_18

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"