Top

International Journal of Machine Learning and Cybernetics

Published in:

01-06-2016 | Original Article

Fusing sequential minimal optimization and Newton’s method for support vector training

Author: Shigeo Abe

Published in: International Journal of Machine Learning and Cybernetics | Issue 3/2016

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

Sequential minimal optimization (SMO) is widely used for training support vector machines (SVMs) because of fast training. But the training slows down when a large margin parameter value is used. Training by Newton’s method (NM) accelerates training in such a situation but it slows down for a small margin parameter value. To solve this problem, in this paper we fuse SMO with NM and call it SMO-NM. Because slow training is caused by repetitive corrections of the same variables, we modify the working set selection when they are detected. We call the variables that are selected by SMO, SMO variables. At the current step, if a variable selected as an SMO variable was selected in a previous step, we consider that a loop is detected. And in addition to the SMO variables, we add, to the working set, the unbounded variables that were selected as SMO variables and correct the variables by NM. If no loop is detected, the training procedure is the same as that of SMO. As a variant of this working set strategy, we further add violating variables to the working set. We clarify that if the classification problem is not linearly separable in the feature space, the solutions of L1/L2 SVMs (with the linear sum/square sum of slack variables) are unbounded as the margin parameter value approaches infinity but that, if the mapped training data are not linearly independent in the feature space, the solution of the least squares SVM is unbounded as the margin parameter approaches infinity. We also clarify the condition, in which the increment of the objective function value by SMO-NM is larger than that by SMO. We evaluate SMO-NM for several benchmark data sets and confirm the effectiveness over SMO especially for a large margin parameter value.

next article An improved algorithm for segmenting online time series with error bound guarantee

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

inform now

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

inform now

In the LS SVM, \(F_i\) are used instead of \(\bar{F_i}\) and \(\tilde{F_i}\).

Vapnik VN (1995) The nature of statistical learning theory. Springer, New YorkCrossRefMATH

Shawe-Taylor J, Cristianini N (2004) Kernel methods for pattern analysis. Cambridge University Press, CambridgeCrossRefMATH

Khemchandani R, Karpatne A, Chandra S (2013) Proximal support tensor machines. Int J Mach Learn Cybernet 4(6):703–712CrossRef

He Q, Wu C (2011) Separating theorem of samples in Banach space for support vector machine learning. Int J Mach Learn Cybernet 2(1):49–54CrossRef

Wang X, Lu S, Zhai J (2008) Fast fuzzy multi-category SVM based on support vector domain description. Int J Pattern Recogn Artif Intell 22(1):109–120CrossRef

Wang X, He Q, Chen D, Yeung D (2005) A genetic algorithm for solving the inverse problem of support vector machines. Neurocomputing 68:225–238CrossRef

Byun H, Lee S-W (2003) A survey on pattern recognition applications of support vector machines. Int J Pattern Recogn Artif Intell 17(3):459–486CrossRef

Widodo A, Yang B-S (2007) Support vector machine in machine condition monitoring and fault diagnosis. Mech Syst Signal Process 21(6):2560–2574CrossRef

Schölkopf B, Tsuda K, Vert J-P (2004) Kernel methods in computational biology. MIT Press, Cambridge

10.

Chen N, Lu W, Yang J, Li G (2004) Support vector machine in chemistry. World Scientific Publishing Company, SingaporeCrossRef

11.

Bradford JR, Westhead DR (2005) Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 21(8):1487–1494CrossRef

12.

Ding C, Peng H (2005) Minimum redundancy feature selection from microarray gene expression data. J Bioinf Comput Biol 3(2):185–205MathSciNetCrossRef

13.

Abe S (2010) Support vector machines for pattern classification, 2nd edn. Springer, LondonCrossRefMATH

14.

Platt JC (1999) Fast training of support vector machines using sequential minimal optimization. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in kernel methods: support vector learning. MIT Press, Cambridge, pp 185–208

15.

Keerthi SS, Gilbert EG (2002) Convergence of a generalized SMO algorithm for SVM classifier design. Mach Learn 46(1–3):351–360CrossRefMATH

16.

Fan R-E, Chen P-H, Lin C-J (2005) Working set selection using second order information for training support vector machines. J Mach Learn Res 6:1889–1918MathSciNetMATH

17.

Barbero Á, Dorronsoro JR (2010) Faster directions for second order SMO. In: Diamantaras K, Duch W, Iliadis LS (eds) Artificial neural networks—ICANN 2010, vol 6353 of lecture notes in computer science. Springer, pp 30–39

18.

Barbero Á, Dorronsoro JR (2011) Momentum sequential minimal optimization: an accelerated method for support vector machine training. In: Proceedings of the 2011 International Joint Conference on neural networks (IJCNN 2011), San Jose, pp 370–377

19.

Chu W, Ong CJ, Keerthi SS (2005) An improved conjugate gradient scheme to the solution of least squares SVM. IEEE Trans Neural Netw 16(2):498–501CrossRef

20.

López J, Suykens JAK (2011) First and second order SMO algorithms for LS-SVM classifiers. Neural Process Lett 33(1):33–44

21.

López J, Barbero Á, Dorronsoro JR (2011) Momentum acceleration of least-squares support vector machines. In: Honkela T, Duch W, Girolami M, Kaski S (eds) Artificial neural networks and machine learning—ICANN 2011, vol 6792 of lecture notes in computer science. Springer, pp 135–142

22.

Joachims T (1999) Making large-scale support vector machine learning practical. In: Schölkopf B, Burges CJC, Smola AJ (eds) Advances in kernel methods: support vector learning. MIT Press, Cambridge, pp 169–184

23.

Vishwanathan SVN, Smola AJ, Murty MN (2003) SimpleSVM. In: Proceedings of the Twentieth International Conference on machine learning (ICML-2003), vol 2, Washington DC, pp 760–767

24.

Kienzle W, Schölkopf B (2005) Training support vector machines with multiple equality constraints. In: Proceedings of Sixteenth European Conference on machine learning (ECML 2005)., vol LNAI 3720Porto, Portugal, pp 182–193

25.

Sentelle C, Georgiopoulos M, Anagnostopoulos GC, Young C (2007) On extending the SMO algorithm sub-problem. In: Proceedings of the 2007 International Joint Conference on neural networks (IJCNN 2007), Orlando, pp 886–891

26.

Hernandez RA, Strum M, Wang JC, Gonzalez JAQ (2009) The multiple pairs SMO: a modified SMO algorithm for the acceleration of the SVM training. In: Proceedings of the 2009 International Joint Conference on neural networks (IJCNN 2009), Atlanta, pp 1221–1228

27.

Abe S (2011) Fast support vector training by Newton’s method. In: Honkela T, Duch W, Girolami M, Kaski S (eds) Artificial neural networks and machine learning—ICANN 2011, vol 6792 of lecture notes in computer science. Springer, pp 143–150

28.

Loosli G, Canu S (2007) Comments on the “Core vector machines: fast SVM training on very large data sets”. J Mach Learn Res 8:291–301MATH

29.

Chang C-C, Lin C-J LIBSVM-A library for support vector machines: http://www.csie.ntu.edu.tw/cjlin/libsvm/

30.

Cauwenberghs G, Poggio T (2001) Incremental and decremental support vector machine learning. In: Leen TK, Dietterich TG, Tresp V (eds) Advances in neural information processing systems, vol 13. MIT Press, Cambridge, pp 409–415

31.

Shilton A, Palaniswami M, Ralph D, Tsoi AC (2005) Incremental training of support vector machines. IEEE Trans Neural Netw 16(1):114–131CrossRef

32.

Scheinberg K (2006) An efficient implementation of an active set method for SVMs. J Mach Learn Res 7:2237–2257MathSciNetMATH

33.

Abe S (2008) Batch support vector training based on exact incremental training. In: Kůrková V, Neruda R, Koutnik J (eds) Artificial neural networks (ICANN 2008)–Proceedings of the Eighteenth International Conference, Prague, Czech Republic, Part I. Springer, Berlin, pp 527–536

34.

Gâlmeanu H, Andonie R (2008) Implementation issues of an incremental and decremental SVM. In: Kůrková V, Neruda R, Koutnik J (eds) Artificial neural networks (ICANN 2008)–Proceedings of the Eighteenth International Conference, Prague, Czech Republic, Part I. Springer, Berlin, pp 325–335

35.

Sentelle C, Anagnostopoulos GC, Georgiopoulos M (2009) An efficient active set method for SVM training without singular inner problems. In Proceedings of the 2009 International Joint Conference on neural networks (IJCNN 2009), Atlanta, pp 2875–2882

36.

Sentelle C, Anagnostopoulos GC, Georgiopoulos M (2011) Efficient revised simplex method for SVM training. IEEE Trans Neural Netw 22(10):1650–1661CrossRef

37.

Golub GH, Van Loan CF (1996) Matrix computations, 3rd edn. The Johns Hopkins University Press, BaltimoreMATH

38.

Abe S (2007) Sparse least squares support vector training in the reduced empirical feature space. Pattern Anal Appl 10(3):203–214MathSciNetCrossRef

39.

Asuncion A, Newman DJ (2007) UCI machine learning repository. http://www.ics.uci.edu/~mlearn/MLRepository.html.

40.

USPS Dataset http://www-i6.informatik.rwth-aachen.de/~keysers/usps.html

Title: Fusing sequential minimal optimization and Newton’s method for support vector training
Author: Shigeo Abe
Publication date: 01-06-2016
Publisher: Springer Berlin Heidelberg
Published in: International Journal of Machine Learning and Cybernetics / Issue 3/2016
Print ISSN: 1868-8071
Electronic ISSN: 1868-808X
DOI: https://doi.org/10.1007/s13042-014-0265-x

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

ATZelectronics worldwide

ATZelektronik

Other articles of this Issue 3/2016

An improved algorithm for segmenting online time series with error bound guarantee

Towards organizing smart collaboration and enhancing teamwork performance: a GA-supported system oriented to mobile learning through cloud-based online course

Fuzzy soft set over a fuzzy topological space

Structure learning for weighted networks based on Bayesian nonparametric models

The structure and realization of a polygonal fuzzy neural network

Least squares recursive projection twin support vector machine for multi-class classification