Skip to main content
Erschienen in: Neural Computing and Applications 7/2009

01.10.2009 | Original Article

Pattern classification with mixtures of weighted least-squares support vector machine experts

verfasst von: Clodoaldo A. M. Lima, André L. V. Coelho, Fernando J. Von Zuben

Erschienen in: Neural Computing and Applications | Ausgabe 7/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Support Vector Machine (SVM) classifiers are high-performance classification models devised to comply with the structural risk minimization principle and to properly exploit the kernel artifice of nonlinearly mapping input data into high-dimensional feature spaces toward the automatic construction of better discriminating linear decision boundaries. Among several SVM variants, Least-Squares SVMs (LS-SVMs) have gained increased attention recently due mainly to their computationally attractive properties coming as the direct result of applying a modified formulation that makes use of a sum-squared-error cost function jointly with equality, instead of inequality, constraints. In this work, we present a flexible hybrid approach aimed at augmenting the proficiency of LS-SVM classifiers with regard to accuracy/generalization as well as to hyperparameter calibration issues. Such approach, named as Mixtures of Weighted Least-Squares Support Vector Machine Experts, centers around the fusion of the weighted variant of LS-SVMs with Mixtures of Experts models. After the formal characterization of the novel learning framework, simulation results obtained with respect to both binary and multiclass pattern classification problems are reported, ratifying the suitability of the novel hybrid approach in improving the performance issues considered.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
3.
Zurück zum Zitat Andrzejak RG, Lehnertz K, Mormann F, Rieke C, David P, Elger CE (2001) Indications of nonlinear deterministic and finite dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state. Phys Rev E Stat Nonlin Soft Matter Phys 64(6):061907. doi:10.1103/PhysRevE.64.061907 Andrzejak RG, Lehnertz K, Mormann F, Rieke C, David P, Elger CE (2001) Indications of nonlinear deterministic and finite dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state. Phys Rev E Stat Nonlin Soft Matter Phys 64(6):061907. doi:10.​1103/​PhysRevE.​64.​061907
5.
Zurück zum Zitat Cawley GC (2001) Model selection for support vector machines via adaptive step-size tabu search. In: Proceedings of international conference on artificial neural networks and genetic algorithms, Prague, pp 434–437 Cawley GC (2001) Model selection for support vector machines via adaptive step-size tabu search. In: Proceedings of international conference on artificial neural networks and genetic algorithms, Prague, pp 434–437
6.
Zurück zum Zitat Cawley GC (2006) Leave-one-out cross-validation based model selection criteria for weighted LS-SVMs. In: Proceedings of the international joint conference on neural networks. IEEE Press, Vancouver, pp 1661–1668 Cawley GC (2006) Leave-one-out cross-validation based model selection criteria for weighted LS-SVMs. In: Proceedings of the international joint conference on neural networks. IEEE Press, Vancouver, pp 1661–1668
8.
Zurück zum Zitat Cawley GC, Talbot NLC (2007) Preventing over-fitting during model selection via Bayesian regularisation of the hyper-parameters. J Mach Learn Res 8:841–861 Cawley GC, Talbot NLC (2007) Preventing over-fitting during model selection via Bayesian regularisation of the hyper-parameters. J Mach Learn Res 8:841–861
12.
Zurück zum Zitat Cristianini N, Shawe-Taylor J (2000) An Introduction to support vector machines. Cambridge University Press, London Cristianini N, Shawe-Taylor J (2000) An Introduction to support vector machines. Cambridge University Press, London
13.
Zurück zum Zitat de Diego IM, Moguerza JM, Muñoz A (2004) Combining kernel information for support vector classification. In: Proceedings of the international workshop on multiple classifier systems. Lecture notes in computer science, vol 3077. Springer, Berlin, pp 102–111 de Diego IM, Moguerza JM, Muñoz A (2004) Combining kernel information for support vector classification. In: Proceedings of the international workshop on multiple classifier systems. Lecture notes in computer science, vol 3077. Springer, Berlin, pp 102–111
14.
Zurück zum Zitat Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1–38MATHMathSciNet Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1–38MATHMathSciNet
16.
17.
Zurück zum Zitat Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, HeidelbergMATH Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, HeidelbergMATH
18.
Zurück zum Zitat Haykin S (1999) Neural networks––a comprehensive foundation. Prentice Hall, New YorkMATH Haykin S (1999) Neural networks––a comprehensive foundation. Prentice Hall, New YorkMATH
21.
Zurück zum Zitat Joachims T (2000) Estimating the generalization performance of an SVM efficiently. In: Proceedings of 17th international conference on machine learning. Morgan Kaufmann Publishers, San Francisco, pp 431–438 Joachims T (2000) Estimating the generalization performance of an SVM efficiently. In: Proceedings of 17th international conference on machine learning. Morgan Kaufmann Publishers, San Francisco, pp 431–438
23.
Zurück zum Zitat Kwok JT-Y (1998) Support vector mixture for classification and regression problems. In: Proceedins of the 14th international conference on pattern recognition, Brisbane, pp 255–258 Kwok JT-Y (1998) Support vector mixture for classification and regression problems. In: Proceedins of the 14th international conference on pattern recognition, Brisbane, pp 255–258
24.
Zurück zum Zitat Lima CAM, Coelho ALV, Von Zuben FJ (2002) Ensembles of support vector machines for regression problems. In: Proceedings of the international joint conference on neural networks. IEEE Press, Hawaii, pp 2381–2386 Lima CAM, Coelho ALV, Von Zuben FJ (2002) Ensembles of support vector machines for regression problems. In: Proceedings of the international joint conference on neural networks. IEEE Press, Hawaii, pp 2381–2386
25.
Zurück zum Zitat Lima CAM, Coelho ALV, Von Zuben FJ (2002) Model selection based on VC-dimension for heterogeneous ensembles of support vector machines. In: Proceedings of the 4th international conference on recent advances in soft computing. Nottingham University Press, Nottingham, pp 459–464 Lima CAM, Coelho ALV, Von Zuben FJ (2002) Model selection based on VC-dimension for heterogeneous ensembles of support vector machines. In: Proceedings of the 4th international conference on recent advances in soft computing. Nottingham University Press, Nottingham, pp 459–464
26.
Zurück zum Zitat Lima CAM, Coelho ALV, Von Zuben FJ (2007) Hybridizing mixtures of experts with support vector machines: investigation into nonlinear dynamic systems identification. Inf Sci 177:2049–2074. doi:10.1016/j.ins.2007.01.009 Lima CAM, Coelho ALV, Von Zuben FJ (2007) Hybridizing mixtures of experts with support vector machines: investigation into nonlinear dynamic systems identification. Inf Sci 177:2049–2074. doi:10.​1016/​j.​ins.​2007.​01.​009
27.
Zurück zum Zitat McLachlan GJ, Basford KE (1988) Mixture models: inference and applications to clustering. Marcel Deckker, Inc., New YorkMATH McLachlan GJ, Basford KE (1988) Mixture models: inference and applications to clustering. Marcel Deckker, Inc., New YorkMATH
28.
Zurück zum Zitat Moerland P (1999) Classification using localized mixture of experts. In: Proceedings of ninth international conference on artificial neural networks, vol 2, Edinburgh, pp 838–843 Moerland P (1999) Classification using localized mixture of experts. In: Proceedings of ninth international conference on artificial neural networks, vol 2, Edinburgh, pp 838–843
31.
Zurück zum Zitat Schölkopf B, Smola A (2002) Learning with kernels. The MIT Press, Cambridge Schölkopf B, Smola A (2002) Learning with kernels. The MIT Press, Cambridge
34.
Zurück zum Zitat Suykens JAK, Lukas L, Van Dooren P, De Moor B, Vandewalle J (1999) Least squares support vector machine classifiers: a large scale algorithm. In: Proceedings of European conference on circuit theory and design, Italy, pp 839–842 Suykens JAK, Lukas L, Van Dooren P, De Moor B, Vandewalle J (1999) Least squares support vector machine classifiers: a large scale algorithm. In: Proceedings of European conference on circuit theory and design, Italy, pp 839–842
36.
Zurück zum Zitat Suykens JAK, Van Gestel T, De Brabanter J, De Moor B, Vandewalle J (2002) Least squares support vector machines. World Scientific Pub, SingaporeMATH Suykens JAK, Van Gestel T, De Brabanter J, De Moor B, Vandewalle J (2002) Least squares support vector machines. World Scientific Pub, SingaporeMATH
37.
Zurück zum Zitat Tikhonov AN, Arsenim VY (1977) Solutions of Ill-posed problems. W. H. Winston, WashingtonMATH Tikhonov AN, Arsenim VY (1977) Solutions of Ill-posed problems. W. H. Winston, WashingtonMATH
39.
Zurück zum Zitat Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH
40.
Zurück zum Zitat Wahba G (1998) Support vector machines, reproducing kernel Hilbert spaces and the randomized GACV. In: Schölkopf B, Burges C, Smola A (eds) Advances in kernel methods: support vector machines. The MIT Press, Cambridge, pp 69–88 Wahba G (1998) Support vector machines, reproducing kernel Hilbert spaces and the randomized GACV. In: Schölkopf B, Burges C, Smola A (eds) Advances in kernel methods: support vector machines. The MIT Press, Cambridge, pp 69–88
41.
Zurück zum Zitat Webb A (1999) Statistical pattern recognition. Wiley, New YorkMATH Webb A (1999) Statistical pattern recognition. Wiley, New YorkMATH
Metadaten
Titel
Pattern classification with mixtures of weighted least-squares support vector machine experts
verfasst von
Clodoaldo A. M. Lima
André L. V. Coelho
Fernando J. Von Zuben
Publikationsdatum
01.10.2009
Verlag
Springer-Verlag
Erschienen in
Neural Computing and Applications / Ausgabe 7/2009
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-008-0210-6

Weitere Artikel der Ausgabe 7/2009

Neural Computing and Applications 7/2009 Zur Ausgabe

Premium Partner