nach oben

Neural Computing and Applications

Erschienen in:

01.10.2009 | Original Article

Pattern classification with mixtures of weighted least-squares support vector machine experts

verfasst von: Clodoaldo A. M. Lima, André L. V. Coelho, Fernando J. Von Zuben

Erschienen in: Neural Computing and Applications | Ausgabe 7/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

Support Vector Machine (SVM) classifiers are high-performance classification models devised to comply with the structural risk minimization principle and to properly exploit the kernel artifice of nonlinearly mapping input data into high-dimensional feature spaces toward the automatic construction of better discriminating linear decision boundaries. Among several SVM variants, Least-Squares SVMs (LS-SVMs) have gained increased attention recently due mainly to their computationally attractive properties coming as the direct result of applying a modified formulation that makes use of a sum-squared-error cost function jointly with equality, instead of inequality, constraints. In this work, we present a flexible hybrid approach aimed at augmenting the proficiency of LS-SVM classifiers with regard to accuracy/generalization as well as to hyperparameter calibration issues. Such approach, named as Mixtures of Weighted Least-Squares Support Vector Machine Experts, centers around the fusion of the weighted variant of LS-SVMs with Mixtures of Experts models. After the formal characterization of the novel learning framework, simulation results obtained with respect to both binary and multiclass pattern classification problems are reported, ratifying the suitability of the novel hybrid approach in improving the performance issues considered.

Vorheriger Artikel Combining seasonal time series ARIMA method and neural networks with genetic algorithms for predicting the production value of the mechanical industry in Taiwan

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Adankon MM, Cherieta M (2007) Optimizing resources in model selection for support vector machine. Pattern Recognit 40:953–963. doi:10.1016/j.patcog.2006.06.012 MATHCrossRef

An S, Liua W, Venkatesha S (2007) Fast cross-validation algorithms for least squares support vector machine and kernel ridge regression. Pattern Recognit 40:2154–2162. doi:10.1016/j.patcog.2006.12.015 MATHCrossRef

Andrzejak RG, Lehnertz K, Mormann F, Rieke C, David P, Elger CE (2001) Indications of nonlinear deterministic and finite dimensional structures in time series of brain electrical activity: Dependence on recording region and brain state. Phys Rev E Stat Nonlin Soft Matter Phys 64(6):061907. doi:10.1103/PhysRevE.64.061907

Burges CJC (1998) A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov 2:121–167. doi:10.1023/A:1009715923555 CrossRef

Cawley GC (2001) Model selection for support vector machines via adaptive step-size tabu search. In: Proceedings of international conference on artificial neural networks and genetic algorithms, Prague, pp 434–437

Cawley GC (2006) Leave-one-out cross-validation based model selection criteria for weighted LS-SVMs. In: Proceedings of the international joint conference on neural networks. IEEE Press, Vancouver, pp 1661–1668

Cawley GC, Talbot NLC (2002) Improved sparse least-squares support vector machines. Neurocomputing 48:1025–1031. doi:10.1016/S0925-2312(02)00606-9 MATHCrossRef

Cawley GC, Talbot NLC (2007) Preventing over-fitting during model selection via Bayesian regularisation of the hyper-parameters. J Mach Learn Res 8:841–861

Chapelle O, Vapnik V, Bousquet O, Mukherjee S (2002) Choosing multiple parameters for support vector machines. Mach Learn 46:131–159. doi:10.1023/A:1012450327387 MATHCrossRef

10.

Cherkassky V, Ma Y (2004) Practical selection of SVM parameters and noise estimation for SVM regression. Neural Netw 17:113–126. doi:10.1016/S0893-6080(03)00169-2 MATHCrossRef

11.

Collobert R, Bengio S, Bengio Y (2002) A parallel mixture of SVMs for very large scale problems. Neural Comput 14:1105–1114. doi:10.1162/089976602753633402 MATHCrossRef

12.

Cristianini N, Shawe-Taylor J (2000) An Introduction to support vector machines. Cambridge University Press, London

13.

de Diego IM, Moguerza JM, Muñoz A (2004) Combining kernel information for support vector classification. In: Proceedings of the international workshop on multiple classifier systems. Lecture notes in computer science, vol 3077. Springer, Berlin, pp 102–111

14.

Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc B 39:1–38MATHMathSciNet

15.

Friedrichs F, Igel C (2005) Evolutionary tuning of multiple SVM parameters. Neurocomputing 64:107–117. doi:10.1016/j.neucom.2004.11.022 CrossRef

16.

Furey TS, Duffy N, Cristianini N, Bednarski D, Schummer M, Haussler D (2000) Support vector machine classification and validation of cancer tissue samples using microarray expression data. Bioinformatics 16:906–914. doi:10.1093/bioinformatics/16.10.906 CrossRef

17.

Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, HeidelbergMATH

18.

Haykin S (1999) Neural networks––a comprehensive foundation. Prentice Hall, New YorkMATH

19.

Hsu C-W, Lin C-J (2002) A comparison of methods for multi-class support vector machines. IEEE Trans Neural Netw 13:415–425. doi:10.1109/72.991427 CrossRef

20.

Jacobs R, Jordan M, Nowlan S, Hinton G (1991) Adaptive mixtures of local experts. Neural Comput 3:79–87. doi:10.1162/neco.1991.3.1.79 CrossRef

21.

Joachims T (2000) Estimating the generalization performance of an SVM efficiently. In: Proceedings of 17th international conference on machine learning. Morgan Kaufmann Publishers, San Francisco, pp 431–438

22.

Jordan M, Jacobs R (1994) Hierarchical mixtures of experts and the EM algorithm. Neural Comput 6:181–214. doi:10.1162/neco.1994.6.2.181 CrossRef

23.

Kwok JT-Y (1998) Support vector mixture for classification and regression problems. In: Proceedins of the 14th international conference on pattern recognition, Brisbane, pp 255–258

24.

Lima CAM, Coelho ALV, Von Zuben FJ (2002) Ensembles of support vector machines for regression problems. In: Proceedings of the international joint conference on neural networks. IEEE Press, Hawaii, pp 2381–2386

25.

Lima CAM, Coelho ALV, Von Zuben FJ (2002) Model selection based on VC-dimension for heterogeneous ensembles of support vector machines. In: Proceedings of the 4th international conference on recent advances in soft computing. Nottingham University Press, Nottingham, pp 459–464

26.

Lima CAM, Coelho ALV, Von Zuben FJ (2007) Hybridizing mixtures of experts with support vector machines: investigation into nonlinear dynamic systems identification. Inf Sci 177:2049–2074. doi:10.1016/j.ins.2007.01.009

27.

McLachlan GJ, Basford KE (1988) Mixture models: inference and applications to clustering. Marcel Deckker, Inc., New YorkMATH

28.

Moerland P (1999) Classification using localized mixture of experts. In: Proceedings of ninth international conference on artificial neural networks, vol 2, Edinburgh, pp 838–843

29.

Pelckmans K, Suykens JAK, De Moor B (2005) Building sparse representations and structure determination on LS-SVM substrates. Neurocomputing 64:137–159. doi:10.1016/j.neucom.2004.11.029 CrossRef

30.

Schölkopf B, Platt J, Shawe-Taylor J, Smola AJ, Williamson RC (2001) Estimating the support of a high-dimensional distribution. Neural Comput 13:1443–1471. doi:10.1162/089976601750264965 MATHCrossRef

31.

Schölkopf B, Smola A (2002) Learning with kernels. The MIT Press, Cambridge

32.

Subasi A (2007) EEG signal classification using wavelet feature extraction and a mixture of expert model. Expert Syst Appl 32:1084–1093. doi:10.1016/j.eswa.2006.02.005 CrossRef

33.

Suykens JAK, Vandewalle J (1999) Least squares support machine classifiers. Neural Process Lett 9:293–300. doi:10.1023/A:1018628609742 CrossRefMathSciNet

34.

Suykens JAK, Lukas L, Van Dooren P, De Moor B, Vandewalle J (1999) Least squares support vector machine classifiers: a large scale algorithm. In: Proceedings of European conference on circuit theory and design, Italy, pp 839–842

35.

Suykens JAK, De Brabanter J, Lukas L, Vandewalle J (2002) Weighted least squares support vector machines: robustness and sparse approximation. Neurocomputing 48:85–105. doi:10.1016/S0925-2312(01)00644-0 MATHCrossRef

36.

Suykens JAK, Van Gestel T, De Brabanter J, De Moor B, Vandewalle J (2002) Least squares support vector machines. World Scientific Pub, SingaporeMATH

37.

Tikhonov AN, Arsenim VY (1977) Solutions of Ill-posed problems. W. H. Winston, WashingtonMATH

38.

Van Gestel T, Suykens JAK, Baesens B, Viaene S, Vanthienen J, Dedene G, De Moor B, Vandewalle J (2004) Benchmarking least squares support vector machine classifiers. Mach Learn 54:5–32. doi:10.1023/B:MACH.0000008082.80494.e0 MATHCrossRef

39.

Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH

40.

Wahba G (1998) Support vector machines, reproducing kernel Hilbert spaces and the randomized GACV. In: Schölkopf B, Burges C, Smola A (eds) Advances in kernel methods: support vector machines. The MIT Press, Cambridge, pp 69–88

41.

Webb A (1999) Statistical pattern recognition. Wiley, New YorkMATH

Titel: Pattern classification with mixtures of weighted least-squares support vector machine experts
verfasst von: Clodoaldo A. M. Lima
André L. V. Coelho
Fernando J. Von Zuben
Publikationsdatum: 01.10.2009
Verlag: Springer-Verlag
Erschienen in: Neural Computing and Applications / Ausgabe 7/2009
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI: https://doi.org/10.1007/s00521-008-0210-6

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Springer Professional "Technik"

Springer Professional "Wirtschaft+Technik"

Weitere Artikel der Ausgabe 7/2009

Combining seasonal time series ARIMA method and neural networks with genetic algorithms for predicting the production value of the mechanical industry in Taiwan

Speech nonfluency detection using Kohonen networks

Overhead cranes fuzzy control design with deadzone compensation

A novel pruning approach for robust data clustering

Improved hybrid wavelet neural network methodology for time-varying behavior prediction of engineering structures

A joint investigation of misclassification treatments and imbalanced datasets on neural network performance

Premium Partner