Skip to main content
Erschienen in: Neural Computing and Applications 2/2009

01.02.2009 | Original Article

Machine learning multi-classifiers for peptide classification

verfasst von: Loris Nanni, Alessandra Lumini

Erschienen in: Neural Computing and Applications | Ausgabe 2/2009

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

In this paper, we study the performance improvement that it is possible to obtain combining classifiers based on different notions (each trained using a different physicochemical property of amino-acids). This multi-classifier has been tested in three problems: HIV-protease; recognition of T-cell epitopes; predictive vaccinology. We propose a multi-classifier that combines a classifier that approaches the problem as a two-class pattern recognition problem and a method based on a one-class classifier. Several classifiers combined with the “sum rule” enables us to obtain an improvement performance over the best results previously published in the literature.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Fußnoten
1
With Gamma (parameter of the radial based kernel) = 0.25 and C (Cost of the constrain violation) = 1.5.
 
2
Implemented as PRTools 3.17 Matlab Toolbox.
 
Literatur
1.
Zurück zum Zitat Altıncay H, Demirekler M (2000) An information theoretic framework for weight estimation in the combination of probabilistic classifiers for speaker identification. Speech Commun 30(4):255–272CrossRef Altıncay H, Demirekler M (2000) An information theoretic framework for weight estimation in the combination of probabilistic classifiers for speaker identification. Speech Commun 30(4):255–272CrossRef
2.
Zurück zum Zitat Altıncay H (2006) Ensembling evidential k-nearest neighbor classifiers through multi-modal perturbation. Appl Soft Comput (in press) Altıncay H (2006) Ensembling evidential k-nearest neighbor classifiers through multi-modal perturbation. Appl Soft Comput (in press)
4.
Zurück zum Zitat Brusic V et al (2002) Prediction of promiscuous peptides that bind HLA class I molecules. Immunol Cell Biol 80:280–285CrossRef Brusic V et al (2002) Prediction of promiscuous peptides that bind HLA class I molecules. Immunol Cell Biol 80:280–285CrossRef
5.
Zurück zum Zitat Bozic I, Zhang GL, Brusic V (2005) Predictive vaccinology: optimization of predictions using support vector machine classifiers, IDEAL 2005, pp 375–381 Bozic I, Zhang GL, Brusic V (2005) Predictive vaccinology: optimization of predictions using support vector machine classifiers, IDEAL 2005, pp 375–381
6.
Zurück zum Zitat Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines and other kernel-based learning methods. Cambridge University Press, Cambridge
7.
Zurück zum Zitat Duda R, Hart P, Stork D (2001) Pattern classification. Wiley, New YorkMATH Duda R, Hart P, Stork D (2001) Pattern classification. Wiley, New YorkMATH
8.
Zurück zum Zitat Franco A, Lumini A, Maio D, Nanni L (2006) An enhanced subspace method for face recognition. Pattern Recognit Lett 27(1):76–84CrossRef Franco A, Lumini A, Maio D, Nanni L (2006) An enhanced subspace method for face recognition. Pattern Recognit Lett 27(1):76–84CrossRef
9.
Zurück zum Zitat Guo J, Lin Y, Sun Z (2005) A novel method for protein subcellular localization: combining residue-couple model and SVM. In: Proceedings of 3rd Asia-Pacific Bioinformatics conference, Singapore, pp 117–129 Guo J, Lin Y, Sun Z (2005) A novel method for protein subcellular localization: combining residue-couple model and SVM. In: Proceedings of 3rd Asia-Pacific Bioinformatics conference, Singapore, pp 117–129
10.
Zurück zum Zitat Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20(8):832–844CrossRef Ho TK (1998) The random subspace method for constructing decision forests. IEEE Trans Pattern Anal Mach Intell 20(8):832–844CrossRef
11.
Zurück zum Zitat Huang L, Dai Y (2005) A support vector machine approach for prediction of T cell epitopes. In: Proceedings of the third Asia–Pacific Bioinformatics conference (APBC2005), Singapore, pp 312–328 Huang L, Dai Y (2005) A support vector machine approach for prediction of T cell epitopes. In: Proceedings of the third Asia–Pacific Bioinformatics conference (APBC2005), Singapore, pp 312–328
12.
Zurück zum Zitat Jain A, Nandakumar K, Ross A (2005) Score normalization in multimodal biometric systems. Pattern Recognit 38(12):2270–2285CrossRef Jain A, Nandakumar K, Ross A (2005) Score normalization in multimodal biometric systems. Pattern Recognit 38(12):2270–2285CrossRef
13.
Zurück zum Zitat Kawashima S, Kanehisa M (2000) A index: amino acid index database. Nucleic Acids Res 28:374CrossRef Kawashima S, Kanehisa M (2000) A index: amino acid index database. Nucleic Acids Res 28:374CrossRef
14.
Zurück zum Zitat Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51:181–207MATHCrossRef Kuncheva LI, Whitaker CJ (2003) Measures of diversity in classifier ensembles and their relationship with the ensemble accuracy. Mach Learn 51:181–207MATHCrossRef
15.
Zurück zum Zitat Kittler J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef Kittler J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef
16.
Zurück zum Zitat Kittler J, Hatef M, Duin R, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef Kittler J, Hatef M, Duin R, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef
17.
Zurück zum Zitat Lumini A, Nanni L (2006a) Machine learning for HIV-1 protease cleavage site prediction. Pattern Recognit Lett, available online 2 May 2006 Lumini A, Nanni L (2006a) Machine learning for HIV-1 protease cleavage site prediction. Pattern Recognit Lett, available online 2 May 2006
18.
Zurück zum Zitat Lumini A, Nanni L (2006b) MppS: an ensemble of support vector machine based on multiple physicochemical properties of amino-acids. NeuroComput 69(13–15):1688–1690 Lumini A, Nanni L (2006b) MppS: an ensemble of support vector machine based on multiple physicochemical properties of amino-acids. NeuroComput 69(13–15):1688–1690
19.
Zurück zum Zitat Melville P, Mooney RJ (2003) Constructing diverse classifier ensembles using artificial training examples. In: Proceedings of the IJCAI, pp 505–510 Melville P, Mooney RJ (2003) Constructing diverse classifier ensembles using artificial training examples. In: Proceedings of the IJCAI, pp 505–510
20.
Zurück zum Zitat Nakai K, Kidera A, Kanehisa M (1988) Cluster analysis of amino acid indices for prediction of protein structure and function. Protein Eng 2:93–100CrossRef Nakai K, Kidera A, Kanehisa M (1988) Cluster analysis of amino acid indices for prediction of protein structure and function. Protein Eng 2:93–100CrossRef
21.
Zurück zum Zitat Nanni L (2006a) Experimental comparison of one-class classifiers for on-line signature verification. Neurocomputing 69(7–9):869–873CrossRef Nanni L (2006a) Experimental comparison of one-class classifiers for on-line signature verification. Neurocomputing 69(7–9):869–873CrossRef
22.
Zurück zum Zitat Nanni L (2006b) Comparison among feature extraction methods for HIV-1 protease cleavage site prediction. Pattern Recognit 39(4):711–713MATHCrossRef Nanni L (2006b) Comparison among feature extraction methods for HIV-1 protease cleavage site prediction. Pattern Recognit 39(4):711–713MATHCrossRef
23.
Zurück zum Zitat Nanni L (2006c) Machine learning algorithms for T-cell epitopes prediction. Neurocomputing 69(7–9):866–868CrossRef Nanni L (2006c) Machine learning algorithms for T-cell epitopes prediction. Neurocomputing 69(7–9):866–868CrossRef
24.
Zurück zum Zitat Nanni L, Lumini A (2005) Ensemble of Parzen Window Classifiers for on-line signature verification. NeuroComput 68(6):217–224 Nanni L, Lumini A (2005) Ensemble of Parzen Window Classifiers for on-line signature verification. NeuroComput 68(6):217–224
25.
Zurück zum Zitat Nanni L, Lumini A (2006a) A reliable method for HIV-1 protease cleavage site prediction. Neurocomputing 69(7–9):838–841CrossRef Nanni L, Lumini A (2006a) A reliable method for HIV-1 protease cleavage site prediction. Neurocomputing 69(7–9):838–841CrossRef
26.
Zurück zum Zitat Nanni L, Lumini A (2006b) An ensemble of K-local hyperplanes for predicting protein–protein interactions, Bioinformatics (22) (10):1207–1210 Nanni L, Lumini A (2006b) An ensemble of K-local hyperplanes for predicting protein–protein interactions, Bioinformatics (22) (10):1207–1210
27.
Zurück zum Zitat Nanni L, Lumini A (2007) Region Boost Learning for 2D + 3D based Face Recognition. Pattern Recognit Lett (in press) Nanni L, Lumini A (2007) Region Boost Learning for 2D + 3D based Face Recognition. Pattern Recognit Lett (in press)
28.
Zurück zum Zitat Oja E (1983) Subspace methods of pattern recognition. Research Studies Press Ltd, Letchworth Oja E (1983) Subspace methods of pattern recognition. Research Studies Press Ltd, Letchworth
29.
Zurück zum Zitat Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. JArtif Intell Res 11:169–198MATH Opitz D, Maclin R (1999) Popular ensemble methods: an empirical study. JArtif Intell Res 11:169–198MATH
30.
Zurück zum Zitat Pudil P, Novovicova J, Kittler J (1994) Floating search methods in feature selection. Pattern Recognit Lett 15(11):1119–1125CrossRef Pudil P, Novovicova J, Kittler J (1994) Floating search methods in feature selection. Pattern Recognit Lett 15(11):1119–1125CrossRef
31.
Zurück zum Zitat Rögnvaldsson T, You L (2003) Why neural networks should not be used for HIV-1 protease cleavage site prediction. Bioinformatics 1702–1709 Rögnvaldsson T, You L (2003) Why neural networks should not be used for HIV-1 protease cleavage site prediction. Bioinformatics 1702–1709
32.
Zurück zum Zitat Ruan J, Wang K, Yang J, Kurgan LA, Cios K (2005) Highly accurate and consistent method for prediction of elix and strand content from primary protein sequences. Artif Intell Med, online version 2005 Ruan J, Wang K, Yang J, Kurgan LA, Cios K (2005) Highly accurate and consistent method for prediction of elix and strand content from primary protein sequences. Artif Intell Med, online version 2005
33.
Zurück zum Zitat Schapire RE (2002) The boosting approach to machine learning: an overview. In: MSRI workshop on nonlinear estimation and classification, Berkeley Schapire RE (2002) The boosting approach to machine learning: an overview. In: MSRI workshop on nonlinear estimation and classification, Berkeley
34.
Zurück zum Zitat Tax DMJ (2001) One-class classification; concept-learning in the absence of counter-examples, Delft University of Technology, June 2001, ISBN: 90-75691-05-x Tax DMJ (2001) One-class classification; concept-learning in the absence of counter-examples, Delft University of Technology, June 2001, ISBN: 90-75691-05-x
35.
Zurück zum Zitat Tomii K, Kanehisa M (1996) Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng 9:27–36CrossRef Tomii K, Kanehisa M (1996) Analysis of amino acid indices and mutation matrices for sequence comparison and structure prediction of proteins. Protein Eng 9:27–36CrossRef
36.
Zurück zum Zitat Whitaker CJ, Kuncheva LI (2003) Examining the relationship between majority vote accuracy and diversity in bagging and boosting, Technical Report, School of Informatics, University of Wales, Bangor Whitaker CJ, Kuncheva LI (2003) Examining the relationship between majority vote accuracy and diversity in bagging and boosting, Technical Report, School of Informatics, University of Wales, Bangor
37.
Zurück zum Zitat Zenobi G, Cunningham P (2001) Using diversity in preparing ensembles of classifiers based on different feature subsets to minimize generalization error, In: Raedt LD, Flach PA (eds) Proceedings of the 12th conference on machine learning, Lecture Notes in Computer Science 2167, pp 576–587 Zenobi G, Cunningham P (2001) Using diversity in preparing ensembles of classifiers based on different feature subsets to minimize generalization error, In: Raedt LD, Flach PA (eds) Proceedings of the 12th conference on machine learning, Lecture Notes in Computer Science 2167, pp 576–587
38.
Zurück zum Zitat Zhang GL et al (2005) Neural models for predicting viral vaccine targets. J Bioinform Comput Biol 3:1207–1225CrossRef Zhang GL et al (2005) Neural models for predicting viral vaccine targets. J Bioinform Comput Biol 3:1207–1225CrossRef
39.
Zurück zum Zitat Zhao Y, Pinilla C, Valmori D, Roland Martin R, Simon R (2003) Application of support vector machines for T-cell epitopes prediction. Bioinformatics, 19(15):1978–1984CrossRef Zhao Y, Pinilla C, Valmori D, Roland Martin R, Simon R (2003) Application of support vector machines for T-cell epitopes prediction. Bioinformatics, 19(15):1978–1984CrossRef
40.
Zurück zum Zitat Zhou Z,Yu Y (2005) Ensembling local learners through multimodal perturbation. IEEE Trans Syst Man Cyber B Cyber 35(4):725–735CrossRef Zhou Z,Yu Y (2005) Ensembling local learners through multimodal perturbation. IEEE Trans Syst Man Cyber B Cyber 35(4):725–735CrossRef
Metadaten
Titel
Machine learning multi-classifiers for peptide classification
verfasst von
Loris Nanni
Alessandra Lumini
Publikationsdatum
01.02.2009
Verlag
Springer-Verlag
Erschienen in
Neural Computing and Applications / Ausgabe 2/2009
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-007-0170-2

Weitere Artikel der Ausgabe 2/2009

Neural Computing and Applications 2/2009 Zur Ausgabe