Skip to main content
Erschienen in: Soft Computing 7/2010

01.05.2010 | Original Paper

Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets

verfasst von: Pietro Ducange, Beatrice Lazzerini, Francesco Marcelloni

Erschienen in: Soft Computing | Ausgabe 7/2010

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

We exploit an evolutionary three-objective optimization algorithm to produce a Pareto front approximation composed of fuzzy rule-based classifiers (FRBCs) with different trade-offs between accuracy (expressed in terms of sensitivity and specificity) and complexity (computed as sum of the conditions in the antecedents of the classifier rules). Then, we use the ROC convex hull method to select the potentially optimal classifiers in the projection of the Pareto front approximation onto the ROC plane. Our method was tested on 13 highly imbalanced datasets and compared with 2 two-objective evolutionary approaches and one heuristic approach to FRBC generation, and with three well-known classifiers. We show by the Wilcoxon signed-rank test that our three-objective optimization approach outperforms all the other techniques, except for one classifier, in terms of the area under the ROC convex hull, an accuracy measure used to globally compare different classification approaches. Further, all the FRBCs in the ROC convex hull are characterized by a low value of complexity. Finally, we discuss how, the misclassification costs and the class distributions are fixed, we can select the most suitable classifier for the specific application. We show that the FRBC selected from the convex hull produced by our three-objective optimization approach achieves the lowest classification cost among the techniques used as comparison in two specific medical applications.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Alcalá R, Gacto MJ, Herrera F, Alcalá-Fdez J (2007) A multi-objective genetic algorithm for tuning and rule selection to obtain accurate and compact linguistic fuzzy rule-based systems. Int J Uncertain Fuzziness Knowl Based Syst 15(5):521–537. doi:10.1142/S0218488507004856 CrossRef Alcalá R, Gacto MJ, Herrera F, Alcalá-Fdez J (2007) A multi-objective genetic algorithm for tuning and rule selection to obtain accurate and compact linguistic fuzzy rule-based systems. Int J Uncertain Fuzziness Knowl Based Syst 15(5):521–537. doi:10.​1142/​S021848850700485​6 CrossRef
Zurück zum Zitat Alcalá-Fdez J, Sánchez L, García S, del Jesus MJ, Ventura S, Garrell JM, Otero J, Romero C, Bacardit J, Rivas VM, Fernández JC, Herrera F (2009) KEEL: a software tool to assess evolutionary algorithms to data mining problems. Soft Comput 13(3):307–318. doi:10.1007/s00500-008-0323-y CrossRef Alcalá-Fdez J, Sánchez L, García S, del Jesus MJ, Ventura S, Garrell JM, Otero J, Romero C, Bacardit J, Rivas VM, Fernández JC, Herrera F (2009) KEEL: a software tool to assess evolutionary algorithms to data mining problems. Soft Comput 13(3):307–318. doi:10.​1007/​s00500-008-0323-y CrossRef
Zurück zum Zitat Anastasio M, Kupinski M, Nishikawa R (1998) Optimization and FROC analysis of rule-based detection schemes using a multiobjective approach. IEEE Trans Med Imaging 17(10):1089–1093. doi:10.1109/42.746726 CrossRef Anastasio M, Kupinski M, Nishikawa R (1998) Optimization and FROC analysis of rule-based detection schemes using a multiobjective approach. IEEE Trans Med Imaging 17(10):1089–1093. doi:10.​1109/​42.​746726 CrossRef
Zurück zum Zitat Antonelli M, Frosini G, Lazzerini B, Marcelloni F (2006) A CAD system for lung nodule detection based on an anatomical model and a fuzzy neural network. In: Proceedings of NAFIPS, Montreal, Canada, 3–6 June, pp 448–453 Antonelli M, Frosini G, Lazzerini B, Marcelloni F (2006) A CAD system for lung nodule detection based on an anatomical model and a fuzzy neural network. In: Proceedings of NAFIPS, Montreal, Canada, 3–6 June, pp 448–453
Zurück zum Zitat Awai K, Murao K, Ozawa A, Komi M, Hayakawa H, Hori S, Nishimura Y (2004) Pulmonary nodules at chest CT: effect of computer-aided diagnosis on radiologists’ detection performance. Radiology 230(2):347–352. doi:10.1148/radiol.2302030049 CrossRef Awai K, Murao K, Ozawa A, Komi M, Hayakawa H, Hori S, Nishimura Y (2004) Pulmonary nodules at chest CT: effect of computer-aided diagnosis on radiologists’ detection performance. Radiology 230(2):347–352. doi:10.​1148/​radiol.​2302030049 CrossRef
Zurück zum Zitat Casillas J, Cordon O, Herrera F, Magdalena L (eds) (2003a) Accuracy improvements in linguistic fuzzy modeling. Springer, Berlin Casillas J, Cordon O, Herrera F, Magdalena L (eds) (2003a) Accuracy improvements in linguistic fuzzy modeling. Springer, Berlin
Zurück zum Zitat Casillas J, Cordon O, Herrera F, Magdalena L (eds) (2003b) Interpretability issues in fuzzy modeling. Springer, Berlin Casillas J, Cordon O, Herrera F, Magdalena L (eds) (2003b) Interpretability issues in fuzzy modeling. Springer, Berlin
Zurück zum Zitat Casillas J, Cordon O, Del Jesus MJ, Herrera F (2005) Genetic tuning of fuzzy rule deep structures preserving interpretability and its interaction with fuzzy rule set reduction. IEEE Trans Fuzzy Syst 13(1):13–29. doi:10.1109/TFUZZ.2004.839670 CrossRef Casillas J, Cordon O, Del Jesus MJ, Herrera F (2005) Genetic tuning of fuzzy rule deep structures preserving interpretability and its interaction with fuzzy rule set reduction. IEEE Trans Fuzzy Syst 13(1):13–29. doi:10.​1109/​TFUZZ.​2004.​839670 CrossRef
Zurück zum Zitat Chawla N, Bowyer K, Hall L, Kegelmeyer W (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357MATH Chawla N, Bowyer K, Hall L, Kegelmeyer W (2002) Smote: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357MATH
Zurück zum Zitat Chi Z, Yan H, Pham T (1996) Fuzzy algorithms with applications to image processing and pattern recognition. World Scientific, Singapore Chi Z, Yan H, Pham T (1996) Fuzzy algorithms with applications to image processing and pattern recognition. World Scientific, Singapore
Zurück zum Zitat Cococcioni M, Ducange P, Lazzerini B, Marcelloni F (2007) A Pareto-based multi-objective evolutionary approach to the identification of Mamdani fuzzy systems. Soft Comput 11(11):1013–1031. doi:10.1007/s00500-007-0150-6 CrossRef Cococcioni M, Ducange P, Lazzerini B, Marcelloni F (2007) A Pareto-based multi-objective evolutionary approach to the identification of Mamdani fuzzy systems. Soft Comput 11(11):1013–1031. doi:10.​1007/​s00500-007-0150-6 CrossRef
Zurück zum Zitat Coello Coello CA, Lamont GB (2004) Applications of multi-objective evolutionary algorithms. World Scientific, Singapore Coello Coello CA, Lamont GB (2004) Applications of multi-objective evolutionary algorithms. World Scientific, Singapore
Zurück zum Zitat Cordon O, Del Jesus MJ, Herrera F (1999) A proposal on reasoning methods in fuzzy rule-based classification systems. Int J Approx Reason 20(1):21–45 Cordon O, Del Jesus MJ, Herrera F (1999) A proposal on reasoning methods in fuzzy rule-based classification systems. Int J Approx Reason 20(1):21–45
Zurück zum Zitat Cordon O, Herrera F, Hoffmann F, Magdalena L (2001) Genetic fuzzy systems. World Scientific, Singapore Cordon O, Herrera F, Hoffmann F, Magdalena L (2001) Genetic fuzzy systems. World Scientific, Singapore
Zurück zum Zitat Cordon O, Del Jesus MJ, Herrera F, Magdalena L, Villar P (2003) A multiobjective genetic learning process for joint feature selection and granularity and contexts learning in fuzzy rule-based classification systems. In: Casillas J, Cordon O, Herrera F, Magdalena L (eds) Accuracy improvements in linguistic fuzzy modeling. Springer, Berlin, pp 79–99 Cordon O, Del Jesus MJ, Herrera F, Magdalena L, Villar P (2003) A multiobjective genetic learning process for joint feature selection and granularity and contexts learning in fuzzy rule-based classification systems. In: Casillas J, Cordon O, Herrera F, Magdalena L (eds) Accuracy improvements in linguistic fuzzy modeling. Springer, Berlin, pp 79–99
Zurück zum Zitat Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, London Deb K (2001) Multi-objective optimization using evolutionary algorithms. Wiley, London
Zurück zum Zitat Fawcett T (2003) ROC graphs: Notes and practical considerations for researchers. Tech. Rep. HPL-2003-4, HP Labs Fawcett T (2003) ROC graphs: Notes and practical considerations for researchers. Tech. Rep. HPL-2003-4, HP Labs
Zurück zum Zitat Fernandez A, García S, Del Jesus MJ, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data sets. Fuzzy Sets Syst 159(18):2378–2398. doi:10.1016/j.fss.2007.12.023 CrossRef Fernandez A, García S, Del Jesus MJ, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data sets. Fuzzy Sets Syst 159(18):2378–2398. doi:10.​1016/​j.​fss.​2007.​12.​023 CrossRef
Zurück zum Zitat Ho SY, Chen HM, Ho SJ, Chen TK (2004) Design of accurate classifiers with a compact fuzzy-rule base using an evolutionary scatter partition of feature space. IEEE Trans Syst Man Cybern 34(2):1031–1044. doi:10.1109/TSMCB.2003.819160 CrossRef Ho SY, Chen HM, Ho SJ, Chen TK (2004) Design of accurate classifiers with a compact fuzzy-rule base using an evolutionary scatter partition of feature space. IEEE Trans Syst Man Cybern 34(2):1031–1044. doi:10.​1109/​TSMCB.​2003.​819160 CrossRef
Zurück zum Zitat Horn J, Nafpliotis N, Goldberg DE (1999) A niched Pareto genetic algorithm for multiobjective optimization. In: Proceedings of the first IEEE conference on evolutionary computation, Orlando, Florida, 27–29 June, pp 82–87 Horn J, Nafpliotis N, Goldberg DE (1999) A niched Pareto genetic algorithm for multiobjective optimization. In: Proceedings of the first IEEE conference on evolutionary computation, Orlando, Florida, 27–29 June, pp 82–87
Zurück zum Zitat Ishibuchi H (2007) Multiobjective genetic fuzzy systems: review and future research directions. In: Proceedings of the 2007 international conference on fuzzy systems, London, 23–26 July, pp 1-6 Ishibuchi H (2007) Multiobjective genetic fuzzy systems: review and future research directions. In: Proceedings of the 2007 international conference on fuzzy systems, London, 23–26 July, pp 1-6
Zurück zum Zitat Ishibuchi H, Murata T, Turksen IB (1997) Single-objective and two-objective genetic algorithms for selecting linguistic rules for pattern classification problems. Fuzzy Sets Syst 89(2):135–150. doi:10.1016/S0165-0114(96)00098-X CrossRef Ishibuchi H, Murata T, Turksen IB (1997) Single-objective and two-objective genetic algorithms for selecting linguistic rules for pattern classification problems. Fuzzy Sets Syst 89(2):135–150. doi:10.​1016/​S0165-0114(96)00098-X CrossRef
Zurück zum Zitat Ishibuchi H, Nakashima T, Nii M (2005a) Classification and modeling with linguistic information granules: advanced approaches to linguistic data Mining. Springer, Berlin Ishibuchi H, Nakashima T, Nii M (2005a) Classification and modeling with linguistic information granules: advanced approaches to linguistic data Mining. Springer, Berlin
Zurück zum Zitat Ishibuchi H, Nozaki K, Yamamoto N, Tanaka H (2005b) Selecting fuzzy if-then rules for classification problems using genetic algorithms. IEEE Trans Fuzzy Syst 3(3):260–270. doi:10.1109/91.413232 CrossRef Ishibuchi H, Nozaki K, Yamamoto N, Tanaka H (2005b) Selecting fuzzy if-then rules for classification problems using genetic algorithms. IEEE Trans Fuzzy Syst 3(3):260–270. doi:10.​1109/​91.​413232 CrossRef
Zurück zum Zitat Kupinski M, Anastasio M (1999) Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves. IEEE Trans Med Imaging 18(8):675–685. doi:10.1109/42.796281 CrossRef Kupinski M, Anastasio M (1999) Multiobjective genetic optimization of diagnostic classifiers with implications for generating receiver operating characteristic curves. IEEE Trans Med Imaging 18(8):675–685. doi:10.​1109/​42.​796281 CrossRef
Zurück zum Zitat Quinlan JR (1993) C4.5: Programs for Machine Learning. Morgan Kauffman, San Mateo Quinlan JR (1993) C4.5: Programs for Machine Learning. Morgan Kauffman, San Mateo
Zurück zum Zitat Sheskin D (2003) Handbook of parametric and nonparametric statistical procedures. Chapman & Hall/CRC, London/Boca Raton Sheskin D (2003) Handbook of parametric and nonparametric statistical procedures. Chapman & Hall/CRC, London/Boca Raton
Zurück zum Zitat Woods K, Doss C, Bowyer K, Solka J, Priebe J, Kegelmeyer P (1993) Comparative evaluation of pattern recognition techniques for detection of microcalcifications in mammography. Int J Pattern Recognit Artif Intell 7(6):1417–1436. doi:10.1142/S0218001493000698 CrossRef Woods K, Doss C, Bowyer K, Solka J, Priebe J, Kegelmeyer P (1993) Comparative evaluation of pattern recognition techniques for detection of microcalcifications in mammography. Int J Pattern Recognit Artif Intell 7(6):1417–1436. doi:10.​1142/​S021800149300069​8 CrossRef
Zurück zum Zitat Yen J, Wang L, Gillespie GW (1998) Improving the interpretability of TSK fuzzy models by combining global learning and local learning. IEEE Trans Fuzzy Syst 6(4):530–537. doi:10.1109/91.728447 CrossRef Yen J, Wang L, Gillespie GW (1998) Improving the interpretability of TSK fuzzy models by combining global learning and local learning. IEEE Trans Fuzzy Syst 6(4):530–537. doi:10.​1109/​91.​728447 CrossRef
Zurück zum Zitat Zitzler E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach. IEEE Trans Evol Comput 3:257–271. doi:10.1109/4235.797969 CrossRef Zitzler E, Thiele L (1999) Multiobjective evolutionary algorithms: a comparative case study and the strength Pareto approach. IEEE Trans Evol Comput 3:257–271. doi:10.​1109/​4235.​797969 CrossRef
Zurück zum Zitat Zitzler E, Laumanns M, Thiele L (2001) SPEA2: Improving the strength Pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of EUROGEN2001 evolutionary methods for design, opt. and control with applications to industrial problems, Athens, pp 95–100 Zitzler E, Laumanns M, Thiele L (2001) SPEA2: Improving the strength Pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of EUROGEN2001 evolutionary methods for design, opt. and control with applications to industrial problems, Athens, pp 95–100
Metadaten
Titel
Multi-objective genetic fuzzy classifiers for imbalanced and cost-sensitive datasets
verfasst von
Pietro Ducange
Beatrice Lazzerini
Francesco Marcelloni
Publikationsdatum
01.05.2010
Verlag
Springer-Verlag
Erschienen in
Soft Computing / Ausgabe 7/2010
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-009-0460-y

Weitere Artikel der Ausgabe 7/2010

Soft Computing 7/2010 Zur Ausgabe