Skip to main content
Erschienen in: Neural Computing and Applications 2/2015

01.02.2015 | Original Article

Hybrid intelligent system-based rough set and ensemble classifier for breast cancer diagnosis

verfasst von: A. H. El-Baz

Erschienen in: Neural Computing and Applications | Ausgabe 2/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The effectiveness of classification and recognition systems has improved in a great deal to help medical experts in diagnosing diseases. Breast cancer is becoming a leading cause of death among women in the whole world; meanwhile, it is confirmed that the early detection and accurate diagnosis of this disease can ensure a long survival of the patients. This paper presents a hybrid intelligent system for recognition of breast cancer tumors. The proposed system includes two main modules: the feature extraction module and the predictor module. In the feature extraction module, rough set theory is used to preprocess the attributes on condition that the important information is not lost, deletes redundant attributes and conflicting objects from decision table. In the predictor module, a combined classifier is proposed based on K-nearest neighbor classifier. Experiments have been conducted on a widely used Wisconsin breast cancer dataset taken from University of California Irvine. Experimental results show that the proposed hybrid system can improve the rate of correct diagnosis of cases. The proposed combined classifier with rough set-based feature selection achieves 99.41 % classification accuracy and uses only 4 features which is the best shown to date. Different performance metrics are used to show the effectiveness of the proposed hybrid system. With these results, the proposed method is very promising compared to the previously reported results and can be used confidently for other breast cancer diagnosis problems.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Ghosh J (2002) Multiclassifier systems: back to the future. In: Roli F, Kittler J (eds) Multiple classifier systems. Lect Notes Comput Sci 2364:1–15 Ghosh J (2002) Multiclassifier systems: back to the future. In: Roli F, Kittler J (eds) Multiple classifier systems. Lect Notes Comput Sci 2364:1–15
3.
Zurück zum Zitat Zhang C, Ma Y (2012) Ensemble machine learning: methods and applications. Springer, BerlinCrossRef Zhang C, Ma Y (2012) Ensemble machine learning: methods and applications. Springer, BerlinCrossRef
4.
Zurück zum Zitat Etemad SA, Arya A (2014) Classification and translation of style and affect in human motion using RBF neural networks. Neurocomputing 129:585–595CrossRef Etemad SA, Arya A (2014) Classification and translation of style and affect in human motion using RBF neural networks. Neurocomputing 129:585–595CrossRef
5.
Zurück zum Zitat Meynet J, Thiran JP (2010) Information theoretic combination of pattern classifiers. Pattern Recogn 43(10):3412–3421CrossRefMATH Meynet J, Thiran JP (2010) Information theoretic combination of pattern classifiers. Pattern Recogn 43(10):3412–3421CrossRefMATH
6.
Zurück zum Zitat Wolberg WH, Mangasarian OL (1990) Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci USA 87(23):9193–9196CrossRefMATH Wolberg WH, Mangasarian OL (1990) Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc Natl Acad Sci USA 87(23):9193–9196CrossRefMATH
7.
Zurück zum Zitat Quinlan JR (1996) Improved use of continuous attributes in C4.5. J Artif Intell Res 4:77–90MATH Quinlan JR (1996) Improved use of continuous attributes in C4.5. J Artif Intell Res 4:77–90MATH
8.
Zurück zum Zitat Hamilton HJ, Shan N, Cercone N (1996) RIAC: a rule induction algorithm based on approximate classification. Technical Report CS 96-06, University of Regina Hamilton HJ, Shan N, Cercone N (1996) RIAC: a rule induction algorithm based on approximate classification. Technical Report CS 96-06, University of Regina
9.
Zurück zum Zitat Ster B, Dobnikar A (1996) Neural networks in medical diagnosis: comparison with other methods. In: Proceedings of the international conference on engineering applications of neural networks, pp 427–430 Ster B, Dobnikar A (1996) Neural networks in medical diagnosis: comparison with other methods. In: Proceedings of the international conference on engineering applications of neural networks, pp 427–430
10.
Zurück zum Zitat Bennet KP, Blue JA (1997) A support vector machine approach to decision trees, Math Report, vols. 97–100, Rensselaer Polytechnic Institute Bennet KP, Blue JA (1997) A support vector machine approach to decision trees, Math Report, vols. 97–100, Rensselaer Polytechnic Institute
11.
Zurück zum Zitat Nauck D, Kruse R (1999) Obtaining interpretable fuzzy classification rules from medical data. Artif Intell Med 16:149–169CrossRef Nauck D, Kruse R (1999) Obtaining interpretable fuzzy classification rules from medical data. Artif Intell Med 16:149–169CrossRef
12.
Zurück zum Zitat Pena-Reyes CA, Sipper M (1999) A fuzzy-genetic approach to breast cancer diagnosis. Artif Intell Med 17:131–155CrossRef Pena-Reyes CA, Sipper M (1999) A fuzzy-genetic approach to breast cancer diagnosis. Artif Intell Med 17:131–155CrossRef
13.
Zurück zum Zitat Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18:205–219CrossRef Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18:205–219CrossRef
14.
Zurück zum Zitat Goodman DE, Boggess L, Watkins A (2002) Artificial immune system classification of multiple-class problems. In: Proceedings of the artificial neural networks in engineering ANNIE, pp 179–183 Goodman DE, Boggess L, Watkins A (2002) Artificial immune system classification of multiple-class problems. In: Proceedings of the artificial neural networks in engineering ANNIE, pp 179–183
15.
Zurück zum Zitat Abonyi J, Szeifert F (2003) Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recogn Lett 24:2195–2207CrossRefMATH Abonyi J, Szeifert F (2003) Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recogn Lett 24:2195–2207CrossRefMATH
16.
Zurück zum Zitat Polat K, Günes S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Proc 17(4):694–701CrossRef Polat K, Günes S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Proc 17(4):694–701CrossRef
17.
Zurück zum Zitat Guijarro-Berdias B, Fontenla-Romero O, Perez-Sanchez B, Fraguela P (2007) A linear learning method for multilayer perceptrons using leastsquares. Lect Notes Comput Sci 365–374 Guijarro-Berdias B, Fontenla-Romero O, Perez-Sanchez B, Fraguela P (2007) A linear learning method for multilayer perceptrons using leastsquares. Lect Notes Comput Sci 365–374
18.
Zurück zum Zitat Yang B, Wang L, Chen Z, Chen Y, Sun R (2010) A novel classification method using the combination of FDPS and flexible neural tree. Neurocomputing 73:690–699CrossRef Yang B, Wang L, Chen Z, Chen Y, Sun R (2010) A novel classification method using the combination of FDPS and flexible neural tree. Neurocomputing 73:690–699CrossRef
19.
Zurück zum Zitat Shafigh P, Yazdi Hadi S, Sohrab E (2013) Gravitation based classification. Inf Sci 220:319–330CrossRef Shafigh P, Yazdi Hadi S, Sohrab E (2013) Gravitation based classification. Inf Sci 220:319–330CrossRef
20.
Zurück zum Zitat Cateni S, Colla V, Vannucc M (2014) A method for resampling imbalanced data sets in binary classification tasks for real-world problems. Neurocomputing 135:32–41CrossRef Cateni S, Colla V, Vannucc M (2014) A method for resampling imbalanced data sets in binary classification tasks for real-world problems. Neurocomputing 135:32–41CrossRef
22.
Zurück zum Zitat Chen HL, Yang B, Liu J, Liu DY (2011) A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl 38:9014–9022CrossRef Chen HL, Yang B, Liu J, Liu DY (2011) A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl 38:9014–9022CrossRef
23.
Zurück zum Zitat Pawlak Z (1996) Why rough sets. In: Proceedings of the fifth IEEE international conference on fuzzy systems, vol 2, 8–11 September 1996, New Orleans, LA, USA, pp 738–743 Pawlak Z (1996) Why rough sets. In: Proceedings of the fifth IEEE international conference on fuzzy systems, vol 2, 8–11 September 1996, New Orleans, LA, USA, pp 738–743
24.
Zurück zum Zitat Rami N, Khushaba N, Al-Ani A, Al-Jumaily A (2011) Feature subset selection using differential evolution and a statistical repair mechanism. In: Expert systems with applications. Elsevier, pp 11515–11526 Rami N, Khushaba N, Al-Ani A, Al-Jumaily A (2011) Feature subset selection using differential evolution and a statistical repair mechanism. In: Expert systems with applications. Elsevier, pp 11515–11526
25.
Zurück zum Zitat Pawlak Z (1997) Rough set approach to knowledge-based decision support. Eur J Oper Res 99(1):48–57CrossRefMATH Pawlak Z (1997) Rough set approach to knowledge-based decision support. Eur J Oper Res 99(1):48–57CrossRefMATH
26.
Zurück zum Zitat Johnson DS (1974) Approximation algorithms for combinatorial problems. J Comput Syst Sci 9:256–278CrossRefMATH Johnson DS (1974) Approximation algorithms for combinatorial problems. J Comput Syst Sci 9:256–278CrossRefMATH
27.
Zurück zum Zitat Jensen R, Shen Q (2008) Computational intelligence and feature selection: rough and fuzzy approaches. Wiley Jensen R, Shen Q (2008) Computational intelligence and feature selection: rough and fuzzy approaches. Wiley
28.
Zurück zum Zitat Mitchell TM (1997) Machine learning. The McGraw-Hill Mitchell TM (1997) Machine learning. The McGraw-Hill
29.
Zurück zum Zitat Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef Kittler J, Hatef M, Duin RPW, Matas J (1998) On combining classifiers. IEEE Trans Pattern Anal Mach Intell 20(3):226–239CrossRef
30.
Zurück zum Zitat Xu L, Krzyzak A, Suen CY (1992) Methods of combining multiple classifiers and their application to handwriting recognition. IEEE Trans SMC 22:418–435 Xu L, Krzyzak A, Suen CY (1992) Methods of combining multiple classifiers and their application to handwriting recognition. IEEE Trans SMC 22:418–435
31.
Zurück zum Zitat Schapire RE (1990) The strenght of weak learnability. Mach Learn 5:197–227 Schapire RE (1990) The strenght of weak learnability. Mach Learn 5:197–227
32.
Zurück zum Zitat Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. Adv Artif Intell 1015–1021 Sokolova M, Japkowicz N, Szpakowicz S (2006) Beyond accuracy, F-score and ROC: a family of discriminant measures for performance evaluation. Adv Artif Intell 1015–1021
33.
Zurück zum Zitat Kohavi R, Provost F (1998) Glossary of terms. Editorial for the Special Issue on Appl Mach Learn the Knowl Discov Process 30(2–3) Kohavi R, Provost F (1998) Glossary of terms. Editorial for the Special Issue on Appl Mach Learn the Knowl Discov Process 30(2–3)
34.
Zurück zum Zitat Tom F (2004) ROC graphs: notes and practical considerations for researchers. Mach Learn 31:1–38 Tom F (2004) ROC graphs: notes and practical considerations for researchers. Mach Learn 31:1–38
Metadaten
Titel
Hybrid intelligent system-based rough set and ensemble classifier for breast cancer diagnosis
verfasst von
A. H. El-Baz
Publikationsdatum
01.02.2015
Verlag
Springer London
Erschienen in
Neural Computing and Applications / Ausgabe 2/2015
Print ISSN: 0941-0643
Elektronische ISSN: 1433-3058
DOI
https://doi.org/10.1007/s00521-014-1731-9

Weitere Artikel der Ausgabe 2/2015

Neural Computing and Applications 2/2015 Zur Ausgabe