Skip to main content
Erschienen in: Pattern Analysis and Applications 4/2015

01.11.2015 | Theoretical Advances

A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer

verfasst von: Fadzil Ahmad, Nor Ashidi Mat Isa, Zakaria Hussain, Muhammad Khusairi Osman, Siti Noraini Sulaiman

Erschienen in: Pattern Analysis and Applications | Ausgabe 4/2015

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Breast cancer is the most common cancer diagnosed and cause of death among women worldwide. There is evidence that early detection and treatment can increase the survival rate of breast cancer patients. The traditional method for diagnosing the disease relies on human experiences to identify the presence of certain pattern from the database. It is prone to human error, time consuming and labour intensive. Therefore, this work proposes an automatic breast cancer diagnosis technique using a genetic algorithm (GA) for simultaneous feature selection and parameter optimization of an artificial neural network (ANN). The proposed algorithm is implemented with three different variations of the backpropagation technique namely the resilient back-propagation (GAANN_RP), Levenberg–Marquardt (GAANN_LM) and gradient descent with momentum (GAANN_GD) for fine tuning of the weight of ANN, and their performances are compared. Besides, the effect of the feature selection and manual determination of the hidden node size has also been investigated. Interestingly, one of the proposed algorithms called GAANN_RP produces the best and on average, 99.24 and 98.29 % correct classification, respectively, on the Wisconsin breast cancer dataset, which is comparable with the results gathered from other works found in the literature.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Übeyli ED (2007) Implementing automated diagnostic systems for breast cancer detection. Expert Syst Appl 33(4):1054–1062CrossRef Übeyli ED (2007) Implementing automated diagnostic systems for breast cancer detection. Expert Syst Appl 33(4):1054–1062CrossRef
3.
Zurück zum Zitat Furundzic D, Djordjevic M, Jovicevic Bekic A (1998) Neural networks approach to early breast cancer detection. J Syst Archit 44(8):617–633CrossRef Furundzic D, Djordjevic M, Jovicevic Bekic A (1998) Neural networks approach to early breast cancer detection. J Syst Archit 44(8):617–633CrossRef
4.
Zurück zum Zitat Karabatak M, Ince MC (2009) An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 36(2, Part 2):3465–3469 Karabatak M, Ince MC (2009) An expert system for detection of breast cancer based on association rules and neural network. Expert Syst Appl 36(2, Part 2):3465–3469
5.
Zurück zum Zitat Rogers SK, Ruck DW, Kabrisky M (1994) Artificial neural networks for early detection and diagnosis of cancer. Cancer Lett 77(2–3):79–83CrossRef Rogers SK, Ruck DW, Kabrisky M (1994) Artificial neural networks for early detection and diagnosis of cancer. Cancer Lett 77(2–3):79–83CrossRef
6.
Zurück zum Zitat Paliwal M, Kumar UA (2009) Neural networks and statistical techniques: a review of applications. Expert Syst Appl 36(1):2–17CrossRef Paliwal M, Kumar UA (2009) Neural networks and statistical techniques: a review of applications. Expert Syst Appl 36(1):2–17CrossRef
7.
Zurück zum Zitat Walczak S, Cerpa N (1999) Heuristic principles for the design of artificial neural networks. Inf Softw Technol 41(2):107–117CrossRef Walczak S, Cerpa N (1999) Heuristic principles for the design of artificial neural networks. Inf Softw Technol 41(2):107–117CrossRef
8.
Zurück zum Zitat Rudy S, Huan L (1997) Neural-network feature selector. IEEE Trans Neural Netw 8(3):654–662CrossRef Rudy S, Huan L (1997) Neural-network feature selector. IEEE Trans Neural Netw 8(3):654–662CrossRef
9.
Zurück zum Zitat Verikas A, Bacauskiene M (2002) Feature selection with neural networks. Pattern Recognit Lett 23(11):1323–1335MATHCrossRef Verikas A, Bacauskiene M (2002) Feature selection with neural networks. Pattern Recognit Lett 23(11):1323–1335MATHCrossRef
10.
Zurück zum Zitat Kabir MM, Islam MM, Murase K (2010) A new wrapper feature selection approach using neural network. Neurocomputing 73(16–18):3273–3283CrossRef Kabir MM, Islam MM, Murase K (2010) A new wrapper feature selection approach using neural network. Neurocomputing 73(16–18):3273–3283CrossRef
11.
Zurück zum Zitat Tian J, Li M, Chen F (2010) Dual-population based coevolutionary algorithm for designing RBFNN with feature selection. Expert Syst Appl 37(10):6904–6918CrossRef Tian J, Li M, Chen F (2010) Dual-population based coevolutionary algorithm for designing RBFNN with feature selection. Expert Syst Appl 37(10):6904–6918CrossRef
12.
Zurück zum Zitat Huang C-L, Wang C-J (2006) A GA-based feature selection and parameters optimization for support vector machines. Expert Syst Appl 31(2):231–240CrossRef Huang C-L, Wang C-J (2006) A GA-based feature selection and parameters optimization for support vector machines. Expert Syst Appl 31(2):231–240CrossRef
13.
Zurück zum Zitat Castillo PA, Merelo JJ, Prieto A, Rivas V, Romero G (2000) G-Prop: global optimization of multilayer perceptrons using GAs. Neurocomputing 35(1–4):149–163MATHCrossRef Castillo PA, Merelo JJ, Prieto A, Rivas V, Romero G (2000) G-Prop: global optimization of multilayer perceptrons using GAs. Neurocomputing 35(1–4):149–163MATHCrossRef
14.
Zurück zum Zitat Kuo RJ (2001) A sales forecasting system based on fuzzy neural network with initial weights generated by genetic algorithm. Eur J Oper Res 129(3):496–517MATHCrossRef Kuo RJ (2001) A sales forecasting system based on fuzzy neural network with initial weights generated by genetic algorithm. Eur J Oper Res 129(3):496–517MATHCrossRef
15.
Zurück zum Zitat Kermani BG, White MW, Nagle HT (1995) Feature extraction by genetic algorithms for neural networks in breast cancer classification. In: Proceedings of the 17th annual conference on IEEE engineering in medicine and biology society, vol 831, pp 831–832 Kermani BG, White MW, Nagle HT (1995) Feature extraction by genetic algorithms for neural networks in breast cancer classification. In: Proceedings of the 17th annual conference on IEEE engineering in medicine and biology society, vol 831, pp 831–832
16.
Zurück zum Zitat Verma B, Zhang P (2007) A novel neural-genetic algorithm to find the most significant combination of features in digital mammograms. Appl Soft Comput 7(2):612–625CrossRef Verma B, Zhang P (2007) A novel neural-genetic algorithm to find the most significant combination of features in digital mammograms. Appl Soft Comput 7(2):612–625CrossRef
17.
Zurück zum Zitat Palaniappan R, Eswaran C (2009) Using genetic algorithm to select the presentation order of training patterns that improves simplified fuzzy ARTMAP classification performance. Appl Soft Comput 9(1):100–106CrossRef Palaniappan R, Eswaran C (2009) Using genetic algorithm to select the presentation order of training patterns that improves simplified fuzzy ARTMAP classification performance. Appl Soft Comput 9(1):100–106CrossRef
18.
Zurück zum Zitat Ferentinos KP (2005) Biological engineering applications of feedforward neural networks designed and parameterized by genetic algorithms. Neural Netw 18(7):934–950CrossRef Ferentinos KP (2005) Biological engineering applications of feedforward neural networks designed and parameterized by genetic algorithms. Neural Netw 18(7):934–950CrossRef
19.
Zurück zum Zitat Almeida LM, Ludermir TB (2010) A multi-objective memetic and hybrid methodology for optimizing the parameters and performance of artificial neural networks. Neurocomputing 73(7–9):1438–1450CrossRef Almeida LM, Ludermir TB (2010) A multi-objective memetic and hybrid methodology for optimizing the parameters and performance of artificial neural networks. Neurocomputing 73(7–9):1438–1450CrossRef
20.
Zurück zum Zitat Xin Y (1999) Evolving artificial neural networks. Proc IEEE 87(9):1423–1447CrossRef Xin Y (1999) Evolving artificial neural networks. Proc IEEE 87(9):1423–1447CrossRef
21.
Zurück zum Zitat Hall MA (1999) Correlation-based feature selection for machine learning. Ph.D. thesis, Department of Computer Science, University of Waikato, Hamilton Hall MA (1999) Correlation-based feature selection for machine learning. Ph.D. thesis, Department of Computer Science, University of Waikato, Hamilton
22.
Zurück zum Zitat Sun Y (2007) Iterative RELIEF for feature weighting: algorithms, theories, and applications. IEEE Trans Pattern Anal Mach Intell 29(6):1035–1051CrossRef Sun Y (2007) Iterative RELIEF for feature weighting: algorithms, theories, and applications. IEEE Trans Pattern Anal Mach Intell 29(6):1035–1051CrossRef
23.
Zurück zum Zitat Peng H, Fulmi L, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238CrossRef Peng H, Fulmi L, Ding C (2005) Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy. IEEE Trans Pattern Anal Mach Intell 27(8):1226–1238CrossRef
24.
Zurück zum Zitat Kim G, Kim Y, Lim H, Kim H (2010) An MLP-based feature subset selection for HIV-1 protease cleavage site analysis. Artif Intell Med 48(2):83–89MathSciNetCrossRef Kim G, Kim Y, Lim H, Kim H (2010) An MLP-based feature subset selection for HIV-1 protease cleavage site analysis. Artif Intell Med 48(2):83–89MathSciNetCrossRef
25.
Zurück zum Zitat Rumelhart D, Hinton G, Williams R (1986) Learning representations by back-propagating errors. Nature 323:533–536CrossRef Rumelhart D, Hinton G, Williams R (1986) Learning representations by back-propagating errors. Nature 323:533–536CrossRef
26.
Zurück zum Zitat Kolen JF, Pollack JB (1991) Back propagation is sensitive to initial conditions. Adv Neural Inf Process Syst 3:860–867 Kolen JF, Pollack JB (1991) Back propagation is sensitive to initial conditions. Adv Neural Inf Process Syst 3:860–867
27.
Zurück zum Zitat Gupta JND, Sexton RS (1999) Comparing backpropagation with a genetic algorithm for neural network training. Omega 27(6):679–684CrossRef Gupta JND, Sexton RS (1999) Comparing backpropagation with a genetic algorithm for neural network training. Omega 27(6):679–684CrossRef
28.
Zurück zum Zitat K-j Kim (2006) Artificial neural networks with evolutionary instance selection for financial forecasting. Expert Syst Appl 30(3):519–526CrossRef K-j Kim (2006) Artificial neural networks with evolutionary instance selection for financial forecasting. Expert Syst Appl 30(3):519–526CrossRef
29.
Zurück zum Zitat Sexton RS, Dorsey RE, Johnson JD (1998) Toward global optimization of neural networks: a comparison of the genetic algorithm and backpropagation. Decis Support Syst 22(2):171–185CrossRef Sexton RS, Dorsey RE, Johnson JD (1998) Toward global optimization of neural networks: a comparison of the genetic algorithm and backpropagation. Decis Support Syst 22(2):171–185CrossRef
30.
Zurück zum Zitat Fogel D, Wasson E III, Boughton E (1995) Evolving neural networks for detecting breast cancer. Cancer Lett 96(1):49–54CrossRef Fogel D, Wasson E III, Boughton E (1995) Evolving neural networks for detecting breast cancer. Cancer Lett 96(1):49–54CrossRef
32.
Zurück zum Zitat Arauzo-Azofra A, Aznarte JL, Benítez JM (2011) Empirical study of feature selection methods based on individual feature evaluation for classification problems. Expert Syst Appl 38(7):8170–8177CrossRef Arauzo-Azofra A, Aznarte JL, Benítez JM (2011) Empirical study of feature selection methods based on individual feature evaluation for classification problems. Expert Syst Appl 38(7):8170–8177CrossRef
33.
Zurück zum Zitat Leung F, Lam H, Ling S, Tam P (2003) Tuning of the structure and parameters of a neural network using an improved genetic algorithm. IEEE Trans Neural Netw 14(1):79–88CrossRef Leung F, Lam H, Ling S, Tam P (2003) Tuning of the structure and parameters of a neural network using an improved genetic algorithm. IEEE Trans Neural Netw 14(1):79–88CrossRef
35.
Zurück zum Zitat Hsu H–H, Hsieh C-W, Lu M-D (2011) Hybrid feature selection by combining filters and wrappers. Expert Syst Appl 38(7):8144–8150CrossRef Hsu H–H, Hsieh C-W, Lu M-D (2011) Hybrid feature selection by combining filters and wrappers. Expert Syst Appl 38(7):8144–8150CrossRef
36.
Zurück zum Zitat Zhao M, Fu C, Ji L, Tang K, Zhou M (2011) Feature selection and parameter optimization for support vector machines: a new approach based on genetic algorithm with feature chromosomes. Expert Syst Appl 38(5):5197–5204CrossRef Zhao M, Fu C, Ji L, Tang K, Zhou M (2011) Feature selection and parameter optimization for support vector machines: a new approach based on genetic algorithm with feature chromosomes. Expert Syst Appl 38(5):5197–5204CrossRef
37.
Zurück zum Zitat Goldberg D (1989) Genetic algorithms in search and optimization, 1st edn. Addison-Wesley, BostonMATH Goldberg D (1989) Genetic algorithms in search and optimization, 1st edn. Addison-Wesley, BostonMATH
38.
Zurück zum Zitat Prechelt L (1994) Proben1: a set of neural network benchmark problems and benchmarking rules. Technical Report, University of Karlsruhe, Karlsruhe, Germany Prechelt L (1994) Proben1: a set of neural network benchmark problems and benchmarking rules. Technical Report, University of Karlsruhe, Karlsruhe, Germany
39.
Zurück zum Zitat Esugasini S, Mashor M, Isa N, Othman N (2005) Performance comparison for MLP networks using various back propagation algorithms for breast cancer diagnosis. In: Knowledge-based intelligent information and engineering systems. Lecture notes in computer science, vol 3682. Springer, Berlin, pp 166–166. doi:10.1007/11552451_17 Esugasini S, Mashor M, Isa N, Othman N (2005) Performance comparison for MLP networks using various back propagation algorithms for breast cancer diagnosis. In: Knowledge-based intelligent information and engineering systems. Lecture notes in computer science, vol 3682. Springer, Berlin, pp 166–166. doi:10.​1007/​11552451_​17
40.
Zurück zum Zitat Riedmiller M, Braun H (1992) RPROP-A fast adaptive learning algorithm. In: Proceedings of the international symposium computer information sciences, Antalya, pp 279–285 Riedmiller M, Braun H (1992) RPROP-A fast adaptive learning algorithm. In: Proceedings of the international symposium computer information sciences, Antalya, pp 279–285
41.
Zurück zum Zitat Quinlan JR (1996) Improved use of continuous attributes in C4.5. J Artif Int Res 4(1):77–90 Quinlan JR (1996) Improved use of continuous attributes in C4.5. J Artif Int Res 4(1):77–90
42.
Zurück zum Zitat Hamilton HJ, Shan N, Cercone N (1996) RIAC: a rule induction algorithm based on approximate classification. Paper presented at the international conference on engineering applications of neural networks, University of Regina Hamilton HJ, Shan N, Cercone N (1996) RIAC: a rule induction algorithm based on approximate classification. Paper presented at the international conference on engineering applications of neural networks, University of Regina
43.
Zurück zum Zitat Nauck D, Kruse R (1999) Obtaining interpretable fuzzy classification rules from medical data. Artif Intell Med 16(2):149–169MathSciNetCrossRef Nauck D, Kruse R (1999) Obtaining interpretable fuzzy classification rules from medical data. Artif Intell Med 16(2):149–169MathSciNetCrossRef
44.
Zurück zum Zitat Peña-Reyes CA, Sipper M (1999) A fuzzy-genetic approach to breast cancer diagnosis. Artif Intell Med 17(2):131–155CrossRef Peña-Reyes CA, Sipper M (1999) A fuzzy-genetic approach to breast cancer diagnosis. Artif Intell Med 17(2):131–155CrossRef
45.
Zurück zum Zitat Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18(3):205–219CrossRef Setiono R (2000) Generating concise and accurate classification rules for breast cancer diagnosis. Artif Intell Med 18(3):205–219CrossRef
46.
Zurück zum Zitat Albrecht AA, Lappas G, Vinterbo SA, Wong C, Ohno-Machado L (2002) Two applications of the LSA machine. In: Proceedings of the 9th international conference on neural information processing (ICONIP’02), Singapore, pp 184–189 Albrecht AA, Lappas G, Vinterbo SA, Wong C, Ohno-Machado L (2002) Two applications of the LSA machine. In: Proceedings of the 9th international conference on neural information processing (ICONIP’02), Singapore, pp 184–189
47.
Zurück zum Zitat Abonyi J, Szeifert F (2003) Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recognit Lett 24(14):2195–2207MATHCrossRef Abonyi J, Szeifert F (2003) Supervised fuzzy clustering for the identification of fuzzy classifiers. Pattern Recognit Lett 24(14):2195–2207MATHCrossRef
48.
Zurück zum Zitat Polat K, Günes S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Proc 17(4):694–701CrossRef Polat K, Günes S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Proc 17(4):694–701CrossRef
49.
Zurück zum Zitat Guijarro-Berdiñas B, Fontenla-Romero O, Pérez-Sánchez B, Fraguela P (2007) A linear learning method for multilayer perceptrons using least-squares. In: Intelligent data engineering and automated learning (IDEAL’07). Lecture notes in computer science, vol 4881. Springer, Berlin, pp 365–374. doi:10.1007/978-3-540-77226-2_38 Guijarro-Berdiñas B, Fontenla-Romero O, Pérez-Sánchez B, Fraguela P (2007) A linear learning method for multilayer perceptrons using least-squares. In: Intelligent data engineering and automated learning (IDEAL’07). Lecture notes in computer science, vol 4881. Springer, Berlin, pp 365–374. doi:10.​1007/​978-3-540-77226-2_​38
50.
Zurück zum Zitat Akay MF (2009) Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst Appl 36(2, Part 2):3240–3247 Akay MF (2009) Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst Appl 36(2, Part 2):3240–3247
51.
Zurück zum Zitat Peng Y, Wu Z, Jiang J (2010) A novel feature selection approach for biomedical data classification. J Biomed Inform 43(1):15–23CrossRef Peng Y, Wu Z, Jiang J (2010) A novel feature selection approach for biomedical data classification. J Biomed Inform 43(1):15–23CrossRef
52.
Zurück zum Zitat Marcano-Cedeño A, Quintanilla-Domínguez J, Andina D (2011) WBCD breast cancer database classification applying artificial metaplasticity neural network. Expert Syst Appl 38(8):9573–9579CrossRef Marcano-Cedeño A, Quintanilla-Domínguez J, Andina D (2011) WBCD breast cancer database classification applying artificial metaplasticity neural network. Expert Syst Appl 38(8):9573–9579CrossRef
53.
Zurück zum Zitat Stoean R, Stoean C (2013) Modeling medical decision making by support vector machines, explaining by rules of evolutionary algorithms with feature selection. Expert Syst Appl 40(7):2677–2686. doi:10.1016/j.eswa.2012.11.007 Stoean R, Stoean C (2013) Modeling medical decision making by support vector machines, explaining by rules of evolutionary algorithms with feature selection. Expert Syst Appl 40(7):2677–2686. doi:10.​1016/​j.​eswa.​2012.​11.​007
Metadaten
Titel
A GA-based feature selection and parameter optimization of an ANN in diagnosing breast cancer
verfasst von
Fadzil Ahmad
Nor Ashidi Mat Isa
Zakaria Hussain
Muhammad Khusairi Osman
Siti Noraini Sulaiman
Publikationsdatum
01.11.2015
Verlag
Springer London
Erschienen in
Pattern Analysis and Applications / Ausgabe 4/2015
Print ISSN: 1433-7541
Elektronische ISSN: 1433-755X
DOI
https://doi.org/10.1007/s10044-014-0375-9

Weitere Artikel der Ausgabe 4/2015

Pattern Analysis and Applications 4/2015 Zur Ausgabe