Skip to main content

2021 | OriginalPaper | Buchkapitel

Feature Selection Optimization for Breast Cancer Diagnosis

verfasst von : Ana Rita Antunes, Marina A. Matos, Lino A. Costa, Ana Maria A. C. Rocha, Ana Cristina Braga

Erschienen in: Optimization, Learning Algorithms and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Cancer is one of the leading causes of death in the world, which has increased over the past few years. This disease can be classified as benign or malignant. One of the first and most common cancers that appear in the human body is breast cancer, which, as the name implies, appears in the breast regardless of the person’s gender. Machine learning has been widely used to assist in the diagnosis of breast cancer.
In this work, feature selection and multi-objective optimization are applied to the Breast Cancer Wisconsin Diagnostic dataset. It is intended to identify the most relevant characteristics to classify whether the diagnosis is benign or malignant. Two classifiers will be used in the feature selection task, one based on neural networks and the other on support vector machine. The objective functions to be used in the optimization process are to maximize sensitivity and specificity, simultaneously. A comparison was made between the techniques used and there was a better performance by neural networks.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Aalaei, S., Shahraki, H., Rowhanimanesh, A., Eslami, S.: Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets. Iran. J. Basic Med. Sci. 19(5), 476 (2016) Aalaei, S., Shahraki, H., Rowhanimanesh, A., Eslami, S.: Feature selection using genetic algorithm for breast cancer diagnosis: experiment on three different datasets. Iran. J. Basic Med. Sci. 19(5), 476 (2016)
2.
Zurück zum Zitat Agarap, A.F.M.: On breast cancer detection: an application of machine learning algorithms on the Wisconsin diagnostic dataset. In: Proceedings of the 2nd International Conference on Machine Learning and Soft Computing, pp. 5–9 (2018) Agarap, A.F.M.: On breast cancer detection: an application of machine learning algorithms on the Wisconsin diagnostic dataset. In: Proceedings of the 2nd International Conference on Machine Learning and Soft Computing, pp. 5–9 (2018)
4.
Zurück zum Zitat Akay, M.F.: Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst. Appl. 36(2), 3240–3247 (2009)CrossRef Akay, M.F.: Support vector machines combined with feature selection for breast cancer diagnosis. Expert Syst. Appl. 36(2), 3240–3247 (2009)CrossRef
5.
Zurück zum Zitat Akram, M., Iqbal, M., Daniyal, M., Khan, A.U.: Awareness and current knowledge of breast cancer. Biol. Res. 50(1), 1–23 (2017)CrossRef Akram, M., Iqbal, M., Daniyal, M., Khan, A.U.: Awareness and current knowledge of breast cancer. Biol. Res. 50(1), 1–23 (2017)CrossRef
7.
Zurück zum Zitat Bhardwaj, A., Tiwari, A.: Breast cancer diagnosis using genetically optimized neural network model. Expert Syst. Appl. 42(10), 4611–4620 (2015)CrossRef Bhardwaj, A., Tiwari, A.: Breast cancer diagnosis using genetically optimized neural network model. Expert Syst. Appl. 42(10), 4611–4620 (2015)CrossRef
8.
Zurück zum Zitat Bonaccorso, G.: Machine Learning Algorithms: Popular Algorithms for Data Science and Machine Learning. Packt Publishing Ltd, Birmingham (2018) Bonaccorso, G.: Machine Learning Algorithms: Popular Algorithms for Data Science and Machine Learning. Packt Publishing Ltd, Birmingham (2018)
9.
Zurück zum Zitat Cai, J., Luo, J., Wang, S., Yang, S.: Feature selection in machine learning: a new perspective. Neurocomputing 300, 70–79 (2018)CrossRef Cai, J., Luo, J., Wang, S., Yang, S.: Feature selection in machine learning: a new perspective. Neurocomputing 300, 70–79 (2018)CrossRef
10.
Zurück zum Zitat Chen, H.L., Yang, B., Liu, J., Liu, D.Y.: A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst. Appl. 38(7), 9014–9022 (2011)CrossRef Chen, H.L., Yang, B., Liu, J., Liu, D.Y.: A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst. Appl. 38(7), 9014–9022 (2011)CrossRef
11.
Zurück zum Zitat Chougrad, H., Zouaki, H., Alheyane, O.: Deep convolutional neural networks for breast cancer screening. Comput. Methods Programs Biomed. 157, 19–30 (2018)CrossRef Chougrad, H., Zouaki, H., Alheyane, O.: Deep convolutional neural networks for breast cancer screening. Comput. Methods Programs Biomed. 157, 19–30 (2018)CrossRef
13.
Zurück zum Zitat Dumitru, C., Maria, V.: Advantages and disadvantages of using neural networks for predictions. Ovidius Univ. Ann. Ser. Econ. Sci. 13(1), 470–475 (2013) Dumitru, C., Maria, V.: Advantages and disadvantages of using neural networks for predictions. Ovidius Univ. Ann. Ser. Econ. Sci. 13(1), 470–475 (2013)
15.
Zurück zum Zitat Gupta, M., Gupta, B.: A comparative study of breast cancer diagnosis using supervised machine learning techniques. In: 2018 Second International Conference on Computing Methodologies and Communication (ICCMC), pp. 997–1002. IEEE (2018) Gupta, M., Gupta, B.: A comparative study of breast cancer diagnosis using supervised machine learning techniques. In: 2018 Second International Conference on Computing Methodologies and Communication (ICCMC), pp. 997–1002. IEEE (2018)
16.
Zurück zum Zitat Huang, S., Cai, N., Pacheco, P.P., Narrandes, S., Wang, Y., Xu, W.: Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics-Proteomics 15(1), 41–51 (2018) Huang, S., Cai, N., Pacheco, P.P., Narrandes, S., Wang, Y., Xu, W.: Applications of support vector machine (SVM) learning in cancer genomics. Cancer Genomics-Proteomics 15(1), 41–51 (2018)
17.
Zurück zum Zitat Janssens, A.C.J., Martens, F.K.: Reflection on modern methods: revisiting the area under the ROC curve. Int. J. Epidemiol. 49(4), 1397–1403 (2020)CrossRef Janssens, A.C.J., Martens, F.K.: Reflection on modern methods: revisiting the area under the ROC curve. Int. J. Epidemiol. 49(4), 1397–1403 (2020)CrossRef
18.
Zurück zum Zitat Maldonado, S., Weber, R.: A wrapper method for feature selection using support vector machines. Inf. Sci. 179(13), 2208–2217 (2009)CrossRef Maldonado, S., Weber, R.: A wrapper method for feature selection using support vector machines. Inf. Sci. 179(13), 2208–2217 (2009)CrossRef
19.
Zurück zum Zitat MATLAB: version 9.9.0.1495850 (R2020b). The MathWorks Inc., Natick, Massachusetts (2020) MATLAB: version 9.9.0.1495850 (R2020b). The MathWorks Inc., Natick, Massachusetts (2020)
20.
Zurück zum Zitat Memon, M.H., Li, J.P., Haq, A.U., Memon, M.H., Zhou, W.: Breast cancer detection in the IoT health environment using modified recursive feature selection. Wireless Commun. Mob. Comput. 2019, 1–19 (2019) Memon, M.H., Li, J.P., Haq, A.U., Memon, M.H., Zhou, W.: Breast cancer detection in the IoT health environment using modified recursive feature selection. Wireless Commun. Mob. Comput. 2019, 1–19 (2019)
21.
Zurück zum Zitat Pisner, D.A., Schnyer, D.M.: Support vector machine. In: Machine Learning, pp. 101–121. Elsevier (2020) Pisner, D.A., Schnyer, D.M.: Support vector machine. In: Machine Learning, pp. 101–121. Elsevier (2020)
22.
Zurück zum Zitat Rejani, Y., Selvi, S.T.: Early detection of breast cancer using SVM classifier technique. arXiv preprint arXiv:0912.2314 (2009) Rejani, Y., Selvi, S.T.: Early detection of breast cancer using SVM classifier technique. arXiv preprint arXiv:​0912.​2314 (2009)
23.
Zurück zum Zitat Salama, G.I., Abdelhalim, M., Zeid, M.A.e.: Breast cancer diagnosis on three different datasets using multi-classifiers. Breast Cancer (WDBC) 32(569), 2 (2012) Salama, G.I., Abdelhalim, M., Zeid, M.A.e.: Breast cancer diagnosis on three different datasets using multi-classifiers. Breast Cancer (WDBC) 32(569), 2 (2012)
24.
Zurück zum Zitat Sharma, A., Kulshrestha, S., Daniel, S.: Machine learning approaches for breast cancer diagnosis and prognosis. In: 2017 International Conference on Soft Computing and Its Engineering Applications (icSoftComp), pp. 1–5. IEEE (2017) Sharma, A., Kulshrestha, S., Daniel, S.: Machine learning approaches for breast cancer diagnosis and prognosis. In: 2017 International Conference on Soft Computing and Its Engineering Applications (icSoftComp), pp. 1–5. IEEE (2017)
25.
Zurück zum Zitat Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009)CrossRef Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009)CrossRef
26.
Zurück zum Zitat Sun, Y.S., et al.: Risk factors and preventions of breast cancer. Int. J. Biol. Sci. 13(11), 1387 (2017)CrossRef Sun, Y.S., et al.: Risk factors and preventions of breast cancer. Int. J. Biol. Sci. 13(11), 1387 (2017)CrossRef
27.
Zurück zum Zitat Sung, H., et al.: Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71(3), 209–249 (2021)MathSciNetCrossRef Sung, H., et al.: Global cancer statistics 2020: Globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin. 71(3), 209–249 (2021)MathSciNetCrossRef
28.
Zurück zum Zitat Tu, J.V.: Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 49(11), 1225–1231 (1996)CrossRef Tu, J.V.: Advantages and disadvantages of using artificial neural networks versus logistic regression for predicting medical outcomes. J. Clin. Epidemiol. 49(11), 1225–1231 (1996)CrossRef
29.
Zurück zum Zitat Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. 87(23), 9193–9196 (1990)CrossRef Wolberg, W.H., Mangasarian, O.L.: Multisurface method of pattern separation for medical diagnosis applied to breast cytology. Proc. Natl. Acad. Sci. 87(23), 9193–9196 (1990)CrossRef
Metadaten
Titel
Feature Selection Optimization for Breast Cancer Diagnosis
verfasst von
Ana Rita Antunes
Marina A. Matos
Lino A. Costa
Ana Maria A. C. Rocha
Ana Cristina Braga
Copyright-Jahr
2021
DOI
https://doi.org/10.1007/978-3-030-91885-9_36

Premium Partner