Skip to main content
Log in

A Comparative Analysis of Deep Learning Approaches for Predicting Breast Cancer Survivability

  • Survey article
  • Published:
Archives of Computational Methods in Engineering Aims and scope Submit manuscript

Abstract

Breast cancer is the second largest cause of mortality among women. Breast cancer patients in developed nations have a relative survival rate of more than 5-years due to early detection and treatment. Deep learning approaches can help enhance the identification of breast cancer cells, lower the risk of detection mistakes, and minimize the time it takes to diagnose breast cancer using human methods. This paper examines the accuracy of artificial neural networks, Restricted Boltzmann Machine, Deep Autoencoders, and Convolutional Neural Networks (CNN) for post-operative survival analysis of breast cancer patients. A thorough examination of each network's operation and design is carried out to determine which network outperforms the other, followed by an analysis based on the network's prediction accurateness. The experimental results assert that all the deep learning techniques can predict the survival of breast cancer patients. The accuracy score achieved by Restricted Boltzmann Machine performed is the highest (0.97), followed by deep Autoencoders that attained an accuracy score of 0.96. CNN achieved a 92% accuracy score, while artificial neural networks attained the least accuracy score (0.89). The prediction performance of models has been evaluated using distinct parameters like accuracy, the area under the curve, F1 Score, Matthew’s correlation coefficient, sensitivity, and specificity. Also, the models have been validated using fivefold cross-validation techniques. However, there is still a need for complete analysis and research using deep learning methods to determine the design that provides superior accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Data Availability

The datasets and Python code for data preprocessing and constructing deep learning models is publicly available online on Github (on request): https://github.com/surbhigupta24/Breast-Cancer-Survival-Prediction

References

  1. Sung H et al (2020) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249

    Article  MathSciNet  Google Scholar 

  2. Islami F et al (2018) Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States. CA Cancer J Clin 68(1):31–54

    Article  Google Scholar 

  3. Kumar Y, Gupta S, Singla R, & Hu YC (2021) A systematic review of artificial intelligence techniques in cancer prediction and diagnosis. Arch Comput Methods Eng 1–28

  4. Gupta S, Gupta MK (2021) Computational model for prediction of malignant mesothelioma diagnosis. Comput J

  5. Chang CH, Sibala JL, Fritz SL, Dwyer 3rd SJ, Templeton AW, Lin F, Jewell WR (1980) Computed tomography in detection and diagnosis of breast cancer. Cancer 46(4 Suppl):939–946

  6. Tapak L, Shirmohammadi-khorram N, Amini P, Alafchi B (2019) Prediction of survival and metastasis in breast cancer patients using machine learning classifiers. Clin Epidemiol Glob Heal 7(3):293–299

    Article  Google Scholar 

  7. Afshar HL, Ahmadi M, Roudbari M, Sadoughi F (2015) Prediction of breast cancer survival through knowledge discovery in databases. Glob J Health Sci 7(4):392–398

    Google Scholar 

  8. Kononenko I (2001) Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med 23(1):89–109

    Article  Google Scholar 

  9. Zhu W, Xie L, Han J, Guo X (2020) The application of deep learning in cancer prognosis prediction. Cancers 12(3):603

  10. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI (2015) Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J 13:8–17

    Article  Google Scholar 

  11. Kim W et al (2012) Development of novel breast cancer recurrence prediction model using support vector machine. J Breast Cancer 15(2):230–238

    Article  Google Scholar 

  12. Gupta S, Gupta MK (2021) Computational prediction of cervical cancer diagnosis using ensemble-based classification algorithm. Comput J

  13. Gupta S, Gupta MK (2021) A comprehensive data-level investigation of cancer diagnosis on imbalanced data. Comput Intell

  14. Chen Y, Ke W, Chiu H (2014) Risk classi fi cation of cancer survival using ANN with gene expression data from multiple laboratories. Comput Biol Med 48:1–7

    Article  Google Scholar 

  15. Zhu W, Fang K, He J, Cui R, Zhang Y, Le H (2019) Research article a prediction rule for overall survival in non-small-cell lung cancer patients with a pathological tumor size less than 30 mm. Dis Markers 2019:1–9

    Google Scholar 

  16. Duggan MA, Anderson WF, Altekruse S, Penberthy L, Sherman ME (2016) The surveillance, epidemiology and end results (SEER) program and pathology: towards strengthening the critical relationship. Am J Surg Pathol 40(12):e94

  17. Shimizu H, Nakayama KI (2020) Artificial intelligence in oncology. Cancer Sci 111(5):1–9

    Article  Google Scholar 

  18. Arihito E, Shibata T, Hiroshi T (2008) Comparison of seven algorithms to predict breast cancer survival. Biomed Soft Comput Human Sci 13(2):11–16

    Google Scholar 

  19. Khan MU, Choi JP, Shin H, Kim M (2008) Predicting breast cancer survivability using fuzzy decision trees for personalized healthcare. In: 2008 30th annual international conference of the IEEE engineering in medicine and biology society. IEEE, pp 5148–5151

  20. Choi JP, Han TH, Park RW (2009). A hybrid Bayesian network model for predicting breast cancer prognosis. J Korean Soc Med Inform 15(1):49–57

    Article  Google Scholar 

  21. Fan C, Chang P, Lin J, Hsieh JC (2011) A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644

    Article  Google Scholar 

  22. Wang KJ, Makond B, Wang KM (2013) An improved survivability prognosis of breast cancer by using sampling and feature selection technique to solve imbalanced patient classification data. BMC Med Inform Decis Mak 13(1):1–14

  23. Kim J, Shin H (2013) Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data. J Am Med Inform Assoc 20(4):613–618

  24. Park K, Ali A, Kim D, An Y, Kim M, Shin H (2013) Engineering Applications of Arti fi cial Intelligence Robust predictive model for evaluating breast cancer survivability. Eng Appl Artif Intell 26(9):2194–2205

    Article  Google Scholar 

  25. Shin H, Nam Y (2014) A coupling approach of a predictor and a descriptor for breast cancer prognosis. BMC Med Genomics 7(Suppl 1):1–12

    Article  Google Scholar 

  26. Wang K, Makond B, Chen K, Wang K (2014) A hybrid classifier combining SMOTE with PSO to estimate 5-year survivability of breast cancer patients. Appl Soft Comput J 20:15–24

    Article  Google Scholar 

  27. Shawky DM, Seddik AF (2017) On the temporal effects of features on the prediction of breast cancer survivability. Curr Bioinform 12(4):378–384

    Article  Google Scholar 

  28. Li Y, Ge D, Gu J, Xu F, Zhu Q, Lu C (2019) A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies. BMC Cancer 19(1):1–14

    Article  Google Scholar 

  29. Abdikenov B, Iklassov Z, Sharipov A, Hussain S, Jamwal PK (2019) Analytics of heterogeneous breast cancer data using neuroevolution. IEEE Access 7:18050–18060

    Article  Google Scholar 

  30. Fotouhi S, Asadi S, Kattan MW (2019) A comprehensive data level analysis for cancer diagnosis on imbalanced data. J Biomed Inform 90:103089

    Article  Google Scholar 

  31. Simsek S, Kursuncu U, Kibis E, Anisabdellatif M (2020) A dag a hybrid data mining approach for identifying the temporal effects of variables associated with breast cancer survival. Exp Syst Appl 139:112863

    Article  Google Scholar 

  32. British T, Society C (2020) A novel data mining on breast cancer survivability using MLP ensemble learners. Compu J Vol 63(3): pp. 435–447

  33. Delen D, Walker G, Kadam A (2005) Predicting breast cancer survivability : a comparison of three data mining methods. Artif Intell Med 343(2):113–127

    Article  Google Scholar 

  34. Id JL et al (2021) Predicting breast cancer 5-year survival using machine learning: a systematic review. PLoS ONE 16:1–23

    Google Scholar 

  35. Fine TL, Hassoun MH (1996) Fundamentals of artificial neural networks. IEEE Trans Inf Theory 42(4):1322–1324

    Article  Google Scholar 

  36. Larochelle H (2012) Learning algorithms for the classification restricted Boltzmann machine. J Mach Learn Res 13:643–669

    MathSciNet  MATH  Google Scholar 

  37. Chicco D, Sadowski P, Baldi P (2014) Deep autoencoder neural networks for gene ontology annotation predictions. In: ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology and Health. Informatics pp. 533–540

  38. Kim P (2012) Convolutional neural network. MATLAB deep learning. Apress, Berkeley, pp 121–147

    Google Scholar 

  39. Celisse A (2010) A survey of cross-validation procedures for model selection. Statistics Surv 4:40–79

    MathSciNet  MATH  Google Scholar 

  40. Aziz R, Verma CK, Jha M, Srivastava N (2017) Artificial neural network classification of microarray data using new hybrid gene selection method. Int J Data Min Bioinform 17(1):42–65

    Article  Google Scholar 

  41. Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444

    Article  Google Scholar 

  42. Polat K, Güneş S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Process 17(4):694–701

    Article  Google Scholar 

  43. Er O, Tanrikulu AC, Abakay A, Temurtas F (2012) An approach based on probabilistic neural network for diagnosis of Mesothelioma’s disease. Comput Electr Eng 38(1):75–81

    Article  Google Scholar 

  44. Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159

    Article  Google Scholar 

  45. Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient ( MCC ) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21(1):1–13

    Article  Google Scholar 

  46. Chicco D (2017) Ten quick tips for machine learning in computational biology. BioData Mining 10(1):1–17

    Article  Google Scholar 

  47. Halimu C, Kasem A, Shah Newaz SH (2019) Empirical comparison of area under ROC curve (AUC) and Mathew correlation coefficient (MCC) for evaluating machine learning algorithms on imbalanced datasets for binary classification. In: Proceedings of the 3rd international conference on machine learning and soft computing, pp. 1–6

Download references

Acknowledgements

We express our gratitude to Dr. Rajeev Saini, MD, DNB (Medical Oncology), Narayana Multispecialty Clinic, Jammu & Kashmir (India), to provide us with consistent guidance on breast cancer. Dr. Saini has also assisted in confirming that the prediction indicators used in the study are extremely important from the medical point of view.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Surbhi Gupta.

Ethics declarations

Conflict of interests

There is no conflict to be declared.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gupta, S., Gupta, M.K. A Comparative Analysis of Deep Learning Approaches for Predicting Breast Cancer Survivability. Arch Computat Methods Eng 29, 2959–2975 (2022). https://doi.org/10.1007/s11831-021-09679-3

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11831-021-09679-3

Navigation