Abstract
Breast cancer is the second largest cause of mortality among women. Breast cancer patients in developed nations have a relative survival rate of more than 5-years due to early detection and treatment. Deep learning approaches can help enhance the identification of breast cancer cells, lower the risk of detection mistakes, and minimize the time it takes to diagnose breast cancer using human methods. This paper examines the accuracy of artificial neural networks, Restricted Boltzmann Machine, Deep Autoencoders, and Convolutional Neural Networks (CNN) for post-operative survival analysis of breast cancer patients. A thorough examination of each network's operation and design is carried out to determine which network outperforms the other, followed by an analysis based on the network's prediction accurateness. The experimental results assert that all the deep learning techniques can predict the survival of breast cancer patients. The accuracy score achieved by Restricted Boltzmann Machine performed is the highest (0.97), followed by deep Autoencoders that attained an accuracy score of 0.96. CNN achieved a 92% accuracy score, while artificial neural networks attained the least accuracy score (0.89). The prediction performance of models has been evaluated using distinct parameters like accuracy, the area under the curve, F1 Score, Matthew’s correlation coefficient, sensitivity, and specificity. Also, the models have been validated using fivefold cross-validation techniques. However, there is still a need for complete analysis and research using deep learning methods to determine the design that provides superior accuracy.
Similar content being viewed by others
Data Availability
The datasets and Python code for data preprocessing and constructing deep learning models is publicly available online on Github (on request): https://github.com/surbhigupta24/Breast-Cancer-Survival-Prediction
References
Sung H et al (2020) Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin 71(3):209–249
Islami F et al (2018) Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States. CA Cancer J Clin 68(1):31–54
Kumar Y, Gupta S, Singla R, & Hu YC (2021) A systematic review of artificial intelligence techniques in cancer prediction and diagnosis. Arch Comput Methods Eng 1–28
Gupta S, Gupta MK (2021) Computational model for prediction of malignant mesothelioma diagnosis. Comput J
Chang CH, Sibala JL, Fritz SL, Dwyer 3rd SJ, Templeton AW, Lin F, Jewell WR (1980) Computed tomography in detection and diagnosis of breast cancer. Cancer 46(4 Suppl):939–946
Tapak L, Shirmohammadi-khorram N, Amini P, Alafchi B (2019) Prediction of survival and metastasis in breast cancer patients using machine learning classifiers. Clin Epidemiol Glob Heal 7(3):293–299
Afshar HL, Ahmadi M, Roudbari M, Sadoughi F (2015) Prediction of breast cancer survival through knowledge discovery in databases. Glob J Health Sci 7(4):392–398
Kononenko I (2001) Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med 23(1):89–109
Zhu W, Xie L, Han J, Guo X (2020) The application of deep learning in cancer prognosis prediction. Cancers 12(3):603
Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI (2015) Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J 13:8–17
Kim W et al (2012) Development of novel breast cancer recurrence prediction model using support vector machine. J Breast Cancer 15(2):230–238
Gupta S, Gupta MK (2021) Computational prediction of cervical cancer diagnosis using ensemble-based classification algorithm. Comput J
Gupta S, Gupta MK (2021) A comprehensive data-level investigation of cancer diagnosis on imbalanced data. Comput Intell
Chen Y, Ke W, Chiu H (2014) Risk classi fi cation of cancer survival using ANN with gene expression data from multiple laboratories. Comput Biol Med 48:1–7
Zhu W, Fang K, He J, Cui R, Zhang Y, Le H (2019) Research article a prediction rule for overall survival in non-small-cell lung cancer patients with a pathological tumor size less than 30 mm. Dis Markers 2019:1–9
Duggan MA, Anderson WF, Altekruse S, Penberthy L, Sherman ME (2016) The surveillance, epidemiology and end results (SEER) program and pathology: towards strengthening the critical relationship. Am J Surg Pathol 40(12):e94
Shimizu H, Nakayama KI (2020) Artificial intelligence in oncology. Cancer Sci 111(5):1–9
Arihito E, Shibata T, Hiroshi T (2008) Comparison of seven algorithms to predict breast cancer survival. Biomed Soft Comput Human Sci 13(2):11–16
Khan MU, Choi JP, Shin H, Kim M (2008) Predicting breast cancer survivability using fuzzy decision trees for personalized healthcare. In: 2008 30th annual international conference of the IEEE engineering in medicine and biology society. IEEE, pp 5148–5151
Choi JP, Han TH, Park RW (2009). A hybrid Bayesian network model for predicting breast cancer prognosis. J Korean Soc Med Inform 15(1):49–57
Fan C, Chang P, Lin J, Hsieh JC (2011) A hybrid model combining case-based reasoning and fuzzy decision tree for medical data classification. Appl Soft Comput 11:632–644
Wang KJ, Makond B, Wang KM (2013) An improved survivability prognosis of breast cancer by using sampling and feature selection technique to solve imbalanced patient classification data. BMC Med Inform Decis Mak 13(1):1–14
Kim J, Shin H (2013) Breast cancer survivability prediction using labeled, unlabeled, and pseudo-labeled patient data. J Am Med Inform Assoc 20(4):613–618
Park K, Ali A, Kim D, An Y, Kim M, Shin H (2013) Engineering Applications of Arti fi cial Intelligence Robust predictive model for evaluating breast cancer survivability. Eng Appl Artif Intell 26(9):2194–2205
Shin H, Nam Y (2014) A coupling approach of a predictor and a descriptor for breast cancer prognosis. BMC Med Genomics 7(Suppl 1):1–12
Wang K, Makond B, Chen K, Wang K (2014) A hybrid classifier combining SMOTE with PSO to estimate 5-year survivability of breast cancer patients. Appl Soft Comput J 20:15–24
Shawky DM, Seddik AF (2017) On the temporal effects of features on the prediction of breast cancer survivability. Curr Bioinform 12(4):378–384
Li Y, Ge D, Gu J, Xu F, Zhu Q, Lu C (2019) A large cohort study identifying a novel prognosis prediction model for lung adenocarcinoma through machine learning strategies. BMC Cancer 19(1):1–14
Abdikenov B, Iklassov Z, Sharipov A, Hussain S, Jamwal PK (2019) Analytics of heterogeneous breast cancer data using neuroevolution. IEEE Access 7:18050–18060
Fotouhi S, Asadi S, Kattan MW (2019) A comprehensive data level analysis for cancer diagnosis on imbalanced data. J Biomed Inform 90:103089
Simsek S, Kursuncu U, Kibis E, Anisabdellatif M (2020) A dag a hybrid data mining approach for identifying the temporal effects of variables associated with breast cancer survival. Exp Syst Appl 139:112863
British T, Society C (2020) A novel data mining on breast cancer survivability using MLP ensemble learners. Compu J Vol 63(3): pp. 435–447
Delen D, Walker G, Kadam A (2005) Predicting breast cancer survivability : a comparison of three data mining methods. Artif Intell Med 343(2):113–127
Id JL et al (2021) Predicting breast cancer 5-year survival using machine learning: a systematic review. PLoS ONE 16:1–23
Fine TL, Hassoun MH (1996) Fundamentals of artificial neural networks. IEEE Trans Inf Theory 42(4):1322–1324
Larochelle H (2012) Learning algorithms for the classification restricted Boltzmann machine. J Mach Learn Res 13:643–669
Chicco D, Sadowski P, Baldi P (2014) Deep autoencoder neural networks for gene ontology annotation predictions. In: ACM BCB 2014 - 5th ACM Conference on Bioinformatics, Computational Biology and Health. Informatics pp. 533–540
Kim P (2012) Convolutional neural network. MATLAB deep learning. Apress, Berkeley, pp 121–147
Celisse A (2010) A survey of cross-validation procedures for model selection. Statistics Surv 4:40–79
Aziz R, Verma CK, Jha M, Srivastava N (2017) Artificial neural network classification of microarray data using new hybrid gene selection method. Int J Data Min Bioinform 17(1):42–65
Lecun Y, Bengio Y, Hinton G (2015) Deep learning. Nature 521(7553):436–444
Polat K, Güneş S (2007) Breast cancer diagnosis using least square support vector machine. Digit Signal Process 17(4):694–701
Er O, Tanrikulu AC, Abakay A, Temurtas F (2012) An approach based on probabilistic neural network for diagnosis of Mesothelioma’s disease. Comput Electr Eng 38(1):75–81
Bradley AE (1997) The use of the area under the roc curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159
Chicco D, Jurman G (2020) The advantages of the Matthews correlation coefficient ( MCC ) over F1 score and accuracy in binary classification evaluation. BMC Genomics 21(1):1–13
Chicco D (2017) Ten quick tips for machine learning in computational biology. BioData Mining 10(1):1–17
Halimu C, Kasem A, Shah Newaz SH (2019) Empirical comparison of area under ROC curve (AUC) and Mathew correlation coefficient (MCC) for evaluating machine learning algorithms on imbalanced datasets for binary classification. In: Proceedings of the 3rd international conference on machine learning and soft computing, pp. 1–6
Acknowledgements
We express our gratitude to Dr. Rajeev Saini, MD, DNB (Medical Oncology), Narayana Multispecialty Clinic, Jammu & Kashmir (India), to provide us with consistent guidance on breast cancer. Dr. Saini has also assisted in confirming that the prediction indicators used in the study are extremely important from the medical point of view.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interests
There is no conflict to be declared.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Gupta, S., Gupta, M.K. A Comparative Analysis of Deep Learning Approaches for Predicting Breast Cancer Survivability. Arch Computat Methods Eng 29, 2959–2975 (2022). https://doi.org/10.1007/s11831-021-09679-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11831-021-09679-3