Skip to main content
Log in

Breast Cancer Diagnosis Using Feature Ensemble Learning Based on Stacked Sparse Autoencoders and Softmax Regression

  • Image & Signal Processing
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

Nowadays, the most frequent cancer in women is breast cancer (malignant tumor). If breast cancer is detected at the beginning stage, it can often be cured. Many researchers proposed numerous methods for early prediction of this Cancer. In this paper, we proposed feature ensemble learning based on Sparse Autoencoders and Softmax Regression for classification of Breast Cancer into benign (non-cancerous) and malignant (cancerous). We used Breast Cancer Wisconsin (Diagnostic) medical data sets from the UCI machine learning repository. The proposed method is assessed using various performance indices like true classification accuracy, specificity, sensitivity, recall, precision, f measure, and MCC. Simulation and result proved that the proposed approach gives better results in terms of different parameters. The prediction results obtained by the proposed approach were very promising (98.60% true accuracy). In addition, the proposed method outperforms the Stacked Sparse Autoencoders and Softmax Regression based (SSAE-SM) model and other State-of-the-art classifiers in terms of various performance indices. Experimental simulations, empirical results, and statistical analyses are also showing that the proposed model is an efficient and beneficial model for classification of Breast Cancer. It is also comparable with the existing machine learning and soft computing approaches present in the related literature.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Breast Cancer Awareness Month. National Health Portal Of India, 10 Apr. 2015, (www.nhp.gov.in/breast-cancer-awareness-month_pg)

  2. Hashemi, S.H.B., Karimi, S., and Mahboobi, H., Lifestyle changes for prevention of breast cancer. Electron. Physician 6(3):894–905, 2014.

    PubMed  PubMed Central  Google Scholar 

  3. Yu, Y.H., Wei, W., and Liu, J.L., Diagnostic value of fine-needle aspiration biopsy for breast mass: A systematic review and meta-analysis. BMC Cancer 12:41, 2012. https://doi.org/10.1186/1471-2407-12-41.

    Article  Google Scholar 

  4. Madubogwu, C.I., Ukah, C.O., Onyiaorah, I.V., Anyiam, D.C.D., Anyanwu, S.N.C., and Chianakwana, G.U.: Cost effectiveness of fine needle aspiration cytology for breast masses. Orient J. Med. 27(1–2), 2015

  5. Iranpour, M., Almassi, S., and Analoui, M.: Breast cancer detection from FNA using SVM and RBF classifier. In: First Joint Congress on Fuzzy and Intelligent Systems, Ferdowsi University of Mashhad, Iran, 29–31 Aug 2007, 2007.

  6. Mert, A., et al.: Breast cancer detection with reduced feature set computational and mathematical methods in medicine Volume 2015. https://doi.org/10.1155/2015/265138

    Article  Google Scholar 

  7. Luo, Z., Wu, X., Guo, S., Ye, B., Guo, S., and Ye, B.: Diagnosis of breast cancer tumor based on manifold learning and support vector machine. In: IEEE International Conference on Information and Automation June 20–23, 2008, Zhangjiajie, China, 2008.

  8. Muthu Rama Krishnan, M., Banerjee, S., Chakraborty, C., Chakraborty, C., and Ray, A.K., Statistical analysis of mammographic features and its classification using support vector machine. Expert Syst. Appl. 37:470–478, 2010.

    Article  Google Scholar 

  9. Zheng, B., Yoon, S.W., and Lam, S.S., Breast cancer diagnosis based on feature extraction using a hybrid of K-means and support vector machine algorithms. Expert Syst. Appl. 41(4):1476–1482, 2014. https://doi.org/10.1016/j.eswa.2013.08.044.

    Article  Google Scholar 

  10. Bamakan, S.M.H., and Gholami, P., A novel feature selection method based on an integrated data envelopment analysis and entropy model. Prog. Comput. Sci. 31:632–638, 2014.

    Article  Google Scholar 

  11. Guo, H., and Nandi, A.K.: Breast cancer diagnosis using genetic programming generated feature. In: 2005 IEEE Workshop on Machine Learning for Signal Processing, Mystic, CT. https://doi.org/10.1109/MLSP.2005.1532902, pp. 215–220, 2005.

  12. Prasad, Y., Biswas, K.K., and Jain, C.K.: SVM classifier based feature selection using GA, ACO and PSO for siRNA design. In: Tan, Y., Shi, Y., and Tan, K.C (Eds.) , Advances in Swarm Intelligence. ICSI 2010. Lecture Notes in Computer Science, Vol. 6146. Springer, Berlin, 2010.

    Google Scholar 

  13. Maldonado, S., Weber, R., and Basak, J., Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf. Sci. 181:115–128, 2011.

    Article  Google Scholar 

  14. Jafari-Marandi, R., Davarzani, S., Gharibdousti, M.S., and Smith, B.K., An optimum ANN-based breast cancer diagnosis: Bridging gaps between ANN learning and decision-making goals. Appl. Soft Comput. 72:108–120, 2018.

    Article  Google Scholar 

  15. Naga RamaDevi, G., Usha Rani, K., and Lavanya, D.: Ensemble-based hybrid approach for breast cancer data. In: International Conference on Communications and Cyber Physical Engineering 2018. Springer, Singapore, 2018.

  16. Wang, H., Zheng, B., Yoon, S.W., and Ko, H.S., A support vector machine-based ensemble algorithm for breast cancer diagnosis. Eur. J. Oper. Res. 267(2):687–699, 2018.

    Article  Google Scholar 

  17. Salama, G.I., Abdelhalim, M.B., and Abd-elghany Zeid, M., Breast cancer diagnosis on three different datasets using multi-classifiers. Int. J. Comput. Inf. Technol. 1(Issue 01):2277–0764, 2012.

    Google Scholar 

  18. Luukka, P., and Leppalampi, T., Similarity classifier with generalized mean applied to medical data. Comput. Biol. Med. 36:1026–1040, 2006.

    Article  Google Scholar 

  19. Lavanya, D., and Usha Rani, K.: Analysis of feature selection with classification: Breast cancer datasets. Indian J. Comput. Sci. Eng. 2(5), 2011. ISSN : 0976-5166

  20. Zhao, J.Y., and Zhang, Z.L.: Fuzzy rough neural network and its application to feature selection. In: The Fourth International Workshop on Advanced Computational Intelligence, Wuhan. pp 684–687, 2011. https://doi.org/10.1109/IWACI.2011.6160094

  21. Mert, A., Kılıç, N., and Akan, A., An improved hybrid feature reduction for increased breast cancer diagnostic performance. Biomed. Eng. Lett. 4(3):285–291, 2014.

    Article  Google Scholar 

  22. Lim, C.K., and Chan, C.S., A weighted inference engine based on interval-valued fuzzy relational theory. Expert Syst. Appl. 42:3410–3419, 2015.

    Article  Google Scholar 

  23. Emami, N., and Pakzad, A., A new knowledge-based system for diagnosis of breast cancer by a combination of affinity propagation clustering and firefly algorithm. J. AI Data Min. 7:59–68, 2018.

    Google Scholar 

  24. Sheikhpour, R., Sarram, M.A., and Sheikhpour, R., Particle swarm optimization for bandwidth determination and feature selection of kernel density estimation based classifiers in diagnosis of breast cancer. Appl. Soft Comput. 40:113–131, 2016.

    Article  Google Scholar 

  25. Liu, N., Qi, E.-S., Xu, M., Gao, B., and Liu, G.-Q., A novel intelligent classification model for breast cancer diagnosis. Inf. Process. Manag. 56(3):609–623, 2019.

    Article  Google Scholar 

  26. Xue, B., Zhang, M., and Browne, W.N.: New fitness functions in binary particle swarm optimisation for feature selection. In: WCCI 2012 IEEE World Congress on Computational Intelligence June, 10–15, 2012 - Brisbane, Australia, 2012.

  27. Xue, B., Zhang, M., and Browne, W.N., Particle swarm optimisation for feature selection in classification: Novel initialisation and updating mechanisms. Appl. Soft Comput. 18:261–276, 2014.

    Article  Google Scholar 

  28. Kim, S., Kavuri, S., and Lee, M.: Deep network with support vector machines. In: Lee, M., Hirose, A., Hou, Z. G., and Kil, R. M. (Eds.) , Neural Information Processing. ICONIP 2013. Lecture Notes in Computer Science, Vol. 8226. Springer, Berlin, 2013.

    Chapter  Google Scholar 

  29. Abdel-Zaher, A.M., and Eldeib, A.M.: Breast cancer classification using deep belief networks. Expert Syst. Appl., 2015. https://doi.org/10.1016/j.eswa.2015.10.015

    Article  Google Scholar 

  30. Xu, J., et al., Stacked Sparse Autoencoder (SSAE) for nuclei detection on breast cancer histopathology images. IEEE Trans. Med. Imaging 35(1):119–130, 2016. https://doi.org/10.1109/TMI.2015.2458702.

    Article  Google Scholar 

  31. Zhang, Q., et al., Deep learning based classification of breast tumors with shear-wave elastography. Ultrasonics 72:150–157, 2016.

    Article  Google Scholar 

  32. Cano, F., Madabhushi, A., and Cruz-Roa, A.: A comparative analysis of sensitivity of convolutional neural networks for histopathology image classification in breast cancer. In: Proceedings Volume 10975, 14th International Symposium on Medical Information Processing and Analysis; 109750W. Mazatlán, Mexico. https://doi.org/10.1117/12.2511647, 2018.

  33. Ragab, D.A., et al., Breast cancer detection using deep convolutional neural networks and support vector machines. PeerJ 7:e6201, 2018.

    Article  Google Scholar 

  34. Kooi, T., et al., Large scale deep learning for computer aided detection of mammographic lesions. Med. Image Anal. 35:303–312, 2017.

    Article  Google Scholar 

  35. Chougrad, H., Zouaki, H., and Alheyane, O., Deep convolutional neural networks for breast cancer screening. Comput Methods Programs Biomed. 157:19–30, 2018. https://doi.org/10.1016/j.cmpb.2018.01.011. Epub 2018 Jan 11.

    Article  Google Scholar 

  36. Xiao, T., Liu, L., Li, K., Qin, W., Yu, S., and Li, Z., Comparison of transferred deep neural networks in ultrasonic breast masses discrimination. Biomed. Res. Int. 2018:Article ID 4605191, 2018. https://doi.org/10.1155/2018/4605191.

    Google Scholar 

  37. Cruz-Roa, A., et al., Accurate and reproducible invasive breast cancer detection in whole-slide images: A deep learning approach for quantifying tumor extent. Sci. Rep. 7:46450, 2017.

    Article  CAS  Google Scholar 

  38. Jadoon, M.M., Zhang, Q., Ul Haq, I., Butt, S., and Jadoon, A.: Three-class mammogram classification based on descriptive CNN features. Biomed. Res. Int. 2017, Article ID 3640901, 11 pages, 2017. https://doi.org/10.1155/2017/3640901

    Article  Google Scholar 

  39. Agrawal, S., Rangnekar, R., Gala, D., Paul, S., and Kalbande, D: Detection of breast cancer from mammograms using a hybrid approach of deep learning and linear classification. In: 2018 International Conference on Smart City and Emerging Technology (ICSCET), Mumbai. https://doi.org/10.1109/ICSCET.2018.8537250, pp. 1–6, 2018.

  40. Liu, K., Kang, G., Zhang, N., and Hou, B., Breast cancer classification based on fully-connected layer first convolutional neural networks. IEEE Access 6:23722–23732, 2018. https://doi.org/10.1109/ACCESS.2018.2817593.

    Article  Google Scholar 

  41. Xiao, Y., Wu, J., Lin, Z., and Zhao, X.: Breast cancer diagnosis using an unsupervised feature extraction algorithm based on deep learning. In: Proceedings of the 37th Chinese Control Conference July 25–27. Wuhan, 2018.

  42. Vijayakumar, K., and Arun, C.: Automated risk identification using NLP in cloud based development environments. J. Ambient Intell. Human Comput., 2017. https://doi.org/10.1007/s12652-017-0503-7

  43. Kadam, V.J., Yadav, S.S., and Jadhav, S.M.: Soft-margin SVM incorporating feature selection using improved elitist GA for arrhythmia classification. In: Abraham, A., Cherukuri, A., Melin, P., and Gandhi, N. (Eds.) , Intelligent Systems Design and Applications. ISDA 2018. Advances in Intelligent Systems and Computing, Vol. 941. Springer, Cham, 2018.

  44. Lu, Y., Zhang, L., Wang, B., and Yang, J.: Feature ensemble learning based on sparse autoencoders for image classification. In: 2014 International Joint Conference on Neural Networks (IJCNN), Beijing, pp. 1739–1745, 2014. https://doi.org/10.1109/IJCNN.2014.6889415.

  45. Kadam, V.J., and Jadhav, S.M.: Feature Ensemble Learning Based on Sparse Autoencoders for Diagnosis of Parkinson’s Disease. In: Iyer, B., Nalbalwar, S., and Pathak, N. (Eds.) , Computing, Communication and Signal Processing. Advances in Intelligent Systems and Computing, Vol. 810. Springer, Singapore, 2019.

  46. Hinton, G.E., and Salakhutdinov, R.R., Reducing the dimensionality of data with neural networks. Science 28(5786):504–507, 2006.

    Article  Google Scholar 

  47. Bengio, Y., and LeCun, Y.: Scaling learning algorithms towards AI. In: Bottou, L., Chapelle, O., DeCoste, D., and Weston, J. (Eds.) Large-Scale Kernel Machines. MIT Press, 2007.

  48. Ranzato, M.A., Poultney, C., Chopra, S., LeCun, Y., Chopra, S., and LeCun, Y.: Efficient learning of sparse representations with an energy-based model. In: Advances in Neural Information Processing Systems 19 (NIPS’06), pp. 1137–1144, MIT Press, 2007.

  49. Baldi, P.: Autoencoders, unsupervised learning, and deep architectures. In: Proceedings of ICML Workshop on Unsupervised and Transfer Learning, 2012.

  50. Ng, A.: CS294A Lecture notes. Sparse autoencoder. pp. 72, 2011. (https://web.stanford.edu/class/cs294a/sparseAutoencoder.pdf)

  51. Hinton, G.E., Osindero, S., and Teh, Y.-W., A fast learning algorithm for deep belief nets. Neural Comput. 18(7):1527–54, 2006.

    Article  Google Scholar 

  52. Hinton, G.E., Learning multiple layers of representation. Trends Cogn. Sci. 11(10):428–34, 2007.

    Article  Google Scholar 

  53. Erhan, D., Bengio, Y., Courville, A., Manzagol, P.-A., Vincent, P., and Bengio, S., Why does unsupervised pre-training help deep learning? J. Mach. Learn. Res. 11:625–660, 2010.

    Google Scholar 

  54. Kuncheva, L.I., Combining Pattern Classifiers, Methods and Algorithms, p. 544. New York: Wiley, 2004.

    Book  Google Scholar 

  55. Kittler, J., Hatef, M., Duin, R.P.W., and Matas, J., On combining classifiers. IEEE Trans. Pattern Anal. Mach. Intell. 20(3):226–239, 1998. https://doi.org/10.1109/34.667881.

    Article  Google Scholar 

  56. Dua, D., and Graff, C., UCI Machine Learning Repository (http://archive.ics.uci.edu/ml). Irvine: University of California, School of Information and Computer Science, 2019.

    Google Scholar 

  57. Wolberg, W.H., Street, W.N., and Mangasarian, O.L., Machine learning techniques to diagnose breast cancer from image-processed nuclear features of fine needle aspirates. Cancer Lett. 77(2–3):163–171, 1994.

    Article  CAS  Google Scholar 

  58. Nick Street, W., Wolberg, W.H., and Mangasarian, O.L., Nuclear feature extraction for breast tumor diagnosis. Proc. SPIE 1905:861–871, 1993.

    Article  Google Scholar 

  59. Street, W.N., and Wolberg, W.H., Breast cancer diagnosis and prognosis via linear programming. Oper. Res. 43(4):570–577, 1995.

    Article  Google Scholar 

  60. Pradeep Mohan Kumar, K., Saravanan, M., Thenmozhi, M., and Vijayakumar, K.: Intrusion detection system based on GA-fuzzy classifier for detecting malicious attacks. Concurr. Comput. Pract. Exp. e5242, 2019. https://doi.org/10.1002/cpe.5242

  61. Joseph Manoj, R., Anto Praveena, M.D., and Vijayakumar, K.: An ACO–ANN based feature selection algorithm for big data. Cluster Comput., 2018. https://doi.org/10.1007/s10586-018-2550-z

  62. Miao, D., Gao, C., Zhang, N., and Zhang, Z., Diverse reduct subspaces based co-training for partially labeled data. Int. J. Approx. Reason. 52:1103–1117, 2011.

    Article  Google Scholar 

  63. Peng, L., Chen, W., Zhou, W., Li, F., Yang, J., and Zhang, J: An immune-inspired semi-supervised algorithm for breast cancer diagnosis. Comput. Methods Prog. Biomed., 2016. https://doi.org/10.1016/j.cmpb.2016.07.020

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vinod Jagannath Kadam.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest, financial or otherwise. No funding was received for this study.

Research involving human participants and/or animals

This research paper does not contain any studies with human participants or animals performed by any of the authors.

Informed consent

No humans are involved in this research paper.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Image & Signal Processing

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kadam, V.J., Jadhav, S.M. & Vijayakumar, K. Breast Cancer Diagnosis Using Feature Ensemble Learning Based on Stacked Sparse Autoencoders and Softmax Regression. J Med Syst 43, 263 (2019). https://doi.org/10.1007/s10916-019-1397-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10916-019-1397-z

Keywords

Navigation