Skip to main content
Top

2018 | OriginalPaper | Chapter

Breast Cancer Diagnosis and Prognosis Using Machine Learning Techniques

Authors : Sunil Suresh Shastri, Priyanka C. Nair, Deepa Gupta, Ravi C. Nayar, Raghavendra Rao, Amritanshu Ram

Published in: Intelligent Systems Technologies and Applications

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Breast cancer is one of the major type of cancer which is the leading cause of death in women. The research work is carried out on the real data of patient records obtained from HealthCare Global Enterprises Ltd (HCG) hospitals. The work analyzes the four major class variables in the dataset, namely death, progression, recurrence and metastasis. The influence of the same 11 predictor variables is explored for each of the class. Various machine algorithms namely Support Vector Machine, Decision Tree, Multi-layer Perceptron and Naive Bayes have been explored for classification of the patient data into various classes. The imbalance in the data is handled using an over sampling technique. The contribution of various attributes in classifying the instances into different classes is also being explored. The model helps in predicting various factors and thus helps in early diagnosis in the breast cancer.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Jothi, N., Wahidah, H.: Data mining in healthcare – a review. Proc. Comput. Sci. 72, 306–313 (2015)CrossRef Jothi, N., Wahidah, H.: Data mining in healthcare – a review. Proc. Comput. Sci. 72, 306–313 (2015)CrossRef
4.
go back to reference Khare, S., Gupta, D.: Association rule analysis in cardiovascular disease. In: Second International Conference on Cognitive Computing and Information Processing (CCIP), SJCE, Mysuru, India, pp. 1–6. IEEE (2016) Khare, S., Gupta, D.: Association rule analysis in cardiovascular disease. In: Second International Conference on Cognitive Computing and Information Processing (CCIP), SJCE, Mysuru, India, pp. 1–6. IEEE (2016)
5.
go back to reference Fan, Q., et al.: An application of apriori algorithm in SEER breast cancer data. In: 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI), vol. 3, pp. 114–116. IEEE (2010) Fan, Q., et al.: An application of apriori algorithm in SEER breast cancer data. In: 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI), vol. 3, pp. 114–116. IEEE (2010)
6.
go back to reference Gupta, D., Aggarwal, A., Khare, S.: A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Fifth IEEE International Conference on Computing Communication and Automation (ICCA), pp. 281–287 (2016) Gupta, D., Aggarwal, A., Khare, S.: A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Fifth IEEE International Conference on Computing Communication and Automation (ICCA), pp. 281–287 (2016)
7.
go back to reference Dominic, V., Aggarwal, A., Gupta, D., Khare, S.: Investigation of chronic disease correlation using data mining techniques. In: 2nd International Conference on Recent Advances in Engineering and Computational Sciences (RAECS), pp. 1–6. University Institute of Engineering and Technology, Panjab University, Chandigarh (2015) Dominic, V., Aggarwal, A., Gupta, D., Khare, S.: Investigation of chronic disease correlation using data mining techniques. In: 2nd International Conference on Recent Advances in Engineering and Computational Sciences (RAECS), pp. 1–6. University Institute of Engineering and Technology, Panjab University, Chandigarh (2015)
8.
go back to reference Dominic, V., Gupta, D., Khare, S.: Exploration of machine learning techniques for cardiovascular disease. Appl. Med. Inf. Index Scopus 36(1), 23–32 (2015) Dominic, V., Gupta, D., Khare, S.: Exploration of machine learning techniques for cardiovascular disease. Appl. Med. Inf. Index Scopus 36(1), 23–32 (2015)
9.
go back to reference Kourou, K., Exarchos, T.P., Exarchos, K.P., Karamouzis, M.V., Fotiadis, D.I.: Machine learning applications in cancer prognosis and prediction. In: International Conférence Science Direct, pp. 8–17 (2014) Kourou, K., Exarchos, T.P., Exarchos, K.P., Karamouzis, M.V., Fotiadis, D.I.: Machine learning applications in cancer prognosis and prediction. In: International Conférence Science Direct, pp. 8–17 (2014)
10.
go back to reference Sharma, N., Om, H.: Data mining models for predicting oral cancer survivability. Netw. Model. Anal. Health Inf. Bioinform. 2(4), 285–295 (2013)CrossRef Sharma, N., Om, H.: Data mining models for predicting oral cancer survivability. Netw. Model. Anal. Health Inf. Bioinform. 2(4), 285–295 (2013)CrossRef
11.
go back to reference Yang, H., Chen, Y.P.P.: Data mining in lung cancer pathologic staging diagnosis: correlation between clinical and pathology information. Expert Syst. Appl. 42(15), 6168–6176 (2015)CrossRef Yang, H., Chen, Y.P.P.: Data mining in lung cancer pathologic staging diagnosis: correlation between clinical and pathology information. Expert Syst. Appl. 42(15), 6168–6176 (2015)CrossRef
12.
go back to reference Abreu, P.H., et al.: Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Comput. Surv. (CSUR) 49(3), 52 (2016)MathSciNetCrossRef Abreu, P.H., et al.: Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Comput. Surv. (CSUR) 49(3), 52 (2016)MathSciNetCrossRef
13.
go back to reference Kim, W., et al.: Development of novel breast cancer recurrence prediction model using support vector machine. J. Breast Cancer 15(2), 230–238 (2012)CrossRef Kim, W., et al.: Development of novel breast cancer recurrence prediction model using support vector machine. J. Breast Cancer 15(2), 230–238 (2012)CrossRef
14.
go back to reference Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using three machine learning techniques for predicting breast cancer recurrence. J. Health Med. Inf. 4(124), 3 (2013) Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using three machine learning techniques for predicting breast cancer recurrence. J. Health Med. Inf. 4(124), 3 (2013)
15.
go back to reference Park, K., et al.: Robust predictive model for evaluating breast cancer survivability. Eng. Appl. Artif. Intell. 26(9), 2194–2205 (2013)CrossRef Park, K., et al.: Robust predictive model for evaluating breast cancer survivability. Eng. Appl. Artif. Intell. 26(9), 2194–2205 (2013)CrossRef
16.
go back to reference Sain, H., Purnami, S.W.: Combine sampling support vector machine for imbalanced data classification. Procedia Comput. Sci. 72, 59–66 (2015)CrossRef Sain, H., Purnami, S.W.: Combine sampling support vector machine for imbalanced data classification. Procedia Comput. Sci. 72, 59–66 (2015)CrossRef
17.
go back to reference Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH
18.
go back to reference Roozbahani, Z., Katanforoush, A.: Classification of gene expression data using multiple ranker evaluators and neural network. In: CICIS, pp. 29–31 (2012) Roozbahani, Z., Katanforoush, A.: Classification of gene expression data using multiple ranker evaluators and neural network. In: CICIS, pp. 29–31 (2012)
19.
go back to reference Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
20.
go back to reference Pal, S.K., Mitra, S.: Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 3(5), 683–697 (1992)CrossRef Pal, S.K., Mitra, S.: Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 3(5), 683–697 (1992)CrossRef
21.
go back to reference John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann Publishers Inc. (1995) John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann Publishers Inc. (1995)
22.
go back to reference Platt, J.C.: 12 fast training of support vector machines using sequential minimal optimization. Adv. Kernel Methods 1, 185–208 (1999) Platt, J.C.: 12 fast training of support vector machines using sequential minimal optimization. Adv. Kernel Methods 1, 185–208 (1999)
23.
go back to reference Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)CrossRef Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)CrossRef
Metadata
Title
Breast Cancer Diagnosis and Prognosis Using Machine Learning Techniques
Authors
Sunil Suresh Shastri
Priyanka C. Nair
Deepa Gupta
Ravi C. Nayar
Raghavendra Rao
Amritanshu Ram
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-319-68385-0_28

Premium Partner