Skip to main content

2018 | OriginalPaper | Buchkapitel

Breast Cancer Diagnosis and Prognosis Using Machine Learning Techniques

verfasst von : Sunil Suresh Shastri, Priyanka C. Nair, Deepa Gupta, Ravi C. Nayar, Raghavendra Rao, Amritanshu Ram

Erschienen in: Intelligent Systems Technologies and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Breast cancer is one of the major type of cancer which is the leading cause of death in women. The research work is carried out on the real data of patient records obtained from HealthCare Global Enterprises Ltd (HCG) hospitals. The work analyzes the four major class variables in the dataset, namely death, progression, recurrence and metastasis. The influence of the same 11 predictor variables is explored for each of the class. Various machine algorithms namely Support Vector Machine, Decision Tree, Multi-layer Perceptron and Naive Bayes have been explored for classification of the patient data into various classes. The imbalance in the data is handled using an over sampling technique. The contribution of various attributes in classifying the instances into different classes is also being explored. The model helps in predicting various factors and thus helps in early diagnosis in the breast cancer.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Jothi, N., Wahidah, H.: Data mining in healthcare – a review. Proc. Comput. Sci. 72, 306–313 (2015)CrossRef Jothi, N., Wahidah, H.: Data mining in healthcare – a review. Proc. Comput. Sci. 72, 306–313 (2015)CrossRef
4.
Zurück zum Zitat Khare, S., Gupta, D.: Association rule analysis in cardiovascular disease. In: Second International Conference on Cognitive Computing and Information Processing (CCIP), SJCE, Mysuru, India, pp. 1–6. IEEE (2016) Khare, S., Gupta, D.: Association rule analysis in cardiovascular disease. In: Second International Conference on Cognitive Computing and Information Processing (CCIP), SJCE, Mysuru, India, pp. 1–6. IEEE (2016)
5.
Zurück zum Zitat Fan, Q., et al.: An application of apriori algorithm in SEER breast cancer data. In: 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI), vol. 3, pp. 114–116. IEEE (2010) Fan, Q., et al.: An application of apriori algorithm in SEER breast cancer data. In: 2010 International Conference on Artificial Intelligence and Computational Intelligence (AICI), vol. 3, pp. 114–116. IEEE (2010)
6.
Zurück zum Zitat Gupta, D., Aggarwal, A., Khare, S.: A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Fifth IEEE International Conference on Computing Communication and Automation (ICCA), pp. 281–287 (2016) Gupta, D., Aggarwal, A., Khare, S.: A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Fifth IEEE International Conference on Computing Communication and Automation (ICCA), pp. 281–287 (2016)
7.
Zurück zum Zitat Dominic, V., Aggarwal, A., Gupta, D., Khare, S.: Investigation of chronic disease correlation using data mining techniques. In: 2nd International Conference on Recent Advances in Engineering and Computational Sciences (RAECS), pp. 1–6. University Institute of Engineering and Technology, Panjab University, Chandigarh (2015) Dominic, V., Aggarwal, A., Gupta, D., Khare, S.: Investigation of chronic disease correlation using data mining techniques. In: 2nd International Conference on Recent Advances in Engineering and Computational Sciences (RAECS), pp. 1–6. University Institute of Engineering and Technology, Panjab University, Chandigarh (2015)
8.
Zurück zum Zitat Dominic, V., Gupta, D., Khare, S.: Exploration of machine learning techniques for cardiovascular disease. Appl. Med. Inf. Index Scopus 36(1), 23–32 (2015) Dominic, V., Gupta, D., Khare, S.: Exploration of machine learning techniques for cardiovascular disease. Appl. Med. Inf. Index Scopus 36(1), 23–32 (2015)
9.
Zurück zum Zitat Kourou, K., Exarchos, T.P., Exarchos, K.P., Karamouzis, M.V., Fotiadis, D.I.: Machine learning applications in cancer prognosis and prediction. In: International Conférence Science Direct, pp. 8–17 (2014) Kourou, K., Exarchos, T.P., Exarchos, K.P., Karamouzis, M.V., Fotiadis, D.I.: Machine learning applications in cancer prognosis and prediction. In: International Conférence Science Direct, pp. 8–17 (2014)
10.
Zurück zum Zitat Sharma, N., Om, H.: Data mining models for predicting oral cancer survivability. Netw. Model. Anal. Health Inf. Bioinform. 2(4), 285–295 (2013)CrossRef Sharma, N., Om, H.: Data mining models for predicting oral cancer survivability. Netw. Model. Anal. Health Inf. Bioinform. 2(4), 285–295 (2013)CrossRef
11.
Zurück zum Zitat Yang, H., Chen, Y.P.P.: Data mining in lung cancer pathologic staging diagnosis: correlation between clinical and pathology information. Expert Syst. Appl. 42(15), 6168–6176 (2015)CrossRef Yang, H., Chen, Y.P.P.: Data mining in lung cancer pathologic staging diagnosis: correlation between clinical and pathology information. Expert Syst. Appl. 42(15), 6168–6176 (2015)CrossRef
12.
Zurück zum Zitat Abreu, P.H., et al.: Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Comput. Surv. (CSUR) 49(3), 52 (2016)MathSciNetCrossRef Abreu, P.H., et al.: Predicting breast cancer recurrence using machine learning techniques: a systematic review. ACM Comput. Surv. (CSUR) 49(3), 52 (2016)MathSciNetCrossRef
13.
Zurück zum Zitat Kim, W., et al.: Development of novel breast cancer recurrence prediction model using support vector machine. J. Breast Cancer 15(2), 230–238 (2012)CrossRef Kim, W., et al.: Development of novel breast cancer recurrence prediction model using support vector machine. J. Breast Cancer 15(2), 230–238 (2012)CrossRef
14.
Zurück zum Zitat Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using three machine learning techniques for predicting breast cancer recurrence. J. Health Med. Inf. 4(124), 3 (2013) Ahmad, L.G., Eshlaghy, A.T., Poorebrahimi, A., Ebrahimi, M., Razavi, A.R.: Using three machine learning techniques for predicting breast cancer recurrence. J. Health Med. Inf. 4(124), 3 (2013)
15.
Zurück zum Zitat Park, K., et al.: Robust predictive model for evaluating breast cancer survivability. Eng. Appl. Artif. Intell. 26(9), 2194–2205 (2013)CrossRef Park, K., et al.: Robust predictive model for evaluating breast cancer survivability. Eng. Appl. Artif. Intell. 26(9), 2194–2205 (2013)CrossRef
16.
Zurück zum Zitat Sain, H., Purnami, S.W.: Combine sampling support vector machine for imbalanced data classification. Procedia Comput. Sci. 72, 59–66 (2015)CrossRef Sain, H., Purnami, S.W.: Combine sampling support vector machine for imbalanced data classification. Procedia Comput. Sci. 72, 59–66 (2015)CrossRef
17.
Zurück zum Zitat Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH Chawla, N.V., Bowyer, K.W., Hall, L.O., Kegelmeyer, W.P.: SMOTE: synthetic minority over-sampling technique. J. Artif. Intell. Res. 16, 321–357 (2002)MATH
18.
Zurück zum Zitat Roozbahani, Z., Katanforoush, A.: Classification of gene expression data using multiple ranker evaluators and neural network. In: CICIS, pp. 29–31 (2012) Roozbahani, Z., Katanforoush, A.: Classification of gene expression data using multiple ranker evaluators and neural network. In: CICIS, pp. 29–31 (2012)
19.
Zurück zum Zitat Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993) Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
20.
Zurück zum Zitat Pal, S.K., Mitra, S.: Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 3(5), 683–697 (1992)CrossRef Pal, S.K., Mitra, S.: Multilayer perceptron, fuzzy sets, and classification. IEEE Trans. Neural Netw. 3(5), 683–697 (1992)CrossRef
21.
Zurück zum Zitat John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann Publishers Inc. (1995) John, G.H., Langley, P.: Estimating continuous distributions in Bayesian classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence, pp. 338–345. Morgan Kaufmann Publishers Inc. (1995)
22.
Zurück zum Zitat Platt, J.C.: 12 fast training of support vector machines using sequential minimal optimization. Adv. Kernel Methods 1, 185–208 (1999) Platt, J.C.: 12 fast training of support vector machines using sequential minimal optimization. Adv. Kernel Methods 1, 185–208 (1999)
23.
Zurück zum Zitat Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)CrossRef Hanley, J.A., McNeil, B.J.: The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143(1), 29–36 (1982)CrossRef
Metadaten
Titel
Breast Cancer Diagnosis and Prognosis Using Machine Learning Techniques
verfasst von
Sunil Suresh Shastri
Priyanka C. Nair
Deepa Gupta
Ravi C. Nayar
Raghavendra Rao
Amritanshu Ram
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-319-68385-0_28