Skip to main content
Top
Published in: Health and Technology 5/2020

18-05-2020 | Original Paper

Heart disease classification using data mining tools and machine learning techniques

Authors: Ilias Tougui, Abdelilah Jilbab, Jamal El Mhamdi

Published in: Health and Technology | Issue 5/2020

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Nowadays, in healthcare industry, data analysis can save lives by improving the medical diagnosis. And with the huge development in software engineering, different data mining tools are available for researchers, and used to conduct studies and experiments. For this, we have decided to compare six common data mining tools: Orange, Weka, RapidMiner, Knime, Matlab, and Scikit-Learn, using six machine learning techniques: Logistic Regression, Support Vector Machine, K Nearest Neighbors, Artificial Neural Network, Naïve Bayes, and Random Forest by classifying heart disease. The dataset used in this study has 13 features, one target variable, and 303 instances in which 139 suffers from cardiovascular disease and 164 are healthy subjects. Three performance measures were used to compare the performance of the techniques in each tool: the accuracy, the sensitivity, and the specificity. The results showed that Matlab was the best performing tool, and Matlab’s Artificial Neural Network model was the best performing technique. We concluded this research by plotting the Receiver operating characteristic curve of Matlab and by giving several recommendations on which tool to choose taking into account the users experience in the field of data mining.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literature
4.
go back to reference Mohtadi K, Msaad R, Essadik R, Lebrazi H, Kettani A. Current risk factors of ischemic cardiovascular diseases estimated in a representative population of Casablanca. Endocrinol Metab Syndr. 2018;7(284):2161–1017.10002. Mohtadi K, Msaad R, Essadik R, Lebrazi H, Kettani A. Current risk factors of ischemic cardiovascular diseases estimated in a representative population of Casablanca. Endocrinol Metab Syndr. 2018;7(284):2161–1017.10002.
5.
go back to reference Wang S, Summers RM. Machine learning and radiology. Med Image Anal. 2012;16(5):933–51.CrossRef Wang S, Summers RM. Machine learning and radiology. Med Image Anal. 2012;16(5):933–51.CrossRef
6.
go back to reference Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J. 2015;13:8–17.CrossRef Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J. 2015;13:8–17.CrossRef
7.
go back to reference Aggarwal CC. Data mining: the textbook. Springer; 2015. Aggarwal CC. Data mining: the textbook. Springer; 2015.
8.
go back to reference Panesar A. Machine learning and AI for healthcare: Springer; 2019. Panesar A. Machine learning and AI for healthcare: Springer; 2019.
9.
go back to reference Haraty RA, Dimishkieh M, Masud M. An enhanced k-means clustering algorithm for pattern discovery in healthcare data. Int J Distrib Sensor Netw. 2015;11(6):615740.CrossRef Haraty RA, Dimishkieh M, Masud M. An enhanced k-means clustering algorithm for pattern discovery in healthcare data. Int J Distrib Sensor Netw. 2015;11(6):615740.CrossRef
10.
go back to reference Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine learning and data mining methods in diabetes research. Comput Struct Biotechnol J. 2017;15:104–16.CrossRef Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I. Machine learning and data mining methods in diabetes research. Comput Struct Biotechnol J. 2017;15:104–16.CrossRef
11.
go back to reference Shameer K, Johnson KW, Glicksberg BS, Dudley JT, Sengupta PP. Machine learning in cardiovascular medicine: are we there yet? Heart. 2018;104(14):1156–64.CrossRef Shameer K, Johnson KW, Glicksberg BS, Dudley JT, Sengupta PP. Machine learning in cardiovascular medicine: are we there yet? Heart. 2018;104(14):1156–64.CrossRef
12.
go back to reference Benba A, Jilbab A, Hammouch A. Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Trans Neural Syst Rehabil Eng. 2016;24(10):1100–8.CrossRef Benba A, Jilbab A, Hammouch A. Discriminating between patients with Parkinson’s and neurological diseases using cepstral analysis. IEEE Trans Neural Syst Rehabil Eng. 2016;24(10):1100–8.CrossRef
13.
go back to reference Dwivedi AK. Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput & Applic. 2018;29(10):685–93.CrossRef Dwivedi AK. Performance evaluation of different machine learning techniques for prediction of heart disease. Neural Comput & Applic. 2018;29(10):685–93.CrossRef
14.
go back to reference Dua D, Graff C. UCI machine learning repository. School of Information and Computer Science, University of California, Irvine, CA. 2019. Dua D, Graff C. UCI machine learning repository. School of Information and Computer Science, University of California, Irvine, CA. 2019.
15.
go back to reference Bhatt A, Dubey SK, Bhatt AK, Joshi M, editors. Data Mining Approach to Predict and Analyze the Cardiovascular Disease. Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications; 2017: Springer. Bhatt A, Dubey SK, Bhatt AK, Joshi M, editors. Data Mining Approach to Predict and Analyze the Cardiovascular Disease. Proceedings of the 5th International Conference on Frontiers in Intelligent Computing: Theory and Applications; 2017: Springer.
16.
go back to reference Sarangam Kodati DRV. Analysis of heart disease using in data mining tools Orange and Weka. Global J Comput Sci Technol. 2018. Sarangam Kodati DRV. Analysis of heart disease using in data mining tools Orange and Weka. Global J Comput Sci Technol. 2018.
17.
go back to reference Escamilla AKG, El Hassani AH, Andres E. A Comparison of Machine Learning Techniques to Predict the Risk of Heart Failure. Machine Learning Paradigms. Springer; 2019. p. 9–26. Escamilla AKG, El Hassani AH, Andres E. A Comparison of Machine Learning Techniques to Predict the Risk of Heart Failure. Machine Learning Paradigms. Springer; 2019. p. 9–26.
18.
go back to reference Latha CBC, Jeeva SC. Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inf Med Unlocked. 2019;16:100203.CrossRef Latha CBC, Jeeva SC. Improving the accuracy of prediction of heart disease risk based on ensemble classification techniques. Inf Med Unlocked. 2019;16:100203.CrossRef
19.
go back to reference Amin MS, Chiam YK, Varathan KD. Identification of significant features and data mining techniques in predicting heart disease. Telematics Inform. 2019;36:82–93.CrossRef Amin MS, Chiam YK, Varathan KD. Identification of significant features and data mining techniques in predicting heart disease. Telematics Inform. 2019;36:82–93.CrossRef
Metadata
Title
Heart disease classification using data mining tools and machine learning techniques
Authors
Ilias Tougui
Abdelilah Jilbab
Jamal El Mhamdi
Publication date
18-05-2020
Publisher
Springer Berlin Heidelberg
Published in
Health and Technology / Issue 5/2020
Print ISSN: 2190-7188
Electronic ISSN: 2190-7196
DOI
https://doi.org/10.1007/s12553-020-00438-1

Other articles of this Issue 5/2020

Health and Technology 5/2020 Go to the issue

Premium Partner