Skip to main content
Top
Published in: Artificial Intelligence Review 4/2021

01-10-2020

A systematic mapping study for ensemble classification methods in cardiovascular disease

Authors: Mohamed Hosni, Juan M. Carrillo de Gea, Ali Idri, Manal El Bajta, José Luis Fernández Alemán, Ginés García-Mateos, Ibtissam Abnane

Published in: Artificial Intelligence Review | Issue 4/2021

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Ensemble methods overcome the limitations of single machine learning techniques by combining different techniques, and are employed in the quest to achieve a high level of accuracy. This approach has been investigated in various fields, one of them being that of bioinformatics. One of the most frequent applications of ensemble techniques involves research into cardiovascular diseases, which are considered the leading cause of death worldwide. The purpose of this research work is to identify the papers that investigate ensemble classification techniques applied to cardiology diseases, and to analyse them according to nine aspects: their publication venues, the medical tasks tackled, the empirical and research types adopted, the types of ensembles proposed, the single techniques used to construct the ensembles, the validation frameworks adopted to evaluate the proposed ensembles, the tools used to build the ensembles, and the optimization methods employed for the single techniques. This paper reports the carrying out of a systematic mapping study. An extensive automatic search in four digital libraries: IEEE Xplore, ACM Digital Library, PubMed, and Scopus, followed by a study selection process, resulted in the identification of 351 papers that were used to address our mapping questions. This study found that the papers selected had been published in a large number of different resources. The medical task addressed most frequently by the selected studies was diagnosis. In addition, the experiment-based empirical type and evaluation-based research type were the most dominant approaches adopted by the selected studies. Homogeneous ensembles were the ensemble type that was developed most often in literature, while decision trees, artificial neural networks and Bayesian classifiers were the single techniques used most frequently to develop ensemble classification methods. The weighted majority and majority voting rules were adopted to obtain the final decision of the ensembles developed. With regard to evaluation frameworks, the datasets obtained from the UCI and PhysioBank repositories were those used most often to evaluate the ensemble methods, while the k-fold cross-validation method was the most frequently-employed validation technique. Several tools with which to build ensemble classifiers were identified, and the type of software adopted with the greatest frequency was open source. Finally, only a few researchers took into account the optimization of the parameter settings of either single or meta ensemble classifiers. This mapping study attempts to provide a greater insight into the application of ensemble classification methods in cardiovascular diseases. The majority of the selected papers reported positive feedback as regards the ability of ensemble methods to perform better than single methods. Further analysis is required to aggregate the evidence reported in literature.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
go back to reference Ani R, Augustine A, Akhil NC, Deepa OS (2016) Random forest ensemble classifier to predict the coronary heart disease using risk factors. In: Suresh LP, Panigrahi BK (eds) Advances in intelligent systems and computing, vol 397. Springer, New Delhi, pp 701–710 Ani R, Augustine A, Akhil NC, Deepa OS (2016) Random forest ensemble classifier to predict the coronary heart disease using risk factors. In: Suresh LP, Panigrahi BK (eds) Advances in intelligent systems and computing, vol 397. Springer, New Delhi, pp 701–710
go back to reference Aruna S, Nandakishore LV (2015) Ensemble neural network algorithm for detecting cardiac arrhythmia. In: Suresh LP, Dash SS, Panigrahi BK (eds) Advances in intelligent systems and computing, vol 324. Springer, New Delhi, pp 27–35 Aruna S, Nandakishore LV (2015) Ensemble neural network algorithm for detecting cardiac arrhythmia. In: Suresh LP, Dash SS, Panigrahi BK (eds) Advances in intelligent systems and computing, vol 324. Springer, New Delhi, pp 27–35
go back to reference Booba B, Gopal TV (2013) Comparison of Ant Colony Optimization & Particle Swarm Optimization in Grid Environment. Int J Adv Res Comput Sci Appl 1(5):27–33 Booba B, Gopal TV (2013) Comparison of Ant Colony Optimization & Particle Swarm Optimization in Grid Environment. Int J Adv Res Comput Sci Appl 1(5):27–33
go back to reference Chen X, Ji J, Loparo K, Li P (2017) Real-time personalized cardiac arrhythmia detection and diagnosis: a cloud computing architecture. In: 2017 IEEE EMBS international conference on biomedical & health informatics (BHI), pp 201–204. 10.1109/BHI.2017.7897240 Chen X, Ji J, Loparo K, Li P (2017) Real-time personalized cardiac arrhythmia detection and diagnosis: a cloud computing architecture. In: 2017 IEEE EMBS international conference on biomedical & health informatics (BHI), pp 201–204. 10.1109/BHI.2017.7897240
go back to reference Choudhury AD, Banerjee R, Pal A, Mandana KM (2017) A fusion approach for non-invasive detection of coronary artery disease. In: Proceedings of the 11th EAI international conference on pervasive computing technologies for healthcare—PervasiveHealth ’17, pp 217–220. https://doi.org/10.1145/3154862.3154871 Choudhury AD, Banerjee R, Pal A, Mandana KM (2017) A fusion approach for non-invasive detection of coronary artery disease. In: Proceedings of the 11th EAI international conference on pervasive computing technologies for healthcare—PervasiveHealth ’17, pp 217–220. https://​doi.​org/​10.​1145/​3154862.​3154871
go back to reference Davis J, Goadrich M (2006) The relationship between precision-recall and ROC curves. In: Proceedings of the 23rd international conference on Machine learning Davis J, Goadrich M (2006) The relationship between precision-recall and ROC curves. In: Proceedings of the 23rd international conference on Machine learning
go back to reference Durlak F, Wels M, Schwemmer C, Sühling M, Steidl S, Maier A (2017) Growing a random forest with fuzzy spatial features for fully automatic artery-specific coronary calcium scoring. Lect Notes Comput Sci 15(3):27–35CrossRef Durlak F, Wels M, Schwemmer C, Sühling M, Steidl S, Maier A (2017) Growing a random forest with fuzzy spatial features for fully automatic artery-specific coronary calcium scoring. Lect Notes Comput Sci 15(3):27–35CrossRef
go back to reference El Bialy R, Salama MA, Karam O (2016) An ensemble model for Heart disease data sets: a generalized model. In: Proceedings of the 10th international conference on informatics and systems—INFOS’16, 2016, vol 9–11-May, pp 191–196. https://doi.org/https://doi.org/10.1145/2908446.2908482. El Bialy R, Salama MA, Karam O (2016) An ensemble model for Heart disease data sets: a generalized model. In: Proceedings of the 10th international conference on informatics and systems—INFOS’16, 2016, vol 9–11-May, pp 191–196. https://​doi.​org/​https://​doi.​org/​10.​1145/​2908446.​2908482.
go back to reference Gayathri P, Jaisankar N (2013) Comprehensive study of heart disease diagnosis using data mining and soft computing techniques. Int J Eng Technol 5(3):2947–2958 Gayathri P, Jaisankar N (2013) Comprehensive study of heart disease diagnosis using data mining and soft computing techniques. Int J Eng Technol 5(3):2947–2958
go back to reference Gomes EF, Jorge AM, Azevedo PJ (2014) Classifying heart sounds using SAX motifs, random forests and text mining techniques. In: Proceedings of the 18th international database engineering & applications symposium on—IDEAS ’14, pp 334–337. https://doi.org/10.1145/2628194.2628240 Gomes EF, Jorge AM, Azevedo PJ (2014) Classifying heart sounds using SAX motifs, random forests and text mining techniques. In: Proceedings of the 18th international database engineering & applications symposium on—IDEAS ’14, pp 334–337. https://​doi.​org/​10.​1145/​2628194.​2628240
go back to reference Gupta D, Khare S, Aggarwal A (2017) A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Proceeding of the IEEE international conference on computing, communication and automation, ICCCA 2016, pp 281–287. https://doi.org/10.1109/CCAA.2016.7813730 Gupta D, Khare S, Aggarwal A (2017) A method to predict diagnostic codes for chronic diseases using machine learning techniques. In: Proceeding of the IEEE international conference on computing, communication and automation, ICCCA 2016, pp 281–287. https://​doi.​org/​10.​1109/​CCAA.​2016.​7813730
go back to reference Hasan SMM, Mamun MA, Uddin MP, Hossain MA (2018) Comparative analysis of classification approaches for heart disease prediction. In: 2018 international conference on computing communication chemistry electronic & engineering materials, pp 1–4 Hasan SMM, Mamun MA, Uddin MP, Hossain MA (2018) Comparative analysis of classification approaches for heart disease prediction. In: 2018 international conference on computing communication chemistry electronic & engineering materials, pp 1–4
go back to reference Hosni M, Idri A, Abran A (2017) Investigating heterogeneous ensembles with filter feature selection for software effort estimation. In: Proceedings of the 27th international workshop on software measurement and 12th international conference on software process and product measurement, pp 207–220. https://doi.org/10.1145/3143434.3143456 Hosni M, Idri A, Abran A (2017) Investigating heterogeneous ensembles with filter feature selection for software effort estimation. In: Proceedings of the 27th international workshop on software measurement and 12th international conference on software process and product measurement, pp 207–220. https://​doi.​org/​10.​1145/​3143434.​3143456
go back to reference Idri A, Hosni M, Abran A (2016) Systematic mapping study of ensemble effort estimation. In: Proceedings of the 11th international conference on evaluation of novel software approaches to software engineering, 2016, no Enase, pp 132–139. https://doi.org/10.5220/0005822701320139. Idri A, Hosni M, Abran A (2016) Systematic mapping study of ensemble effort estimation. In: Proceedings of the 11th international conference on evaluation of novel software approaches to software engineering, 2016, no Enase, pp 132–139. https://​doi.​org/​10.​5220/​0005822701320139​.
go back to reference Jabbar MA, Deekshatulu BL, Chandra P (2016) Prediction of heart disease using random forest and feature subset selection. Adv Intell Syst Comput 424:187–196 Jabbar MA, Deekshatulu BL, Chandra P (2016) Prediction of heart disease using random forest and feature subset selection. Adv Intell Syst Comput 424:187–196
go back to reference Kuan MM, Lim CP, Morad N, Harrison RF (2000) An experimental study of original and ordered fuzzy ARTMAP neural networks in pattern classification tasks. In 2000 TENCON proceedings of the intelligent systems and technologies for the new millennium (Cat. No. 00CH37119), vol 2, pp 392–397. https://doi.org/10.1109/TENCON.2000.888769 Kuan MM, Lim CP, Morad N, Harrison RF (2000) An experimental study of original and ordered fuzzy ARTMAP neural networks in pattern classification tasks. In 2000 TENCON proceedings of the intelligent systems and technologies for the new millennium (Cat. No. 00CH37119), vol 2, pp 392–397. https://​doi.​org/​10.​1109/​TENCON.​2000.​888769
go back to reference Lafta R, Zhang J, Tao X, Li Y, Abbas W (2017) A fast Fourier transform-coupled machine learning-based ensemble model for disease risk prediction using a real-life dataset. In: Lecture notes in computer science, vol 2, pp 654–670 Lafta R, Zhang J, Tao X, Li Y, Abbas W (2017) A fast Fourier transform-coupled machine learning-based ensemble model for disease risk prediction using a real-life dataset. In: Lecture notes in computer science, vol 2, pp 654–670
go back to reference Meesri S, Phimoltares S (2017) Diagnosis of heart disease using a mixed classifier. In: 2017 21st international computing science engineering conference, vol 6, pp 1–5 Meesri S, Phimoltares S (2017) Diagnosis of heart disease using a mixed classifier. In: 2017 21st international computing science engineering conference, vol 6, pp 1–5
go back to reference Nguyen TT, Liew AWC, Tran MT, Pham XC, Nguyen MP (2014) A novel genetic algorithm approach for simultaneous feature and classifier selection in multi classifier system. In: Proceedings of the 2014 IEEE congress on evolutionary computation, CEC 2014, pp 1698–1705. https://doi.org/10.1109/CEC.2014.6900377 Nguyen TT, Liew AWC, Tran MT, Pham XC, Nguyen MP (2014) A novel genetic algorithm approach for simultaneous feature and classifier selection in multi classifier system. In: Proceedings of the 2014 IEEE congress on evolutionary computation, CEC 2014, pp 1698–1705. https://​doi.​org/​10.​1109/​CEC.​2014.​6900377
go back to reference Nita S, Bitam S, Mellouk A (2018) An enhanced random forest for cardiac diseases identification based on ECG signal. In: 2018 14th international wireless communications & mobile computing conference, pp 1339–1344 Nita S, Bitam S, Mellouk A (2018) An enhanced random forest for cardiac diseases identification based on ECG signal. In: 2018 14th international wireless communications & mobile computing conference, pp 1339–1344
go back to reference Pandit D, Zhang L, Aslam N, Liu C, Hossain A, Chattopadhyay S (2014) An efficient abnormal beat detection scheme from ECG signals using neural network and ensemble classifiers. In: The 8th international conference on software, knowledge, information management and applications (SKIMA 2014), pp 1–6. https://doi.org/10.1109/SKIMA.2014.7083561 Pandit D, Zhang L, Aslam N, Liu C, Hossain A, Chattopadhyay S (2014) An efficient abnormal beat detection scheme from ECG signals using neural network and ensemble classifiers. In: The 8th international conference on software, knowledge, information management and applications (SKIMA 2014), pp 1–6. https://​doi.​org/​10.​1109/​SKIMA.​2014.​7083561
go back to reference Ruta D, Gabrys B (2000) An overview of classifier fusion methods. Comput Inf Syst 7:1–10 Ruta D, Gabrys B (2000) An overview of classifier fusion methods. Comput Inf Syst 7:1–10
go back to reference Sakellarios A et al (2019) A novel concept of the management of coronary artery disease patients based on machine learning risk stratification and computational biomechanics: preliminary results of SMARTool project antonis. In: World congress on medical physics & biomedical engineering (IUPESM), Prague, Czech Republic, 2019, vol 68/1, no May, pp 731–735. https://doi.org/10.1007/978-981-10-9035-6 Sakellarios A et al (2019) A novel concept of the management of coronary artery disease patients based on machine learning risk stratification and computational biomechanics: preliminary results of SMARTool project antonis. In: World congress on medical physics & biomedical engineering (IUPESM), Prague, Czech Republic, 2019, vol 68/1, no May, pp 731–735. https://​doi.​org/​10.​1007/​978-981-10-9035-6
go back to reference Sasikala S, Appavu Alias Balamurugan S, Geetha S (2013) An efficient feature selection paradigm using PCA-CFS-Shapley values ensemble applied to small medical data sets. In: 2013 fourth international conference on computing, communications and networking technologies (ICCCNT), pp 1–5. https://doi.org/10.1109/ICCCNT.2013.6726773 Sasikala S, Appavu Alias Balamurugan S, Geetha S (2013) An efficient feature selection paradigm using PCA-CFS-Shapley values ensemble applied to small medical data sets. In: 2013 fourth international conference on computing, communications and networking technologies (ICCCNT), pp 1–5. https://​doi.​org/​10.​1109/​ICCCNT.​2013.​6726773
go back to reference Schapire RE (1999) A brief introduction to boosting. In: Proceedings of the sixth international joint conference artificial intelligence Schapire RE (1999) A brief introduction to boosting. In: Proceedings of the sixth international joint conference artificial intelligence
go back to reference Schlemmer A, Zwirnmann H, Zabel M, Parlitz U, Luther S (2014) Evaluation of machine learning methods for the long-term prediction of cardiac diseases. In: 2014 8th conference of the European study group on cardiovascular oscillations (ESGCO), no Esgco, pp 157–158. https://doi.org/10.1109/ESGCO.2014.6847567 Schlemmer A, Zwirnmann H, Zabel M, Parlitz U, Luther S (2014) Evaluation of machine learning methods for the long-term prediction of cardiac diseases. In: 2014 8th conference of the European study group on cardiovascular oscillations (ESGCO), no Esgco, pp 157–158. https://​doi.​org/​10.​1109/​ESGCO.​2014.​6847567
go back to reference Seni G, Elder JF (2010) Ensemble methods in data mining: improving accuracy through combining predictions, vol 2. Morgan & Claypool Publishers, New York Seni G, Elder JF (2010) Ensemble methods in data mining: improving accuracy through combining predictions, vol 2. Morgan & Claypool Publishers, New York
go back to reference Soria-Olivas E, Martin-Guerrero JD, Redon J, Tellez-Plaza M, Vila-Frances J (2015) Improving mortality prediction in cardiovascular risk patients by balancing classes. In: 2015 IEEE international conference on data mining workshop (ICDMW), pp 480–484. 10.1109/ICDMW.2015.76 Soria-Olivas E, Martin-Guerrero JD, Redon J, Tellez-Plaza M, Vila-Frances J (2015) Improving mortality prediction in cardiovascular risk patients by balancing classes. In: 2015 IEEE international conference on data mining workshop (ICDMW), pp 480–484. 10.1109/ICDMW.2015.76
go back to reference Tulu B, Djamasbi S, Leroy G (2019) Designing a machine learning model to predict cardiovascular disease without any blood test. In: Extending the boundaries of design science theory and practice, vol 11491. Springer, p 324 Tulu B, Djamasbi S, Leroy G (2019) Designing a machine learning model to predict cardiovascular disease without any blood test. In: Extending the boundaries of design science theory and practice, vol 11491. Springer, p 324
go back to reference Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH Vapnik VN (1998) Statistical learning theory. Wiley, New YorkMATH
go back to reference Xiao Y, Fang R (2017) RFMiner: risk factors discovery and mining for preventive cardiovascular health. In: 2017 IEEE/ACM international conference on connected health: applications, systems and engineering technologies (CHASE), pp 278–279. https://doi.org/10.1109/CHASE.2017.101 Xiao Y, Fang R (2017) RFMiner: risk factors discovery and mining for preventive cardiovascular health. In: 2017 IEEE/ACM international conference on connected health: applications, systems and engineering technologies (CHASE), pp 278–279. https://​doi.​org/​10.​1109/​CHASE.​2017.​101
go back to reference Yıldız OT, İrsoy O, Alpaydın E (2016) Bagging soft decision trees. In: Machine learning for health informatics, vol 1, pp 25–36 Yıldız OT, İrsoy O, Alpaydın E (2016) Bagging soft decision trees. In: Machine learning for health informatics, vol 1, pp 25–36
go back to reference Zhang Y, Zhao Z (2018) Fetal state assessment based on cardiotocography parameters using PCA and AdaBoost. In: Proceedings of the 2017 10th international congress on image and signal processing, biomedical engineering and informatics, CISP-BMEI 2017, vol 2018-Jan, pp 1–6. https://doi.org/10.1109/CISP-BMEI.2017.8302314 Zhang Y, Zhao Z (2018) Fetal state assessment based on cardiotocography parameters using PCA and AdaBoost. In: Proceedings of the 2017 10th international congress on image and signal processing, biomedical engineering and informatics, CISP-BMEI 2017, vol 2018-Jan, pp 1–6. https://​doi.​org/​10.​1109/​CISP-BMEI.​2017.​8302314
Metadata
Title
A systematic mapping study for ensemble classification methods in cardiovascular disease
Authors
Mohamed Hosni
Juan M. Carrillo de Gea
Ali Idri
Manal El Bajta
José Luis Fernández Alemán
Ginés García-Mateos
Ibtissam Abnane
Publication date
01-10-2020
Publisher
Springer Netherlands
Published in
Artificial Intelligence Review / Issue 4/2021
Print ISSN: 0269-2821
Electronic ISSN: 1573-7462
DOI
https://doi.org/10.1007/s10462-020-09914-6

Other articles of this Issue 4/2021

Artificial Intelligence Review 4/2021 Go to the issue

Premium Partner