Skip to main content
Erschienen in: Soft Computing 3/2020

15.05.2019 | Methodologies and Application

Hybrid model for prediction of heart disease

verfasst von: Bikash Kanti Sarkar

Erschienen in: Soft Computing | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Heart disease is a leading cause of death in the world. In order to drop its rate, effective and timely diagnosis of the disease is very essential. Numerous automated decision support systems have been developed for this purpose. In the present research, a predictive model consisting of two-level optimization is introduced, to save lives and cost via effective diagnosis of the disease. Level-1 optimization of the model first identifies parallelly an optimal proportion (Popt) for training and test sets for each dataset on parallel machine. Next, the best training set (Tbest) for Popt is again searched parallelly. On the other hand, level-2 optimization refines the rule set (R) generated by the Perfect Rule Induction by Sequential Method (PRISM) learner on Tbest employing parallel genetic algorithm. The experimental results obtained by the model over the heart disease datasets (collected from https://​archive.​ics.​uci.​edu/​ml) are compared and analysed with its base learner and four state-of-the-art learners, namely C4.5 (decision tree-based classifier), Naïve Bayes, neural network and support vector machine. The empirical outcomes (based on the top performance metrics—prediction accuracy, precision, recall, area under curve values, true positive and false positive rates) positively demonstrate that the new model is proficient in undertaking heart disease treatment. Importantly, the prediction accuracy of the presented hybrid model exceeds around 6% than that of the sequential GA-based hybrid model over almost all the chosen datasets. After all, the proposed system may work as an e-doctor to predict heart attack and assist clinicians to take precautionary steps.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
Zurück zum Zitat Alba E, Troya JM (2001) Analyzing synchronous and asynchronous parallel distributed genetic algorithms. Gener Comput Syst 17(4):451–465CrossRefMATH Alba E, Troya JM (2001) Analyzing synchronous and asynchronous parallel distributed genetic algorithms. Gener Comput Syst 17(4):451–465CrossRefMATH
Zurück zum Zitat Alba E, Troya JM (2002) Improving flexibility and efficiency by adding parallelism to genetic algorithms. Stat Comput 12(2):91–114CrossRefMathSciNet Alba E, Troya JM (2002) Improving flexibility and efficiency by adding parallelism to genetic algorithms. Stat Comput 12(2):91–114CrossRefMathSciNet
Zurück zum Zitat Ali SH (2012) A novel tool (FP-KC) for handle the three main dimensions reduction and association rule mining. In: The 6th international conference on sciences of electronics, technologies of information and telecommunications (IEEE), pp 951–961. https://doi.org/10.1109/setit.2012.6482042 Ali SH (2012) A novel tool (FP-KC) for handle the three main dimensions reduction and association rule mining. In: The 6th international conference on sciences of electronics, technologies of information and telecommunications (IEEE), pp 951–961. https://​doi.​org/​10.​1109/​setit.​2012.​6482042
Zurück zum Zitat Al-Janabi S, Patel A, Fatlawi H, Kalajdzic K, Al Shourbaji I (2014) Empirical rapid and accurate prediction model for data mining tasks in cloud computing environments. In: The international congress on technology, communication and knowledge (IEEE), Mashhad, pp 1–8 Al-Janabi S, Patel A, Fatlawi H, Kalajdzic K, Al Shourbaji I (2014) Empirical rapid and accurate prediction model for data mining tasks in cloud computing environments. In: The international congress on technology, communication and knowledge (IEEE), Mashhad, pp 1–8
Zurück zum Zitat Al-Janabi S, Al-Shourbaji I, Shojafar M, Shamshirband S (2017) Survey of main challenges (security and privacy) in wireless body area networks for healthcare applications. Egypt Inf J 18(2):113–122CrossRef Al-Janabi S, Al-Shourbaji I, Shojafar M, Shamshirband S (2017) Survey of main challenges (security and privacy) in wireless body area networks for healthcare applications. Egypt Inf J 18(2):113–122CrossRef
Zurück zum Zitat Arabasadi Z, Alizadehsani R, Roshanzamir M, Moosaei H, Yarifard AA (2017) Computer aided decision making for heart disease detection using hybrid neural network-genetic algorithm. Comput Methods Programs Biomed 141:19–26CrossRef Arabasadi Z, Alizadehsani R, Roshanzamir M, Moosaei H, Yarifard AA (2017) Computer aided decision making for heart disease detection using hybrid neural network-genetic algorithm. Comput Methods Programs Biomed 141:19–26CrossRef
Zurück zum Zitat Brown RG (1957) Exponential smoothing for predicting demand. Oper Res 5(1):145CrossRef Brown RG (1957) Exponential smoothing for predicting demand. Oper Res 5(1):145CrossRef
Zurück zum Zitat Catlett J (1991) On changing continuous attributes into ordered discrete attributes. In: Proceedings of European working session on learning, pp 164–178 Catlett J (1991) On changing continuous attributes into ordered discrete attributes. In: Proceedings of European working session on learning, pp 164–178
Zurück zum Zitat Cendrowska J (1987) PRISM: an algorithm for inducing modular rules. Int J Man-Mach Stud 27:349–370CrossRefMATH Cendrowska J (1987) PRISM: an algorithm for inducing modular rules. Int J Man-Mach Stud 27:349–370CrossRefMATH
Zurück zum Zitat Cho BH, Yu H, Lee J, Chee YJ, Kim IY, Kim SI (2008) Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels. IEEE Trans Inf Technol Biomed 12(2):247–256CrossRef Cho BH, Yu H, Lee J, Chee YJ, Kim IY, Kim SI (2008) Nonlinear support vector machine visualization for risk factor analysis using nomograms and localized radial basis function kernels. IEEE Trans Inf Technol Biomed 12(2):247–256CrossRef
Zurück zum Zitat Choi E, Schuetz A, Stewart WF, Sun J (2016) Using recurrent neural network models for early detection of heart failure onset. J Am Med Inf Assoc 24(2):361–370 Choi E, Schuetz A, Stewart WF, Sun J (2016) Using recurrent neural network models for early detection of heart failure onset. J Am Med Inf Assoc 24(2):361–370
Zurück zum Zitat Clark P, Niblett T (1989) The CN2 algorithm. Mach Learn 3:261–283 Clark P, Niblett T (1989) The CN2 algorithm. Mach Learn 3:261–283
Zurück zum Zitat Cohen WW (1995) Fast effective rule induction. In: Proceeding of twelfth international conference on machine learning, pp 115–123 Cohen WW (1995) Fast effective rule induction. In: Proceeding of twelfth international conference on machine learning, pp 115–123
Zurück zum Zitat Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297MATH Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297MATH
Zurück zum Zitat Das R, Sengur A (2010) Evaluation of integrated methods for diagnosing of valvular heart disease. Expert Syst Appl 37(7):5110–5115CrossRef Das R, Sengur A (2010) Evaluation of integrated methods for diagnosing of valvular heart disease. Expert Syst Appl 37(7):5110–5115CrossRef
Zurück zum Zitat Das R, Turkoglu I, Sengur A (2009) Effective diagnosis of heart disease through neural networks ensembles. Expert Syst Appl 36:7675–7680CrossRef Das R, Turkoglu I, Sengur A (2009) Effective diagnosis of heart disease through neural networks ensembles. Expert Syst Appl 36:7675–7680CrossRef
Zurück zum Zitat Duda RO, Hurt PE (1973) Pattern classification and scene analysis. Wiley, New York Duda RO, Hurt PE (1973) Pattern classification and scene analysis. Wiley, New York
Zurück zum Zitat El-Bialy R, Salamay MA, Karam OH, Khalifa ME (2015) Feature analysis of coronary artery heart disease datasets. Procedia Comput Sci 65:459–468CrossRef El-Bialy R, Salamay MA, Karam OH, Khalifa ME (2015) Feature analysis of coronary artery heart disease datasets. Procedia Comput Sci 65:459–468CrossRef
Zurück zum Zitat Fernández A, García S, del Jesus MJ, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets. Fuzzy Sets Syst 159(18):2378–2398CrossRefMathSciNet Fernández A, García S, del Jesus MJ, Herrera F (2008) A study of the behaviour of linguistic fuzzy rule based classification systems in the framework of imbalanced data-sets. Fuzzy Sets Syst 159(18):2378–2398CrossRefMathSciNet
Zurück zum Zitat Fernández A, del Jesus MJ, Herrera F (2009) Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets. Int J Approx Reason 50:561–577CrossRefMATH Fernández A, del Jesus MJ, Herrera F (2009) Hierarchical fuzzy rule based classification systems with genetic rule selection for imbalanced data-sets. Int J Approx Reason 50:561–577CrossRefMATH
Zurück zum Zitat Fix E, Hodges J (1951) Discriminatory analysis. Nonparametric discrimination: consistency properties. Technical Report 4, USAF School of Aviation Medicine, Randolph Field, Texas Fix E, Hodges J (1951) Discriminatory analysis. Nonparametric discrimination: consistency properties. Technical Report 4, USAF School of Aviation Medicine, Randolph Field, Texas
Zurück zum Zitat Fu LM (1999) Knowledge discovery based on neural networks. Commun ACM 42(11):47–50CrossRef Fu LM (1999) Knowledge discovery based on neural networks. Commun ACM 42(11):47–50CrossRef
Zurück zum Zitat Grefenstette JJ (1981) Parallel adaptive algorithms for function optimization. Report No. CS-81-19, Vanderbilt University, TN Grefenstette JJ (1981) Parallel adaptive algorithms for function optimization. Report No. CS-81-19, Vanderbilt University, TN
Zurück zum Zitat Hidayet T (2018) Improvement of heart attack prediction by the feature selection methods. Turk J Electr Eng Comput Sci 26:1–10CrossRef Hidayet T (2018) Improvement of heart attack prediction by the feature selection methods. Turk J Electr Eng Comput Sci 26:1–10CrossRef
Zurück zum Zitat Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef Hochreiter S, Schmidhuber J (1997) Long short-term memory. Neural Comput 9(8):1735–1780CrossRef
Zurück zum Zitat Jabbar MA, Deekshatulu BL, Chandra P (2013) Classification of heart disease using K-nearest neighbor and genetic algorithm. Procedia Technol 10:85–94CrossRef Jabbar MA, Deekshatulu BL, Chandra P (2013) Classification of heart disease using K-nearest neighbor and genetic algorithm. Procedia Technol 10:85–94CrossRef
Zurück zum Zitat Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5):429–450CrossRefMATH Japkowicz N, Stephen S (2002) The class imbalance problem: a systematic study. Intell Data Anal 6(5):429–450CrossRefMATH
Zurück zum Zitat Kahramanli H, Allahverdi N (2009) Extracting rules for classification problems: AIS based approach. Expert Syst Appl 36(7):10494–10502CrossRef Kahramanli H, Allahverdi N (2009) Extracting rules for classification problems: AIS based approach. Expert Syst Appl 36(7):10494–10502CrossRef
Zurück zum Zitat Kira K, Reddell L (1992) The feature selection problem: traditional methods and a new algorithm. In: Proceedings AAAI-95. AAAI Press/The MIT Press, Cambridge, pp 129–134 Kira K, Reddell L (1992) The feature selection problem: traditional methods and a new algorithm. In: Proceedings AAAI-95. AAAI Press/The MIT Press, Cambridge, pp 129–134
Zurück zum Zitat Klosgen W, Z’ytkow JM (2002) Handbook of data mining and knowledge discovery. Oxford University Press, OxfordMATH Klosgen W, Z’ytkow JM (2002) Handbook of data mining and knowledge discovery. Oxford University Press, OxfordMATH
Zurück zum Zitat Kurgan L, Cios KJ (2004) CAIM discretization algorithm. IEEE Trans Knowl Data Eng 16(2):145–153CrossRef Kurgan L, Cios KJ (2004) CAIM discretization algorithm. IEEE Trans Knowl Data Eng 16(2):145–153CrossRef
Zurück zum Zitat Lipton ZC, Kale DC, Elkan C, Wetzel R (2016) Learning to diagnose with LSTM recurrent neural networks. In: ICLR 2016, Caribe Hilton, San Juan, Puerto Rico Lipton ZC, Kale DC, Elkan C, Wetzel R (2016) Learning to diagnose with LSTM recurrent neural networks. In: ICLR 2016, Caribe Hilton, San Juan, Puerto Rico
Zurück zum Zitat Luukka P (2011) Feature selection using fuzzy entropy measures with similarity classifier. Expert Syst Appl 38(4):4600–4607CrossRef Luukka P (2011) Feature selection using fuzzy entropy measures with similarity classifier. Expert Syst Appl 38(4):4600–4607CrossRef
Zurück zum Zitat Mitchell TM (1997) Machine learning. McGraw-Hill, New YorkMATH Mitchell TM (1997) Machine learning. McGraw-Hill, New YorkMATH
Zurück zum Zitat Montalbano M (1974) Decision tables. SRA, Chicago Montalbano M (1974) Decision tables. SRA, Chicago
Zurück zum Zitat Nahar J, Imam T, Tickle KS, Chen YPP (2013) Computational intelligence for heart disease diagnosis: a medical knowledge driven approach. Expert Syst Appl 40(1):96–104CrossRef Nahar J, Imam T, Tickle KS, Chen YPP (2013) Computational intelligence for heart disease diagnosis: a medical knowledge driven approach. Expert Syst Appl 40(1):96–104CrossRef
Zurück zum Zitat Nikolaiev S, Timoshenko Y (2015) Reinvention of the cardiovascular diseases prevention and prediction due to ubiquitous convergence of mobile apps and machine learning. In: IEEE explorer proceedings titled—information technologies in innovation business (ITIB), 7–9 October, 2015, Kharkiv, Ukraine Nikolaiev S, Timoshenko Y (2015) Reinvention of the cardiovascular diseases prevention and prediction due to ubiquitous convergence of mobile apps and machine learning. In: IEEE explorer proceedings titled—information technologies in innovation business (ITIB), 7–9 October, 2015, Kharkiv, Ukraine
Zurück zum Zitat Ozcift A, Gulten A (2011) Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. Comput Methods Programs Biomed 104(3):443–451CrossRef Ozcift A, Gulten A (2011) Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. Comput Methods Programs Biomed 104(3):443–451CrossRef
Zurück zum Zitat Park YJ, Chun SH, Kim BC (2011) Cost-sensitive case-based reasoning using a genetic algorithm: application to medical diagnosis. Artif Intell Med 51(2):133–145CrossRef Park YJ, Chun SH, Kim BC (2011) Cost-sensitive case-based reasoning using a genetic algorithm: application to medical diagnosis. Artif Intell Med 51(2):133–145CrossRef
Zurück zum Zitat Pawlak Z, Slowinski R (1994) Rough set approach to multi-attribute decision analysis. Eur J Oper Res 472:43–459MATH Pawlak Z, Slowinski R (1994) Rough set approach to multi-attribute decision analysis. Eur J Oper Res 472:43–459MATH
Zurück zum Zitat Pfahringer B (1995) Supervised and unsupervised discretization of continuous features. In: Proceedings of 12th international conference on machine learning, pp 456–463 Pfahringer B (1995) Supervised and unsupervised discretization of continuous features. In: Proceedings of 12th international conference on machine learning, pp 456–463
Zurück zum Zitat Purushottam, Saxena K, Sharma R (2016) Efficient heart disease prediction system. Procedia Comput Sci 85:962–969CrossRef Purushottam, Saxena K, Sharma R (2016) Efficient heart disease prediction system. Procedia Comput Sci 85:962–969CrossRef
Zurück zum Zitat Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106 Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106
Zurück zum Zitat Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufman, San Mateo Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufman, San Mateo
Zurück zum Zitat Quinn MJ (1994) Parallel computing: theory and practice, 2nd edn. McGraw-Hill Inc, New York. ISBN 0-07-051294-9 Quinn MJ (1994) Parallel computing: theory and practice, 2nd edn. McGraw-Hill Inc, New York. ISBN 0-07-051294-9
Zurück zum Zitat Sarkar BK (2016) A case study on partitioning data for classification. Int J Inf Decis Sci 8(1):73–91 Sarkar BK (2016) A case study on partitioning data for classification. Int J Inf Decis Sci 8(1):73–91
Zurück zum Zitat Sarkar BK, Sachdev K, Bharati S, Bhaskar A (2005) An interface for converting rules generated by C4.5 to the most suitable format for Genetic Algorithm. In: Proceedings of the eighth international conference on IT (CIT-2005), Bhubaneswar, India, pp 113–115 Sarkar BK, Sachdev K, Bharati S, Bhaskar A (2005) An interface for converting rules generated by C4.5 to the most suitable format for Genetic Algorithm. In: Proceedings of the eighth international conference on IT (CIT-2005), Bhubaneswar, India, pp 113–115
Zurück zum Zitat Sarkar BK, Sana SS, Chaudhuri KS (2011a) MIL: a data discretization approach. Int J Data Min Model Manag (IJDMMM) 3(3):303–318 Sarkar BK, Sana SS, Chaudhuri KS (2011a) MIL: a data discretization approach. Int J Data Min Model Manag (IJDMMM) 3(3):303–318
Zurück zum Zitat Sarkar BK, Sana SS, Chaudhuri KS (2011b) Selecting informative rules with parallel genetic algorithm in classification problem. Appl Math Comput 218:3247–3264MATH Sarkar BK, Sana SS, Chaudhuri KS (2011b) Selecting informative rules with parallel genetic algorithm in classification problem. Appl Math Comput 218:3247–3264MATH
Zurück zum Zitat Simon D (2013) Evolutionary optimization algorithms: biologically-inspired and population-based approaches to computer intelligence. Wiley, HobokenMATH Simon D (2013) Evolutionary optimization algorithms: biologically-inspired and population-based approaches to computer intelligence. Wiley, HobokenMATH
Zurück zum Zitat Skurichina M, Duin RPW (2002) Bagging, boosting and the random subspace method for linear classifiers. Pattern Anal Appl 5(2):121–135CrossRefMathSciNetMATH Skurichina M, Duin RPW (2002) Bagging, boosting and the random subspace method for linear classifiers. Pattern Anal Appl 5(2):121–135CrossRefMathSciNetMATH
Zurück zum Zitat Stasis AC, Loukis EN, Pavlopoulos SA, Koutsouris D (2003) Using decision tree algorithms as a basis for a heart sound diagnosis decision support system. In: Proceedings of the 4th annual IEEE conference on information technology applications in biomedicine, UK, pp 354–357 Stasis AC, Loukis EN, Pavlopoulos SA, Koutsouris D (2003) Using decision tree algorithms as a basis for a heart sound diagnosis decision support system. In: Proceedings of the 4th annual IEEE conference on information technology applications in biomedicine, UK, pp 354–357
Zurück zum Zitat Suresh P, Ananda Raj MD (2018) Study and analysis of prediction model for heart disease: an optimization approach using genetic algorithm. Int J Pure Appl Math 119(16):5323–5336 Suresh P, Ananda Raj MD (2018) Study and analysis of prediction model for heart disease: an optimization approach using genetic algorithm. Int J Pure Appl Math 119(16):5323–5336
Zurück zum Zitat Tsai C-J, Lee C-I, Yang W-P (2008) A discretization algorithm based on class-attribute contingency coefficient. Inf Sci 178:714–731CrossRef Tsai C-J, Lee C-I, Yang W-P (2008) A discretization algorithm based on class-attribute contingency coefficient. Inf Sci 178:714–731CrossRef
Zurück zum Zitat Vijayarani S, Divya M (2011) An efficient algorithm for generating classification rules. Int J Comput Sci Technol 2(4):512–515 Vijayarani S, Divya M (2011) An efficient algorithm for generating classification rules. Int J Comput Sci Technol 2(4):512–515
Metadaten
Titel
Hybrid model for prediction of heart disease
verfasst von
Bikash Kanti Sarkar
Publikationsdatum
15.05.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 3/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04022-2

Weitere Artikel der Ausgabe 3/2020

Soft Computing 3/2020 Zur Ausgabe