Skip to main content
Erschienen in: The Journal of Supercomputing 6/2020

23.03.2018

Health care data analysis using evolutionary algorithm

verfasst von: A. Suresh, R. Kumar, R. Varatharajan

Erschienen in: The Journal of Supercomputing | Ausgabe 6/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Assessment of huge amount of data is the difficult task in the health care industry. Hence, it here brings the important need of the data mining in identifying the relationship between the data attributes. In this research work, an assessment model for the health care analysis is developed with the preprocessing steps of performing data cleaning by applying normalization with outlier detection by applying the k-means clustering. Then, the preprocessed data are subjected to the dimensionality reduction process by performing the Feature Selection task. Then, the selected features are analyzed by the wrapper model named SVM-based improved recursive feature selection, and its accuracy is evaluated and compared with the other traditional classifiers such as Naïve Bayes. The analysis demonstrates that the planned perfect has accomplished a regular correctness of 98.79% of health care dataset such as Pima Indians diabetes. It demonstrates that the planned technique has achieved improved consequences.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Zhou J (2007) Feature selection in data mining—approaches based on information theory. VDM Verlag, Saarbrücken Zhou J (2007) Feature selection in data mining—approaches based on information theory. VDM Verlag, Saarbrücken
2.
Zurück zum Zitat Bu F, Chen Z, Zhang Q, Yang LT (2016) Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud. J Supercomput 72(8):2977 Bu F, Chen Z, Zhang Q, Yang LT (2016) Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud. J Supercomput 72(8):2977
3.
Zurück zum Zitat Han J, Kamber M (2000) Data mining: concepts and techniques, 1st edn. Morgan Kaufmann Publishers, BurlingtonMATH Han J, Kamber M (2000) Data mining: concepts and techniques, 1st edn. Morgan Kaufmann Publishers, BurlingtonMATH
4.
Zurück zum Zitat Rahm E, Do HH (2000) Data cleaning: problems and current approaches. IEEE Bull Tech Comm Data Eng 23(4):3–13 Rahm E, Do HH (2000) Data cleaning: problems and current approaches. IEEE Bull Tech Comm Data Eng 23(4):3–13
5.
Zurück zum Zitat Lemke F, Mueller J-A (2003) Medical data analysis using self-organizing data mining technologies. Syst Anal Model Simul 43(10):1399–1408 Lemke F, Mueller J-A (2003) Medical data analysis using self-organizing data mining technologies. Syst Anal Model Simul 43(10):1399–1408
6.
Zurück zum Zitat Matheny ME, Ohno-Machado L, Resnic FS (2005) Discrimination and calibration of mortality risk prediction models in interventional cardiology. J Biomed Inform 38(5):367–375 Matheny ME, Ohno-Machado L, Resnic FS (2005) Discrimination and calibration of mortality risk prediction models in interventional cardiology. J Biomed Inform 38(5):367–375
7.
Zurück zum Zitat Quinlan J (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Mateo Quinlan J (1993) C4.5: programs for machine learning. Morgan Kaufmann, San Mateo
8.
Zurück zum Zitat Pang-Ning T, Steinbach M, Kumar V (2006) Introduction to data mining. Library of Congress, WashingtonMATH Pang-Ning T, Steinbach M, Kumar V (2006) Introduction to data mining. Library of Congress, WashingtonMATH
9.
Zurück zum Zitat Koh HC, Tan G (2005) Data mining applications in healthcare. J Health Care Inf Manag 19(2):64–72 Koh HC, Tan G (2005) Data mining applications in healthcare. J Health Care Inf Manag 19(2):64–72
10.
Zurück zum Zitat Ordonez C (2004) Improving Heart Disease Prediction Using Constrained Association Rules. Seminar presentation at University of Tokyo Ordonez C (2004) Improving Heart Disease Prediction Using Constrained Association Rules. Seminar presentation at University of Tokyo
11.
Zurück zum Zitat Leskovec J, Rajaraman A, Ullman JD (2014) Mining massive datasets. Cambridge University Press, Cambridge Leskovec J, Rajaraman A, Ullman JD (2014) Mining massive datasets. Cambridge University Press, Cambridge
12.
Zurück zum Zitat Lin K-C, Zhang K-Y, Huang Y-H, Hung JC, Yen N (2016) Feature selection based on an improved cat swarm optimization algorithm for big data classification. J Supercomput 72(8):3210 Lin K-C, Zhang K-Y, Huang Y-H, Hung JC, Yen N (2016) Feature selection based on an improved cat swarm optimization algorithm for big data classification. J Supercomput 72(8):3210
13.
Zurück zum Zitat Carlsson G, Mémoli F (2010) Characterization, stability and convergence of hierarchical clustering methods. J Mach Learn Res 11:1425–1470MathSciNetMATH Carlsson G, Mémoli F (2010) Characterization, stability and convergence of hierarchical clustering methods. J Mach Learn Res 11:1425–1470MathSciNetMATH
14.
Zurück zum Zitat Osl M, Dreiseit S, Cerqueira F, Netzer M, Pfeifer B, Baumgartner C (2009) Demoting redundant features to improve the discriminatory ability in cancer data. J Biomed Inform 42(4):721–725 Osl M, Dreiseit S, Cerqueira F, Netzer M, Pfeifer B, Baumgartner C (2009) Demoting redundant features to improve the discriminatory ability in cancer data. J Biomed Inform 42(4):721–725
15.
Zurück zum Zitat Cios KJ, William Moore G (2002) Uniqueness of medical data mining. Artif Intell Med 26(1):1–24 Cios KJ, William Moore G (2002) Uniqueness of medical data mining. Artif Intell Med 26(1):1–24
16.
Zurück zum Zitat Sufi F (2011) Diagnosis of cardiovascular abnormalities from compressed ECG: a data mining-based approach. IEEE Trans Inf Technol Biomed 15(1):3–39 Sufi F (2011) Diagnosis of cardiovascular abnormalities from compressed ECG: a data mining-based approach. IEEE Trans Inf Technol Biomed 15(1):3–39
17.
Zurück zum Zitat Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines. Cambridge University Press, CambridgeMATH Cristianini N, Shawe-Taylor J (2000) An introduction to support vector machines. Cambridge University Press, CambridgeMATH
Metadaten
Titel
Health care data analysis using evolutionary algorithm
verfasst von
A. Suresh
R. Kumar
R. Varatharajan
Publikationsdatum
23.03.2018
Verlag
Springer US
Erschienen in
The Journal of Supercomputing / Ausgabe 6/2020
Print ISSN: 0920-8542
Elektronische ISSN: 1573-0484
DOI
https://doi.org/10.1007/s11227-018-2302-0

Weitere Artikel der Ausgabe 6/2020

The Journal of Supercomputing 6/2020 Zur Ausgabe

Premium Partner