Using Feature Selection as Accuracy Benchmarking in Clinical Data Mining

Jafreen Hossain; Nor FazlidaMohdSani; Aida Mustapha; Lilly SurianiAffendey

doi:10.3844/jcssp.2013.883.888

Research Article Open Access

Using Feature Selection as Accuracy Benchmarking in Clinical Data Mining

Jafreen Hossain¹, Nor FazlidaMohdSani¹, Aida Mustapha¹ and Lilly SurianiAffendey¹

¹ Universiti Putra Malaysia, Malaysia

Abstract

Automated prediction of new patients’ disease diagnosis based on data mining analysis on historical data is proven to be an extremely useful tool in the medical innovation. There are several studies focusing on this particular aspect. The objective of this study is two-fold. First, we look into three different classifiers, which are the Naïve Bayes, Multilayer Perceptron (MLP) and Decision Tree J48 to predict the diagnosis results. Next, we investigate the effects of feature selection in such experiments. We also compare the experimental results with the study of Comparative Disease Profile (CDP) using the same dataset. Results have shown that the Naive Bayes provides the best result in terms of accuracy in our experiments and in comparison with CDP. However, we suggest using Multilayer Perceptron since the variables used in our experiments are inter-dependent among each other. In addition, MLP has shown better accuracy than CDP.

Journal of Computer Science

Volume 9 No. 7, 2013, 883-888

DOI: https://doi.org/10.3844/jcssp.2013.883.888

Submitted On: 27 February 2013 Published On: 20 June 2013

How to Cite: Hossain, J., FazlidaMohdSani, N., Mustapha, A. & SurianiAffendey, L. (2013). Using Feature Selection as Accuracy Benchmarking in Clinical Data Mining. Journal of Computer Science, 9(7), 883-888. https://doi.org/10.3844/jcssp.2013.883.888

Copyright: © 2013 Jafreen Hossain, Nor FazlidaMohdSani, Aida Mustapha and Lilly SurianiAffendey. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

2,956 Views
3,111 Downloads
5 Citations

Download

Keywords

Data Mining
Healthcare
Heart Disease
Multilayer Perceptron
Naive Bayes
J48