Abstract
In this paper, we present a new method for dealing with feature subset selection based on fuzzy entropy measures for handling classification problems. First, we discretize numeric features to construct the membership function of each fuzzy set of a feature. Then, we select the feature subset based on the proposed fuzzy entropy measure focusing on boundary samples. The proposed method can select relevant features to get higher average classification accuracy rates than the ones selected by the MIFS method (Battiti, R. in IEEE Trans. Neural Netw. 5(4):537–550, 1994), the FQI method (De, R.K., et al. in Neural Netw. 12(10):1429–1455, 1999), the OFEI method, Dong-and-Kothari’s method (Dong, M., Kothari, R. in Pattern Recognit. Lett. 24(9):1215–1225, 2003) and the OFFSS method (Tsang, E.C.C., et al. in IEEE Trans. Fuzzy Syst. 11(2):202–213, 2003).
Similar content being viewed by others
References
Baim PW (1988) A method for attribute selection in inductive learning systems. IEEE Trans Pattern Anal Mach Intell 10(6):888–896
Battiti R (1994) Using mutual information for selecting features in supervised neural net learning. IEEE Trans Neural Netw 5(4):537–550
Caruana R, Freitag D (1994) Greedy attribute selection. In: Proceedings of international conference on machine learning, New Brunswick, NJ, pp 28–36
Chaikla N, Qi Y (1999) Genetic algorithms in feature selection. In: Proceedings of the 1999 IEEE international conference on systems, man, and cybernetics, Tokyo, Japan, vol 5, pp 538–540
Chen SM, Chang CH (2005) A new method to construct membership functions and generate weighted fuzzy rules from training instances. Cybern Syst 36(4):397–414
Chen SM, Chen YC (2002) Automatically constructing membership functions and generating fuzzy rules using genetic algorithms. Cybern Syst 33(8):841–862
Chen SM, Kao CH, Yu CH (2002) Generating fuzzy rules from training data containing noise for handling classification problems. Cybern Syst 33(7):723–748
Chen SM, Shie JD (2005) A new method for feature subset selection for handling classification problems. In: Proceedings of the 2005 IEEE international conference on fuzzy systems, Reno, NV, pp 183-188
Chen SM (1988) A new approach to handling fuzzy decision-making problems. IEEE Trans Syst Man Cybern 18(6):1012–1016
De RK, Basak J, Pal SK (1999) Neuro-fuzzy feature evaluation with theoretical analysis. Neural Netw 12(10):1429–1455
De RK, Pal NR, Pal SK (1997) Feature analysis: neural network and fuzzy set theoretic approaches. Pattern Recognit 30(10):1579–1590
Dong M, Kothari R (2003) Feature subset selection using a new definition of classifiability. Pattern Recognit Lett 24(9):1215–1225
Fodor IK, Kamath C (2002) Dimension reduction techniques and the classification of bent double galaxies. Comput Stat Data Anal 41(1):91–122
Hartigan JA, Wong MA (1979) A k-means clustering algorithm. J Roy Stat Soc Ser C 28(1):100–108
John GH, Kohavi R, Pfleger K (1994) Irrelevant features and the subset selection problem. In: Proceedings of the eleventh international conference on machine learning, San Francisco, CA, pp 121–129
Kosko B (1986) Fuzzy entropy and conditioning. Inf Scie 40(2):165–174
Landwehr N, Hall M, Frank E (2003) Logistic model trees. In: Proceedings of the 14th European conference on machine learning, Cavtat-Dubrovnik, Croatia, pp 241–252
Lee HM, Chen CM, Chen JM, Jou YL (2001) An efficient fuzzy classifier with feature selection based on fuzzy entropy. IEEE Trans Syst Man Cybern Part B Cybern 31(3):426–432
Luca AD, Termini S (1972) A definition of a non-probabilistic entropy in the setting of fuzzy set theory. Inf Control 20(4):301–312
Platt JC (1999) Using analytic QP and sparseness to speed training of support vector machines. In: Proceedings of the thirteenth annual conference on neural information processing systems, Denver, CO, pp 557–563
Pal SK, De RK, Bsask J (2000) Unsupervised feature selection: a neuro-fuzzy approach. IEEE Trans Neural Netw 11(2):366–376
Quinlan JR (1993) C4.5: programs for machine learning. Kaufmann, San Francisco
Schaffer C (1993) Overfitting avoidance as bias. Mach Learn 10(2):153–178
Shannon CE (1948) A mathematical theory of communication. Bell Syst Techn J 27(3):379–423
Shie JD, Chen SM (2006) A new approach for handling classification problems based on fuzzy information gain measures. In: Proceedings of the 2006 IEEE international conference on fuzzy systems, Vancouver, BC, Canada, pp 5427–5434
Tsang ECC, Yeung DS, Wang XZ (2003) OFFSS: optimal fuzzy-valued feature subset selection. IEEE Trans Fuzzy Syst 11(2):202–213
Zadeh LA (1965) Fuzzy sets. Inf Control 8:338–353
Zadeh LA (1965) Probability measures of fuzzy events. J Math Anal Appl 23(2):421–427
Zadeh LA (1975) The concept of linguistic variable and its application to approximate reasoning, I. Inf Sci 8(3):199–249
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Shie, JD., Chen, SM. Feature subset selection based on fuzzy entropy measures for handling classification problems. Appl Intell 28, 69–82 (2008). https://doi.org/10.1007/s10489-007-0042-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-007-0042-6