Skip to main content
Top
Published in: Memetic Computing 2/2019

23-07-2018 | Regular Research Paper

A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection

Authors: Marwa Hammami, Slim Bechikh, Chih-Cheng Hung, Lamjed Ben Said

Published in: Memetic Computing | Issue 2/2019

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Feature selection is an important pre-processing data mining task, which can reduce the data dimensionality and improve not only the classification accuracy but also the classifier efficiency. Filters use statistical characteristics of the data as the evaluation measure rather than using a classification algorithm. On the contrary, the wrapper process is computationally expensive because the evaluation of every feature subset requires running the classifier on the datasets and computing the accuracy from the obtained confusion matrix. In order to solve this problem, we propose a hybrid tri-objective evolutionary algorithm that optimizes two filter objectives, namely the number of features and the mutual information, and one wrapper objective corresponding to the accuracy. Once the population is classified into different non-dominated fronts, only feature subsets belonging to the first (best) one are improved using the indicator-based multi-objective local search. Our proposed hybrid algorithm, named Filter-Wrapper-based Nondominated Sorting Genetic Algorithm-II, is compared against several multi-objective and single-objective feature selection algorithms on eighteen benchmark datasets having different dimensionalities. Experimental results show that our proposed algorithm gives competitive and better results with respect to existing algorithms.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Appendix
Available only for authorised users
Literature
1.
go back to reference Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recognit 43:5–13CrossRefMATH Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recognit 43:5–13CrossRefMATH
2.
3.
go back to reference Xue B, Zhang M, Browne WN, Yao X (2016) A Survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20:606–626CrossRef Xue B, Zhang M, Browne WN, Yao X (2016) A Survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20:606–626CrossRef
4.
go back to reference Mukhopadhyay A, Maulik U (2013) An SVM-wrapped multiobjective evolutionary feature selection approach for identifying cancer-microRNA markers. IEEE Trans Nanobiosci 12:275–281CrossRef Mukhopadhyay A, Maulik U (2013) An SVM-wrapped multiobjective evolutionary feature selection approach for identifying cancer-microRNA markers. IEEE Trans Nanobiosci 12:275–281CrossRef
5.
go back to reference Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43:1656–1671CrossRef Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43:1656–1671CrossRef
6.
go back to reference Hart WE, Krasnogor N, Smith JE (2005) Recent advances in memetic algorithms. Springer, Berlin, p 410. ISBN: 978-3-540-32363-1 Hart WE, Krasnogor N, Smith JE (2005) Recent advances in memetic algorithms. Springer, Berlin, p 410. ISBN: 978-3-540-32363-1
7.
go back to reference Canuto AMP, Nascimento DSC (2012) A genetic-based approach to features selection for ensembles using a hybrid and adaptive fitness function. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–8 Canuto AMP, Nascimento DSC (2012) A genetic-based approach to features selection for ensembles using a hybrid and adaptive fitness function. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–8
8.
go back to reference Liu H, Zhao Z (2015) Manipulating data and dimension reduction methods: feature selection. In: Meyers R (eds) Encyclopedia of complexity and systems science. Springer, Berlin, Heidelberg Liu H, Zhao Z (2015) Manipulating data and dimension reduction methods: feature selection. In: Meyers R (eds) Encyclopedia of complexity and systems science. Springer, Berlin, Heidelberg
9.
go back to reference Hamdani TM, Won JM, Alimi AM, Karray F (2007) Multi-objective feature selection with NSGA II. In: Beliczynski B, Dzielinski A, Iwanowski M, Ribeiro B (eds) Adaptive and natural computing algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4431. Springer, Berlin, Heidelberg Hamdani TM, Won JM, Alimi AM, Karray F (2007) Multi-objective feature selection with NSGA II. In: Beliczynski B, Dzielinski A, Iwanowski M, Ribeiro B (eds) Adaptive and natural computing algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4431. Springer, Berlin, Heidelberg
10.
go back to reference Huang J, Cai Y, Xu X (2007) A hybrid genetic algorithm for feature selection wrapper based on mutual information. Pattern Recognit Lett 28:1825–1844CrossRef Huang J, Cai Y, Xu X (2007) A hybrid genetic algorithm for feature selection wrapper based on mutual information. Pattern Recognit Lett 28:1825–1844CrossRef
11.
go back to reference Zhu Z, Jia S, Ji Z (2010) Towards a memetic feature selection paradigm [application notes]. IEEE Comput Intell Mag 5:41–53CrossRef Zhu Z, Jia S, Ji Z (2010) Towards a memetic feature selection paradigm [application notes]. IEEE Comput Intell Mag 5:41–53CrossRef
12.
go back to reference Yang C-S, Chuang L-Y, Chen Y-J, Yang C-H (2008) Feature selection using memetic algorithms. In: Proceedings of the third international conference on convergence and hybrid information technology (ICCIT ’08), pp 416–423 Yang C-S, Chuang L-Y, Chen Y-J, Yang C-H (2008) Feature selection using memetic algorithms. In: Proceedings of the third international conference on convergence and hybrid information technology (ICCIT ’08), pp 416–423
13.
go back to reference Butler-Yeoman T, Xue B, Zhang M (2015) Particle swarm optimisation for feature selection: a hybrid filter–wrapper approach. In: Proceedings of the IEEE congress on evolutionary computation (CEC 2015), pp 2428–2435 Butler-Yeoman T, Xue B, Zhang M (2015) Particle swarm optimisation for feature selection: a hybrid filter–wrapper approach. In: Proceedings of the IEEE congress on evolutionary computation (CEC 2015), pp 2428–2435
14.
go back to reference Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6:182–197CrossRef Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6:182–197CrossRef
15.
go back to reference Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13:21–27CrossRefMATH Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13:21–27CrossRefMATH
16.
go back to reference Basseur M, Burke EK (2007) Indicator-based multi-objective local search. In: Proceedings of IEEE congress on evolutionary computation, pp 3100–3107 Basseur M, Burke EK (2007) Indicator-based multi-objective local search. In: Proceedings of IEEE congress on evolutionary computation, pp 3100–3107
17.
go back to reference Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput 1:3–18CrossRef Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput 1:3–18CrossRef
19.
go back to reference Gütlein M, Frank E, Hall M, Karwath A (2009) Large-scale attribute selection using wrappers. In: Proceedings of the IEEE symposium on computational intelligence and data mining (CIDM ’09), pp 332–339 Gütlein M, Frank E, Hall M, Karwath A (2009) Large-scale attribute selection using wrappers. In: Proceedings of the IEEE symposium on computational intelligence and data mining (CIDM ’09), pp 332–339
20.
go back to reference Abeel T, de Peer YV, Saeys Y (2009) Java-ml: a machine learning library. J Mach Learn Res 10:931–934MathSciNetMATH Abeel T, de Peer YV, Saeys Y (2009) Java-ml: a machine learning library. J Mach Learn Res 10:931–934MathSciNetMATH
21.
go back to reference Shannon C, Weaver W (1948) The mathematical theory of communication. The University of Illinois Press, Urbana, p 144. ISBN: 978-0252725487 Shannon C, Weaver W (1948) The mathematical theory of communication. The University of Illinois Press, Urbana, p 144. ISBN: 978-0252725487
22.
go back to reference Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York, p 745. ISBN: 978-0-387-84858-7 Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York, p 745. ISBN: 978-0-387-84858-7
23.
go back to reference Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the evolutionary methods for design, optimization and control with applications to industrial problems, pp 95–100 Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the evolutionary methods for design, optimization and control with applications to industrial problems, pp 95–100
24.
go back to reference Xue B, Cervante L, Shang L, Browne WN, Zhang M (2013) Multi-objective evolutionary algorithms for filter based feature selection in classification. Int J Artif Intell Tools 22:1350024–31CrossRef Xue B, Cervante L, Shang L, Browne WN, Zhang M (2013) Multi-objective evolutionary algorithms for filter based feature selection in classification. Int J Artif Intell Tools 22:1350024–31CrossRef
25.
go back to reference Bermejo P, Gámez JA, Puerta JM (2011) A GRASP algorithm for fast hybrid (filter–wrapper) feature subset selection in high-dimensional datasets. Pattern Recognit Lett 32:701–711CrossRef Bermejo P, Gámez JA, Puerta JM (2011) A GRASP algorithm for fast hybrid (filter–wrapper) feature subset selection in high-dimensional datasets. Pattern Recognit Lett 32:701–711CrossRef
Metadata
Title
A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection
Authors
Marwa Hammami
Slim Bechikh
Chih-Cheng Hung
Lamjed Ben Said
Publication date
23-07-2018
Publisher
Springer Berlin Heidelberg
Published in
Memetic Computing / Issue 2/2019
Print ISSN: 1865-9284
Electronic ISSN: 1865-9292
DOI
https://doi.org/10.1007/s12293-018-0269-2

Other articles of this Issue 2/2019

Memetic Computing 2/2019 Go to the issue

Editorial

Editorial

Premium Partner