Skip to main content
Erschienen in: Memetic Computing 2/2019

23.07.2018 | Regular Research Paper

A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection

verfasst von: Marwa Hammami, Slim Bechikh, Chih-Cheng Hung, Lamjed Ben Said

Erschienen in: Memetic Computing | Ausgabe 2/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Feature selection is an important pre-processing data mining task, which can reduce the data dimensionality and improve not only the classification accuracy but also the classifier efficiency. Filters use statistical characteristics of the data as the evaluation measure rather than using a classification algorithm. On the contrary, the wrapper process is computationally expensive because the evaluation of every feature subset requires running the classifier on the datasets and computing the accuracy from the obtained confusion matrix. In order to solve this problem, we propose a hybrid tri-objective evolutionary algorithm that optimizes two filter objectives, namely the number of features and the mutual information, and one wrapper objective corresponding to the accuracy. Once the population is classified into different non-dominated fronts, only feature subsets belonging to the first (best) one are improved using the indicator-based multi-objective local search. Our proposed hybrid algorithm, named Filter-Wrapper-based Nondominated Sorting Genetic Algorithm-II, is compared against several multi-objective and single-objective feature selection algorithms on eighteen benchmark datasets having different dimensionalities. Experimental results show that our proposed algorithm gives competitive and better results with respect to existing algorithms.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Anhänge
Nur mit Berechtigung zugänglich
Literatur
1.
Zurück zum Zitat Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recognit 43:5–13CrossRefMATH Gheyas IA, Smith LS (2010) Feature subset selection in large dimensionality domains. Pattern Recognit 43:5–13CrossRefMATH
2.
Zurück zum Zitat Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97:273–324CrossRefMATH Kohavi R, John GH (1997) Wrappers for feature subset selection. Artif Intell 97:273–324CrossRefMATH
3.
Zurück zum Zitat Xue B, Zhang M, Browne WN, Yao X (2016) A Survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20:606–626CrossRef Xue B, Zhang M, Browne WN, Yao X (2016) A Survey on evolutionary computation approaches to feature selection. IEEE Trans Evol Comput 20:606–626CrossRef
4.
Zurück zum Zitat Mukhopadhyay A, Maulik U (2013) An SVM-wrapped multiobjective evolutionary feature selection approach for identifying cancer-microRNA markers. IEEE Trans Nanobiosci 12:275–281CrossRef Mukhopadhyay A, Maulik U (2013) An SVM-wrapped multiobjective evolutionary feature selection approach for identifying cancer-microRNA markers. IEEE Trans Nanobiosci 12:275–281CrossRef
5.
Zurück zum Zitat Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43:1656–1671CrossRef Xue B, Zhang M, Browne WN (2013) Particle swarm optimization for feature selection in classification: a multi-objective approach. IEEE Trans Cybern 43:1656–1671CrossRef
6.
Zurück zum Zitat Hart WE, Krasnogor N, Smith JE (2005) Recent advances in memetic algorithms. Springer, Berlin, p 410. ISBN: 978-3-540-32363-1 Hart WE, Krasnogor N, Smith JE (2005) Recent advances in memetic algorithms. Springer, Berlin, p 410. ISBN: 978-3-540-32363-1
7.
Zurück zum Zitat Canuto AMP, Nascimento DSC (2012) A genetic-based approach to features selection for ensembles using a hybrid and adaptive fitness function. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–8 Canuto AMP, Nascimento DSC (2012) A genetic-based approach to features selection for ensembles using a hybrid and adaptive fitness function. In: Proceedings of the international joint conference on neural networks (IJCNN), pp 1–8
8.
Zurück zum Zitat Liu H, Zhao Z (2015) Manipulating data and dimension reduction methods: feature selection. In: Meyers R (eds) Encyclopedia of complexity and systems science. Springer, Berlin, Heidelberg Liu H, Zhao Z (2015) Manipulating data and dimension reduction methods: feature selection. In: Meyers R (eds) Encyclopedia of complexity and systems science. Springer, Berlin, Heidelberg
9.
Zurück zum Zitat Hamdani TM, Won JM, Alimi AM, Karray F (2007) Multi-objective feature selection with NSGA II. In: Beliczynski B, Dzielinski A, Iwanowski M, Ribeiro B (eds) Adaptive and natural computing algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4431. Springer, Berlin, Heidelberg Hamdani TM, Won JM, Alimi AM, Karray F (2007) Multi-objective feature selection with NSGA II. In: Beliczynski B, Dzielinski A, Iwanowski M, Ribeiro B (eds) Adaptive and natural computing algorithms. ICANNGA 2007. Lecture Notes in Computer Science, vol 4431. Springer, Berlin, Heidelberg
10.
Zurück zum Zitat Huang J, Cai Y, Xu X (2007) A hybrid genetic algorithm for feature selection wrapper based on mutual information. Pattern Recognit Lett 28:1825–1844CrossRef Huang J, Cai Y, Xu X (2007) A hybrid genetic algorithm for feature selection wrapper based on mutual information. Pattern Recognit Lett 28:1825–1844CrossRef
11.
Zurück zum Zitat Zhu Z, Jia S, Ji Z (2010) Towards a memetic feature selection paradigm [application notes]. IEEE Comput Intell Mag 5:41–53CrossRef Zhu Z, Jia S, Ji Z (2010) Towards a memetic feature selection paradigm [application notes]. IEEE Comput Intell Mag 5:41–53CrossRef
12.
Zurück zum Zitat Yang C-S, Chuang L-Y, Chen Y-J, Yang C-H (2008) Feature selection using memetic algorithms. In: Proceedings of the third international conference on convergence and hybrid information technology (ICCIT ’08), pp 416–423 Yang C-S, Chuang L-Y, Chen Y-J, Yang C-H (2008) Feature selection using memetic algorithms. In: Proceedings of the third international conference on convergence and hybrid information technology (ICCIT ’08), pp 416–423
13.
Zurück zum Zitat Butler-Yeoman T, Xue B, Zhang M (2015) Particle swarm optimisation for feature selection: a hybrid filter–wrapper approach. In: Proceedings of the IEEE congress on evolutionary computation (CEC 2015), pp 2428–2435 Butler-Yeoman T, Xue B, Zhang M (2015) Particle swarm optimisation for feature selection: a hybrid filter–wrapper approach. In: Proceedings of the IEEE congress on evolutionary computation (CEC 2015), pp 2428–2435
14.
Zurück zum Zitat Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6:182–197CrossRef Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6:182–197CrossRef
15.
Zurück zum Zitat Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13:21–27CrossRefMATH Cover T, Hart P (1967) Nearest neighbor pattern classification. IEEE Trans Inf Theory 13:21–27CrossRefMATH
16.
Zurück zum Zitat Basseur M, Burke EK (2007) Indicator-based multi-objective local search. In: Proceedings of IEEE congress on evolutionary computation, pp 3100–3107 Basseur M, Burke EK (2007) Indicator-based multi-objective local search. In: Proceedings of IEEE congress on evolutionary computation, pp 3100–3107
17.
Zurück zum Zitat Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput 1:3–18CrossRef Derrac J, García S, Molina D, Herrera F (2011) A practical tutorial on the use of nonparametric statistical tests as a methodology for comparing evolutionary and swarm intelligence algorithms. Swarm Evol Comput 1:3–18CrossRef
19.
Zurück zum Zitat Gütlein M, Frank E, Hall M, Karwath A (2009) Large-scale attribute selection using wrappers. In: Proceedings of the IEEE symposium on computational intelligence and data mining (CIDM ’09), pp 332–339 Gütlein M, Frank E, Hall M, Karwath A (2009) Large-scale attribute selection using wrappers. In: Proceedings of the IEEE symposium on computational intelligence and data mining (CIDM ’09), pp 332–339
20.
Zurück zum Zitat Abeel T, de Peer YV, Saeys Y (2009) Java-ml: a machine learning library. J Mach Learn Res 10:931–934MathSciNetMATH Abeel T, de Peer YV, Saeys Y (2009) Java-ml: a machine learning library. J Mach Learn Res 10:931–934MathSciNetMATH
21.
Zurück zum Zitat Shannon C, Weaver W (1948) The mathematical theory of communication. The University of Illinois Press, Urbana, p 144. ISBN: 978-0252725487 Shannon C, Weaver W (1948) The mathematical theory of communication. The University of Illinois Press, Urbana, p 144. ISBN: 978-0252725487
22.
Zurück zum Zitat Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York, p 745. ISBN: 978-0-387-84858-7 Hastie T, Tibshirani R, Friedman J (2009) The elements of statistical learning: data mining, inference and prediction. Springer, New York, p 745. ISBN: 978-0-387-84858-7
23.
Zurück zum Zitat Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the evolutionary methods for design, optimization and control with applications to industrial problems, pp 95–100 Zitzler E, Laumanns M, Thiele L (2001) SPEA2: improving the strength pareto evolutionary algorithm for multiobjective optimization. In: Proceedings of the evolutionary methods for design, optimization and control with applications to industrial problems, pp 95–100
24.
Zurück zum Zitat Xue B, Cervante L, Shang L, Browne WN, Zhang M (2013) Multi-objective evolutionary algorithms for filter based feature selection in classification. Int J Artif Intell Tools 22:1350024–31CrossRef Xue B, Cervante L, Shang L, Browne WN, Zhang M (2013) Multi-objective evolutionary algorithms for filter based feature selection in classification. Int J Artif Intell Tools 22:1350024–31CrossRef
25.
Zurück zum Zitat Bermejo P, Gámez JA, Puerta JM (2011) A GRASP algorithm for fast hybrid (filter–wrapper) feature subset selection in high-dimensional datasets. Pattern Recognit Lett 32:701–711CrossRef Bermejo P, Gámez JA, Puerta JM (2011) A GRASP algorithm for fast hybrid (filter–wrapper) feature subset selection in high-dimensional datasets. Pattern Recognit Lett 32:701–711CrossRef
Metadaten
Titel
A Multi-objective hybrid filter-wrapper evolutionary approach for feature selection
verfasst von
Marwa Hammami
Slim Bechikh
Chih-Cheng Hung
Lamjed Ben Said
Publikationsdatum
23.07.2018
Verlag
Springer Berlin Heidelberg
Erschienen in
Memetic Computing / Ausgabe 2/2019
Print ISSN: 1865-9284
Elektronische ISSN: 1865-9292
DOI
https://doi.org/10.1007/s12293-018-0269-2

Weitere Artikel der Ausgabe 2/2019

Memetic Computing 2/2019 Zur Ausgabe

Editorial

Editorial