Skip to main content
Erschienen in: Evolutionary Intelligence 4/2019

27.08.2019 | Research Paper

A histogram based fuzzy ensemble technique for feature selection

verfasst von: Manosij Ghosh, Ritam Guha, Pawan Kumar Singh, Vikrant Bhateja, Ram Sarkar

Erschienen in: Evolutionary Intelligence | Ausgabe 4/2019

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Feature selection (FS) is an integral part of many machine learning problems in providing a better and time-efficient classification model. In recent times, many new FS algorithms have been proposed which combine well-established algorithms to overcome drawbacks of the constituent algorithms. The general process of combination is to allow them to operate consecutively or simultaneously. These rudimentary combinations in many cases do not allow for proper inclusion of the advantages of the specific algorithms and this necessitates an alternative approach for combining. Initially without interrupting the flow of the algorithms, we allow them to generate their results. After selection of the most dominant features, the rest of the combination is done using the concept of histogram and assigning a weightage to the fuzzy features based on the quality of the candidate solution in which they appear. In the proposed method, the outcome of the three popularly used algorithms with complementary exploitation–exploration trade-off namely genetic algorithm (GA), binary particle swarm optimisation (BPSO) and ant colony optimisation (ACO) are combined together. Then, 14 popular UCI datasets have been used to evaluate the proposed FS method. Results obtained by our proposed ensemble are compared with some popular FS models like gravitational search algorithm, histogram based multi objective GA, GA, BPSO and ACO, and it shows that our algorithm outperforms the others.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
2.
Zurück zum Zitat Yang J, Honavar V (1998) Feature subset selection using a genetic algorithm. IEEE Intell Syst Appl 13:44–49CrossRef Yang J, Honavar V (1998) Feature subset selection using a genetic algorithm. IEEE Intell Syst Appl 13:44–49CrossRef
4.
Zurück zum Zitat Ghosh M, Begum S, Sarkar R, Chakraborty D, Maulik U (2019) Recursive memetic algorithm for gene selection in microarray data. Expert Syst Appl 116:172–185CrossRef Ghosh M, Begum S, Sarkar R, Chakraborty D, Maulik U (2019) Recursive memetic algorithm for gene selection in microarray data. Expert Syst Appl 116:172–185CrossRef
5.
Zurück zum Zitat Ghosh M, Adhikary S, Ghosh KK, Sardar A, Begum S, Sarkar R (2019) Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods. Med Biol Eng Comput 57:159–176CrossRef Ghosh M, Adhikary S, Ghosh KK, Sardar A, Begum S, Sarkar R (2019) Genetic algorithm based cancerous gene identification from microarray data using ensemble of filter methods. Med Biol Eng Comput 57:159–176CrossRef
6.
Zurück zum Zitat Liu H, Motoda H (2007) Computational methods of feature selection. CRC Press, Boca RatonCrossRef Liu H, Motoda H (2007) Computational methods of feature selection. CRC Press, Boca RatonCrossRef
7.
Zurück zum Zitat Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24:301–312CrossRef Mitra P, Murthy CA, Pal SK (2002) Unsupervised feature selection using feature similarity. IEEE Trans Pattern Anal Mach Intell 24:301–312CrossRef
8.
Zurück zum Zitat Shang W-Q, Qu Y-L, Huang H-K, Zhu H-B, Lin Y-M, Dong H-B (2006) Fuzzy knn text classifier based on Gini index. J Guangxi Norm Univ 24:87–90MATH Shang W-Q, Qu Y-L, Huang H-K, Zhu H-B, Lin Y-M, Dong H-B (2006) Fuzzy knn text classifier based on Gini index. J Guangxi Norm Univ 24:87–90MATH
9.
Zurück zum Zitat Dorigo M, Birattari M (2011) Ant colony optimization. In: Sammut C, Webb GI (eds) Encyclopedia machine learning. Springer, Berlin, pp 36–39 Dorigo M, Birattari M (2011) Ant colony optimization. In: Sammut C, Webb GI (eds) Encyclopedia machine learning. Springer, Berlin, pp 36–39
10.
Zurück zum Zitat Eberhart R, Kennedy J (1995) A new optimizer using particle swarm theory. In: MHS’95. Proceedings of the sixth international symposium on micro machine and human science. IEEE, pp 39–43 Eberhart R, Kennedy J (1995) A new optimizer using particle swarm theory. In: MHS’95. Proceedings of the sixth international symposium on micro machine and human science. IEEE, pp 39–43
13.
Zurück zum Zitat Duval B, Hao J-K, Hernandez Hernandez JC (2009) A memetic algorithm for gene selection and molecular classification of cancer. In: Proceedings of the 11th annual conference on genetic and evolutionary computation—GECCO’09, p 201. https://doi.org/10.1145/1569901.1569930 Duval B, Hao J-K, Hernandez Hernandez JC (2009) A memetic algorithm for gene selection and molecular classification of cancer. In: Proceedings of the 11th annual conference on genetic and evolutionary computation—GECCO’09, p 201. https://​doi.​org/​10.​1145/​1569901.​1569930
16.
Zurück zum Zitat Kennedy J, Eberhart RC (1997) A discrete binary version of the particle swarm algorithm, systems, man, cybernetics. In: IEEE international conference on computational cybernetics and simulation, vol 5, pp 4104–4108 Kennedy J, Eberhart RC (1997) A discrete binary version of the particle swarm algorithm, systems, man, cybernetics. In: IEEE international conference on computational cybernetics and simulation, vol 5, pp 4104–4108
19.
Zurück zum Zitat Sarkar R, Ghosh M, Chatterjee A, Malakar S (2018) An advanced particle swarm optimization based feature selection method for tri-script handwritten digit recognition. In: International conference on computational intelligence, communications, and business analytics, pp 978–981 Sarkar R, Ghosh M, Chatterjee A, Malakar S (2018) An advanced particle swarm optimization based feature selection method for tri-script handwritten digit recognition. In: International conference on computational intelligence, communications, and business analytics, pp 978–981
20.
21.
Zurück zum Zitat Leardi R (2000) Application of genetic algorithm—PLS for feature selection in spectral data sets. J Chemom 14(5–6):643–655CrossRef Leardi R (2000) Application of genetic algorithm—PLS for feature selection in spectral data sets. J Chemom 14(5–6):643–655CrossRef
22.
Zurück zum Zitat Ghosh M, Guha R, Mondal R, Singh PK, Sarkar R (2018) Feature selection using histogram based multi-objective GA for handwritten devanagari numeral recognition. Intell Eng Inform AISC 695:471–479CrossRef Ghosh M, Guha R, Mondal R, Singh PK, Sarkar R (2018) Feature selection using histogram based multi-objective GA for handwritten devanagari numeral recognition. Intell Eng Inform AISC 695:471–479CrossRef
24.
Zurück zum Zitat Prasad Y, Biswas KK, Jain CK (2010) SVM classifier based feature selection using GA, ACO and PSO for siRNA design. In: International conference in swarm intelligence, pp 307–314. Springer, Berlin Prasad Y, Biswas KK, Jain CK (2010) SVM classifier based feature selection using GA, ACO and PSO for siRNA design. In: International conference in swarm intelligence, pp 307–314. Springer, Berlin
28.
Zurück zum Zitat Basiri ME, Nemati S (2009) A novel hybrid ACO–GA algorithm for text feature selection. In: Proceedings of 11th IEEE conference on congress on evolutionary computation, pp 2561–2568 Basiri ME, Nemati S (2009) A novel hybrid ACO–GA algorithm for text feature selection. In: Proceedings of 11th IEEE conference on congress on evolutionary computation, pp 2561–2568
30.
Zurück zum Zitat Alba E, Garcia-Nieto J, Jourdan L, Talbi E-G (2007) Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: 2007 IEEE congress on evolutionary computation, pp 284–290 Alba E, Garcia-Nieto J, Jourdan L, Talbi E-G (2007) Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. In: 2007 IEEE congress on evolutionary computation, pp 284–290
31.
Zurück zum Zitat Cadenas JM, Garrido MC, MartíNez R (2013) Feature subset selection filter–wrapper based on low quality data. Expert Syst Appl 40:6241–6252CrossRef Cadenas JM, Garrido MC, MartíNez R (2013) Feature subset selection filter–wrapper based on low quality data. Expert Syst Appl 40:6241–6252CrossRef
32.
Zurück zum Zitat Tran CT, Zhang M, Andreae P, Xue B (2016) Improving performance for classification with incomplete data using wrapper-based feature selection. Evol Intell 9:81–94CrossRef Tran CT, Zhang M, Andreae P, Xue B (2016) Improving performance for classification with incomplete data using wrapper-based feature selection. Evol Intell 9:81–94CrossRef
33.
Zurück zum Zitat Harifi S, Khalilian M, Mohammadzadeh J, Ebrahimnejad S (2019) Emperor penguins colony: a new metaheuristic algorithm for optimization. Evol Intell 12(2):211–226CrossRef Harifi S, Khalilian M, Mohammadzadeh J, Ebrahimnejad S (2019) Emperor penguins colony: a new metaheuristic algorithm for optimization. Evol Intell 12(2):211–226CrossRef
35.
Zurück zum Zitat Singh H, Kumar Y, Kumar S (2019) A new meta-heuristic algorithm based on chemical reactions for partitional clustering problems. Evol Intell 12(2):241–252CrossRef Singh H, Kumar Y, Kumar S (2019) A new meta-heuristic algorithm based on chemical reactions for partitional clustering problems. Evol Intell 12(2):241–252CrossRef
36.
Zurück zum Zitat Cruz DPF, Maia RD, De Castro LN (2019) A critical discussion into the core of swarm intelligence algorithms. Evol Intell 12(2):189–200CrossRef Cruz DPF, Maia RD, De Castro LN (2019) A critical discussion into the core of swarm intelligence algorithms. Evol Intell 12(2):189–200CrossRef
37.
Zurück zum Zitat Elbes M, Alzubi S, Kanan T, Al-Fuqaha A, Hawashin B (2019) A survey on particle swarm optimization with emphasis on engineering and network applications. Evol Intell 12(2):113–129CrossRef Elbes M, Alzubi S, Kanan T, Al-Fuqaha A, Hawashin B (2019) A survey on particle swarm optimization with emphasis on engineering and network applications. Evol Intell 12(2):113–129CrossRef
38.
Zurück zum Zitat Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1:67–82CrossRef Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1:67–82CrossRef
41.
Zurück zum Zitat Singh PK, Sarkar R, Nasipuri M (2016) Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets. Int J Comput Sci Math 7(5):410–422MathSciNetCrossRef Singh PK, Sarkar R, Nasipuri M (2016) Significance of non-parametric statistical tests for comparison of classifiers over multiple datasets. Int J Comput Sci Math 7(5):410–422MathSciNetCrossRef
Metadaten
Titel
A histogram based fuzzy ensemble technique for feature selection
verfasst von
Manosij Ghosh
Ritam Guha
Pawan Kumar Singh
Vikrant Bhateja
Ram Sarkar
Publikationsdatum
27.08.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Evolutionary Intelligence / Ausgabe 4/2019
Print ISSN: 1864-5909
Elektronische ISSN: 1864-5917
DOI
https://doi.org/10.1007/s12065-019-00279-6

Weitere Artikel der Ausgabe 4/2019

Evolutionary Intelligence 4/2019 Zur Ausgabe

Premium Partner