skip to main content
10.1145/3520304.3528800acmconferencesArticle/Chapter ViewAbstractPublication PagesgeccoConference Proceedingsconference-collections
poster

Boosting the convergence of a GA-based wrapper for feature selection problems on high-dimensional data

Published:19 July 2022Publication History

ABSTRACT

High-dimensional data often need techniques, such as Feature Selection (FS), in order to solve the curse of dimensionality problem. One of the most popular approaches to FS is wrapper methods, which are based on a search algorithm and a clustering technique. NSGA-II and k-NN are applied in this paper. Since NSGA-II is intended to converge to a small subset of features, the generation of the initial population is crucial to speed up the search process. The lower number of features selected by the individuals belonging to the initial population, the lower number of generations needed to achieve convergence. This work presents a novel technique to reduce the average size (number of features selected) of the individuals forming the initial population. This technique is based on a hyper-parameter, p, which controls the probability of any feature being selected by any individual of the initial population. An analysis of both convergence time and classification accuracy is performed for different values of p, concluding that p can be set to quite low values, accelerating markedly the convergence of the algorithm without affecting the quality of the solutions.

References

  1. J. Asensio-Cubero, J. Q. Gan, and R. Palaniappan. 2013. Multiresolution Analysis over Simple Graphs for Brain Computer Interfaces. Journal of Neural Engineering 10, 4 (2013), 21--26. Google ScholarGoogle ScholarCross RefCross Ref
  2. R. E. Bellman. 1961. Adaptive Control Processes: A Guided Tour. Princeton University Press.Google ScholarGoogle Scholar
  3. J. J. Escobar, J. Ortega, J. González, M. Damas, and A. F. Díaz. 2017. Parallel high-dimensional multi-objective feature selection for EEG classification with dynamic workload balancing on CPU-GPU. Cluster Computing 20, 3 (2017), 1881--1897. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. K. Fukunaga. 2013. Introduction to Statistical Pattern Recognition. Elsevier.Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. González, J. Ortega, M. Damas, P. Martín-Smith, and J. Q. Gan. 2019. A New Multi-objective Wrapper Method for Feature Selection - Accuracy and Stability Analysis for BCI. Neurocomputing (2019), 407--418. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. J. González, J. Ortega, J. J. Escobar, and M. Damas. 2021. A Lexicographic Cooperative Co-evolutionary Approach for Feature Selection. Neurocomputing (2021), 59--76. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. González, I. Rojas, J. Ortega, H. Pomares, F. J. Fernandez, and A. F. Diaz. 2003. Multiobjective Evolutionary Optimization of the Size, Shape, and Position Parameters of Radial Basis Function Networks for Function Approximation. IEEE Transactions on Neural Networks (2003), 1478--1495. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. M. Leon, C. Parkkila, J. Tidare, N. Xiong, and E. Astrand. 2020. Impact of NSGA-II objectives on EEG feature selection related to motor imagery. In Proceedings of the 2020 Genetic and Evolutionary Computation Conference (GECCO'20). ACM, New York, NY, USA, 1134--1142. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. L. D. Vignolo, D. H. Milone, and J. Scharcanski. 2013. Feature Selection for Face Recognition Based on Multi-objective Evolutionary Wrappers. Expert Systems with Applications (2013), 5077--5084. Google ScholarGoogle ScholarCross RefCross Ref
  10. Z. Yin, H. Lan, G. Tan, M. Lu, A. V. Vasilakos, and W. Liu. 2017. Computing Platforms for Big Biological Data Analytics: Perspectives and Challenges. Computational and Structural Biotechnology Journal 15 (2017), 403--411. Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Boosting the convergence of a GA-based wrapper for feature selection problems on high-dimensional data

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Conferences
            GECCO '22: Proceedings of the Genetic and Evolutionary Computation Conference Companion
            July 2022
            2395 pages
            ISBN:9781450392686
            DOI:10.1145/3520304

            Copyright © 2022 Owner/Author

            Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 19 July 2022

            Check for updates

            Qualifiers

            • poster

            Acceptance Rates

            Overall Acceptance Rate1,669of4,410submissions,38%

            Upcoming Conference

            GECCO '24
            Genetic and Evolutionary Computation Conference
            July 14 - 18, 2024
            Melbourne , VIC , Australia

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader