2012 | OriginalPaper | Buchkapitel
A Comparative Analysis of Balancing Techniques and Attribute Reduction Algorithms
verfasst von : R. Romero, E. L. Iglesias, L. Borrajo
Erschienen in: 6th International Conference on Practical Applications of Computational Biology & Bioinformatics
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
In this study we analyze several data balancing techniques and attribute reduction algorithms and their impact over the information retrieval process. Specifically, we study its performance when used in biomedical text classification using Support Vector Machines (SVMs) based on Linear, Radial, Polynomial and Sigmoid kernels. From experiments on the TREC Genomics 2005 biomedical text public corpus we conclude that these techniques are necessary to improve the classification process. Kernels get some improvements about their results when attribute reduction algorithms were used.Moreover, if balancing techniques and attribute reduction algorithms are applied, results obtained with oversampling are better than subsampling.