Skip to main content

2016 | OriginalPaper | Buchkapitel

A Weighted Feature Selection Method for Instance-Based Classification

verfasst von : Gennady Agre, Anton Dzhondzhorov

Erschienen in: Artificial Intelligence: Methodology, Systems, and Applications

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The paper presents a new method for selecting features that is suited for the instance-based classification. The selection is based on the ReliefF estimation of the quality of features in the orthogonal feature space obtained after PCA transformation, as well as on the interpretation of these weights as values proportional to the amount of explained concept changes. The user sets a threshold defining what percent of the whole concept variability the selected features should explain and only the first “stronger” features, which combine weights together exceed this threshold, are selected. During the classification phase the selected features are used along with their weights. The experiment results on 12 benchmark databases have shown the advantages of the proposed method in comparison with traditional ReliefF.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Bins, J., Draper, B.: Feature selection from huge feature sets. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, vol. 2, pp. 159–165 (2001) Bins, J., Draper, B.: Feature selection from huge feature sets. In: Proceedings of the Eighth IEEE International Conference on Computer Vision, vol. 2, pp. 159–165 (2001)
2.
Zurück zum Zitat Chang, C.-C.: Generalized iterative RELIEF for supervised distance metric learning. Pattern Recogn. 43(8), 2971–2981 (2010)CrossRefMATH Chang, C.-C.: Generalized iterative RELIEF for supervised distance metric learning. Pattern Recogn. 43(8), 2971–2981 (2010)CrossRefMATH
3.
Zurück zum Zitat Dhanabal, S., Chandramathi, S.: A review of various k-nearest neighbor query processing techniques. Intern. J. Comput. Appl. 31(7), 14–22 (2011) Dhanabal, S., Chandramathi, S.: A review of various k-nearest neighbor query processing techniques. Intern. J. Comput. Appl. 31(7), 14–22 (2011)
4.
Zurück zum Zitat Diamataras, K.I., Kung, S.J.: Principal Component Neural Networks. Theory and Applications. Wiley, New York (1996) Diamataras, K.I., Kung, S.J.: Principal Component Neural Networks. Theory and Applications. Wiley, New York (1996)
5.
Zurück zum Zitat Florez-lopez, R.: Reviewing RELIEF and its extensions: a new approach for estimating attributes considering high-correlated features. In: Proceedings of IEEE International Conference on Data Mining, Maebashi, Japan, pp. 605–608 (2002) Florez-lopez, R.: Reviewing RELIEF and its extensions: a new approach for estimating attributes considering high-correlated features. In: Proceedings of IEEE International Conference on Data Mining, Maebashi, Japan, pp. 605–608 (2002)
6.
Zurück zum Zitat Freitag, D., Caruana, R.: Greedy attribute selection. In: Proceedings of Eleven International Conference on Machine Learning, pp. 28–36 (1994) Freitag, D., Caruana, R.: Greedy attribute selection. In: Proceedings of Eleven International Conference on Machine Learning, pp. 28–36 (1994)
7.
Zurück zum Zitat Hall, M.A.: Correlation-based feature selection of discrete and numeric class machine learning. In: Proceedings of International Conference on Machine Learning (ICML-2000), San Francisco, CA, pp. 359–366. Morgan Kaufmann, San Francisco (2000) Hall, M.A.: Correlation-based feature selection of discrete and numeric class machine learning. In: Proceedings of International Conference on Machine Learning (ICML-2000), San Francisco, CA, pp. 359–366. Morgan Kaufmann, San Francisco (2000)
9.
Zurück zum Zitat Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: Proceedings of AAAI 1992, San Jose, USA, pp. 129–134 (1992) Kira, K., Rendell, L.A.: The feature selection problem: traditional methods and a new algorithm. In: Proceedings of AAAI 1992, San Jose, USA, pp. 129–134 (1992)
10.
Zurück zum Zitat Kononenko, I.: Estimating attributes: analysis and extensions of RELIEF. In: Proceedings of European Conference on Machine Learning, Catania, Italy, vol. 182, pp. 171–182 (1994) Kononenko, I.: Estimating attributes: analysis and extensions of RELIEF. In: Proceedings of European Conference on Machine Learning, Catania, Italy, vol. 182, pp. 171–182 (1994)
11.
Zurück zum Zitat Ordonez, C., Mohanam, N., Garcia-Alvarado, C.: PCA for large data sets with parallel data summarization. Distrib. Parallel Databases 32(3), 377–403 (2014)CrossRef Ordonez, C., Mohanam, N., Garcia-Alvarado, C.: PCA for large data sets with parallel data summarization. Distrib. Parallel Databases 32(3), 377–403 (2014)CrossRef
12.
Zurück zum Zitat Pechenizkiy, M.: The impact of feature extraction on the performance of a classifier: kNN, Naïve Bayes and C4.5. In: Kégl, B., Lee, H.-H. (eds.) Canadian AI 2005. LNCS (LNAI), vol. 3501, pp. 268–279. Springer, Heidelberg (2005)CrossRef Pechenizkiy, M.: The impact of feature extraction on the performance of a classifier: kNN, Naïve Bayes and C4.5. In: Kégl, B., Lee, H.-H. (eds.) Canadian AI 2005. LNCS (LNAI), vol. 3501, pp. 268–279. Springer, Heidelberg (2005)CrossRef
13.
Zurück zum Zitat Robnik-Sikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. J. 53, 23–69 (2003)CrossRefMATH Robnik-Sikonja, M., Kononenko, I.: Theoretical and empirical analysis of ReliefF and RReliefF. Mach. Learn. J. 53, 23–69 (2003)CrossRefMATH
14.
Zurück zum Zitat Sharma, A., Paliwala, K., Onwubolu, G.: Class-dependent PCA, MDC and LDA: a combined classifier for pattern classification. Pattern Recogn. 39, 1215–1229 (2006)CrossRefMATH Sharma, A., Paliwala, K., Onwubolu, G.: Class-dependent PCA, MDC and LDA: a combined classifier for pattern classification. Pattern Recogn. 39, 1215–1229 (2006)CrossRefMATH
15.
Zurück zum Zitat Strandjev, B., Agre, G.: On impact of PCA for solving classification tasks defined on facial images. Intern. J. Reason. Based Intell. Syst. 6(3/4), 85–92 (2014) Strandjev, B., Agre, G.: On impact of PCA for solving classification tasks defined on facial images. Intern. J. Reason. Based Intell. Syst. 6(3/4), 85–92 (2014)
16.
Zurück zum Zitat Sun, Y., Li, J.: Iterative RELIEF for feature weighting: algorithms, theories, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1035–1051 (2007)MathSciNetCrossRef Sun, Y., Li, J.: Iterative RELIEF for feature weighting: algorithms, theories, and applications. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1035–1051 (2007)MathSciNetCrossRef
17.
Zurück zum Zitat Tsymbal, A., Puuronen, S., Pechenizkiy, M., Baumgarten, M., Patterson, D.W.: Eigenvector-based feature extraction for classification. In: Proceedings of FLAIRS Conference, pp. 354–358 (2002) Tsymbal, A., Puuronen, S., Pechenizkiy, M., Baumgarten, M., Patterson, D.W.: Eigenvector-based feature extraction for classification. In: Proceedings of FLAIRS Conference, pp. 354–358 (2002)
18.
Zurück zum Zitat Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cogn. Neurosci. 3(1), 71–86 (1991)CrossRef Turk, M., Pentland, A.: Eigenfaces for recognition. J. Cogn. Neurosci. 3(1), 71–86 (1991)CrossRef
19.
Zurück zum Zitat Vergara, J., Estevez, P.: A review of feature selection methods based on mutual information. Neural Comput. Appl. 24, 175–186 (2014)CrossRef Vergara, J., Estevez, P.: A review of feature selection methods based on mutual information. Neural Comput. Appl. 24, 175–186 (2014)CrossRef
20.
Zurück zum Zitat Wettschereck, D., Aha, D.W., Mohri, T.: A review and empirical evaluation of feature weighting methods for a class of lazy learning algorithms. Artif. Intell. Rev. 11, 273–314 (1997)CrossRef Wettschereck, D., Aha, D.W., Mohri, T.: A review and empirical evaluation of feature weighting methods for a class of lazy learning algorithms. Artif. Intell. Rev. 11, 273–314 (1997)CrossRef
21.
Zurück zum Zitat Yang, J., Li, Y.-P.: Orthogonal relief algorithm for feature selection. In: Huang, D.-S., Li, K., Irwin, G.W. (eds.) ICIC 2006. LNCS, vol. 4113, pp. 227–234. Springer, Heidelberg (2006)CrossRef Yang, J., Li, Y.-P.: Orthogonal relief algorithm for feature selection. In: Huang, D.-S., Li, K., Irwin, G.W. (eds.) ICIC 2006. LNCS, vol. 4113, pp. 227–234. Springer, Heidelberg (2006)CrossRef
22.
Zurück zum Zitat Zeng, X., Wang, Q., Zhang, C., Cai, H.: Feature selection based on ReliefF and PCA for underwater sound classification. In: Proceedings of the 3rd International Conference on Computer Science and Network Technology (ICCSNT), Dalian, pp. 442–445 (2013) Zeng, X., Wang, Q., Zhang, C., Cai, H.: Feature selection based on ReliefF and PCA for underwater sound classification. In: Proceedings of the 3rd International Conference on Computer Science and Network Technology (ICCSNT), Dalian, pp. 442–445 (2013)
Metadaten
Titel
A Weighted Feature Selection Method for Instance-Based Classification
verfasst von
Gennady Agre
Anton Dzhondzhorov
Copyright-Jahr
2016
DOI
https://doi.org/10.1007/978-3-319-44748-3_2

Premium Partner