Skip to main content

2018 | OriginalPaper | Buchkapitel

Feature Group Selection Using MKL Penalized with \(\ell _1\)-norm and SVM as Base Learner

verfasst von : Henry Jhoán Areiza-Laverde, Gloria M. Díaz, Andrés Eduardo Castro-Ospina

Erschienen in: Applied Computer Sciences in Engineering

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Objective feature selection is an important component in the machine learning framework, which has addressed problems like computational burden increasing and unnecessary high-dimensional representations. Most of feature selection techniques only perform individual feature evaluations and ignore the structural relationships between features of the same nature, causing relations to break and harming the algorithm performance. In this paper a feature group selection technique is proposed with the aim of objectively identify the relevance that a feature group carries out in a classification task. The proposed method uses Multiple Kernel Learning with a penalization rule based on the \(\ell _1\)-norm and a Support Vector Machine as base learner. Performance evaluation is carried out using two binarized configurations of the freely available MFEAT dataset. It provides six different feature groups allowing to develop multiple feature group analysis. The experimental results show that the implemented methodology is stable in the identification of the relevance of each feature group during all experiments, what allows to outperform the classification accuracy of state-of-the-art methods.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Brahim, A.B., Khanchel, R., Limam, M.: Robust ensemble based algorithms for multi-source data classification. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 4, 420–427 (2012) Brahim, A.B., Khanchel, R., Limam, M.: Robust ensemble based algorithms for multi-source data classification. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 4, 420–427 (2012)
2.
Zurück zum Zitat Cilia, N.D., De Stefano, C., Fontanella, F., di Freca, A.S.: A ranking-based feature selection approach for handwritten character recognition. Pattern Recognit. Lett. (2018) Cilia, N.D., De Stefano, C., Fontanella, F., di Freca, A.S.: A ranking-based feature selection approach for handwritten character recognition. Pattern Recognit. Lett. (2018)
3.
Zurück zum Zitat Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRef Cristianini, N., Shawe-Taylor, J.: An introduction to Support Vector Machines and Other Kernel-based Learning Methods. Cambridge University Press, Cambridge (2000)CrossRef
4.
Zurück zum Zitat Culache, O., Obadă, D.R.: Multimodality as a premise for inducing online flow on a brand website: a social semiotic approach. Procedia-Soc. Behav. Sci. 149, 261–268 (2014)CrossRef Culache, O., Obadă, D.R.: Multimodality as a premise for inducing online flow on a brand website: a social semiotic approach. Procedia-Soc. Behav. Sci. 149, 261–268 (2014)CrossRef
5.
Zurück zum Zitat Damoulas, T., Girolami, M.A.: Pattern recognition with a Bayesian Kernel combination machine. Pattern Recognit. Lett. 30(1), 46–54 (2009)CrossRef Damoulas, T., Girolami, M.A.: Pattern recognition with a Bayesian Kernel combination machine. Pattern Recognit. Lett. 30(1), 46–54 (2009)CrossRef
6.
Zurück zum Zitat De Stefano, C., Fontanella, F., Marrocco, C., Di Freca, A.S.: A GA-based feature selection approach with an application to handwritten character recognition. Pattern Recognit. Lett. 35, 130–141 (2014)CrossRef De Stefano, C., Fontanella, F., Marrocco, C., Di Freca, A.S.: A GA-based feature selection approach with an application to handwritten character recognition. Pattern Recognit. Lett. 35, 130–141 (2014)CrossRef
8.
Zurück zum Zitat Dhifli, W., Aridhi, S., Nguifo, E.M.: MR-SimLab: scalable subgraph selection with label similarity for big data. Inf. Syst. 69, 155–163 (2017)CrossRef Dhifli, W., Aridhi, S., Nguifo, E.M.: MR-SimLab: scalable subgraph selection with label similarity for big data. Inf. Syst. 69, 155–163 (2017)CrossRef
9.
Zurück zum Zitat Foresti, L., Tuia, D., Timonin, V., Kanevski, M.F.: Time series input selection using multiple Kernel learning. In: ESANN, pp. 123–128 (2010) Foresti, L., Tuia, D., Timonin, V., Kanevski, M.F.: Time series input selection using multiple Kernel learning. In: ESANN, pp. 123–128 (2010)
10.
Zurück zum Zitat Gönen, G.B., Gönen, M., Gürgen, F.: Probabilistic and discriminative group-wise feature selection methods for credit risk analysis. Expert Syst. Appl. 39(14), 11709–11717 (2012)CrossRef Gönen, G.B., Gönen, M., Gürgen, F.: Probabilistic and discriminative group-wise feature selection methods for credit risk analysis. Expert Syst. Appl. 39(14), 11709–11717 (2012)CrossRef
11.
Zurück zum Zitat Gönen, M., Alpaydın, E.: Multiple Kernel learning algorithms. J. Mach. Learn. Res. 12(Jul), 2211–2268 (2011)MathSciNetMATH Gönen, M., Alpaydın, E.: Multiple Kernel learning algorithms. J. Mach. Learn. Res. 12(Jul), 2211–2268 (2011)MathSciNetMATH
12.
Zurück zum Zitat Gu, X., Angelov, P.P.: Self-organising fuzzy logic classifier. Inf. Sci. 447, 36–51 (2018)CrossRef Gu, X., Angelov, P.P.: Self-organising fuzzy logic classifier. Inf. Sci. 447, 36–51 (2018)CrossRef
14.
Zurück zum Zitat Koç, M., Barkana, A.: Application of linear regression classification to low-dimensional datasets. Neurocomputing 131, 331–335 (2014)CrossRef Koç, M., Barkana, A.: Application of linear regression classification to low-dimensional datasets. Neurocomputing 131, 331–335 (2014)CrossRef
15.
Zurück zum Zitat Kyunghoon, K.: Approaches to the design of machine learning system. Ph.D. thesis, Escuela de Graduados de la Universidad Nacional de Seúl (2016) Kyunghoon, K.: Approaches to the design of machine learning system. Ph.D. thesis, Escuela de Graduados de la Universidad Nacional de Seúl (2016)
16.
Zurück zum Zitat Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R.P., Tang, J., Liu, H.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50(6), 94 (2017)CrossRef Li, J., Cheng, K., Wang, S., Morstatter, F., Trevino, R.P., Tang, J., Liu, H.: Feature selection: a data perspective. ACM Comput. Surv. (CSUR) 50(6), 94 (2017)CrossRef
18.
Zurück zum Zitat Raza, H., Cecotti, H., Prasad, G.: Optimising frequency band selection with forward-addition and backward-elimination algorithms in EEG-based brain-computer interfaces. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2015) Raza, H., Cecotti, H., Prasad, G.: Optimising frequency band selection with forward-addition and backward-elimination algorithms in EEG-based brain-computer interfaces. In: 2015 International Joint Conference on Neural Networks (IJCNN), pp. 1–7. IEEE (2015)
19.
Zurück zum Zitat Subrahmanya, N., Shin, Y.C.: Automated sensor selection and fusion for monitoring and diagnostics of plunge grinding. J. Manuf. Sci. Eng. 130(3), 031014 (2008)CrossRef Subrahmanya, N., Shin, Y.C.: Automated sensor selection and fusion for monitoring and diagnostics of plunge grinding. J. Manuf. Sci. Eng. 130(3), 031014 (2008)CrossRef
20.
Zurück zum Zitat Symons, C.T., Arel, I.: Multi-view budgeted learning under label and feature constraints using label-guided graph-based regularization. In: International Conference on Machine Learning, Workshop on Combining Learning Strategies to Reduce Label Cost. Citeseer (2011) Symons, C.T., Arel, I.: Multi-view budgeted learning under label and feature constraints using label-guided graph-based regularization. In: International Conference on Machine Learning, Workshop on Combining Learning Strategies to Reduce Label Cost. Citeseer (2011)
21.
Zurück zum Zitat Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58, 267–288 (1996)MathSciNetMATH Tibshirani, R.: Regression shrinkage and selection via the lasso. J. R. Stat. Soc. Ser. B (Methodol.) 58, 267–288 (1996)MathSciNetMATH
22.
Zurück zum Zitat Wang, J., Wang, M., Li, P., Liu, L., Zhao, Z., Hu, X., Wu, X.: Online feature selection with group structure analysis. IEEE Trans. Knowl. Data Eng. 27(11), 3029–3041 (2015)CrossRef Wang, J., Wang, M., Li, P., Liu, L., Zhao, Z., Hu, X., Wu, X.: Online feature selection with group structure analysis. IEEE Trans. Knowl. Data Eng. 27(11), 3029–3041 (2015)CrossRef
23.
Zurück zum Zitat Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016) Witten, I.H., Frank, E., Hall, M.A., Pal, C.J.: Data Mining: Practical Machine Learning Tools and Techniques. Morgan Kaufmann, Burlington (2016)
24.
Zurück zum Zitat Xiang, S., Yang, T., Ye, J.: Simultaneous feature and feature group selection through hard thresholding. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 532–541. ACM (2014) Xiang, S., Yang, T., Ye, J.: Simultaneous feature and feature group selection through hard thresholding. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 532–541. ACM (2014)
25.
Zurück zum Zitat Xu, Z., Jin, R., Yang, H., King, I., Lyu, M.R.: Simple and efficient multiple Kernel learning by group lasso. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 1175–1182. Citeseer (2010) Xu, Z., Jin, R., Yang, H., King, I., Lyu, M.R.: Simple and efficient multiple Kernel learning by group lasso. In: Proceedings of the 27th International Conference on Machine Learning (ICML 2010), pp. 1175–1182. Citeseer (2010)
Metadaten
Titel
Feature Group Selection Using MKL Penalized with -norm and SVM as Base Learner
verfasst von
Henry Jhoán Areiza-Laverde
Gloria M. Díaz
Andrés Eduardo Castro-Ospina
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-00350-0_12