Skip to main content
Top

2018 | OriginalPaper | Chapter

Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data

Authors : Wenbin Pei, Bing Xue, Lin Shang, Mengjie Zhang

Published in: AI 2018: Advances in Artificial Intelligence

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

Classification tasks become more challenging when having the curse of dimensionality issue. Recently, there has been an increasing number of datasets with thousands of features. Some classification algorithms often need feature selection to avoid the curse of dimensionality. Genetic programming (GP) has shown success in classification tasks. GP does not require to do feature selection because of its built-in capability to automatically select informative features. However, GP-based methods are often computationally intensive to achieve a good classification accuracy. Based on perspectives from granular computing (GrC), this paper proposes a new approach to linking features hierarchically for GP-based classification. Experiments on seven high-dimensional datasets show the effectiveness of the proposed algorithm in terms of saving training time and enhancing the classification accuracy, compared to baseline methods.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Bargiela, A., Pedrycz, W.: Granular computing. In: Handbook in Computational Intelligence. Fuzzy Logic, Systems, Artificial Neural Networks, and Learning Systems, vol. 1, pp. 43–66. World Scientific (2016) Bargiela, A., Pedrycz, W.: Granular computing. In: Handbook in Computational Intelligence. Fuzzy Logic, Systems, Artificial Neural Networks, and Learning Systems, vol. 1, pp. 43–66. World Scientific (2016)
2.
4.
go back to reference Deng, Z., Zhu, X., Cheng, D., Zong, M., Zhang, S.: Efficient knn classification algorithm for big data. Neurocomputing 195, 143–148 (2016)CrossRef Deng, Z., Zhu, X., Cheng, D., Zong, M., Zhang, S.: Efficient knn classification algorithm for big data. Neurocomputing 195, 143–148 (2016)CrossRef
5.
go back to reference Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 40(2), 121–144 (2010)CrossRef Espejo, P.G., Ventura, S., Herrera, F.: A survey on the application of genetic programming to classification. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 40(2), 121–144 (2010)CrossRef
6.
go back to reference Fleury, A., Vacher, M., Noury, N.: SVM-based multimodal classification of activities of daily living in health smart homes: sensors, algorithms, and first experimental results. IEEE Trans. Inf. Technol. Biomed. 14(2), 274–283 (2010)CrossRef Fleury, A., Vacher, M., Noury, N.: SVM-based multimodal classification of activities of daily living in health smart homes: sensors, algorithms, and first experimental results. IEEE Trans. Inf. Technol. Biomed. 14(2), 274–283 (2010)CrossRef
7.
go back to reference Joshi, A., Dangra, J., Rawat, M.: A decision tree based classification technique for accurate heart disease classification and prediction. Int. J. Technol. Res. Manag. 3, 1–4 (2016) Joshi, A., Dangra, J., Rawat, M.: A decision tree based classification technique for accurate heart disease classification and prediction. Int. J. Technol. Res. Manag. 3, 1–4 (2016)
8.
go back to reference Luna, J.M., Pechenizkiy, M., del Jesus, M.J., Ventura, S.: Mining context-aware association rules using grammar-based genetic programming. IEEE Trans. Cybern. (2017) Luna, J.M., Pechenizkiy, M., del Jesus, M.J., Ventura, S.: Mining context-aware association rules using grammar-based genetic programming. IEEE Trans. Cybern. (2017)
9.
go back to reference Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006) Murphy, K.P.: Naive Bayes Classifiers. University of British Columbia (2006)
11.
go back to reference Poli, R., Langdon, W., McPhee, N.: A Field Guide to Genetic Programming (2008) Poli, R., Langdon, W., McPhee, N.: A Field Guide to Genetic Programming (2008)
12.
go back to reference Sáez, J.A., Galar, M., Luengo, J., Herrera, F.: Tackling the problem of classification with noisy data using multiple classifier systems: analysis of the performance and robustness. Inf. Sci. 247, 1–20 (2013)CrossRef Sáez, J.A., Galar, M., Luengo, J., Herrera, F.: Tackling the problem of classification with noisy data using multiple classifier systems: analysis of the performance and robustness. Inf. Sci. 247, 1–20 (2013)CrossRef
13.
go back to reference Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009)CrossRef Sokolova, M., Lapalme, G.: A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 45(4), 427–437 (2009)CrossRef
14.
go back to reference Svetnik, V., Liaw, A., Tong, C., Culberson, J.C., Sheridan, R.P., Feuston, B.P.: Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43(6), 1947–1958 (2003)CrossRef Svetnik, V., Liaw, A., Tong, C., Culberson, J.C., Sheridan, R.P., Feuston, B.P.: Random forest: a classification and regression tool for compound classification and QSAR modeling. J. Chem. Inf. Comput. Sci. 43(6), 1947–1958 (2003)CrossRef
15.
go back to reference Thearling, K.: An Introduction to Data Mining (2017) Thearling, K.: An Introduction to Data Mining (2017)
16.
go back to reference Tran, B., Xue, B., Zhang, M.: Genetic programming for feature construction and selection in classification on high-dimensional data. Memetic Comput. 8(1), 3–15 (2016)CrossRef Tran, B., Xue, B., Zhang, M.: Genetic programming for feature construction and selection in classification on high-dimensional data. Memetic Comput. 8(1), 3–15 (2016)CrossRef
18.
go back to reference Wang, G., Yang, J., Xu, J.: Granular computing: from granularity optimization to multi-granularity joint problem solving. Granul. Comput. 2(3), 105–120 (2017)CrossRef Wang, G., Yang, J., Xu, J.: Granular computing: from granularity optimization to multi-granularity joint problem solving. Granul. Comput. 2(3), 105–120 (2017)CrossRef
19.
go back to reference Yang, H.J., Roe, B.P., Zhu, J.: Studies of stability and robustness for artificial neural networks and boosted decision trees. Nucl. Instrum. Methods Phys. Res. Sect. A: Accel. Spectrometers Detect. Assoc. Equip. 574(2), 342–349 (2007)CrossRef Yang, H.J., Roe, B.P., Zhu, J.: Studies of stability and robustness for artificial neural networks and boosted decision trees. Nucl. Instrum. Methods Phys. Res. Sect. A: Accel. Spectrometers Detect. Assoc. Equip. 574(2), 342–349 (2007)CrossRef
20.
go back to reference Yao, J.: Novel Developments in Granular Computing: Applications for Advanced Human Reasoning and Soft Computation. IGI Global (2010) Yao, J.: Novel Developments in Granular Computing: Applications for Advanced Human Reasoning and Soft Computation. IGI Global (2010)
21.
go back to reference Yao, Y.: A triarchic theory of granular computing. Granul. Comput. 1(2), 145–157 (2016)CrossRef Yao, Y.: A triarchic theory of granular computing. Granul. Comput. 1(2), 145–157 (2016)CrossRef
22.
go back to reference Zhang, G.P.: Neural networks for classification: a survey. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 30(4), 451–462 (2000)CrossRef Zhang, G.P.: Neural networks for classification: a survey. IEEE Trans. Syst. Man Cybern. Part C (Appl. Rev.) 30(4), 451–462 (2000)CrossRef
23.
go back to reference Zhu, Z., Ong, Y.S., Dash, M.: Markov blanket-embedded genetic algorithm for gene selection. Pattern Recognit. 40(11), 3236–3248 (2007)CrossRef Zhu, Z., Ong, Y.S., Dash, M.: Markov blanket-embedded genetic algorithm for gene selection. Pattern Recognit. 40(11), 3236–3248 (2007)CrossRef
Metadata
Title
Genetic Programming Based on Granular Computing for Classification with High-Dimensional Data
Authors
Wenbin Pei
Bing Xue
Lin Shang
Mengjie Zhang
Copyright Year
2018
DOI
https://doi.org/10.1007/978-3-030-03991-2_58

Premium Partner