Skip to main content
Top

2019 | OriginalPaper | Chapter

Neural Network-Based Deep Encoding for Mixed-Attribute Data Classification

Authors : Tinglin Huang, Yulin He, Dexin Dai, Wenting Wang, Joshua Zhexue Huang

Published in: Trends and Applications in Knowledge Discovery and Data Mining

Publisher: Springer International Publishing

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

This paper proposes a neural network-based deep encoding (DE) method for the mixed-attribute data classification. DE method first uses the existing one-hot encoding (OE) method to encode the discrete-attribute data. Second, DE method trains an improved neural network to classify the OE-attribute data corresponding to the discrete-attribute data. The loss function of improved neural network not only includes the training error but also considers the uncertainty of hidden-layer output matrix (i.e., DE-attribute data), where the uncertainty is calculated with the re-substitution entropy. Third, the classification task is conducted based on the combination of previous continuous-attribute data and transformed DE-attribute data. Finally, we compare DE method with OE method by training support vector machine (SVM) and deep neural network (DNN) on 4 KEEL mixed-attribute data sets. The experimental results demonstrate the feasibility and effectiveness of DE method and show that DE method can help SVM and DNN obtain the better classification accuracies than the traditional OE method.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
1.
go back to reference Babich, G.A., Camps, O.I.: Weighted Parzen windows for pattern classification. IEEE Trans. Pattern Anal. Mach. Intell. 18(5), 567–570 (1996)CrossRef Babich, G.A., Camps, O.I.: Weighted Parzen windows for pattern classification. IEEE Trans. Pattern Anal. Mach. Intell. 18(5), 567–570 (1996)CrossRef
2.
go back to reference Boullé, M.: MODL: a Bayes optimal discretization method for continuous attributes. Mach. Learn. 65(1), 131–165 (2006)CrossRef Boullé, M.: MODL: a Bayes optimal discretization method for continuous attributes. Mach. Learn. 65(1), 131–165 (2006)CrossRef
3.
go back to reference Chen, L., Li, X., Yang, Y., et al.: Personal health indexing based on medical examinations: a data mining approach. Decis. Support Syst. 81, 54–65 (2016)CrossRef Chen, L., Li, X., Yang, Y., et al.: Personal health indexing based on medical examinations: a data mining approach. Decis. Support Syst. 81, 54–65 (2016)CrossRef
4.
go back to reference Ching, J.Y., Wong, A.K.C., Chan, K.C.C.: Class-dependent discretization for inductive learning from continuous and mixed-mode data. IEEE Trans. Pattern Anal. Mach. Intell. 17(7), 641–651 (1995)CrossRef Ching, J.Y., Wong, A.K.C., Chan, K.C.C.: Class-dependent discretization for inductive learning from continuous and mixed-mode data. IEEE Trans. Pattern Anal. Mach. Intell. 17(7), 641–651 (1995)CrossRef
5.
go back to reference He, Y.L., Liu, J.N., Wang, X.Z., et al.: Optimal bandwidth selection for re-substitution entropy estimation. Appl. Math. Comput. 219(8), 3425–3460 (2012)MathSciNetMATH He, Y.L., Liu, J.N., Wang, X.Z., et al.: Optimal bandwidth selection for re-substitution entropy estimation. Appl. Math. Comput. 219(8), 3425–3460 (2012)MathSciNetMATH
6.
go back to reference Hu, H.W., Chen, Y.L., Tang, K.: A dynamic discretization approach for constructing decision trees with a continuous label. IEEE Trans. Knowl. Data Eng. 21(11), 1505–1514 (2009)CrossRef Hu, H.W., Chen, Y.L., Tang, K.: A dynamic discretization approach for constructing decision trees with a continuous label. IEEE Trans. Knowl. Data Eng. 21(11), 1505–1514 (2009)CrossRef
7.
go back to reference Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006)CrossRef Huang, G.B., Zhu, Q.Y., Siew, C.K.: Extreme learning machine: theory and applications. Neurocomputing 70(1–3), 489–501 (2006)CrossRef
8.
go back to reference Khan, I., Huang, J.Z., Luo, Z., et al.: CPLP: an algorithm for tracking the changes of power consumption patterns in load profile data over time. Inf. Sci. 429, 332–348 (2018)CrossRef Khan, I., Huang, J.Z., Luo, Z., et al.: CPLP: an algorithm for tracking the changes of power consumption patterns in load profile data over time. Inf. Sci. 429, 332–348 (2018)CrossRef
9.
go back to reference Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning, pp. 807–814 (2010) Nair, V., Hinton, G.E.: Rectified linear units improve restricted Boltzmann machines. In: Proceedings of the 27th International Conference on Machine Learning, pp. 807–814 (2010)
10.
11.
go back to reference Passerini, A., Pontil, M., Frasconi, P.: New results on error correcting output codes of kernel machines. IEEE Trans. Neural Netw. 15(1), 45–54 (2004)CrossRef Passerini, A., Pontil, M., Frasconi, P.: New results on error correcting output codes of kernel machines. IEEE Trans. Neural Netw. 15(1), 45–54 (2004)CrossRef
12.
go back to reference Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(1), 23–38 (1998)CrossRef Rowley, H.A., Baluja, S., Kanade, T.: Neural network-based face detection. IEEE Trans. Pattern Anal. Mach. Intell. 20(1), 23–38 (1998)CrossRef
13.
go back to reference Svozil, D., Kvasnicka, V., Pospichal, J.: Introduction to multi-layer feed-forward neural networks. Chemometr. Intell. Lab. Syst. 39(1), 43–62 (1997)CrossRef Svozil, D., Kvasnicka, V., Pospichal, J.: Introduction to multi-layer feed-forward neural networks. Chemometr. Intell. Lab. Syst. 39(1), 43–62 (1997)CrossRef
14.
go back to reference Triguero, T., González, S., Moyano, J.M., et al.: KEEL 3.0: an open source software for multi-stage analysis in data mining. Int. J. Comput. Intell. Syst. 10, 1238–1249 (2017)CrossRef Triguero, T., González, S., Moyano, J.M., et al.: KEEL 3.0: an open source software for multi-stage analysis in data mining. Int. J. Comput. Intell. Syst. 10, 1238–1249 (2017)CrossRef
15.
go back to reference Wand, M.P., Jones, M.C.: Kernel Smoothing. Chapman and Hall/CRC, Boca Raton (1994)CrossRef Wand, M.P., Jones, M.C.: Kernel Smoothing. Chapman and Hall/CRC, Boca Raton (1994)CrossRef
16.
go back to reference Wang, R., He, Y.L., Chow, C.Y., et al.: Learning ELM-tree from big data based on uncertainty reduction. Fuzzy Sets Syst. 258, 79–100 (2015)MathSciNetCrossRef Wang, R., He, Y.L., Chow, C.Y., et al.: Learning ELM-tree from big data based on uncertainty reduction. Fuzzy Sets Syst. 258, 79–100 (2015)MathSciNetCrossRef
17.
go back to reference Wang, X.Z., He, Y.L., Wang, D.D.: Non-naive Bayesian classifiers for classification problems with continuous attributes. IEEE Trans. Cybern. 44(1), 21–39 (2014)CrossRef Wang, X.Z., He, Y.L., Wang, D.D.: Non-naive Bayesian classifiers for classification problems with continuous attributes. IEEE Trans. Cybern. 44(1), 21–39 (2014)CrossRef
18.
go back to reference Wang, X.Z., He, Y.L.: Learning from uncertainty for big data: future analytical challenges and strategies. IEEE Syst. Man Cybern. Mag. 2(2), 26–31 (2016)CrossRef Wang, X.Z., He, Y.L.: Learning from uncertainty for big data: future analytical challenges and strategies. IEEE Syst. Man Cybern. Mag. 2(2), 26–31 (2016)CrossRef
19.
go back to reference Wang, C.M., Huang, Y.F.: Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data. Expert Syst. Appl. 36(3), 5900–5908 (2009)CrossRef Wang, C.M., Huang, Y.F.: Evolutionary-based feature selection approaches with new criteria for data mining: a case study of credit approval data. Expert Syst. Appl. 36(3), 5900–5908 (2009)CrossRef
20.
go back to reference Wang, X.Z., Wang, R., Xu, C.: Discovering the relationship between generalization and uncertainty by incorporating complexity of classification. IEEE Trans. Cybern. 48(2), 703–715 (2018)MathSciNetCrossRef Wang, X.Z., Wang, R., Xu, C.: Discovering the relationship between generalization and uncertainty by incorporating complexity of classification. IEEE Trans. Cybern. 48(2), 703–715 (2018)MathSciNetCrossRef
21.
go back to reference Zhou, Z.H., Chen, Z.Q.: Hybrid decision tree. Knowl.-Based Syst. 15(8), 515–528 (2002)CrossRef Zhou, Z.H., Chen, Z.Q.: Hybrid decision tree. Knowl.-Based Syst. 15(8), 515–528 (2002)CrossRef
Metadata
Title
Neural Network-Based Deep Encoding for Mixed-Attribute Data Classification
Authors
Tinglin Huang
Yulin He
Dexin Dai
Wenting Wang
Joshua Zhexue Huang
Copyright Year
2019
DOI
https://doi.org/10.1007/978-3-030-26142-9_14

Premium Partner