Skip to main content
Erschienen in: Soft Computing 3/2020

23.04.2019 | Methodologies and Application

Supervised Kohonen network with heterogeneous value difference metric for both numeric and categorical inputs

verfasst von: Yuxian Zhang, Mohammed Altayeb Awad Gendeel, Huideng Peng, Xiaoyi Qian, Hongqing Xu

Erschienen in: Soft Computing | Ausgabe 3/2020

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

The multi-attribute information appears in real world, which also includes numeric and categorical attributes. However, the previous classification algorithms for both numeric and categorical data exist in some limitations on categorical data. In this paper, a supervised Kohonen network with heterogeneous value difference metric is proposed for both numeric and categorical inputs. It employs the framework of supervised Kohonen networks, adopts heterogeneous value difference metric to measure dissimilarity between numeric and categorical data, uses the frequency of each categorical item in the Voronoi set to update the reference vector of categorical attribute on the competitive layer, and updates different competitive learning rules for numeric and categorical data. The effectiveness of the proposed algorithm is verified by UCI Machine Learning Data Repository. The classification accuracy is compared with BP, k-NN, naive Bayes network, C4.5 and SVM; the dissimilarity metric is analyzed. The proposed classification algorithm is applied to the operating mode classification for wind turbines; the effectiveness is illustrated in condition monitoring for pitch system of wind turbines.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Literatur
Zurück zum Zitat Ahmad A, Dey L (2007) A k-mean clustering algorithm for mixed numeric and categorical data. Data Knowl Eng 63(2):503–527CrossRef Ahmad A, Dey L (2007) A k-mean clustering algorithm for mixed numeric and categorical data. Data Knowl Eng 63(2):503–527CrossRef
Zurück zum Zitat Baati K, Hamdani TM, Alimi AM, Abraham A (2017) A new classifier for categorical data based on a possibilistic estimation and a novel generalized minimum-based algorithm. J Intell Fuzzy Syst 33(3):1723–1731CrossRef Baati K, Hamdani TM, Alimi AM, Abraham A (2017) A new classifier for categorical data based on a possibilistic estimation and a novel generalized minimum-based algorithm. J Intell Fuzzy Syst 33(3):1723–1731CrossRef
Zurück zum Zitat Belyi D, Popova E, Morton DP, Damien P (2017) Bayesian failure-rate modeling and preventive maintenance optimization. Eur J Oper Res 262(3):1085–1093MathSciNetCrossRef Belyi D, Popova E, Morton DP, Damien P (2017) Bayesian failure-rate modeling and preventive maintenance optimization. Eur J Oper Res 262(3):1085–1093MathSciNetCrossRef
Zurück zum Zitat Chen Y, Pazner MI, Wu W (2007) A comparison between a modified counter propagation network and an extended self-organizing map in remotely sensed data classification. Math Geol 39(6):559–574CrossRef Chen Y, Pazner MI, Wu W (2007) A comparison between a modified counter propagation network and an extended self-organizing map in remotely sensed data classification. Math Geol 39(6):559–574CrossRef
Zurück zum Zitat De Leon AR, Soo A, Williamson T (2011) Classification with discrete and continuous variables via general mixed-data models. J Appl Stat 38(5):1021–1032MathSciNetCrossRef De Leon AR, Soo A, Williamson T (2011) Classification with discrete and continuous variables via general mixed-data models. J Appl Stat 38(5):1021–1032MathSciNetCrossRef
Zurück zum Zitat Garcia S, Herrera F (2008) An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons. J Mach Learn Res 9:2677–2694MATH Garcia S, Herrera F (2008) An extension on statistical comparisons of classifiers over multiple data sets for all pairwise comparisons. J Mach Learn Res 9:2677–2694MATH
Zurück zum Zitat Hsu CC, Huang YP, Chang KW (2008) Extended Naive Bayes classifier for mixed data. Expert Syst Appl 35(3):1080–1083CrossRef Hsu CC, Huang YP, Chang KW (2008) Extended Naive Bayes classifier for mixed data. Expert Syst Appl 35(3):1080–1083CrossRef
Zurück zum Zitat Jabeen H, Baig AR (2012) Two layered Genetic programming for mixed-attribute data classification. Appl Soft Comput 12(1):416–422CrossRef Jabeen H, Baig AR (2012) Two layered Genetic programming for mixed-attribute data classification. Appl Soft Comput 12(1):416–422CrossRef
Zurück zum Zitat Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666CrossRef Jain AK (2010) Data clustering: 50 years beyond K-means. Pattern Recognit Lett 31(8):651–666CrossRef
Zurück zum Zitat Jiao L, Pan Q, Denœux T, Liang Y, Feng X (2015) Belief rule-based classification system: extension of FRBCS in belief functions framework. Inf Sci 309:26–49CrossRef Jiao L, Pan Q, Denœux T, Liang Y, Feng X (2015) Belief rule-based classification system: extension of FRBCS in belief functions framework. Inf Sci 309:26–49CrossRef
Zurück zum Zitat Kim K, Hong JS (2017) A hybrid decision tree algorithm for mixed numeric and categorical data in regression analysis. Pattern Recognit Lett 98:39–45CrossRef Kim K, Hong JS (2017) A hybrid decision tree algorithm for mixed numeric and categorical data in regression analysis. Pattern Recognit Lett 98:39–45CrossRef
Zurück zum Zitat Kohonen T (1998) The self-organizing map. Neurocomputing 21(1–3):1–6CrossRef Kohonen T (1998) The self-organizing map. Neurocomputing 21(1–3):1–6CrossRef
Zurück zum Zitat Kohonen T (2013) Essentials of the self-organizing map. Neural Netw 37:52–65CrossRef Kohonen T (2013) Essentials of the self-organizing map. Neural Netw 37:52–65CrossRef
Zurück zum Zitat Li C, Biswas G (2002) Unsupervised learning with mixed numeric and nominal data. IEEE Trans Knowl Data Eng 4:673–690CrossRef Li C, Biswas G (2002) Unsupervised learning with mixed numeric and nominal data. IEEE Trans Knowl Data Eng 4:673–690CrossRef
Zurück zum Zitat Liu H, Wu Y, Sun F, Fang B, Guo D (2018a) Weakly paired multimodal fusion for object recognition. IEEE Trans Autom Sci Eng 15(2):784–795CrossRef Liu H, Wu Y, Sun F, Fang B, Guo D (2018a) Weakly paired multimodal fusion for object recognition. IEEE Trans Autom Sci Eng 15(2):784–795CrossRef
Zurück zum Zitat Liu H, Li F, Xu X, Sun F (2018b) Multi-modal local receptive field extreme learning machine for object recognition. Neurocomputing 277:4–11CrossRef Liu H, Li F, Xu X, Sun F (2018b) Multi-modal local receptive field extreme learning machine for object recognition. Neurocomputing 277:4–11CrossRef
Zurück zum Zitat Masmoudi Y, Türkay M, Chabchoub H (2013). A binarization strategy for modelling mixed data in multigroup classification. In: Proceedings of international conference on advanced logistics and transport. IEEE, Sousse, May 2013, pp 347–353 Masmoudi Y, Türkay M, Chabchoub H (2013). A binarization strategy for modelling mixed data in multigroup classification. In: Proceedings of international conference on advanced logistics and transport. IEEE, Sousse, May 2013, pp 347–353
Zurück zum Zitat McCane B, Albert M (2008) Distance functions for categorical and mixed variables. Pattern Recognit Lett 29(7):986–993CrossRef McCane B, Albert M (2008) Distance functions for categorical and mixed variables. Pattern Recognit Lett 29(7):986–993CrossRef
Zurück zum Zitat Melssen W, Wehrens R, Buydens L (2006) Supervised Kohonen networks for classification problems. Chemom Intell Lab Syst 83(2):99–113CrossRef Melssen W, Wehrens R, Buydens L (2006) Supervised Kohonen networks for classification problems. Chemom Intell Lab Syst 83(2):99–113CrossRef
Zurück zum Zitat Nouaouria N, Boukadoum M (2014) Improved global-best particle swarm optimization algorithm with mixed-attribute data classification capability. Appl Soft Comput 21:554–567CrossRef Nouaouria N, Boukadoum M (2014) Improved global-best particle swarm optimization algorithm with mixed-attribute data classification capability. Appl Soft Comput 21:554–567CrossRef
Zurück zum Zitat Pathak A, Pal NR (2016) Clustering of mixed data by integrating fuzzy, probabilistic, and collaborative clustering framework. Int J Fuzzy Syst 18(3):339–348CrossRef Pathak A, Pal NR (2016) Clustering of mixed data by integrating fuzzy, probabilistic, and collaborative clustering framework. Int J Fuzzy Syst 18(3):339–348CrossRef
Zurück zum Zitat Qiao W, Lu D (2015) A survey on wind turbine condition monitoring and fault diagnosis—part I: components and subsystems. IEEE Trans Ind Electron 62(10):6536–6545CrossRef Qiao W, Lu D (2015) A survey on wind turbine condition monitoring and fault diagnosis—part I: components and subsystems. IEEE Trans Ind Electron 62(10):6536–6545CrossRef
Zurück zum Zitat Qiu Y, Feng Y, Tavner P, Richardson P, Erdos G, Chen B (2012) Wind turbine SCADA alarm analysis for improving reliability. Wind Energy 15(8):951–966CrossRef Qiu Y, Feng Y, Tavner P, Richardson P, Erdos G, Chen B (2012) Wind turbine SCADA alarm analysis for improving reliability. Wind Energy 15(8):951–966CrossRef
Zurück zum Zitat Schlechtingen M, Santos IF (2011) Comparative analysis of neural network and regression based condition monitoring approaches for wind turbine fault detection. Mech Syst Signal Process 25(5):1849–1875CrossRef Schlechtingen M, Santos IF (2011) Comparative analysis of neural network and regression based condition monitoring approaches for wind turbine fault detection. Mech Syst Signal Process 25(5):1849–1875CrossRef
Zurück zum Zitat Schlechtingen M, Santos IF, Achiche S (2013) Wind turbine condition monitoring based on SCADA data using normal behavior models. Part 1: system description. Appl Soft Comput 13(1):259–270CrossRef Schlechtingen M, Santos IF, Achiche S (2013) Wind turbine condition monitoring based on SCADA data using normal behavior models. Part 1: system description. Appl Soft Comput 13(1):259–270CrossRef
Zurück zum Zitat Sun P, Li J, Wang C, Lei X (2016) A generalized model for wind turbine anomaly identification based on SCADA data. Appl Energy 168:550–567CrossRef Sun P, Li J, Wang C, Lei X (2016) A generalized model for wind turbine anomaly identification based on SCADA data. Appl Energy 168:550–567CrossRef
Zurück zum Zitat Villuendas-Rey Y, Rey-Benguría CF, Ferreira-Santiago Á, Camacho-Nieto O, Yáñez-Márquez C (2017) The naïve associative classifier (NAC): a novel, simple, transparent, and accurate classification model evaluated on financial data. Neurocomputing 265:105–115CrossRef Villuendas-Rey Y, Rey-Benguría CF, Ferreira-Santiago Á, Camacho-Nieto O, Yáñez-Márquez C (2017) The naïve associative classifier (NAC): a novel, simple, transparent, and accurate classification model evaluated on financial data. Neurocomputing 265:105–115CrossRef
Zurück zum Zitat Wang H (2006) Nearest neighbors by neighborhood counting. IEEE Trans Pattern Anal Mach Intell 28(6):942–953CrossRef Wang H (2006) Nearest neighbors by neighborhood counting. IEEE Trans Pattern Anal Mach Intell 28(6):942–953CrossRef
Zurück zum Zitat Yang W, Tavner PJ, Crabtree CJ, Feng Y, Qiu Y (2014) Wind turbine condition monitoring: technical and commercial challenges. Wind Energy 17(5):673–693CrossRef Yang W, Tavner PJ, Crabtree CJ, Feng Y, Qiu Y (2014) Wind turbine condition monitoring: technical and commercial challenges. Wind Energy 17(5):673–693CrossRef
Metadaten
Titel
Supervised Kohonen network with heterogeneous value difference metric for both numeric and categorical inputs
verfasst von
Yuxian Zhang
Mohammed Altayeb Awad Gendeel
Huideng Peng
Xiaoyi Qian
Hongqing Xu
Publikationsdatum
23.04.2019
Verlag
Springer Berlin Heidelberg
Erschienen in
Soft Computing / Ausgabe 3/2020
Print ISSN: 1432-7643
Elektronische ISSN: 1433-7479
DOI
https://doi.org/10.1007/s00500-019-04001-7

Weitere Artikel der Ausgabe 3/2020

Soft Computing 3/2020 Zur Ausgabe

Methodologies and Application

n-ary Cartesian composition of automata

Premium Partner