nach oben

International Journal of Machine Learning and Cybernetics

Erschienen in:

21.10.2023 | Original Article

An interpretable neural network for robustly determining the location and number of cluster centers

verfasst von: Xuetao Xie, Yi-Fei Pu, Huaqing Zhang, Jacek Mańdziuk, El-Sayed M. El-Alfy, Jian Wang

Erschienen in: International Journal of Machine Learning and Cybernetics | Ausgabe 4/2024

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Aus

Abstract

K-means is a clustering method with an interpretable mechanism. However, its clustering results are significantly affected by the location of the initial cluster centers. More importantly, for it and its improved versions, it is extremely hard to adaptively determine the number of cluster centers. In contrast, ordinary neural networks have powerful information representation ability but lack interpretability. Moreover, to the best of our knowledge, the use of interpretable neural networks to determine the number of cluster centers of K-means is absent. This paper proposes K-meaNet that combines the interpretable mechanism of K-means and the powerful information representation ability of neural networks. For the neural network in K-meaNet, its inputs, weights, and mathematical expressions of each layer have clear meanings. During training, if one cluster center is critical, the value of one of the weights in the neural network, the gate, corresponding to this cluster center will increase. At the same time, the position of this cluster center will be close to the ideal cluster center. Besides, the location of the cluster center(s) and the value(s) of the corresponding gate(s) will not change significantly. This endows K-meaNet with the ability to adaptively determine the location and number of cluster centers compared with K-means and its improved versions. Moreover, this adaptive ability is robust to the location of the initial cluster centers, the number of the initial cluster centers, and the number of features. On six synthetic datasets and three real datasets, numerical experiments verify that K-meaNet can adaptively determine the number of cluster centers and is robust to the location of the initial cluster centers, the number of the initial cluster centers, and the number of features.

Vorheriger Artikel Incremental feature selection based on uncertainty measure for dynamic interval-valued data

Nächster Artikel Cost-sensitive sparse subset selection

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

Jetzt informieren

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Jetzt informieren

ATZelectronics worldwide

ATZlectronics worldwide is up-to-speed on new trends and developments in automotive electronics on a scientific level with a high depth of information.

Order your 30-days-trial for free and without any commitment.

Jetzt informieren

ATZelektronik

Die Fachzeitschrift ATZelektronik bietet für Entwickler und Entscheider in der Automobil- und Zulieferindustrie qualitativ hochwertige und fundierte Informationen aus dem gesamten Spektrum der Pkw- und Nutzfahrzeug-Elektronik.

Lassen Sie sich jetzt unverbindlich 2 kostenlose Ausgabe zusenden.

Jetzt informieren

Nur mit Berechtigung zugänglich

Ahmed M, Seraj R, Islam SMS (2020) The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 9:1295CrossRef

“Hierarchical clustering" (2023) https://www.mathworks.com/help/stats/clusterdata.html

Ester M, Kriegel H. P, Sander J, Xu X (1996)“A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the 2nd KDD. AAAI Press

Dong S, Xia Y, Peng T (2021) Network abnormal traffic detection model based on semi-supervised deep reinforcement learning. IEEE Trans Netw Serv Manag 18(4):4197–4212CrossRef

Wang H, Cheng R, Zhou J, Tao L, Kwan HK (2022) Multistage model for robust face alignment using deep neural networks. Cogn Comput 14:1123–1139CrossRef

Li F, Gao D, Yang Y, Zhu J (2023) Small target deep convolution recognition algorithm based on improved YOLOv4. Int J Mach Learn Cybern 14:387–394CrossRef

Zhang Y, Mańdziuk J, Quek CH, Goh BW (2017) Curvature-based method for determining the number of clusters. Inf Sci 415–416:414–428CrossRef

Liu Q, Wu H, Xu Z (2021) Consensus model based on probability K-means clustering algorithm for large scale group decision making. Int J Mach Learn Cybern 12:1609–1626CrossRefADS

Biswas TK, Giri K, Roy S (2023) ECKM: An improved K-means clustering based on computational geometry. Expert Syst Appl 212:118862CrossRef

10.

Hu H, Liu J, Zhang X, Fang M (2023) An effective and adaptable K-means algorithm for big data cluster analysis. Pattern Recognit 139:109404CrossRef

11.

Liu L, Li P, Chu M, Liu S (2023) Robust nonparallel support vector machine with privileged information for pattern recognition. Int J Mach Learn Cybern 14:1465–1482CrossRef

12.

Tanveer M, Gupta T, Shah M, Richhariya B (2021) Sparse twin support vector clustering using pinball loss. IEEE J Biomed Health Inf 25(10):3776–3783CrossRef

13.

Tanveer M, Gupta T, Shah M (2021) Pinball loss twin support vector clustering. ACM Trans Multimed Comput Commun Appl 17(2s):1–23CrossRef

14.

Tanveer M, Tabish M, Jangir J (2022) Sparse pinball twin bounded support vector clustering. IEEE Trans Comput Soc Syst 9(6):1820–1829CrossRef

15.

Demuth HB, Beale MH, De Jésus O, Hagan MT (2014) Neural network design. Martin Hagan, Stillwater, Oklahoma, USA

16.

Larochelle H, Bengio Y, Louradour J, Lamblin P (2009) Exploring strategies for training deep neural networks. J Mach Learn Res 10(1):1–40

17.

Xie X, Li Z, Pu YF, Wang J, Zhang W, Wen Y (2023) A fractional filter based on reinforcement learning for effective tracking under impulsive noise. Neurocomputing 516:155–168CrossRef

18.

Liu S, Huang S, Fu W, Lin JCW (2023) A descriptive human visual cognitive strategy using graph neural network for facial expression recognition. Int J Mach Learn Cybern. https://doi.org/10.1007/s13042-022-01681-wCrossRefPubMed

19.

Jain DK, Ding W, Kotecha K (2023) Training fuzzy deep neural network with honey badger algorithm for intrusion detection in cloud environment. Int J Mach Learn Cybern. https://doi.org/10.1007/s13042-022-01758-6CrossRef

20.

Caron M, Bojanowski P, Joulin A, Douze M (2018) Deep clustering for unsupervised learning of visual features. In: European conference on computer vision

21.

Dang Z, Deng C, Yang X, Wei K, Huang H (2021) Nearest neighbor matching for deep clustering. In: IEEE/CVF conference on computer vision and pattern recognition

22.

Xu J, Ren Y, Li G, Pan L, Zhu C, Xu Z (2021) Deep embedded multi-view clustering with collaborative training. Inf Sci 573:279–290MathSciNetCrossRef

23.

Özgül OF, Bardak B, Tan M (2021) A convolutional deep clustering framework for gene expression time series. IEEE ACM Trans Comput Biol Bioinform 18(6):2198–2207CrossRefPubMed

24.

Cai J, Fan J, Guo W, Wang S, Zhang Y, Zhang Z (2022) Efficient deep embedded subspace clustering. In: IEEE/CVF conference on computer vision and pattern recognition

25.

Cai J, Wang S, Xu C, Guo W (2022) Unsupervised deep clustering via contractive feature representation and focal loss. Pattern Recognit 123:108386CrossRef

26.

Li S, Yuan M, Chen J, Hu Z (2022) AdaDC: adaptive deep clustering for unsupervised domain adaptation in person re-identification. IEEE Trans Circuits Syst Video Technol 32(6):3825–3838CrossRef

27.

Wang J, Wu B, Ren Z, Zhang H, Zhou Y (2023) Multi-scale deep multi-view subspace clustering with self-weighting fusion and structure preserving. Expert Syst Appl 213:119031CrossRef

28.

Wang Y, Chang D, Fu Z, Zhao Y (2023) Learning a bi-directional discriminative representation for deep clustering. Pattern Recogn 137:109237CrossRef

29.

Wang T, Zhang X, Lan L, Luo Z (2023) Local-to-global deep clustering on approximate Uniform manifold. IEEE Trans Knowl Data Eng 35(5):5035–5046

30.

Liu Y et al (2023) Dink-net: neural clustering on large graphs. arXiv:2305.18405v3 [cs.LG]

31.

Ding F, Zhang D, Yang Y, Krovi V, Luo F (2023) Contrastive representation Disentanglement for Clustering. arXiv:2306.05439v2 [cs.LG]

32.

Castelvecchi D (2016) Can we open the black box of AI? Nat News 538(7623):20CrossRefADS

33.

Tang Z et al (2019) Interpretable classification of Alzheimer’s disease pathologies with a convolutional neural network pipeline. Nat Commun 10(1):1–14MathSciNetADS

34.

Samek W, Montavon G, Lapuschkin S, Anders CJ, Müller KR (2021) Explaining deep neural networks and beyond: a review of methods and applications. Proc IEEE 109(3):247–278CrossRef

35.

Peng X, Li Y, Tsang IW, Zhu H, Lv J, Zhou JT (2022) XAI beyond classification: interpretable neural clustering. J Mach Learn Res 23(6):1–28MathSciNet

36.

Yu L, Zhang Z, Xie X, Chen H, Wang J (2019) Unsupervised feature selection using RBF autoencoder. Int Symp Neural Netw 11554:48–57

37.

Ma L, Wang X, Zhou Y (2022) Observer and command-filter-based adaptive neural network control algorithms for nonlinear multi-agent systems with input delay. Cogn Comput 14:814–827CrossRef

38.

Wang K, Yan C, Yuan X, Wang Y, Liu C (2022) A reduced nonstationary discrete convolution kernel for multimode process monitoring. Int J Mach Learn Cybern 13:3711–3725CrossRef

39.

Gao T, Zhang Z, Chang Q, Xie X, Ren P, Wang J (2019) Conjugate gradient-based Takagi–Sugeno fuzzy neural network parameter identification and its convergence analysis. Neurocomputing 364:168–181CrossRef

40.

Wang J, Chang Q, Gao T, Zhang K, Pal NR (2022) Sensitivity analysis of Takagi–Sugeno fuzzy neural network. Inf Sci 582:725–749CrossRef

41.

Xue G, Chang Q, Wang J, Zhang K, Pal NR (2023) An adaptive neuro-fuzzy system with integrated feature selection and rule extraction for high-dimensional classification problems. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2022.3220950CrossRef

42.

Xue G, Wang J, Yuan B, Dai C (2023) DG-ALETSK: a high-dimensional fuzzy approach with simultaneous feature selection and rule extraction. IEEE Trans Fuzzy Syst. https://doi.org/10.1109/TFUZZ.2023.3270445CrossRef

43.

Xie X, Zhang H, Wang J, Chang Q, Wang J, Pal NR (2020) Learning optimized structure of neural networks by hidden node pruning with \(L_ {1}\) regularization. IEEE Trans Cybern 50(3):1333–1346CrossRefPubMed

44.

Dau HA et al (2019) The UCR time series archive. IEEE CAA J Autom Sin 6(6):1293–1305CrossRef

45.

UCI Machine Learning Repository, School Inf. Comput. Sci., Univ. California, at Irvine, CA, USA, Accessed: 2023. [Online]. https://archive-beta.ics.uci.edu/

46.

Park HS, Jun CH (2009) A simple and fast algorithm for K-medoids clustering. Expert Syst Appl 36:3336–3341CrossRef

Titel: An interpretable neural network for robustly determining the location and number of cluster centers
verfasst von: Xuetao Xie
Yi-Fei Pu
Huaqing Zhang
Jacek Mańdziuk
El-Sayed M. El-Alfy
Jian Wang
Publikationsdatum: 21.10.2023
Verlag: Springer Berlin Heidelberg
Erschienen in: International Journal of Machine Learning and Cybernetics / Ausgabe 4/2024
Print ISSN: 1868-8071
Elektronische ISSN: 1868-808X
DOI: https://doi.org/10.1007/s13042-023-01978-4

Neuer Inhalt

Bildnachweise

VDI-Icon, Profil Icon, inhalt2, Springer Professional Modul/© Springer Fachmedien Wiesbaden GmbH, Die Gewinner und Laudatoren des Sustainability Award in Automotive 2024/© Uli Regenscheit | ATZlive, Search Icon, Banner Hanser, Additiv gefertigte Teile/© Marina_Skoropadskaya | Getty Images | iStock, Warnschild "Land unter"/© Bluedesign / Fotolia, Gardiner von Trapp/© Alpega Group, Zeitschrift Wissensmanagement Cover, PatentFit-Logo/© Springer Fachmedien Wiesbaden GmbH, ATZ-Webinar: Prototypenfreie Entwicklung durch Offline- und Driver-in-the-Loop-HiL-Tests /© (c) VI-grade, chassis.tech plus 2023/© [M] ATZlive / TÜV SÜD PRODUCT SERVICE GMBH, adäsion-Webinar-Matinee/© krystiannawrocki_ Getty Images

Springer Professional

Abstract

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

ATZelectronics worldwide

ATZelektronik

Weitere Artikel der Ausgabe 4/2024

Subspace clustering based on a multichannel attention mechanism

Entropy based swarm intelligent searching for scheduling deadline constrained workflows in hybrid cloud

Joint weighted knowledge distillation and multi-scale feature distillation for long-tailed recognition

A hybrid similarity measure-based clustering approach for mixed attribute data

LM-Net: a dynamic gesture recognition network with long-term aggregation and motion excitation

A k-nearest neighbor attentive deep autoregressive network for electricity consumption prediction

Neuer Inhalt

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.

Bitte loggen Sie sich ein, um Zugang zu Ihrer Lizenz zu erhalten.