Skip to main content

2018 | OriginalPaper | Buchkapitel

Mass-Based Density Peaks Clustering Algorithm

verfasst von : Ding Ling, Xu Xiao

Erschienen in: Intelligent Information Processing IX

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Density peaks clustering algorithm (DPC) relies on local-density and relative-distance of dataset to find cluster centers. However, the calculation of these attributes is based on Euclidean distance simply, and DPC is not satisfactory when dataset’s density is uneven or dimension is higher. In addition, parameter \( d_{\text{c}} \) only considers the global distribution of the dataset, a little change of \( d_{\text{c}} \) has a great influence on small-scale dataset clustering. Aiming at these drawbacks, this paper proposes a mass-based density peaks clustering algorithm (MDPC). MDPC introduces a mass-based similarity measure method to calculate the new similarity matrix. After that, K-nearest neighbour information of the data is obtained according to the new similarity matrix, and then MDPC redefines the local density based on the K-nearest neighbour information. Experimental results show that MDPC is superior to DPC, and satisfied on datasets with uneven density and higher dimensions, which also avoids the influence of \( d_{\text{c}} \) on the small-scale datasets.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Morris, K., Mcnicholas, P.: Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures. Comput. Stat. Data Anal. 97, 133–150 (2016)MathSciNetCrossRef Morris, K., Mcnicholas, P.: Clustering, classification, discriminant analysis, and dimension reduction via generalized hyperbolic mixtures. Comput. Stat. Data Anal. 97, 133–150 (2016)MathSciNetCrossRef
2.
Zurück zum Zitat Ivannikova, E., Park, H., Hämäläinen, T., et al.: Revealing community structures by ensemble clustering using group diffusion. Inf. Fusion 42, 24–36 (2018)CrossRef Ivannikova, E., Park, H., Hämäläinen, T., et al.: Revealing community structures by ensemble clustering using group diffusion. Inf. Fusion 42, 24–36 (2018)CrossRef
3.
Zurück zum Zitat Slimen, Y., Allio, S., Jacques, J.: Model-based co-clustering for functional data. Neurocomputing 291, 97–108 (2018)CrossRef Slimen, Y., Allio, S., Jacques, J.: Model-based co-clustering for functional data. Neurocomputing 291, 97–108 (2018)CrossRef
4.
Zurück zum Zitat Fraley, C., Raftery, A.: Model-based clustering, discriminant analysis, and density estimation. J. Am. Stat. Assoc. 97, 611–631 (2011)MathSciNetCrossRef Fraley, C., Raftery, A.: Model-based clustering, discriminant analysis, and density estimation. J. Am. Stat. Assoc. 97, 611–631 (2011)MathSciNetCrossRef
5.
Zurück zum Zitat Rodríguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344, 1492–1496 (2014)CrossRef Rodríguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344, 1492–1496 (2014)CrossRef
6.
Zurück zum Zitat Xu, X., Ding, S., Du, M., et al.: DPCG: an efficient density peaks clustering algorithm based on grid. Int. J. Mach. Learn. Cybern. 9, 743–754 (2018)CrossRef Xu, X., Ding, S., Du, M., et al.: DPCG: an efficient density peaks clustering algorithm based on grid. Int. J. Mach. Learn. Cybern. 9, 743–754 (2018)CrossRef
7.
Zurück zum Zitat Ding, S., Du, M., Sun, T., et al.: An entropy-based density peaks clustering algorithm for mixed type data employing fuzzy neighborhood. Knowl. Based Syst. 133, 294–313 (2017)CrossRef Ding, S., Du, M., Sun, T., et al.: An entropy-based density peaks clustering algorithm for mixed type data employing fuzzy neighborhood. Knowl. Based Syst. 133, 294–313 (2017)CrossRef
8.
Zurück zum Zitat Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)MathSciNetCrossRef Liu, R., Wang, H., Yu, X.: Shared-nearest-neighbor-based clustering by fast search and find of density peaks. Inf. Sci. 450, 200–226 (2018)MathSciNetCrossRef
9.
Zurück zum Zitat Du, M., Ding, S., Jia, H.: Study on density peaks clustering based on K-nearest neighbors and principal component analysis. Knowl. Based Syst. 99, 135–145 (2016)CrossRef Du, M., Ding, S., Jia, H.: Study on density peaks clustering based on K-nearest neighbors and principal component analysis. Knowl. Based Syst. 99, 135–145 (2016)CrossRef
10.
Zurück zum Zitat Xie, J., Gao, H., Xie, W., et al.: Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inf. Sci. 354, 19–40 (2016)CrossRef Xie, J., Gao, H., Xie, W., et al.: Robust clustering by detecting density peaks and assigning points based on fuzzy weighted K-nearest neighbors. Inf. Sci. 354, 19–40 (2016)CrossRef
11.
Zurück zum Zitat Shi, Y., Chen, Z., Qi, Z., et al.: A novel clustering-based image segmentation via density peaks algorithm with mid-level feature. Neural Comput. Appl. 28, 29–39 (2017)CrossRef Shi, Y., Chen, Z., Qi, Z., et al.: A novel clustering-based image segmentation via density peaks algorithm with mid-level feature. Neural Comput. Appl. 28, 29–39 (2017)CrossRef
12.
Zurück zum Zitat Bai, L., Cheng, X., Liang, J., et al.: Fast density clustering strategies based on the k-means algorithm. Pattern Recogn. 71, 375–386 (2017)CrossRef Bai, L., Cheng, X., Liang, J., et al.: Fast density clustering strategies based on the k-means algorithm. Pattern Recogn. 71, 375–386 (2017)CrossRef
13.
Zurück zum Zitat Wang, M., Min, F., Zhang, Z., et al.: Active learning through density clustering. Expert Syst. Appl. 85, 305–317 (2017)CrossRef Wang, M., Min, F., Zhang, Z., et al.: Active learning through density clustering. Expert Syst. Appl. 85, 305–317 (2017)CrossRef
14.
Zurück zum Zitat Zhou, L., Pei, C.: Delta-distance based clustering with a divide-and-conquer strategy: 3DC clustering. Pattern Recogn. Lett. 73, 52–59 (2016)CrossRef Zhou, L., Pei, C.: Delta-distance based clustering with a divide-and-conquer strategy: 3DC clustering. Pattern Recogn. Lett. 73, 52–59 (2016)CrossRef
15.
Zurück zum Zitat Krumhansl, C.: Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol. Rev. 85, 445–463 (1987)CrossRef Krumhansl, C.: Concerning the applicability of geometric models to similarity data: the interrelationship between similarity and spatial density. Psychol. Rev. 85, 445–463 (1987)CrossRef
16.
Zurück zum Zitat Kai, M., Zhu, Y., Carman, M., et al.: Overcoming key weaknesses of distance-based neighbourhood methods using a data dependent dissimilarity measure. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016, San Francisco, California, USA, pp. 1205–1214, 13–17 August 2016 Kai, M., Zhu, Y., Carman, M., et al.: Overcoming key weaknesses of distance-based neighbourhood methods using a data dependent dissimilarity measure. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016, San Francisco, California, USA, pp. 1205–1214, 13–17 August 2016
17.
Zurück zum Zitat Aryal, S., Kai, M., Haffari, G., et al.: Mp-dissimilarity: a data dependent dissimilarity measure. In: 2014 IEEE International Conference on Data Mining, Shenzhen, China, pp. 707–712, 14–17 December 2014 Aryal, S., Kai, M., Haffari, G., et al.: Mp-dissimilarity: a data dependent dissimilarity measure. In: 2014 IEEE International Conference on Data Mining, Shenzhen, China, pp. 707–712, 14–17 December 2014
18.
Zurück zum Zitat Chen, B., Ting, K., Washio, T., et al.: Half-space mass: a maximally robust and efficient data depth method. Mach. Learn. 100, 677–699 (2015)MathSciNetCrossRef Chen, B., Ting, K., Washio, T., et al.: Half-space mass: a maximally robust and efficient data depth method. Mach. Learn. 100, 677–699 (2015)MathSciNetCrossRef
Metadaten
Titel
Mass-Based Density Peaks Clustering Algorithm
verfasst von
Ding Ling
Xu Xiao
Copyright-Jahr
2018
DOI
https://doi.org/10.1007/978-3-030-00828-4_5