Skip to main content

2017 | OriginalPaper | Buchkapitel

A Potential-Based Density Estimation Method for Clustering Using Decision Graph

verfasst von : Huanqian Yan, Yonggang Lu, Li Li

Erschienen in: Intelligent Data Engineering and Automated Learning – IDEAL 2017

Verlag: Springer International Publishing

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Clustering is an important unsupervised machine learning method which has played an important role in various fields. As suggested by Alex Rodriguez et al. in a paper published in Science in 2014, the 2D decision graph of the estimated density value versus the minimum distance from the points with higher density values for all the data points can be used to identify the cluster centroids. However, the traditional kernel density estimation methods may be affected by the setting of the parameters and cannot work well for some complex datasets. In this work, a novel potential-based method is designed to estimate density values, which is not sensitive to the parameters and is more effective than the traditional kernel density estimation methods. Experiments on several synthetic and real-world datasets show the superiority of the proposed method in clustering the datasets with various distributions and dimensionalities.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Satyanarayana, A., Acquaviva, V.: Enhanced cobweb clustering for identifying analog galaxies in astrophysics. In: IEEE Electrical and Computer Engineering, pp. 1–4 (2014) Satyanarayana, A., Acquaviva, V.: Enhanced cobweb clustering for identifying analog galaxies in astrophysics. In: IEEE Electrical and Computer Engineering, pp. 1–4 (2014)
2.
Zurück zum Zitat Menéndez, H.D., Plaza, L., Camacho, D.: Combining graph connectivity and genetic clustering to improve biomedical summarization. In: IEEE Congress on Evolutionary Computation, pp. 2740–2747 (2014) Menéndez, H.D., Plaza, L., Camacho, D.: Combining graph connectivity and genetic clustering to improve biomedical summarization. In: IEEE Congress on Evolutionary Computation, pp. 2740–2747 (2014)
3.
Zurück zum Zitat Han, E., et al.: Clustering of 770,000 genomes reveals post-colonial population structure of North America. Nat. Commun. 8, 14238 (2017)CrossRef Han, E., et al.: Clustering of 770,000 genomes reveals post-colonial population structure of North America. Nat. Commun. 8, 14238 (2017)CrossRef
4.
Zurück zum Zitat Lu, Y., Wan, Y.: Clustering by sorting potential values (CSPV): a novel potential-based clustering method. Pattern Recogn. 45(9), 3512–3522 (2012)CrossRef Lu, Y., Wan, Y.: Clustering by sorting potential values (CSPV): a novel potential-based clustering method. Pattern Recogn. 45(9), 3512–3522 (2012)CrossRef
5.
Zurück zum Zitat Kleinberg, J.: An impossibility theorem for clustering. In: NIPS, pp. 463–470 (2002) Kleinberg, J.: An impossibility theorem for clustering. In: NIPS, pp. 463–470 (2002)
6.
Zurück zum Zitat Gan, G., Ng, M.K.-P.: k-means clustering with outlier removal. Pattern Recogn. Lett. 8–14 (2017) Gan, G., Ng, M.K.-P.: k-means clustering with outlier removal. Pattern Recogn. Lett. 8–14 (2017)
7.
Zurück zum Zitat Song, H., Yan, H.: Novel K-medoids clustering algorithm based on dynamic search of microparticles under optimized granular computing. Intell. Comput. Appl. (2016) Song, H., Yan, H.: Novel K-medoids clustering algorithm based on dynamic search of microparticles under optimized granular computing. Intell. Comput. Appl. (2016)
9.
Zurück zum Zitat Serdah, A.M., Ashour, W.M.: Clustering large-scale data based on modified affinity propagation algorithm. J. Artif. Intell. Soft Comput. Res. 6(1), 23–33 (2016)CrossRef Serdah, A.M., Ashour, W.M.: Clustering large-scale data based on modified affinity propagation algorithm. J. Artif. Intell. Soft Comput. Res. 6(1), 23–33 (2016)CrossRef
10.
Zurück zum Zitat Ward Jr., J.H.: Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58(301), 236–244 (1963)MathSciNetCrossRef Ward Jr., J.H.: Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58(301), 236–244 (1963)MathSciNetCrossRef
11.
Zurück zum Zitat Höppner, F.: Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. Wiley, New York (1999)MATH Höppner, F.: Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition. Wiley, New York (1999)MATH
12.
Zurück zum Zitat Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)CrossRef Jain, A.K.: Data clustering: 50 years beyond K-means. Pattern Recogn. Lett. 31(8), 651–666 (2010)CrossRef
13.
Zurück zum Zitat Ester, M., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD, vol. 96(34), pp. 226–231 (1996) Ester, M., et al.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: KDD, vol. 96(34), pp. 226–231 (1996)
14.
Zurück zum Zitat Ghassabeh, Y.A., Rudzicz, F.: The mean shift algorithm and its relation to Kernel regression. Inf. Sci. 348, 198–208 (2016)MathSciNetCrossRef Ghassabeh, Y.A., Rudzicz, F.: The mean shift algorithm and its relation to Kernel regression. Inf. Sci. 348, 198–208 (2016)MathSciNetCrossRef
15.
Zurück zum Zitat Campello, R.J.G.B., et al.: Hierarchical density estimates for data clustering, visualization, and outlier detection. ACM Trans. Knowl. Discov. Data (TKDD) 10(1), 5 (2015) Campello, R.J.G.B., et al.: Hierarchical density estimates for data clustering, visualization, and outlier detection. ACM Trans. Knowl. Discov. Data (TKDD) 10(1), 5 (2015)
16.
Zurück zum Zitat Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)CrossRef Rodriguez, A., Laio, A.: Clustering by fast search and find of density peaks. Science 344(6191), 1492–1496 (2014)CrossRef
17.
Zurück zum Zitat Lu, Y., Hou, X., Chen, X.: A novel travel-time based similarity measure for hierarchical clustering. Neurocomputing 173, 3–8 (2016)CrossRef Lu, Y., Hou, X., Chen, X.: A novel travel-time based similarity measure for hierarchical clustering. Neurocomputing 173, 3–8 (2016)CrossRef
19.
Zurück zum Zitat Fowlkes, E.B., Mallows, C.L.: A method for comparing two hierarchical clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)CrossRefMATH Fowlkes, E.B., Mallows, C.L.: A method for comparing two hierarchical clusterings. J. Am. Stat. Assoc. 78(383), 553–569 (1983)CrossRefMATH
Metadaten
Titel
A Potential-Based Density Estimation Method for Clustering Using Decision Graph
verfasst von
Huanqian Yan
Yonggang Lu
Li Li
Copyright-Jahr
2017
DOI
https://doi.org/10.1007/978-3-319-68935-7_9

Premium Partner