Skip to main content
Erschienen in: Neural Processing Letters 2/2018

30.11.2017

Density Based Cluster Growing via Dominant Sets

verfasst von: Jian Hou, Xu E, Weixue Liu

Erschienen in: Neural Processing Letters | Ausgabe 2/2018

Einloggen

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Although there are a lot of clustering algorithms available in the literature, existing algorithms are usually afflicted by practical problems of one form or another, including parameter dependence and the inability to generate clusters of arbitrary shapes. In this paper we aim to solve these two problems by merging the merits of dominant sets and density based clustering algorithms. We firstly apply histogram equalization to eliminate the parameter dependence problem of the dominant sets algorithm. Noticing that the obtained clusters are usually smaller than the real ones, a density threshold based cluster growing step is then used to improve the clustering results, where the involved parameters are determined based on the initial clusters. This is followed by the second cluster growing step which makes use of the density relationship between neighboring data. Data clustering experiments and comparison with other algorithms validate the effectiveness of the proposed algorithm.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Achtert E, Bohm C, Kroger P (2006) Deli-clu: boosting robustness, completeness, usability, and efficiency of hierarchical clustering by a closest pair ranking. In: International conference on knowledge discovery and data mining, pp 119–128CrossRef Achtert E, Bohm C, Kroger P (2006) Deli-clu: boosting robustness, completeness, usability, and efficiency of hierarchical clustering by a closest pair ranking. In: International conference on knowledge discovery and data mining, pp 119–128CrossRef
2.
Zurück zum Zitat Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) Optics: ordering points to identify the clustering structure. In: ACM SIGMOD international conference on management of data, pp 49–60 Ankerst M, Breunig MM, Kriegel HP, Sander J (1999) Optics: ordering points to identify the clustering structure. In: ACM SIGMOD international conference on management of data, pp 49–60
3.
4.
Zurück zum Zitat Bulo SR, Pelillo M, Bomze IM (2011) Graph-based quadratic optimization: a fast evolutionary approach. Comput Vis Image Underst 115(7):984–995CrossRef Bulo SR, Pelillo M, Bomze IM (2011) Graph-based quadratic optimization: a fast evolutionary approach. Comput Vis Image Underst 115(7):984–995CrossRef
5.
Zurück zum Zitat Chang H, Yeung DY (2008) Robust path-based spectral clustering. Pattern Recognit 41(1):191–203CrossRef Chang H, Yeung DY (2008) Robust path-based spectral clustering. Pattern Recognit 41(1):191–203CrossRef
6.
Zurück zum Zitat Cheng Y (1995) Mean shift, mode seeking, and clustering. IEEE Trans Pattern Anal Mach Intell 17(8):790–799CrossRef Cheng Y (1995) Mean shift, mode seeking, and clustering. IEEE Trans Pattern Anal Mach Intell 17(8):790–799CrossRef
7.
Zurück zum Zitat Comaniciu D, Peter M (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619CrossRef Comaniciu D, Peter M (2002) Mean shift: a robust approach toward feature space analysis. IEEE Trans Pattern Anal Mach Intell 24(5):603–619CrossRef
8.
Zurück zum Zitat Daszykowski M, Walczak B, Massart DL (2001) Looking for natural patterns in data: part 1. Density-based approach. Chemometr Intell Lab Syst 56(2):83–92CrossRef Daszykowski M, Walczak B, Massart DL (2001) Looking for natural patterns in data: part 1. Density-based approach. Chemometr Intell Lab Syst 56(2):83–92CrossRef
9.
Zurück zum Zitat Ding J, Chen Z, He X, Zhan Y (2016) Clustering by finding density peaks based on Chebyshev’s inequality. In: Chinese control conference, pp 7169–7172 Ding J, Chen Z, He X, Zhan Y (2016) Clustering by finding density peaks based on Chebyshev’s inequality. In: Chinese control conference, pp 7169–7172
10.
Zurück zum Zitat Ester M, Kriegel HP, Sander J, Xu XW (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: International conference on knowledge discovery and data mining, pp 226–231 Ester M, Kriegel HP, Sander J, Xu XW (1996) A density-based algorithm for discovering clusters in large spatial databases with noise. In: International conference on knowledge discovery and data mining, pp 226–231
11.
Zurück zum Zitat Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol 14(8):2611–2620CrossRef Evanno G, Regnaut S, Goudet J (2005) Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol 14(8):2611–2620CrossRef
12.
Zurück zum Zitat Fraley C, Raftery AE (1998) How many clusters? which clustering method? Answers via model-based cluster analysis. Comput J 41(8):578–588CrossRef Fraley C, Raftery AE (1998) How many clusters? which clustering method? Answers via model-based cluster analysis. Comput J 41(8):578–588CrossRef
13.
Zurück zum Zitat Fränti P, Virmajoki O, Hautamäki V (2006) Fast agglomerative clustering using a k-nearest neighbor graph. IEEE Trans Pattern Anal Mach Intell 28(11):1875–1881CrossRef Fränti P, Virmajoki O, Hautamäki V (2006) Fast agglomerative clustering using a k-nearest neighbor graph. IEEE Trans Pattern Anal Mach Intell 28(11):1875–1881CrossRef
14.
Zurück zum Zitat Fu L, Medico E (2007) Flame, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinformatics 8(1):1–17CrossRef Fu L, Medico E (2007) Flame, a novel fuzzy clustering method for the analysis of DNA microarray data. BMC Bioinformatics 8(1):1–17CrossRef
15.
Zurück zum Zitat Gionis A, Mannila H, Tsaparas P (2007) Clustering aggregation. ACM Trans Knowl Discov Data 1(1):1–30CrossRef Gionis A, Mannila H, Tsaparas P (2007) Clustering aggregation. ACM Trans Knowl Discov Data 1(1):1–30CrossRef
16.
Zurück zum Zitat Hinnerberg A, Keim D (1998) An efficient approach to clustering large multimedia databases with noise. In: International conference on knowledge discovery and data mining pp 58–65 Hinnerberg A, Keim D (1998) An efficient approach to clustering large multimedia databases with noise. In: International conference on knowledge discovery and data mining pp 58–65
17.
Zurück zum Zitat Hou J, Gao H, Li X (2016) DSets-DBSCAN: a parameter-free clustering algorithm. IEEE Trans Image Process 25(7):3182–3193MathSciNetCrossRef Hou J, Gao H, Li X (2016) DSets-DBSCAN: a parameter-free clustering algorithm. IEEE Trans Image Process 25(7):3182–3193MathSciNetCrossRef
18.
Zurück zum Zitat Hou J, Gao H, Xia Q, Qi N (2016) Feature combination and the knn framework in object classification. IEEE Trans Neural Netw Learn Syst 27(6):1368–1378CrossRef Hou J, Gao H, Xia Q, Qi N (2016) Feature combination and the knn framework in object classification. IEEE Trans Neural Netw Learn Syst 27(6):1368–1378CrossRef
19.
Zurück zum Zitat Hou J, Liu W, Xu E (2016) Density based clustering via dominant sets. In: IAPR-TC3 workshop on artificial neural networks in pattern recognition, pp 80–91CrossRef Hou J, Liu W, Xu E (2016) Density based clustering via dominant sets. In: IAPR-TC3 workshop on artificial neural networks in pattern recognition, pp 80–91CrossRef
20.
Zurück zum Zitat Hou J, Liu W, Xu E, Cui H (2016) Towards parameter-independent data clustering and image segmentation. Pattern Recogn 60:25–36CrossRef Hou J, Liu W, Xu E, Cui H (2016) Towards parameter-independent data clustering and image segmentation. Pattern Recogn 60:25–36CrossRef
21.
Zurück zum Zitat Hou J, Pelillo M (2013) A simple feature combination method based on dominant sets. Pattern Recogn 46(11):3129–3139CrossRef Hou J, Pelillo M (2013) A simple feature combination method based on dominant sets. Pattern Recogn 46(11):3129–3139CrossRef
22.
Zurück zum Zitat Jain AK, Law MHC (2005) Data clustering: a user’s dilemma. In: International conference on pattern recognition and machine intelligence, pp 1–10 Jain AK, Law MHC (2005) Data clustering: a user’s dilemma. In: International conference on pattern recognition and machine intelligence, pp 1–10
23.
Zurück zum Zitat Kärkkäinen I, Fränti P (2002) Dynamic local search algorithm for the clustering problem. Research report A-2002-6, University of Joensuu Kärkkäinen I, Fränti P (2002) Dynamic local search algorithm for the clustering problem. Research report A-2002-6, University of Joensuu
24.
Zurück zum Zitat Pavan M, Pelillo M (2003) A graph-theoretic approach to clustering and segmentation. In: IEEE international conference on computer vision and pattern recognition, pp 145–152 Pavan M, Pelillo M (2003) A graph-theoretic approach to clustering and segmentation. In: IEEE international conference on computer vision and pattern recognition, pp 145–152
25.
Zurück zum Zitat Pavan M, Pelillo M (2005) Efficient out-of-sample extension of dominant-set clusters. In: Advances in neural information processing systems, pp 1057–1064 Pavan M, Pelillo M (2005) Efficient out-of-sample extension of dominant-set clusters. In: Advances in neural information processing systems, pp 1057–1064
26.
Zurück zum Zitat Pavan M, Pelillo M (2007) Dominant sets and pairwise clustering. IEEE Trans Pattern Anal Mach Intell 29(1):167–172CrossRef Pavan M, Pelillo M (2007) Dominant sets and pairwise clustering. IEEE Trans Pattern Anal Mach Intell 29(1):167–172CrossRef
27.
Zurück zum Zitat Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496CrossRef Rodriguez A, Laio A (2014) Clustering by fast search and find of density peaks. Science 344:1492–1496CrossRef
28.
Zurück zum Zitat Rosenberg A, Hirschberg J (2007) V-measure: a conditional entropy-based external cluster evaluation measure. In: Joint conference on empirical methods in natural language processing and computational natural language learning, pp 410–420 Rosenberg A, Hirschberg J (2007) V-measure: a conditional entropy-based external cluster evaluation measure. In: Joint conference on empirical methods in natural language processing and computational natural language learning, pp 410–420
29.
Zurück zum Zitat Roy S, Bhattacharyya DK (2005) An approach to find embedded clusters using density based techniques. LNCS 3816:523–535 Roy S, Bhattacharyya DK (2005) An approach to find embedded clusters using density based techniques. LNCS 3816:523–535
30.
Zurück zum Zitat Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):167–172 Shi J, Malik J (2000) Normalized cuts and image segmentation. IEEE Trans Pattern Anal Mach Intell 22(8):167–172
31.
Zurück zum Zitat Torsello A, Bulo SR, Pelillo M (2006) Grouping with asymmetric affinities: a game-theoretic perspective. IEEE Int Conf Comput Vis Pattern Recognit 1:292–299 Torsello A, Bulo SR, Pelillo M (2006) Grouping with asymmetric affinities: a game-theoretic perspective. IEEE Int Conf Comput Vis Pattern Recognit 1:292–299
32.
Zurück zum Zitat Veenman CJ, Reinders M, Backer E (2002) A maximum variance cluster algorithm. IEEE Trans Pattern Anal Mach Intell 24(9):1273–1280CrossRef Veenman CJ, Reinders M, Backer E (2002) A maximum variance cluster algorithm. IEEE Trans Pattern Anal Mach Intell 24(9):1273–1280CrossRef
34.
Zurück zum Zitat Yang N, Liu Q, Li Y, Xiao L, Liu X (2016) Star-scan: a stable clustering by statistically finding centers and noises. In: Asia-Pacific web conference on web technologies and applications, pp 456–467CrossRef Yang N, Liu Q, Li Y, Xiao L, Liu X (2016) Star-scan: a stable clustering by statistically finding centers and noises. In: Asia-Pacific web conference on web technologies and applications, pp 456–467CrossRef
35.
Zurück zum Zitat Zahn CT (1971) Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Trans Comput 20(1):68–86CrossRef Zahn CT (1971) Graph-theoretical methods for detecting and describing gestalt clusters. IEEE Trans Comput 20(1):68–86CrossRef
36.
Zurück zum Zitat Zhu X, Loy CC, Gong S (2014) Constructing robust affinity graphs for spectral clustering. In: IEEE International conference on computer vision and pattern recognition, pp 1450–1457 Zhu X, Loy CC, Gong S (2014) Constructing robust affinity graphs for spectral clustering. In: IEEE International conference on computer vision and pattern recognition, pp 1450–1457
Metadaten
Titel
Density Based Cluster Growing via Dominant Sets
verfasst von
Jian Hou
Xu E
Weixue Liu
Publikationsdatum
30.11.2017
Verlag
Springer US
Erschienen in
Neural Processing Letters / Ausgabe 2/2018
Print ISSN: 1370-4621
Elektronische ISSN: 1573-773X
DOI
https://doi.org/10.1007/s11063-017-9767-3

Weitere Artikel der Ausgabe 2/2018

Neural Processing Letters 2/2018 Zur Ausgabe

Neuer Inhalt