Skip to main content
Top
Published in: Natural Computing 2/2023

22-08-2022

What’s in a distance? Exploring the interplay between distance measures and internal cluster validity in multi-objective clustering

Authors: Adán José-García, Julia Handl

Published in: Natural Computing | Issue 2/2023

Log in

Activate our intelligent search to find suitable subject content or patents.

search-config
loading …

Abstract

The problem of cluster analysis eludes a unique mathematical definition. Instead, a variety of different instantiations of the problem can be defined using specific measures of internal cluster validity. In turn, such internal cluster validity measures rely on quantifying dissimilarity between entities. This article explores the interaction between dissimilarity measures and internal cluster validity techniques in the context of multi-objective clustering. It does so by contrasting two conceptually different approaches to multi-objective clustering: the multi-criterion clustering algorithm \(\Delta\)-MOCK, designed to optimise different measures of internal cluster validity over a single dissimilarity space, and the multi-view clustering algorithm MVMC, designed to optimise a single measure of internal cluster validity over distinct dissimilarity spaces. Our comparison highlights the interchangeable roles of distance functions and measures of internal cluster validity, which paves the way for the future design of a flexible, dual-purpose approach to multi-objective clustering.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literature
go back to reference Bayá AE, Granitto PM (2013) How many clusters: a validation index for arbitrary-shaped clusters. IEEE/ACM Trans Comput Biol Bioinf 10(2):401–14CrossRef Bayá AE, Granitto PM (2013) How many clusters: a validation index for arbitrary-shaped clusters. IEEE/ACM Trans Comput Biol Bioinf 10(2):401–14CrossRef
go back to reference de Carvalho F, Lechevallier Y, de Melo FM (2012) Partitioning hard clustering algorithms based on multiple dissimilarity matrices. Pattern Recogn 45(1):447–464CrossRefMATH de Carvalho F, Lechevallier Y, de Melo FM (2012) Partitioning hard clustering algorithms based on multiple dissimilarity matrices. Pattern Recogn 45(1):447–464CrossRefMATH
go back to reference de Carvalho F, Lechevallier Y, Despeyroux T et al (2014) Multi-view clustering on relational data. In: Zighed F, Abdelkader G, Gilles P et al (eds) Advances in knowledge discovery and management. Springer, Heidelberg, pp 37–51CrossRef de Carvalho F, Lechevallier Y, Despeyroux T et al (2014) Multi-view clustering on relational data. In: Zighed F, Abdelkader G, Gilles P et al (eds) Advances in knowledge discovery and management. Springer, Heidelberg, pp 37–51CrossRef
go back to reference Delattre M, Hansen P (1980) Bicriterion cluster analysis. IEEE Trans Pattern Anal Mach Intell 2(4):277–291CrossRefMATH Delattre M, Hansen P (1980) Bicriterion cluster analysis. IEEE Trans Pattern Anal Mach Intell 2(4):277–291CrossRefMATH
go back to reference Garza-Fabre M, Handl J, Knowles J (2018) An improved and more scalable evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 22(4):515–535CrossRef Garza-Fabre M, Handl J, Knowles J (2018) An improved and more scalable evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 22(4):515–535CrossRef
go back to reference Handl J, Knowles J (2007) An evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 11(1):56–76CrossRef Handl J, Knowles J (2007) An evolutionary approach to multiobjective clustering. IEEE Trans Evol Comput 11(1):56–76CrossRef
go back to reference José-García A, Gómez-Flores W (2016) Automatic clustering using nature-Inspired metaheuristics: a survey. Appl Soft Comput 41:192–213CrossRef José-García A, Gómez-Flores W (2016) Automatic clustering using nature-Inspired metaheuristics: a survey. Appl Soft Comput 41:192–213CrossRef
go back to reference José-García A, Handl J (2021) On the interaction between distance functions and clustering criteria in multi-objective clustering. In: International conference on evolutionary multi-criterion optimization, Springer, pp 504–515 José-García A, Handl J (2021) On the interaction between distance functions and clustering criteria in multi-objective clustering. In: International conference on evolutionary multi-criterion optimization, Springer, pp 504–515
go back to reference José-García A, Handl J, Gómez-Flores W et al (2019) Many-view clustering: An illustration using multiple dissimilarity measures. In: Press ACM (ed) Genetic and Evolutionary Computation Conference - GECCO ’19. Republic Prague, Czech, pp 213–214 José-García A, Handl J, Gómez-Flores W et al (2019) Many-view clustering: An illustration using multiple dissimilarity measures. In: Press ACM (ed) Genetic and Evolutionary Computation Conference - GECCO ’19. Republic Prague, Czech, pp 213–214
go back to reference José-García A, Handl J, Gómez-Flores W et al (2021) An evolutionary many-objective approach to multiview clustering using feature and relational data. Appl Soft Comput 108:1–15CrossRef José-García A, Handl J, Gómez-Flores W et al (2021) An evolutionary many-objective approach to multiview clustering using feature and relational data. Appl Soft Comput 108:1–15CrossRef
go back to reference Kanaan-Izquierdo S, Ziyatdinov A, Perera-Lluna A (2018) Multiview and multifeature spectral clustering using common eigenvectors. Pattern Recogn Lett 102:30–36CrossRef Kanaan-Izquierdo S, Ziyatdinov A, Perera-Lluna A (2018) Multiview and multifeature spectral clustering using common eigenvectors. Pattern Recogn Lett 102:30–36CrossRef
go back to reference MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley symposium on mathematical statistics and probability. University of California Press, pp 281–297 MacQueen J (1967) Some methods for classification and analysis of multivariate observations. In: Proceedings of the Fifth Berkeley symposium on mathematical statistics and probability. University of California Press, pp 281–297
go back to reference Mukhopadhyay A, Maulik U, Bandyopadhyay S (2015) A survey of multiobjective evolutionary clustering. ACM Comput Surv (CSUR) 47(4):1–46CrossRef Mukhopadhyay A, Maulik U, Bandyopadhyay S (2015) A survey of multiobjective evolutionary clustering. ACM Comput Surv (CSUR) 47(4):1–46CrossRef
go back to reference Park Y, Song M (1998) A genetic algorithm for clustering problems. In: Proceedings of the Third Annual Conference on Genetic Programming, pp 568–575 Park Y, Song M (1998) A genetic algorithm for clustering problems. In: Proceedings of the Third Annual Conference on Genetic Programming, pp 568–575
go back to reference Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65CrossRefMATH Rousseeuw PJ (1987) Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Comput Appl Math 20:53–65CrossRefMATH
go back to reference Santos JM, de Sá JM (2005) Human clustering on bi-dimensional data: an assessment. Tech. rep, INEB -Instituto de Engenharia Biomedica Santos JM, de Sá JM (2005) Human clustering on bi-dimensional data: an assessment. Tech. rep, INEB -Instituto de Engenharia Biomedica
go back to reference Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef Sun S (2013) A survey of multi-view machine learning. Neural Comput Appl 23(7–8):2031–2038CrossRef
go back to reference Theodoridis S, Koutrumbas K (2009) Pattern recognition, 4th edn. Elsevier Inc, Amsterdam Theodoridis S, Koutrumbas K (2009) Pattern recognition, 4th edn. Elsevier Inc, Amsterdam
go back to reference Tibshirani R, Walther G, Hastie T (2001) Estimating the number of clusters in a data set via the gap statistic. J R Statist Soc Ser B (Statist Methodol) 63(2):411–423MathSciNetCrossRefMATH Tibshirani R, Walther G, Hastie T (2001) Estimating the number of clusters in a data set via the gap statistic. J R Statist Soc Ser B (Statist Methodol) 63(2):411–423MathSciNetCrossRefMATH
go back to reference Zhang Q, Li H (2007) MOEA/D: a multiobjective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput 11(6):712–731CrossRef Zhang Q, Li H (2007) MOEA/D: a multiobjective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput 11(6):712–731CrossRef
Metadata
Title
What’s in a distance? Exploring the interplay between distance measures and internal cluster validity in multi-objective clustering
Authors
Adán José-García
Julia Handl
Publication date
22-08-2022
Publisher
Springer Netherlands
Published in
Natural Computing / Issue 2/2023
Print ISSN: 1567-7818
Electronic ISSN: 1572-9796
DOI
https://doi.org/10.1007/s11047-022-09909-y

Other articles of this Issue 2/2023

Natural Computing 2/2023 Go to the issue

Premium Partner