2011 | OriginalPaper | Buchkapitel
Evaluation of Categorical Data Clustering
verfasst von : Hana Rezankova, Tomas Loster, Dusan Husek
Erschienen in: Advances in Intelligent Web Mastering – 3
Verlag: Springer Berlin Heidelberg
Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.
Wählen Sie Textabschnitte aus um mit Künstlicher Intelligenz passenden Patente zu finden. powered by
Markieren Sie Textabschnitte, um KI-gestützt weitere passende Inhalte zu finden. powered by
Methods of cluster analysis are well known techniques of multivariate analysis used for many years. Their main applications concern clustering objects characterized by quantitative variables. For this case various coefficients for clustering evaluation and determination of cluster numbers have been proposed. However, in some areas, i.e., for segmentation of Internet users, the variables are often nominal or ordinal as their origin in questionnaire responses. That is why we are dealing with the evaluation criteria for the case of categorical variables here. The criteria based on variability measures are proposed. Instead of variance as a measure for quantitative variables, three measures for nominal variables are considered: the variability measure based on a modal frequency, Gini’s coefficient of mutability, and the entropy. The proposed evaluation criteria are applied to a real-dataset.