Top

Journal of Classification

Published in:

11-07-2019

Note: t for Two (Clusters)

Author: Stanley L. Sclove

Published in: Journal of Classification | Issue 3/2019

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

The computation for cluster analysis is done by iterative algorithms. But here, a straightforward, non-iterative procedure is presented for clustering in the special case of one variable and two groups. The method is univariate but may reasonably be applied to multivariate datasets when the first principal component or a single factor explains much of the variation in the data. The t method is motivated by the fact that minimizing the within-groups sum of squares is equivalent to maximizing the between-groups sum of squares, and that Student’s t statistic measures the between-groups difference in means relative to within-groups variation. That is, the t statistic is the ratio of the difference in sample means, divided by the standard error of this difference. So, maximizing the t statistic is developed as a method for clustering univariate data into two clusters. In this situation, the t method gives the same results as the K-means algorithm. K-means tacitly assumes equality of variances; here, however, with t, equality of variances need not be assumed because separate variances may be used in computing t. The t method is applied to some datasets; the results are compared with those obtained by fitting mixtures of distributions.

previous article MCC: a Multiple Consensus Clustering Framework

next article The δ-Machine: Classification Based on Distances Towards Prototypes

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 340 Zeitschriften

aus folgenden Fachgebieten:

Bauwesen + Immobilien
Business IT + Informatik
Finance + Banking
Management + Führung
Marketing + Vertrieb
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Connor, L.R., & Morrell, A.J.H. (1977). Statistics in theory and practice, 7th Edn. London: Pitman.

Kenkel, J.L. (1984). Introductory statistics for management and economics, (p. 31). Boston: Duxbury Press. Exercise 4.

MacQueen, J.B. (1967). Some methods for classification and analysis of multivariate observations. In Proc. Fifth Berkeley symp. on math. statist. and prob., (Vol. 1 pp. 281–297).

Steinhaus, H. (1956). Sur la division des corps materiels en parties. Bulletin l’Académie Polonaise des Science (Bulletin of the Polish Academy of Science) (in French), 4(12), 801–804.MathSciNetMATH

Title: Note: t for Two (Clusters)
Author: Stanley L. Sclove
Publication date: 11-07-2019
Publisher: Springer US
Published in: Journal of Classification / Issue 3/2019
Print ISSN: 0176-4268
Electronic ISSN: 1432-1343
DOI: https://doi.org/10.1007/s00357-019-09335-3

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Springer Professional "Wirtschaft"

Other articles of this Issue 3/2019

MDCGen: Multidimensional Dataset Generator for Clustering

A Note on Applying the BCH Method Under Linear Equality and Inequality Constraints

Three-Way Symbolic Tree-Maps and Ultrametrics

A New Relationship Between Intuitionistic Fuzzy Sets and Genetics

Erratum to: A Framework for Quantifying Qualitative Responses in Pairwise Experiments

Quantum-Behaved Particle Swarm Optimization for Parameter Optimization of Support Vector Machine

Premium Partner