Background
Methods
Time series normalization
Distance measures
Euclidean distance
City block distance
Minkowski distance
Chebyshev distance
Cosine distance
Correlation distance
Spearman distance
Cophenetic correlation coefficient (CPCC)
Hierarchical clustering
Data set description
Results and discussion
Distance metric selection
Distance metric | Cophenetic correlation coefficient (CPCC) | ||||||
---|---|---|---|---|---|---|---|
Single | Complete | Average | Ward | Weighted | Median | Centroid | |
Euclidean | 0.784227 | 0.791608 |
0.826298
| 0.63921 | 0.760709 | 0.739566 | 0.812102 |
Cityblock | 0.795472 | 0.720603 | 0.822869 | 0.617237 | 0.814186 | 0.757599 | 0.800933 |
Minkowski | 0.784227 | 0.791608 |
0.826298
| 0.63921 | 0.760709 | 0.739566 | 0.812102 |
Chebychev | 0.747701 | 0.621421 | 0.78132 | 0.620067 | 0.756454 | 0.684826 | 0.730986 |
Cosine | 0.73324 | 0.73852 | 0.779876 | 0.63823 | 0.692975 | 0.70216 | 0.760326 |
Correlation | 0.733247 | 0.738506 | 0.779844 | 0.638203 | 0.692964 | 0.702147 | 0.760308 |
Spearman | 0.750498 | 0.528649 | 0.773012 | 0.576422 | 0.725262 | 0.742434 | 0.757063 |
Clustering result
Cluster Id | Name of districts |
---|---|
1 | Ahmedabad, Kutch, Surat |
2 | Amreli, Anand, Bharuch, Bhavnagar, Gandhinagar, Junagadh, Kheda, Mahesana, Patan, Rajkot, Surendranagar, Vadodara |
3 | BanasKantha, Dahod, Narmada, PanchMahals, Sabarkantha, Tapi |
4 | Jamnagar, Porbandar |
5 | Navsari, Valsad |
6 | The Dangs |