Skip to main content

2017 | OriginalPaper | Buchkapitel

Choice of Cumulative Percentage in Principal Component Analysis for Regionalization of Peninsular Malaysia Based on the Rainfall Amount

verfasst von : Shazlyn Milleana Shaharudin, Norhaiza Ahmad

Erschienen in: Modeling, Design and Simulation of Systems

Verlag: Springer Singapore

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config
loading …

Abstract

Principal Component Analysis (PCA) is a popular method used for reduction of large scale data sets in hydrological applications. Typically, PCA scores are applied to hierarchical cluster analysis to redefine region. However, the choice of cumulative percentage of variance for PCA scores and identifying the best number of clusters can be difficult. In this paper, we investigate the effect of determining the number of clusters by comparing (i) standardized and unstandardized PCA scores on different cumulative percentages of variance (ii) to determine number of clusters using Calinski and Harabasz Index. We have found that Calinski and Harabasz Index is most appropriate to determine the best number of clusters and that standardized PCA scores within the range of 65% to 70% cumulative percentage of variance give the most reasonable number of clusters.

Sie haben noch keine Lizenz? Dann Informieren Sie sich jetzt über unsere Produkte:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

  • über 102.000 Bücher
  • über 537 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Maschinenbau + Werkstoffe
  • Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 390 Zeitschriften

aus folgenden Fachgebieten:

  • Automobil + Motoren
  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Elektrotechnik + Elektronik
  • Energie + Nachhaltigkeit
  • Maschinenbau + Werkstoffe




 

Jetzt Wissensvorsprung sichern!

Springer Professional "Wirtschaft"

Online-Abonnement

Mit Springer Professional "Wirtschaft" erhalten Sie Zugriff auf:

  • über 67.000 Bücher
  • über 340 Zeitschriften

aus folgenden Fachgebieten:

  • Bauwesen + Immobilien
  • Business IT + Informatik
  • Finance + Banking
  • Management + Führung
  • Marketing + Vertrieb
  • Versicherung + Risiko




Jetzt Wissensvorsprung sichern!

Literatur
1.
Zurück zum Zitat Lattin, J.: Analyzing Multivariate Data. Curt Hinrichs, Canada (2003) Lattin, J.: Analyzing Multivariate Data. Curt Hinrichs, Canada (2003)
2.
Zurück zum Zitat Romero, R., Ramis, C., Guijarro, J.A.: Daily rainfall patterns in the spanish mediterranean area: an objective classification. Int. J. Climatol. 19, 95–112 (1999)CrossRef Romero, R., Ramis, C., Guijarro, J.A.: Daily rainfall patterns in the spanish mediterranean area: an objective classification. Int. J. Climatol. 19, 95–112 (1999)CrossRef
3.
Zurück zum Zitat Alvin, C.R.: Methods of Multivariate Analysis. Wiley, Hoboken (2002)MATH Alvin, C.R.: Methods of Multivariate Analysis. Wiley, Hoboken (2002)MATH
4.
Zurück zum Zitat Cliff, N.: Analyzing Multivariate Data. Harcourt Brace, San Diego (1987) Cliff, N.: Analyzing Multivariate Data. Harcourt Brace, San Diego (1987)
5.
Zurück zum Zitat Cattell, R.B.: The scree test for the number of factors. Multivar. Behav. Res. 1, 245–276 (1966)CrossRef Cattell, R.B.: The scree test for the number of factors. Multivar. Behav. Res. 1, 245–276 (1966)CrossRef
6.
Zurück zum Zitat Jollieffe, I.T.: Discarding variables in principal component analysis. I: artifical data. Appl. Stat. 21, 160–173 (1972)CrossRef Jollieffe, I.T.: Discarding variables in principal component analysis. I: artifical data. Appl. Stat. 21, 160–173 (1972)CrossRef
7.
Zurück zum Zitat Mimmack, G.M., Mason, S.J., Galpin, J.S.: Choice of distance matrices in cluster analysis: defining region. J. Clim. 14, 2790–2797 (2000)CrossRef Mimmack, G.M., Mason, S.J., Galpin, J.S.: Choice of distance matrices in cluster analysis: defining region. J. Clim. 14, 2790–2797 (2000)CrossRef
8.
Zurück zum Zitat Suhaila, J., Deni, S.M., Wan Zin, W.Z., Jemain, A.A.: Sains Malays. 39(4), 533–542 (2010) Suhaila, J., Deni, S.M., Wan Zin, W.Z., Jemain, A.A.: Sains Malays. 39(4), 533–542 (2010)
9.
Zurück zum Zitat Wilks, D.S.: Statistical Methods in the Atmospheric Sciences, p. 467. Academic Press, Cambridge (1995) Wilks, D.S.: Statistical Methods in the Atmospheric Sciences, p. 467. Academic Press, Cambridge (1995)
10.
Zurück zum Zitat Aldenderfer, M.S., Blashfield, R.K.: Cluster Analysis. Sage Publications, Inc., Beverly Hills (1984)CrossRefMATH Aldenderfer, M.S., Blashfield, R.K.: Cluster Analysis. Sage Publications, Inc., Beverly Hills (1984)CrossRefMATH
12.
Zurück zum Zitat Bunkers, M.J., Miller, J.R., DeGaaetano, A.T.: J. Clim. 9, 130–146 (1996)CrossRef Bunkers, M.J., Miller, J.R., DeGaaetano, A.T.: J. Clim. 9, 130–146 (1996)CrossRef
13.
Zurück zum Zitat Jolliffe, I.T.: Principal Component Analysis. Springer Series in Statistics, p. 271. Springer, Heidelberg (1986)CrossRef Jolliffe, I.T.: Principal Component Analysis. Springer Series in Statistics, p. 271. Springer, Heidelberg (1986)CrossRef
14.
Zurück zum Zitat Chang, W.C.: On using principal components before separating a mixture of two multivariate normal populations. J. Appl. Stat. 32, 267–275 (1983)CrossRefMATH Chang, W.C.: On using principal components before separating a mixture of two multivariate normal populations. J. Appl. Stat. 32, 267–275 (1983)CrossRefMATH
15.
Zurück zum Zitat Pelczer, I.J., Cisnerous-Iturbe, H.L.: Identification of rainfall patterns over the valley Mexico. In: 11th International Conference on Urban Drainage, pp. 1–9 (2008) Pelczer, I.J., Cisnerous-Iturbe, H.L.: Identification of rainfall patterns over the valley Mexico. In: 11th International Conference on Urban Drainage, pp. 1–9 (2008)
16.
17.
Zurück zum Zitat Donald, A.J.: Stopping rules in principal components analysis: a comparison of heuristical and statistical approaches. Ecology 74(8), 2204–2214 (1993)CrossRef Donald, A.J.: Stopping rules in principal components analysis: a comparison of heuristical and statistical approaches. Ecology 74(8), 2204–2214 (1993)CrossRef
18.
Zurück zum Zitat Grossman, G.D., Nickerson, D.M., Freeman, M.C.: Principal component analyses of assemblage structure data: utility of tests based on eigenvalues. Ecology 72, 341–347 (1991)CrossRef Grossman, G.D., Nickerson, D.M., Freeman, M.C.: Principal component analyses of assemblage structure data: utility of tests based on eigenvalues. Ecology 72, 341–347 (1991)CrossRef
19.
Zurück zum Zitat Rexstad, E.A., Miller, D.D., Flather, C.H., Anderson, E.M., Hupp, J.W., Anderson, D.R.: Questionable multivariate statistical inference in wildlife habitat and community studies. J. Wildl. Manag. 52, 794–798 (1988)CrossRef Rexstad, E.A., Miller, D.D., Flather, C.H., Anderson, E.M., Hupp, J.W., Anderson, D.R.: Questionable multivariate statistical inference in wildlife habitat and community studies. J. Wildl. Manag. 52, 794–798 (1988)CrossRef
20.
Zurück zum Zitat Mercedesm, D., Michael, E.P, Scott, S.: Defining Clusters of Related Industries (2013) Mercedesm, D., Michael, E.P, Scott, S.: Defining Clusters of Related Industries (2013)
21.
Zurück zum Zitat Baeriswyl, P.A., Rebetez, M.: Regionalization of precipitation in switzerland by means of principal component analysis. Theor. Appl. Climatol. 58, 31–41 (1997)CrossRef Baeriswyl, P.A., Rebetez, M.: Regionalization of precipitation in switzerland by means of principal component analysis. Theor. Appl. Climatol. 58, 31–41 (1997)CrossRef
22.
Zurück zum Zitat Bunkers, M.J., Miller, J.R., DeGaetano, A.T.: Definition of climate regions in the northern plains using an objective cluster modification technique. J. Clim. 9, 130–146 (1996)CrossRef Bunkers, M.J., Miller, J.R., DeGaetano, A.T.: Definition of climate regions in the northern plains using an objective cluster modification technique. J. Clim. 9, 130–146 (1996)CrossRef
23.
Zurück zum Zitat DeGaetano, A.T.: Delineation of mesoscale climate zones in the Northeastern United States using a novel approach to cluster analysis. J. Clim. 9, 1765–1782 (1996)CrossRef DeGaetano, A.T.: Delineation of mesoscale climate zones in the Northeastern United States using a novel approach to cluster analysis. J. Clim. 9, 1765–1782 (1996)CrossRef
Metadaten
Titel
Choice of Cumulative Percentage in Principal Component Analysis for Regionalization of Peninsular Malaysia Based on the Rainfall Amount
verfasst von
Shazlyn Milleana Shaharudin
Norhaiza Ahmad
Copyright-Jahr
2017
Verlag
Springer Singapore
DOI
https://doi.org/10.1007/978-981-10-6502-6_19

Premium Partner