Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling

Sharma, S. K.; Gajbhiye, S.; Tignath, S.

doi:10.1007/s13201-014-0170-1

Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling

Original Article
Open access
Published: 20 March 2014

Volume 5, pages 89–96, (2015)
Cite this article

Download PDF

You have full access to this open access article

Applied Water Science Aims and scope Submit manuscript

Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling

Download PDF

S. K. Sharma¹,
S. Gajbhiye² &
S. Tignath³

5164 Accesses
69 Citations
Explore all metrics

Abstract

Principal component analysis has been applied to 13 dimensionless geomorphic parameters on 8 sub-watersheds of Kanhiya Nala watershed tributary of Tons River located in Part of Panna and Satna district of Madhya Pradesh, India, to group the parameters under different components based on significant correlations. Results of principal component analysis of 13 geomorphic parameters clearly reveal that some of these parameters are strongly correlated with the components but texture ratio and hypsometric integral do not show correlation with any of the component. So they have been screened out of analysis. The principal component loading matrix obtained using correlation matrix of eleven parameters reveals that first three components together account for 93.71 % of the total explained variance. Therefore, principal component loading is applied to get better correlation and clearly group the parameters in physically significant components. Based on the properties of the geomorphic parameters, three principal components were defined as drainage, slope or steepness and shape components. One parameter each from the significant components may form a set of independent parameters at a time in modeling the hydrologic responses such as runoff and sediment yield from small watersheds.

Choice of Cumulative Percentage in Principal Component Analysis for Regionalization of Peninsular Malaysia Based on the Rainfall Amount

Application of Principal Component Analysis for Grouping of Morphometric Parameters and Prioritization of Watershed

Selection of minimum indicators of hydrologic alteration of the Gorai river, Bangladesh using principal component analysis

Article 25 February 2017

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Watershed is an ideal unit for planning and management of land and water resources (Gajbhiye et al. 2013). It is a natural hydrological entity which allows surface runoff to a defined channel, drain, stream or river at a particular point (Chopra et al. 2005). Physiography, drainage, geomorphology, soil, land use/land cover are some of the parameters which play a significant role in watershed planning (Javed et al. 2011). Watershed management involves proper utilization of land, water, forest and soil resources. Therefore, realistic assessment of the hydrological behavior of a watershed is important to develop effective management plan. There may be various considerations for the implementation of management programs in the few sub-watersheds only. It is always better to start management measures from the most critical sub-watershed. Sediment yield from a catchment is one of the main criteria to find most critical sub-watershed to soil erosion. However, this criterion requires for assessing continuous monitoring of sediment samples at the catchment outlet. Such data are hardly available in India for small watersheds. Although the sediment yield from large basins can be obtained from such observation, it is not possible to ascertain the vulnerability to soil erosion of small watersheds within a basin. In the absence of sediment yield data morphometric parameters may be helpful in assessing most critical sub-watershed.

Morphometry is the measurement and mathematical analysis of the configuration of the earth’s surface, shape and dimensions of its landform (Clarke 1966). This analysis can be achieved through measurement of linear, aerial and relief aspects of basin and slope contributions. Morphometric analysis of a basin can be better achieved through a latest technology like RS (Remote Sensing) and Geographical Information System (GIS) as conventional measurement of these parameters is laborious and cumbersome. Many researchers have demonstrated the potential of RS and GIS technique for morphometric analysis of watershed (Shrimali et al. 2001; Thakker and Dhiman 2007; Sharma et al. 2010).

The method of quantitative analysis of watershed was developed by Horton (1945) and was further modified by Strahler (1964). Sufficient works on the quantitative analysis of geomorphological parameters of watersheds have been done in India and abroad (Ghose et al. 1969). However, a very little work on the interrelationship of morphological parameters has been carried out. To determine interrelationship of these geomorphological parameters is very important to develop sediment yield regression models (Hydrological modeling). Statistical methods are applied in a variety of fields in hydrological research. Factor analysis is useful for interpreting morphometric parameters and relating the same to specific hydrological processes. Multivariate analysis is simply a collection of procedures for analyzing the associations between two or more sets of data that have been collected on each object in one or more samples of object. Synder (1962) introduced some solutions, possibilities of multivariate statistics in hydrological modeling. Wong (1979) utilized a multivariate statistical technique component analysis in analyzing the effects of twelve basins and climatological parameters. Wallis (1965) in discussion of multivariate statistical methods in hydrology recommends, for multifactor hydrological problems, the use of principle component analysis with varimax rotation of the factor weight matrix. Haan and Allen (1972), Decoursy and Deal (1974) have also demonstrated the use of multiple regression analysis for development of hydrological prediction equations involving geomorphic parameters. Mishra and Satyanarayana (1988) carried out principal component analysis with varimax rotation on ten geomorphic parameters at Damodar Valley catchment of India and concluded that nine parameters could be significantly grouped into three components. Singh et al. (2009) carried out principal component analysis to 13 geomorphic parameters collected for sixteen watersheds of Chambel catchment of Rajasthan. The parameters are grouped into three components. Therefore, in this study an attempt has been made to determine geomorphological parameters and to study the intercorrelationship (multicollinearity) among variables to screen out the less significant variables out of the analysis and to arrange the remaining into physically significant groups by applying principal component analysis for better interpretability.

Materials and methods

The study area Kanhiya Nala watershed lies within the Tons River catchment 80°32′24″ to 80°34′12″E longitude and 24°06′ to 24°10′48″N latitude (Fig. 1) with elevation ranges from 480 to 600 m above mean sea level and extends a total area of 25.58 km². The average annual rainfall is 1,300 mm. The watershed is situated in Satna and Panna district of Madhya Pradesh.

The Survey of India (SOI) toposheet number 63 D/12 on 1:50,000 scale was used to prepare a base map. The satellite data of IRS-P6, LISS-III sensor with 23.5 m resolution was used in the present study to prepare updated drainage map. False Color Composite (FCC) of study area is presented in Fig. 2.

For generation of digital input maps, image processing and digital analysis of data, Arc GIS 9.1 and ERDAS Imagine 9.1 software are used in the present study. SPSS 14.0 is also used for statistical analysis.

Watershed delineation from the topological data

The topological information of the study area was digitized from georeferenced SOI toposheets 63 D/12 using the ArcGIS tools. The watershed boundary, sub-watershed boundary, stream network and contours were digitized in vector form to generate digital input maps. The drainage was initially digitized from SOI toposheets and later updated using IRS-P6 LISS-III Satellite data. The digitized sub-watershed boundary, updated stream network and contour lines were used for further geomorphological analysis (Figs. 3, 4).

Geomorphic parameters

Geomorphologic characteristics play a vital role on the hydrological response from a watershed, and therefore, a number of parameters which signify the watershed characteristics are evaluated from toposheets in GIS environment. For the present study, entire catchment of the Kanhiya Nala which is a tributary of Tons river of Madhya Pradesh, India was discretized into eight sub-watersheds. The input parameters for present study such as area, perimeter, stream order, number of streams, stream length, elevation and basin length were derived from digitized stream network and contour map in GIS environment. The geomorphic parameters for the discretized watershed area were calculated using formula suggested by Horton (1945), Strahler (1964), Schumm (1956) and Miller (1953) given in Table 1.

Table 1 Formula for computation of geomorphic parameters

Full size table

Hypsometric analysis of drainage basin is carried out to develop the relationship between horizontal cross-sectional drainage basin area and the elevation. In analysis, a curve is derived by plotting the relative height (h/H) and relative areas (a/A); the obtained curve is called as hypsometric curve (Suresh 1997).The shape of hypsometric curve varies in early geologic stages of development of the drainage basin, but once a steady state is attained it tends to vary little despite lowering relief (Kumar 1991; Suresh 1997).

Principal component analysis

The method of principal components or component analysis is based upon the early work of Pearson with specific adaption to principal component analysis suggested by the Hotelling (1933). The geomorphometric parameters are usually many times correlated. The correlation indicates that some of the information contained in one variable is also contained in some of the other remaining variables. More specifically, the first principal component is that linear combination of the original variables which contributes a maximum to their total variance; the second principal component, uncorrelated with the first, contributes a maximum to the residual variance, and so on until the total variance is analyzed. Since the method is so dependent on the total variance of the original variables, it is most suitable when all the variables are measured in the same units. Hence, it is customary to express the variables in standard form, i.e., to select the unit of measurement for each variables so that its sample variance is one. Then, the analysis is made on the correlation matrix, with the total variance equal to n. The objectives are achieved in three steps:

Step 1 Calculate the correlation matrix, R.
Step 2 Calculate the principal component loading matrix by principal component analysis.
Step 3 In the principal component (PC) Loading matrix, eigen values greater than 1 indicates significant PC loading.

Eigen value indicated how well each of the identified factors fit the data from all the geomorphic parameters on all the principal components.

1.
Correlation matrix

The inter-correlation matrix of the geomorphic parameters is obtained using the following procedure:
1. (a)
  The parameters are standardized
  $$X = \left( {x_{ij} - x_{j} } \right)/S_{j}$$
  (1)
  where X denotes the matrix of standardized parameters, x_ijith observation on jth parameters, i 1…N (Number of observation), j 1…P (Number of observation), x_j mean of the jth parameters, S_j Standard deviation of the jth parameters.
2. (b)
  The correlation matrix of parameters is the minor product moment of the standardized predictor measures divided by N and is given by
  $$R = \left( {x^{\prime} \times x} \right)/N$$
  (2)
  where, x′ denotes the transpose of the standardized matrix of predictor parameters
2.
Principal component loading matrix

The principal component loading matrix which reflects how much a particular parameter is correlated with different factors, is obtained by premultiplying the characteristics vector with square root of the characteristics values of the correlation matrix.

Thus,
$$A = Q \times D^{0.5}$$
(3)
where A principal component loading matrix, Q characteristics vector of the correlation matrix, D characteristics value of the correlation matrix.

Result and discussion

Morphometric parameters of sub-watersheds were calculated in GIS environment and are presented in Table 2 and computed geomorphometric parameters are presented in Table 3.

Table 2 Sub-watershed wise input geomorphic parameters

Full size table

Table 3 Sub-watershed wise computed geomorphic parameters

Full size table

The correlation matrix (Table 4) of 13 geomorphic parameters reveals that strong correlations (correlation coefficient more than 0.9) exist between bifurcation ratio, form factor and elongation ratio, between drainage density and length of overland flow, between circulatory ratio and compactness coefficient, between form factor and elongation ratio and between relative relief and relief ratio. Also, good correlation (correlation coefficient more than 0.75) exists between bifurcation ratio, stream frequency and ruggedness number, between texture ratio relief ratio and hypsometric integral, between stream frequency form factor and elongation ratio, between circulatory ratio and ruggedness number and between circulatory ratio and ruggedness number. Some more moderately correlated parameters exist (correlation coefficient more than 0.60) between bifurcation ratio ruggedness number, between drainage density and stream frequency, between texture ratio form factor and elongation ratio, between stream frequency and length of overland flow, between form factor and circulatory ratio, between circulatory ratio and elongation ratio, between relative relief and hypsometric integral and between relief ratio and hypsometric integral.

Table 4 Inter-correlation matrix of 13 geomorphic parameters

Full size table

It is very difficult at this stage to group the parameters into components and attach any physical significance. Hence, in the next, the principal component analysis has been applied. The correlation matrix is subjected to the principal component analysis.

The principal component loading matrix obtained from correlation matrix (Table 5) reveals that the first three components whose eigen values are greater than 1, together account for about 91.458 % of the total explained variance. The first component is strongly correlated (loading of more than 0.90) with circulatory ratio and compactness coefficient and moderately correlated (loading of more than 0.60) with form factor and elongation ratio, which may be termed as shape component. The second component is strongly correlated with relief ratio and ruggedness number and good correlated (loading of more than 0.80) which may be termed as slope or steepness component. Third component is strongly correlated with drainage density and length of overland flow, good correlation with bifurcation ratio and moderately correlated with stream frequency. It is evident from these results that some of the parameters are highly correlated with some of the components but the parameters texture ratio and hypsometric integral could not be grouped with any of the component because of their poor correlation with them.

Table 5 Principal component loading matrix of 13 geomorphic parameters

Full size table

To screen out parameters having less significance in explaining the component variance, parameters texture ratio and hypsometric integral are screened out from analysis. Then correlation matrix and principal component matrix are obtained for eleven parameters.

The principal component loading matrix obtained using the correlation matrix of eleven parameters (Table 6) reveals that the first three components now together accounts for 94.491 % of the total explained variance showing an increase of about 3.033 %.

Table 6 Principal component loading matrix of eleven finally screened out geomorphic parameters

Full size table

The principal component loading here also improved considerably in almost all significant parameters. The circulatory ratio and compactness coefficient have strong correlation (loadings of more than 0.90) with the first component. The elongation ratio and form factor have moderate correlation (loadings of more than 0.60) with first component. The relative relief, relief ratio and ruggedness number have strong correlation with the second component. The bifurcation ratio, drainage density and length of overland flow have strong correlation (loadings of more than 0.90) with third component. The stream frequency has moderate correlation (loadings of more than 0.60) with third component.

It is observed that the first component is strongly correlated with circulatory ratio and compactness coefficient and good correlation with ruggedness number which is grouped under shape component. The second component has strong correlation with relative relief, relief ratio and ruggedness number and termed as slope or steepness component. The third component has strong correlation with bifurcation ratio, drainage density and length of overland flow and moderate correlation with stream frequency hence is called as drainage component.

It can be seen how useful the principal component analysis has been in screening out the parameters or variables of least significance and regrouping the remaining variables into the physically significant factors. Multiple regression technique can then be applied in modeling the hydrological responses such as surface runoff and sediment yields from the watersheds. One parameter each from the significant components may form a set of independent parameters at a time in modeling the said hydrologic responses.

Conclusion

In the present study, 13 geomorphic parameters were evaluated for eight discretized sub-watersheds of Kanhiya watershed located in part of Panna and Satna district of Madhya Pradesh, India for principal component analysis. The correlation matrix of the 13 geomorphic parameters revealed that strong correlations (correlation coefficient more than 0.9) exist between bifurcation ratio, form factor and elongation ratio, between drainage density and length of over land flow, between circulatory ratio and compactness coefficient, between form factor and elongation ratio and between relative relief and relief ratio. The principal component loading matrix obtained from correlation matrix reveals that first three components whose eigen values are greater than 1, together accounts for about 91.458 of the total explained variance. Based on the results of the principal component analysis, first component is strongly correlated with circulatory ratio and compactness coefficient. The second component is strongly correlated with relief ratio and ruggedness number. However, third component is strongly correlated with drainage density and length of overland flow. The texture ratio and hypsometric integral could not be grouped with any of the component because of their poor correlation with them. After screening out these parameters, the principal component loading matrix of eleven parameters indicates that first three components together account for 94.491 % of the total explained variance. Based on the properties of the geomorphic parameters, three principal components were defined as drainage, slope or steepness and shape components. One parameter each from the significant components may form a set of independent parameters at a time in modeling the hydrologic responses such as runoff and sediment yield from small watersheds. The principal component analysis is a good tool for screening out the insignificant parameters from the analysis.

References

Chopra R, Dhiman RD, Sharma PK (2005) Morphometric analysis of sub-watersheds in Gurudaspur district, Punjab using remote sensing and GIS techniques. J Indian Soc Remote Sens 33(4):531–539
Article Google Scholar
Clarke JI (1966) Morphometry from maps. Essays in geomorphology. Elsevier Publishing Co, New York, pp 235–274
Google Scholar
Decoursy D, Deal RB (1974) General aspect of multivariate analysis with application to some problems in hydrology. In: Proceedings of symposium on statistical hydrology, USDA, miscellaneous publication No. 1275. Washington DC, pp 47–68
Gajbhiye S, Mishra SK, Pandey A (2013) Prioritizing erosion-prone area through morphometric analysis: an RS and GIS perspective. Appl Water Sci (Springer). doi:10.1007/s13201-013-0129-7
Google Scholar
Ghose B, Pandey S, Singh S (1969) Quantitative geomorphology of the drainage basin in semi arid environment. Ann Arid Zone 1:37–44
Google Scholar
Haan CT, Allen DM (1972) Comparison of multiple regression and principal component regression for predicting water yields in Kentucky. Water Resour Res 8(6):1593–1596
Article Google Scholar
Horton RE (1945) Erosional development of streams and their drainage basins: a hydrophysical approach to quantitative morphology. Geol Soc Am Bull 56:275–370
Article Google Scholar
Hotelling H (1933) Analysis of complex of statistical variables into principal component. J Educ Psychol 24:417o–441o 498–520
Article Google Scholar
Javed A, Khamday AY, Rais S (2011) Watershed prioritization using morphometric and land use/land cover parameters: a remote sensing and GIS based approach. J Geo Soc India 78:63–75
Article Google Scholar
Kumar V (1991) Hydrologic response models for prediction of runoff and sediment yield from small watersheds. Unpublished Ph.D Thesis. Indian Institute of Technology, Kharagpur, India, p 350
Miller VC (1953) A quantitative geomorphic study of drainage basin characteristics in the Clinch mountain area, Virginia and Tennesses. Department of Navy, Office of Naval Research, Technical Report 3, Project NR 389-042, Washington DC
Mishra N, Satyanarayana T (1988) Parameter grouping—a prelude to hydrologic modeling. Indian J Power River Val Dev 256–260
Schumm SA (1956) Evaluation of drainage system and slopes in bed lands at Perth Ambry, New Jersy. Geol Soc Am Bull 67:597–646
Article Google Scholar
Sharma SK, Rajput GS, Tignath S, Pandey RP (2010) Morphometric analysis of a watershed using GIS. J Indian Water Res Soc 30(2):33–39
Google Scholar
Shrimali SS, Aggarwal SP, Samra JS (2001) Prioritizing erosion-prone areas in hills using remote sensing and GIS—a case study of the Sukhna Lake catchment, Northern India. Int J Appl Earth Obs and Geoinf 2(1):54–60
Google Scholar
Singh PK, Kumar V, Purohit RC, Kothari M, Dashora PK (2009) Application of principal component analysis in grouping geomorphic parameters for hydrologic modeling. Water Resour Manage 23:325–339
Article Google Scholar
Strahler AN (1964) Quantitative geomorphology of drainage basins and channel networks. Section 4-II. In: Chow VT (ed) Handbook of applied hydrology. McGraw-Hill, USA, pp 4–39
Google Scholar
Suresh R (1997) Soil and water conservation engineering. Standard Publishers Distributors, New Delhi, p 973
Google Scholar
Synder WM (1962) Some possibilities for multivariate analysis in hydrologic studies. J Geophys Res 62(2):721–729
Article Google Scholar
Thakker AK, Dhiman SP (2007) Morphometric analysis and prioritization of miniwatersheds in Mohr watershed, Gujarat using Remote sensing and GIS techniques. J Indian Soc Remote Sens 35(4):313–321
Article Google Scholar
Wallis RJ (1965) Multivariate statistical methods in hydrology—a comparison using data of known functional relationship. Water Resour Res 1:447–467
Article Google Scholar
Wong ST (1979) A multivariate statistical model for predicting mean annual flood in New England. Ann Assoc Am Geogr 53:293–311
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Soil and Water Engineering, College of Agricultural Engineering, J.N.K.V.V, Jabalpur (M.P.), India
S. K. Sharma
Department of Water Resources Development and Management, Indian Institute of Technology, Roorkee (U.K.), India
S. Gajbhiye
Department of Geology, Government Science College, Jabalpur (M.P.), India
S. Tignath

Authors

S. K. Sharma
View author publications
You can also search for this author in PubMed Google Scholar
S. Gajbhiye
View author publications
You can also search for this author in PubMed Google Scholar
S. Tignath
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to S. K. Sharma.

Rights and permissions

This article is published under license to BioMed Central Ltd.Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Sharma, S.K., Gajbhiye, S. & Tignath, S. Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling. Appl Water Sci 5, 89–96 (2015). https://doi.org/10.1007/s13201-014-0170-1

Download citation

Received: 06 October 2013
Accepted: 23 February 2014
Published: 20 March 2014
Issue Date: March 2015
DOI: https://doi.org/10.1007/s13201-014-0170-1

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling

Abstract

Similar content being viewed by others

Choice of Cumulative Percentage in Principal Component Analysis for Regionalization of Peninsular Malaysia Based on the Rainfall Amount

Application of Principal Component Analysis for Grouping of Morphometric Parameters and Prioritization of Watershed

Selection of minimum indicators of hydrologic alteration of the Gorai river, Bangladesh using principal component analysis

Introduction

Materials and methods

Watershed delineation from the topological data

Geomorphic parameters

Principal component analysis

Result and discussion

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Application of principal component analysis in grouping geomorphic parameters of a watershed for hydrological modeling

Abstract

Similar content being viewed by others

Choice of Cumulative Percentage in Principal Component Analysis for Regionalization of Peninsular Malaysia Based on the Rainfall Amount

Application of Principal Component Analysis for Grouping of Morphometric Parameters and Prioritization of Watershed

Selection of minimum indicators of hydrologic alteration of the Gorai river, Bangladesh using principal component analysis

Introduction

Materials and methods

Watershed delineation from the topological data

Geomorphic parameters

Principal component analysis

Result and discussion

Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation