Skip to main content

Advertisement

Log in

Comprehensive assessment of flood risk using the classification and regression tree method

  • Original Paper
  • Published:
Stochastic Environmental Research and Risk Assessment Aims and scope Submit manuscript

Abstract

The Evaluation of flood risk is a difficult task due to its numerous and complex impact factors. This article built a classification and regression tree (CART) model for the flood risk assessment with the available data of Hunan Province. This model is able to extract the major impact factors from many complex variables, determine the factors’ thresholds, and evaluate the levels of flood risk objectively. To construct the model, 18 explanatory variables were selected as the influential factors, including meteorological conditions, surface conditions and social vulnerability. Economic loss density from flood was chosen as the response variable for the quantitative and comprehensive evaluation of flood risk. The final model showed that meteorological conditions have the most significant influence on flood risk. Additionally, the relationship between meteorological factors and flood risk is rather complex. The variability of rainstorm days during the seasonal alternate period from the end of spring (May) to the early summer (June) is the source of the highest flood risk. In addition, the regional embankment density and population density as social vulnerability indicators and the relief degree of land surface as a surface condition indicator were also included in the flood risk assessment for Hunan. A region with dense dams appeared at a relatively higher risk. Densely inhabited areas with greater topographical relief also demonstrated a higher flood risk in the study area. The conditions obtained from the final tree for different levels of risk demonstrate the objectivity of selecting impact factors and a reduction of complexity for the risk evaluation process. Furthermore, the evaluation of high-level risk using the proposed method requires fewer conditions, which allows for a rapid risk assessment of serious floods. The CART method shows a decreased root mean squared error compared with that of a multiple linear regression model. In addition, the cross-validation error was improved for the high-risk levels that represent the most important classes in risk management. The verification with the available historical records showed that the output of the model is reliable. In summary, the CART method is feasible for extracting the main impact factors and their associated thresholds for the comprehensive assessment of regional flood risk.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  • Alkema D (2003) Flood risk assessment for EIA; an example of a motorway near Trento, Italy. Stud Trent Sci Nat Acta Geol 78:147–153

    Google Scholar 

  • Apel H, Thieken AH, Merz B, Bloschl G (2006) A probabilistic modelling system for assessing flood risks. Nat Hazards 38:79–100

    Article  Google Scholar 

  • Arlot S (2010) A survey of cross-validation procedures for model selection. Stat Surv 4:40–90

    Article  Google Scholar 

  • Atmospheric Science Dictionary Editorial Board (2006) China’s meteorological disaster dictionary, volume of Hunan. China Meteorological Press, Beijing (in Chinese)

    Google Scholar 

  • Bakeev KA (2010) Process analytical technology: spectroscopic tools and implementation strategies for the chemical and pharmaceutical industries. CPI Antony Rowe, Chippenham

    Book  Google Scholar 

  • Barthès B, Roose E (2002) Aggregate stability as an indicator of soil susceptibility to runoff and erosion; validation at several levels. Catena 47:133–149

    Article  Google Scholar 

  • Bouman BAM, Humphreys E, Tuong TP, Barker R (2007) Rice and water. Adv Agron 92:187–237

    Article  CAS  Google Scholar 

  • Breiman L, Friedman JH, Olshen RA, Stone CJ (1984) Classification and regression trees. The Wadsworth statistics/probability series. Chapman and Hall, New York

    Google Scholar 

  • Camarasa AM, López-García MJ, Soriano-García J (2011) Mapping temporally-variable exposure to flooding in small Mediterranean basins using land-use indicators. Appl Geogr 31:136–145

    Article  Google Scholar 

  • Cutter SL, Finch C (2008) Temporal and spatial changes in social vulnerability to natural hazards. PNAS 105:2301–2306

    Article  CAS  Google Scholar 

  • Cutter SL, Boruff BJ, Shirley WL (2003) Social vulnerability to environmental hazards. Soc Sci Q 84:242–261

    Article  Google Scholar 

  • Dawson R, Hall J, Sayers P, Bates P, Rosu C (2005) Sampling-based flood risk analysis for fluvial dike systems. Stoch Environ Res Risk Assess 19:388–402

    Article  Google Scholar 

  • de Moel H, Aerts JCJH, Koomenb E (2010) Development of flood exposure in the Netherlands during the 20th and 21st century. Glob Environ Change 21:620–627

    Google Scholar 

  • Deconinck E, Hancock T, Coomans D, Massart DL, Heyden YV (2005) Classification of drugs in absorption classes using the classification and regression trees (CART) methodology. J Pharm Biomed 39:91–103

    Article  CAS  Google Scholar 

  • Erpicum S, Dewals B, Archambeau P, Detrembleur S, Pirotton M (2010) Detailed inundation modelling using high resolution DEMs. Eng Appl Comput Fluid Mech 4:196–208

    Google Scholar 

  • Federal Emergency Management Agency (FEMA) (2003) Hazus-MH MR5 multi-hazard loss estimation model methodology—flood model, user manual. Department of Homeland, FEMA, Washington, DC

  • Feng ZM, Tang Y, Yang YZ, Zhang D (2008) Relief degree of land surface and its influence on population distribution in China. J Geogr Sci 18:237–246

    Article  Google Scholar 

  • Gilard O, Givone P (1997) Flood risk management: new concepts and methods for objective negotiations. In: Proceedings of the destructive water: water-caused natural disasters, their abatement and control, IAHS Pulb 239, IAHS Press, Wallingford, pp 145–155

  • Hastie T, Tibshirani R, Friedman J (2008) The elements of statistical learning: data mining, inference, and prediction. Springer Series in Statistics. Springer, Stanford, pp 305–308

    Google Scholar 

  • Horritt MS, Bates PD (2002) Evaluation of 1D and 2D numerical models for predicting river flood inundation. J Hydrol 268:87–99

    Article  Google Scholar 

  • Huang YM, Zhang XP, Liu F, Zhang JM, Huang YB (2007) Precipitation change in Hunan and its relationship to that in India. Trop Geogr 27(5):416–419

    Google Scholar 

  • Huang X, Tan HZ, Zhou J, Yang TB, Benjamin A, Wen SW, Li SQ, Liu AZ, Li XH, Fen SD, Li XL (2008) Flood hazard in Hunan province of China: an economic loss analysis. Nat Hazards 47:65–73

    Article  Google Scholar 

  • Hunan Bureau of Statistics (2012) 2011 Hunan statistical yearbook. China Statistics Press, Beijing

    Google Scholar 

  • Ian CF (2008) Geomorphic impacts of a 100-year flood: Kiwitea Stream, Manawatu catchment, New Zealand. Geomorphology 98:84–95

    Article  Google Scholar 

  • Ito A, Hayashi M, Hamasaki T, Ebisu S (2011) Risk assessment of dental caries by using classification and regression trees. J Dent 39:457–463

    Article  Google Scholar 

  • Jenks GF (1967) The data model concept in statistical mapping. Int Yearb Cartogr 7:186–190

    Google Scholar 

  • Jonkman SN, Kok M, Vrijling JK (2008) Flood risk assessment in the Netherlands: a case study for dike ring South Holland. Risk Anal 28:1357–1373

    Article  Google Scholar 

  • Kandilioti G, Makropoulos C (2012) Preliminary flood risk assessment: the case of Athens. Nat Hazards 61:441–468

    Article  Google Scholar 

  • Kim ES, Choi H (2011) Assessment of vulnerability to extreme flash floods in design storms. Int J Environ 8:2907–2922

    Google Scholar 

  • Knotters M, BRUS DJ, Voshaar JHO (1995) A comparison of Kring, Co-Kring and Kring combined with regression for spatial interpolation of horizon depth with censored observations. Geoderma 67:227–246

    Article  Google Scholar 

  • Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the fourteenth international joint conference on artificial intelligence, vol 12, Montreal, pp 1137–1143

  • Leinster P (2009) Flooding in England: a national assessment of flood risk. Environment Agency, Almondsbury

    Google Scholar 

  • Li K, Li GS (2011) Vulnerability assessment of storm surges in the coastal area of Guangdong Province. Nat Hazards Earth Syst Sci 11:2003–2010

    Article  CAS  Google Scholar 

  • Li LF, Wang JF, Leung H, Jiang CS (2010) Assessment of catastrophic risk using Bayesian network constructed from domain knowledge and spatial data. Risk Anal 30:1157–1175

    Article  Google Scholar 

  • Livingstone DJ, Salt DW (2005) Judging the significance of multiple linear regression models. J Med Chem 48:661–663

    Article  CAS  Google Scholar 

  • McCarthy JJ, Canziani OF, Leary NA, Dokken DJ, White KS (2001) Climate change 2001: impacts, adaptation and vulnerability. Cambridge University Press, Cambridge

    Google Scholar 

  • Mendes BVD, Pericchi LR (2009) Assessing conditional extremal risk of flooding in Puerto Rico. Stoch Environ Res Risk Assess 23:399–410

    Article  Google Scholar 

  • Meyer V, Scheuer S, Haase D (2009) A multicriteria approach for flood risk mapping exemplified at the Mulde river, Germany. Nat Hazards 48:17–39

    Article  Google Scholar 

  • Miao HJ (1988) Land and water resources in the Pearl River Delta. Sun Yat-sen University Press, Guangzhou (in Chinese)

    Google Scholar 

  • Nirel R, Dayan U (2001) On the ratio of sulfur dioxide to nitrogen oxides as an indicator of air pollution sources. J Appl Meteorol 40:1209–1222

    Article  Google Scholar 

  • Okazawa Y, Yeh PJF, Kanae S, Oki T (2011) Development of a global flood risk index based on natural and socio-economic factors. Hydrol Sci J 56:789–804

    Article  Google Scholar 

  • Parry ML, Canziani OF, Palutikof JP, van der Linden PJ, Hanson CE (2007) IPCC fourth assessment report: climate change 2007 (AR4): working Group II report, impacts, adaptation and vulnerability. Cambridge University Press, Cambridge

    Google Scholar 

  • Put R, Perrin C, Questier F, Coomans D, Massart DL, Heyden YV (2003) Classification and regression tree analysis for molecular descriptor selection and retention prediction in chromatographic quantitative structure-retention relationship studies. J Chromatogr A 988:261–276

    Article  CAS  Google Scholar 

  • Rovlias A, Kotsou S (2004) Classification and regression tree for prediction of outcome after severe head injury using simple clinical and laboratory variables. J Neurotraum 21:886–893

    Article  CAS  Google Scholar 

  • Schmidtlein MC, Deutsch RC, Piegorsch WW, Cutter SL (2008) A sensitivity analysis of the social vulnerability index. Risk Anal 28:1099–1114

    Article  Google Scholar 

  • Seifert I, Thieken AH, Merz M, Borst D, Werner U (2010) Estimation of industrial and commercial asset values for hazard risk assessment. Nat Hazards 52:453–479

    Article  Google Scholar 

  • Tachiiri K, Shinoda M (2011) Quantitative risk assessment for future meteorological disasters reduced livestock mortality in Mongolia. Clim Change 113(3−4):867−882

    Google Scholar 

  • Tachiiri K, Shinoda M, Klinkenberg B, Morinaga Y (2008) Assessing Mongolian snow disaster risk using livestock and satellite data. J Arid Environ 72:2251–2263

    Article  Google Scholar 

  • Takar AA, Dobrowolski JP, Thurow TL (1990) Influence of grazing, vegetation life-form, and soil type on infiltration rates and interrill erosion on a Somalion rangeland. J Range Manage 43:486–490

    Article  Google Scholar 

  • Thieken AH, Müller M, Kleist L, Seifert I, Borst D, Werner U (2006) Regionalisation of asset values for risk analyses. Nat Hazards Earth Syst Sci 6:167–178

    Article  Google Scholar 

  • Thuring F, Gustafsson J (2008) A suitable parametric model for operational risk applications. Working paper, SSRN

  • Unami K, Abagale FK, Yangyuoru M, Alam AMB, Kranjac-Berisavljevic G (2010) A stochastic differential equation model for assessing drought and flood risks. Stoch Environ Res Risk Assess 24:725–733

    Article  Google Scholar 

  • Vandeginste BGM, Massart DL, Buydens LMC, Jong SD, Lewi PJ, Smeyers-Verbeke J (1997) Handbook of chemometrics and qualimetrics—Part B. Elsevier, Amsterdam

    Google Scholar 

  • Vayssiéres MP, Plant RE, Allen-Diaz BH (2000) Classification trees: an alternative non-parametric approach for predicting species distributions. J Veg Sci 11:679–694

    Article  Google Scholar 

  • Venables WN, Ripley BD (2002) Tree based methods. In: modern applied statistics with S, 4th edn. Springer, New York, pp 251–270

  • Vorogushyn S, Merz B, Lindenschmidt KE, Apel H (2010) A new methodology for flood hazard assessment considering dike breaches. Water Resour Res 46:1–17

    Article  Google Scholar 

  • Wang YM, Li ZW, Tang ZH, Zeng GM (2011) A GIS-based spatial multi-criteria approach for flood risk assessment in the Dongting Lake region, Hunan, Central China. Water Resour Manag 25:3465–3484

    Article  Google Scholar 

  • Wheater HS, Chandler RE, Onof CJ, Isham VS, Bellone E, Yang C, Lekkas D, Lourmas G, Segond ML (2005) Spatial–temporal rainfall modelling for flood risk estimation. Stoch Environ Res Risk Assess 19:403–416

    Article  Google Scholar 

  • Xia JQ, Falconer RA, Lin BL, Tan GM (2011) Numerical assessment of flood hazard risk to people and vehicles in flash floods. Environ Model Softw 26:987–998

    Article  Google Scholar 

  • Xu YP, Booij MJ, Tong YB (2010) Uncertainty analysis in statistical modeling of extreme hydrological events. Stoch Environ Res Risk Assess 24:567–578

    Article  Google Scholar 

  • Zahran S, Brody SD, Peacock WG, Vedlitz A, Grover H (2008) Social vulnerability and the natural and built environment: a model of flood casualties in Texas. Disasters 32:537–560

    Article  Google Scholar 

  • Zhou Q, Mikkelsen PS, Halsnæs K, Arnbjerg-Nielsen K (2012) Framework for economic pluvial flood risk assessment considering climate change effects and adaptation benefits. J Hydrol 414:539–549

    Article  Google Scholar 

Download references

Acknowledgments

This work was supported by the National Basic Research Program of China (973) (2012CB955402), the National Natural Science Foundation of China (41101506, 41171401), and the International Cooperation Project funded by Ministry of Science and Technology of China (S2012GR0231).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ning Li.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ji, Z., Li, N., Xie, W. et al. Comprehensive assessment of flood risk using the classification and regression tree method. Stoch Environ Res Risk Assess 27, 1815–1828 (2013). https://doi.org/10.1007/s00477-013-0716-z

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00477-013-0716-z

Keywords

Navigation