Methods to improve the neural network performance in suspended sediment estimation

doi:10.1016/j.jhydrol.2005.05.019

Journal of Hydrology

Volume 317, Issues 3–4, 20 February 2006, Pages 221-238

https://doi.org/10.1016/j.jhydrol.2005.05.019 Get rights and content

Abstract

The effect of employment of different methods of suspended sediment estimation by artificial neural networks (ANNs) was the concern of the presented study. It was seen that the initial statistical analysis of flow and sediment data provided valuable information about the appropriate number of input nodes of the neural network, thereby avoiding redundant nodes. The k-fold partitioning of the training data set showed that similar or even superior sediment estimation performances can be obtained with quite limited data provided that the training data statistics of the subset are close to those of the testing data. The range-dependent neural network (RDNN) was found to be superior to conventional ANN applications, where only a single network is trained considering the entire training data set. It was seen that both low and high-observed sediment values were closely approximated by the RDNN.

Introduction

Estimates of sediment yield are required in a wide spectrum of problems such as design of reservoirs and dams, transport of sediment and pollutants in rivers, lakes and estuaries, design of stable channels, dams and debris basins, undertaking cleanup following floods, protection of fish and wildlife habitats, determination of the effects of watershed management, and environmental impact assessment. Fine sediment has long been identified as an important vector for the transport of nutrients and contaminants such as heavy metals and micro-organics. Suspended sediment is important in its own right, since its presence or absence exerts an important control on geomorphological and biological processes in rivers and estuaries.

Sedimentation in rivers, reservoirs and estuaries is a serious problem. The prediction of river sediment load constitutes an important issue in hydraulic and sanitary engineering. It is a well-known fact that all reservoirs are designed to contain a volume known as the dead storage to accommodate the incoming sediment that will accumulate over a specified period. The underestimation of sediment yield results in insufficient reservoir capacities while the overestimation will lead to over-capacity reservoirs. Achieving only the appropriate reservoir design is sufficient to justify every effort to determine sediment yield accurately but in sanitary engineering the prediction of river sediment load has an additional significance, especially if the particles transport pollutants. The real-time distribution of the sediment concentration is needed in this case and the sediment concentration forecast is necessary for controlling the pollution level in rivers and reservoirs.

Several factors inter-relate to determine if soil is detached or moved and these processes are difficult to predict, as is evident from the number of reservoirs where actual sedimentation rates outstrip our predicted estimates, which can be out by orders of magnitude. The traditional calculation of sediment transport rates, and hence sediment yield, relates sediment concentration to river flow values. Limited sediment data can thus be extrapolated to the length of the discharge record, although such relationships demonstrate a wide spread of points, as would be expected from a consideration of sediment transport mechanics. The classical approach of hydromechanics has not yet succeeded in modeling the complete process of sediment transport in rivers for reasons that particle movements in turbulent flow, as well as the properties of the particles, are all random. The properties of the riverbed are irregular and hence can also be considered as random. Moreover, all the processes affect each other in that the flow causes erosion and transportation of particles, while the particles transported in turn affect the flow as well as the rate of erosion. In the majority of the rivers the total sediment load is mainly constituted from suspended sediment (Morris and Fan, 1997). The bed load has a significant contribution to total sediment load only in the mountainous regions. The bed load is difficult to measure and time series for this parameter are not available in the literature. Therefore the estimation of suspended sediment is considered as the key information for the future sediment accumulation in the water reservoirs.

Sediment yield Y(t) at a given point in space (say, watershed outlet) can be represented as $Y (t) = \bar{Y} (t) + ε (t)$ in which $\overline{Y} (t)$ is the mean value or deterministic component of Y(t), and ε(t) is the error from or fluctuation around the mean value or stochastic component of Y(t). The relative contribution of $\overline{Y} (t)$ and ε(t) to Y(t) depends on the watershed and space-time scales. Clearly, Y(t) encompasses the full range of variability from being entirely deterministic to being entirely stochastic (Singh et al., 1988). All sediment models are special cases of (1).

The deterministic models can be distinguished as being empirical or conceptual. Most of the empirical models are related to the Universal Soil Loss Equation (USLE) and its latter modifications. These models usually require long data records, so that average annual sediment yield can be determined. The conceptual models combine the mechanics of sediment transport with empirical relationships. Both the empirical and conceptual models approximate the physical processes controlling sediment yield.

Another way to represent the complex sediment behavior is to interpret a sequence of sediment yield measurements as being random. If the processes governing sediment yield, such as soil particle detachment, entrainment, transport, and deposition, are assumed to be stochastic and thus governed by the laws of probability the sediment yield can be described by a stochastic process and associated probability distributions (pdf).

Some sediment yield models contain both deterministic and stochastic elements. A classical example is the relationship between sediment yield and runoff, represented by a line in a logarithmic plot. This is the deterministic part $\overline{Y} (t)$ of the model. When the measurements are plotted, they encircle this line and most often will not lie directly on it. Thus the line represents only the mean trend of sediment yield-runoff relationship, and fluctuations ε(t) above and below it may be considered stochastic. A successful model will have to include a deterministic component or fluctuations around it.

Stochastic models of sediment yield can be grouped as the regression models, time series models, entropy models and probability models. The regression models relate Y(t) empirically to rainfall R(t) and runoff Q(t). Spatial variability of these models is not considered. Stochasticity is represented by variations around the mean trend. In time series models a watershed is considered as spatially lumped system. Deterministic relationships between R(t), Q(t) and Y(t) are represented by a transfer function and stochasticity is modelled as an autoregression process. In entropy models, the pdf of Y(t) is obtained using constraints based on observed values of Y(t) and/or Q(t). Spatial variability of the variables is not accounted for. Probability models consider sediment yield Y(t) as a stochastic process, and so also may be the rainfall R(x,y,z,t) and runoff Q(t). The behaviour of Y(t) is described by its pdf or its joint probability density function with other stochastic sequences.

The application of physics-based distributed process computer simulation offers one possible method of prediction to assess the outcome of different management actions and long-term management strategies. But the application of these complex software programs is often problematic, due to the use of idealized sedimentation components, or the need for massive amounts of detailed spatial and temporal environmental data, which is not available. Simpler approaches are therefore required in the form of `conceptual` solutions or `black-box` modeling techniques.

The artificial neural network (ANN) approach, which is a non-linear black box model, would seem to be a useful alternative for modeling the complex suspended sediment series. The ANN applications in water resources are in river flow prediction (Tokar and Johnson, 1999, Khalil et al., 2001, Brikundavyi et al., 2002, Elshorbagy et al., 2002, Cigizoglu, 2003a, Cigizoglu, 2003b, Cigizoglu and Kisi, Kisi, 2004), in the rainfall-runoff relationship (Hsu et al., 1995, Minns and Hall, 1996, Fernando and Jayawardena, 1998, Dawson and Wilby, 2001), in rainfall estimation (Silverman and Dracup, 2000, Cigizoglu and Alp, 2004, Freiwan and Cigizoglu, 2005) and in the various groundwater problems (Ranjithan et al., 1993). Neural network applications in hydrology were summarized by the ASCE Task Committee (2000b) and by Govindaraju and Rao (2000).

In the majority of these studies, the feed-forward error back-propagation method (FFBP) was employed to train the neural networks. The performance of the FFBP was found to be superior to conventional statistical and stochastic methods in continuous flow series forecasting (Brikundavyi et al., 2002, Cigizoglu, 2003a). However, the FFBP algorithm has some drawbacks such as the local minima problem. In their work, Maier and Dandy (2000) summarized the methods used in the literature to overcome this problem of training a number of networks starting with different initial weights, the on-line training mode used to help the network to escape local minima, the inclusion of the addition of random noise, and the employment of second order (Newtons algorithm, Levenberg–Marquardt algorithm) or global methods (stochastic gradient algorithms, simulated annealing). In the review study of the ASCE Task Committee (2000a), other ANN methods such as conjugate gradient algorithms, the radial basis function, the cascade correlation algorithm and recurrent neural networks were briefly explained. Thirumalaiah and Deo, 1998, Thirumalaiah and Deo, 2000) used conjugate gradient and cascade correlation algorithms together with FFBPs for different hydrological applications. The Levenberg–Marquardt algorithm was employed in the FFBP applications included in the present study.

The ANN applications in suspended sediment modeling are relatively new compared with other water resources domains (Abrahart and White, 2001, Cigizoglu, 2004a). Cigizoglu (2004b) showed that the FFBP could provide negative suspended sediment estimations for some of the observed low sediment values. As the extrapolation potential of the FFBP was demonstrated by Cigizoglu (2003a), such a result can be expected. The suspended sediment records contain periods with succeeding extremely low and high sediment values (or vice versa). The ratio between overall record maximum and overall record mean $(x_{max} / \bar{x})$ is quite high and some underestimations in low values (even negative values) arise as the network faces confusion in the transition between these two extreme zones (Cigizoglu, 2004a).

Considering this difficulty in suspended sediment modeling, two methods described in the literature, previously used for river flow estimation, are employed for suspended sediment estimation in this study. The first of these is the k-fold partitioning method. Using this statistical method, as explained by Ali and Pazzani (1996), the record is divided into smaller data sets and handled separately. Thus, statistical work is carried out for each sub-set independently and the sub-set which provides the most information (even more than the whole data set) is selected. This is important because the scarcity in rainfall data is a problem faced frequently by water resources engineers. In works like water reservoir planning, the length of the observed precipitation record might be quite short making the rainfall forecasting studies difficult. Therefore, methods helping to obtain more information from the available limited data are valuable. In a recent study, it was shown that extending the ANN training sets with synthetically generated flow data noticeably increased the flow prediction performance of ANNs (Cigizoglu, 2003a). Cigizoglu and Kisi (2005) successfully applied k-fold partitioning to flow data for neural network training.

The second method employed in this study for suspended sediment estimation is the range-dependent neural network (RDNN). This method was applied to the river flow time series by Hu et al. (2001). Based on a proposed clustering algorithm for the training pairs, RDNN has been developed for better accuracy in hydrologic time series prediction. In this method, the training data are clustered using different ranges such as different proportions of x_mean (mean of the whole series). Flow data falling within each range are trained by a separate neural network. Hence, each of the networks has its own training pairs obtained by the flowing clustering algorithm and serves different magnitudes of flow predictions.

Section snippets

Feed-forward back-propagation (FFBP)

The FFBP is the most popular ANN training method in water resources literature. A typical feed forward structure is presented in Fig. 1. A FFBP distinguishes itself by the presence of one or more hidden layers, whose computation nodes are correspondingly called hidden neurons of hidden units. The function of hidden neurons is to intervene between the external input and the network output in some useful manner. By adding one or more hidden layers, the network is enabled to extract higher order

The data and k-fold partitioning

In this study, daily mean river flow and daily total suspended sediment data collected at the Manayunk Station (USGS station no: 1473800) on the Schuylkill River in the United States were used. These data, which were downloaded from the website of USGS, were divided into two groups, one for training and the other for testing. The training period covered the first 8760 daily flow and suspended sediment values and the testing period consisted of the last 1825 daily values. The k-fold partitioning

Range-dependent neural network (RDNN)

In the context of suspended sediment modeling, a single neural network with complex structure was found to be unable to adapt to the complexity of the suspended sediment process although it performed better than the conventional methods (Cigizoglu, 2004a). This result is consistent with the applications of such networks to other hydrologic processes such as intermittent river flow forecasting and rainfall-runoff transformation (Cigizoglu, Cigizoglu and Alp, 2004). Cigizoglu (2004a) found that a

Method of application

A MATLAB code was written for the range-dependent neural network (RDNN) using FFBP. The code demands range coefficients a and b as input and trains three different neural networks (for the corresponding three ranges) simultaneously. The RDNN ranges are shown in Fig. 2. The RDNN training simulations were carried out for each subset and for the whole training data and the resulting trained network was then used for testing. Analogous to the RDNN concept, the simpler multi-linear model (for which

Results

The ANN configurations providing the best performance criteria values for the four input combinations are presented in Table 5. For the single input node Q_t, the most convenient ANN structure was ANN (1,3,1), representing 1, 3 and 1 input-, hidden- and output nodes, respectively. For the two inputs (Q_t, S_t−1) case, on the other hand, ANN (2,5,1) gave the best results. Network configurations ANN (3,4,1) and ANN (4,3,1) showed best performance for three (Q_t−1, Q_t and S_t−1) and four (Q_t−1, Q_t, S_t−1

Conclusions

The presented study covered the employment of different methods for river suspended sediment estimation by ANNs. The k-fold partitioning of training data was quite fruitful showing that similar or even superior sediment estimation performances can be obtained with quite limited data provided that the sub-training-data statistics are close to those of the whole testing data set. The initial statistical analysis was found to be useful in the determination of the appropriate number of input nodes.

References (41)

R.J. Abrahart et al.
Modeling sediment transfer in Malawi: comparing backpropagation neural network solutions against a multiple linear regression benchmark using small data sets
Physics and Chemistry of the Earth (B)
(2001)
H.K. Cigizoglu
Estimation and forecasting of daily suspended sediment data by multi layer perceptrons
Advances in Water Resources
(2004)
A. Elshorbagy et al.
Estimation of missing streamflow data using principles of chaos theory
Journal of Hydrology
(2002)
R.K. Kachroo et al.
Non-linear modeling of the rainfall-runoff transformation
Journal of Hydrology
(1992)
M. Khalil et al.
Groups and neural networks based streamflow data infilling procedures
Journal of Hydrology
(2001)
H.R. Maier et al.
Neural network for the prediction and forecasting of water resources variables: a review of modeling issues and applications
Environmental Modeling and Software
(2000)
J.E. Nash et al.
River flow forecasting through conceptual models. Part I: a discussion of principles
Journal of Hydrology
(1970)
K. Ali et al.
Error reduction through learning multiple descriptions
Machine Learning
(1996)
Artificial neural networks in Hydrology I
Journal of Hydrologic Engineering, ASCE
(2000)
Artificial neural networks in hydrology II
Journal of Hydrologic Engineering, ASCE
(2000)

A. Becker et al.

Non linear flood routing model with multi-linear model

Water Resources Research

(1987)

S. Brikundavyi et al.

Performance of neural networks in daily streamflow forecasting

Journal of Hydrologic Engineering

(2002)

H.K. Cigizoglu

Incorporation of ARMA models into flow forecasting by artificial neural networks

Environmetrics

(2003)

H.K. Cigizoglu

Estimation, forecasting and extrapolation of flow data by artificial neural networks

Hydrological Sciences Journal

(2003)

H.K. Cigizoglu

Discussion of performance of neural networks in daily streamflow forecasting

Cigizoglu, H.K., 2005. Application of the generalized regression neural networks in intermittent flow forecasting and...

H.K. Cigizoglu et al.

Rainfall-runoff modelling using three neural network methods, artificial intelligence and soft computing- ICAISC 2004

Lecture Notes in Artifical Intelligence

(2004)

Cigizoglu, H.K., Kisi, O., 2005. Flow prediction by three back propagation techniques using k-fold partitioning of...

C.W. Dawson et al.

Hydrological modeling using artificial neural networks

Progress in Physical Geography

(2001)

M.Y. El-Bakyr

Feed forward neural networks modeling for K-P interactions

Chaos, Solutions and Fractals

(2003)

Cited by (179)

Optimizing sediment transport models by using the Monte Carlo simulation and deep neural network (DNN): A case study of the Riba-Roja reservoir
2024, Environmental Modelling and Software
This study emphasizes the importance of accurate calibration in sediment transport models and highlights the transformative role of artificial intelligence (AI), specifically machine learning, in improving accuracy and computational efficiency. Extensive experiments were carried out in the Riba-Roja reservoir, which is located in the northeastern Iberian Peninsula. The accumulated sediment volume (ASV) curve was used to calibrate these experiments. The optimal ASV curve was found to be very close to the experimental data, with only minor differences in upstream areas. The results revealed a consistent rate of sediment transport and settling. Furthermore, the study investigated the capabilities of deep neural networks (DNNs) in predicting ASV curves and observing variable performance. In essence, the study highlights AI's potential for enhancing sediment transport models.
Prediction of the removal of solid suspensions and chemical oxygen demand from a pharmaceutical wastewater plant using a neural network approach
2024, Desalination and Water Treatment
This study aimed to model the removal efﬁciency of chemical oxygen demand (COD) and solid suspension (SS) from a real pharmaceutical wastewater treatment plant (WWTP) using artificial neural network-multilayer perception (ANN-MLP). The ANN model was developed using experimental data which were collected during four years. The input variables of the neural network are water pH, temperature, SS, and COD. The percentages of removal of COD (CODRE) and of SS (SSRE) are considered as output variables. The Levenberg–Marquardt algorithm was utilized to train ANN. It was found that the ANN architecture has two hidden layers with 8 neurons. The results of ANN models were compared with the measured data based on the correlation coefficient (R²) and mean square error (MSE). It was noticed that the best ANN model provides good accurate results with R² values of 0.9783 for SSRE and 0.9826 for CODRE, and a value of MSE equal to 1.695 10⁻³. This study may aid the users to adjust operational parameters in recovering COD and SS in the case of the process treatment of industrial effluents.
Deep learning in hydrology and water resources disciplines: concepts, methods, applications, and research directions
2024, Journal of Hydrology
Over the past few years, Deep Learning (DL) methods have garnered substantial recognition within the field of hydrology and water resources applications. Beginning with a discussion on fundamental concepts of DL, we discussed the state-of-the-art DL architectures such as Long-Short-Term-Memory (LSTM), Convolutional Neural Networks (CNN), Generative Adversarial Networks (GAN), and Encoder-Decoder models that have gained much attention over the recent years. The recent advancements in the DL model, such as the Attention model and Transformer Neural Network, that are designed to handle sequential time series data, are also discussed. An overview of integrating physics-based hydrological models with state-of-the-art DL models, known as Physics-Guided Deep Learning (PGDL), and its potential for improving the accuracy and interpretability of hydrological predictions are discussed. We emphasized that PGDL has the potential to enhance the physical consistency and robustness of the hydrologic predictions. We further delve into Explainable Artificial Intelligence (XAI), examining various techniques for constructing interpretable models. The objective is to empower users to comprehend and confidently trust machine learning algorithms' results (model outputs). Furthermore, we delved into the diverse applications of Deep Learning (DL) in hydrology and water resources sectors, encompassing areas such as drought and flood forecasting, remote sensing applications, water quality assessments, subsurface flow inversion problems, groundwater level prediction, and hydro-climate variable downscaling.
Review of the application of Artificial Neural Networks in ocean engineering
2022, Ocean Engineering
Artificial Neural Networks (ANNs) were firstly used to model ocean engineering problems in the decade of 1990s. Since then, this soft-modelling technique has proved several advantages against traditional approaches. In this article, the application of ANNs in ocean and maritime engineering from its beginnings until today is reviewed. After reviewing more than 90 studies, some general rules to apply in ANNs in ocean engineering are concluded, the most common and advantageous methods are summed up and research gaps in the application of ANNs are identified.
Machine learning and regression-based techniques for predicting sprinkler irrigation's wind drift and evaporation losses
2022, Agricultural Water Management
Wind drift and evaporation losses (WDEL), which can occur as a result of operational and meteorological factors, are two of the most significant sprinkler-irrigation losses that can occur even in a well-managed irrigation system. A proper understanding of factors that influence WDEL in sprinkle irrigation is critical for developing water conservation strategies that significantly impact the quality and return on investment of irrigation projects. The specific objective of this research was to determine the predictive ability of five soft computing approaches (artificial neural network (ANN), adaptive neuro-fuzzy inference system (ANFIS), multivariate adaptive regression spline (MARS), probabilistic linear regression (PLR), and support vector regression (SVR)) for predicting WDEL on a sprinkler irrigation system under design, operational, and meteorological conditions. Datasets were collected from previously published studies conducted under a variety of conditions. The results showed that the five approaches yielded statistically different WDEL predictions. The ANN model produced the most accurate WDEL predictions compared to the other models with the training and testing dataset. The ANFIS, MARS, PLR, and SVR models' performance ranks were found to be inconsistent across a variety of statistical performance criteria. Hence, Shannon's entropy-based decision theory was used to rank these models. The MARS model was ranked second (0.896), followed by the ANFIS model (0.865), the PLR model (0.833), and the SVR model (0.794). The design variable “auxiliary nozzle diameter” and climate variable “wind speed” both had high contribution ratios (17.5% and 12.19%, respectively) in WDEL modeling to produce a robust predictive model. In general, the developed models, particularly the ANN model, demonstrated a high degree of accuracy in estimating the WDEL of sprinkler irrigation systems.
Comparison of Bayesian, k-Nearest Neighbor and Gaussian process regression methods for quantifying uncertainty of suspended sediment concentration prediction
2022, Science of the Total Environment
Citation Excerpt :
Accelerated suspended sediment transport and deposition frequently lead to the deterioration of surface water quality, reservoir sedimentation, the widening of rivers floodplains, aquatic habitat degradation, change in river morphology and damages to urban infrastructure (Best and Bristow, 1993; Choubin et al., 2018; Fan et al., 2012; Horowitz, 2003; Morris and Fan, 1998). In addition, fine-grained sediment particles are an important vector of nutrients and contaminants (Afan et al., 2015; Cigizoglu and Kisi, 2006). In this context, the reliable prediction and modeling of suspended sediment concentrations is needed to support the estimation of suspended sediment loads and to help inform targeted management of the suspended sediment problem.
Suspended sediment transport in river system is a complex process influenced by many factors that their interactions lead to nonlinear and high scatter of concentration-discharge relationships. This makes the model prediction subject to high uncertainty and providing one value as the model prediction is somehow useless and cannot provide adequate information about the model accuracy and associated uncertainty. Current study compares the efficiency of Bayesian (i.e. Bayesian segmented linear regression (BSLR) and Bayesian linear model (BLR)), Gaussian Process Regression (GPR) and k-Nearest Neighbor (k-NN) in quantifying uncertainty of the suspended sediment concentration prediction in three watersheds namely Arazkoseh, Oghan and Jajrood located in Iran. Three input combinations including, contemporary discharge, slow and quick flow components and contemporary, one and two antecedent days discharge, were used. The BSLR model was able to identify threshold value, furthermore, pre-threshold and post-threshold slopes of BSLR model indicated that for Arazkoseh watershed channel and for Oghan and Jajrood watersheds, upland area are dominate sediment sources. In all three studied cases, given prediction interval width and the percent of enclosed observed data by prediction interval, k-NN model provided more reliable prediction interval. Moreover, separation stream flow into slow and quick flow components lead to improved performance of GPR and k-NN models in the studied watersheds, and the best results for Arazkoseh and Oghan watersheds were obtained when slow and quick flow components were used as the model input.

View all citing articles on Scopus

View full text

Methods to improve the neural network performance in suspended sediment estimation

Abstract

Introduction

Section snippets

Feed-forward back-propagation (FFBP)

The data and k-fold partitioning

Range-dependent neural network (RDNN)

Method of application

Results

Conclusions

Physics and Chemistry of the Earth (B)

Advances in Water Resources

Journal of Hydrology

Journal of Hydrology

Journal of Hydrology

Environmental Modeling and Software

Journal of Hydrology

Error reduction through learning multiple descriptions

Machine Learning

Artificial neural networks in Hydrology I

Journal of Hydrologic Engineering, ASCE

Artificial neural networks in hydrology II

Journal of Hydrologic Engineering, ASCE

Non linear flood routing model with multi-linear model

Water Resources Research

Performance of neural networks in daily streamflow forecasting

Journal of Hydrologic Engineering

Incorporation of ARMA models into flow forecasting by artificial neural networks

Environmetrics

Estimation, forecasting and extrapolation of flow data by artificial neural networks

Hydrological Sciences Journal

Discussion of performance of neural networks in daily streamflow forecasting

Rainfall-runoff modelling using three neural network methods, artificial intelligence and soft computing- ICAISC 2004

Lecture Notes in Artifical Intelligence

Hydrological modeling using artificial neural networks

Progress in Physical Geography

Feed forward neural networks modeling for K-P interactions

Chaos, Solutions and Fractals