Next Article in Journal
Biomechanical Evaluation of the Effects of Implant Neck Wall Thickness and Abutment Screw Size: A 3D Nonlinear Finite Element Analysis
Previous Article in Journal
Organic and Conventional Herbs Quality Reflected by Their Antioxidant Compounds Concentration
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Monthly Reservoir Inflow Forecasting for Dry Period Using Teleconnection Indices: A Statistical Ensemble Approach

1
Department of Environmental Engineering, Korea University, Sejong 30019, Korea
2
Program in Environmental Technology and Policy, Korea University, Sejong 30019, Korea
3
Korea Infrastructure Safety and Technology Corporation, Jinju 52852, Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2020, 10(10), 3470; https://doi.org/10.3390/app10103470
Submission received: 18 March 2020 / Revised: 21 April 2020 / Accepted: 13 May 2020 / Published: 18 May 2020
(This article belongs to the Section Civil Engineering)

Abstract

:

Featured Application

The work can be applied to reservoir operation for water supply by foreseeing the inflows to a reservoir for the coming months to take necessary action in advance to save water for an expected drought situation.

Abstract

Reliable long-range reservoir inflow forecast is essential to successfully manage water supply from reservoirs. This study aims to develop statistical reservoir inflow forecast models for a reservoir watershed, based on hydroclimatic teleconnection between monthly reservoir inflow and climatic variables. Predictability of such a direct relationship has not been assessed yet at the monthly time scale using the statistical ensemble approach that employs multiple data-driven models as an ensemble. For this purpose, three popular data-driven models, namely multiple linear regression (MLR), support vector machines (SVM) and artificial neural networks (ANN) were used to develop monthly reservoir inflow forecasting models. These models have been verified using leave-one-out cross-validation with expected error S as a measure of forecast skill. The S values of the MLR model ranged from 0.21 to 0.55, the ANN model ranged from 0.20 to 0.52 and the SVM from 0.21 to 0.56 for different months. When used as an ensemble, Bayesian model averaging was more accurate than simple model averaging and naïve forecast for four target years tested. These were considered to be decent prediction skills, indicating that teleconnection-based models have the potential to be used as a tool to make a decision for reservoir operation in preparing for droughts.

1. Introduction

Like many regions in developed countries, demands on water resources of the Southern Chungnam province, fourth largest in South Korea, are continuously growing, but there is only one water source—Boryeong multi-purpose Dam. The available reservoir storage of Boryeong Dam is 116.9 million tons and designed to supply 23,800 tons per day for agricultural, residential and industrial use serving 420,000 of people in Western Chungnam Province [1]. Boryeong Dam is also supplying water to four major steam power plants, which generate about 25% of South Korea’s electricity.
Currently, the water use of Southern Chungnam Province is about 21,000 tons per day, which is approaching 90% of design demand. Water demand from steam power plants is expected to increase because of population growth. Furthermore, because of the region’s adjacency to Seoul, the capital of South Korea, construction of an industrial complex has been planned in the near future, which will also increase water demand. These increasing water demands have raised several concerns and triggered discussions on additional dam building and water allocation issues. However, those structural measures need at least 5–10 years to solve the water shortage problem in Chungnam Province, which means, if the operational failure of the dam occurs in the meantime during drought, water availability may be threatened because of the storage shortage.
In 2015, Western Chungnam Province had to reduce water supply by 20% for 5 months because of serious drought. Thus, reliable long lead-time reservoir inflow forecasting is essential because water resources manager must decide when to reduce environmental flows to secure more water during water shortage or to increase water release to prevent dam overflows. To reduce the negative impacts of climatic variability on dam operation, dam inflow forecasting model with at least 3 months lead time is needed to prepare for potential threat such as floods and droughts. For the water supply purpose, it is important to know in advance to save some water if the shortage in water is on the horizon of the near future. Boryeong Dam is very vulnerable to drought as its effective storage to annual water supply is 1:1 and is related with many different types of stakeholders including government, municipalities, agriculture, environmental organizations, enterprises, etc. [1]. To address the complex need for water resources before any decision on the adjustment of dam releases to be reached, a month or two is not a sufficient amount of time to do so. Thus, at least 3-months lead time, thus a season-ahead forecast, is needed, and it is quite common in forecasting practices [2,3,4]. In summary, the time required for local stakeholders to take operational actions and also the medium time horizon for forecasting needs were factored into the selection of lead time to be 3 months.
There are two basic approaches for long-range hydroclimatic predictions: dynamic models and empirical statistical models. The dynamic approach is based on process models for climate and hydrology. The statistical approach is based on data driven models where the predictor–predictand relationship is the essence of the technology. Although it is anticipated that dynamic models may become superior to empirical models in the future, empirical forecasting models are still able to compete because they are less expensive to develop and use [5].
The hydroclimatic models used to forecast seasonal precipitation or streamflow are still far from perfect, and there are a lot of uncertainties to be considered. Especially, the models need to be in better compliance with climatic observations such as ocean-atmospheric indices, which are connected to climatic variability globally. These oceanic-atmospheric forcing predictors, called hydroclimatic teleconnections are widely used to forecast long range seasonal streamflow or precipitation. Since weather and climate are linked with atmospheric circulation over the continents, previous studies have suggested the relationships between the extreme phases of El Niño southern oscillation (ENSO) and the fluctuations of stream flow and precipitation [6,7]. There is growing evidence of the effects of atmospheric-oceanic features on the hydrology of the western basins [8,9,10]. Moss et al. [11] used the southern oscillation index (SOI) as a predictor of the probability of low flows in New Zealand. Chiew et al. [12] pointed out the effect of ENSO on Australian rainfall, streamflow and drought. More recently, Chen and lee [13] suggested variations in correlations and possible climate regime shifts.
There are also several past studies for streamflow prediction with long lead times using ocean-atmospheric oscillation indices. Chiew et al. [14] used the ENSO–streamflow relationship to forecast streamflow. Xu et al. [15] showed the possibility for seasonal streamflow forecasting using climate teleconnections at the Three Gorges Dam. Chandimala and Zubair [16] tried to forecast streamflow and rainfall, based on ENSO, for water resource management. Karla et al. [17] predicted annual streamflow volume with a 1-year lead time by a data-driven model, the support vector machine (SVM), using the North Atlantic oscillation (NAO), ENSO and sea surface temperature (SST). Ouachani et al. [18] also showed the impact of teleconnection pattern on precipitation and streamflow. More recently, Hidalgo-Munoz et al. [19] employed multiple linear regression using large-scale atmospheric and oceanic information for seasonal streamflow forecasting and found fair forecasting skill.
Although previous studies revealed a significant relationship between large-scale climate teleconnections and stream flow, most of those applications have investigated the correlation between teleconnection indices and streamflow in the large-scale basins on a yearly or seasonal basis. However, there have been few studies evaluating the applicability of climate indices in predicting monthly runoff in a small watershed such as Boryeong Dam watershed (163 km2) in South Korea [20,21]. At the same time, there were no attempts to use multiple data-driven models as an ensemble for a monthly reservoir inflow forecasting purpose in the context of simple model averaging (SMA) and Bayesian model averaging (BMA), which is an ensemble averaging technique based on multiple model results. BMA considers the uncertainty of each model’s forecasts explicitly and uses this uncertainty to calculate a predictive distribution. This method has two desirable advantages. One is that it provides deterministic forecast and the other is that it offers prediction interval of forecast distribution, which is very helpful to make a decision probabilistically.
The main objective of this research is identifying the monthly best combination of climate indices and developing a monthly reservoir inflow forecasting model based on them with a three-month lead time. For the efficient dam operation, monthly reservoir inflow prediction with a lead time of at least three months is needed, which means climate predictors measured at least three months in advance are used. Another objective is to evaluate whether the ensemble prediction by BMA with data-driven models for reservoir inflows in a small basin is reliable or useful in forecasting monthly reservoir inflows, which is a very important tool for the stable dam operation.

2. Materials and Methods

2.1. Study Site and Data Used

Boryeong Dam watershed located in South Chungcheong Province of South Korea was selected as the study site (Figure 1). It is near the coast of the West Sea and belongs to a monsoonal climatic region with seasonal cycle of precipitation. The dam’s height and crest length are 50 m and 291 m, respectively [22]. The dam has an effective storage of 108.7 × 106 m3 and supplies municipal and industrial waters to three cities (Boryeong, Seosan and Dangjin), five counties (Seocheon, Cheongyang, Hongseong, Yesan and Taean) and two electrical power plants (Dangjin and Seobu). The watershed drainage area upstream of the dam site is 163 km2 [1]. The average slope is 36.9%, and the average elevation is 233.4 m. The basin’s topography is mainly covered with forest (72%) and agricultural land (17%) and had undergone very little changes since early 1900s [23]. This watershed is a fluvial system, which is governed by rainfall-driven peak in wet season. The annual average precipitation of the watershed is about 1305 mm/year, and approximately two-thirds of the annual precipitation occurs during warm seasons from June to September. The annual mean discharge and runoff ratio are 4.19 m3/s and 0.58. The average annual temperature, monthly maximum temperature and monthly minimum temperature are 12.8, 36, and −13.6 °C, respectively.
In this study, monthly reservoir inflow data of Boryeong Dam are obtained from Water Resources Management Information System [24] of Korea (1998–2016) to be used as predictands. Our target seasons for forecasts are dry periods spanning from January to May and from November to December, and the wet seasons (June to October) are excluded because the prediction of reservoir inflow becomes more important for water supply during this season. In the literature, there are similar studies in spirit, which focused on forecasting specifically for dry season streamflow, which is the critical season that allows for the decision to be made on optimal allocation of precious water resources in the context of the drought condition [25,26,27]. Monthly long-term observed precipitation data, which are the areal precipitation from stations within Boryeong Dam watershed, were taken from WAMIS and K-water and were used for the analysis. The gauging stations are located in Figure 1. The monthly average precipitation, dam inflow and runoff coefficient are shown in Table 1 and Figure 2.
We also used time series of many climatic indices available from the Climate Prediction Center [28] for the same period, which are updated monthly (Table 2).

2.2. Methodology

2.2.1. Predictor Selection

A time-lagged (3–12 months backward) correlation analysis between the monthly climatic indices and reservoir inflow at Boryeong Dam was performed to select the best monthly candidate predictors associated with teleconnection impact on the monthly reservoir inflow. Correlation analysis of less than a 3 months lagged period was not considered in this study because 3-months lead time forecasts were needed for the study area to prepare for a drought, allowing for about a month for multi-stakeholders to reach a decision and 1 or 2 months to take managerial actions [1]. Significance tests were then performed based on a two-tailed t-test for correlation coefficients between predictors (observed climate indices) and predictand (observed reservoir inflow) values. A multiple linear regression analysis was used to select predictors using the stepwise regression method, which is an iterative process of adding or removing candidate predictors based on the F-statistics of their estimated coefficients and their p-values, which are significant for a value of less than 5% [19,29]. Multicollinearity check was also considered for the regression modeling. Multicollinearity results in difficulty in coming up with reliable estimates of parameters in regression models. Multicollinearity checks were run for the models, calculating variance inflation factor (VIF) as described in Equation (1):
V I F i = 1 ( 1 R i 2 )
where Ri2 is the coefficient of determination of the regression of predictor i. A VIF above 5 indicates high correlation [30].

2.2.2. Statistical Models

For the monthly dam inflow forecasts with a 3-month lead time, multiple linear regression model (MLR), support vector machines (SVM) and artificial neural networks (ANN) methods were applied, which are very popular approaches well identified as mainstream in a recent review [31] of streamflow forecasting using data-driven models and powerful statistical techniques that have found many applications in this field [17,29,32,33,34,35,36,37,38,39,40,41,42]. However, thorough the comparison of models was not the main focus of our work. The MLR method is a multivariate technique that determines a linear relationship between multiple independent variables (predictors) and a dependent variable [43]. The chosen climatic indices are the independent variables and monthly reservoir inflow is the dependent variable. The formula for the multiple linear model used is shown below in Equation (2):
y = C + D 1 x 1 + D 2 x 2 + + D n x n
where y is the output variable; xi represent the climatic indices; Di is the regression coefficients and C is the intercept.
SVM is a useful tool for data classification and pattern recognition in the field of data mining [44,45]. It has shown to provide excellent performances in regression and time series prediction. The general framework of SVM involves (1) separating data into training and test sets, (2) producing a model with training data set and (3) predicting the target value based on test data set. Support vector regression has the same concepts as the SVM for classification. The main goal of support vector regression is to find a function f(x) that has at most ε deviation from the target variable y as described in Equations (3)−(5).
f ( x ) = ω , x + b
M i n i m i z e   P ( f ) = 1 2 ω 2 + c i = 1 l ( ξ i + ξ i * )
S u b j e c t   t o { y i ω , χ i b ε + ξ i ω , χ i + b y i ε + ξ i * ξ i , ξ i * 0
where C determines the tradeoff between flatness of f and tolerance of the prediction error ε, ξ i and ξ i * are the slack variables to deal with optimization problems. A detailed description of statistical principles of SVM can be found in [46]. The kernel function is used to consider nonlinearities in the model. A radial basis kernel that has been known to perform better than other kernels was used.
ANN, in contrast to MLR, has a flexible mathematical configuration that can analyze nonlinear relationships between dependent and independent variables. ANN generally consists of input layers, one or more hidden layers and an output layer [47]. However, there are different schemes for ANN depending on the number of layers and nodes. The general definition of ANN is as shown in Equation (6):
y = g ( i = 1 l w i x i )
where y is a summation function of weighted (w) output, x is a neuron with inputs and g is an activation function used to generate an output. There are many different activation functions, we used the typical sigmoid logistic non-linear function, which varies from 0 to 1 as in Equation (7):
g ( t ) = 1 1 + e t .
The optimization process of finding an appropriate connection between the input layer and hidden layer by changing w and the bias was determined through learning or training the system. Therefore, training was the process to minimize the difference between the ANN output and the observed value (the target data).
The application of ANN algorithms in hydrology is well documented in the paper by Govindaraju [48]. Matlab was the software used for developing the ANN model. The scheme used in this study, shown in Figure 3, was the input–hidden–output layer, which means the network had input neurons (chosen climatic indices) in the input layer (4 at the most as determined in this study), ten neurons in the hidden layer and one neuron (monthly reservoir inflow) in the output layer. When we added to or subtracted from 10 hidden layer nodes, which is a default value in Matlab, the result was not significantly better. Therefore 10 hidden layer nodes were used as recommended by the software. A two-layer feedforward network, which is the most common training method in ANN called the multilayer perceptron, was used and trained using the Bayesian-regularization algorithm [49].

2.2.3. Model Validation

The traditional framework of hydrologic modeling includes preparing data sets, dividing a data set into training and test data sets, training the model and assessing the results from the trained model on a test data set. However, the general process described above can cause period-specific output for reservoir inflow forecasts using climate indices such as different combinations of predictors depending on how the data is split. This is because of different periodicity exhibited by predictors. Additionally, the second reason is the difference between the recent and past climate characteristics due to the climate change impact. For example, the average inflows in November and December in the period of 1998–2012 (training data) were 1.4–1.5 million tons, while the average inflows in the recent period of 2013–2016 (test data) were 5.0–6.3 million tons [1]. In this case, there is a lack of learning data for high inflow, which lowers the reliability of the machine learning method because training data did not reflect the recent increasing precipitation trend in winter season. These results indicate the necessity of considering the recent seasonal or monthly streamflow characteristics when we build a statistical model. In order to solve that issue, k-fold cross-validation was proposed instead of dividing the data sets into specific training and testing term. The target area had 19 observed data sets. Each sample data was removed, and the model was trained with the remaining (k − 1) sample data. Finally, the removed sample data was tested using the trained model, and this process is repeated for k times. For each month that is to be predicted in a given year, a separate model is built and trained using the data recorded in the same month for the remaining 18 years.

2.2.4. Model Evaluation

To evaluate the ability of above models’ forecast skill, correlation coefficient (r) and expected error, S [50] were used. Correlation coefficient describes the degree of collinearity between the predicted value and measured data. The coefficient S is computed as Equation (8):
S = 1 var ( Q Q s ) var ( Q s )
where Q′ and Qs are forecasted and observed values of reservoir inflow, respectively. The coefficient S can range from −∞ to 1. The coefficient of 1 means a perfect match of predicted value to the observed data. The coefficient above 0 indicates that the model has certain skill in prediction. Consequently, the closer the model’s efficiency coefficient is to 1, the more accurate the model’s prediction is.
The overall methodology adopted for monthly reservoir inflow prediction can be visualized in Figure 4.

3. Results and Discussion

3.1. Selection of Oceanic-Atmospheric Indices

Table 3 shows the best-fitting predictors having the highest correlations with p-values of less than 5% and VIF of less than 5. We described the correlation strength between predictors and reservoir inflow of 0.80 and higher as “very strong”, those from 0.60 to 0.79 as “strong” and those from 0.40 to 0.59 as moderate correlations [51]. In January, two predictors were selected, Tropical Northern Atlantic index (TNA; lag-10 months) and extended reconstructed sea surface temperature (ERSST; lag-3 months), and showed strong correlation (−0.67, 0.61) with p-values of less than 1%. Additionally, WP (lag-4 months) and the multivariate ENSO index (MEI; lag-8 months) have strong correlations (−0.67 and 0.68) with p-values of less than 1% in February. The East Atlantic pattern (EA) was well correlated during the month of the March (−0.71) and May (0.67). Four climate indices were selected in April with NINA3 (lag-10 months) showing a very strong correlation (0.83), and the remaining predictor’s correlation coefficients lie between the values 0.44 and 0.66. The ERSST (lag-3 months) has a strong correlation in November (0.75) and December (0.66).
A different set of climate factors was selected for each month as is found in Table 3. The lag month as little as 3 months to as long as 12 months were found for climate factors exhibiting the different levels of lags of climate indices that has influence on streamflow. The spring season (March-May) were found to be the months that take the most advantage of good number of climate indices (four) as predictors for streamflow. A similar pattern of results was presented in previous work [52], indicating the variability of climatic factors that have influences on each month for medium-range forecasting on the study area. Among the climate indices considered, TNA, Arctic oscillation (AO), EA and ERSST were found to be selected repeatedly more than once with different lagged months as major predictors in several months. The ERSST, which is the climate index representing the sea surface temperature, was identified to be most frequent predictor (selected in five different months) that has the influence on streamflow.
In many papers, ocean SST were identified to influence streamflow [20,53,54]. The correlation analysis between the ERSST value of the waters around the Korean Peninsula and the reservoir inflow was investigated in order to find the region highly correlated with the inflow with 3-month lead time. As a result, ERSST was identified to be useful for lead time forecasting of reservoir inflow except February and April. Table 4 lists the selected locations for ERSST for the months being considered in this study. Figure 5 shows the moving correlation coefficients with lags considered between Q′ and Qs for the entire period 1998–2016. Except the ERSST(Mar, Dec) and TNA(Nov), the remaining factors exceeded a significance level of 95%.

3.2. Individual Model Performance

The performance of each model in terms of r (correlation coefficient) and S are listed in Table 5. The r values of the three models ranged from 0.62 to 0.90, which shows that the pattern of the observations was well simulated. The r value of 0.7 or more means a strong relationship between the observed and predicted values based on the criteria given by Hinkle et al. [55]. In terms of skill score, the monthly prediction skill score was larger than 0 for all three models and showed some predictive skill. The S values of the MLR model ranged from 0.21 to 0.55, the ANN model ranged from 0.20 to 0.52, and the SVM from 0.21 to 0.56. Except for January and December, the maximum prediction score of the models was above 0.4. One of the reasons why the forecasting score was low in January and December compared with the other months is that the variation in monthly inflow was large, which made the model prediction more difficult for these two months. The MLR model has a relatively simpler method than the other two models, however the forecast performance shows similar results and is considered to be an applicable and useful method with short computation time. Among the three models, MLR model depicted best results in April (r = 0.86, S = 0.49), and ANN model showed the best result in February (r = 0.83, S = 0.44), May (r = 0.81, S = 0.41) and November (r = 0.88, S = 0.52). SVM model indicated the best skill score in the cases of January (r = 0.67, S = 0.26), March (r = 0.9, S = 0.56) and December (r = 0.76, S = 0.34). The comparison of the observed and predicted values is shown in Figure 6. It can be concluded that the statistical method using climatic indices to predict dam inflow for small basin scale gives reasonable forecasts, with a 3-months lead time.
With the increase of observed data, the accuracy of forecasts will be improved. This issue of the effect of the training set size on the forecast performance of data-driven models is well confirmed in the literature. Foody et al. [56] showed for the remotely sensed data set the classification accuracy was increased significantly as a result of increasing the number of training cases using ANN method. Sug [57] discussed the effect of training set size for the performance of neural networks of classification and showed the tendency that multilayer perceptrons have better performance in relatively larger training data sets. Sordo et al. [58] investigate the dependency between sample size and accuracy of SVM. As a number of cases increases, SVM showed a substantial improvement in performance. Raquel et al. [59] analyzed the influence of training set size on the prediction accuracy of SVM model and indicated that the predictions became stable with increasing size of training sets.

3.3. Application of Ensemble Prediction

3.3.1. Ensemble Prediction Method

Previous evaluation results of each model’s performance show that the predictive accuracy of the models varied for different months. That means ensemble prediction is needed to integrate the strengths of different candidate models. This also allows for the quantification of prediction uncertainty. This section shows whether ensemble forecasting results with 3-months lead time are applicable to the dam operation. To illustrate this point, the prediction accuracy of 3-months lead time forecasting was analyzed using the models trained first by historical climate indices and reservoir inflow data for the preceding years before a target year. For example, the forecast results for 2013 are predicted using the models trained by monthly climate indices and reservoir inflow of 1998–2012. Four years (2013–2016) were selected as the target years, which is considered to reflect recent climate change impacts.
Model averaging is frequently used to quantify or reduce the uncertainty of various models and to take the good characteristics of each model. The multiple model ensemble technique are known to provide more reliable prediction results than the individual models by combining the results of various outflow models [60]. This study compared the SMA with BMA methods used widely in the various field. The SMA method means averaging that each model’s result has an equal value of the weight as described in Equation (9):
α s m a = ( 1 k , , , 1 k )
where k is the total number of models.
Forecast density of the BMA model, g j is computed as follows in Equation (10):
g j = k = 1 k β k f k ( y ˜ j )
where β k denotes the weight vector, fk is unknown forecast distribution and y ˜ j is a measurement vector. If a normal distribution is used as a forecast distribution, fk can be described as Equation (11):
f k ( y ˜ j | D j k , σ k 2 ) = 1 2 π σ k 2 exp ( 1 2 σ k 2 ( y ˜ j D j k ) )
where D j k is the point forecasts, and σ k 2 is variance. The MODELAVG tool box was used for BMA analysis and the details are explained in the manual [61].

3.3.2. Application of Ensemble Prediction

In order to assess the utility of this approach, we compared each forecast with the naive forecast (NF) that is average of the streamflow recorded for that month over the previous years. The Nash–Sutcliffe efficiency (NSE, [62]) and the percent bias (PBIAS) were chosen as the metrics for performance evaluation.
NSE is a normalized value that describes the relative magnitude of the residual variance (“noise”) compared to the observed data variance [36] and is computed as follows in Equation (12):
N S E = 1 t = 1 T ( Q o t Q m t ) 2 t = 1 T ( Q o t Q ¯ o ) 2
where Q o t and Q m t is the observed and forecasted value of the dam inflow, respectively, at time t, and Q ¯ o is the mean observed value. NSE can range from −∞ to 1 and the coefficient of 1 means a perfect match of predicted value to the observed data. The coefficient of 0 indicate that the model predictions are same as the mean of the observed data, whereas the coefficients less than zero indicate that the observed mean is more accurate than the model’s predicted value. Consequently, the closer the model’s efficiency coefficient is to 1, the more accurate the model’s prediction is.
The PBIAS was calculated according to the Equation (13):
P B I A S   ( % ) = t = 1 T ( Q o t Q m t ) t = 1 T ( Q o t ) × 100
The PBIAS is the forecast error normalized by the observation and quantifies the tendency of predicted value being either greater (over-forecast: negative values) or smaller (under-forecast: positive values) than the observed data.
NSE values greater than 0.5 and PBIAS within 25% are considered “adequate” [63]. Table 6 shows the forecast results of SMA, BMA and NF with the value of NSE, PBIAS. BMA results showed that NSE was above 0.7 and PBIAS values were within 25% for all years. BMA and SMA methods showed more reliable prediction results than NF. Overall, the BMA method resulted in more accurate forecasts than the other two methods. In addition, the 95% prediction uncertainty ranges of the BMA method for the target year 2016, which is given as an example, envelop most observations in Figure 7. The dam watershed suffered severe meteorological droughts in 2015 with precipitation of 668.6 mm/year, approximately 60% of 40-year (1976–2015) average precipitation (1188 mm/year) [22]. The forecasting results show that the BMA method presented reliable predictability in drought year. These results showed the applicability that the proposed models could be used as a decision-making tool to manage dam operation for water supply in the context of BMA.

4. Conclusions

Water resources managers consider accurate lead time forecasting of streamflow as one of the most fundamental components for preparing water hazards such as droughts. For the streamflow prediction with the statistical method, the most basic method is a naive forecast. Additionally, an improved forecast skill has been shown by more advanced models ranging from simple regression method to the data-driven modeling approach such as ANN, SVM, etc. [31,64]. Many hydrologists focus on predicting hydrologic conditions using teleconnection between streamflow and climate indices [17,65].
This study was motivated by the recognition of worsening water shortage in Southern Chungnam Province in South Korea due to emerging development and increasing demand on water resources. We developed statistical reservoir inflow forecast models with a lead time of 3 months based on MLR, ANN and SVM methods using lagged teleconnection between streamflow and regularly updated climate indices. For the model building, a 19-year-long hydrologic data set for the Boryeong Dam basin and various climatic indices were used. In addition to individual model building, we expanded the scope to investigate their integrated predictive capability as an ensemble prediction method in the context of BMA for improving predictability. The important results obtained in this study are summarized as follows.
First, we found that monthly teleconnection variables can be useful predictors for forecasting the monthly reservoir inflow for a small basin. Although the forecast performance of three models in Boryeong Dam basin were not perfect, the results have shown the possibility that climatic indices based models could be applied to a small basin for sustainable water resources management. Though ANN and SVM were better than MLR, the MLR method still has the advantage of being more directly based on the relationship between climatic indices and reservoir inflow over the forecasts obtained from SVM and ANN model. Ensemble prediction results showed that BMA is more accurate and useful than SMA and NF. In addition, the prediction interval provided by BMA can be very helpful for early decision-making in response to drought situations.
Wind, temperature and geomorphologic characters are key factors affecting precipitation and reservoir inflow. Despite remarkable recent progress in atmospheric science, our understanding of climatic forcing and how they influence climate variability and predictability at different lead times and over different time scales remains far from being complete [66]. Further understanding of climatic forcing is needed to develop more accurate statistical and numerical streamflow forecasting models.
This study is considered a promising first step for stable dam operation using statistical model to forecast reservoir inflow at a reasonable lead time. Despite being the promising approach to forecast reservoir inflow from a small basin, the proposed models need to be tested for other basins for more general applicability. In the future study, climatic indices can be further combined with local variables such as precipitation and flows for current and previous months so that the direct cause of the inflow is supplemented by the remotely influential climate variables for better prediction performance.

Author Contributions

All the authors contributed to the conception and the development of this paper. D.L. and I.J. contributed to the calculations and the analyses. H.K. and J.Y. helped with the figure production and the writing of the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Korea University Grant.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. Jeong, W.; Rho, S.; Kim, Y. Assessment of Boryeong Multipurpose Dam’s Water Supply Capacity and Research on Drought Response; Chungnam Institute: Chungcheongnam-do, Korea, 2016. (In Korean) [Google Scholar]
  2. Lima, C.H.R.; Lall, U. Climate informed monthly streamflow forecasts for the Brazilian hydropower network using a periodic ridge regression model. J. Hydrol. 2010, 380, 438–449. [Google Scholar] [CrossRef]
  3. Sinha, A.; Sankarasubramanian, A.; Mazrooei, A. Decomposition of Sources of Errors in Monthly to Seasonal Streamflow Forecasts in a Rainfall–Runoff Regime. J. Hydrometeorol. 2014, 15, 2470–2483. [Google Scholar] [CrossRef] [Green Version]
  4. Harpold, A.A.; Sutcliffe, K.; Clayton, J.; Goodbody, A.; Vazquez, S. Does including soil moisture observations improve operational streamflow forecasts in snow-dominated watersheds? J. Am. Water Resour. Assoc. 2017, 53, 179–196. [Google Scholar] [CrossRef]
  5. Anderson, J.; van den Dool, H.; Barnston, A.; Chen, W.; Stern, W.; Ploshay, J. Present-day capabilities of numerical and statistical models for atmospheric extratropical seasonal simulation and prediction. Bull. Am. Meteorol. Soc. 1999, 80, 1349–1361. [Google Scholar] [CrossRef]
  6. Ropelewski, C.F.; Halpert, M.S. Global and Regional Scale Precipitation Patterns Associated with the El Niño/Southern Oscillation. Mon. Weather Rev. 1987, 115, 1606–1626. [Google Scholar] [CrossRef] [Green Version]
  7. Cayan, D.R.; Dettinger, M.D.; Diaz, H.F.; Graham, N.E. Decadal Variability of Precipitation over Western North America. J. Clim. 1998, 11, 3148–3166. [Google Scholar] [CrossRef]
  8. Piechota, T.C.; Dracup, J.A.; Fovell, R.G. Western US streamflow and atmospheric circulation patterns during El Nino Southern Oscillation. J. Hydrol. 1997, 201, 249–271. [Google Scholar] [CrossRef]
  9. McCabe, G.J.; Dettinger, M.D. Decadal variations in the strength of ENSO teleconnections with precipitation in the western United States. Int. J. Climatol. 1999, 19, 1399–1410. [Google Scholar] [CrossRef]
  10. Regonda, S.K.; Rajagopalan, B.; Clark, M. A new method to produce categorical streamflow forecasts. Water Resour. Res. 2006, 42. [Google Scholar] [CrossRef]
  11. Moss, M.E.; Pearson, C.P.; McKerchar, A.I. The Southern Oscillation index as a predictor of the probability of low streamflows in New Zealand. Water Resour. Res. 1994, 30, 2717–2723. [Google Scholar] [CrossRef]
  12. Chiew, F.H.S.; Piechota, T.C.; Dracup, J.A.; McMahon, T.A. El Nino/Southern Oscillation and Australian rainfall, streamflow and drought: Links and potential for forecasting. J. Hydrol. 1998, 204, 138–149. [Google Scholar] [CrossRef]
  13. Chen, C.-J.; Lee, T.-Y. Variations in the correlation between teleconnections and Taiwan’s streamflow. Hydrol. Earth Syst. Sci. 2017, 21, 3463–3481. [Google Scholar] [CrossRef] [Green Version]
  14. Chiew, F.H.S.; McMahon, T.A. Global ENSO-streamflow teleconnection, streamflow forecasting and interannual variability. Hydrol. Sci. J. 2002, 47, 505–522. [Google Scholar] [CrossRef] [Green Version]
  15. Xu, K.; Brown, C.; Kwon, H.-H.; Lall, U.; Zhang, J.; Hayashi, S.; Chen, Z. Climate teleconnections to Yangtze river seasonal streamflow at the Three Gorges Dam, China. Int. J. Climatol. 2007, 27, 771–780. [Google Scholar] [CrossRef] [Green Version]
  16. Chandimala, J.; Zubair, L. Predictability of stream flow and rainfall based on ENSO for water resources management in Sri Lanka. J. Hydrol. 2007, 335, 303–312. [Google Scholar] [CrossRef] [Green Version]
  17. Kalra, A.; Miller, W.P.; Lamb, K.W.; Ahmad, S.; Piechota, T. Using large-scale climatic patterns for improving long lead time streamflow forecasts for Gunnison and San Juan River Basins. Hydrol. Process. 2013, 27, 1543–1559. [Google Scholar] [CrossRef]
  18. Ouachani, R.; Bargaoui, Z.; Ouarda, T. Power of teleconnection patterns on precipitation and streamflow variability of upper Medjerda Basin. Int. J. Climatol. 2013, 33, 58–76. [Google Scholar] [CrossRef]
  19. Hidalgo-Munoz, J.M.; Gamiz, S.R.; Castro, Y.; Argueso, D.; Esteban, M.J. Long-range seasonal streamflow forecasting over the Iberian Peninsula using large-scale atmospheric and oceanic information. Water Resour. Res. 2015, 51, 3543–3567. [Google Scholar] [CrossRef] [Green Version]
  20. Schepen, A.; Zhao, T.; Wang, Q.J.; Zhou, S.; Feikema, P. Optimising seasonal streamflow forecast lead time for operational decision making in Australia. Hydrol. Earth Syst. Sci. 2016, 20, 4117–4128. [Google Scholar] [CrossRef] [Green Version]
  21. Zhao, T.; Schepen, A.; Wang, Q.J. Ensemble forecasting of sub-seasonal to seasonal streamflow by a Bayesian joint probability modelling approach. J. Hydrol. 2016, 541, 839–849. [Google Scholar] [CrossRef]
  22. Kim, W.; Lee, J.; Kim, J.; Kim, S. Assessment of Water Supply Stability for Drought-Vulnerable Boryeong Multipurpose Dam in South Korea Using Future Dry Climate Change Scenarios. Water 2019, 11, 2403. [Google Scholar] [CrossRef] [Green Version]
  23. EGIS: Environmental Geographic Information Services. Available online: https://egis.me.go.kr/main.do (accessed on 13 April 2020).
  24. WAMIS: Hydrologic Data of Dams. Available online: http://www.wamis.go.kr (accessed on 17 March 2020).
  25. Coley, D.M.; Waylen, P.R. Forecasting dry season streamflow on the Peace River at Arcadia, Florida, USA. J. Am. Water Resour. As. 2006, 42, 851–862. [Google Scholar] [CrossRef]
  26. Demirel, M.C.; Booij, M.J.; Hoekstra, A.Y. The skill of seasonal ensemble low-flow forecasts in the Moselle River for three different hydrological models. Hydrol. Earth Syst. Sci. 2015, 19, 275–291. [Google Scholar] [CrossRef] [Green Version]
  27. Risva, K.; Nikolopoulos, D.; Efstratiadis, A. A framework for dry period low flow forecasting in Mediterranean streams. Water Resour. Manag. 2018, 32, 4911–4932. [Google Scholar] [CrossRef]
  28. CPC: Climate Information. Available online: http://www.cpc.ncep.noaa.gov/ (accessed on 17 March 2020).
  29. Mathias, S.; Merz, B.; Apel, H. Seasonal forecasting of hydrological drought in the Limpo Basin: A comparison of statistical methods. Hydrol. Earth Syst. Sci. 2017, 21, 1611–1629. [Google Scholar]
  30. Hair, J.F., Jr.; Anderson, R.E.; Tatham, R.L.; Black, W.C. Multivariate Data Analysis; Prentice-Hall: Englewood Cliffs, NJ, USA, 1995; pp. 200–206. [Google Scholar]
  31. Zhang, Z.H.; Zhang, Q.; Singh, V.P. Univariate streamflow forecasting using commonly used data-driven models: Literature review and case study. Hydrolog. Sci. J. 2018, 63, 1091–1111. [Google Scholar] [CrossRef]
  32. Salas, J.; Fu, C.; Rajagopalan, B. Long-Range forecasting of Colorado streamflows based on hydrologic, atmospheric, and oceanic data. J. Hydrol. Eng. 2011, 16, 508–520. [Google Scholar] [CrossRef] [Green Version]
  33. Kabir, R.; Hsieh, W.; Cannon, A. Daily streamflow forecasting by machine learning methods with weather and climate inputs. J. Hydrol. 2012, 414, 284–293. [Google Scholar]
  34. Cheng, C.T.; Feng, Z.K.; Niu, W.J.; Liao, S.L. Heuristic methods for reservoir monthly inflow forecasting: A case study of Xinfengjiang Reservoir in Pearl River, China. Water 2015, 7, 4477–4495. [Google Scholar] [CrossRef]
  35. Sudheer, C.; Maheswaran, R.; Panigrahi, B.; Mathur, S. A hybrid SVM-PSO model for forecasting monthly streamflow. Neural Comput. Appl. 2014, 6, 1381–1389. [Google Scholar] [CrossRef]
  36. Guo, J.; Zhou, J.Z.; Qin, H.; Zou, Q.; Li, Q.Q. Monthly streamflow forecasting based on improved support vector machine model. Expert. Syst. Appl. 2011, 10, 13073–13081. [Google Scholar] [CrossRef]
  37. Zhang, X.L.; Wang, H.X.; Peng, A.B.; Wang, W.C.; Li, B.J.; Huang, X.D. Quantifying the uncertainties in data-driven models for reservoir inflow prediction. Water Resour. Res. 2020, 34, 1479–1493. [Google Scholar] [CrossRef]
  38. Babaei, M.; Moeini, R.; Ehsanzadeh, E. Artificial neural network and support vector machine models for inflow prediction of dam reservoir (case study: Zayandehroud Dam reservoir). Water Resour. Manag. 2019, 6, 2203–2218. [Google Scholar] [CrossRef]
  39. Patel, S.; Hardaha, M.K.; Seetpal, M.K.; Madankar, K.K. Multiple linear regression model for stream flow estimation of Wainganga River. Am. J. Water Sci. Eng. 2016, 2, 1–5. [Google Scholar]
  40. Moradi, A.M.; Dariane, A.B.; Yang, G.; Block, P. Long-range reservoir inflow forecasts using large-scale climate predictors. Int. J. Clinatol. 2020, 22. [Google Scholar] [CrossRef]
  41. Hossain, I.; Rasel, H.M.; Imteaz, M.A.; Mekanik, F. Long-term seasonal rainfall forecasting using linear and non-linear modelling approaches: A case study for Western Australia. Meteorol. Atmos. Phys. 2020, 132, 131–141. [Google Scholar] [CrossRef]
  42. Esha, R.I.; Imteaz, M.A. Assessing the predictability of MLR models for long-term streamflow using lagged climate indices as predictors: A case study of Australia. Hydrol. Res. 2019, 50, 262–281. [Google Scholar] [CrossRef]
  43. Tabachnick, B.G.; Fidell, L.S. Using Multivariate Statistics, 3rd ed.; HarperCollins College Publishers: New York, NY, USA, 1996. [Google Scholar]
  44. Guyon, I.; Matic, N.; Vapnik, V. Discovering informative patterns and data cleaning. In Advances in Knowledge Discovery and Data Mining; Usama, M.F., Gregory, P.S., Padhraic, S., Ramasamy, U., Eds.; American Association for Artificial Intelligence Press: Menlo Park, CA, USA, 1996; pp. 181–203. [Google Scholar]
  45. Müller, K.R.; Smola, A.J.; Rätsch, G.; Schölkopf, B.; Kohlmorgen, J.; Vapnik, V. Predicting time series with support vector machines. In Proceedings of the Artificial Neural Networks—ICANN’97, Berlin/Heidelberg, Germany, 8–10 October 1997; pp. 999–1004. [Google Scholar]
  46. Cortes, C.; Vapnik, V. Support-Vector Networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  47. Zaefizadeh, M.; Khayatnezhad, M.; Gholamin, R. Comparison of Multiple Linear Regressions (MLR) and Artificial Neural Network (ANN) in Predicting the Yield Using its Components in the Hulless Barley. Am.-Eur. J. Agric. Environ. Sci. 2011, 11, 60–64. [Google Scholar]
  48. Govindaraju, R.S. Artificial Neural Networks in Hydrology. I: Preliminary Concepts. J. Hydrol. Eng. 2000, 5, 115–123. [Google Scholar]
  49. Melesse, A.M.; Ahmad, S.; McClain, M.E.; Lim, Y.H. Suspended sediment load prediction of river systems: An artificial neural network approach. Agric. Water Manag. 2011, 98, 855–866. [Google Scholar] [CrossRef]
  50. Wilks, D.S. Statistical Methods in the Atmospheric Sciences: An Introduction; Wilks, D.S., Ed.; International Geophysics Series, 59; Academic Press: San Diego, CA, USA, 1995. [Google Scholar]
  51. Evans, J.D. Straightforward Statistics for the Behavioral Sciences; Brooks/Cole Publishing: Pacific Grove, CA, USA, 1996. [Google Scholar]
  52. Kim, Y.; Kim, M.; Lee, W. An investigation of large-scale climate indices with the influence on temperature and precipitation variation in Korea. Atmosphere 2008, 18, 83–95. (In Korean) [Google Scholar]
  53. Kwon, H.H.; Brown, C.; Xu, K.Q.; Lall, U. Seasonal and annual maximum streamflow forecasting using climate information: Application to the Three Gorges Dam in the Yangtze River basin, China. Hydrol. Sci. J. 2009, 54, 582–595. [Google Scholar] [CrossRef]
  54. Sagarika, S.; Kalra, A.; Ahmad, S. Interconnections between oceanic-atmospheric indices and variability in the US streamflow. J. Hydrol. 2015, 525, 724–736. [Google Scholar] [CrossRef]
  55. Hinkle, D.E.; Wiersma, W.; Jurs, S.G. Applied Statistics for the Behavioral Sciences, 5th ed.; Houghton Mifflin: Boston, MA, USA, 2003. [Google Scholar]
  56. Foody, G.M.; McCulloch, M.B.; Yates, W.B. The effect of training set size and composition on artificial neural-network classification. Int. J. Remote Sens. 1995, 16, 1707–1723. [Google Scholar] [CrossRef]
  57. Sug, H. The effect of training set size for the performance of neural networks of classification. WSEAS Trans. Comp. 2010, 9, 1297–1306. [Google Scholar]
  58. Sordo, M.; Zeng, Q. On Sample Size and Classification Accuracy: A Performance Comparison; Springer: Berlin/Heidelberg, Germany, 2005. [Google Scholar]
  59. Rodriguez-Perez, R.; Vogt, M.; Bajorath, J. Influence of varying training set composition and size on support vector machine-based prediction of active compounds. J. Chem. Inf. Model. 2017, 57, 710–716. [Google Scholar] [CrossRef] [Green Version]
  60. Duan, Q.; Ajami, N.K.; Gao, X.; Sorooshian, S. Multi-model ensemble hydrologic prediction using Bayesian model averaging. Adv. Water Res. 2007, 30, 1371–1386. [Google Scholar] [CrossRef] [Green Version]
  61. Vrugt, J.A. MODELAVG: A MATLAB Toolbox for Postprocessing of Model Ensembles; Department of Civil and Environmental Engineering, University of California Irvine: Irvine, CA, USA, 2016; Available online: http://faculty.sites.uci.edu/jasper/files/2016/04/manual_Model_averaging.pdf (accessed on 15 August 2019).
  62. Nash, J.E.; Sutcliffe, J.V. River flow forecasting through conceptual models. I: A discussion of principles. J. Hydrol. 1970, 10, 282–290. [Google Scholar] [CrossRef]
  63. Moriasi, D.N.; Arnold, J.G.; Van Liew, M.W.; Bingner, R.L.; Harmel, R.D.; Veith, T.L. Model evaluation guidelines for systematic quantification of accuracy in watershed simulations. Trans. ASABE 2007, 50, 885–900. [Google Scholar] [CrossRef]
  64. Wood, A.W.; Kumar, A.; Lettenmaier, D.P. A retrospective assessment of national centers for environmental prediction climate model-based ensemble hydrologic forecasting in the western United States. J. Geophys. Res. Atmos. 2005, 110. [Google Scholar] [CrossRef]
  65. Kalra, A.; Ahmad, S. Using oceanic-atmospheric oscillations for long lead time streamflow forecasting. Water. Resour. Res. 2009, 45. [Google Scholar] [CrossRef]
  66. Qian, B.; Saunders, M.A.; Summer, U.K. Temperature and Its Links to Preceding Eurasian Snow Cover, North Atlantic SSTs, and the NAO. J. Clim. 2003, 16, 4108–4120. [Google Scholar] [CrossRef]
Figure 1. Location of the Boryeong Dam watershed and its elevation.
Figure 1. Location of the Boryeong Dam watershed and its elevation.
Applsci 10 03470 g001
Figure 2. Monthly average precipitation, inflow depth and runoff coefficient of study area.
Figure 2. Monthly average precipitation, inflow depth and runoff coefficient of study area.
Applsci 10 03470 g002
Figure 3. The designed network diagram for artificial neural network (ANN).
Figure 3. The designed network diagram for artificial neural network (ANN).
Applsci 10 03470 g003
Figure 4. Flowchart of small basin scale dam inflow prediction by multiple linear regression (MLR), a support vector machine (SVM) and ANN.
Figure 4. Flowchart of small basin scale dam inflow prediction by multiple linear regression (MLR), a support vector machine (SVM) and ANN.
Applsci 10 03470 g004
Figure 5. Moving correlations (years 1998–2016) between the monthly reservoir inflow and the climatic indices.
Figure 5. Moving correlations (years 1998–2016) between the monthly reservoir inflow and the climatic indices.
Applsci 10 03470 g005
Figure 6. Comparison of the observed (dot shape) and predicted (dashed line) reservoir inflow from three models for January (a), February (b), March (c), April (d), May (e), November (f) and December (g) of years 1998–2016.
Figure 6. Comparison of the observed (dot shape) and predicted (dashed line) reservoir inflow from three models for January (a), February (b), March (c), April (d), May (e), November (f) and December (g) of years 1998–2016.
Applsci 10 03470 g006aApplsci 10 03470 g006bApplsci 10 03470 g006c
Figure 7. Comparison of forecasts between SMA and BMA for January (a), February (b), March (c), April (d), May (e), November (f) and December (g) of target year 2016. The red line displays the mean point forecast of the BMA method, whereas the block dots are observations. Grey region indicates 95% prediction interval.
Figure 7. Comparison of forecasts between SMA and BMA for January (a), February (b), March (c), April (d), May (e), November (f) and December (g) of target year 2016. The red line displays the mean point forecast of the BMA method, whereas the block dots are observations. Grey region indicates 95% prediction interval.
Applsci 10 03470 g007aApplsci 10 03470 g007bApplsci 10 03470 g007c
Table 1. Monthly average precipitation, reservoir inflow depth and runoff coefficient (1998–2016).
Table 1. Monthly average precipitation, reservoir inflow depth and runoff coefficient (1998–2016).
DataJanFebMarAprMayJunJulAugSepOctNovDec
Precipitation (mm)28.738.650.396.3103.4163.4326.6274.8165.267.745.639.9
Inflow depth (mm)10.917.018.444.650.363.8237.9198.5119.133.613.515.0
Runoff coefficient0.380.440.370.460.480.390.720.720.720.500.300.38
Table 2. List of teleconnection indices used in this study.
Table 2. List of teleconnection indices used in this study.
AbbreviationDefinition
AAOAntarctic Oscillation is the first leading mode from the EOF analysis of monthly mean height anomalies at 700-hPa.
AMMAtlantic Meridional Mode is the atmosphere-ocean variability in the tropical Pacific and tropical Atlantic.
AMOAtlantic Multidecadal Oscillation is an index of the North Atlantic temperature.
AOArctic Oscillation is a pattern of atmospheric pressures of the Arctic and North Atlantic oceans.
EAEast Atlantic Pattern is associated with surface temperatures in Europe and US.
EAWREastern Asia/Western Russia index is the second prominent mode of low-frequency variability over the North Atlantic.
ERSSTExtended Reconstructed Sea Surface Temperature is a global grid monthly sea surface temperature.
MEIMultivariate ENSO Index is related to sea-level pressure, surface wind, sea surface temperature, surface air temperature and total cloudiness fraction of the sky.
NAONorth Atlantic Oscillation is a dominant teleconnection patterns ranging from North America to Europe and North Asia.
NINA3NINA3 is the average of sea surface temperature anomalies over the region from 5N to 5S and 150W to 90W.
PDOPacific Decadal Oscillation is associated with monthly SSTs across the North Pacific.
SOISouthern Oscillation Index is the development and intensity of El Nino or La Nina events in the Pacific Ocean.
TNATropical Northern Atlantic Index is the anomaly of the average of the monthly SST from 5.5N to 23.5N and 15W to 57.5W.
WPWestern Pacific Index is a primary mode of low-frequency variability over the North Pacific in all months.
Table 3. List of teleconnection indices used for each month (the number in the ( ) means the months lagged).
Table 3. List of teleconnection indices used for each month (the number in the ( ) means the months lagged).
MonthClimatic IndicesCorrelation Coefficientp-ValueVIF
JanTNA(10), ERSST(3)−0.67, 0.610.001, 0.005<1.0
FebWP(4), MEI(8)−0.67, 0.680.001, 0.001<1.3
MarEA(4), AO(5), AMM(3), ERSST(3)−0.71, −0.64, −0.59, 0.450.0007, 0.002, 0.007, 0.05<1.7
AprNINA3(10), SOI(5), EAWR(6), PDO(8)0.83, −0.57, 0.44, 0.660.0001, 0.009, 0.05, 0.001<4.1
MayEA(9), AO(9), AAO(12), ERSST(3)0.67, 0.45, −0.47, 0.470.001, 0.04, 0.03, 0.03<1.3
NovTNA(8), ERSST(3)−0.54, 0.750.01, 0.0002<1.1
DecNAO(8), NINO3(5), ERSST(3)0.46, 0.57, 0.660.04, 0.01, 0.002<1.3
Table 4. Selected locations from the global climate data for the extended reconstructed sea surface temperature (ERSST).
Table 4. Selected locations from the global climate data for the extended reconstructed sea surface temperature (ERSST).
MonthJanFebMarAprMayNovDec
Locations selected48° N, 210° E40° N, 150° E26° N, 244° E32° N, 198° E46° N, 146° E60° N, 152° E48° N, 210° E
Correlation coefficient0.610.400.450.640.470.750.66
Table 5. Monthly forecast skill scores (r and S) between the observed and predicted dam inflow with a 3-months lead time. The bold numbers represent the best prediction skill score of each month.
Table 5. Monthly forecast skill scores (r and S) between the observed and predicted dam inflow with a 3-months lead time. The bold numbers represent the best prediction skill score of each month.
MonthMLRANNSVM
rSrSrS
Jan0.650.210.670.200.670.26
Feb0.690.260.830.440.620.21
Mar0.890.550.860.490.900.56
Apr0.860.490.760.350.830.44
May0.780.330.810.410.710.30
Nov0.810.410.880.520.800.32
Dec0.750.310.750.270.760.34
Table 6. Performance statistics for Bayesian model averaging (BMA), simple model averaging (SMA) and naive forecast (NF).
Table 6. Performance statistics for Bayesian model averaging (BMA), simple model averaging (SMA) and naive forecast (NF).
YearBMASMANaive Forecast
NSEPBIAS (%)NSEPBIAS (%)NSEPBIAS (%)
20130.786.450.7714.41−0.0237.37
20140.773.080.5210.37−1.6814.92
20150.89−1.330.71−7.11−1.3436.41
20160.71−21.870.43−36.340.3144.19

Share and Cite

MDPI and ACS Style

Lee, D.; Kim, H.; Jung, I.; Yoon, J. Monthly Reservoir Inflow Forecasting for Dry Period Using Teleconnection Indices: A Statistical Ensemble Approach. Appl. Sci. 2020, 10, 3470. https://doi.org/10.3390/app10103470

AMA Style

Lee D, Kim H, Jung I, Yoon J. Monthly Reservoir Inflow Forecasting for Dry Period Using Teleconnection Indices: A Statistical Ensemble Approach. Applied Sciences. 2020; 10(10):3470. https://doi.org/10.3390/app10103470

Chicago/Turabian Style

Lee, Donghee, Hwansuk Kim, Ilwon Jung, and Jaeyoung Yoon. 2020. "Monthly Reservoir Inflow Forecasting for Dry Period Using Teleconnection Indices: A Statistical Ensemble Approach" Applied Sciences 10, no. 10: 3470. https://doi.org/10.3390/app10103470

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop