Abstract
Streamflow forecasting is used in river training and management, river restoration, reservoir operation, power generation, irrigation, and navigation. In hydrology, streamflow forecasting is often done using time series analysis. Although monthly streamflow time series are stochastic, they exhibit seasonal and periodic patterns. Therefore, streamflow forecasting entails modeling two main aspects: seasonality and correlation structure. Spectral analysis can be employed to characterize patterns of streamflow variation and identify the periodicity of streamflow. That is, it permits to extract significant information for understanding the streamflow process and prediction thereof. For forecasting streamflow, spectral analysis has, however, not yet been widely applied. Streamflow spectra can be determined using entropy theory. There are three ways to employ entropy theory: (1) Burg entropy, (2) configurational entropy, and (3) relative entropy. In either way, the methodology involves determination of spectral density, determination of parameters, and extension of autocorrelation function. This paper reviews the methods of spectral analysis using the entropy theory and tests them using streamflow data.
Similar content being viewed by others
1 Introduction
Streamflow forecasting is the process of estimating future streamflow in advance, based on available information. Timely streamflow may help the public to reduce the impacts of a flood or a drought. As streamflow is stochastic, time series analysis is a widely used method for streamflow forecasting. The most widely used models are autoregressive (AR) or autoregressive moving average (ARMA) models, which have been applied for forecasting streamflow under different conditions (Carlson et al. 1970; Haltiner and Salas 1988; Jones and Brelsfor 1967; Salas and Obeysekera 1982). However, the underlying linear assumption of these methods is the major drawback that sometimes limits their application (Elshorbagy et al. 2002). On the other hand, it has been shown that spectral analysis is capable of extracting significant information for understanding the streamflow process and prediction thereof (Fleming et al. 2002; Ghil et al. 2002; Labat 2005; Marques et al. 2006; Molenat et al. 1999). Thus, entropy theory is employed to combine spectral analysis and time series analysis for streamflow forecasting.
The development of Burg entropy spectral analysis (BESA) (Burg 1967, 1975) not only improved the resolution of the spectral density but also improved the reliability of prediction of streamflow. BESA has been applied to forecast hydrological series and it is recommended over classical methods (Krstanovic and Singh 1989, 1991a, b; Singh 2013). However, due to the weakness in determining multi-peak spectral density for non-stationary conditions (Boshnakov and Lambert-Lacroix 2012), it sometimes does not work well for monthly streamflow with strong seasonal and periodic characteristics.
The configurational entropy spectral analysis (CESA) was introduced by Frieden (1972) and Gull and Daniell (1978), which is sometimes also referred to as maximum entropy method 2 (MEM2) or spectral MESA (SMESA) (Katsakos-Mavromichalis et al. 1985; Tzannes et al. 1985; Tzannes and Avgeris 1981). Superior to BESA, CESA has been shown to be not restricted to the AR process (Liefhebber and Boekee 1987; Ortigueira et al. 1981). Configurational entropy has also been applied for spectral analysis and shown to have a better resolution than BESA for autoregressive moving average (ARMA) and moving average (MA) processes, and is comparable to BESA for the autoregressive (AR) process (Nadeu et al. 1981). It has been applied to forecast monthly streamflow and has shown better reliability than BESA for both high flow and low flow (Cui and Singh 2015).
Besides, there is relative entropy spectral analysis (RESA), which was developed by Shore (1979, 1981) as an extension of Burg’s maximum entropy spectral analysis, where the spectral power was considered as a random variable. RESA can be used for streamflow forecasting as well. Later, another version of RESA was developed by Tzannes et al. (1985), considering frequency as a random variable. The RESA spectra are reported to have higher resolution and are more accurate in detecting peak location than other methods for spectral computation (Papademetriou 1998). Besides, the RESA theory reduces the number of prediction coefficients by relying on the prior information (Schroeder 1982). However, RESA has not been applied to streamflow forecasting yet.
The objective of this paper is to review and compare three methods of entropy spectral analysis, which are Burg entropy spectral analysis (BESA), configurational entropy spectral analysis (CESA) and relative entropy spectral analysis (RESA).
2 Entropy Spectral Analyses
Let streamflow time series y(t) be denoted as y1,…, yT, where T is the total time period. Transferring to the frequency (f) domain, the information on streamflow is stored in the spectral density p(f). Considering frequency f as a random variable, the normalized spectral density p(f) can be taken as the probability density function. Thus, the Burg entropy can be defined as
where W = 1/(2Δt) is the Nyquist fold-over frequency and f is the frequency that varies from –W to W, Δt is the sampling period. It is observed that the Burg entropy is defined as the sum of log of the spectral density values.
Taking the expectation of the integral of the log of spectral density, the configurational entropy is defined as
On the other hand, with given prior spectral density q(f), the relative entropy of the spectral density p(f) can be defined as
The prior spectral density can be taken as a background noise with the peak assumed at the observed periodicity. It is noted that when a uniform prior is taken, the relative entropy reduces to the configurational entropy.
The development of entropy spectral analyses comprises the following steps: (1) derivation of entropy-based spectral density, (2) computation of the Lagrange multipliers, (3) extension of the autocorrelation function, and (4) forecasting of streamflow.
2.1 Derivation of Entropy-Based Spectral Density
To obtain the least biased spectral density, one needs to maximize the Burg and configurational entropy but minimize the relative entropy to the prior subject to specified constraints. The constraints can be formed from the relationship between the spectral density and autocorrelation, which can written as
where \( i=\sqrt{-1} \) and ρ n is the autocorrelation function of n-th lag. When n = 0, Eq. (4) reduces to
Thus, entropy can be maximized or minimized, subject to the constraints, using the Lagrange multipliers, in which the Lagrangian function can be formulated as
where λ n , n = 0, 1, 2, …, N, are the Lagrange multipliers, and H(f) is the entropy to be maximized [as H B (f) or H C (f)] or to be minimized [as H R (f)]. Taking the partial derivative of L(f) with respect to the spectral density and equating to zero, \( \frac{\partial L(f)}{\partial p(f)}=0 \), the least-biased spectral densities obtained from the maximization of the Burg entropy and configurational entropy and from the minimization of the relative entropy, respectively, are
It can be seen from the above three equations that the spectral density derived from the Burg entropy is in the form of inverse of polynomials, while the ones from the configurational entropy and relative entropy are in the exponential form. The form in Eq. (7) suggests that BESA is related to the linear prediction process.
2.2 Determination of Parameters
Due to the different forms of the spectral density, the ways of determining Lagrange multipliers are different. For the Burg entropy, the Lagrange multipliers and prediction coefficients can be computed from the Levinson-Burg algorithm developed by Burg (1967, 1975). The Levinson-Burg algorithm is a recursive algorithm for estimating prediction coefficients, and improves the original Levinson algorithm by computing forward and backward prediction error together to update the coefficient of next order (Collomb 2009; Lin and Wong 1990).
However, for the configurational entropy and relative entropy, the cepstrum analysis is incorporated. Taking the inverse Fourier transform of the log-magnitude of Eq. (9), one obtains
It can be seen from Eq. (10) that there are two terms relating to the spectral density that can turn to the cepstrum of autocorrelation, which is also called autocepstrum.
Let the prior cepstrum of autocorrelation be denoted as e q.(n), which is transferred from the prior spectral density as
Similarly, the posterior cepstrum of autocorrelation e p (n) transform from the posterior spectral density can be expressed as
Doing the integration of both sides of Eq. (10), one gets
where δ n is the delta function defined as:
Equation (13) can be expanded as a set of N linear equations:
Equation (15) enables to solve for the Lagrange multipliers in a straight-forward manner. Thus, the Lagrange multipliers can be estimated from the summation of the prior and posterior autocepstrums. The prior autocepstrum can be obtained from the observed periodicity of streamflow. The posterior autocepstrum can be estimated from the following recursive function introduced by Nadeu (1992) as
It is seen from Eq. (16) that the nth lag of cepstrum e p (n) is dependent on the previous n-1 lags of cepstrum and n-lags of autocorrelation. Thus, for given N lag autocorrelations, the cepstrum of autocorrelation can be computed up to lag N.
It can be noted that for solving parameters of the configurational entropy without prior, the cepstrum e q in Eq. (15) equals 0 and diminishes, and the Lagrange multipliers are solved from
where e p (n) is computed from Eq. (16) as well.
2.3 Extension of Autocorrelation
According to Burg’s (1967, 1975) derivation, maximization of the Burg entropy allows the autocorrelation to be extended as a linear combination of previous lags with the prediction coefficients as
On the other hand, for the configurational entropy and relative entropy, the autocorrelation is extended with the inverse relationship of Eq. (16) using the autocepstrum as
When no prior is given, Eq. (19) reduces to
2.4 Streamflow Forecasting
Streamflow is forecasted in the manner that autocorrelation function is extended. Thus using BESA, streamflow is forecasted by a linear combination of past series weighted by the coefficients in Eq. (18) to extend the autocorrelation, which becomes
When using CESA or RESA, streamflow is forecasted with cepstrum analysis which is the way autocorrelation is extended. Thus, streamflow is forecasted by
where c(j) is the cepstrum of the time series and equals to \( \frac{1}{2}e(n) \). Then Eq. (22) can be written as
When no prior is given, e p is 0, thus, streamflow forecasted by CESA becomes
The order of forecasting model m is identified by the Akaike information criterion (AIC) or the Bayesian information criterion (BIC) (Box and Jenkins 1970; Hipel and McLeod 1994).
3 Application
The proposed three entropy spectral analysis methods, BESA, CESA and RESA, are verified using observed streamflow data from Iowa River. Iowa River is a tributary of Mississippi River, which is about 520 km long and has a drainage area of 33,000 km2. Two stations are chosen for comparison, one from the upstream and the other from the downstream. The upstream station of Iowa River has a mean monthly streamflow of 9.3 m3/s, while the downstream station has a mean of 305 m3/s.
3.1 Estimation of Spectral Density
First of all, the spectral densities, estimated by BESA, CESA and RESA, are compared to the one estimated from the Fast Fourier transform (FFT), as plotted in Fig. 1. As seen from the figure, the spectral density of the upstream station is more likely to be multi-peaked than that of the downstream station. For the upstream station, spectral peaks at frequencies 1/12th, 1/6th and 1/4th are significant and even peaks at frequencies 1/3th, 5/12th and 1/2th are visible. However, for the downstream station, there are no more peaks after frequency 1/4th. The performance of the three entropy spectral analyses was evaluated by the Itakura-Saito distortion, which is a measure of the perceptual difference between an original spectrum and its estimate. The distortion is defined as
where p(f) represents the spectral density from FFT and \( \widehat{p}(f) \) is the estimated spectral density. The smaller value represents a better fit.
As seen in Fig. 1, BESA did not perform well in estimating the spectral density with multi-peaks. BESA estimated spectral peaks at frequencies 1/12th and 1/6th with the same significance for the upstream station and estimated spectral peaks at frequency 1/6th with higher significance than at 1/12th frequency. However, for monthly streamflow of Iowa River, the 12 month periodicity is the most important periodicity, and should always be most significant. On the contrary, the CESA and RESA methods correctly estimated the most significant peak at 1/12th frequency. One concern of CESA is that it ignores all less significant peaks to keep the peak at 1/12th frequency most significant. Though, using RESA, those less significant peaks were estimated, and were found to be consistent to the ones from FFT.
The Itakura-Saito distortion, listed in Table 1, had the largest values using BESA and smallest values using RESA for both upstream and downstream stations. Besides, the difference between BESA and CESA was larger than that between CESA and RESA. It suggests that for streamflow of Iowa River, the exponential form of CESA and RESA fits better than the polynomial form of BESA.
3.2 Forecasted Streamflow
Streamflow was forecasted by the three entropy spectral analysis methods with a 3 year lead time for an upstream station and a 1 year lead time for a downstream station on Iowa River, as shown in Fig. 2. The goodness of forecasting was examined by RMSE, r 2 and NSE, which are defined as
where Q o (i) is the i-th observed streamflow; Q f (i) is the i-th forecasted streamflow; and \( \overline{Q_o} \) and \( \overline{Q_f} \) are the average values of observed and computed discharges, respectively.
As shown in Fig. 2, observed monthly streamflow of Iowa River has the largest value in April every year. But it did not monotonically increase or decrease. After a small drop in the month of May, streamflow increases again in June then drops. During the low flow season from September to February, another small peak occurs in October or November. The entropy spectral analyses discussed in this paper, though not exactly, forecasted streamflow of the above characteristics and fitted the observations with r 2 of over 0.5. The forecasted streamflow by RESA was closest to the observations both for upstream and downstream stations, which led to r 2 higher than 0.8.
The peak streamflow of the upstream station in April was correctly forecasted by RESA with less than 2 % error, and by CESA and BESA with errors around 13–17 %. However, the peak time forecasted by BESA and CESA were different. The peak forecasted by BESA was earlier than by CESA, even earlier than the observed value in the 3rd lead year. The earlier peaks forecasted by BESA missed more streamflow volume during the high flow season than the other methods. For the downstream station of Iowa River, the forecast error in streamflow peak by RESA was still within 2 %, but BESA and CESA underestimated the peak by around 19 %.
During the low flow season, the advantage of RESA was significant than the others, where streamflow was forecasted close to the observation, as shown in Fig. 2. However, it is noted that BESA had the poorest forecasting during the low-flow season, compared to the other two methods. The forecasted streamflows in November of the second and third lead years were 4.8 and 3.2 m3/s over the observed value, which is around 1.5 times the observed value. CESA performs between RESA and BESA for forecasting flow in the low flow season.
All three measurements of goodness of fit in Table 2 show that RESA yielded forecasts closest to the observations, and CESA was slightly better than BESA. It seemed that the way of forecasting streamflow using the recursive function of cepstrum analysis had an advantage over linear forecasting used by BESA. The reason is that cepstrum analysis, especially when incorporating prior cepstrum by RESA, helps realize homomorphic characteristics of time series (Oppenheim and Schafer 2004), thus is more applicable than linear forecasting.
The relative errors in forecasts in versus lead time are shown in Fig. 3. As shown in Fig. 3a, RESA generated the smallest errors around 0 and does not change much during the 3 year lead time. However, the relative errors by BESA and CESA tended to get larger as the lead time increased. Larger errors were noticed for forecasting streamflow during the low flow season for BESA and CESA, which is consistent with the previous finding. For the downstream station on Iowa River in Fig. 3b, the relative errors of CESA and RESA were distributed similarly and were closer to 0 than for BESA.
4 Conclusions
Three entropy spectral analysis methods, developed from Burg entropy, configurational entropy and relative entropy, are reviewed in the paper. The relative entropy spectral analysis yields the highest resolution in estimating the spectral density of observed streamflow of Iowa River. It shows that the exponential form obtained from either CESA or RESA fit streamflow of Iowa River better than from BESA. The relative entropy spectral analysis also provides the highest reliability in streamflow forecasting. When forecasting lead time increases, RESA is more consistent than the other two methods.
References
Boshnakov GN, Lambert-Lacroix S (2012) A periodic Levinson-Durbin algorithm for entropy maximization. Comput Stat Data An 56(1):15–24
Box GEP, Jenkins GM (1970) Time series analysis; forecasting and control. Holden-Day series in time series analysis. Holden-Day, San Francisco
Burg JP (1967) Maximum entropy spectral analysis. In: Proceedings of 37th Meeting of Society Exploration Geophysics, pp 34–41
Burg JP (1975) Maximum entropy spectral analysis. Ph.D., Stanford University
Carlson RF, Maccormi AJ, Watts DG (1970) Application of linear random models to 4 annual streamflow series. Water Resour Res 6:1070–1078. doi:10.1029/Wr006i004p01070
Collomb C (2009) Burg’s method, alogrithm and recursion. http://www.emptyloop.com/technotes/A%20tutorial%20on%20Burg%27s%20method,%20algorithm%20and%20recursion.pdf
Cui H, Singh VP (2015) Configurational entropy theory for streamflow forecasting. J Hydrol 521:1–17. doi:10.1016/j.jhydrol.2014.11.065
Elshorbagy A, Simonovic SP, Panu US (2002) Estimation of missing streamflow data using principles of chaos theory. J Hydrol 255:123–133. doi:10.1016/S0022-1694(01)00513-3
Fleming SW, Lavenue AM, Aly AH, Adams A (2002) Practical applications of spectral analysis to hydrologic time series. Hydrol Process 16(2):565–574
Frieden BR (1972) Restoring with maximum likelihood and maximum entropy. J Opt Soc Am 62:511–518
Ghil M, Allen MR, Dettinger MD, Ide K, Kondrashov D, Mann ME, Robertson AW, Saunders A, Tian Y, Varadi F, Yiou P (2002) Advanced spectral methods for climatic time series. Rev Geophys 40(1):1-1–1-41
Gull SF, Daniell GJ (1978) Image-reconstruction from incomplete and noisy data. Nature 272:686–690
Haltiner JP, Salas JD (1988) Development and testing of a multivariate, seasonal ARMA(1,1) model. J Hydrol 104:247–272. doi:10.1016/0022-1694(88)90168-0
Hipel KW, McLeod AI (1994) Time series modelling of water resources and environmental systems developments in water science, v. 45. Elsevier, Burlington, p 1
Jones RH, Brelsfor WM (1967) Time series with periodic structure. Technometrics 9:403–408
Katsakos-Mavromichalis NA, Tzannes MA, Tzannes NS (1985) Frequency resolution: a comparative study of four entropy methods. Kybernetes 15:25–32
Krstanovic PF, Singh VP (1989) An entropy based method for flood forecasting. In: New Directions for Surcace Water Modeling (Proceedings of the Baltimore Symposium, May 1989). IAHS Publ
Krstanovic PF, Singh VP (1991a) A univariate model for long-term streamflow forecasting.1. Development. Stoch Hydrol Hydraul 5:173–188
Krstanovic PF, Singh VP (1991b) A univariate model for long-term streamflow forecasting.2. Application. Stoch Hydrol Hydraul 5:189–205
Labat D (2005) Recent advances in wavelet analyses: Part I. A review of concepts. J Hydrol 314(1–4):275–288
Liefhebber F, Boekee DE (1987) Minimum information spectral-analysis. Signal Process 12:243–255
Lin DM, Wong EK (1990) A survey on the maximum entropy method and parameter spectral estimation. Phys Rep 193:41–135. doi:10.1016/0370-1573(90)9
Marques CAF, Ferreira JA, Rocha A, Castanheira JM, Melo-Goncalves P, Vaz N, Dias JM (2006) Singular spectrum analysis and forecasting of hydrological time series. Phys Chem Earth 31(18):1172–1179
Molenat J, Davy P, Gascuel-Odoux C, Durand P (1999) Study of three subsurface hydrologic systems based on spectral and cross-spectral analysis of time series. J Hydrol 222(1–4):152–164
Nadeu C (1992) Finite length cepstrum modeling - a simple spectrum estimation technique. Signal Process 26:49–59
Nadeu C, Sanvicente E, Bertran M (1981) A new algorthm for spectral esimation. International Conference on Digital Signal Processing. Florence, Italy, pp 463–470
Oppenheim AV, Schafer RW (2004) From frequency to quefrency: a history of the cepstrum. IEEE Signal Process Mag 21:95–106
Ortigueira MD, Garcia-Gomes R, Tribolet JM (1981) An iterative algorithm for maximum flatness spectral analysis. In: International Conference on Digital Signal Processing, Florence, Italy, pp 810–818
Papademetriou RC (1998) Experimental comparison of two information-theoretic spectral estimators. In: Signal Processing Proceedings, 1998. ICSP ’98. 1998 Fourth International Conference 141:141–144. doi:10.1109/ICOSP.1998.770170
Salas JD, Obeysekera JTB (1982) ARMA model identification of hydrologic time-series. Water Resour Res 18:1011–1021. doi:10.1029/Wr018i004p01011
Schroeder MR (1982) Linear prediction, extermal entropy and prior information in speech signal analysis and synthesis. Speech Comm 1:9–20. doi:10.1016/0167-6393(82)90004-8
Shore JE (1979) Minimum cross-entropy spectral analysis. Naval Research laboratory, Washington, DC
Shore JE (1981) Minimum cross-entropy spectral-analysis. IEEE Trans Acoust Speech 29:230–237
Singh VP (2013) Entropy spectral analyses. In: Entropy Theory and its Application in Environmental and Water Engineering. Wiley, pp 436–491. doi:10.1002/9781118428306.ch11
Tzannes NS, Avgeris TG (1981) A new approach to the estimation of continuous spectra. Kybernetes 10:123–133
Tzannes MA, Politis D, Tzannes NS (1985) A general method of minimum cross-entropy spectral estimation. IEEE Trans Acoust Speech 33:748–752
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Singh, V.P., Cui, H. Entropy Theory for Streamflow Forecasting. Environ. Process. 2, 449–460 (2015). https://doi.org/10.1007/s40710-015-0080-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s40710-015-0080-8