Analysis of stochastic dual dynamic programming method

doi:10.1016/j.ejor.2010.08.007

European Journal of Operational Research

Volume 209, Issue 1, 16 February 2011, Pages 63-72

https://doi.org/10.1016/j.ejor.2010.08.007 Get rights and content

Abstract

In this paper we discuss statistical properties and convergence of the Stochastic Dual Dynamic Programming (SDDP) method applied to multistage linear stochastic programming problems. We assume that the underline data process is stagewise independent and consider the framework where at first a random sample from the original (true) distribution is generated and consequently the SDDP algorithm is applied to the constructed Sample Average Approximation (SAA) problem. Then we proceed to analysis of the SDDP solutions of the SAA problem and their relations to solutions of the “true” problem. Finally we discuss an extension of the SDDP method to a risk averse formulation of multistage stochastic programs. We argue that the computational complexity of the corresponding SDDP algorithm is almost the same as in the risk neutral case.

Introduction

The goal of this paper is to analyze convergence properties of the Stochastic Dual Dynamic Programming (SDDP) approach to solve linear multistage stochastic programming problems of the form $\underset{\binom{A_{1} x_{1} = b_{1}}{x_{1} ⩾ 0}}{Min} c_{1}^{T} x_{1} + E [\min_{\binom{B_{2} x_{1} + A_{2} x_{2} = b_{2}}{x_{2} ⩾ 0}} c_{2}^{T} x_{2} + E [\dots + E [\min_{\binom{B_{T} x_{T - 1} + A_{T} x_{T} = b_{T}}{x_{T} ⩾ 0}} c_{T}^{T} x_{T}]]] .$ Components of vectors c_t, b_t and matrices A_t, B_t are modeled as random variables forming the stochastic data process¹ ξ_t = (c_t, A_t, B_t, b_t), t = 2, … , T, with ξ₁ = (c₁, A₁, b₁) being deterministic (not random). By ξ_[t] = (ξ₁, … , ξ_t) we denote history of the data process up to time t. The SDDP method originated in Pereira and Pinto [11], and was extended and analyzed in several publications (e.g., [2], [4], [7], [12]). It was assumed in those publications that the number of realizations (scenarios) of the data process is finite, and this assumption was essential in the implementations and analysis of the SDDP type algorithms. In many applications, however, this assumption is quite unrealistic. In forecasting models (such as ARIMA) the errors are typically modeled as having continuous (say normal or log-normal) distributions. So one of the relevant questions is what is the meaning of the introduced discretizations of the corresponding stochastic process. Related questions are convergence properties and error analysis of the method.

We make the basic assumption that the random data process is stagewise independent, i.e., random vector ξ_t+1 is independent of ξ_[t] = (ξ₁, … , ξ_t) for t = 1, … , T − 1. In some cases across stages dependence can be dealt with by adding state variables to the model. For example, suppose that parameters of the data process ξ_t other than b_t are stagewise independent (in particular are deterministic) and random vectors b_t, t = 2, … , T, form a first order autoregressive process, i.e., b_t = Φb_t−1 + ε_t, with appropriate matrix Φ and error vectors ε₂, … , ε_T being independent of each other. Then the feasibility equations of problem (1.1) can be written as $b_{t} - Φ b_{t - 1} = ε_{t}, B_{t} x_{t - 1} - Φ b_{t - 1} + A_{t} x_{t} = ε_{t}, x_{t} ⩾ 0, t = 2, \dots, T .$ Therefore by replacing x_t with (x_t, b_t) and data process with (c_t, A_t, B_t, ε_t), t = 2, … , T, we transform the problem to the stagewise independent case. Of course, in this new formulation we do not need to enforce nonnegativity of the state variables b_t.

We also assume that the implementation is performed in two steps. First, a (finite) scenario tree is generated by randomly sampling from the original distribution and then the constructed problem is solved by the SDDP algorithm. A current opinion is that the approach of random generation of scenarios (the so-called Sample Average Approximation (SAA) method) is computationally intractable for solving multistage stochastic programs because of the exponential growth of the number of scenarios with increase of the number of stages (cf., [18], [19]). An interesting property of the SDDP method is that the computational complexity of one run of the involved backward and forward step procedures is proportional to the sum of sampled data points at every stage and not to the total number of scenarios given by their product. This makes it computationally feasible to run several such backward and forward steps. Of course, this still does not give a proof of computational tractability of the true multistage problem. It also should be remembered that this nice property holds because of the stagewise independence assumption.

We also discuss an extension of the SDDP method to a risk averse formulation of multistage stochastic programs. We argue that the computational complexity of the corresponding SDDP algorithm is almost the same as in the risk neutral case.

In order to present some basic ideas we start our analysis in the next section with two-stage linear stochastic programming problems. For a discussion of basic theoretical properties of two and multi-stage stochastic programs we may refer to [21]. In Section 3 we describe the SDDP approach, based on approximation of the dynamic programming equations, applied to the SAA problem. A risk averse extension of this approach is discussed in Section 4. Finally, Section 5 is devoted to a somewhat informal discussion of this methodology.

We use the following notations and terminology throughout the paper. The notation “≔” means “equal by definition”. For $a \in R$ , [a]₊≔max{0, a}. By ∣J∣ we denote cardinality of a finite set J. By A^T we denote transpose of matrix (vector) A. For a random variable Z, $E [Z]$ and Var[Z] denote its expectation and variance, respectively. Pr(·) denotes probability of the corresponding event. Given a convex function Q(x) we denote by ∂Q(x) its subdifferential, i.e., the set of all its subgradients, at point $x \in R^{n}$ . It is said that an affine function ℓ(x) = a + b^Tx is a cutting plane, of Q(x), if Q(x) ⩾ ℓ(x) for all $x \in R^{n}$ . Note that cutting plane ℓ(x) can be strictly smaller than Q(x) for all $x \in R^{n}$ . If, moreover, $Q (\bar{x}) = ℓ (\bar{x})$ for some $\bar{x} \in R^{n}$ , it is said that ℓ(x) is a supporting plane of Q(x). This supporting plane is given by $ℓ (x) = Q (\bar{x}) + g^{T} (x - \bar{x})$ for some subgradient $g \in \partial Q (\bar{x})$ .

Section snippets

Two-stage programs

In this section we discuss a setting of the SDDP method applied to the following two-stage linear stochastic programming problem: $\underset{x \in X}{Min} c^{T} x + Q (x),$ where $X ≔ {x \in R^{n_{1}} : Ax = b, x ⩾ 0}$ , $Q (x) ≔ E [Q (x, ξ)]$ and Q(x, ξ) is the optimal value of the second stage problem $\begin{matrix} \underset{y \in R^{n_{2}}}{Min} q^{T} y \\ s.t. Tx + Wy = h, y ⩾ 0 . \end{matrix}$ It is assumed that some/all elements of vectors q, h and matrices T, W are random. The data vector ξ is formed from elements of q, h, T, W, and the expectation in (2.1) is taken with respect to a (known) probability distribution

Multistage programs

Consider the linear multistage stochastic programming problem (1.1). Recall that we make the assumption that the data process ξ₁, … , ξ_T is stagewise independent. Then the dynamic programming equations for problem (1.1) take the form $Q_{t} (x_{t - 1}, ξ_{t}) = \inf_{x_{t} \in R^{n_{t}}} \{c_{t}^{T} x_{t} + Q_{t + 1} (x_{t}) : B_{t} x_{t - 1} + A_{t} x_{t} = b_{t}, x_{t} ⩾ 0\},$ where $Q_{t + 1} (x_{t}) ≔ E \{Q_{t + 1} (x_{t}, ξ_{t + 1})\},$ t = T, …,2 (with $Q_{T + 1} (\cdot) \equiv 0$ by definition). At the first stage problem $\begin{matrix} \underset{x_{1} \in R^{n_{1}}}{Min} c_{1}^{T} x_{1} + Q_{2} (x_{1}) \\ s.t. A_{1} x_{1} = b_{1}, x_{1} ⩾ 0, \end{matrix}$ should be solved. We assume that the cost-to-go functions $Q_{t} (\cdot)$ are finite valued, in

Risk averse approach

Let us look again at the two-stage problem (2.1), (2.2). At the first stage the value Q(x, ξ) is minimized on average. Of course, for a particular realization of the random vector ξ the second stage cost Q(x, ξ) can be quite bigger than its mean (expected value) $Q (x)$ . Therefore, in order to control this cost one can add the constraint Q(x, ξ) ⩽ η for some chosen constant η and trying to enforce it for all realizations of the random vector ξ. However, enforcing such constraint for all realizations of

Discussion

One run of the backward step procedure requires solving 1 + N₂ + ⋯ + N_T linear programming problems. Each of these problems has a fixed number of decision variables and constraints with additional variables and constraints corresponding to cutting planes of the approximate functions $Q_{t} (\cdot)$ . That is, complexity of one run of the backward step procedure is more or less proportional to the sum of the sample sizes, while the total number of scenarios is given by the product of the sample sizes. Therefore

Acknowledgment

This research was partly supported by the NSF award DMS-0914785 and ONR award N000140811104.

References (21)

W.K. Mak et al.
Monte Carlo bounding techniques for determining solution quality in stochastic programs
Operations Research Letters
(1999)
A.B. Philpott et al.
On the convergence of stochastic dual dynamic programming and related methods
Operations Research Letters
(2008)
A. Shapiro
On complexity of multistage stochastic programs
Operations Research Letters
(2006)
A. Shapiro
On a time consistency concept in risk averse multi-stage stochastic programming
Operations Research Letters
(2009)
P. Artzner et al.
Coherent measures of risk
Mathematical Finance
(1999)
Z.L. Chen et al.
Convergent cutting plane and partialsampling algorithm for multistage stochastic linear programs with recourse
Journal of Optimization Theory and Applications
(1999)
W.G. Cochran
Sampling Techniques
(1977)
C.J. Donohue et al.
The abridged nested decomposition method for multistage stochastic linear programs with relatively complete recourse
Algorithmic Operations Research
(2006)
M. Hindsberger, A.B. Philpott, Stopping criteria in sampling strategies for multistage SLP-problems, presented at the...
J.E. Kelley
The cutting-plane method for solving convex programs
Journal of the Society for Industrial and Applied Mathematics
(1960)

There are more references available in the full text version of this article.

Cited by (0)

View full text

Stochastics and StatisticsAnalysis of stochastic dual dynamic programming method

Abstract

Introduction

Section snippets

Two-stage programs

Multistage programs

Risk averse approach

Discussion

Acknowledgment

Operations Research Letters

Operations Research Letters

Operations Research Letters

Operations Research Letters

Coherent measures of risk

Mathematical Finance

Convergent cutting plane and partialsampling algorithm for multistage stochastic linear programs with recourse

Journal of Optimization Theory and Applications

Sampling Techniques

The abridged nested decomposition method for multistage stochastic linear programs with relatively complete recourse

Algorithmic Operations Research

The cutting-plane method for solving convex programs

Journal of the Society for Industrial and Applied Mathematics

Stochastics and Statistics
Analysis of stochastic dual dynamic programming method