Top

Optimization and Engineering

Published in:

01-06-2020 | Research Article

Regularized stochastic dual dynamic programming for convex nonlinear optimization problems

Authors: Vincent Guigues, Migual A. Lejeune, Wajdi Tekaya

Published in: Optimization and Engineering | Issue 3/2020

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Off

Abstract

We define a regularized variant of the dual dynamic programming algorithm called DDP-REG to solve nonlinear dynamic programming equations. We extend the algorithm to solve nonlinear stochastic dynamic programming equations. The corresponding algorithm, called SDDP-REG, can be seen as an extension of a regularization of the stochastic dual dynamic programming (SDDP) algorithm recently introduced which was studied for linear problems only and with less general prox-centers. We show the convergence of DDP-REG and SDDP-REG. We assess the performance of DDP-REG and SDDP-REG on portfolio models with direct transaction and market impact costs. In particular, we propose a risk-neutral portfolio selection model which can be cast as a multistage stochastic second-order cone program. The formulation is motivated by the impact of market impact costs on large portfolio rebalancing operations. Numerical simulations show that DDP-REG is much quicker than DDP on all problem instances considered (up to 184 times quicker than DDP) and that SDDP-REG is quicker on the instances of portfolio selection problems with market impact costs tested and much faster on the instance of risk-neutral multistage stochastic linear program implemented (8.2 times faster).

previous article Deterministic global optimization of steam cycles using the IAPWS-IF97 model

next article A solution method for heterogeneity involving present bias

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Online-Abonnement

Mit Springer Professional "Wirtschaft+Technik" erhalten Sie Zugriff auf:

über 102.000 Bücher
über 537 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Finance + Banking
Management + Führung
Marketing + Vertrieb
Maschinenbau + Werkstoffe
Versicherung + Risiko

Jetzt Wissensvorsprung sichern!

inform now

Springer Professional "Technik"

Online-Abonnement

Mit Springer Professional "Technik" erhalten Sie Zugriff auf:

über 67.000 Bücher
über 390 Zeitschriften

aus folgenden Fachgebieten:

Automobil + Motoren
Bauwesen + Immobilien
Business IT + Informatik
Elektrotechnik + Elektronik
Energie + Nachhaltigkeit
Maschinenbau + Werkstoffe

Jetzt Wissensvorsprung sichern!

inform now

Available only for authorised users

Note that to simplify notation, the same notation \(\xi _\mathtt{{Index}}\) is used to denote the realization of the process at node Index of the scenario tree and the value of the process \((\xi _t)\) for stage Index. The context will allow us to know which concept is being referred to. In particular, letters n and m will only be used to refer to nodes while t will be used to refer to stages.

Note that the proposition can be applied because Assumption (H2) holds and thus the assumptions of the proposition are satisfied for value function \({\underline{{\mathfrak {Q}}}}_t^k( \cdot , \xi _m )\).

In Guigues (2016) a forward, instead of a forward-backward algorithm, is considered. In this setting, finiteness of coefficients \(\theta _t^k\) and \(\beta _t^k\) is not guaranteed for the first iterations (for instance \((\theta _t^1)_t\) are \(-\infty\)) but the proof is similar.

It is indeed immediately seen that (4.3) and (4.4) is of form (2.2)–(2.4), writing the maximization problems as minimization problems and introducing the extended state \(s_t=(x_t, y_t, z_t)\).

Though when deriving these relations in (i) we had fixed \(k \in {\mathcal {S}}_n\), the inequalities we now re-use for (ii) are valid for any \(k \ge 1\).

The existence of an accumulation point comes from the fact that the decisions belong almost surely to a compact set.

Almgren R (2003) Optimal execution with nonlinear impact functions and trading-enhanced risk. Appl Math Finance 10:1–18CrossRef

Almgren R, Thum C, Li H (2005) Equity market impact. Risk 18:57–62

Andersen ED, Dahl J, Friberg HA (2009) Markowitz portfolio optimization using MOSEK. MOSEK Technical report: TR-2009-2. Revised on March 4th, 2012. Avaialble at: https://docs.mosek.com/whitepapers/portfolio.pdf

Asamov T, Powell W (2015) Regularized decomposition of high-dimensional multistage stochastic programs with Markov uncertainty. SIAM J Optim 28:575–595MathSciNetCrossRef

Bandarra M, Guigues V (2019) Single cut and multicut SDDP with cut selection for multistage stochastic linear programs: convergence proof and numerical experiments. arXiv:1902.06757

Bouchaud J, Gefen Y, Potters M, Wyart M (2004) Fluctuations and response in financial markets: the subtle nature ofrandom price changes. Quant Finance 4:176–190CrossRef

Cadenillas A (2000) Consumption-investment problems with transaction costs: survey and open problems. Math Methods Oper Res 51:43–68MathSciNetCrossRef

de Matos V, Philpott A, Finardi E (2015) Improving the performance of stochastic dual dynamic programming. J Comput Appl Math 290:196–208MathSciNetCrossRef

Filomena T, Lejeune M (2012) Stochastic portfolio optimization with proportional transaction costs: convex reformulations and computational experiments. Oper Res Lett 40:212–217MathSciNetCrossRef

Frino A, Bjursell J, Wang G, Lepone A (2008) Large trades and intraday futures price behavior. J Fut Mark 28:1117–1181CrossRef

Gabaix X, Gopikrishnan P, Plerou V, Stanley H (2003) A theory of power-law distributions in financial market fluctuations. Nature 423:267–270CrossRef

Gatheral J (2010) No-dynamic-arbitrage and market impact. Quant Finance 10:749–759MathSciNetCrossRef

Girardeau P, Leclere V, Philpott A (2015) On the convergence of decomposition methods for multistage stochastic convex programs. Math Oper Res 40:130–145MathSciNetCrossRef

Grinold R (2006) A dynamic model of portfolio management. J Invest Manag 4:5–22

Grinold R, Kahn R (2000) Active Portfolio Management, 2nd edn. McGraw-Hill, New York

Guigues V (2014) SDDP for some interstage dependent risk-averse problems and application to hydro-thermal planning. Comput Optim Appl 57:167–203MathSciNetCrossRef

Guigues V (2016) Convergence analysis of sampling-based decomposition methods for risk-averse multistage stochastic convex programs. SIAM J Optim 26:2468–2494MathSciNetCrossRef

Guigues V (2017) Dual dynamic programing with cut selection: convergence proof and numerical experiments. Eur J Oper Res 258:47–57MathSciNetCrossRef

Guigues V (2020) Inexact cuts in stochastic dual dynamic programming. SIAM J Optim 30:407–438MathSciNetCrossRef

Guigues V, Römisch W (2012a) Sampling-based decomposition methods for multistage stochastic programs based on extended polyhedral risk measures. SIAM J Optim 22:286–312MathSciNetCrossRef

Guigues V, Römisch W (2012b) SDDP for multistage stochastic linear programs based on spectral risk measures. Oper Res Lett 40:313–318MathSciNetCrossRef

Infanger G, Morton D (1996) Cut sharing for multistage stochastic linear programs with interstage dependency. Math Program 75:241–256MathSciNetMATH

Kozmik V, Morton D (2015) Evaluating policies in risk-averse multi-stage stochastic programming. Math Program 152:275–300MathSciNetCrossRef

Lemarechal C (1974) An algorithm for minimizing convex functions. In: Proceedings of the IFIP’74, Stockholm

Lillo F, Farmer J, Mantegna R (2003) Econophysics: master curve for price-impact function. Nature 421:129–130CrossRef

Loeb T (1983) Trading costs: the critical link between investment information and results. Financ Anal J 39:39–44CrossRef

Mitchell J, Braun S (2013) Rebalancing an investment portfolio in the presence of convex transaction costs, including market impact costs. Optim Methods Softw 28:523–542MathSciNetCrossRef

Mo B, Gjelsvik A, Grundt A (2001) Integrated risk management of hydro power scheduling and contract management. IEEE Trans Power Syst 16:216–221CrossRef

Moazeni S, Coleman T, Li Y (2010) Optimal portfolio execution strategies and sensitivity to price impact parameters. SIAM J Optim 20:1620–1654MathSciNetCrossRef

Moro E, Vicente J, Moyano L, Gerig A, Farmer J, Vaglica G, Lillo F, Mantegna R (2009) Market impact and trading profile of hidden orders in stock markets. Phys Rev E 80:1–8CrossRef

MOSEK (2017) MOSEK optimization suite. release 8.0.0.52

Pereira M, Pinto L (1991) Multi-stage stochastic optimization applied to energy planning. Math Program 52:359–375MathSciNetCrossRef

Pfeiffer L, Apparigliato R, Auchapt S (2012) Two methods of pruning benders’ cuts and their application to the management of a gas portfolio. Research report RR-8133, hal-00753578

Philpott AB, de Matos V (2012) Dynamic sampling algorithms for multi-stage stochastic programs with risk aversion. Eur J Oper Res 218:470–483MathSciNetCrossRef

Philpott AB, Guan Z (2008) On the convergence of stochastic dual dynamic programming and related methods. Oper Res Lett 36:450–455MathSciNetCrossRef

Powell W (2011) Approximate Dynamic Programming, 2nd edn. Wiley, LondonCrossRef

Rockafellar R, Uryasev S (2002) Conditional value-at-risk for general loss distributions. J Bank Finance 26:1443–1471CrossRef

Sen S, Zhou Z (2014) Multistage stochastic decomposition: a bridge between stochastic programming and approximate dynamic programming. SIAM J Optim 24:127–153MathSciNetCrossRef

Service WDR (2016) WRDS. http://wrds-web.wharton.upenn.edu

Shapiro A (2011) Analysis of stochastic dual dynamic programming method. Eur J Oper Res 209:63–72MathSciNetCrossRef

Shapiro A, Dentcheva D, Ruszczyński A (2009) Lectures on stochastic programming: modeling and theory. SIAM, PhiladelphiaCrossRef

Shapiro A, Tekaya W, da Costa J, Soares M (2013) Risk neutral and risk averse stochastic dual dynamic programming method. Eur J Oper Res 224:375–391MathSciNetCrossRef

Tikhonov A (1943) On the stability of inverse problems. Dokl Akad Nauk SSSR 39:195–198MathSciNet

Torre N (1997) Market impact model handbook. BARRA Inc., Berkeley

Zagst R, Kalin D (2007) Portfolio optimization under liquidity costs. Int J Pure Appl Math 39:217–233MathSciNetMATH

Title: Regularized stochastic dual dynamic programming for convex nonlinear optimization problems
Authors: Vincent Guigues
Migual A. Lejeune
Wajdi Tekaya
Publication date: 01-06-2020
Publisher: Springer US
Published in: Optimization and Engineering / Issue 3/2020
Print ISSN: 1389-4420
Electronic ISSN: 1573-2924
DOI: https://doi.org/10.1007/s11081-020-09511-0

Springer Professional

Abstract

Please log in to get access to your license.

Dont have a licence yet? Then find out more about our products and how to get one now:

Springer Professional "Wirtschaft+Technik"

Springer Professional "Technik"

Other articles of this Issue 3/2020

Strategic planning of an underground mine with variable cut-off grades

Robust principal component analysis using facial reduction

Comparison of dual based optimization methods for distributed trajectory optimization of coupled semi-batch processes

Robust trade-off portfolio selection

Announcement: Howard Rosenbrock Prize 2019

A simulation–optimization framework for short-term underground mine production scheduling

Premium Partners