Top

Financial Markets and Portfolio Management

Published in:

Open Access 17-11-2023

Hedging goals

Authors: Thomas Krabichler, Marcus Wunsch

Published in: Financial Markets and Portfolio Management | Issue 1/2024

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Patentsearch

Off

Abstract

Goal-based investing is concerned with reaching a monetary investment goal by a given finite deadline, which differs from mean-variance optimization in modern portfolio theory. In this article, we expand the close connection between goal-based investing and option hedging that was originally discovered in Browne (Adv Appl Probab 31(2):551–577, 1999) by allowing for varying degrees of investor risk aversion using lower partial moments of different orders. Moreover, we show that maximizing the probability of reaching the goal (quantile hedging, cf. Föllmer and Leukert in Finance Stoch 3:251–273, 1999) and minimizing the expected shortfall (efficient hedging, cf. Föllmer and Leukert in Finance Stoch 4:117–146, 2000) yield, in fact, the same optimal investment policy. We furthermore present an innovative and model-free approach to goal-based investing using methods of reinforcement learning. To the best of our knowledge, we offer the first algorithmic approach to goal-based investing that can find optimal solutions in the presence of transaction costs.

Thomas Krabichler and Marcus Wunsch have contributed equally to this work.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

While modern portfolio theory (Markowitz 1952) posits that investors are risk averse and thus should seek to maximize their portfolios’ risk-adjusted returns, in reality, investors often find themselves in need of capital to finance future investment goals: a car, an apartment or their children’s college education. The importance of investment goals on a societal level can be appreciated in view of the exacerbating retirement problem in many Western countries, cf. Giron et al. (2018).

Goal-based investment strategies are not primarily concerned with risk preferences relating to portfolio volatility; instead, they are subject to the risk of falling short of reaching a goal by its maturity. Even exceeding an investment goal is not necessarily desirable; in this case, a strategy with less volatility could have led to an outcome matching the investment goal.

There are at least two ways to translate this practical problem into a mathematical optimization problem. Either, one attempts to maximize the probability of reaching an investment goal by a given maturity, or one tries to minimize the expected shortfall (or a function thereof).

This first approach was investigated in a series of papers by Browne (cf. Browne (1999b) and the references therein), who found the explicit portfolio allocation formula for the probability-maximizing strategy in the context of complete markets. In his articles, Browne used techniques from stochastic control theory as well as from Partial Differential Equations (PDEs). While highly appealing theoretically, the probability-maximizing paradigm suffers from the binary nature of its optimum: a goal missed by a hair’s breadth is still a goal missed, and any such strategy will be discarded. Rather, more and more leverage will be applied to attain the goal—even as the maturity draws closer—resulting in either success or bankruptcy. This indifference for the size of the shortfall constitutes a major drawback of probability-maximizing strategies for practical purposes.

Leukert (1999), Föllmer and Leukert (1999, 2000), and Föllmer and Schied (2016) treated the closely related problem of maximizing the probability of hedging contingent claims successfully when replication is attempted with less than the required initial capital (corresponding to the discounted value under the equivalent martingale measure). Their solution is based on a static optimization problem of Neyman–Pearson type. Another approach can be found in Spivak and Cvitanić (1999).

In practice, measuring and minimizing downward risk is arguably more significant than maximizing the probability of attaining a goal (in analogy with the dichotomy of Expected Shortfall versus Value-at-Risk, cf. Leukert 1999; Föllmer and Schied 2016). Downward risk can be quantified by the shortfall, i.e., the positive part of the distance between the profit a strategy has earned at maturity and the goal. Several authors have addressed this problem in the context of replicating contingent claims, cf. Leukert (1999), Föllmer and Leukert (1999, 2000), Pham (2002), Föllmer and Schied (2016), including Cvitanić and Karatzas (1999). The latter authors employ tools from convex duality to show that a solution exists and state explicit solutions for several special cases with a single risky asset. It is also interesting to note that quantile hedging (cf. Föllmer and Leukert 1999), i.e., the probability-maximizing paradigm, can be interpreted as the most risk-seeking limit of efficient hedging, cf. Föllmer and Leukert (2000). Nakano (2004) studied a similar problem, considering coherent risk measures instead of lower partial moments.

An intriguing and novel approach via optimal transport has recently been used to target prescribed terminal wealth distributions in Guo et al. (2021).

Bühler et al. (2019) introduced a flexible framework for hedging contingent claims by applying deep learning methods. This approach transcends the classical Black–Scholes model’s restrictions, e.g., the absence of transaction costs. Related reinforcement learning approaches can be found in Halperin (2020) and Szehr (2021). Ruf and Wang (2020) provide a comprehensive literature review regarding the application of neural networks for pricing and hedging purposes.

2 Main contributions

In our opinion, the potential that goal-based investing has for retirement saving and individual asset-liability management cannot be overestimated.

The theoretical foundations for the goal-based investment problem have been laid out in the—superficially unrelated—field of replicating contingent claims. Therefore, we regard adapting these results and making them accessible and palatable to practitioners as one of the main contributions of this paper. In particular, we show how risk preferences can be integrated into the original goal-based investment problem (cf., e.g., Proposition 7.1), drawing on results on efficient hedging derived by Föllmer and Leukert (2000).

Another important contribution is the adaptation of deep hedging techniques (cf. Bühler et al. 2019) to incorporate transaction costs into the optimization problems arising in goal-based investing.

3 Outline of this paper

The remainder of this article is organized as follows.

After introducing the basic model in Sect. 4, we state the optimal policy for risk-neutral and risk-taking goal-based investors in Sect. 5. The optimal policy for risk-averse goal-based investors, whose utility is determined by a lower partial moment of the shortfall relative to the goal, can be found in Sect. 7. We discuss the shortcomings of the probability-maximizing paradigm in Sect. 6, where we also provide an illustrative example. To mitigate the risk inherent in the quantile and efficient hedging approaches, we propose a policy allowing for downward protection in Sect. 8.

Finally, in Sect. 9, we show that an artificial neural network can be trained to minimize the expected shortfall as well as lower partial moments, thereby approximating the optimal policies from Sects. 5 and 7.

The proofs of this paper can be found in Section A of the Appendix.

Remark 3.1

The explicit analytical results in Sects. 5–8 build upon the work in Browne (1999b) and Föllmer and Leukert (1999, 2000). In particular, the validity of our analytical results is restricted to complete markets. Föllmer and Leukert (1999, 2000) also elaborate on the incomplete case using duality results. In Sect. 9, deep hedging, as adapted from Bühler et al. (2019), provides an appealing and highly flexible approach, as it is model free and allows for the inclusion of transaction costs.

4 Preliminaries

4.1 The model

We consider a complete market with $n \in \mathbb {N}$ correlated risky assets generated by n independent Brownian motions (cf. Browne 1999b), i.e.,

$$\begin{aligned} \,\textrm{d}X_t^{(i)}&= X_t^{(i)} \left[ \mu ^{(i)}\,\textrm{d}t+\sum _{j=1}^n \sigma ^{(i,j)} \,\textrm{d}W_t^{(j)} \right] , \qquad i=1, \dots , n, \end{aligned}$$

(1)

where the drift $\pmb \mu=\left( \mu ^{(i)}\right) _{i=1}^n$ and the full rank volatility matrix $\pmb \sigma=\left( \sigma ^{(i,j)}\right) _{i, j=1}^n$ are constant.

$$\begin{aligned} \pmb {W}_t:= \left( W_t^{(1)},\dots , W_t^{(n)}\right) ^\top \end{aligned}$$

shall denote a standard n-dimensional Brownian motion defined on the complete probability space $(\Omega , \mathcal {F}, \mathbb {P})$ satisfying the usual conditions (cf. Protter 2004).

We assume that there is, in addition, a risk-less bank account compounding at the risk-free rate $r>0$, i.e.,

$$\begin{aligned} \,\textrm{d}B_t=r\, B_t \,\textrm{d}t,\qquad B_0=1. \end{aligned}$$

(2)

The value of a zero-coupon bond at time t that pays 1 monetary unit at maturity $T > t \ge 0$ will be denoted as

$$\begin{aligned} R_{t, T}&:= e^{-r (T-t)}. \end{aligned}$$

We will only consider bonds without default risk. Monetary goals will be denoted by $H \in \mathbb {R}_+$ throughout. To ease notation, we shall write $H_{t, T}:= R_{t, T} H$ for any $t \in [0, T].$

We will make use of the diffusion matrix $\pmb \Sigma := \pmb \sigma \, {\pmb {\sigma }}^\top ;$ the market price of risk will be denoted by the vector $\pmb \vartheta $ defined as

$$\begin{aligned} \pmb \vartheta := {\pmb \sigma }^{-1} (\pmb \mu - r\pmb 1). \end{aligned}$$

(3)

We assume that all entries of $\pmb \vartheta $ are strictly positive. According to Girsanov’s theorem, the vector process defined via the market price of risk as

$$\begin{aligned} \pmb W^*_t&:= \pmb W_t+\pmb \vartheta \ t \end{aligned}$$

is an n-dimensional Brownian motion under the probability measure $\mathbb {P}^*$ given by its Radon–Nikodym derivative

$$\begin{aligned} \rho _* := \frac{\,\textrm{d}\mathbb {P}^*}{\,\textrm{d}\mathbb {P}}&= \exp \left\{ - {\pmb \vartheta }^\top \pmb W_T -\frac{1}{2} {\pmb \vartheta }^\top \pmb \vartheta \ T \right\}=\exp \left\{ - {\pmb \vartheta }^\top \pmb W^*_T +\frac{1}{2} {\pmb \vartheta }^\top \pmb \vartheta \ T \right\} , \end{aligned}$$

(4)

where $\mathbb {P}$ denotes the objective probability measure. The expectation under the risk-neutral measure $\mathbb {P}^*$ will be denoted as $\mathbb {E}^*$.

The optimal growth portfolio (Platen 2005) maximizes the growth rate of wealth (Browne 1999b, Sect. 4.2). Its weights $\pmb {\pi }_*$ and its volatility $\sigma _*$ are determined via

$$\begin{aligned} \pmb {\pi }_*:= \left( \pmb {\sigma }^{-1}\right) ^\top \, \pmb {\vartheta }_t,\qquad {\sigma _*}^2:= {\pmb {\pi }_*}^\top \, \pmb {\Sigma }\, \pmb {\pi }_*=\pmb {\vartheta }^\top \pmb {\vartheta }=\sum _{i=1}^n \left( \frac{\mu ^{(i)}-r}{\sigma ^{(i,i)}}\right) ^2. \end{aligned}$$

The optimal growth portfolio evolves as (Browne 1999b, Sect. 4.2)

$$\begin{aligned} \Pi _t&= \Pi _0 \exp \left\{ \left( r - \frac{1}{2} {\sigma _*}^2 \right) t+{\pmb \vartheta }^\top \ \pmb {W}_t^* \right\} . \end{aligned}$$

Remark 4.1

For ease of notation, we use constant coefficients throughout this article. It is, however, straightforward to generalize all our results to deterministic time-dependent coefficients.

4.2 Goal-based investing and hedging

The goal-based investment problem can be expressed in terms of replicating a contingent claim with a constant payoff at maturity $T>0$ given by $H>0$, starting from a prespecified initial endowment $V_0$, cf. Browne (1999b). It is thus equivalent to finding an admissible¹ strategy $(V_0, \,\pmb \xi )$, evolving for $t\in [0,T]$ according to

$$\begin{aligned} V_t = V_0 + \int _0^t {\pmb \xi _s}^\top \,\textrm{d}\pmb W_s, \end{aligned}$$

(5)

where $\pmb \xi $ is a predictable process with respect to the Brownian motion $\pmb W$ such that

$$\begin{aligned} \mathbb {E}[ \ell ( (H - V_T)_+ ) ] \end{aligned}$$

(6)

becomes minimal. Here, the expectation $\mathbb {E}$ is taken under the objective probability measure $\mathbb {P}$, and $\ell $ denotes a loss function that expresses the risk appetite of the investor. We will consider loss functions of the type

$$\begin{aligned} \ell _p(x) = x^p, \quad p \in \mathbb {R}_{\ge 0}. \end{aligned}$$

(7)

For these loss functions, the expression (6) is referred to as the lower partial moment of order p. Note that, as $p\rightarrow 0+$, the integrand in (6) tends to the indicator function $\mathbbm {1}_{(0, H)}(V_T)$. This situation is tantamount to quantile hedging as discussed in Föllmer and Leukert (1999). Conversely, risk aversion increases as $p\rightarrow \infty $.

Let us assume that the investor initially posts the amount $V_0 = z > 0$. If z is such that $z \ge H_{0, T}$, then the zero-coupon bond can be perfectly replicated at no risk, and the expected loss (6) vanishes. On the other hand, if $z < H_{0, T}$, then the investor faces the risk of falling short of her desired goal.

5 Risk neutrality and risk taking

The policy minimizing the expected shortfall for hedging a zero-coupon bond paying out $H\equiv 1$ at maturity was derived in Xu (2004). In what follows, we extend her approach to incorporate a constant risk-free rate $r>0$ and an arbitrary constant payoff $H\in \mathbb {R}_+$ subject to $z < H_{0, T}$. Moreover, we show that the result of (Xu 2004, Sect. 2.2.1) is, in fact, equivalent to the one of Browne (1999b) for $H \equiv 1$. In particular, the hedging strategy in the case of a single risky asset is indeed independent of its drift, which is not immediately obvious from the formulae stated in Xu (2004).

Remark 5.1

In the following discussion, we treat the entire spectrum of risk appetites ranging from risk neutrality ($p=1$, also referred to as efficient hedging) to extreme risk taking ($p=0$, also referred to as quantile hedging). The theoretical foundations can be found in Sect. 5.4 of Föllmer and Leukert (2000). The discussion in Sect. 7 will address higher degrees of risk aversion by considering lower partial moments of order $p > 1$.

Proposition 5.2

(Efficient hedging using several risky assets) Consider an investment with an initial capital endowment of z monetary units, whose objective is to minimize

$$\begin{aligned} \mathbb {E}\left[ {(H-V_T)_+}^p\right] , \qquad H \in \mathbb {R}_+, \qquad p \in [0, 1]. \end{aligned}$$

Then the optimal policy for this objective is equivalent to the replication of a European digital call option on the optimal growth portfolio $\Pi _t$ with payoff H and strike price $K^*$, where

$$\begin{aligned} K^*&= \Pi _0\; \exp \left\{ \left( r-\frac{1}{2}{\sigma _*}^2\right) \,T - \sigma _* \ \sqrt{T}\ \Phi ^{-1}\left( \frac{z}{H_{0, T}}\right) \right\} , \end{aligned}$$

(8)

$\Phi $ denotes the cumulative distribution function of the standard normal distribution, and $\Phi ^{-1}$ the corresponding quantile function.

In particular, the investor’s wealth process can be expressed by means of

$$\begin{aligned} V_t&= H_{t, T}\, \Phi \left( \frac{\log \frac{\Pi _t}{K^*} + \left( r-\frac{1}{2}{\sigma _*}^2\right) \,(T-t)}{\sigma _* \sqrt{T-t}}\right) . \end{aligned}$$

(9)

Remark 5.3

Note that, if $z = H_{0, T}$, then the strike $K^*$ given in (8) will vanish. As a consequence, the value of the standard normal distribution function $\Phi $ in (9) will be 1, so that the claim reduces to a risk-less bond, $V_t = H_{t, T}$. If z is even larger than the discounted goal, compounding will result in super-replication.

Corollary 5.4

(Efficient hedging using a single risky asset) In the case of a single risky asset, the contingent claim (9) can be simplified to

$$\begin{aligned} V_t&= H_{t, T}\, \Phi \left( \frac{\log \frac{X_t}{K^*} + \left( r-\frac{1}{2}{\sigma }^2\right) \,(T-t)}{\sigma \sqrt{T-t}}\right) , \end{aligned}$$

where

$$\begin{aligned} K^*&= x_0\, \exp \left\{ \left( r-\frac{1}{2}\sigma ^2\right) T - \Phi ^{-1}\left( \frac{z}{H_{0, T}} \right) \right\} . \end{aligned}$$

The corresponding delta-hedging strategy is obtained by differentiation:

$$\begin{aligned} \xi _1(t, X_t) = \frac{\partial }{\partial x}\bigg |_{x = X_t}V_t = \frac{H_{t, T}}{X_t\ \sigma \sqrt{T-t}}\ \phi \left( \frac{\log \frac{X_t}{K^*} + \left( r-\frac{\sigma ^2}{2}\right) (T-t)}{\sigma \sqrt{T-t}} \right) , \end{aligned}$$

where $\phi $ denotes the probability density function of the standard normal distribution.

Corollary 5.5

In the case of a constant claim $H \in \mathbb {R}_+$, the optimal policies for quantile hedging Föllmer and Leukert (1999) and efficient hedging Föllmer and Leukert (2000) coincide.

In particular, (Xu 2004, Corollary 2.8) concerning the efficient hedging of a bond with payoff $H\equiv 1$ yields the same optimal policy as (Browne 1999b, Proposition 4.1) with goal $b\equiv 1$ and vanishing risk-free rate.

6 Practical considerations when maximizing probabilities

Let us assume that the investment universe consists of a single risky company share $X=(X_t)_{t\in [0, T]}$ and a risk-less bank account $B=(B_t)_{t\in [0, T]}$, cf. (1), (2). A digital (or binary) European call option on the underlying X with strike $K>0$ is a financial derivative with payoff $\mathbbm {1}_{\{X_T\ge K\}}$ at maturity T. Its Black–Scholes price is given by

$$\begin{aligned} C(t;\,X_t,\,K)&=R_{t,T}\ \Phi (d_-(t;X_t,K)) \\ d_-(t;\,x,K)&:=\frac{\log {\frac{x}{K}}+\left( r-\frac{\sigma ^2}{2}\right) (T-t)}{\sigma \sqrt{T-t}}. \end{aligned}$$

(10)

According to Corollary 5.4 (cf. Browne 1999b, Sect. 4), continuous rebalancing with

$$\begin{aligned} \xi (t;\, X_t,\, K)=\frac{R_{t, T}}{X_t\, \sigma \, \sqrt{T-t}}\ \phi \left( d_-(t;\, X_t,K)\right) \end{aligned}$$

replicates this digital payoff starting from $V_0=C(0;\, X_0,K)$ monetary units. By inspection, the initial price $V_0=V_0(K)$ is monotonously decreasing with

$$\begin{aligned}\lim _{K\rightarrow 0+}V_0(K)=R_{0, T},\qquad \lim _{K\rightarrow \infty }V_0(K)=0. \end{aligned}$$

Let us assume that a financial investor owns $c_0>0$ monetary units at time $t=0$ and, by means of an admissible strategy in the investment universe, aims at owning $c_T > c_0$ monetary units at time T. For simplicity, let us exclude intermediate income and consumption. To ensure that the mathematical problem is well posed, one needs to establish in what sense a certain strategy becomes optimal. In Browne (1999b, Theorem 3.1), the author proved the intriguing fact that replicating $c_T$ digital call options with strike

$$\begin{aligned} K^*=X_0 \exp \left\{ \left( r-\frac{1}{2}\sigma ^2\right) T-\sigma \,\sqrt{T}\,\Phi ^{-1}\left( \frac{c_0}{R_{0, T}\,c_T}\right) \right\} \end{aligned}$$

maximizes the objective probability of reaching the goal. This result has an insightful economic interpretation; $K^*$ coincides with the break-even point with respect to the strike where a single digital call option costs $\frac{c_0}{c_T}$ at time 0. Notably, but also well known, the magnitude of the hardly ascertainable drift $\mu $ does not affect $K^*$. In fact, the above expression of $K^*$ is only well-defined provided that the argument of $\Phi ^{-1}$ is within (0, 1). In our setting, this prerequisite is only violated in the degenerate case $c_0 \ge R_{0, T}\,c_T$, i.e., the goal can be super-replicated in terms of the bank account at no risk anyway. The maximized real-world probability of reaching the goal is

$$\begin{aligned} \mathbb {P}\left[ X_T\ge K^*\right] =\Phi \left( \vartheta \sqrt{T}+\Phi ^{-1}\left( \frac{c_0}{R_{0, T}\,c_T}\right) \right) . \end{aligned}$$

For real-world applications, the financial investor has two alternatives; either she buys it over-the-counter or she replicates the digital payoff herself. In the former case, she runs the risk of not getting the promised payoff due to the bankruptcy of the issuer. In the latter case, without further stop-loss measures in place, discrete rebalancing schedules imply the risk of arbitrarily large losses way beyond $c_0$ due to the discontinuity of the payoff and, hence, the unbounded delta of the digital option. Notably, the strategy also requires an unlimited credit line at the bank which is collateralized only to an insufficient extent by the company share. Transaction costs exacerbate the situation. By approximating the digital payoff by a classical bull call spread and by diversifying the involved derivatives across several bona fide counterparties, the financial investor manages to deal with the mentioned impediments all the same. From a computational perspective, we lose analytical tractability with increasing degrees of complexity, e.g., additional constraints, more realistic price dynamics, transaction costs, etc. Despite all, and much more crucially, the all-or-nothing feature of the proposed optimal strategy is not feasible in many real-world applications such as traditional pension funds. For obvious reasons, retirement savings are not supposed to be a Bernoulli experiment. Therefore, we will consider further ways to control downward risk in Sect. 8.

Example 6.1

Let us consider a simple one-step financial market that hosts two financial assets over the time horizon $t\in \{0,1\}$. For some $0<\varepsilon \ll 1$, a risk-less bank account carries a deterministic log-return of $r-\varepsilon $ for some $r\in \mathbb {R}$. The other investment alternative is a start-up company whose success is dichotomous; the log-return $\widetilde{r}$ of the company share satisfies $\mathbb {P}[\widetilde{r}=r-1]=p$ and $\mathbb {P}[\widetilde{r}=r+1]=1-p$ for some $p\in (0,1)$. Let $\xi \in [0,1]$ denote the portion of the initial wealth that is kept in the risky asset. The log-return of any strategy $\xi $ is then given by $R(\xi )=\log \big (\xi e^{\widetilde{r}}+(1-\xi )e^{r-\varepsilon }\big )$. From a practitioner’s perspective, if the investor’s ultimate goal was to reach a continuously compounded yield of r, then it would not be advisable to invest in the risky asset at all. However, a strict application of maximizing the probability of reaching the goal would involve shortfall risk. Indeed, it holds $\mathbb {P}\left[ R(0)\ge r\right] =0$, whereas $\mathbb {P}\left[ R(\xi )\ge r\right] $ is maximal for any

$$\begin{aligned} \xi \ge \frac{e^\varepsilon -1}{e^{1+\varepsilon }-1}. \end{aligned}$$

This example shows that the probability-maximizing paradigm might be too rigid in the context of goal-based investing as it does not take into consideration the investor’s risk appetite. In the next section, we will discuss optimal policies for risk-averse investors.

Remark 6.2

The quantile hedging approach toward goal-based investing is a dynamic portfolio allocation strategy that shifts wealth between the optimal growth portfolio and the risk-free asset (Browne 1999b, Theorem 3.1). We analyze the goal-based investor’s wealth process using historical S&P 500 Index returns and compare it with the optimal growth portfolio process in Fig. 1.² Notice that in the top plot, the goal-based investor keeps all her funds in the optimal growth portfolio and misses the goal, while in the middle one, she narrowly reaches the goal by shifting wealth into the risk-free asset very late. Finally, in the bottom plot, the goal-based investor exits the optimal growth portfolio as soon as a wealth level is reached which equals the present value of the financial goal discounted at the risk-free rate. In this situation, she takes just enough risk to achieve her goal, while the optimal growth portfolio investor remains fully risk-on to maximize long-term growth, yet suffers from the drawdown of US Large Caps starting in 2022. A mean-variance optimal portfolio, on the other hand, reflects an investor’s risk preferences and thus usually bears less risk than the optimal growth portfolio; however, as the latter, the mean-variance optimal portfolio does not take into account any financial goals by its very design.

7 Risk aversion

We consider the case of $p>1$, so that $(\ell _p)_{p>1}$ (cf. (7)) denotes a series of convex loss functions corresponding to increasing levels of risk aversion as p grows. According to (Leukert 1999, Lemma 11) the optimal strategy to minimize (6) consists in hedging the modified claim

$$\begin{aligned} \varphi _p H&= H - \min \left( a_p\, \rho _*^{\frac{1}{p-1}}, H\right) , \end{aligned}$$

(11)

where the constant $a_p$ is implicitly determined by the capital requirement $\mathbb {E}^*[\varphi _p\, H] = z$.

Proposition 7.1

(Risk aversion with several risky assets) Consider an investor endowed with z monetary units at time $t = 0$. We assume that her objective is to minimize the lower partial moment

$$\begin{aligned} \mathbb {E}\left[ {(H-V_T)_+}^p\right] , \end{aligned}$$

for $p>1$, cf. (6). Then, the optimal strategy is equivalent to replicating the contingent claim on the optimal growth portfolio $\Pi _t$ with value process

$$\begin{aligned} \begin{aligned} V_t&= V(t, \Pi _t) \\ &= H_{t, T}\, \bigg \{\Phi (d_-(t; \Pi _t, L)) - \bigg (\frac{L}{\Pi _t}\bigg )^{p'} \exp \bigg \{ p' (p'+1) \bigg (\frac{1}{2}{\sigma _*}^2-r\bigg ) (T-t) \bigg \} \\&\quad \times \Phi \bigg ( d_-(t; \Pi _t, L)-p' \,\sigma _*\, \sqrt{T-t} \bigg )\bigg \}. \end{aligned} \end{aligned}$$

Here, $p'=1/(p-1)$, and the threshold L is implicitly determined by the capital requirement $V(0, \Pi _0) = V_0 = z$.

Corollary 7.2

(Risk aversion with a single risky asset) If there is only one risky asset $X=(X_t)_{t\in [0, T]}$ available to the investor, then the optimal strategy to minimize the lower partial moment (6) with exponent $p>1$ will be equivalent to replicating the value process $V_t=V(t,X_t)$ equal to

$$\begin{aligned} H_{t, T}\,\bigg \{&\Phi (d_-(t; X_t, L)) - \frac{L^{\alpha _p}}{X_t^{\alpha _p}} \exp \bigg \{ \alpha _p (\alpha _p+1) \bigg (\frac{1}{2}\sigma ^2-r\bigg ) (T-t) \bigg \}\, \\&\Phi \bigg ( d_-(t; x, L)-\alpha _p\, \sigma \, \sqrt{T-t} \bigg )\bigg \}, \end{aligned}$$

(12)

where $\alpha _p:= \alpha /(p-1)$ and $\alpha := (\mu -r)/\sigma ^2$. The hedging strategy $\xi _p$ is given by

$$\begin{aligned}&\xi _p(t, X_t) = H_{t, T} \Bigg (\frac{\phi (d_-(t;X_t,L))}{X_t \sigma \sqrt{T-t}}\\&\quad - \frac{L^{\alpha _p}}{X_t^{\alpha _p}}\exp \left\{ \alpha _p \left( \alpha _p+1\right) \bigg (\frac{1}{2}\sigma ^2-r\bigg ) (T-t) \right\} \frac{\phi (d_-(t;X_t,L)-\alpha _p\,\sigma \,\sqrt{T-t})}{X_t\, \sigma \, \sqrt{T-t}}\\&\quad + \frac{\alpha _p L^{\alpha _p}}{X_t^{\alpha _p+1}}\exp \left\{ \alpha _p \left( \alpha _p+1\right) \left( \frac{1}{2}\sigma ^2-r\right) (T-t) \right\} \Phi (d_-(t; X_t, L) - \alpha _p\,\sigma \,\sqrt{T-t}) \Bigg ). \end{aligned}$$

Remark 7.3

The first term in the expression for the modified claim $\varphi _p H$ in (12) constitutes a digital European call option with strike L and terminal payoff $H\mathbbm {1}_{\{ X_T \ge L \}}$.

Remark 7.4

From a practical viewpoint, plausible values for $\alpha $ would be around 1, assuming $\mu = 5\%$, $r=1\%$, and $\sigma = 20\%$. The exponent $\alpha _p$ would then be positive and decrease from 1 to 0 as $p \rightarrow \infty $ $(p>1)$.

Remark 7.5

If the term corresponding to a digital European call option in Eq. (12) matures in-the-money (i.e., $X_T > L)$, then the second term in this equation equals $(L/X_T)^{\alpha _p}$, which is less than 1 and decreasing in $X_T$ if $\alpha _p > 0$. Conversely, if the digital call expires at-the-money, the second term in Eq. (12) will be 1, so that the entire claim matures worthless. The same holds true if the digital call expires out-of-the-money.

Remark 7.6

What happens in the case of extreme risk aversion, i.e., as $p\rightarrow \infty $? By Eqs. (11) and (17),

$$\begin{aligned} \lim _{p\rightarrow \infty }a_p=H-R_{T,0}z,\qquad \left( \frac{L}{X_T}\right) ^{\alpha _p} =a_p\frac{k^{\frac{1}{p-1}}}{{X_T}^{\frac{\alpha }{p-1}}}. \end{aligned}$$

Hence,

$$\begin{aligned} \lim _{p\rightarrow \infty }\varphi _p\,H=\lim _{p\rightarrow \infty }(1-a_p)\mathbbm {1}_{\{X_T\ge 0\}}=R_{T,0}z\quad \Rightarrow \quad z=R_{0, T}\cdot \varphi _\infty H, \end{aligned}$$

i.e., the entire endowment is kept in the bank account. This observation is consistent with the concept of total risk aversion, and it is in line with (Leukert 1999, Lemma 14). There, it is demonstrated that $\varphi _p H \rightarrow (H-a_\infty )_+$ for $p\rightarrow \infty $ almost surely and in $L^1(\mathbb {P}^*)$, for general (not necessarily constant) payoff functions $H = H(X_T)$.

Remark 7.7

The knock-out feature of the digital European call that is present for risk-neutral/risk-taking investors ($p\in [0,1]$) makes hedging increasingly difficult if the underlying is close to the strike as maturity approaches, because the digital call’s delta becomes unbounded. Appealingly, however, this knockout feature disappears for risk-averse investors ($p>1$), as one can see in Fig. 2, and the delta of these modified claims becomes more and more well behaved as risk aversion increases ($p\rightarrow \infty $).

8 Downward protection

The probability of reaching the target for the probability-maximizing policy, given by (Browne 1999b, Theorem 3.1)

$$\begin{aligned} \sup _{f} \mathbb {P}_{(t, x)} [{X_T}^{(f)} \ge H ] = \Phi \left( \Phi ^{-1}\left( \frac{x}{H_{t, T}}\right) +\sqrt{\pmb {\vartheta }^\top \pmb \vartheta \, (T-t)}\right) , \end{aligned}$$

(13)

is the counter-probability of going bankrupt, which can be prohibitively high for practical purposes.

Remark 8.1

If we assume an initial investment of two-thirds of the desired goal and a single risky asset with a drift of $6\%$, a volatility of $20\%$, and a zero risk-free rate, then the “optimal” strategy entails a probability of losing everything of approximately $25\%$.

Clearly, this all-or-nothing strategy is too risky for most practical applications. Browne (Browne 1999b, Sect. 8.2) therefore proposed to control downside risk in the context of active portfolio management (cf. also Browne 1999a). We adapt his approach to goal-based investing as follows.

Proposition 8.2

Consider an investor whose objective is to minimize the expected shortfall of her terminal wealth $V_T$ versus the goal $H \in \mathbb {R}_+$, with the additional requirement that the expected shortfall versus the discounted goal $H_{0, T}$ never exceed a predefined percentage $\delta \in [0, 1]$ of the latter. Then

$$\begin{aligned}&\sup _f \mathbb {P}\left[ X_T^{(f)} \ge H,\ \inf _{0\le s \le T} X_s^{(f)} \ge (1-\delta ) H_{0,T} \,\bigg |\, X_t = x \right] \\ &\quad = \Phi \left( \Phi ^{-1}\left( \frac{x-(1-\delta ) H_{t, T}}{\delta \, H_{t, T}}\right) + \sqrt{\pmb {\vartheta }^\top \pmb {\vartheta }\,(T-t)}\right) . \end{aligned}$$

Corollary 8.3

(cf. Cvitanić and Karatzas (1999), Example 4.1) Let $\varepsilon $ be a given positive real number. It follows from Proposition 8.2 that the smallest initial endowment $x_\varepsilon > 0$ required so that the probability of violating the shortfall constraint is bounded from above by $\varepsilon $ is given by

$$\begin{aligned} x_\varepsilon = \left[ \Phi \left( \Phi ^{(-1)}(1-\varepsilon )-\sqrt{\pmb {\vartheta }^\top \pmb {\vartheta } \,T} \right) + 1-\delta \right] H_{0, T}. \end{aligned}$$

Note that, as $\varepsilon \rightarrow 1$, the initial endowment $x_\varepsilon $ tends to the discounted goal $H_{0, T}$ minus the shortfall allowance $\delta \,H_{0, T}$.

8.1 The nature of the claim with downward protection

If maximizing the probability of reaching an investment goal is equivalent to replicating a digital European call option (cf. Browne 1999b, Proposition 4.1), what interpretation can be given to the situation in this section?

First, let us rephrase the optimal policy, given for the general case in (Browne 1999b, Theorem 3.1) for constant coefficients and in the presence of a downward risk limit:

$$\begin{aligned} f^*_t(x-(1-\delta )\,H_{t, T}; \delta \,H) = \frac{\delta \,H_{t, T}}{\sigma \sqrt{T-t}}\phi \left( \Phi ^{-1}\left( \frac{x-(1-\delta ) \,H_{t, T}}{\delta \,H_{t, T}}\right) \right) . \end{aligned}$$

Now, if we evaluate $f^*_t$ at $x = C(t, X_t;\delta )$, where

$$\begin{aligned} C(t, X_t;\delta ) = \delta \,H_{t, T}\,\Phi \left( \frac{\log \frac{X_t}{K^*} + (r-\frac{\sigma ^2}{2})(T-t)}{\sigma \,\sqrt{T-t}} \right) + (1-\delta ) \,H_{t, T} \end{aligned}$$

then

$$\begin{aligned}&f_t^*(C(t, X_t;\delta )-(1-\delta ) \,H_{t, T}; \delta \,H)\\ &\quad = \frac{\delta \,H_{t, T}}{\sigma \,\sqrt{T-t}}\, \phi \left( \Phi ^{-1}\left( \frac{C(t, X_t;\delta )-(1-\delta ) \,H_{r, T}}{\delta \,H_{t, T}}\right) \right) \\&\quad = \frac{\delta \,H_{t, T}}{\sigma \,\sqrt{T-t}}\,\phi \left( \frac{\log \frac{X_t}{K^*}+\left( r-\frac{\sigma ^2}{2}\right) (T-t))}{\sigma \,\sqrt{T-t}} \right) \\&\quad = \Delta _t \cdot X_t \end{aligned}$$

where $\Delta _t$ is the delta of the digital European call option paying $\delta \,H$ at maturity if $X_T \ge K^*$, and nothing otherwise. The optimal policy thus consists of initially investing $(1-\delta ) \,H_{0, T}$ into a bond, and the remainder into a digital European call option with said characteristics. As before, the strike $K^*$ of this contingent claim depends implicitly on the initial endowment z.

9 Deep hedging

The investment strategies derived in the previous sections cannot be transferred to more realistic settings without further ado. The optimality fundamentally relies on the completeness of the financial market model as well as the simplistic distributional assumption on the price dynamics. More sophisticated price dynamics, for instance involving rough volatility, inevitably lead to incomplete market models. Furthermore, minimizing lower partial moments in such intricate environments may hardly be analytically tractable. It remains unclear whether the duality principle between the optimization problem and the hedging of a qualitatively similar payoff prevails. In contrast, simply applying the proposed delta hedging strategies for different price dynamics can be arbitrarily bad. Another impediment for applications in the real world are discrete hedging schedules and transaction cost. Therefore, we investigate whether we manage to circumvent these delicate issues by applying the striking approach of deep hedging as proposed in Bühler et al. (2019). Subsequently, we present our findings for the one-dimensional case.

For any $t\in \{0,1,2,\ldots ,N\}$ in some discrete time grid with horizon $N\in \mathbb {N}$, we consider a feedforward neural network

$$\begin{aligned} F_t=\left( \phi \circ A_t^{(2)}\right) \circ \left( \phi \circ A_t^{(1)}\right) \circ \left( \phi \circ A_t^{(0)}\right) \end{aligned}$$

with some affine functions

$$\begin{aligned} A_t^{(0)}: \mathbb {R}^2\longrightarrow \mathbb {R}^{10}, \ A_t^{(1)}: \mathbb {R}^{10} \longrightarrow \mathbb {R}^{10},\ A_t^{(2)}: \mathbb {R}^{10} \longrightarrow \mathbb {R}\end{aligned}$$

and the sigmoid activation function $\phi (x)=(1+e^{-x})^{-1}$. The input layer consists of the current holding $\xi _{t\text {--}}$ before rehedging and the moneyness $X_t/X_0$, where $X_t$ is the marginal distribution of a geometric Brownian motion as considered above. The output layer reveals the outcome $\xi _t$ of the rehedging at the time instance t, i.e.,

$$\begin{aligned} \xi _t&=F_t\big (\xi _{t\text {--}},X_t/X_0\big ). \end{aligned}$$

Similarly as above, we aim at optimizing a function of the terminal wealth $V_T$ that can be derived iteratively. Let $b_{0\text {--}}$ denote the initial holdings in the bank account bearing the risk-free rate $r\in \mathbb {R}$, $\xi _{0\text {--}}$ denote the initial holdings in the underlying, $\kappa \ge 0$ the coefficient for proportional transaction cost, and $\tau >0$ the year fraction of a time step. Hence, the value of the portfolio before and after rehedging at time 0 is given by

$$\begin{aligned} V_{0\text {--}}&=b_{0\text {--}}+\xi _{0\text {--}}X_0,\\ V_0&=b_0+\xi _0X_0, \end{aligned}$$

where $b_0:=b_{0\text {--}}-\left( \xi _0-\xi _{0\text {--}}\right) X_0-\kappa \left| \xi _0-\xi _{0\text {--}}\right| X_0$ satisfies the self-financing principle. Then, we proceed consistently in terms of the iteration

$$\begin{aligned} V_{t\text {--}}&=b_{t\text {--}}+\xi _{t\text {--}}X_t\\ V_t&=b_t+\xi _tX_t, \end{aligned}$$

where $b_{t\text {--}}=b_{t-1}e^{r\tau }$, $\xi _{t\text {--}}=\xi _{t-1}$ and $b_t=b_{t\text {--}}-\left( \xi _t-\xi _{t\text {--}}\right) X_t-\kappa \left| \xi _t-\xi _{t\text {--}}\right| X_t$ for $t\in \{1,2,\ldots ,N-1\}$. At maturity, we have to bear the unwinding cost additionally. Hence,

$$\begin{aligned} V_T&=b_{T-1}e^{r\tau }+\xi _{T-1}X_T-\kappa \left| \xi _{T-1}\right| X_T. \end{aligned}$$

For experimental purposes, we chose similar parameters as in Fig. 2; a maturity $T=10$, a discretization $N=52T$ (i.e., weekly rehedging with $\tau =1/52$), a risk-free rate $r=1\%$, a drift $\mu =8\%$, and a volatility $\sigma =30\%$. The initial state of the market and the wealth are standardized to $X_0=100$, $b_{0\text {--}}=70$ and $\xi _{0\text {--}}=0$. The ultimate goal is to reach the deterministic payoff $H=100$; this refers to as a continuously compounded return of $h\approx 3.6\%$. Let $J\in \mathbb {N}$ be a sufficiently large number³ of simulated paths $X^{(j)}=(X_t^{(j)})_{t=0,1,2,\ldots ,N}$, e.g., $J=10^4$. Given this parameter set, we seek to find optimal rehedging strategies. This can be achieved by applying a suitable backpropagation algorithm on the deep neural network architecture that consolidates the above feedforward neural network instances together with the intermediary accounting routines. A direct translation of the above concept is the minimization of the loss

$$\begin{aligned} \frac{1}{J}\sum _{j=1}^J\max \left\{ H-V_T^{(j)},0\right\} ^p. \end{aligned}$$

We modify the loss function for two crucial reasons. Firstly, the function $x\mapsto \max \{H-x,0\}$ is nondifferentiable at the point H and ignores any points beyond H. This raises concerns on the stability of the learning algorithm. Therefore, we replace the maximum with the softplus function $\log {(1+e^{x})}$. Secondly, the natural extension of the loss function apparently has an undesirable local minimum for strategies with a deterministic equity portion $\xi _t\equiv \xi \in [0,1]$; see Fig. 3 above.

Without further interventions, the learning algorithms often gets stuck in the suboptimal neighborhood of static strategies. Therefore, we also penalize deviations beyond H in terms of

$$\begin{aligned} \frac{1}{J}\sum _{j=1}^J\left( \log \left\{ 1+\exp \left\{ H-V_T^{(j)}\right\} \right\} \right) ^p+\lambda \log \left\{ 1+\exp \left\{ V_T^{(j)}-H\right\} \right\} . \end{aligned}$$

for a regularization parameter $\lambda =0.1$. It needs to be noted that the introduction of the positive second summand does not alter the global optimum. The following charts exhibit the out-of-sample performance of a trained artificial financial agent for $p\in \{1,1.5,5\}$ and $\kappa \in \{0,0.005\}$. For the training, we relied on the default configuration of the Adam algorithm of TensorFlow Keras with a batch size of 64 over 500 epochs. All charts are generated with the same sample data. The training phase of the Jupyter notebook takes in each case approximately 2.5h on Google Colab. As a benchmark, we also show the performance of naively applying the continuous-time optimal hedging strategy on the same weekly time grid.

For $p\in \{1,1.5\}$, deep hedging mitigates the risk of large losses. In the absence of transaction costs, our simulations suggest that deep hedging does not surpass the benchmark consistently, at least not for the selected parameters and without further measures. However, in the presence of transaction costs, the strength of deep hedging is particularly evident, cf. Table 1. Moreover, it could be extended to more realistic dynamics of the underlying for which analytical solutions are typically not available. The empirically derived expected terminal wealth, the value-at-risk to a significance of $5\%$ as well as the success rates and the success ratios for the different investment strategies are lined up in Table 1. Remarkably, due to accounting for offsetting effects of an adjusted hedge and borne transaction cost, deep hedging leads to an improved value-at-risk in the presence of transaction cost (Table 1; cf. also Figs. 4, 5, 6, 7).

Table 1

Selected empirically derived characteristics of the terminal wealth distribution for $p\in \{1,1.5,5\}$ and $\kappa \in \{0,0.005\}$

Mean	Theoretical	Deep hedging		Discrete delta hedging
Mean	$\kappa =0$	$\kappa =0$	$\kappa =0.005$	$\kappa =0$	$\kappa =0.005$
$p=1$	93.18	91.07	89.10	93.19	88.39
$p=1.5$	88.52	88.53	87.44	91.55	87.97
$p=5$	80.17	80.50	79.89	80.28	79.81

$5\%$-quantile	Theoretical	Deep hedging		Discrete delta hedging
$5\%$-quantile	$\kappa =0$	$\kappa =0$	$\kappa =0.005$	$\kappa =0$	$\kappa =0.005$
$p=1$	0	48.71	48.98	4.05	$-2.98$
$p=1.5$	49.63	54.63	57.17	54.96	48.84
$p=5$	73.59	71.96	74.00	73.64	73.11

Success rate	Theoretical	Deep hedging		Discrete delta hedging
Success rate	$\kappa =0$	$\kappa =0$	$\kappa =0.005$	$\kappa =0$	$\kappa =0.005$
$p=1$	0.93	0.41	0.36	0.47	0.01
$p=1.5$	0	0.21	0.12	0.29	0.02
$p=5$	0	0.00	0.00	0.00	0.00

Success ratio	Theoretical	Deep hedging		Discrete delta hedging
Success ratio	$\kappa =0$	$\kappa =0$	$\kappa =0.005$	$\kappa =0$	$\kappa =0.005$
$p=1$	0.93	0.89	0.88	0.92	0.88
$p=1.5$	0.89	0.87	0.87	0.91	0.88
$p=5$	0.80	0.80	0.80	0.80	0.80

The success rate $\mathbb {P}\left[ V_T\ge H\right] $ is the counter probability of the shortfall risk. The success ratio is the generalized success rate $\mathbb {E}\left[ \mathbbm {1}_{\{V_T\ge H\}}+\frac{V_T}{H}\mathbbm {1}_{\{V_T<H\}}\right] $ as defined in (Föllmer and Leukert 2000, Definition (2.32)). Not only does deep hedging yield a flatter right tail in the presence of transaction costs—as can be deduced from the figures for the $5\%$-quantile—deep hedging moreover provides a superior success rate, and can keep up with the success ratio of discrete delta hedging

10 Conclusions and outlook

We have discussed two approaches to goal-based investing in this article. The first—analytical—approach yields several explicit continuous dynamic trading strategies that risk-taking, risk-neutral, and risk-averse investors need to implement to maximize their goal-based utilities.

In the real world, however, continuous-time trading is not feasible. We show that this drawback can be addressed with a more flexible deep hedging approach. Not only is this approach well-suited for discrete rebalancing, it also allows for the inclusion of transaction costs. Curiously, goal-based investing provides a use case for deep hedging with a probability-maximizing objective function, due to the problem’s equivalence with efficient hedging.

There are many ramifications of our work on hedging goals that we will investigate elsewhere. In particular, open research questions that we will address include:

hedging goals under general market dynamics, e.g., GARCH Ghalanos (2019), or scenarios generated with Generative Adversarial Networks (GAN, cf. Ni et al. (2020));
hedging goals with downward protection in the spirit of Sect. 8;
hedging goals with exogenous income (Browne 1999b, Sect. 7) and liabilities Browne (1997);
beating stochastic benchmarks (as in Sect. 8.1 of Browne 1999b) using deep learning.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

previous article Does analysts’ industrial concentration affect the quality of their forecasts?

next article Evaluating the influence of financial technology (FinTech) on sustainable finance: a comprehensive global analysis

Appendix A: Proofs

Appendix A.1: Proofs of the results with risk neutrality and risk taking ($p\in [0, 1]$)

Proof of Proposition 5.2

(Leukert 1999, Theorem 9) states the test function

$$\begin{aligned} \varphi _p&= \mathbbm {1}_{\left\{ \frac{\,\textrm{d}\mathbb {P}}{\,\textrm{d}\mathbb {P}^*} \ge a_p \,H^{1-p} \right\} }, \end{aligned}$$

(14)

needs to be used to modify the claim H. Here, $a_p$ is determined implicitly by the capital requirement $z = \mathbb {E}^*[\varphi _p\, H]$. To avoid trivial cases, let us assume that z lies within the open interval $\left( 0, H_{0, T} \right) $. It is straightforward to show that the constant $a_p$ is given by

$$\begin{aligned} a_p&= H^{p-1} \exp \left\{ \sigma _*\, \sqrt{T}\, \Phi ^{-1}\left( 1-\frac{z}{H_{0, T}} \right) - \frac{1}{2}{\sigma _*}^2 \ T \right\} . \end{aligned}$$

Let us introduce the density process $(Z_t)_{t\ge 0}$ as

$$\begin{aligned} Z_t&= \exp \left\{ - {\pmb {\vartheta }}^\top \, \pmb {W}_t^* + \frac{1}{2} {\sigma _*}^2 \, t \right\} . \end{aligned}$$

(15)

Note that $Z_T = \rho _*$, cf. (4). The density process and the optimal growth portfolio are related via

$$\begin{aligned} \log Z_t&= - \left( \log \frac{\Pi _t}{\Pi _0} - \left( r - \frac{1}{2} {\sigma _*}^2\right) t \right) + \frac{1}{2} {\sigma _*}^2 \ t. \end{aligned}$$

With these notations, we can show that the value process corresponds to a digital European call option, namely,

$$\begin{aligned} \mathbb {E}^*[\varphi _p H \, \vert \, \mathcal {F}_t]&= \mathbb {P}^*\left[ Z_T \le \frac{H^{p-1}}{a_p} \, \bigg |\, \mathcal {F}_t \right] \\&= \mathbb {P}^*\left[ \frac{Z_T}{Z_t} \le \frac{H^{p-1}}{a_p\ Z_t} \, \bigg |\, \mathcal {F}_t \right] \\&= \mathbb {P}^*\left[ {\pmb {\vartheta }}^\top \left( \pmb {W}_T^*-\pmb {W}_t^*\right) \ge - \left( \log \frac{H^{p-1}}{a_p \, Z_t}-\frac{1}{2}{\sigma _*}^2\, (T-t)\right) \right] \\&= 1-\Phi \left( \frac{\log a_p - (p-1) \log H + \frac{1}{2}{\sigma _*}^2 \,(T-t) - \left( \log \frac{\Pi _t}{\Pi _0} -\left( r-\frac{1}{2} {\sigma _*}^2 \right) t \right) + \frac{1}{2}{\sigma _*}^2 \,t}{\sigma _* \,\sqrt{T-t}} \right) \\&= 1-\Phi \left( \frac{ \Phi ^{-1}\left( 1-\frac{ z}{H_{0, T}}\right) \sigma _* \,\sqrt{T} - \left( \log \frac{\Pi _t}{\Pi _0} -\left( r-\frac{1}{2} {\sigma _*}^2\right) t \right) }{\sigma _* \,\sqrt{T-t}} \right) \\&= \Phi \left( \frac{\log \frac{\Pi _t}{\Pi _0} - \left( r-\frac{1}{2} {\sigma _*}^2\right) t - \Phi ^{-1}\left( 1-\frac{z}{H_{0, T}}\right) \sigma _*\,\sqrt{T}}{\sigma _* \, \sqrt{T-t}} \right) \\&= \Phi \left( \frac{\log \frac{\Pi _t}{K^*} + \left( r-\frac{1}{2}{\sigma _*}^2\right) (T-t)}{\sigma _*\, \sqrt{T-t}} \right) , \end{aligned}$$

where the strike $K^*$ is given by

$$\begin{aligned} \log K^*&= \log \Pi _0 + \left( r-\frac{1}{2}{\sigma _*}^2\right) \,T-\sigma _*\,\sqrt{T}\,\Phi ^{-1}\left( \frac{z}{H_{0, T}} \right) . \end{aligned}$$

This furthermore shows that the solution for the multivariate problem of minimizing the expected shortfall is identical to the one derived by Browne in the case of maximizing the probability of reaching an investment goal Browne (1999b). $\square $

Proof of Corollary 5.4

By virtue of (4), we can express the test function $\varphi _p$ as the indicator function

$$\begin{aligned} \varphi _p&= \mathbbm {1}{\left\{ \rho _* \le \frac{H^{p-1}}{a_p} \right\} } = \mathbbm {1}{\left\{ \exp \left\{ \frac{1}{2}\vartheta ^2\, T-\vartheta \,W_T^* \right\} \le \frac{H^{p-1}}{a_p} \right\} } \\&= \mathbbm {1}{\left\{ W_T^* \ge \frac{1}{\vartheta }\left( \frac{1}{2}\vartheta ^2\, T + \log a_p + (1-p) \log H \right) \right\} }. \end{aligned}$$

Hence, for a standard normal random variate Y,

$$\begin{aligned} R_{T, 0}\, z&= R_{T, 0}\, \mathbb {E}^*[\varphi _p H] = H \, \mathbb {P}^*\left[ \sqrt{T}\, Y \ge \frac{1}{\vartheta }\left( \frac{1}{2}\vartheta ^2\, T + \log a_p + (1-p) \log H\right) \right] \\&= H \left( 1 - \Phi \left( \frac{\frac{1}{2}\vartheta ^2\, T+\log a_p + (1-p) \log H }{\vartheta \, \sqrt{T}}\right) \right) . \end{aligned}$$

Thus,

$$\begin{aligned} a_p&= H^{p-1} \exp \left\{ \vartheta \,\sqrt{T}\Phi ^{-1}\left( 1-\frac{z}{H_{0, T}}\right) - \frac{1}{2}\vartheta ^2\, T \right\} . \end{aligned}$$

It can be shown that $\rho _* = k \ {X_T}^{-\alpha }$, for a real constant k. In fact,

$$\begin{aligned} {X_T}^{-\alpha }&= {x_0}^{-\alpha }\ \exp \left\{ - \alpha \left( \mu - \frac{\sigma ^2}{2}\right) T -\alpha \sigma W_T \right\} \\&= {x_0}^{-\alpha } \exp \left\{ - \alpha \left( \frac{\mu - r}{2} + \frac{ \mu + r - \sigma ^2}{2}\right) T -\vartheta \, W_T \right\} \\&= {x_0}^{-\alpha } \exp \left\{ - \frac{\alpha (\mu + r - \sigma ^2)T}{2} \right\} \underbrace{\exp \left\{ - \frac{1}{2}\vartheta ^2\, T -\vartheta W_T \right\} }_{=\rho ^*}, \end{aligned}$$

and hence

$$\begin{aligned} k&= {x_0}^\alpha \exp \left\{ \frac{\alpha (\mu + r - \sigma ^2)T}{2} \right\} . \end{aligned}$$

The test function $\varphi _p$ in (14) can therefore be rewritten as

$$\begin{aligned} \varphi _p&= \mathbbm {1} \left\{ k\, {X_T}^{-\alpha } \le \frac{H^{p-1}}{a_p} \right\} = \mathbbm {1}\left\{ X_T \ge \root \alpha \of {k\, a_p\, H^{1-p}} \right\} \\&= \mathbbm {1}\left\{ X_T \ge x_0 \exp \left\{ \frac{\mu +r-\sigma ^2}{2}\,T + \Phi ^{-1}\left( 1-\frac{ z}{H_{0, T}}\right) \sigma \,\sqrt{T} - \frac{\mu - r}{2}\,T \right\} \right\} \\&= \mathbbm {1}\left\{ X_T \ge x_0 \exp \left\{ \left( r - \frac{\sigma ^2}{2}\right) T + \Phi ^{-1}\left( 1-\frac{ z}{H_{0, T}}\right) \sigma \,\sqrt{T} \right\} \right\} \\&= \mathbbm {1}\left\{ X_T \ge x_0 \exp \left\{ \left( r - \frac{\sigma ^2}{2}\right) T - \Phi ^{-1}\left( \frac{ z}{H_{0, T}}\right) \sigma \,\sqrt{T} \right\} \right\} . \end{aligned}$$

Let $Z_T:= \rho _*$, with the density process $Z=(Z_t)_{t\in [0, T]}$ defined as

$$\begin{aligned} \log Z_t&= -\frac{\vartheta }{\sigma } \left( \log \frac{X_t}{x_0} -\left( r- \frac{1}{2}\sigma ^2\right) t\right) + \frac{1}{2}\vartheta ^2 t. \end{aligned}$$

Then we have that (cf. Xu 2004, Corollary 2.8)

$$\begin{aligned}&\mathbb {E}^*[\varphi _p H \, \vert \, \mathcal {F}_t] = \mathbb {P}^*\left[ Z_T \le \frac{H^{p-1}}{a_p} \, \bigg |\, \mathcal {F}_t \right] \\&\quad = \mathbb {P}^*\left[ \frac{Z_T}{Z_t}\, Z_t \le \frac{H^{p-1}}{a_p} \, \bigg |\, \mathcal {F}_t \right] \\&\quad = \mathbb {P}^*\left[ \frac{Z_T}{Z_t} \le \frac{H^{p-1}}{a_p\ Z_t} \, \bigg |\, \mathcal {F}_t \right] \\&\quad = \mathbb {P}^*\left[ {W_T}^*-{W_t}^*\ge -\frac{1}{\vartheta }\left( \log \left( \frac{H^{p-1}}{a_pZ_t}\right) -\frac{1}{2}\vartheta ^2(T-t)\right) \right] \\&\quad = 1-\Phi \left( \frac{\log a_p + (1-p)\log H + \frac{1}{2}\vartheta ^2 (T-t) - \frac{\vartheta }{\sigma }\left( \log \frac{X_t}{x_0} -\left( r- \frac{\sigma ^2}{2}\right) t \right) + \frac{1}{2}\vartheta ^2 t}{\vartheta \sqrt{T-t}} \right) \\&\quad = 1-\Phi \left( \frac{ \Phi ^{-1}\left( 1-\frac{ z}{H_{0, T}} \right) \vartheta \sqrt{T} - \frac{\vartheta }{\sigma }\left( \log \frac{X_t}{x_0} -\left( r- \frac{\sigma ^2}{2}\right) t \right) }{\vartheta \,\sqrt{T-t}} \right) \\&\quad = \Phi \left( \frac{\log \frac{X_t}{x_0} - \left( r- \frac{\sigma ^2}{2}\right) t - \Phi ^{-1}\left( 1-\frac{ z}{R_{0, T}\,H}\right) \sigma \sqrt{T}}{\sigma \sqrt{T-t}} \right) \\&\quad = \Phi \left( \frac{\log \frac{X_t}{K^*} + \left( r-\frac{\sigma ^2}{2}\right) (T-t)}{\sigma \sqrt{T-t}} \right) , \end{aligned}$$

where $K^*$ is given by

$$\begin{aligned} \log K^* = \log x_0 + \Phi ^{-1}\left( 1-\frac{z}{H_{0, T}}\right) \sigma \,\sqrt{T} + \left( r-\frac{\sigma ^2}{2}\right) T. \end{aligned}$$

The modified claim $\varphi _p H$ thus corresponds to a digital call option with strike $K^*$; cf. (10). $\square $

Remark A.1

Note that, as z approaches 0, the inverse cumulative distribution function diverges to $+\infty $, so that $K^*$ tends to $\infty $ and, as a consequence, the (initial) value of the modified claim $V_t$ vanishes.

Conversely, as z approaches $H_{0, T}$ from below, $K^*$ diverges to $-\infty $, so that $\varphi _p \rightarrow \mathbbm {1}_{\mathbb {R}_+}$: in the limit, the modified claim coincides with the original one.

Appendix A.2: Proofs of the results with risk aversion

Proof of Proposition 7.1

Recall that, in the case of increasing risk aversion, we need to consider the problem (11). For this purpose, we note that the density process $(Z_t)_{t\in [0, T]}$ (cf. (15)) relates to the optimal-growth portfolio via

$$\begin{aligned} Z_T&= \rho _* = \frac{\Pi _0}{R_{0, T}\,\Pi _T}. \end{aligned}$$

The modified claim of (11) thus takes the form

$$\begin{aligned} \varphi _p&= \left( 1-a_p\left( \frac{\Pi _0}{R_{0, T}\,\Pi _T}\right) ^{p'}\right) _+, \end{aligned}$$

where we have used the shorthand $p'=1/(p-1)$. This equation in turn can be rewritten as

$$\begin{aligned} \varphi _p&= \left( 1-\left( \frac{L}{\Pi _T}\right) ^{p'}\right) \, \mathbbm {1}_{\{ \Pi _T \ge L \}}, \end{aligned}$$

where the threshold is given by $L:=\root p' \of {a_p}\, R_{T, 0}\, \Pi _0$. This claim consists of a European digital option that is modified by a factor. The difference now, however, is that the digital option is a contingent claim on the optimal growth portfolio, whose wealth at time t is given by $\Pi _t$.

Calculations analogous to those in the case of a single risky asset (cf. the proof of Corollary 7.2 below) show that the modified claim on the optimal-growth portfolio takes the form specified in Proposition 7.1. $\square $

Proof of Corollary 7.2

The modified claim (11) in this case reads as

$$\begin{aligned} \varphi _p = \left( 1-a_p\, {\rho _*}^{p'}\right) _+. \end{aligned}$$

Recall from the proof of Corollary 5.4 that

$$\begin{aligned} \rho _* = \frac{k}{{X_T}^{\alpha }}, \end{aligned}$$

where

$$\begin{aligned} k = {x_0}^\alpha \exp \left\{ \frac{\alpha (\mu + r - \sigma ^2)T}{2}\right\} . \end{aligned}$$

Therefore,

$$\begin{aligned} \varphi _p = \left( 1 - a_p\frac{k^{p'}}{ {X_T}^{\alpha p'}}\right) \, \mathbbm {1}_{\left\{ X_T \ge {a_p}^\frac{p-1}{\alpha }k^{\frac{1}{\alpha }}\right\} }. \end{aligned}$$

(16)

Let us denote the threshold by $L:= {a_p}^\frac{p-1}{\alpha }k^{\frac{1}{\alpha }}$. Thus Eq. (16) can be rewritten as

$$\begin{aligned} \varphi _p = \left( 1-\left( \frac{L}{X_T}\right) ^{\alpha _p} \right) \, \mathbbm {1}_{\{ X_T \ge L \}}. \end{aligned}$$

(17)

Defining the function $f_p(y):= \left( 1-\frac{L^{\alpha _p}}{y^{\alpha _p}} \right) \, \mathbbm {1}_{\{ y \ge L \}}$ for $y \in \mathbb {R}$, we set

$$\begin{aligned} V_t&= \mathbb {E}^*[\varphi _p H\,\vert \, \mathcal {F}_t]\\&= H_{T,t} \, \mathbb {E}^*\left[ f_p(X_t\, \exp \left( \sigma \, (W_T^*-W_t^*) + (r-\sigma ^2/2)\,(T-t)\right) \,\vert \, \mathcal {F}_t\right] \\&=: H_{T, t}\, F_p(t, X_t), \end{aligned}$$

so that, for $\tau :=T-t$,

$$\begin{aligned}&H_{T, t}\,F_p(t, x) = \int _\mathbb {R}f_p(x\exp [\sigma \sqrt{\tau }\, y + (r-\sigma ^2/2)\, \tau ])\, \exp (-y^2/2)\frac{\,\textrm{d}y}{\sqrt{2\pi }} \\&\quad = \Phi (d_-(t; x, L)) - \frac{L^{\alpha _p}}{x^{\alpha _p}} \int _{-d_-(t; x, L)}^\infty e^{ -\alpha _p(\sigma \sqrt{\tau } y + (r-\sigma ^2/2)\tau ) } e^\frac{-y^2}{2}\frac{\,\textrm{d}y}{\sqrt{2\pi }} \\&\quad = \Phi (d_-(t; x, L)) - \frac{L^{\alpha _p}}{x^{\alpha _p}} e^{ \alpha _p \left( \alpha _p+1\right) (\sigma ^2/2-r) \tau } \Phi \left( d_-(t; x, L)-\alpha _p \sigma \sqrt{\tau } \right) . \end{aligned}$$

(18)

The threshold L is determined implicitly by the initial endowment z via

$$\begin{aligned} z&= \mathbb {E}^*[\varphi _p H] = H_{0, T}\,F_p(0, x_0)\\&= H_{0,T} \left( \Phi (d_-(0; x_0, L)) - \frac{L^{\alpha _p}}{x_0^{\alpha _p}} e^{\alpha _p\left( \alpha _p+1\right) (\sigma ^2/2-r) T } \Phi \left( d_-(0; x, L)-\alpha _p \sigma \sqrt{T} \right) \right) . \end{aligned}$$

$\square $

Appendix A.3: Proofs of the results with downward protection

Proof of Proposition 8.2

This follows from applying the results in Browne (1999b, Sect. 8). $\square $

See, e.g., definition 8.1.1 in Delbaen and Schachermayer (2006).

We constrain leverage to 100% in both cases to avoid excessively high exposures. This entails that the optimal growth portfolio in our example coincides with a buy-and-hold strategy.

Whereas the trade-off between hedging, bearing transaction cost, and leaving a position open is involved, the mathematical complexity of the solution is low. Experiments demonstrate that $10^4$ paths are sufficient to learn the desired behavior.

Browne, S.: Survival and growth with a liability: optimal portfolio strategies in continuous time. Math. Oper. Res. 22(2), 468–493 (1997)MathSciNetCrossRef

Browne, S.: Beating a moving target: optimal portfolio strategies for outperforming a stochastic benchmark. Finance Stoch. 3, 275–294 (1999a)

Browne, S.: Reaching goals by a deadline: digital options and continuous-time active Portfolio management. Adv. Appl. Probab. 31(2), 551–577 (1999b)

Bühler, H., Gonon, L., Teichmann, J., Wood, B.: Deep hedging. Quant. Finance 19(8), 1271–1291 (2019)MathSciNetCrossRef

Cvitanić, J., Karatzas, I.: On dynamic measures of risk. Finance Stoch. 3, 451–482 (1999)MathSciNetCrossRef

Delbaen, F., Schachermayer, W.: The Mathematics of Arbitrage. Springer, Berlin (2006)

Föllmer, H., Leukert, P.: Quantile hedging. Finance Stoch. 3, 251–273 (1999)MathSciNetCrossRef

Föllmer, H., Leukert, P.: Efficient hedging: cost versus shortfall risk. Finance Stoch. 4, 117–146 (2000)MathSciNetCrossRef

Föllmer, H., Schied, A.: Stochastic Finance: An Introduction in Discrete Time. De Gruyter Studies in Mathematics, Walter de Gruyter, Birkhauser (2016)CrossRef

Ghalanos, A.: rmgarch: multivariate GARCH models. R package version 13-6 (2019)

Giron, K., et al.: Applying goal-based investing principles to the retirement problem. EDHEC-Risk Institute (2018)

Guo, I., Langrené, N., Loeper, G., Ning, W.: Portfolio optimization with a prescribed terminal wealth distribution. Quant. Finance 22(2), 333–347 (2021)MathSciNetCrossRef

Halperin, I.: QLBS: Q-learner in the black-Scholes(-Merton) worlds. J. Deriv. 28(1), 99–122 (2020)CrossRef

Leukert, P.: Absicherungsstrategien zur minimierung des verlustrisikos. Ph.D. Thesis, Humboldt-Universität zu Berlin (1999)

Markowitz, H.: Portfolio selection. J. Finance 7(1), 77–91 (1952)

Nakano, Y.: Efficient hedging with coherent risk measure. J. Math. Anal. Appl. 293(1), 345–354 (2004)MathSciNetCrossRef

Ni, H., Szpruch, L., Wiese, M., Liao, S., Xiao, B.: Conditional sigwasserstein gans for time series generation (2020). arxiv:2006.05421

Pham, H.: Minimizing shortfall risk and applications to finance and insurance problems. Ann. Appl. Probab. 12(1), 143–172 (2002)MathSciNetCrossRef

Platen, E.: On the role of the growth optimal Portfolio in finance. Aust. Econ. Pap. 44(4), 365–388 (2005)CrossRef

Protter, P.E.: Stochastic Integration and Differential Equations, 2nd edn. Springer, Berlin (2004)

Ruf, J., Wang, W.: Neural networks for option pricing and hedging: a literature review. J. Comput. Finance 24(1), 1–45 (2020)

Spivak, G., Cvitanić, J.: Maximizing the probability of a perfect hedge. Ann. Appl. Probab. 9(4), 1303–1328 (1999)MathSciNet

Szehr, O.: Hedging of financial derivative contracts via Monte Carlo tree search (2021). arxiv:2102.06274

Xu, M.: Minimizing shortfall risk using duality approach—an application to partial hedging in incomplete markets. Ph.D. Thesis, Carnegie Mellon University (2004)

Title: Hedging goals
Authors: Thomas Krabichler
Marcus Wunsch
Publication date: 17-11-2023
Publisher: Springer US
Published in: Financial Markets and Portfolio Management / Issue 1/2024
Print ISSN: 1934-4554
Electronic ISSN: 2373-8529
DOI: https://doi.org/10.1007/s11408-023-00437-y

Springer Professional

Hedging goals

Abstract

Publisher’s Note

1 Introduction

2 Main contributions

3 Outline of this paper

4 Preliminaries

4.1 The model

4.2 Goal-based investing and hedging

5 Risk neutrality and risk taking

6 Practical considerations when maximizing probabilities

7 Risk aversion

8 Downward protection

8.1 The nature of the claim with downward protection

9 Deep hedging

10 Conclusions and outlook

Publisher’s Note

Appendix A: Proofs

Appendix A.1: Proofs of the results with risk neutrality and risk taking (\(p\in [0, 1]\))

Appendix A.2: Proofs of the results with risk aversion

Appendix A.3: Proofs of the results with downward protection

Mean	Theoretical	Deep hedging		Discrete delta hedging
Mean	\(\kappa =0\)	\(\kappa =0\)	\(\kappa =0.005\)	\(\kappa =0\)	\(\kappa =0.005\)
\(p=1\)	93.18	91.07	89.10	93.19	88.39
\(p=1.5\)	88.52	88.53	87.44	91.55	87.97
\(p=5\)	80.17	80.50	79.89	80.28	79.81

\(5\%\)-quantile	Theoretical	Deep hedging		Discrete delta hedging
\(5\%\)-quantile	\(\kappa =0\)	\(\kappa =0\)	\(\kappa =0.005\)	\(\kappa =0\)	\(\kappa =0.005\)
\(p=1\)	0	48.71	48.98	4.05	\(-2.98\)
\(p=1.5\)	49.63	54.63	57.17	54.96	48.84
\(p=5\)	73.59	71.96	74.00	73.64	73.11

Springer Professional

Abstract

Publisher’s Note

1 Introduction

2 Main contributions

3 Outline of this paper

4 Preliminaries

4.1 The model

4.2 Goal-based investing and hedging

5 Risk neutrality and risk taking

6 Practical considerations when maximizing probabilities

7 Risk aversion

8 Downward protection

8.1 The nature of the claim with downward protection

9 Deep hedging

10 Conclusions and outlook

Publisher’s Note

Appendix A: Proofs

Appendix A.1: Proofs of the results with risk neutrality and risk taking (\(p\in [0, 1]\))

Appendix A.2: Proofs of the results with risk aversion

Appendix A.3: Proofs of the results with downward protection

Other articles of this Issue 1/2024

Evaluating the influence of financial technology (FinTech) on sustainable finance: a comprehensive global analysis

Report of the editor 2023

The Credit Suisse bailout in hindsight: not a bitter pill to swallow, but a case to follow

The palgrave handbook of FinTech and blockchain

Does analysts’ industrial concentration affect the quality of their forecasts?