main-content

## Weitere Artikel dieser Ausgabe durch Wischen aufrufen

02.04.2020 | Ausgabe 3/2020 Open Access

# No–arbitrage commodity option pricing with market manipulation

Zeitschrift:
Mathematics and Financial Economics > Ausgabe 3/2020
Autoren:
René Aïd, Giorgia Callegaro, Luciano Campi
Wichtige Hinweise

## Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## 1 Introduction

The methods and techniques of manipulation are limited only by the ingenuity of man,
in Cargill vs Hardin, US Court of Appeal, 8th circuit, December 7, 1971.
Price manipulation in financial markets is not the rare event as one may think. In their paper, Aggarwal and Wu [ 1] provide data on more than 140 cases of market stock price manipulation in the sole period of ten years from 1990 to 2001 released by the Security Exchange Commission. As the authors quote, those cases only correspond to those who were caught. On commodity markets, illegal practices of market manipulations are abundantly documented and can compete with stock markets (see Pirrong [ 20] for a survey of those practices). More recently, the LIBOR itself was the object of a coordinated manipulation by a cartel of banks. The LIBOR, created more than fifty years ago and controlled by the British Bankers Association, serves as a benchmark for loans and as an index in hundreds of trillions of nominal in derivatives. It is enough to read Duffie and Stein [ 7] to measure the extent of social welfare loss induced by the manipulators actions. In their report for the Federal Reserve Bank of New York on the LIBOR scandal, Hou and Skeie [ 14] explain that if the first motivation for this manipulation in the aftermath of the 2008 financial crisis was to maintain a signal of credit worthiness, the second motivation was the express intent of benefiting the bank’s derivatives positions.
Indeed, if the first generation of market price manipulation concentrated on using some market power to increase the market price and then resell the good at a higher price (unravelling strategy), it seems that the second generation of market manipulation will use the leverage effect provided by derivatives. Worrying enough to support this prognosis, the recent paper of Griffin and Shams [ 12] asserts the possibility of an on-going VIX manipulation using large position in the out-of-the money options used to compute the VIX. If true, it would mean that some traders are already engaged in what was thirty years ago a theoretical problem in derivative pricing when academics would relax the hypothesis of no-market impact in the Black and Scholes pricing framework (see Jarrow [ 15] for a seminal work on this subject).
In this paper, we take market price manipulation models one step further in considering the possibility of the joint control of the average (the drift) and of the volatility of a commodity price by the actions of a producer and a trader who exchange a derivative. To analyse the behaviours of both players and the distortion of the prices of the commodity and of the derivative, we design three continuous-time models of increasing complexity. In each model, the commodity price is impacted by the actions of a representative risk-neutral producer and a representative risk-neutral trader. Both agents want to maximize their respective expected profits. The representative producer has market power and can increase or decrease the price by reducing or increasing her production rate. Actions on the volatility can be performed either by randomizing the production rate or by spreading false information. Production randomization is just making a strategic use of outages and the question answered in this paper is when this device has an interest for the producer. Further, regarding the use of information on the volatility, we suppose that the trader or the producer has identified some channels allowing her to act on the nominal volatility of the underlying by an appropriate rate of information. For both agents, manipulation of the commodity price comes at some costs, which are included in their profit functions.
We consider first the case of production-based manipulation: the producer acts alone and can impact both the average price and its volatility by changing her average production rate and the volatility of her production rate. Second, we consider the case of production and information-based manipulation by a producer: the producer acts alone, she can affect the average price by changing her production rate and the volatility as mentioned above, e.g. by spreading appropriate information. Finally, we consider the case of a competition between a producer who can exert market power on the drift of the price and a trader who has an impact on the volatility of the price. In each case, we suppose that the producer contracts at time zero a constant (long or short) position of a European convex derivative and delivers (or receives) its payoff at maturity. The trader has the opposite position in the derivative. Since they have an asymmetric impact on the dynamics of the commodity price, we are able to assess which instrument is more efficient, manipulation of the drift or manipulation of the volatility. We aim at studying to which extent the producer and the trader can profit from their market power and how the prices of the commodity and its derivatives can be distorted by their actions.
In the classification of market manipulation provided by Allen and Gorton [ 3], the first model corresponds to an action-based manipulation (using physical means such as production); the second model is a mixture of action-based and information-based (spreading false rumours on commodity scarcity or accounting and earnings’ manipulation); the last one is a mixture of action-based, information-based and trading-based (buying to increase the price and then selling back).
To our knowledge, this is the first paper to study the joint manipulation of a commodity price and a derivative. We give now some reasons why our analysis might get even more relevant in a near future.
First, the commodity business has gone through a concentration trend in the last decades with the emergence of major players like Glencore/Xstrata, Rio Tinto or BHP Billinton. A small amount of international firms concentrate in their hands a significant volume of minerals production. For instance, Glencore concentrate 60% of the zinc, 50% of the copper, 45% of the lead and 38% of the aluminium. At the same time, they have to take significant position in the financial markets to hedge their big physical positions. For instance, Glencore [ 10, note 28, p. 201] shows a position of $3.2 billion of commodity related contracts including futures, options, swaps and physical forwards compared to an adjusted EBITDA of$15.8 billion or a total asset value of $128 billion. Rio Tinto [ 21, notes p. 193] presents an exposition in nominal value of derivatives in aluminum of$1.786 billion for an EBITDA for aluminum of $3.1 billion and operating asset value of$16.5 billion. Players of this size cannot ignore that the impact they may have on the price of a commodity will affect the value of their portfolio derivatives too.
Second, large commodity firms are not the only big players in financial markets to hold significant positions in commodity derivatives. With the financialization of commodity markets, large hedge funds, banks and institutional players have increased their position in commodity derivatives (see Cheng and Xiong [ 5] for an overview). Thus, when trying to move the price at her own advantage, a producer may find some opposition from financial actors harmed by her action. This problem is already documented in the case of large position of derivatives exchanged between financial institution (see the case of Merrill Lynch selling \$500 million of knock-in put options to Leiter’s International in Gallmeyer and Seppi [ 11]).
Third, the activities of trading in commodity firms are in general isolated in a subsidiary because they fall within the scope of financial regulation. Thus, the trading activity might end up in conflict with the production activity regarding the use of market power.
Our main results can be summarized as follows. For each model we provide closed-form solutions in terms of a coupled Riccati systems of ordinary differential equations. In each model, the price of the derivative is a fair market price in the sense that it is consistent with no-arbitrage condition. First, we find that in all models, the optimal production rate of the producer follows the same pattern: during a transitory phase, it reaches the production rate that maximizes the profit rate, then it stays there and at maturity, the production rate increases (resp. decreases) in case of short position (resp. long position). In the case of production-based manipulation, it is optimal for the producer to increase the volatility of the production rate to induce an increase in the volatility of the derivative if and only if she has a short position in the derivative exceeding a given threshold. Without derivative position, the value function of the producer is a non-increasing function of the volatility. Thus the producer prefers to reduce the volatility. But, when she holds a sufficiently large short position in the derivative, the increase in volatility that pushes the price of the derivative up can compensate the induced indirect cost of volatility. Since her impact on the average price is significant, the producer can compensate the loss in expected profit from production due to an increase of volatility by an increased profit from the derivative position. When the producer action on the volatility is information-based, the previous observations still hold, except that now her value function is increasing in the volatility, providing strong incentive to raise the volatility even without derivative position. Thus, it results that if the producer can impact the price to increase her profit in her derivative position, she does it.
What happens if the producer manipulation can be challenged by a trader taking an opposite position in the derivative? We find that the actions of the trader on the volatility only reduces the potential profit made by the producer on the derivative. Further, despite the asymmetry of powers of the two players, we find that when the production rate is already at its optimal stationary level, there is an amount of derivative position that makes both players better off entering the game.
There is a considerable financial economics literature about market manipulation, which follows in particular a game theoretic approach. A short review of the portion of such a literature related to stock markets has to start with the seminal work of Kyle [ 16] and the work of Allen and Gale [ 2] who provide a simple information condition under which an uninformed trader can make a profitable unravelling strategy (buying the stock, making the price rise and sell the stocks at an average higher price). Chatterjea and Jarrow [ 4] provide a game theoretic model of US Treasury Securities manipulation. Cooper and Donaldson [ 6] design a dynamic game theoretic model of corner strategy. Regarding commodity price manipulation strategy, thorough analysis are available in the works of Pirrong [ 1820].
It is also worth mentioning that our modelling and contribution are different from those in the rich literature on market impact, for which we refer the reader to, e.g., the recent book by Guéant [ 13] and the references therein. Indeed, while in market impact models the drift of the market price is affected by the traders as a consequence of an optimal execution of a market order, in our setting both drift and volatility are affected and the impact comes directly from market manipulation. Moreover, the modelled financial phenomena are different and so are the problems solved (optimal execution vs profit maximization).
The closest work to ours is the paper by Nyström and Parviainen [ 17]. The authors provide a zero-sum game between two players who can control the drifts and the volatilities of a multidimensional stock market and show under mild conditions that the game has a value and that it is given by the unique viscosity solution of degenerate parabolic PDE. Considering a more specific model of actors and impact functions, we are able to provide more insights in the gains of the producer and the trader and the distortion of prices.
The paper is organised in the following way. Sects.  2, 3 and 4 present respectively the model of manipulation through production, manipulation through information and competition of manipulation. Sect.  5 provides numerical illustration of the three models.

## 2 Production-based manipulation

This section contains all our results on the first model of a producer of a commodity, who can manipulate the price of the commodity through production.
More in detail, we consider a producer whose objective is to maximize her profit from production and from investment in a financial market over the time period [0,  T] for some $$T>0$$. The producer can increase her production rate $$q_t$$ with the instantaneous control rate $$u_t$$ at the expense of a cost $$\frac{\kappa }{2} u^2$$ with $$\kappa >0$$. We suppose that the production is entirely sold at a market price $${{\tilde{S}}}$$, which can be affected by the producer: the more the production rate the less the market price. This effect leads to observed market price $${{\tilde{S}}}_t := s_0 - a \, q_t$$ where $$a > 0$$ is some fixed parameter and $$s_0 > 0$$ is the market price before action of the producer (which in this case is constant). We will relax the hypothesis of a constant market price before impact in the second model (see Sect.  3). The former relation can also be seen as an inverse demand function of the good, where a is its elasticity. Thus, the instantaneous profit rate is $$P_t := q_t \, {{\tilde{S}}}_t$$.
We suppose that the production rate $$q_t$$ is affected by a random factor that gathers all the randomness that usually affects production processes (outages, strikes and so on). We suppose that, without intervention of the producer, uncertainty is normally distributed with standard deviation $$\sigma > 0$$. Further, we suppose that the producer has an effect on the uncertainty of his production rate. These hypotheses lead to the following dynamics for the production rate:
\begin{aligned} dq_t = u_t \, dt + \sigma \, \sqrt{1+z_t} \, dW_t, \quad q_0 \in {\mathbb {R}}, \end{aligned}
(2.1)
where W is a standard Brownian motion, defined on some probability space $$(\Omega , {\mathcal {F}},{\mathbb {P}})$$, and $$z_t$$ is the effect (in percentage) on the variance of the production rate. The information available to the producer is modelled by the natural filtration, $${({\mathcal {F}}_t)}_{t \in [0,T]} = {({\mathcal {F}}_t^W)}_{t \in [0,T]}$$, generated by the Brownian motion W and completed with all $${\mathbb {P}}$$-null sets. Hence, anywhere in this section adaptedness will always be referred to this filtration.
We suppose that controlling $$z_t$$ requires some financial cost $$\frac{g}{2} z^2$$ with $$g >0$$. The producer can choose either to decrease or to increase the volatility of the production rate $$q_t$$. Although the costs incurred to increase the volatility are less easy to grasp than the cost involved to decrease it, they can be interpreted as the costs of the actions needed to hide them.
At this stage of the model description, we notice that an increase of the volatility of $$q_t$$ has a negative impact on the expected instantaneous profit $${\mathbb {E}}[P_t] = s_0 {\mathbb {E}}[q_t] - a{\mathbb {E}}[q_t ^2]$$. In other terms, the producer is Gamma negative. Thus, he has no incentive to increase the volatility of his production facilities.
Most large commodity producers make an important use of financial market for hedging purposes. Hence, we suppose that the producer intervenes in the financial market for his production good by selling derivatives at the initial time. Since the producer is Gamma negative, a natural hedge would be to sell a Gamma positive derivative such as a call option. Here, we suppose that the producer has a net derivative position $$\lambda$$ which can be positive (short, sale) or negative (long, purchase) with maturity T and payoff $$h_T := {{\tilde{S}}}_T^2$$. Such a quadratic payoff can be seen as a position over a portfolio of call options with the same maturity T and different strike prices. We denote by $$h_0$$ the price at time 0 of that option, its precise definition will be given when specifying the set of admissible controls. Indeed, $$h_0$$ is not given from the outset, as it depends on the underlying which is in turn controlled.
The aim of the producer is to maximize the following objective functional
\begin{aligned} J^\lambda (u,z,h_0) := {\mathbb {E}}\left[ \int _0^{T} \left( P_t - \frac{\kappa }{2}u^2 _t - \frac{g}{2}z^2 _t \right) dt + \lambda \big ( h_0 - h_T\big ) \right] . \end{aligned}
(2.2)
Now, within the producer firm, there are two distinct departments, a production department and an investment department. The former takes any production-related decisions, i.e. it controls u and z, while the latter is responsible for selling/buying the derivatives at a fair price and pursuing the corresponding hedging strategy. It is natural to assume that the investment department is using the nowadays classical no-arbitrage machinery to propose a derivative’s price. The two departments are aware of the fact that their decision affects each other. In particular, the fact that the investment department uses the no-arbitrage approach to price derivatives implies the following natural constraints for the production side: the chosen production plan should not lead to arbitrage opportunities. We are going to incorporate this idea into the definition of admissible policies. After that, we will give the precise formulation of the producer optimization problem.
Definition 2.1
We say that any pair ( uz) is admissible if the following properties are satisfied:
(i)
$$(u_t ,z_t )_{t \in [0,T]}$$ are progressively measurable processes with values in $${\mathbb {R}} \times (-1,\infty )$$ such that
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T (u_t ^2 + z^2 _t)dt\right]< \infty , \quad {\mathbb {E}}\left[ \int _0 ^T q_t ^2 (1 + z_t) dt\right] < \infty ; \end{aligned}

(ii)
there exists a unique equivalent martingale measure $${\mathbb {Q}}^{u,z}$$ for the price process $${{\tilde{S}}}$$, equivalently for the production process $$(q_t)_{t\in [0,T]}$$, with $$h_T \in L^1 ({\mathbb {P}}) \cap L^1 ({\mathbb {Q}}^{u,z})$$;

(iii)
there exists a real-valued progressively measurable process $$(\Delta ^{u,z}_t)_{t\in [0,T]}$$ satisfying
\begin{aligned} {\mathbb {E}}\left[ \int _0^T \left( |\Delta ^{u,z}_t u_t| + |\Delta ^{u,z}_t |^2 (1+z_t) \right) dt \right] < \infty , \end{aligned}
and such that the following holds $${\mathbb {Q}}^{u,z}$$-a.s.
\begin{aligned} h^{u,z}_t := {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T| {\mathcal {F}}_t] = {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T] + \int _0 ^t \Delta ^{u,z}_s d\tilde{S}_s, \end{aligned}
for all $$t \in [0,T]$$.

The set of all admissible pairs will be denoted by $${\mathcal {A}}$$.
Hence $$h^{u,z}_0 = {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T]$$ can be viewed as the price of the option $$h_T$$ under the production control ( uz). Notice that such a price is clearly affected by the controls via the risk neutral measure in (ii). It can be interpreted as “commitment price”: after selling the option, the producer could deviate from the implementation of the hedging strategy that leads to the measure $${\mathbb {Q}}^{u,z}$$. Here we make the assumption that the producer implements the production controls leading to precisely that measure and thus, that price.
Notice that from q’s dynamics we have that the measure $${\mathbb {Q}}^{u,z}$$, whose existence is postulated in (ii) above, is necessarily given by the following Radon–Nikodym derivative
\begin{aligned} \frac{d{\mathbb {Q}}^{u,z}}{d{\mathbb {P}}} = \exp \left\{ -\int _0 ^T \delta _t dW_t - \frac{1}{2}\int _0 ^T \delta _t ^2 dt \right\} , \quad \delta _t = \frac{u_t}{\sigma \sqrt{1+ z_t}}. \end{aligned}
Before giving the final formulation of the producer optimization problem, we can exploit the admissibility properties above to rewrite the objective functional ( 2.2) as follows
\begin{aligned} J^\lambda (u,z,h_0) = {\mathbb {E}}\left[ \int _0^{T} \left( P_t - \frac{\kappa }{2}u^2 _t - \frac{g}{2}z^2 _t - \lambda \Delta _t^{u,z} u_t \right) dt\right] = : {{\tilde{J}}}^\lambda (u,z) . \end{aligned}
Indeed, condition (iii) implies that
\begin{aligned} h_T - h_0 = \int _0 ^T \Delta _t^{u,z} dq_t = \int _0 ^T \Delta _t^{u,z} (u_t dt + \sigma \sqrt{1+z_t}dW_t). \end{aligned}
Condition (iii) implies that the dW-part above has zero expectation under $${\mathbb {P}}$$. Moreover, we observe that the integrability assumptions in (i) and (iii) ensure that $${\tilde{J}}^\lambda (u,z)$$ is finite.
Finally, after all these preliminaries, we can formulate the producer’s optimization problem
\begin{aligned} \sup _{(u,z) \in {\mathcal {A}}} {{\tilde{J}}}^\lambda (u,z). \end{aligned}
(2.3)

### 2.1 Heuristics

In this part we develop the heuristics needed to obtain a candidate for the solution of problem ( 2.2). In the next sub-section, we will verify that the candidate is indeed the optimal solution according to the definition above.
First, notice that, since the market is complete, there exists only one possible no-arbitrage price for the derivative $$h_T$$, which also gives the initial wealth needed to fund the hedging strategy. The derivative can be perfectly replicated by trading in a self-financing way in the underlying $${{\tilde{S}}}_t = s_0 -aq_t$$ or, equivalently, in $$q_t$$. Therefore
\begin{aligned} h_T = {\mathbb {E}}^{\mathbb {Q}}[h_T] + \int _0 ^T \Delta _t dq_t , \end{aligned}
where $${\mathbb {Q}}$$ is the unique equivalent martingale measure for $${{\tilde{S}}}$$ (or, equivalently, for q), and $$\Delta$$ is the delta hedging. More precisely, using Girsanov’s theorem we get the dynamics of q under $${\mathbb {Q}}$$, which is
\begin{aligned} dq_t = \sigma \sqrt{1+z_t} dW_t ^{\mathbb {Q}}, \quad dW^{\mathbb {Q}}_t = dW_t - \delta _t dt, \end{aligned}
where $$\delta _t = \frac{u_t}{\sigma \sqrt{1+z_t}}$$ for $$t\in [0,T]$$. Hence, defining the price at time t of the derivative as
\begin{aligned} h_t = {\mathbb {E}}^{\mathbb {Q}}[ (s_0 -aq_T)^2 | q_t] := \varphi (t,q_t), \end{aligned}
we obtain the usual PDE for the price
\begin{aligned} \varphi _t + \frac{1}{2} \sigma ^2 (1+z(t,q)) \varphi _{qq} = 0, \quad \varphi (T,q)= (s_0 -aq)^2. \end{aligned}
(2.4)
Finally, we have the usual relationship $$\Delta _t = \varphi _q (t,q_t)$$.
Remark 2.1
Notice from Eq. ( 2.4) that $$\varphi$$ depends on z in a functional way. However we expect the optimal control to be Markovian, which justifies replacing $$z_t$$ (which could in principle depend on the whole path of the state variable $$(q_t)$$) with the function z( tq) of time and of the value q of state variable at time t. The PDE above needs to be solved together with the HJB equation for the value function, since the coefficient of the second derivative $$\varphi _{qq}$$ depends on the control z.
The perfect replicability of the derivative allows us to rewrite the objective function in a more suitable way as at the end of the previous sub-section. Indeed, observe first that
\begin{aligned} h_0 - h_T = - \int _0 ^T \Delta _t dq_t = -\int _0 ^T \varphi _q (t,q_t) (u_t dt + \sigma \sqrt{1+z_t} dW_t), \end{aligned}
where recall that W is a Brownian motion under $${\mathbb {P}}$$. Hence, given the hedging strategy $$\varphi _q(t,q_t)$$, the maximization problem on the production side can be expressed as follows
\begin{aligned} v^\lambda (0,q_0) := \sup _{u,z} {\mathbb {E}} \left[ \int _0 ^T \left( q_t (s_0 -aq_t) - \frac{g}{2}z_t ^2 - \frac{\kappa }{2}u_t ^2 - \lambda \varphi _q (t,q_t) u_t \right) dt \right] . \end{aligned}
(2.5)
In other terms, the gain coming from selling the derivative has been absorbed by the running profit term. Therefore, we can get the HJB equation
\begin{aligned} -v_t = \sup _{u,z} \left\{ q(s_0 -a q) - \frac{g}{2}z ^2 - \frac{\kappa }{2}u ^2 - \lambda \varphi _q (t,q) u + u v_q + \frac{\sigma ^2}{2}(1+z) v_{qq}\right\} , \end{aligned}
(2.6)
with terminal condition $$v(T,q)=0$$. Notice that the PDE for the price ( 2.4) and the HJB equation for the value function are clearly coupled as the optimal z appears in the pricing PDE, while the derivative of the price, $$\varphi _q$$, appears in the HJB equation (compare to Remark  2.1). The first order conditions give the two (candidate) optimal controls
\begin{aligned} {{\widehat{u}}} = \frac{1}{\kappa }\left( v_q-\lambda \varphi _q\right) , \quad {\widehat{z}} =\frac{\sigma ^2}{2g} v_{qq}. \end{aligned}
(2.7)
Notice that we have dropped the dependence upon $$\lambda$$ in the value function for sake of readability. In order to get the full solution, it is natural to make the following
Ansatz 2.1
Both solutions $$\varphi$$ and $$v^\lambda$$ are quadratic in q, i.e.
\begin{aligned} \varphi (t,q)= A(t)q^2 + B(t)q+C(t), \quad v^{\lambda }(t,q)=D(t)q^2 + E(t)q +F(t), \end{aligned}
where ABCDEF are deterministic functions of time, to be determined.
Solving for $$\varphi$$ . To ease the notation, we drop the dependence of time from AB and so on. First of all, applying the Ansatz  2.1 to the candidate optimal controls gives
\begin{aligned} {\widehat{u}} = \frac{1}{\kappa }\left( 2q(D-\lambda A) +E -\lambda B \right) , \quad {\widehat{z}} = \frac{\sigma ^2}{g}D. \end{aligned}
Next, we substitute the expression above for $${\widehat{u}}$$ and $${\widehat{z}}$$ in the pricing PDE ( 2.4) and we obtain
\begin{aligned} \sigma ^2 \left( 1+\frac{\sigma ^2}{g}D\right) A + A' q^2 + B' q + C' =0, \quad A(T)q^2 + B(T) q +C(T) = s_0 ^2-2as_0 q +a^2q^2. \end{aligned}
In particular, the terminal condition for $$\varphi$$ gives the corresponding terminal conditions for ABC as
\begin{aligned} A(T) = a^2, \quad B(T)=-2as_0 , \quad C(T)=s_0 ^2. \end{aligned}
By identification of the terms in q, we get the following ODEs for AB and C
\begin{aligned} A'=0, \quad B'=0, \quad C'=-\sigma ^2a^2\left( 1+ \frac{\sigma ^2}{g} D\right) , \end{aligned}
which can be easily solved using the terminal conditions above. Indeed, we obtain
\begin{aligned} A(t)=a^2, \quad B(t) = -2as_0 , \quad C(t) = s_0 ^2 + \sigma ^2 a^2 \int _t ^T \left( 1+ \frac{\sigma ^2}{g}D(r)\right) dr, \end{aligned}
(2.8)
for all $$t \in [0,T]$$. Notice that the function D( t) will be obtained when solving the HJB Eq. ( 2.6).
Solving for $$v^\lambda$$ . Substituting the Ansatz  2.1 for $$v^\lambda$$ (together with the optimal controls) in the HJB Eq. ( 2.6) and identifying the terms in q, we obtain the following ODEs for DE and F
\begin{aligned} -D'= & {} -a +\frac{2}{\kappa }(D-\lambda A)^2, \quad D(T)=0,\\ -E'= & {} s_0 + \frac{2}{\kappa }(D-\lambda A) (E-\lambda B), \quad E(T)=0,\\ -F'= & {} \frac{\sigma ^4}{2g}D^2 + \sigma ^2 D + \frac{1}{2 \kappa }(E-\lambda B)^2 , \quad F(T)=0. \end{aligned}
Now, using ( 2.8) implies
\begin{aligned} -D'= & {} -a +\frac{2}{\kappa }(D-\lambda a^2)^2, \end{aligned}
(2.9)
\begin{aligned} -E'= & {} s_0 + \frac{2}{\kappa }(D-\lambda a^2) (E + 2\lambda as_0 ), \end{aligned}
(2.10)
\begin{aligned} -F'= & {} \frac{\sigma ^4}{2g}D^2 + \sigma ^2 D +\frac{1}{2 \kappa }(E + 2 a \lambda s_0 )^2 , \end{aligned}
(2.11)
with null terminal conditions $$D(T)=E(T)=F(T)=0$$.
Remark 2.2
We observe that, while the equation for D is a one-dimensional Riccati ODE, the second one is linear and the third one can be solved just by integration. The Riccati Eq. ( 2.9) can be easily proved to have a unique solution over the whole time interval [0,  T]. Indeed, this is a direct consequence of, e.g., Lemma 10.12 in [ 9]. Moreover, that lemma also implies that
\begin{aligned} D(t) \le 0 \quad \hbox {for all }t \in [0,T] \quad \Leftrightarrow \quad D'(T) = a - \frac{2\lambda ^2 a^4}{\kappa } \ge 0, \end{aligned}
which will be important later for the interpretation of our results. The value $$a - \frac{2\lambda ^2 a^4}{\kappa }$$ corresponds to the slope of D close to T.
Set $$\theta = \sqrt{8a / \kappa }$$. Solving the equations above gives the following expressions
\begin{aligned} D(t)= & {} - \frac{2(a-\frac{2\lambda ^2 a^4}{\kappa })(e^{\theta (T-t)} -1)}{\theta (e^{\theta (T-t)} +1) +\frac{4\lambda a^2}{\kappa } (e^{\theta (T-t)} -1)}, \end{aligned}
(2.12)
\begin{aligned} E(t)= & {} s_0 \int _t ^{T} e^{\int _t ^{u} \frac{2}{\kappa }(D(r)-\lambda a^2)dr} \left[ 1 +\frac{4 a \lambda }{ \kappa }(D(u)-\lambda a^2 ) \right] du, \end{aligned}
(2.13)
\begin{aligned} F(t)= & {} \int _t ^T \left( \frac{\sigma ^4}{2g}D(u) ^2 + \sigma ^2 D(u) + \frac{1}{2 \kappa } {{(E(u) + 2a \lambda s_0 )}^2} du \right) , \end{aligned}
(2.14)
for all $$t \in [0,T]$$.

### 2.2 Verification

We conclude the section with the verification that the candidate described above is indeed a solution to problem ( 2.3).
Theorem 2.1
Let DE be deterministic functions of time as in, respectively, ( 2.12) and ( 2.13). Whenever $$H:= 1- \frac{2\lambda a}{\kappa }(\lambda a^2 - \frac{g}{\sigma ^2})>0$$, we assume that the maturity T is small enough, more precisely
\begin{aligned} T < T_{max} := \frac{2}{\theta } \coth ^{-1} \left( \frac{2\sigma ^2 a}{\theta g} H\right) . \end{aligned}
(2.15)
Then there exists an optimal policy $$({\widehat{u}}, {\widehat{z}}) \in {\mathcal {A}}$$ for problem ( 2.3), where the production policies are
\begin{aligned} {\widehat{u}}_t = \frac{1}{\kappa }\left( 2 {\widehat{q}}_t (D(t)-\lambda a^2 ) + 2as_0 \lambda +E(t) \right) , \quad {\widehat{z}}_t = \frac{\sigma ^2}{g}D(t), \quad t\in [0,T]. \end{aligned}
(2.16)
The no-arbitrage price process for the derivative $$h_T$$ is given by
\begin{aligned} {\widehat{h}}_t := h_t^{{\widehat{u}}, {\widehat{z}}}= (s_0 -a{\widehat{q}}_t)^2 + \sigma ^2 a^2 \int _t ^T \left( 1+ \frac{\sigma ^2}{g}D(u)\right) du, \quad t \in [0,T], \end{aligned}
(2.17)
and the hedging process is
\begin{aligned} {\widehat{\Delta }} _t := \Delta _t^{{\widehat{u}}, {\widehat{z}}} = 2 a (a {\widehat{q}}_t - s_0), \quad t\in [0,T], \end{aligned}
(2.18)
where the production rate is
\begin{aligned} {\widehat{q}}_t= & {} e^{R(t)}\left\{ q_0 + \int _0^t e^{-R(s)} \frac{1}{\kappa }(-2\lambda a^2 + 2as_0 \lambda + E(s))ds \right. \nonumber \\&\left. + \int _0 ^t e^{-R(s)} \sigma \sqrt{1+ \frac{\sigma ^2}{g}D(s)} dW_s \right\} , \end{aligned}
(2.19)
with $$R(t) := \int _0 ^t \frac{2}{\kappa } D(s)ds$$.
Proof
The proof is structured in two steps.
1.
Admissibility. Let us verify that the pair $$({\widehat{u}},{\widehat{z}})$$ given in the statement above belongs to $${\mathcal {A}}$$. We start from condition (i) in Definition 2.1. First, $${\widehat{u}}, {\widehat{z}}$$ are trivially progressively measurable and real valued. We need to check $${\widehat{z}}_t > -1$$, which is equivalent to $$\frac{\sigma ^2}{g}D(t) > -1$$. We distinguish two cases: if $$a \le \frac{2\lambda ^2a^4}{\kappa }$$ we have $$D(t) \ge 0$$ (this is a consequence of Remark 2.2 or, alternatively, the explicit formula ( 2.12)), hence $${\widehat{z}}_t >-1$$ for all $$t \in [0,T]$$. On the other hand, if $$a > \frac{2\lambda ^2a^4}{\kappa }$$, it follows from expression ( 2.12) that D( t) is nondecreasing with $$D(T)=0$$. Therefore, it suffices to check that $$D(0)> -g/\sigma ^2$$, where
\begin{aligned} D(0)= -\frac{2 \left( a - \frac{2\lambda ^2a^4}{\kappa }\right) }{\theta \coth (\frac{\theta T}{2}) + \frac{4\lambda a^2}{\kappa }}. \end{aligned}
After some computation, we obtain that $$D(0)> -g/\sigma ^2$$ if and only if
\begin{aligned} \coth \left( \frac{\theta T}{2}\right) > \frac{2\sigma ^2 a}{\theta g} H, \end{aligned}
where H is the constant defined in the statement. Now, if $$H <0$$ the inequality above is always satisfied as the LHS above is nonnegative. If $$H>0$$, the inequality above is guaranteed by the condition $$T < T_{max}$$. We can conclude that even in this second case, provided $$T< T_{max}$$, we have $${\widehat{z}}_t >-1$$ for all $$t\in [0,T]$$. Regarding the integrability properties, we verify now that
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T ({\widehat{u}}_t ^2 + {\widehat{z}}^2 _t)dt\right]< \infty , \quad {\mathbb {E}}\left[ \int _0 ^T {\widehat{q}}_t ^2 (1 + {\widehat{z}}_t) dt\right] < \infty , \end{aligned}
(2.20)
where $${\widehat{q}} = q^{{\widehat{u}},{\widehat{z}}}$$. Since $${\widehat{u}}$$ is affine in $${\widehat{q}}$$ with continuous time-dependent coefficients and $${\widehat{z}}$$ is deterministic and continuous in t, checking the properties above boils down to show
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T {\widehat{q}}_t ^2 dt\right] < \infty . \end{aligned}
First, we use Fubini’s theorem to get $${\mathbb {E}} [\int _0 ^T {\widehat{q}}_t ^2 dt ] = \int _0 ^T {\mathbb {E}}[{\widehat{q}}_t ^2] dt$$. Moreover, since $${\widehat{q}}_t$$ is a Gaussian random variable for any fixed t (see Remark 2.3 below), the function $$t \mapsto {\mathbb {E}}[{\widehat{q}}_t ^2]$$ is continuous over [0,  T], so its integral is finite. Regarding condition (ii), we need to show that there exists a unique EMM $$\widehat{{\mathbb {Q}}} = {\mathbb {Q}}^{{\widehat{u}},{\widehat{z}}}$$ for the production process $${\widehat{q}}$$. Let us recall that
\begin{aligned} \frac{d{\widehat{{\mathbb {Q}}}}}{d{\mathbb {P}}} = \exp \left\{ -\int _0 ^T {\widehat{\delta }}_t dW_t - \frac{1}{2}\int _0 ^T {\widehat{\delta }}_t ^2 dt \right\} , \quad {\widehat{\delta }}_t = \frac{{\widehat{u}}_t}{\sigma \sqrt{1+ {\widehat{z}}_t}}. \end{aligned}
We use [ 23, Theorem 2.1] to prove that under our assumptions the probability $${\widehat{{\mathbb {Q}}}}$$ is well-defined (see also [ 22] for more general results of the same type). According to that results, we need to check Assumption 2.2 in [ 23], which in our case is satisfied as long as $$\sigma ^2 (1+{\widehat{z}}_t) > 0$$ for all $$t\in [0,T]$$. By the same arguments used for condition (i), we get the result. A standard application of Girsanov theorem, together with the integrability properties in ( 2.20), yields immediately that $${\widehat{q}}$$ is a martingale under $${\widehat{{\mathbb {Q}}}}$$. To end checking condition (ii), we have to show $$h_T \in L^1({\mathbb {P}}) \cap L^1({\widehat{{\mathbb {Q}}}})$$. Now, $$h_T = (s_0 -a{\widehat{q}}_T)^2$$, hence quadratic in $${\widehat{q}}_T$$. Since under both probability measures $${\widehat{q}}_T$$ is a Gaussian random variable, we have $$q_T ^2 \in L^1({\mathbb {P}}) \cap L^1({\widehat{{\mathbb {Q}}}})$$, which gives the desired property. We pass to condition (iii) in Definition 2.1. First, $${\widehat{\Delta }}$$ is trivially a progressively measurable process with real values. For the integrability property, since both $${\widehat{u}}$$ and $${\widehat{\Delta }}$$ are linear in $${\widehat{q}}_t$$, we are again reduced to the square integrability $${\mathbb {E}}[\int _0 ^T {\widehat{q}}_t ^2 dt]<\infty$$, which has been proved just before. To conclude this part of the proof, it remains to check that, given $$({\widehat{u}}, {\widehat{z}})$$ as above, $${\widehat{h}}_t := h^{{\widehat{u}},{\widehat{z}}}_t = {\mathbb {E}}^{{\widehat{{\mathbb {Q}}}}} [ h_T| {\mathcal {F}}_t] = {\mathbb {E}}^{{\widehat{{\mathbb {Q}}}}} [ h_T] + \int _0 ^t {\widehat{\Delta }}_s dq_s$$ a.s. under $${\widehat{{\mathbb {Q}}}}$$, for all $$t \in [0,T]$$. This can be done by direct computation as follows: applying Itô’s formula to $${\widehat{h}}_t$$ in ( 2.17) we get
\begin{aligned} d{\widehat{h}}_t = -2a(s_0 -a{\widehat{q}}_t)d{\widehat{q}}_t \end{aligned}
whence, in integral form,
\begin{aligned} {\widehat{h}}_t = {\widehat{h}}_0 -2a \int _0 ^t (s_0 -a{\widehat{q}}_s)d{\widehat{q}}_s = {\widehat{h}}_0 + \int _0 ^t {\widehat{\Delta }}_s d{\widehat{q}}_s. \end{aligned}
Moreover, one easily find
\begin{aligned} \widehat{{\mathbb {E}}} [h_T] = \widehat{{\mathbb {E}}} [(s_0 -a{\widehat{q}}_T)^2] = s_0^2 - 2as_0 q_0 + a^2 \widehat{{\mathbb {E}}}[{\widehat{q}}_T ^2], \end{aligned}
where $$\widehat{{\mathbb {E}}}$$ denotes the expectation with respect to the measure $${\widehat{{\mathbb {Q}}}}$$. Using Itô’s isometry, we also have
\begin{aligned} \widehat{{\mathbb {E}}}[{\widehat{q}}_T ^2] = q_0 ^2 + \int _0 ^T \sigma ^2 \left( 1+\frac{\sigma ^2}{g}D(t)\right) dt, \end{aligned}
which leads to the remaining property in (iii).

2.
Optimality. To check the optimality condition, we are going to use the martingale optimality principle (see [ 8]). Let us define the process
\begin{aligned} Y^{u,z}_t := \int _0 ^t \left( q_r (s_0 -aq_r) - \frac{g}{2}z_r ^2 - \frac{\kappa }{2}u_r ^2 - \lambda {\widehat{\Delta }}_r u_r \right) dr + V(t,q_t), \end{aligned}
(2.21)
with
\begin{aligned} V(t,q) = D(t)q^2 + E(t)q + F(t). \end{aligned}
Thanks to the martingale optimality principle, proving that $$Y^{u,z}$$ is a supermartingale for all $$(u,z) \in {\mathcal {A}}$$ and a martingale for $$(u,z)=({\widehat{u}},{\widehat{z}})$$, will give us the result. Itô’s formula yields
\begin{aligned} dY^{u,z}_t&= \left[ q_t (s_0 -aq_t) - \frac{g}{2}z_t ^2 - \frac{\kappa }{2}u_t ^2 - \lambda {\widehat{\Delta }}_t u_t + V_t + V_q u_t + \frac{1}{2}V_{qq}\sigma ^2 (1+ z_t) \right] dt \nonumber \\&\quad + V_q \sigma \sqrt{1+z_t} dW_t, \end{aligned}
(2.22)
where $$V_t,V_q,V_{qq}$$ denote partial derivative of the function V( tq). We have omitted the dependence on ( tq) for sake of simplicity. First, notice that due to the integrability properties in Definition 2.1(i) the process $$\int _0 ^t V_q \sigma \sqrt{1+z_r} dW_r$$ is a true $${\mathbb {P}}$$-martingale. Therefore, it remains to show that the dt-part in ( 2.22) above is a nonincreasing process, that is it is lower or equal to zero almost everywhere. By construction (see heuristics), we know that the function V( tq) satisfies the HJB Eq. ( 2.6), with
\begin{aligned} \varphi _q (t,q) = 2a^2 q -2a s_0 , \end{aligned}
hence $$\varphi _q (t,q_t) = {\widehat{\Delta }}_t$$ for all t. We can conclude that the drift in ( 2.22) is lower or equal to zero a.e., yielding that $$Y^{u,z}$$ is a supermartingale for all $$(u,z) \in {\mathcal {A}}$$. To show that $$Y^{{\widehat{u}},{\widehat{z}}}$$ is a martingale, one proceeds in the same way getting equalities instead of inequalities. In particular one gets that the drift in ( 2.22) is equal to zero a.e., implying the martingale property. Finally, we can solve for $${\widehat{q}}_t$$ in explicit form, via standard resolution methods, since it is the unique solution of the following linear SDE
\begin{aligned} d{\widehat{q}}_t = \frac{1}{\kappa }\left( 2{\widehat{q}}_t (D(t)-\lambda a^2 ) + 2as_0 \lambda +E(t) \right) dt + \sigma \sqrt{1+\frac{\sigma ^2}{g} D(t)} dW_t , \quad {\widehat{q}}_0 = q_0. \end{aligned}
$$\square$$

Remark 2.3
Notice, from Eq. ( 2.19), that $${\widehat{q}}_t$$ is a Gaussian random variable with time-dependent mean and variance.

## 3 Production and information based manipulation

In this section we describe and solve explicitly a variant of the model presented before. We still have a producer of a commodity, who is maximizing her profit coming from both production and a short/long position in some derivative. The main differences are that the market price of the commodity is no longer a constant as it is driven by the Brownian motion W, that in turn does not affect the production rate anymore and, finally, the producer can directly control the volatility of the market price (by spreading false information on the state of his production, for instance). Thus, the dynamics of the state variables is now given by
\begin{aligned} \left\{ \begin{array}{rcl} dS_t &{} = &{} \mu \, dt + \sigma \, \sqrt{1+z_t} \, dW_t, \\ dq_t &{} = &{} u_t \, dt. \end{array} \right. \end{aligned}
(3.1)
Moreover in this model the market price $${{\tilde{S}}}$$ is given by $${{\tilde{S}}}_t = S_t -aq_t$$, $$t \in [0,T]$$. The objective of the producer is the same as before, i.e.
\begin{aligned} J^\lambda (u,z,h_0) := {\mathbb {E}}\left[ \int _0^{T} \left( P_t - \frac{\kappa }{2}u^2 _t - \frac{g}{2}z^2 _t \right) dt + \lambda \big ( h_0 - h_T\big ) \right] . \end{aligned}
(3.2)
Analogously to the previous model, we will be working with the following definition of admissible policies, which admits the same interpretation as before.
Definition 3.1
We say that any pair ( uz) is admissible if the following properties are satisfied:
(i)
$$(u_t ,z_t )_{t \in [0,T]}$$ are progressively measurable processes with values in $${\mathbb {R}} \times (-1,\infty )$$ such that
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T (u_t ^2 + z^2 _t)dt\right]< \infty , \quad {\mathbb {E}}\left[ \int _0 ^T {{\tilde{S}}}_t ^2 (1 + z_t) dt\right] < \infty ; \end{aligned}

(ii)
there exists a unique equivalent martingale measure $${\mathbb {Q}}^{u,z}$$ for the price process $${{\tilde{S}}}$$, with $$h_T \in L^1 ({\mathbb {P}}) \cap L^1 ({\mathbb {Q}}^{u,z})$$;

(iii)
there exists a real-valued progressively measurable process $$(\Delta _t^{u,z})_{t\in [0,T]}$$ satisfying
\begin{aligned} {\mathbb {E}}\left[ \int _0^T \left( |\Delta _t^{u,z} u_t| + |\Delta _t^{u,z} |^2 (1+z_t) \right) dt \right] < \infty , \end{aligned}
and such that the following holds $${\mathbb {Q}}^{u,z}$$-a.s.
\begin{aligned} h_t^{u,z} := {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T| {\mathcal {F}}_t] = {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T] + \int _0 ^t \Delta _s^{u,z} d {\widetilde{S}}_s , \end{aligned}
for all $$t \in [0,T]$$.

The set of all admissible pairs will be denoted by $${\mathcal {A}}$$.
Analogously as in the previous model, from $${{\tilde{S}}}$$’s dynamics we have that the measure $${\mathbb {Q}}^{u,z}$$, whose existence is postulated in (ii) above, is necessarily given by the following Radon–Nikodym derivative
\begin{aligned} \frac{d{\mathbb {Q}}^{u,z}}{d{\mathbb {P}}} = \exp \left\{ -\int _0 ^T \gamma _t dW_t - \frac{1}{2}\int _0 ^T \gamma _t ^2 dt \right\} , \quad \gamma _t = \frac{\mu - au_t}{\sigma \sqrt{1+ z_t}}. \end{aligned}
Before giving the final formulation of the producer’s optimization problem in this model as well, we can exploit the admissibility properties above to rewrite the objective functional ( 3.2) as follows
\begin{aligned} J^\lambda (u,z,h_0) = {\mathbb {E}}\left[ \int _0^{T} \left( P_t - \frac{\kappa }{2}u^2 _t - \frac{g}{2}z^2 _t - \lambda \Delta _t^{u,z} (\mu -au_t) \right) dt\right] = : {{\tilde{J}}}^\lambda (u,z) . \end{aligned}
Finally, the producer’s optimization problem, that we are going to solve in the next sub-section, is given by
\begin{aligned} \sup _{(u,z) \in {\mathcal {A}}} {{\tilde{J}}}^\lambda (u,z). \end{aligned}
(3.3)

### 3.1 Heuristics

In this sub-section we describe the heuristics that led us to propose some candidate solution. It follows the same lines as in the first model. Being the market complete, there exists a unique martingale measure $${\mathbb {Q}}$$ under which $${\widetilde{S}}$$ is a martingale. Indeed we have
\begin{aligned} d {\widetilde{S}}_t= & {} (\mu - a u_t) dt + \sigma \sqrt{1+z_t} d W_t \\= & {} (\mu - a u_t - \gamma _t \sigma \sqrt{1+z_t}) dt + \sigma \sqrt{1+z_t} d W_t^{{\mathbb {Q}}}, \end{aligned}
where $$W^{{\mathbb {Q}}}$$ is a $${\mathbb {Q}}$$-Brownian motion and by choosing $$\gamma _t = \frac{\mu - a u_t}{\sigma \sqrt{1+z_t}}$$, $${\widetilde{S}}$$ is a $${\mathbb {Q}}$$-martingale. The price at time $$t=0$$ of the European claim $$h_T$$ equals $${\mathbb {E}}^{{\mathbb {Q}}} [ {\widetilde{S}}_T^2 ]$$, so that we expect that $$h_T = h_0 + \int _0^T \Delta _t d {\widetilde{S}}_t$$, where $$\Delta$$ is the corresponding delta hedging strategy. Hence, the difference $$h_0 - h_T$$ also reads
\begin{aligned} h_0 - h_T = - \int _0^T \Delta _t d {\widetilde{S}}_t. \end{aligned}
(3.4)
The last quantity to be introduced is the financial claim’s price, that we denote by $$\varphi$$:
\begin{aligned} \varphi (t,q,s) := {\mathbb {E}}^{{\mathbb {Q}}} \left[ {\widetilde{S}}_T^2 \vert q_t =q , S_t = s \right] . \end{aligned}
Remark 3.1
Notice that the filtration generated by q and $${\widetilde{S}}$$ is the same as the one associated with q and S, namely $${\mathcal {F}}_t^{{\widetilde{S}}} \vee {\mathcal {F}}_t^q = {\mathcal {F}}_t^{S} \vee {\mathcal {F}}_t^q$$ for every $$t \in [0,T]$$. We conveniently choose to consider q and S as state variables.
We clearly expect $$\varphi$$ to solve the following PDE
\begin{aligned} \left\{ \begin{array}{rcl} \varphi _t + \frac{\sigma ^2}{2} (1+z(t,q,s)) \varphi _{ss} &{} = &{} 0 \\ \varphi ( T,q,s) &{} = &{} {(s-a q)}^2. \end{array} \right. \end{aligned}
(3.5)
Since we formally have $$\Delta := \frac{\partial \varphi }{ \partial {\widetilde{s}}} = \frac{\partial \varphi }{ \partial s} \frac{\partial s}{ \partial {\widetilde{s}}} = \frac{\partial \varphi }{ \partial s}$$, using the dynamics $${\widetilde{S}}$$ under $${\mathbb {P}}$$ together with ( 3.4) we find that the value function $$v^{\lambda }$$ satisfies
\begin{aligned} v^{\lambda }(0,q_0,s_0) = \sup _{u,z} \ {\mathbb {E}}_{q_0,s_0} \left[ \int _0^T \left( q_t (S_t - a q_t) - \frac{g}{2} z_t^2 - \frac{\kappa }{2} u_t^2 - \lambda \varphi _s (t,q_t, S_t) (\mu - a u_t) \right) dt \right] . \end{aligned}
The value function $$v^{\lambda }$$ is solution to the following HJB equation, which depends on $$\varphi$$ (satisfying ( 3.5))
\begin{aligned} \left\{ \begin{array}{l} \displaystyle - v_t = \sup _{u,z} \big \{ q(s- a q) - \frac{g}{2} z^2 - \frac{\kappa }{2} u^2 - \lambda \varphi _s (\mu - a u ) + v_q u + v_s \mu \\ ~~~~~~~~~~~+ \frac{ \sigma ^2}{2} (1+z) v_{ss} \big \} \\ v( T,q,s) = 0 \end{array} \right. \end{aligned}
(3.6)
Ansatz 3.2
We guess the value function v and the price $$\varphi$$ have the following form
\begin{aligned} v(t,q,s)= & {} A(t) q^2 + B(t) s^2 + C(t) qs + D(t) q + E(t) s + F(t), \\ \varphi (t,q,s)= & {} {\bar{A}}(t) q^2 + {\bar{B}}(t) s^2 + {\bar{C}}(t) qs + {\bar{D}}(t) q + {\bar{E}}(t) s + {\bar{F}}(t). \end{aligned}
The first order conditions on the ( 3.6) lead us to the candidate optimal controls
\begin{aligned} {\widehat{z}} = \frac{\sigma ^2}{2 g} v_{ss} \, , \qquad {\widehat{u}} = \frac{1}{\kappa } \left( a\lambda \varphi _s + v_q \right) , \end{aligned}
and thus, using the ansatz above,
\begin{aligned} {\widehat{z}}_t = \frac{\sigma ^2}{g} B(t) \qquad {\widehat{u}}_t = \frac{1}{k} \left\{ \lambda a \left[ 2 {\bar{B}}(t) s + {\bar{C}}(t) q + {\bar{E}}(t) \right] + 2 A(t) q + C(t) s + D(t) \right\} ,\nonumber \\ \end{aligned}
(3.7)
for all $$t \in [0,T]$$.
Solving for $$\varphi$$ . We can now explicitly find $$\varphi$$ by exploiting the Ansatz 3.2 and replacing the control pair $$({\widehat{u}}, {\widehat{z}})$$ into ( 3.5). We find:
\begin{aligned} \left\{ \begin{array}{rcl} {\bar{A}}(t) &{} \equiv &{} a^2\\ {\bar{B}}(t) &{} \equiv &{} 1 \\ {\bar{C}}(t) &{} \equiv &{} - 2 a \\ {\bar{D}}(t) &{} \equiv &{} 0 \\ {\bar{E}}(t) &{} \equiv &{} 0 \\ {\bar{F}}(t) &{} = &{} \displaystyle \int _t^T \sigma ^2 \left[ 1+\frac{\sigma ^2}{g} B(u) \right] du \\ \end{array} \right. \end{aligned}
Solving for $$v^{\lambda }$$. We proceed in the same way as above to find the value function $$v^{\lambda }$$. We find the following systems of ODEs
\begin{aligned} A'(t)&= a - \frac{2}{\kappa } {(A(t) - \lambda a^2)}^2 \end{aligned}
(3.8)
\begin{aligned} B'(t)&= - \frac{1}{2\kappa } {(2 \lambda a + C(t))}^2 \end{aligned}
(3.9)
\begin{aligned} C'(t)&= -1-\frac{2}{\kappa } (2 \lambda a + C(t)) (A(t) - \lambda a^2) \end{aligned}
(3.10)
\begin{aligned} D'(t)&= - \frac{2}{k} D(t) \big ( A(t) - a^2 \lambda \big ) - \mu \big ( C(t) + 2 a \lambda \big ) \end{aligned}
(3.11)
\begin{aligned} E'(t)&= - \frac{1}{k} D(t) \big (C(t) + 2 a \lambda \big ) - 2 \mu ( B(t) - \lambda ) \end{aligned}
(3.12)
\begin{aligned} F'(t)&= -\frac{1}{2} \frac{\sigma ^4 B(t)^2}{g} - \frac{D(t)^2}{2\kappa } - \mu E(t) - \sigma ^2 B(t) \end{aligned}
(3.13)
with null terminal conditions $$A(T)=\cdots = F(T)=0$$.
Remark 3.3
Notice, in particular, that B is a positive decreasing function of time, which implies (recall Eq. ( 3.7)) that the control $${\widehat{z}}$$ is always positive. This means that there is always interest in increasing the market price volatility, even if the producer buys the derivative.

### 3.2 Verification

Theorem 3.1
Let ABCD be solutions to the system ( 3.8)–( 3.13). There exists an optimal policy $$({\widehat{u}}, {\widehat{z}}) \in {\mathcal {A}}$$ for problem ( 3.3), where
\begin{aligned} {\widehat{u}}_t = \frac{1}{\kappa }\left[ (2\lambda a + C(t)) {\widehat{S}}_t + 2(A(t) -\lambda a^2) {\widehat{q}}_t + D(t) \right] , \quad {\widehat{z}}_t = \frac{\sigma ^2}{g}B(t), \quad t\in [0,T]. \end{aligned}
(3.14)
The no-arbitrage price process for the derivative $$h_T$$ is given by
\begin{aligned} {\widehat{h}}_t := h_t^{{\widehat{u}}, {\widehat{z}}}= ({\widehat{S}}_t -a{\widehat{q}}_t)^2 + \sigma ^2 \int _t ^T \left( 1+ \frac{\sigma ^2}{g}B(u)\right) du, \quad t \in [0,T], \end{aligned}
(3.15)
and the hedging process is
\begin{aligned} {\widehat{\Delta }} _t := \Delta _t^{{\widehat{u}}, {\widehat{z}}} = 2 ({\widehat{S}}_t -a{\widehat{q}}_t), \quad t\in [0,T]. \end{aligned}
(3.16)
Proof
The proof is very similar to the previous model (cf. Theorem 2.1), hence we give details only for those steps which are slightly different.
1.
Admissibility. First, we observe that checking the admissibility property (i), namely $${\widehat{z}}_t > -1$$ for all $$t\in [0,T]$$, is actually easier as no conditions on small T are needed. Indeed, $${\widehat{z}}_t > -1$$ if and only if $$B(t) > - g/\sigma ^2$$, where B solves the corresponding equation in the system ( 3.8)–( 3.13). The latter inequality is satisfied since, being $$B'(t) \le 0$$ and $$B(T) =0$$, we have $$B(t) \ge 0$$ for all $$t\in [0,T]$$. Regarding the integrability properties in (i), checking them is equivalent to show that
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T {\widehat{q}}_t ^2 dt\right]< \infty , \quad {\mathbb {E}}\left[ \int _0 ^T {\widehat{S}}_t ^2 dt\right] < \infty . \end{aligned}
As for the condition on $${\widehat{S}}$$, observe that under $${\mathbb {P}}$$ the process $${\widehat{S}}$$ satisfies $$d {\widehat{S}}_t = \mu dt + \sigma \sqrt{1 + \frac{\sigma ^2}{g} B(t)} dW_t,$$ namely it is a Gaussian process. So, as previously noticed in the proof of Theorem 2.1, we apply first of all Fubini’s theorem and then we remark that the function $$t \mapsto {\mathbb {E}}[{\widehat{S}}_t ^2]$$ is continuous over [0,  T] and its integral is finite. We now work on $${\mathbb {E}}[{\widehat{q}}_t ^2]$$, which is more delicate, since $$d {\widehat{q}}_t = {\widehat{u}}_t dt$$ and $${\widehat{u}}$$ depends also on $${\widehat{S}}$$ [see ( 3.14)]. We have
\begin{aligned} {\widehat{q}}_t = q_0 +\frac{1}{\kappa } \int _0^t (2\lambda a + C(s)) {\widehat{S}}_s ds + \frac{2}{\kappa } \int _0^t (A(s) -\lambda a^2) {\widehat{q}}_s ds + \frac{1}{\kappa } \int _0^t D(s) ds , \end{aligned}
and so
\begin{aligned} {\mathbb {E}}[{\widehat{q}}_t ^2]\le & {} 3 \left( q_0 + \frac{1}{\kappa } \int _0^t D(s) ds \right) ^2 + \frac{3}{\kappa ^2} {\mathbb {E}} \left( \int _0^t (2\lambda a + C(s)) {\widehat{S}}_s ds \right) ^2 \\&+ \frac{12}{\kappa ^2} {\mathbb {E}} \left( \int _0^t (A(s) -\lambda a^2) {\widehat{q}}_s ds \right) ^2 \\\le & {} \underbrace{3 \left( q_0 + \frac{1}{\kappa } \int _0^t D(s) ds \right) ^2 + \frac{3 t }{\kappa ^2} \int _0^t [2\lambda a + C(s)]^2 {\mathbb {E}}[ {\widehat{S}}_s^2] ds }_{:= \alpha (t)} \\&+ \frac{12 t}{\kappa ^2} \int _0^t [A(s) -\lambda a^2]^2 {\mathbb {E}}[{\widehat{q}}_s^2] ds . \end{aligned}
Now, $${\mathbb {E}}[ {\widehat{S}}_s^2]$$ is positive and finite and so we can safely introduce the positive continuous function $$\alpha$$ as above and write
\begin{aligned} {\mathbb {E}}[{\widehat{q}}_t ^2]\le & {} \alpha (t) + \frac{12 t}{\kappa ^2} \int _0^t [A(s) -\lambda a^2]^2 {\mathbb {E}}[{\widehat{q}}_s^2] ds . \end{aligned}
An application of Gronwall’s lemma leads to
\begin{aligned} {\mathbb {E}}[{\widehat{q}}_t ^2] \le \alpha (t) +\int _0^t \alpha (s) K(s) e^{\int _s^t K(u) du} ds, \end{aligned}
with $$K(u) := \frac{12 t}{\kappa ^2} [A(u) -\lambda a^2]^2 >0$$. So, $$t \rightarrow {\mathbb {E}}[{\widehat{q}}_t ^2]$$ is bounded by a continuous function and its integral over [0,  T] is finite. Regarding condition (ii), we proceed again as in the proof of Theorem 2.1 by checking Assumption 2.2 in [ 23], which in our case is satisfied as long as $$\sigma ^2 (1+{\widehat{z}}_t) > 0$$ for all $$t\in [0,T]$$. This is automatically true, since here $$B(t) \ge 0$$ for all $$t \in [0,T]$$ and so $$\sigma ^2 (1 + {\widehat{z}}_t) =\sigma ^2 ( 1 + \frac{\sigma ^2}{g} B(t)) \ge \sigma ^2 >0$$. To end checking condition (ii), we have to show $${\widehat{h}}_T \in L^1({\mathbb {P}}) \cap L^1({\widehat{{\mathbb {Q}}}})$$. Now, $${\widehat{h}}_T = ({\widehat{S}}_T -a{\widehat{q}}_T)^2$$, hence quadratic in both $${\widehat{S}}_t$$ and in $${\widehat{q}}_T$$. For what we have seen up to now $$q_T ^2$$ and $${\widehat{S}}_T$$ belong to $$L^1({\mathbb {P}})$$ and they also belong to $$L^1({\widehat{{\mathbb {Q}}}})$$, which gives the desired property. It remains to check (iii). We proceed again as in the previous theorem, except that now $${\widehat{h}}_t$$ in ( 3.15) is a function of both $${\widehat{S}}$$ and $${\widehat{q}}$$. First, notice that
\begin{aligned} {\widehat{h}}_t = {\widehat{h}}_0 + 2 \int _0 ^t ({\widehat{S}}_s - a {\widehat{q}}_s) (d {\widehat{S}}_s - a d {\widehat{q}}_s)= {\widehat{h}}_0 + \int _0 ^t {\widehat{\Delta }}_s d{\widetilde{S}}_s. \end{aligned}
Now, since $${\widehat{h}}_T = (s_0 - a q_0)^2 + 2 \int _0^T ({\widehat{S}}_s - a {\widehat{q}}_s) \sigma \sqrt{1 + {\widehat{z}}_s} d {\widehat{W}}_s$$ we have
\begin{aligned} \widehat{{\mathbb {E}}} [h_T \vert {\mathcal {F}}_t]= & {} (s_0 - a q_0)^2 + 2 \int _0^t ({\widehat{S}}_s - a {\widehat{q}}_s) \sigma \sqrt{1 + {\widehat{z}}_s} d {\widehat{W}}_s\\&+ 2 \ \widehat{{\mathbb {E}}} \left[ \int _t^T ({\widehat{S}}_s - a {\widehat{q}}_s) \sigma \sqrt{1 + {\widehat{z}}_s} d {\widehat{W}}_s \right] = {\widehat{h}}_t. \end{aligned}
Finally, progressive measurability and integrability of $${\widehat{\Delta }}$$ in condition (iii) in Definition 3.1 can be treated exactly as in the proof of Theorem 2.1, using (i).

2.
Optimality This can be proved by proceeding exactly as in the proof of Theorem  2.1, by taking into account that now the state variable is two-dimensional. $$\square$$

In this section, we finally consider a game between a producer who can manipulate the commodity price through the drift and a trader who can manipulate the volatility of the price. Hence, the trader is no longer price-taker or passive as in the previous two models. Here, she can affect the price of commodity by paying some (quadratic) cost.
The dynamics for S and q are as in ( 3.1). The controls are still given by ( uz) with the big difference that now only u is controlled by the producer, while z is controlled by the trader. Clearly, the corresponding costs are allocated accordingly. When the strategy profile for both players is ( uz) and the derivative price is $$h_0$$, the producer payoff is
\begin{aligned} J_{\mathrm{pr}}(u , z, h_0) = {\mathbb {E}}\left[ \int _0 ^T \left( q_t (S_t -aq_t) - \frac{\kappa }{2}u_t ^2 \right) dt + \lambda (h_0 -h_T) \right] . \end{aligned}
(4.1)
On the other hand, the trader who has the opposite position in the derivatives will get the payoff
\begin{aligned} J_{\mathrm{tr}} (u,z,h_0) = {\mathbb {E}}\left[ - \frac{g}{2} \int _0 ^T z_t ^2 dt - \lambda (h_0 -h_T) \right] . \end{aligned}
(4.2)
Now, we can give the definition of admissible policies, including the strategy profiles of both players together with the pricing and hedging strategy of the investment department in the production firm. Notice that it is the same as for the model in Sect. 3. We recall that $${{\tilde{S}}}_t = S_t -aq_t$$, for $$t \in [0,T]$$.
Definition 4.1
We say that any pair ( uz) is admissible if the following properties are satisfied:
(i)
$$(u_t ,z_t )_{t \in [0,T]}$$ are progressively measurable processes with values in $${\mathbb {R}} \times (-1,\infty )$$ such that
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T (u_t ^2 + z^2 _t)dt\right]< \infty , \quad {\mathbb {E}}\left[ \int _0 ^T {{\tilde{S}}}_t ^2 (1 + z_t)dt\right] < \infty ; \end{aligned}

(ii)
there exists a unique equivalent martingale measure $${\mathbb {Q}}^{u,z}$$ for the price process $${{\tilde{S}}}$$, with $$h_T \in L^1 ({\mathbb {P}}) \cap L^1 ({\mathbb {Q}}^{u,z})$$;

(iii)
there exists a real-valued progressively measurable process $$(\Delta _t^{u,z})_{t\in [0,T]}$$ satisfying
\begin{aligned} {\mathbb {E}}\left[ \int _0^T \left( |\Delta _t^{u,z} u_t| + |\Delta _t^{u,z} |^2 (1+z_t) \right) dt \right] < \infty , \end{aligned}
and such that the following holds $${\mathbb {Q}}^{u,z}$$-a.s.
\begin{aligned} h_t^{u,z} := {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T| {\mathcal {F}}_t] = {\mathbb {E}}^{{\mathbb {Q}}^{u,z}} [ h_T] + \int _0 ^t \Delta _s^{u,z} dq_s , \end{aligned}
for all $$t \in [0,T]$$.

The set of all admissible pairs will be denoted by $${\mathcal {A}}$$.
On the other hand the definition of solution is slightly different, due to the fact that the trader can also play strategically in this model. Before proceeding, we exploit the definition of admissibility, conditions (ii) and (iii) in particular, to rewrite as in the previous two models the payoffs of both the trader and the producer as follows
\begin{aligned} J_{\mathrm{pr}}(u , z, h_0)= & {} {\mathbb {E}}\left[ \int _0 ^T \left( q_t (S_t -aq_t) - \frac{\kappa }{2}u_t ^2 -\lambda (\mu -au_t) \Delta _t^{u,z} \right) dt \right] =: {{\tilde{J}}}_{\mathrm{pr}}(u , z),\\ J_{\mathrm{tr}} (u,z,h_0)= & {} {\mathbb {E}}\left[ \int _0 ^T \left( - \frac{g}{2} z_t ^2 +\lambda (\mu -au_t) \Delta _t^{u,z} \right) dt \right] =: {{\tilde{J}}}_{\mathrm{tr}} (u,z), \end{aligned}
for any admissible pair $$(u,z) \in {\mathcal {A}}$$.
Definition 4.2
We say that the pair $$({\widehat{u}}, {\widehat{z}}) \in {\mathcal {A}}$$ is a Nash equilibrium if it is a Nash equilibrium between the producer and the trader, i.e.,
\begin{aligned} {{\tilde{J}}}_{\mathrm{pr}}({\widehat{u}} , {\widehat{z}}) \ge {{\tilde{J}}}_{\mathrm{pr}}(u , {\widehat{z}}), \quad {{\tilde{J}}}_{\mathrm{tr}}({\widehat{u}} , {\widehat{z}}) \ge {{\tilde{J}}}_{\mathrm{tr}}({\widehat{u}} , z), \end{aligned}
(4.3)
for all deviations uz such that $$(u,{\widehat{z}})$$ and $$({\widehat{u}},z)$$ belong to $${\mathcal {A}}$$.

### 4.1 Heuristics

We want to compute explicitly a Nash equilibrium for the game between producer and trader, described just above. We start from some heuristics that would lead to some candidate equilibrium, while the rigorous verification is postponed to the next sub-section as usual.
First, assuming $$h_t = \varphi (t,q_t,S_t)$$ and consequently $$\Delta _t = \varphi _s (t,q_t,S_t)$$, while exploiting the market completeness as for the previous two models, we can rewrite the running best-response functions of the two players as follows
\begin{aligned} v(t,q,s; z)= & {} \sup _u {\mathbb {E}}\left[ \int _t ^T \left( q_r (S_r -aq_r) - \frac{\kappa }{2}u_r ^2 - \lambda (\mu -a u_r) \varphi _s \right) dr \mid q_t =q, S_t =s \right] , \\ w(t,q,s; u)= & {} \sup _z {\mathbb {E}}\left[ \int _t ^T \left( - \frac{g}{2} z_r ^2 + \lambda (\mu -au_r) \varphi _s \right) dr \mid q_t =q, S_t =s \right] , \end{aligned}
where $$\varphi _s$$ is the delta hedging of the derivative, that will have to be determined at the equilibrium. It is reasonable to expect that the derivative’s price $$\varphi (t,q,s)$$ is the solution to the following PDE:
\begin{aligned} \varphi _t + \frac{\sigma ^2}{2}(1+z) \varphi _{ss} =0, \quad \varphi (T, s,q) = (s-aq)^2, \end{aligned}
(4.4)
for some function $$z=z(t,q,s)$$ coming from the trader’s best-response. The PDE above is coupled with the following two HJB equations, arising from the best-response functions of the two players:
\begin{aligned} -v_t= & {} \sup _{u} \left\{ q(s- a q) - \frac{\kappa }{2} u^2 - \lambda \varphi _s (\mu - a u ) + v_q u + v_s \mu + \frac{\sigma ^2}{2} (1+z) v_{ss} \right\} , \end{aligned}
(4.5)
\begin{aligned} -w_t= & {} \sup _{z} \left\{ - \frac{g}{2} z^2 + \lambda \varphi _s (\mu - a u ) + w_q u + w_s \mu + \frac{\sigma ^2}{2} (1+z) w_{ss} \right\} , \end{aligned}
(4.6)
with terminal conditions $$v(T)=w(T)=0$$. Solving the optimization problems within the HJB equations above, we get the (candidate) equilibrium strategies for the producer and the trader in terms of the corresponding payoff functions:
\begin{aligned} {\widehat{u}} = \frac{1}{\kappa } \big ( v_q + \lambda a \varphi _s \big ), \quad {\widehat{z}} = \frac{\sigma ^2}{2 g} w_{ss}. \end{aligned}
The HJB equations for the producer and the trader become respectively as
\begin{aligned} - v_t = q (s - a q) + \frac{1}{2 \kappa } \big ( v_q + \lambda a \varphi _s \big )^2 - \mu \lambda \varphi _s + \mu v_s + \frac{\sigma ^2}{2} \left( 1 + \frac{\sigma ^2}{2 g} w_{ss} \right) v_{ss} \end{aligned}
and
\begin{aligned} - w_t = \lambda \mu \varphi _s - \frac{1}{\kappa } \big ( v_q + \lambda a \varphi _s \big ) \big ( \lambda a \varphi _s - w_q \big ) + \mu w_s + \frac{\sigma ^2}{2} w_{ss} + \frac{\sigma ^4}{8 g} w^2_{ss} . \end{aligned}
Furthermore, the PDE giving the option equilibrium price $$\varphi$$ is given:
\begin{aligned} \varphi _t + \frac{\sigma ^2}{2} \left( 1 + \frac{\sigma ^2}{2g} w_{ss} \right) \varphi _{ss} = 0, \hbox {with} \quad \varphi (T,q,s) = (s-aq)^2. \end{aligned}
Analogously as in the previous two models, we use the following ansatz for w:
\begin{aligned} w(t,q,s) = A_w(t) q^2 + B_w(t) s^2 + C_w(t) q s + D_w(t) q + E_w(t) s + F_w(t), \end{aligned}
and similarly for v and $$\varphi$$ with self-explanatory notation for their coefficients. Therefore, using the ansatz and proceeding in the usual way, we easily get
\begin{aligned} \varphi (t,q,s) = (s - a q) ^2 + F_{\varphi }(t), \end{aligned}
where
\begin{aligned} F_{\varphi }(t) := \sigma ^2 \int _t^T \left( 1 + \frac{\sigma ^2}{g} B_{w}(r) \right) dr. \end{aligned}
After tedious yet straightforward computations we obtain
\begin{aligned} - A_v^{'}&= - a + \frac{2}{\kappa } \left( A_v - a^2 \lambda \right) ^2 \end{aligned}
(4.7)
\begin{aligned} - B_v^{'}&= \frac{1}{2 \kappa } \left( C_v + 2 a \lambda \right) ^2 \end{aligned}
(4.8)
\begin{aligned} - C_v^{'}&= 1 + \frac{2}{\kappa } \left( A_v - a^2 \lambda \right) \left( C_v + 2 a \lambda \right) \end{aligned}
(4.9)
\begin{aligned} - D_v^{'}&= \mu \left( C_v + 2 a \lambda \right) + \frac{2}{\kappa } D_v \left( A_v - a^2 \lambda \right) \end{aligned}
(4.10)
\begin{aligned} - E_v^{'}&= 2 \mu \left( B_v - \lambda \right) + \frac{1}{\kappa } D_v \left( C_v + 2 a \lambda \right) \end{aligned}
(4.11)
\begin{aligned} - F_v^{'}&= \mu E_v + \frac{1}{2 \kappa } D^2_v + \sigma ^2 \left( 1 + \frac{\sigma ^2}{g} B_w \right) B_v \end{aligned}
(4.12)
\begin{aligned} - A_w^{'}&= \frac{4 }{\kappa } \left( A_v - a^2 \lambda \right) \left( A_w + a^2 \lambda \right) \end{aligned}
(4.13)
\begin{aligned} - B_w^{'}&= \frac{1}{\kappa } \left( C_w - 2 a \lambda \right) \left( C_v + 2 \lambda a \right) \end{aligned}
(4.14)
\begin{aligned} - C_w^{'}&= \frac{2 }{\kappa } \left[ ( A_v - \lambda a^2) ( C_w - 2 a \lambda ) + ( A_w + a^2 \lambda ) ( C_v + 2 \lambda a ) \right] \end{aligned}
(4.15)
\begin{aligned} - D_w^{'}&= \mu \left( C_w - 2 a \lambda \right) + \frac{2 }{\kappa } D_v \left( A_w + a^2 \lambda \right) + \frac{2 }{\kappa } D_w \left( A_v - a^2 \lambda \right) \end{aligned}
(4.16)
\begin{aligned} - E_w^{'}&= 2 \mu \left( B_w + \lambda \right) + \frac{1 }{\kappa } D_w \left( C_v + 2 a \lambda \right) + \frac{1 }{\kappa } D_v \left( C_w - 2 a \lambda \right) \end{aligned}
(4.17)
\begin{aligned} - F_w^{'}&= \mu E_w + \frac{1}{\kappa } D_v D_w + \sigma ^2 B_w + \frac{\sigma ^4}{2 g} B^2_w \end{aligned}
(4.18)
with zero terminal condition for all ODEs above. The first equation, which is a Riccati ODE, can be solved explicitly giving the same expression as for D in the first model:
\begin{aligned} A_v (t) = - \frac{2(a-\frac{2\lambda ^2 a^4}{\kappa })(e^{\theta (T-t)} -1)}{\theta (e^{\theta (T-t)} +1) +\frac{4\lambda a^2}{\kappa } (e^{\theta (T-t)} -1)}, \quad \theta := \sqrt{\frac{8a}{\kappa }}. \end{aligned}
The other equations are linear, hence they can be solved in integral form. For the moment, we give only the expressions for the coefficients that we need in order to compute the equilibrium strategy $${\widehat{z}}$$ of the trader. They are given by:
\begin{aligned} A_w (t)= & {} \frac{4a^2 \lambda }{\kappa } \int _t ^T e^{\frac{4}{\kappa }\int _t ^u (A_v(r) - a^2 \lambda ) dr} (A_v(u) - a^2 \lambda ) du,\\ B_w (t)= & {} \frac{1}{\kappa } \int _t ^T (C_w(r)-2a\lambda )( C_v(r) + 2a\lambda )dr ,\\ C_w(t)= & {} \frac{2}{\kappa } \int _t ^T e^{\frac{2}{\kappa }\int _t ^u (A_v(r) - a^2 \lambda ) dr} \left[ (A_w(u) + a^2 \lambda ) (C_v (u) + 2a\lambda ) - 2a\lambda (A_v (u) - a^2 \lambda ) \right] du ,\\ C_v (t)= & {} \int _t ^T e^{\frac{2}{\kappa }\int _t ^u (A_v(r) - a^2 \lambda ) dr} \left( 1+ \frac{4 a \lambda }{\kappa }(A_v (u) - a^2 \lambda ) \right) du. \end{aligned}

### 4.2 Verification

Theorem 4.1
Let $$A_v,C_v,D_v, B_w$$ be solutions to the system ( 4.7)–( 4.18) such that
\begin{aligned} B_w (t) > -\frac{g}{\sigma ^2}, \quad t \in [0,T]. \end{aligned}
(4.19)
Then there exists a Nash equilibrium $$({\widehat{u}}, {\widehat{z}}) \in {\mathcal {A}}$$ as in Definition 4.2, where
\begin{aligned} {\widehat{u}}_t = \frac{1}{\kappa }\left[ C_v (t) {\widehat{S}}_t + 2A_v (t) {\widehat{q}}_t + D_v (t) + 2\lambda a ({\widehat{S}}_t -a{\widehat{q}}_t) \right] , \quad {\widehat{z}}_t = \frac{\sigma ^2}{g}B_w(t), \quad t\in [0,T].\nonumber \\ \end{aligned}
(4.20)
The no-arbitrage equilibrium price process for the derivative $$h_T$$ is given by
\begin{aligned} {\widehat{h}}_t := h_t^{{\widehat{u}}, {\widehat{z}}} = ({\widehat{S}}_t -a{\widehat{q}}_t)^2 + \sigma ^2 \int _t ^T \left( 1+ \frac{\sigma ^2}{g}B_w(u)\right) du, \quad t \in [0,T], \end{aligned}
(4.21)
and the hedging process at equilibrium is
\begin{aligned} {\widehat{\Delta }} _t := \Delta _t^{{\widehat{u}}, {\widehat{z}}} = 2 ({\widehat{S}}_t -a{\widehat{q}}_t), \quad t\in [0,T]. \end{aligned}
(4.22)
Proof
Analogously as for the previous two models, the proof is structured in two steps.
1.
Admissibility. We start from the verification that the pair $$({\widehat{u}}, {\widehat{z}})$$ belongs to $${\mathcal {A}}$$. The two processes $${\widehat{u}},{\widehat{z}}$$ are clearly progressively measurable by definition, where $${\widehat{u}}$$ takes real values. Moreover, assumption ( 4.19) implies that $${\widehat{z}}_t > -1$$ for all $$t \in [0,T]$$. Concerning the integrability properties in Definition 4.1(i), since both $${\widehat{u}}$$ and $${\widehat{z}}$$ are affine in the state variables $${\widehat{q}}_t,{\widehat{S}}_t$$ with time continuous (hence bounded) coefficients, they boil down to checking
\begin{aligned} {\mathbb {E}}\left[ \int _0 ^T ({\widehat{q}}_t ^2 + {\widehat{S}}_t ^2) dt \right] < \infty . \end{aligned}
(4.23)
For the square integrability of $${\widehat{S}}$$, observe that since $${\widehat{z}}$$ is a deterministic continuous function of time, each $${\widehat{S}}_t$$ is normally distributed, hence it has every moment and they are continuous in time. Therefore $${\mathbb {E}}[\int _0 ^T {\widehat{S}}_t ^2 dt] < \infty$$. To verify the square integrability of $${\widehat{q}}$$, notice that at equilibrium we have
\begin{aligned} d{\widehat{q}}_t = [\alpha (t){\widehat{q}}_t + \beta (t){\widehat{S}}_t + \gamma (t)] dt, \quad {\widehat{q}}_0 =q_0, \end{aligned}
for some deterministic continuous functions of time $$\alpha , \beta$$ and $$\gamma$$. Such a linear ODE can be solved pathwise, giving
\begin{aligned} {\widehat{q}}_t = e^{\int _0 ^t \alpha (r) dr} \left( q_0 + \int _0 ^t e^{-\int _0 ^r \alpha (u)du}\left( \beta (t){\widehat{S}}_r + \gamma (r)\right) dr\right) , \quad t\in [0,T]. \end{aligned}
This implies that showing $${\mathbb {E}}[\int _0 ^T {\widehat{q}}_t ^2 dt] < \infty$$ reduces to $${\mathbb {E}}[ \int _0 ^T (\int _0 ^t {\widehat{S}}_ r dr)^2 dt] < \infty$$, which follows since $$\Sigma _t := \int _0 ^t {\widehat{S}}_r dr$$ is a Gaussian process with time continuous second moment. Regarding condition (ii), we need to show that there exists a unique EMM $$\widehat{{\mathbb {Q}}} = {\mathbb {Q}}^{{\widehat{u}},{\widehat{z}}}$$ for the production process $${\widehat{q}}$$. Let $${\widehat{\gamma }}_t := \frac{\mu -a{\widehat{u}}_t}{\sigma \sqrt{1+{\widehat{z}}_t}}$$ and let us consider
\begin{aligned} L_t ^{{\widehat{u}},{\widehat{z}}} := \exp \left\{ -\int _0 ^t {\widehat{\gamma }}_t dW_t - \frac{1}{2} \int _0 ^t {\widehat{\gamma }}_t ^2 dt \right\} ,\quad t \in [0,T]. \end{aligned}
We use once more [ 23, Theorem 2.1] to prove that under our assumptions the probability $$d{\widehat{{\mathbb {Q}}}} := L_T ^{{\widehat{u}},{\widehat{z}}} d{\mathbb {P}}$$ is well-defined (see also [ 22]). For this model, Assumption 2.2 in [ 23] is satisfied as long as $$\sigma ^2 (1+{\widehat{z}}_t) > 0$$ for all $$t\in [0,T]$$, which is immediately given by our assumption that $$B_w (t) > -g/\sigma ^2$$ ensuring, as we already saw above, that $${\widehat{z}}_t > -1$$ for all $$t \in [0,T]$$. To end this part, we need to check property (iii) in Definition 4.1. The first part, relative to $${\widehat{h}}$$, is done as in the proof of Theorem 2.1, hence the details are omitted. Concerning $${\widehat{\Delta }}$$, first notice that it is clearly progressively measurable and takes real values. Due to the fact that $${\widehat{\Delta }}_t$$ is linear in both $${\widehat{q}}_t$$ and $${\widehat{S}}_t$$, its integrability property is equivalent to ( 4.23), which has already been checked before.

2.
Equilibrium. We are going to use the martingale optimality principle here as well to verify that at the proposed equilibrium $$({\widehat{u}}, {\widehat{z}})$$ both players are implementing an optimal response to each other strategy. Let us consider the producer first and define the following process
\begin{aligned} Y^{u,{\widehat{z}}}_t := \int _0 ^t \left( q_r ({\widehat{S}}_r -aq_r) - \frac{\kappa }{2}u_r ^2 - \lambda \Delta _r^{u, {\widehat{z}}} (\mu -a u_r) \right) dr + W(t,q_t, {\widehat{S}}_t), \end{aligned}
(4.24)
with
\begin{aligned} W(t,q,s) = A_w(t) q^2 + B_w(t) s^2 + C_w(t) q s + D_w(t) q + E_w(t) s + F_w(t). \end{aligned}
Similarly as in the proof of Theorem 2.1, one can verify by applying Itô’s formula and the HJB equation ( 4.5) satisfied by the function W( tqs) by construction in the heuristics part, that $$Y^{u,{\widehat{z}}}_t$$ is a supermartingale for all u such that $$(u, {\widehat{z}}) \in {\mathcal {A}}$$, and a martingale for $$u={\widehat{u}}$$. For the trader, we consider the process
\begin{aligned} Z^{{\widehat{u}}, z}_t := \int _0 ^t \left( - \frac{g}{2}z_r ^2 + \lambda \Delta _r^{{\widehat{u}}, z} (\mu -a {\widehat{u}}_r) \right) dr + U(t, {\widehat{q}}_t, S_t), \end{aligned}
(4.25)
with
\begin{aligned} U(t,q,s) = A_v(t) q^2 + B_v(t) s^2 + C_v(t) q s + D_v(t) q + E_v(t) s + F_v(t). \end{aligned}
Applying the same arguments as for the producer, one can easily check that $$Z^{{\widehat{u}}, z}$$ is a supermartingale for all z such that $$({\widehat{u}}, z) \in {\mathcal {A}}$$ and a martingale for $$z={\widehat{z}}$$. Finally, an application of the martingale optimality principle combined with the admissibility of $$({\widehat{u}}, {\widehat{z}})$$, gives that the latter is a Nash equilibrium as in Definition 4.2. Therefore, the proof is complete. $$\square$$

Remark 4.1
Observe that in the theorem above we assumed that $$B_w (t) > -g/\sigma ^2$$ for all $$t \in [0,T]$$. This is satisfied when the maturity T is small enough. Indeed, one can reason heuristically in the following way: when $$T \approx 0$$, using the Eq. ( 4.14) for $$B_w$$ we have $$B'_w (T) \approx \frac{1}{\kappa }(2\lambda a)^2 >0$$ and we also have $$B_w (T) = 0$$. Therefore, it is natural to expect $$B_w (t) > -g/\sigma ^2$$ for all $$t \in [0,T]$$ when T is small enough, which would also guarantee that the Radon–Nikodym derivative $$d{\mathbb {Q}}^{{\widehat{u}},{\widehat{z}}}/d{\mathbb {P}} = L_T ^{{\widehat{u}},{\widehat{z}}}$$ is well-defined (see the second part of the proof above). Unfortunately, the study of the function $$B_w$$ is much more difficult in this case than in the model of Sect.  2, where we were able to quantify precisely how small T must be.

## 5 Numerical illustration

In this section, we use numerical simulations to illustrate and explain the behaviours of the producer in the three models and of the trader in the third one. We set the drift of the commodity price to zero, $$\mu =0$$, in Models 2 and 3, to simplify the analysis.
Model 1: production-based manipulation. The understanding of the first model is quite straightforward and it is illustrated by the first column of Fig.  1. Starting from a zero production rate q, the optimal strategy of the producer, whether or not she holds a derivative position, is to reach as fast as possible the optimal production rate that maximizes the running profit $$\displaystyle q^\star := \frac{s_0}{2 a}$$. We have seen that when the producer has no position in the derivative market, she has no interest in increasing the volatility using some randomization of her production rate q. Indeed, her expected profit is proportional to $${\mathbb {E}}\big [-q_t^2]$$ and thus, increasing the volatility decreases her expected profit. On the contrary, since controlling the volatility has a cost, she makes costly efforts to reduce it. When the producer holds a derivative position, we first note that she uses her market power to drive the price of the commodity at maturity to a level that suits her profit. If she has bought (resp. sold) the derivative, she drives the commodity price up (resp. down). For instance, in case of a sale, the derivative is sold at, say, 100, but at maturity its payoff is close to zero, ensuring a profit of nearly 100. Figure  2 (left) gives the value function of the producer at time zero as a function of the derivative position. We see that selling derivatives ( $$\lambda >0$$) can only make her better off while buying derivatives requires a certain amount of sales before it is worth the cost.
Regarding the volatility, since the price $$h_0$$ of the derivative is an increasing function of the realized volatility, the producer may have an interest in increasing the volatility to push the value of the derivative up. But, since increasing the volatility has a negative effect on the expected profit, the producer has to assess this trade-off. Using the Remark 2.2 together with the expression for $${\widehat{z}}$$ in (2.8) and noting that it makes sense to increase the volatility only in case the producer has sold the derivative ( $$\lambda >0$$) we have that
\begin{aligned} {\widehat{z}} \ge 0 \quad \Leftrightarrow \quad \lambda \ge \sqrt{\frac{\kappa }{2 a^3}}. \end{aligned}
(5.1)
If the net position exceeds the threshold above, the benefit of increasing the volatility outweighs the cost. The higher the market power, the lower the threshold. Besides, it is worth noting that this threshold does not depend on the cost of intervention g to reduce the volatility. It only depends on the parameters affecting the drift of the commodity price process. In Fig.  1, we choose a large short position of $$\lambda =1$$ which makes the profit on the derivative as important as the profit from production. In that case, the producer increases more than by half the volatility. Figure  3 illustrates the variation of the price of the derivative $$h_0$$, of the expected payoff $${\mathbb {E}}^{\mathbb {P}}[ h_T]$$ but also of the price of the derivative if no volatility manipulation was undertaken, noted $$h_0^{z=0}$$, as a function of the holding position $$\lambda$$. In these simulations, we started the initial production rate at its optimal stationary level $$q^\star$$ to get rid of transitory effects. For the first model, we observe that $$h_0$$ is an increasing function of the position, but it varies much less than the expected terminal payoff. It means that much of the benefit from holding a derivative position comes from the manipulation of the price at maturity.
Model 2: production and information based manipulation. The story for the second model is illustrated by the second column of Fig.  1 and it has many points in common with the first model: the producer optimal production strategy is to reach the stationary optimal level $$q^\star$$ and to drive the price at maturity up in case of a purchase and down in case of a sale. But, now, contrary to the former case, as pointed out in Remark  3.3, the producer always increases the volatility whatever her net position is, long or short, because her profit rate is an increasing function of the volatility. As a consequence, we observe in the second column of Fig.  3 that $$h_0$$ is always greater than $$h_0^{z=0}$$. Further, the variation of $${\mathbb {E}}^{\mathbb {P}}[h_T] - h_0$$ is much larger now, when the producer can separate the manipulation of the drift and of the volatility, than in the first model. Figure  2 (middle) provides the value function of the producer at time zero as a function of the derivative position. The situation here is very similar to what we observed in the first model, namely selling derivatives ( $$\lambda >0$$) results in a profit, while buying derivatives is worth the cost as soon as the amount of sales exceeds a threshold depending on the model parameters.
In both models 1 and 2, the capacity of driving the price of the commodity at maturity at a desired level reveals itself an efficient tool to take advantage of a derivative position. If the producer has sold the derivative at, say, 100, she increases her production rate so that at terminal date, the price of the commodity decreases, making the price of the derivative decrease below the initial price and thus ensuring a profit on the derivative.
Model 3: producer-trader competition. What happens when the producer is facing an opponent who can control the level of volatility? The third column of Fig.  1 illustrates the interaction between the producer and the trader. The fact that the producer now faces an opponent does not change her overall production strategy: she still reaches the stationary optimal level of production rate $$q ^\star$$ and she manipulates the commodity price at maturity at her own advantage. But, the actions of the trader on the volatility reduces the potential profit made by the producer on the derivative. When the producer has sold (resp. bought) the derivative to the trader, the trader reduces (resp. increases) the volatility to push the price $$h_0$$ of the derivative down. The third column of Fig.  3 shows a much lower variation of the derivative profit $$h_0 - {\mathbb {E}}^{\mathbb {P}}[h_T]$$ than in the first two models. Besides, we observe that for $$\lambda >0$$, we have $$h_0- {\mathbb {E}}^{\mathbb {P}}[h_T] > 0$$ and for $$\lambda <0$$, we have $$h_0- {\mathbb {E}}^{\mathbb {P}}[h_T] < 0$$, meaning that in each case, the trader is making a loss.
We have seen that the producer can drive the price at maturity at a level that would make her derivative position a profitable trade for her, providing her with an efficient tool in this asymmetric game of price manipulation. In this situation, considered the potential strong asymmetry of power in this game, a natural question on whether an exchange level $$\lambda$$ that would make both players better off exchanging might arise. Figure  2 (right) gives the value functions of the producer and of the trader at time zero as a function of the derivative position. We observe that even if the value function of the producer exhibits the same pattern as in the first two models, her expected profit is now considerably reduced due to the counteraction of the trader. Besides, the value function of the trader is concave and admits an optimum at a position that makes the producer worse off trading. Further, we observe that in this situation, with a zero initial rate of production, neither the producer nor the trader are better off trading.
But, if we consider that the production rate starts at its optimal stationary level $$q^\star$$, we find that whatever the market power of the producer, there is an exchange position making both the producer and the trader better off than not making a trade. Figure  4 presents the value functions of the producer and the trader at initial time for different values of the market power parameter a and different cost of intervention for the trader g. In each case, we chose as an initial production rate $$q_0$$ $$=$$ $$q^\star$$ $$=$$ $$s_0/(2a)$$, the stationary level of production, avoiding in this way the transitory phase to optimal production rate. When the producer and the trader do not trade ( $$\lambda =0$$), their respective value $$v(0,q_0,s_0)$$ and $$w(0,q_0,s_0)$$ stand at the intersection of the black axis. As the trader starts to sell the derivative, $$\lambda$$ becomes negative and both values are greater than when $$\lambda =0$$, showing that both are better off making the trade. Further, we observe that both are worse off in the case where the trader buys the derivative from the producer.

## Acknowledgements

This work was supported by University of Padova, via the research grant BIRD172407/17 “New perspectives in stochastic methods for finance and energy markets”.

## Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Literatur
Über diesen Artikel

Zur Ausgabe