Open Access 22032021
An ImpulseRegime Switching Game Model of Vertical Competition
Published in: Dynamic Games and Applications  Issue 4/2021
Abstract
We study a new kind of nonzerosum stochastic differential game with mixed impulse/switching controls, motivated by strategic competition in commodity markets. A representative upstream firm produces a commodity that is used by a representative downstream firm to produce a final consumption good. Both firms can influence the price of the commodity. By shutting down or increasing generation capacities, the upstream firm influences the price with impulses. By switching (or not) to a substitute, the downstream firm influences the drift of the commodity price process. We study the resulting impulseregime switching game between the two firms, focusing on explicit thresholdtype equilibria. Remarkably, this class of games naturally gives rise to multiple potential Nash equilibria, which we obtain thanks to a verificationbased approach. We exhibit three candidate types of equilibria depending on the ultimate number of switches by the downstream firm (zero, one or an infinite number of switches). We illustrate the diversification effect provided by vertical integration in the specific case of the crude oil market. Our analysis shows that the diversification gains strongly depend on the passthrough from the crude price to the gasoline price.
1 Introduction
Since Hotelling’s [20] seminal study of commodity prices, considerable efforts have been undertaken to understand the dynamics of the equilibrium price of commodities and in particular, its longrun properties. The cyclical nature of price dynamics is driven by the substitution effect, whereby consumers will switch to a different commodity if prices rise too high. In a deterministic setting, the switching time to the substitute is simple to analyze, but with the stochastic economic cycle consumers face a huge challenge in determining when is the appropriate moment to switch. The succession of booms and busts of commodity prices complicates the switching timing. In the long run, production capacities adapt to demand and make the price oscillate around a longterm equilibrium. Indeed, the longrun behavior of commodity prices exhibits supercycle patterns. The econometric studies in Leon and Soto [24], Erten and Ocampo [15], Jacks [21] and more recently Stuemer [31] all find the presence of supercycles of several decades in the price of commodities. This phenomenon makes one wonder whether it is even necessary for the consumers to ever switch and whether it is not preferable to just wait for the prices to crash again.
The longrun dynamics of commodity prices not only heavily weighs on consumers’ longterm decisions but is also an important driver of the industrial organization of the production (upstream) and transformation (downstream) segments. In particular, from the perspective of industrials, having one foot in each of these two sides of the commodity market is supposed to induce a natural physical hedge against the undesirable effects of supercycles. For instance, Helfat and Teece [19] and Levin [23] find a positive hedging effect procured by vertical integration in the oil industry, while Mansur [27] and Aïd et al. [2] find the same effect in electricity markets.^{1}
Advertisement
In this paper, we design a dynamic model of competition between production and consumption of a commodity used as an intermediate good, allowing to draw conclusions on the longrun dynamics of the commodity price and its effects on vertical integration. In our model, two factors drive the price of the commodity: on the one hand, shortterm but persistent shocks of demand and/or production, and on the other hand, strategic decisions of the (representative) upstream production firm and of the (representative) downstream consumer firm. The upstream producer extracts the commodity at cost \(c_p\) and sells it for a price X. The downstream industry buys the commodity and converts it into a final good that has a price P, nondecreasing in X. This framework covers a wide range of industries. One might think, for example, of the agricultural sector where soy enters as an input for the food industry to produce a large range of consumer goods. In the aluminum industry, upstream smelters produce aluminum to be used by the automotive, transportation and computer industries. In the oil industry, the crude is extracted by production firms, then transformed into gasoline and kerosene by downstream refineries, and then consumed in the retail market. For the sake of simplicity, we identify the downstream firm that transforms the commodity with the final consumer and this downstream firm’s profit with the consumer’s surplus.
We focus on the role of the commodity price X that intrinsically creates competition between the representative agents of producers and consumers. In a nutshell, producers prefer high price X, while consumers prefer low price X. This competition is dynamic and manifests itself through strategic price effects actuated by the two industries. Therefore, X is (partially) jointly controlled by the producers/consumers, leading to gametheoretic impacts.
On the upstream production side, the producer needs the commodity price X to be high enough to make a profit margin. We suppose that the dynamics of investment and disinvestment in upstream production is driven by production capacity shocks that cause jumps in the price X. This assumption is consistent with the theory of real options that predicts the existence of threshold prices triggering the decision of entry and the exit from the market (see MacDonald and Siegel [26] and Dixit and Pindyck [14]). It is also consistent with the observations of quick swings in investment and disinvestment in production, see, e.g., the boom and bust of commodity prices in 2008–2010.
On the downstream consumer side, consumers induce a longterm effect on the commodity price only if they switch to a substitute, and they switch to a substitute only if they anticipate that X will remain high enough for a long time. The downstream side faces slower dynamics because it involves the transformation of many local installations using the commodity. To have an example in mind, one may think of the thousands of adjustments required to change heating systems in buildings, or of the slow effect of the energy saving programs launched by OECD after the 1970s oil shocks. Thus, in our model, the downstream market for the final good can be in contraction or expansion regime. The contraction regime corresponds to a decreasing demand for the primary commodity, i.e., the market is abandoning the use of the commodity for a substitute, while the expansion mode corresponds to an increasing demand for the commodity. Depending on the state of the downstream retail market, the drift of the commodity price takes either a constant positive value in the expansion mode or a negative value in the contraction mode. Because such consumer shifts are slow and expensive, the state is persistent (i.e., piecewise constant in time) and changing the state of the finalgood market incurs heavy switching costs. This toggling of the price trend can be interpreted as endogenous regime switching, a common way of modeling commodity prices through the business cycle. Indeed, since the seminal paper by Hamilton [17] on the modeling of financial timeseries with regimeswitching models, this type of models has been successfully applied to a large range of commodities spot prices, such as crude oil (Alizadeh et al. [4]), precious metals (Choi and Hammoudeh [11]), lumber (Chen and Insley [10]) and power (Haldrup and Nielson [16]). Beyond the impact of producers and consumers decisions, the commodity price is subject to exogenous shortterm stochastic shocks, captured through a Brownian motion driving risk factor.
Advertisement
Our aim is to construct and characterize the dynamic equilibrium in the commodity market due to this vertical competition. Our major contribution is to provide an endogenous, gametheoretic basis for two key stylized features of commodity markets: (i) supercycles that manifest as longterm meanreversion; (ii) fundamental impact of supply and demand that maintains the price in a range of values rather than a single equilibrium value. Furthermore, our model allows for three potential types of equilibria depending on the number of demand switches undertaken by the consumer at equilibrium: zero, one, or an infinite number of switches. All thresholdtype equilibria exhibit the latter qualitative properties. Besides, the higher the consumer’s switching cost, the more she is compelled to endure an unfavorable range of prices.
We name the three types of potential equilibria in our model as generic, transitory and preemptive. In the generic equilibrium described precisely in Sect. 4.1, both the upstream and downstream representative firms act on the price perpetually: the downstream firm repeatedly switches from expansion to contraction, and the upstream firm applies investment and disinvestment impulses. This equilibrium is likely in sectors where there are easily available substitutes, e.g., for agricultural products. In the transitory equilibrium described in Sect. 4.2, the consumer and the producer both prefer a given regime and thus, the consumer switches at most once when the market is initialized in the opposite regime. Afterward, only the producer acts to maintain the price within her preferred range of values by appropriate investment/disinvestment actions. In the last type of equilibrium, the consumer is also stuck forever in a regime, but it is a regime she wishes to leave. In that case described in Sect. 4.3, only the producer acts. Starting in the expansion regime, for instance, the consumer would like to switch to the contraction regime when the price reaches a threshold. But the producer, who prefers perpetual expansion, preempts the switch by acting just before the action of the consumer. Transitory and preemptive equilibria are likely to be observed in sectors where substitutes are hard to come by and where upstream firms dominate.
Our model relates to the vast literature on the dynamics of commodity prices. An important stream of the economic literature deals with the joint dynamics of futures and spot prices and its relation with storage. The seminal references are the papers by Deaton and Laroque [13] and Routledge et al. [30]. The monograph by Pirrong [29] provides a thorough discussion of this field of research. In this paper, our focus is to explain the cycles of commodity prices as the result of vertical competition. The paper which is the closest to our work is Cassasus et al. [8] addressing longterm competitive equilibrium of crude oil, which also exhibits meanreversion and regime switching produced by episodes of over and undersupply, generated by lumpy irreversible investment performed by a representative agent. Although our model shares some features with Cassasus et al. [8] (the absence of storage for instance), we get a richer set of equilibria generated by the antagonist objectives of the upstream and downstream sectors.
Along the way, we also make mathematical contributions to the literature on nonzerosum stochastic games (see Martyr and Moriarty [25], Attard [5], De Angelis et al. [9], Aïd et al. [1]). To our knowledge ours is the first paper that: (i) considers a mixed impulse–control/switching–control stochastic game; (ii) explicitly constructs a potential impulseswitching thresholdtype equilibria in nonzerosum games; provides new verification theorems regarding bestresponse strategies for (iii) an impulsing agent in a regimeswitching setting and (iv) switching agent with an impulsed state process. While our solution is nonexhaustive in the sense that we a priori focus on a special class of equilibria (leaving open the question of existence of other equilibrium families), it is highly tractable. Namely, we are able to provide closedform description of the dynamic equilibrium, offering precise quantitative insights regarding the producer and consumer roles and their equilibrium behavior.
To emphasize the latter point, beyond several synthetic examples that illustrate and visualize our model features, we also present a detailed case study of the diversification effect provided by vertical integration in the crude oil market circa 2019, viewed as a competition between crude oil producers and oil refiners that convert crude into gasoline and other consumer goods. In our case study, we fitted the oil market to the generic type of equilibrium, considering that oil consumption experiences alternate phases of expansion when the price is low and contraction when the price is too high. In this setting, we consider a small downstream firm asking herself whether she has an interest in getting more vertically integrated. We show that the gains from integration are directly linked to the passthrough parameter that links the crude oil price to the retail gasoline price. The higher this passthrough, the higher production activity dominates the retail activity both in terms of expected rate of profit and the standard deviation of rate of profit.
The rest of the paper is organized as follows. Section 2 sets up the competitive producer–consumer commodity market. Section 3 then constructs the respective candidates for thresholdtype impulseswitching equilibria by considering the producer and consumer bestresponse strategies. Section 4 illustrates and discusses the different types of emergent equilibria using toy examples. Section 5 presents the above vertical integration case study, and Sect. 6 concludes. All the proofs, as well as additional comparative statics, are delegated to Sect. 7.
2 The Model
2.1 Description
We use \((X_t)\) to denote the (preequilibrium) commodity price, modeled as a continuoustime stochastic process. The two players are denoted as producer and consumer. In what follows subindex p (resp. c) in the notation will always refer to the producer (resp. consumer). The market involves the original raw commodity that is being produced and the goods market (e.g., gasoline). The producer extracts the commodity at cost \(c_p\) and sells it for price x. The consumer buys it for price x, converts it into a final good, and sells it for price P.
Profit rates The price x of the commodity influences the volume of trade, captured by the demand function \(D_p (x)\). A similar phenomenon plays out in the finalgood market: the goods price P leads to sales volume \(D_c (P)\). Since the consumer is in effect the intermediary between the commodity and the goods market, she will pass some of her input price shocks to the output price \(P \equiv P(x)\).
We ignore the players’ fixed costs because they can be considered to be integrated in the investment costs, and concentrate on the variable costs and revenues that are driven by the respective input/output prices. Based on the above discussion, the instantaneous profit rate of the producer isLet \(c_c\) be the processing/conversion cost from input commodity to final good and \(\alpha \) be the respective conversion factor, so that one unit of commodity becomes \(\alpha \) units of the final good (e.g., barrels of crude oil, converted into barrels of gasoline). Then, the instantaneous profit rate of the consumer isWe note that while the consumer has market power, he is not the only user of the commodity (e.g., crude oil is also used by the petrochemical industry), so there is no direct link between production volume and consumption volume. Thus, while there is a physical link between the consumer input volume \(D_c(P)/\alpha \) and her output volume \(D_c(P)\), there is no direct link between \(D_c(P)/\alpha \) and aggregate commodity demand \(D_p(x)\).
$$\begin{aligned} \pi _p (x) := (x  c_p) D_p (x). \end{aligned}$$
(1)
$$\begin{aligned} \pi _c (x) := D_c (P) P  \frac{D_c (P)}{\alpha } (x + c_c). \end{aligned}$$
(2)
We shall consider linear inverse demandIf we further assume that \(P(x) = p_0 + p_1 x\) (the price of the final good is linearly proportional to the commodity price), and \(D_c (P) = d'_0  d'_1 P\) (finalgood demand is linearly decreasing in its price P), the profit rate of the consumer becomes:The consumer profit is concave in the commodity price x, \(\gamma _2 < 0\), if and only if the passthrough coefficient \(p_1\) is higher than the conversion factor \(1/\alpha \). It means that the finalgood price increases faster than the need of the downstream industry to produce one more good, which is a sound economic condition for having a sustainable downstream industry. To sum up, the profit rates of both producer and consumer are concave and quadratic in x.
$$\begin{aligned} D_p (x) = d_0  d_1 x. \end{aligned}$$
$$\begin{aligned} \pi _c (x)&= D_c (P(x)) \cdot \left( P(x)  \frac{(x+c_c)}{\alpha }\right) \nonumber \\&= \left( d'_0d'_1p_0 \right) \left( p_0\frac{c_c}{\alpha }\right) \nonumber \\&\quad +\, \left( \left( d'_0d'_1p_0 \right) \left( p_1\frac{1}{\alpha }\right) d'_1p_1\left( p_0\frac{c_c}{\alpha }\right) \right) x +d'_1p_1\left( \frac{1}{\alpha }p_1\right) x^2 \nonumber \\&=: \gamma _0 + \gamma _1 x + \gamma _2 x^2. \end{aligned}$$
(3)
Market Conditions We model the commodity price process \((X_t)\) as a controlled Itô diffusion of the formThe Brownian motion \((W_t)\) captures exogenous price shocks due to random demand or production fluctuations, or pertinent economic shocks for the industry. In this sense, the model is agnostic in the reasons why the commodity price fluctuates around its mean trend. The point process \(N_t := \sum _{i\ge 1} \xi _i {\mathbf {1}}_{\{ \tau _i \le t \} }\) captures the producer interventions at times \((\tau _i)_{i \ge 1}\) and impulses \((\xi _i)_{i \ge 1}\). A positive impulse is triggered by an investment phase and has a negative impact on the price. A negative impulse is induced by a disinvestment phase and has a positive impact on the price.
$$\begin{aligned} \text {d}X_t = \mu _t \text {d}t + \sigma \text {d}W_t  \text {d}N_t. \end{aligned}$$
(4)
The drift process \((\mu _t)\) represents the state of the retail market for the final good. It is either in expansion or in contraction state. When in expansion, demand is growing faster than the available production capacity; hence, prices tend to rise: \(\mu _t = \mu _+ > 0\). When in contraction, the demand is shrinking faster than the production capacity; thus, the price tends to decrease, and thus, \(\mu _t = \mu _ < 0\). This modeling corresponds to an imperfect adjustment of the market as in a sticky price model in macroeconomics. The drift is fully controlled by the consumer,where \(\sigma _i\) is the ith switching instance taken by the consumer in the case \(\mu _{0}=\mu _+\) (with the convention \(\sigma _0 =0\), so that \(\sigma _1\) is the first switching time) and analogously when \(\mu _{0}=\mu _\) by interchanging odd and even switching times. Thus, both players influence \((X_t)\), although their actions are of distinct types, namely impulse control \((N_t)\) by the producer and switchingdrift control \((\mu _t)\) by the consumer. The resulting controlled price dynamics are denoted as \(X^{(\mu ,N)}\).
$$\begin{aligned} \mu _t = \mu _+ \sum _{i=0}^\infty {\mathbf {1}}_{\{ \sigma _{2i} \le t< \sigma _{2i+1} \} } + \mu _ \sum _{i=1}^\infty {\mathbf {1}}_{\{ \sigma _{2i1} \le t < \sigma _{2i} \} },\qquad t \ge 0, \end{aligned}$$
The quadratic nature of the upstream and downstream profit rate functions \(\pi _p(\cdot )\) and \(\pi _c(\cdot )\) implies that each player has their own natural habitat given by the intervals \((x_c^1, x_c^2)\) and \((x_p^1,x_p^2)\) for commodity price levels with:Players make a positive profit only when the price is in the interval \((x^1_i,x^2_i)\), \(i \in \{c,p\}\). The concavity of the profit functions implies that players have preferred commodity levels \({\bar{X}}_p, {\bar{X}}_c\) that maximize their profit rates, namely:Typically, we expect that \({\bar{X}}_c < {\bar{X}}_p\), so that the preferred commodity price of the consumer is lower than that of the producer. The stochastic fluctuations coming from \((W_t)\) can generate three different market conditions:In the first and last cases, both players have the same preferences to raise or decrease \(X_t\); in the intermediate case, they compete against each other. Because both players can in principle push \((X_t)\) in either direction, the market organization is influenced by their relative gain of doing so, as well as their action costs. In cases I and III, the players are in waiting mode because of the secondmover advantage, hoping that the other will act first which allows the second to benefit from the price effect without paying the cost of (dis)investment or of switching. In case II, they are in preemption mode, with the player who moves first being able to increase her profits at the expense of the other. These dynamic shifts between waiting and preemption is an important feature of vertical competition.
$$\begin{aligned} x^1_p&:= \min \Big \{c_p,\frac{d_0}{d_1}\Big \} , \qquad&x^2_p := \max \Big \{c_p,\frac{d_0}{d_1}\Big \} , \end{aligned}$$
(5)
$$\begin{aligned} x^1_c&:= \min \Big \{ \frac{p_0c_c/\alpha }{1/\alpha p_1},\frac{d'_0d'_1p_0}{p_1d'_1}\Big \} , \qquad&x^2_c := \max \Big \{\frac{p_0c_c/\alpha }{1/\alpha p_1},\frac{d'_0d'_1p_0}{p_1d'_1}\Big \}. \end{aligned}$$
(6)
$$\begin{aligned} {\bar{X}}_p := \frac{d_0 + c_p d_1}{2 d_1}, \qquad {\bar{X}}_c := \frac{\gamma _1}{2 \gamma _2}. \end{aligned}$$
(7)
$$\begin{aligned} X_t< {\bar{X}}_c&\quad \text {I: abnormally low prices}; \\ {\bar{X}}_c \le X_t \le {\bar{X}}_p&\quad \text {II: vertical competition}; \\ {\bar{X}}_p < X_t&\quad \text {III: abnormally high prices}. \end{aligned}$$
Objective Functions and Admissible Strategies The objective functionals of the players consist of integrated profit rates \(\pi _{\cdot }(x)\), discounted at constant rate \(\beta >0\) and subtracting the control costs that are paid at respective intervention epochs. We take the investment cost function of the producer to be some convex function \(K_p : {\mathbb {R}} \rightarrow {\mathbb {R}}\), and of the consumer as \(H: \{\mu _, \mu _+\} \rightarrow {\mathbb {R}}_+\). We denote the latter as \(H(\mu _) = h_, H(\mu _+) = h_+\). Depending on the initial drift \(\mu _0\) being positive/negative, the producer’s objective function is given by:and, similarly, the representative consumer’s objective function is:In order for the state variable dynamics and players’ expected payoffs to be well defined, we give the following definition of admissible strategies. To this end, let \((\Omega , {\mathcal {F}}, ({\mathcal {F}}_t)_{t \ge 0}, {\mathbb {P}})\) be a probability space with a filtration satisfying the usual conditions and supporting an \(({\mathcal {F}}_t)_{t \ge 0}\)Brownian motion \((W_t)\).
$$\begin{aligned} J_p ^\pm (x; N,\mu ) := {\mathbb {E}}\Big [ \int _0^\infty e^{ \beta \, t} \big ( X_t  c_p) D_p(X_t) \text {d}t  \sum _i e^{\beta \, \tau _i} K_p (\xi _{i}) \Big  \, \mu _0 = \mu ^\pm , X_0 = x\Big ], \end{aligned}$$
(8)
$$\begin{aligned} J_c ^\pm (x; N,\mu ) := {\mathbb {E}}\Big [ \int _0^\infty e^{ \beta \, t} \big ( \gamma _0 + \gamma _1 X_t + \gamma _2 X_t ^2) \text {d}t  \sum _j e^{\beta \, \sigma _j} H(\mu _{\sigma _j}) \Big  \, \mu _0 = \mu ^\pm , X_0 = x \Big ].\nonumber \\ \end{aligned}$$
(9)
Definition 1
(Admissible strategies) We say that \((\tau _i, \xi _i)_{i \ge 1}\) is an admissible strategy for the producer if the following properties hold: Similarly, we say that the sequence \((\sigma _j)_{j \ge 1} \) is an admissible strategy for the consumer if The set of all producer’s (resp. consumer’s) admissible strategies is denoted by \({\mathcal {A}}_p\) (resp. \({\mathcal {A}}_c\)).
1.
\((\tau _i)_{i \ge 1}\) is a sequence of \([0,\infty ]\)valued stopping times such that \(0 \le \tau _1< \tau _2 < \cdots \) and \(\lim _{i \rightarrow \infty } \tau _i =\infty \) a.s., with the convention that \(\tau _i = \infty \) for some \(i \ge 1\) implies \(\tau _k = \infty \) for all \(k \ge i\);
2.
\((\xi _i)_{i \ge 1}\) is a sequence of realvalued \({\mathcal {F}}_{\tau _i}\)measurable random variables;
3.
the sequence \((\tau _i , \xi _i)_{i \ge 1}\) satisfies \(\sum _{i \ge 1} e^{\beta \tau _i } \xi _i \in L^2 ({\mathbb {P}})\).
4.
each \(\sigma _j\) is a \([0,\infty ]\)valued stopping time, \(0 \le \sigma _1< \sigma _2 < \cdots \), with the convention that \(\sigma _j = \infty \) for some \(j \ge 1\) implies \(\sigma _k = \infty \) for all \(k \ge j\);
5.
\(\sum _{j \ge 1} e^{\beta \sigma _j} \in L^2 ({\mathbb {P}})\).
Remark 1
Observe that the property 1 above implies that the producer intervention times do not accumulate in finite time, so that for all \(t >0\) the process \(N_t = \sum _{i\ge 1} \xi _i {\mathbf {1}}_{\{ \tau _i \le t \} }\), \(t \ge 0\), is well defined, adapted and finitevalued. Moreover, the integrability condition in 5 gives that \(\sigma _j \rightarrow \infty \) (as \(j \rightarrow \infty \)), i.e., the switching times of the consumer do not accumulate in finite time either, so that the dynamics of the controlled state variable (4) is well defined too. Regarding the expected profits of the players, they are both finite due to integrability properties in 3 and 5 above.
Remark 2
According to the definition of admissibility above, neither player can intervene more than once at a time. However, simultaneous interventions coming from both of them are not excluded. As discussed, the dynamics of the intervention in upstream production is much faster than the switching of the consumption regime for final good. Thus, in case both players try to act simultaneously, we assume that the producer has priority. This avoids unnecessary technicalities and allows for a consistent modeling of the vertical competition.
2.2 Equilibrium
Using this notion of admissible strategies, we give the definition of Nash equilibrium.
Definition 2
(Nash equilibrium) A Nash equilibrium is any pair \(((\xi _i, \tau _i)_{i \ge 1}, (\sigma _j)_{j \ge 1}) \in {\mathcal {A}}_p \times {\mathcal {A}}_c\) satisfying the following property:for any other pair of strategies \(((\xi _i ', \tau '_i )_{i \ge 1},(\sigma '_j)_{j \ge 1}) \in {\mathcal {A}}_p \times {\mathcal {A}}_c\), where in the payoffs \(J^ _r (x; \cdot )\), \(r \in \{c,p\}\), above we have \(N'_t = \sum _{i \ge 1} \xi '_i {\mathbf {1}}_{\{ \tau '_i \le t \} }\) and \(\mu '_t = \mu _+ \sum _{i=0}^\infty 1_{\{ \sigma '_{2i} \le t< \sigma '_{2i+1} \} } + \mu _ \sum _{i=1}^\infty {\mathbf {1}}_{\{ \sigma '_{2i1} \le t < \sigma '_{2i} \} }\) for \(t\ge 0\), \(\mu ' _{0}=\mu _+\) and the convention \(\sigma ^\prime _0 =0\) (analogously in the other case \(\mu '_{0}=\mu _\) by interchanging odd and even switching times).
$$\begin{aligned} J_p ^\pm (x; N',\mu ) \le J_p ^\pm (x; N,\mu ), \qquad J_c ^\pm (x; N,\mu ') \le J_c ^\pm (x; N,\mu ), \qquad \forall x \in {\mathbb {R}}, \end{aligned}$$
In line with the envisioned Markovian structure and in order to maximize tractability, we concentrate on a specific class of dynamic equilibria. Namely, we aim to construct thresholdtype feedback Nash equilibria which are of the formandwherefor some measurable function \(\delta : {\mathbb {R}} \rightarrow {\mathbb {R}}\) and some suitable Borel sets \(\Gamma _p ^\pm ,\Gamma _c ^\pm \subset {\mathbb {R}}\). Thus, (10), (11) imply that players act based solely on the current price \((X_t)\) and demand regime \((\mu _t)\), ruling out historydependent strategies, and moreover, the strategies are characterized through fixed action regions \(\Gamma _p^\pm , \Gamma _c^\pm \) and impulse maps \(\delta (\cdot )\). We will denote by \(\tau '_k\) the aggregated intervention times coming jointly from the two players. The fact that in (10) producer’s intervention times \(\tau _i\) are defined via \(\Gamma _p (t)\) translates the assumption that in case of simultaneous interventions, the producer plays first and so her thresholds naturally depend on the drift \(\mu _{t}\) just before her and consumer’s actions (compare to Remark 2).
$$\begin{aligned} \tau _0 = 0, \quad \tau _i = \inf \{ t > \tau _{i1} : X_t \in \Gamma _p (t) \}, \quad i \ge 1, \qquad \xi _i = \delta ( X_{\tau _i}, \mu _{\tau _i}), \end{aligned}$$
(10)
$$\begin{aligned} \sigma _0 =0, \quad \sigma _j = \inf \{ t > \sigma _{j1} : X_t \in \Gamma _c (t) \}, \quad j \ge 1, \end{aligned}$$
(11)
$$\begin{aligned} \Gamma _r (t) = \Gamma ^+ _r {\mathbf {1}}_{\{\mu _t = \mu _+\}} + \Gamma ^ _r {\mathbf {1}}_{\{\mu _t = \mu _\}}, \quad r\in \{c,p\}, \end{aligned}$$
The action regions \(\Gamma _p ^\pm ,\Gamma _c ^\pm \) are expected to be as follows. The impulse intervention region of the upstream production \(\Gamma ^\pm _p =(x^\pm _\ell , x^\pm _h)\) is twosided: the producer will act whenever \(X_t\) reaches \(x^\pm _h\) from below or drops to \(x^\pm _\ell \) from above. Note that these thresholds \(x_\ell ^\pm , x_h^\pm \) are \(\mu \)dependent. On the consumption side, when \(\mu _t = \mu _+\) (expansion regime), the consumer will switch to \(\mu _\) if \(X_t\) gets too high: \(\Gamma ^+_c = (y_h, \infty )\). Similarly when \(\mu _t = \mu _\) (contraction regime), she will switch to \(\mu _+\) if \(X_t\) gets too low \(\Gamma ^_c = (\infty , y_\ell )\). Finally, when the producer intervenes, he will bring \(X_t\) to her impulse level \(x^{\pm *}_{r}\) so that the impulse amount is \(\xi ^{\pm }_{r} = x^{\pm } _{r} x^{\pm *}_{r}\). The natural ordering we expect is the producer impulses toward \({\bar{X}}_p\)and the consumer switches toward \({\bar{X}}_c\),so that when acting both players try to move X toward their preferred levels. However, the precise ordering between the impulse thresholds \(x^{\pm }_r\) and the switching thresholds y’s is not clear a priori and will emerge as part of the overall equilibrium construction.
$$\begin{aligned} x^{\pm }_\ell< x^{\pm *}_\ell \quad \text {and} \quad x^{\pm *}_h < x^{\pm } _h , \end{aligned}$$
(12)
$$\begin{aligned} y_\ell< {\bar{X}}_c < y_h , \end{aligned}$$
(13)
2.3 Illustration of Competitive Dynamics
To further understand the market evolution under competition of the producer and consumer, we focus on the case where both players are active. The producer’s strategy is summarized via a \(2\times 4\) matrix \({\mathcal {C}}_p\) which lists the thresholds \(x^{\pm }_\ell ,x^{\pm }_h\) and the target levels \(x^{\pm *}_\ell ,x^{\pm *}_h\). Thus, the nointervention regions are \([x^{\pm }_\ell , x^{\pm }_h]\) and impulse amounts are \(x^{\pm }_hx^{\pm *}_h, x^{\pm *}_\ell x^{\pm }_\ell \):The consumer has two switching thresholds \(y_\ell , y_h\); in a typical setup, we expect them to satisfy the following orderingNote that in the expansion regime (drift \(\mu _+\)), we assume that \(y_h < x^+_h\). Therefore, coming from below, \(X^*_t\) hits \(y_h\) first, causing the consumer to switch into the contraction regime with drift \(\mu _\). As a result, the impulse threshold \(x^+_h\) is not effective, i.e., it will never get triggered along an equilibrium path of \((X_t)\). Similar argument implies that \(x^_\ell \) is not effective either if \(x^_\ell <y_\ell \). In the left panel of Fig. 1, we illustrate such thresholdbased vertical competition among the two players.
$$\begin{aligned} {\mathcal {C}}_p=\begin{bmatrix} x^+_\ell , &{} x^{+*}_\ell , &{} x^{+*}_h, &{} x^{+}_h\\ x^_\ell , &{} x^{*}_\ell , &{} x^{*}_h, &{} x^{}_h\\ \end{bmatrix}. \end{aligned}$$
(14)
$$\begin{aligned} x^_\ell<y_\ell<y_h<x_h^+. \end{aligned}$$
(15)
×
To illustrate competitive dynamics, the right panel of Fig. 1 shows a sample trajectory of \((X^*_t)\) (the superscript emphasizing the fact that we are now looking at equilibrium) with producer and consumer strategiesAccording to the above discussion, the effective thresholds are \((x^+_\ell , y_h)\) when \(\mu _t=\mu _+\), or \((y_\ell , x^_h)\) when \(\mu _t=\mu _\). In other words, in the expansion regime, \((X_t)\) will be between [1.0, 1.8] and in the contraction regime it will be between [1.2, 2.0]. In Fig. 1 (Right), we start in the contraction regime with \(X_0=1.5\) and \(\mu _0=\mu _\). On this trajectory, \((X^*_t)\) moves down until it touches the consumer’s threshold \(y_\ell \), where the consumer switches to a positive drift to draw the price up. Nevertheless, the price keeps decreasing and hits \(x^+_\ell =1.0\), whereby the producer intervenes and pushes it to \(x^{+*}_\ell =1.3\). Prices then continue to rise up to \(y_h = 1.8\) at which point the consumer switches again and starts pushing them back down (supposedly she wishes to keep them somewhere around 1.5). This cyclic behavior continues ad infinitum, yielding a stationary distribution for the pair \((X^*_t, \mu ^*_t)\). Note that the consumer uses her switching control to keep \(X^*_t\) from going too high or too low, essentially cycling between \(y_\ell \) and \(y_h\). Indeed, starting at \(X^*_t = y_\ell \), the consumer switches to expansion which causes prices to trend up; once they hit \(y_h\), the consumer switches to contraction, causing prices to trend down. As a result, \(\mu _t\) alternates between \(\mu _+, \mu _\) generating a meanreverting behavior. Throughout, the producer acts as a “backup,” explicitly forcing prices from becoming extreme (namely from falling in the expansion regime, or rising in the contraction regime). These additional interventions by the producer make the domain of \((X^*_t)\) bounded.
$$\begin{aligned} {\mathcal {C}}_p=\begin{bmatrix} 1.0, &{}1.3,&{} 1.7,&{} 2.0\\ 1.0,&{} 1.3, &{} 1.7, &{} 2.0 \end{bmatrix}, \quad \quad (y_\ell , y_h)=(1.2, 1.8). \end{aligned}$$
It is also possible that, say, \(x^+_h < y_h\) so that in the expansion regime the producer will act first both when \((X^*_t)\) falls (impulse threshold \(x^+_\ell \)) and when \((X^*_t)\) rises (\(x^+_h\)), making the consumer inactive. In that case it is plain to see that the drift \(\mu _t \equiv \mu _+\) will stay positive forever; \((X_t)\) will be forced to a bounded domain but will not have meanreverting dynamics since the drift is constant. Instead, it will experience repeated impulses downward to counteract the upward trend due to ongoing consumption growth.
3 BestResponse Functions
To obtain a thresholdtype feedback Nash equilibrium, we view it as a fixed point of the producer and consumer bestresponse maps. Therefore, our overall strategy is to (i) characterize thresholdtype switching strategies for the consumer given a prespecified, thresholdtype behavior by the producer; (ii) characterize thresholdtype impulse strategies for the producer who faces a prespecified regimeswitching behavior of \((X_t)\); (iii) employ tâtonnement, i.e., iteratively apply the bestresponse controls alternating between the two players to construct an interior, nonpreemptive equilibrium satisfying the ordering (15).
To analyze bestresponse strategies, we utilize stochastic control theory, rephrasing the related dynamic optimization objectives through variational inequalities (VI) for the jumpdiffusion dynamics (4). The competitor thresholds then act as boundary conditions in the VIs. To establish the desired equilibrium, we need to verify that the best response is also of thresholdtype and solves the expected systems of equations. We note that all three pieces above are new and we have not been able to find precise analogues of the needed verification theorems in the extant literature. Nevertheless, they do build upon similar singleagent control formulations, so the overall technique is conceptually clear.
3.1 Consumer Best Response
Fixing impulse thresholds \(x^\pm _r\) (\(r=h,l\)), the consumer faces a twostate switching control problem on the bounded domain \((x^\pm _\ell , x^\pm _h)\). Namely, given a producer’s impulse strategy \((\tau _i ,\xi _i)_{i \ge 1}\) with \(\tau = \inf \{ t : X_t \notin [x^{\pm }_\ell , x^{\pm }_h ] \}\), we expect the following stochastic representation for her value functions \(w^\pm (x)\) with \(x \in [x^{\pm }_\ell , x^{\pm }_h ]\)where \({\mathbb {E}}_{x,\pm }\) denotes expectation with respect to \(\mu _t \in \{\mu _, \mu _+\}\) and \(h_{\pm }\) are the fixed intervention costs of the consumer. The above is a system of two coupled equations, which locally resembles an optimal stopping problem with running payoff \(\pi _c(\cdot )\), reward \(w^\mp (\cdot )\) (last term), and stoploss payoff (middle term) \(w^\mp (\cdot )\) due to the producer impulse at \(\tau \). This is almost the formulation as considered in [3] except with two modifications:Now, given a producer strategy \({\mathcal {C}}_p\), if the consumer’s response is such that \(y_\ell < x^_\ell \) and \(x^+_h < y_h\), the consumer will be stuck forever in the initial regime because the price touches \( x^_\ell \) before \(y_\ell \) in the contraction regime and \(x^+_h\) before \(y_h\) in the expansion regime. In this case, the price will oscillate between \(x^_\ell \) and \(x^_h\) if the initial market is in the contraction regime, and between \(x^+_\ell \) and \(x^+_h\) in the expansion regime.
$$\begin{aligned} w^\pm (x)&= \sup _{ \sigma \in {\mathcal {T}} }{\mathbb {E}}_{x,\pm }\Bigg [\int _0^{{\tau }\wedge \sigma }e^{\beta t} \pi _c (X_t) \text {d}t +e^{\beta \underline{\tau }}\mathbb {1}_{\{\tau < \sigma \}} \Big (w^\pm (X_{\tau }\xi )\Big ) \nonumber \\&\quad +\,e^{\beta \underline{\tau }}\mathbb {1}_{\{\tau > \sigma \}}\Big (w^\mp (X_{\sigma })  h_\pm \Big )\Bigg ], \end{aligned}$$
(16)

The domain is bounded on both sides (previously there was a onesided stoploss region).

The boundary condition \(w^+(x_\ell ) = w^+(x^{+*}_\ell )\) is autonomous but nonlocal. Therefore, the two stoppingtype VIs for the consumer are coupled only through the free boundaries, not through the stoploss thresholds as in [3].
In the case where the consumer’s response satisfies \(y_\ell < x^_\ell \) and \(y_h < x^+_h\), depending on the initial state, the consumer will switch once to the expansion regime or will be stuck in the initial expansion regime. If the initial regime is \(\mu _+\), the price will touch \(y_h\), the regime will switch to contraction, the price will never touch \(y_\ell \) and will oscillate between \(x^ _\ell \) and \(x^_h\). If the initial state is already \(\mu _+\), no switch of regime will ever occur. The same reasoning applies for the symmetric case where \(x^_\ell < y_\ell \) and \(x^+_h < y_h\).
Finally, if the consumer’s response satisfies \(x^_\ell < y_\ell \) and \(y_h < x^+_h\), then whatever the initial regime, the state \((\mu _t)\) will switch many times between the two regimes.
The best response of the consumer consists in picking the best response among the three possible ones above. Thus, we distinguish three cases:
(a) NoSwitch
The consumer is completely inactive and simply collects her payoff based on the strategy \((x^{\pm }_{\ell ,h})\).
(b) Single Switch
The consumer always prefers one regime to the other. Then, she is inactive (like in case (a) above) in the preferred regime and faces an optimal stopping (since there is only a single switch to consider) problem in the other regime.
(c) Multiple Switch
The consumer switches back and forth between both regimes: the continuation region is \((y_\ell ,y_h)\).
Proposition 1 provides the value function of the consumer in case (a). The system (24) characterizes the game payoff in case (b), and Proposition 2 provides the value function of the consumer in case (c).
3.1.1 NoSwitch
Regardless of the consumer strategy, in the continuation region, a direct application of the Feynman–Kac formula on (16) shows that her value function solves the following ordinary differential equation (ODE)Solving this inhomogeneous secondorder ODE, we obtain \(w^{\pm }(x)=\widehat{\omega }^{\pm }(x)+u^{\pm }(x)\), where letting \(\theta _2 ^\pm<0< \theta _1^\pm \) be the two real roots of the quadratic equation \(\beta + \mu _\pm z + \frac{1}{2}\sigma ^2 z^2 =0\),When the consumer is inactive (denoted by \(w^\pm _0\)), the continuation region is \([x_\ell ^\pm , x_h^\pm ]\) with the boundary conditions at the impulse levelsFrom (19) the respective coefficients \(\lambda ^{\pm }_{1,0}, \lambda ^{\pm }_{2,0}\) are solved from the following uncoupled linear system:For \(x > x^{\pm }_h\), we take \(w^\pm _0(x) = w^\pm _0(x^{\pm *}_h)\) and similarly in the contraction regime, we take \(w^\pm _0(x) = w^\pm _0(x^{\pm *}_\ell )\) for \(x<x^{\pm }_\ell \).
$$\begin{aligned}  \beta w + \mu _\pm w_x + \frac{1}{2} \sigma ^2 w_{xx} + \pi _c(x) = 0. \end{aligned}$$
(17)

\(u^{\pm }(x)=\lambda ^{\pm }_{1}e^{\theta ^{\pm }_1 x}+\lambda ^{\pm }_{2}e^{\theta ^{\pm }_2 x}\) solves the homogeneous ODE \(\beta u + \mu _\pm u_x + \frac{1}{2}\sigma ^2 u_{xx} =0\) and \(\lambda ^\pm _{i,0}\), \(i=1,2\) are to be determined from appropriate boundary conditions;

\(\widehat{\omega }^{\pm }(x)\) is a particular solution to (17), given by$$\begin{aligned} \widehat{\omega }^\pm (x)= & {} E x^2 + F_\pm x + G_\pm \qquad \text { where }\nonumber \\ E= & {} \frac{\gamma _2}{\beta }, \quad F_\pm = \frac{1}{\beta }\Big ( \gamma _1 + 2 \mu _\pm \frac{\gamma _2}{\beta }\Big ), \quad G_\pm = \frac{1}{\beta } \big (\gamma _0 + \sigma ^2 \frac{\gamma _2}{\beta } + \mu _\pm F_\pm \big ). \end{aligned}$$(18)
$$\begin{aligned} w^{\pm }_0(x^{\pm }_r)=w^{\pm }_0(x^{\pm *}_r), \qquad r\in \{\ell , h\}. \end{aligned}$$
(19)
$$\begin{aligned} \lambda ^{\pm }_{1,0}\cdot \big [e^{\theta ^{\pm }_1 x^{\pm }_\ell }e^{\theta ^{\pm }_1 x^{\pm *}_\ell }\big ]+\lambda ^{\pm }_{2,0}\cdot \big [e^{\theta ^{\pm }_2 x^{\pm }_\ell }e^{\theta ^{\pm }_2 x^{\pm *}_\ell }\big ]=\widehat{\omega }^\pm (x^{\pm *}_\ell )\widehat{\omega }^\pm (x^{\pm }_\ell ), \end{aligned}$$
(20)
$$\begin{aligned} \lambda ^{\pm }_{1,0}\cdot \big [e^{\theta ^{\pm }_1 x^{\pm }_h}e^{\theta ^{\pm }_1 x^{\pm *}_h}\big ]+\lambda ^{\pm }_{2,0}\cdot \big [e^{\theta ^{\pm }_2 x^{\pm }_h}e^{\theta ^{\pm }_2 x^{\pm *}_h}\big ]=\widehat{\omega }^\pm (x^{\pm *}_h)\widehat{\omega }^\pm (x^{\pm }_h). \end{aligned}$$
(21)
Proposition 1
Let \((\lambda ^\pm _{1,0}, \lambda ^\pm _{2,0}) \in {\mathbb {R}}^4\) be the solution to the system (20), (21). Then the functions \(w^\pm _0 (x)\), \(x \in [x_\ell ^\pm , x_h ^\pm ]\), are the value functions for an inactive consumer, i.e., \(w_0 ^\pm (x) = J_c ^\pm (x; N, \mu ^\pm )\), where N is the producer impulse strategy associated with the thresholds \((x_\ell ^\pm , x_\ell ^{\pm *}; x_h ^\pm , x_h ^{\pm *})\) with \(x_\ell ^\pm < x_h ^\pm \).
The role of \(w^\pm _0(\cdot )\) is important for judging the other two cases, and moreover for deciding whether the best response ought to be of thresholdtype.
3.1.2 Single Switch
We next consider the situation where the payoff in the expansion regime is higher than the contraction one for any price x, so that the consumer is never incentivized to switch to the contraction regime. We then expect the consumer’s corresponding best response to be either a singleswitch strategy (to the preferred regime) or noswitch (if already there). Economically, this corresponds to \(y_h > x_h^{+}\) so that as the price rises, the producer impulses \((X_t)\) down, and the consumer is not intervening to decrease her demand. As a result, the consumer never switches (except perhaps the first time from negative to positive drift) and \(\lim _{t\rightarrow \infty } \mu _t = \mu _+\). This can be observed when demand switching is very expensive, so that the producer has full market power and is able to keep prices consistently low. The consumer is forced to be in the expansion regime forever, and she is not able to influence \((X_t)\).
Suppose that the consumer prefers expansion regime (\(\mu _t=\mu _+\)) and adopts thresholdtype strategies. Given \({\mathcal {C}}_p\), her strategy is summarized byand the resulting contractionregime value function \(w^\) should be a solution to the variational inequalitywhere \(w^+_0\) is from Proposition 1 and the continuation region is \([y_\ell , x_h^]\). This is a standard optimal stopping problem. Note that while the above equation for \(w^\) depends on \(w^+_0\), the equation for \(w^+_0\) is autonomous—the system of equations becomes decoupled because the two regimes of \((\mu _t)\) no longer communicate.
$$\begin{aligned} y_\ell > x^_\ell , \qquad y_h=+\infty , \end{aligned}$$
$$\begin{aligned} \sup \big \{\beta w^ +\mu _ w^_x+\frac{1}{2}\sigma ^2 w^_{xx}+ \pi _c ;\, w^+_0h_w^\big \}=0, \end{aligned}$$
(22)
To solve (22), we posit that her best response is of the formwith the smoothpasting and boundary conditions:The system (24) is to be solved for the three unknowns \(y_\ell , \lambda ^_{1}, \lambda ^_2\), while \(\lambda ^+_{1,0},\lambda ^+_{2,0}\) are the coefficients of the consumer’s payoff associated with the noswitch strategy in the \(\mu _+\) regime, see previous subsection. We can rewrite it as first solving for \(\lambda ^{}_{1,2}\) from the linear systemand then determining \(y_\ell \) from the smooth pasting \({\mathcal C}^1\)regularityThe case of a single switch from expansion to contraction regime can be treated analogously in a symmetric way.
$$\begin{aligned} w^(x)&={\left\{ \begin{array}{ll} w^+_0(x)h_, &{} x\le y_\ell ,\\ \widehat{\omega }^ (x)+\lambda ^ _1e^{\theta ^ _1x}+\lambda ^ _2e^{\theta ^ _2x}, &{} y_\ell< x < x^_h,\\ w^(x^{*}_h), &{} x^_h \le x, \end{array}\right. } \end{aligned}$$
(23)
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{\omega }^(y_\ell )+\lambda ^_1e^{\theta ^_1y_\ell }+\lambda ^_2e^{\theta ^_2y_\ell }=\widehat{\omega }^+(y_\ell )+\lambda ^+_{1,0}e^{\theta ^+_1y_\ell }+\lambda ^+_{2,0}e^{\theta ^+_2y_\ell }h_, &{} ({\mathcal {C}}^0\text { at }y_\ell )\\ \widehat{\omega }^(x^_h)+\lambda ^_1e^{\theta ^_1x^_h}+\lambda ^_2e^{\theta ^_2x^_h}=\widehat{\omega }^(x^{*}_h)+\lambda ^_1e^{\theta ^_1x^{*}_h}+\lambda ^_2e^{\theta ^_2x^{*}_h}, &{} ({\mathcal {C}}^0\text { at }x^_h)\\ \widehat{\omega }^_x(y_\ell )+\lambda ^_1\theta ^_1e^{\theta ^_1y_\ell }+\lambda ^_2\theta ^_2e^{\theta ^_2y_\ell }=\widehat{\omega }^+_x(y_\ell )+\lambda ^+_{1,0}\theta ^+_1e^{\theta ^+_1y_\ell }+\lambda ^+_{2,0}\theta ^+_2e^{\theta ^+_2y_\ell }. &{} ({\mathcal {C}}^1\text { at }y_\ell ) \end{array}\right. } \end{aligned}$$
(24)
$$\begin{aligned} \begin{bmatrix} e^{\theta ^_1y_\ell } &{} e^{\theta ^_2y_\ell } \\ e^{\theta ^_1 x^_h}e^{\theta ^_1 x^{*}_h} &{} e^{\theta ^_2 x^_h}e^{\theta ^_2 x^{*}_h} \end{bmatrix} \cdot \begin{bmatrix} \lambda ^_1\\ \lambda ^_2 \end{bmatrix} =\begin{bmatrix} w^+_0(y_\ell )\widehat{\omega }^(y_\ell )h_\\ \widehat{\omega }^(x^{*}_h)\widehat{\omega }^(x^{}_h) \end{bmatrix} \end{aligned}$$
(25)
$$\begin{aligned} w^_x(y_\ell )=w^+_{0,x}(y_\ell ). \end{aligned}$$
(26)
3.1.3 Double Switch
Finally, we consider the main case where the consumer adopts thresholdtype switches, i.e., the ordering in (15) holds. Given \({\mathcal {C}}_p\), the \(w^\pm \) are then supposed to be a solution to the coupled variational inequalitieswhere we expect continuation regions of the form \((x_\ell ^+, y_h)\) and \((y_\ell , x_h ^)\). To set up a verification argument for the consumer’s best response, we make the ansatz This yields 6 equations:The six equations can be split into a linear system for the four coefficients \(\lambda ^\pm _{1,2}\)’sand the smoothpasting conditions determining the two switching thresholds \(y_{\ell ,h}\) (viewed as free boundaries)
$$\begin{aligned} \sup \big \{\beta w^+ + \mu _+w^+_x+\frac{1}{2}\sigma ^2 w^+_{xx}+\pi _c;\, \max \{w^h_+, w^+ \}w^+\big \}&=0, \end{aligned}$$
(27)
$$\begin{aligned} \sup \big \{\beta w^ + \mu _w^_x+\frac{1}{2}\sigma ^2 w^_{xx}+\pi _c;\, \max \{w^+h_, w^\}w^\big \}&=0, \end{aligned}$$
(28)
$$\begin{aligned} w^+(x)&={\left\{ \begin{array}{ll} w^+(x^{+*}_\ell ), &{} x\le x^+_\ell ,\\ \widehat{\omega }^+(x)+\lambda ^+_1e^{\theta ^+_1x}+\lambda ^+_2e^{\theta ^+_2x}, &{} x^+ _\ell< x < y_h,\\ w^(x)h_+, &{} x \ge y_h, \end{array}\right. } \end{aligned}$$
(29a)
$$\begin{aligned} w^(x)&={\left\{ \begin{array}{ll} w^+(x)h_, &{} x \le y_\ell ,\\ \widehat{\omega }^(x)+\lambda ^_1e^{\theta ^_1x}+\lambda ^_2e^{\theta ^_2x}, &{} y_\ell< x < x^_h ,\\ w^(x^{*}_h), &{} x\ge x^_h . \end{array}\right. } \end{aligned}$$
(29b)
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{\omega }^+(y_\ell )+\lambda ^+_1e^{\theta ^+_1y_\ell }+\lambda ^+_2e^{\theta ^+_2y_\ell } h_ =\widehat{\omega }^(y_\ell )+\lambda ^_1e^{\theta ^_1y_\ell }+\lambda ^_2e^{\theta ^_2y_\ell }, &{}({\mathcal {C}}^0\text { at } y_\ell )\\ \widehat{\omega }^+(x^+_\ell )+\lambda ^+_1e^{\theta ^+_1x^+_\ell }+\lambda ^+_2e^{\theta ^+_2x^+_\ell }=\widehat{\omega }^+(x^{+*}_\ell )+\lambda ^+_1e^{\theta ^+_1x^{+*}_\ell }+\lambda ^+_2e^{\theta ^+_2x^{+*}_\ell }, &{} ({\mathcal {C}}^0\text { at }x^+_\ell )\\ \widehat{\omega }^(y_h)+\lambda ^_1e^{\theta ^_1y_h}+\lambda ^_2e^{\theta ^_2y_h} h_+=\widehat{\omega }^+(y_h)+\lambda ^+_1e^{\theta ^+_1y_h}+\lambda ^+_2e^{\theta ^+_2y_h} , &{}({\mathcal {C}}^0\text { at } y_h)\\ \widehat{\omega }^(x^{}_h)+\lambda ^_1e^{\theta ^_1x^{}_h}+\lambda ^_2e^{\theta ^_2x^{}_h}=\widehat{\omega }^(x^{*}_h)+\lambda ^_1e^{\theta ^_1x^{*}_h}+\lambda ^_2e^{\theta ^_2x^{*}_h}, &{} ({\mathcal {C}}^0\text { at }x^_h)\\ \widehat{\omega }^+_x(y_\ell )+\lambda ^+_1\theta ^+_1e^{\theta ^+_1y_\ell }+\lambda ^+_2\theta ^+_2e^{\theta ^+_2y_\ell }=\widehat{\omega }^_x(y_\ell )+\lambda ^_1\theta ^_1e^{\theta ^_1y_\ell }+\lambda ^_2\theta ^_2e^{\theta ^_2y_\ell }, &{}({\mathcal {C}}^1\text { at } y_\ell )\\ \widehat{\omega }^_x(y_h)+\lambda ^_1\theta ^_1e^{\theta ^_1y_h}+\lambda ^_2\theta ^_2e^{\theta ^_2y_h}=\widehat{\omega }^+_x(y_h)+\lambda ^+_1\theta ^+_1e^{\theta ^+_1y_h}+\lambda ^+_2\theta ^+_2e^{\theta ^+_2y_h}. &{}({\mathcal {C}}^1\text { at } y_h)\\ \end{array}\right. } \end{aligned}$$
(30)
$$\begin{aligned}&\begin{bmatrix} e^{\theta ^+_1y_\ell } &{} e^{\theta ^+_2y_\ell } &{} e^{\theta ^_1y_\ell } &{} e^{\theta ^_2y_\ell } \\ e^{\theta ^+_1 x^+_\ell }e^{\theta ^+_1 x^{+*}_\ell } &{} e^{\theta ^+_2 x^+_\ell }e^{\theta ^+_2 x^{+*}_\ell } &{} 0 &{} 0 \\ e^{\theta ^+_1y_h} &{} e^{\theta ^+_2y_h} &{} e^{\theta ^_1y_h} &{} e^{\theta ^_2y_h} \\ 0 &{} 0 &{} e^{\theta ^_1 x^_h}e^{\theta ^_1 x^{*}_h} &{} e^{\theta ^_2 x^_h}e^{\theta ^_2 x^{*}_h} \end{bmatrix} \cdot \begin{bmatrix} \lambda ^+_1\\ \lambda ^+_2\\ \lambda ^_1\\ \lambda ^_2 \end{bmatrix}\nonumber \\&\quad =\begin{bmatrix} \widehat{\omega }^(y_\ell )\widehat{\omega }^+(y_\ell )h_+\\ \widehat{\omega }^+(x^{+*}_\ell )\widehat{\omega }^+(x^{+}_\ell )\\ \widehat{\omega }^+(y_h)\widehat{\omega }^(y_h)h_\\ \widehat{\omega }^(x^{*}_h)\widehat{\omega }^(x^{}_h) \end{bmatrix} \end{aligned}$$
(31)
$$\begin{aligned} w^+_x(y_r)=w^_x(y_r), \qquad r\in \{\ell , h\}. \end{aligned}$$
(32)
Proposition 2
Let the 6tuple \((\lambda ^\pm _1, \lambda ^\pm _2, y_h, y_\ell )\) be a solution to the system (31), (32) such that the order in (15) is fulfilled. Then, the functions defined in (29) give the bestresponse payoffs of consumer, and a bestresponse strategy is given by \(({\hat{\sigma }}_i)_{i \ge 1}\), wherewith \(\Gamma _c ^+ = [y_\ell ,+\infty )\) and \(\Gamma _c ^ = (\infty , y_h]\).
$$\begin{aligned} {\hat{\sigma }}_{0} =0, \quad {\hat{\sigma }}_i = \inf \left\{ t > {\hat{\sigma }}_{i1} : X_t \in \Gamma _c (t) \right\} , \quad i \ge 1, \end{aligned}$$
Figure 2 illustrates the shapes of the consumer’s value function in the different case of best response. For the strategy given, we have a dominant function in the contraction regime (\(w^_0\)) and a dominant function in the expansion regime (\(w_0^+\)).
×
Remark
For comparison purposes, it is also useful to know the continuation region of the consumer when she alone controls the market price \((X_t)\). As usual, this region is \((\infty ,y_h)\) in the expansion regime and \((y_\ell , +\infty )\) in the contraction regime, with the natural ordering \(y_\ell < y_h\). The value functions \(w^\pm \) satisfy:
$$\begin{aligned} \sup \big \{ \beta w^+ + \mu _+ w^+_x +\frac{1}{2}\sigma ^2 w^+_{xx} + \pi _c;\, w^  h_+ \big \}&=0, \end{aligned}$$
(33)
$$\begin{aligned} \sup \big \{ \beta w^ + \mu _ w^_x + \frac{1}{2}\sigma ^2 w^_{xx} + \pi _c; \, w^+  h_ \big \}&=0. \end{aligned}$$
(34)
To set up a verification argument for the consumer’s best response, we make the ansatz Furthermore, in the expansion regime, to keep \(w^+(x)\) bounded as \(x \rightarrow \infty \) we must have \(\lambda _{2,0}^+ =0\) because \(\theta _2^+<0\). In the contraction regime, a similar argument gives \(\lambda _{1,0}^ = 0\). We are left with the four unknowns \(y_\ell , y_h\) and \(\lambda _{1,0}^p\) and \(\lambda _{2,0}^\) determined from the following smoothpasting conditions:\(\square \)
$$\begin{aligned} w^+(x)&= {\left\{ \begin{array}{ll} w^(y_h)  h_+, &{} x \ge y_h, \\ \widehat{\omega }^+(x) + \lambda _{1,0}^+ e^{\theta _1^+ x} + \lambda _{2,0}^+ e^{\theta _2^+ x}, &{} x < y_h, \end{array}\right. } \end{aligned}$$
(35a)
$$\begin{aligned} w^(x)&= {\left\{ \begin{array}{ll} \widehat{\omega }^(x) + \lambda _{1,0}^ e^{\theta _1^ x} + \lambda _{2,0}^ e^{\theta _2^ x}, &{} x > y_\ell ,\\ w^+(y_\ell ) h_, &{} x \le y_\ell . \end{array}\right. } \end{aligned}$$
(35b)
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{\omega }^(y_\ell ) + \lambda ^_{2,0} e^{\theta ^_2 y_\ell } = \widehat{\omega }^+(y_\ell ) + \lambda _{1,0}^+ e^{\theta _1^+ y_\ell }  h_, &{}({\mathcal {C}}^0\text { at } y_\ell )\\ \widehat{\omega }^+(y_h) + \lambda ^+_{1,0} e^{\theta ^+_1 y_h} = \widehat{\omega }^(y_h) + \lambda _{2,0}^ e^{\theta _2^ y_h}  h_+, &{}({\mathcal {C}}^0\text { at } y_h) \\ \widehat{\omega }^_{x}(y_\ell ) + \lambda ^_{2,0} \theta ^_2 e^{\theta ^_2 y_\ell } = \widehat{\omega }^+_{x}(y_\ell ) + \lambda ^+_{1,0} \theta ^+_1 e^{\theta ^+_1 y_\ell }, &{}({\mathcal {C}}^1\text { at } y_\ell )\\ \widehat{\omega }^+_{x}(y_h) + \lambda ^+_{1,0} \theta ^+_1 e^{\theta ^+_1 y_h} = \widehat{\omega }^_{x}(y_h) + \lambda ^_{2,0} \theta ^_2 e^{\theta ^_2 y_h}. &{}({\mathcal {C}}^1\text { at } y_h)\\ \end{array}\right. } \end{aligned}$$
(36)
3.2 Producer Best Response
We now consider the best response of the producer, given the consumer’s switching strategy denoted by \({\mathcal {C}}_c:=[y_\ell , y_h]\). Once again, we face three cases:
1.
The producer is a monopolist, i.e., the consumer is completely inactive;
2.
The consumer adopts a singleswitch strategy;
3.
The consumer adopts a doubleswitch strategy.
3.2.1 Producer as Sole Optimizer
To begin with, we determine the monopolylike strategy of the producer assuming the consumer adopts a noswitch strategy. In that case, \(\mu _t\) is constant throughout and the functions \(v^{\pm }\) of the producer satisfy the variational inequality (VI):Note that the two VIs for \(v^+\) and \(v^\) are autonomous, hence uncoupled from each other. In the continuation region, the general solution of the ODEis of the form \(v^\pm (x)\) \(=\) \({\widehat{v}}^\pm (x) + u^\pm (x)\), where \(u^\pm = \nu ^\pm _1e^{\theta ^\pm _1x}+\nu ^\pm _2e^{\theta ^\pm _2x}\), with \(\theta ^\pm _1, \theta _2 ^\pm \) as before, satisfies the homogenous ODE \( \beta u +\mu _\pm u_x + \frac{1}{2} \sigma ^2 u_{xx} = 0\), and \({\widehat{v}}^\pm (x)\) is a particular solution given bywhere the coefficients \(A, B_\pm , C_\pm \) are identified as:Assuming the producer adopts thresholdtype impulse strategies defined by \(\xi ^*(x)\) in the intervention region, her expected payoff is of the form:When applying the optimal impulse \(\xi ^{\pm *}(x)\) at the threshold \(x_{r}^\pm , r = \ell ,h\), the producer brings \(X_t\) back to the price level \(x^{\pm *}_{r}\) \(:=\) \(x^\pm _{r}  \xi ^{\pm *}(x^\pm _{r})\). For optimality, the respective impulse amounts satisfy the first order conditionsWe reinterpret the above as the equation to be satisfied by \(\xi ^*(x_r^{\pm })\) which are treated temporarily as unknowns and plugged into further equations. To ensure that the value function is continuous at \(x^\pm _{r}\), we further needFinally, making the hypothesis that the value function is differentiable at the borders of the intervention region, we have:
$$\begin{aligned} \sup \Big \{  \beta v^\pm + \mu _{\pm } v^\pm _x + \frac{1}{2} \sigma ^2 v^\pm _{xx} + \pi _p \; , \sup _{\xi } \big \{ v^\pm (\cdot + \xi )  v^\pm (\cdot )  K_p (\xi ) \big \} \Big \}&= 0. \end{aligned}$$
(37)
$$\begin{aligned}  \beta v + \mu _{\pm } v_x + \frac{1}{2} \sigma ^2 v_{xx} + \pi _p(x) = 0 \end{aligned}$$
$$\begin{aligned} {\widehat{v}}^\pm (x) = A x^2 + B_\pm x + C_\pm , \end{aligned}$$
(38)
$$\begin{aligned} A&= \frac{d_1}{\beta }, \quad B_\pm = \frac{1}{\beta }\Big ( d_0  \frac{2 \, \mu _\pm \, d_1}{\beta } + c_p \, d_1\Big ), \quad C_\pm = \frac{1}{\beta } \big (\mu _\pm B_\pm + A \sigma ^2  c_p d_0 \big ). \end{aligned}$$
$$\begin{aligned} v^\pm (x)&={\left\{ \begin{array}{ll} v^\pm (x^{\pm *}_h)K_p(\xi ^*(x)) &{} x\ge x^{\pm }_h,\\ \widehat{v}^\pm (x)+\nu ^\pm _1e^{\theta ^\pm _1x}+\nu ^\pm _2e^{\theta ^\pm _2x}, &{} x^{\pm }_\ell<x<x^{\pm }_h,\\ v^\pm (x^{\pm *}_\ell )K_p( \xi ^*(x)) &{} x\le x^{\pm }_\ell . \end{array}\right. } \end{aligned}$$
(39)
$$\begin{aligned} v_x^\pm (x^{\pm *}_h)&= \partial _\xi K_p(\xi ^*(x^\pm _h)), \qquad v_x^\pm (x^{\pm *}_\ell ) =\partial _\xi K_p(\xi ^*(x^\pm _\ell )). \end{aligned}$$
(40)
$$\begin{aligned} v^\pm (x^\pm _{r})&= v^\pm (x^{\pm *}_{r})  K_p (\xi ^{\pm *} _r). \end{aligned}$$
(41)
$$\begin{aligned} v^\pm _x(x^\pm _{\ell })&= v^\pm _x(x^{\pm *}_{\ell })  \partial _\xi K_p(\xi ^*(x^\pm _\ell )) , \end{aligned}$$
(42a)
$$\begin{aligned} v^\pm _x(x^\pm _{h})&= v^\pm _x(x^{\pm *}_{h})  \partial _\xi K_p(\xi ^*(x^\pm _h)). \end{aligned}$$
(42b)
We consider two cases of impulse costs: (i) constant \(K_p(\xi ) =\kappa _0\) and (ii) linear \(K_p(\xi ) = \kappa _0 + \kappa _1 \xi \). In case (i), because the impulse cost is independent of the intervention amount there will be an optimal impulse level \(x^{\pm *}_r\) so that for any x in the intervention region the strategy is to impulse back to \(x^{\pm *}_r\) which is the same at the two thresholds. In case (ii), \(\partial _\xi K_p = \pm \kappa _1\) and all the smooth pasting and boundary conditions can be gathered in the following system:Note that there are two uncoupled linear systems for \(v^+\) and \(v^\). The \({\mathcal C}^0\) conditions are from (41), the first two \({\mathcal C}^1\) conditions are from (40) which determines the optimal impulse destination, and the last two \({\mathcal C}^1\) conditions are from (42).
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{v}^\pm (x^{\pm }_h)+\nu ^\pm _1e^{\theta ^\pm _1x^{\pm }_h}+\nu ^\pm _2e^{\theta ^\pm _2x^{\pm }_h}=\widehat{v}^\pm (x^{\pm *}_h)+\nu ^\pm _1e^{\theta ^\pm _1x^{\pm *}_h}+\nu ^\pm _2e^{\theta ^\pm _2x^{\pm *}_h}\kappa _0\kappa _1(x^{\pm }_hx^{\pm *}_h), &{} ({\mathcal {C}}^0\text { at }x^{\pm }_h)\\ \widehat{v}^\pm (x^{\pm }_\ell )+\nu ^\pm _1e^{\theta ^\pm _1x^{\pm }_\ell }+\nu ^\pm _2e^{\theta ^\pm _2x^{\pm }_\ell }=\widehat{v}^\pm (x^{\pm *}_\ell )+\nu ^\pm _1e^{\theta ^\pm _1x^{\pm *}_\ell }+\nu ^\pm _2e^{\theta ^\pm _2x^{\pm *}_\ell }\kappa _0\kappa _1(x^{\pm *}_\ell x^{\pm }_\ell ), &{} ({\mathcal {C}}^0\text { at }x^{\pm }_\ell )\\ \widehat{v}^\pm _x(x^{\pm *}_h)+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm *}_h}+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm *}_h}=\kappa _1&{} ({\mathcal {C}}^1\text { at }x^{\pm *}_h)\\ \widehat{v}^\pm _x(x^{\pm *}_\ell )+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm *}_\ell }+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm *}_\ell }=\kappa _1, &{} ({\mathcal {C}}^1\text { at }x^{\pm *}_\ell )\\ \widehat{v}^\pm _x(x^{\pm }_h)+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm }_h}+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm }_h}=\widehat{v}^\pm _x(x^{\pm *}_h)+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm *}_h}+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm *}_h}\kappa _1, &{} ({\mathcal {C}}^1\text { at }x^{\pm }_h)\\ \widehat{v}^\pm _x(x^{\pm }_\ell )+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm }_\ell }+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm }_\ell }=\widehat{v}^\pm _x(x^{\pm *}_\ell )+\nu ^\pm _1\theta ^\pm _1e^{\theta ^\pm _1x^{\pm *}_\ell }+\nu ^\pm _2\theta ^\pm _2e^{\theta ^\pm _2x^{\pm *}_\ell }+\kappa _1. &{} ({\mathcal {C}}^1\text { at }x^{\pm }_\ell ) \end{array}\right. } \end{aligned}$$
(43)
By a standard verification argument, one can show that if both systems above admit solutions \(\nu ^\pm _{1,2}\) and \(x^\pm _{\ell ,h}\), where the latter satisfy the order condition \(x_\ell ^\pm < x_h ^\pm \), then the functions \(v^\pm (x)\) as in (39) are the value functions of the producer and his optimal strategies are given by the thresholds \(x_{\ell ,h}^\pm \) and impulse amounts \(\xi ^{*}(x_{\ell ,h}^{\pm *})\). This can be done by following exactly the arguments in, e.g., [7] (see also their Remark 2.1), which are very standard in the literature of impulse control problems. Therefore, details are omitted.
3.2.2 NonPreemptive Response
Suppose the following ordering, which is similar to (15), holds:We then expect \(v^\pm \) to solve the VIsTo obtain the producer best response, it suffices to identify the two active impulse thresholds \(x^+_\ell ,x^_h\) and the respective target levels \(x^{+*}_\ell , x^{*}_h\). The other two boundary conditions take place at the consumer thresholds \(y_\ell , y_h\), so that the strategy (see (47)) is \({\mathcal {C}}_p=\begin{bmatrix} x^+_\ell ,&{} x^{+*}_\ell , &{} ,&{}+\infty \\ \infty , &{}  ,&{} x^{*}_h,&{} x^{}_h \end{bmatrix}.\) The game coupling shows up in the additional boundary condition that when the consumer switches, the producer’s value is unaffected:Accordingly, our ansatz is To simplify the presentation, let us concentrate on the proportional impulse costs \(K_p(\xi ) = \kappa _0 + \kappa _1 \xi \). We have the smooth pasting \({\mathcal {C}}^1\) and boundary conditions:Unlike the singleagent setting (43), Eq. (48) are coupled. The coefficients \(\nu ^{\pm }_{1,2}\) are the solution to the linear systemand the thresholds \(x_h^+, x_\ell ^\) are determined by the \({\mathcal C}^1\) smooth pasting (recall that \(x^{*}_h = x^{}_h \xi ^*(x^_h)\), \(x^{+*}_\ell = x^{+}_\ell \xi ^*(x^+_\ell )\)):and the first order conditions (FOCs) giving the optimal impulses:
$$\begin{aligned} x_\ell ^\pm< y_l< y_h < x_h ^\pm .\end{aligned}$$
(44)
$$\begin{aligned} {\left\{ \begin{array}{ll} \sup \big \{\beta v^+ +\mu _+ v^+_x+\frac{1}{2}\sigma ^2v^+_{xx} + \pi _p\; ;\; \sup _\xi (v^+(\cdot \xi )v^+K_p(\xi ))\big \}=0,\\ \sup \big \{\beta v^ + \mu _ v^_x+\frac{1}{2}\sigma ^2v^_{xx} + \pi _p\; ;\; \sup _\xi (v^(\cdot \xi )v^K_p(\xi ))\big \}=0.\\ \end{array}\right. } \end{aligned}$$
(45)
$$\begin{aligned} v^+ ( y) = v^ ( y), \qquad y \in (\infty , y_\ell ] \cup [y_h, +\infty ). \end{aligned}$$
(46)
$$\begin{aligned} v^(x)&={\left\{ \begin{array}{ll} v^(x^{*}_h)K_p( \xi ^*( x)), &{} x\ge x^_h,\\ \widehat{v}^(x)+\nu ^_1e^{\theta ^_1x}+\nu ^_2e^{\theta ^_2x}, &{} y_\ell<x<x^_h,\\ v^+(x), &{} x\le y_\ell , \end{array}\right. } \end{aligned}$$
(47a)
$$\begin{aligned} v^+(x)&={\left\{ \begin{array}{ll} v^(x), &{} x\ge y_h,\\ \widehat{v}^+(x)+\nu ^+_1e^{\theta ^+_1x}+\nu ^+_2e^{\theta ^+_2x}, &{} x^+_\ell<x<y_h,\\ v^+(x^{+*}_\ell )K_p( \xi ^*(x)), &{} x\le x^+_\ell . \end{array}\right. } \end{aligned}$$
(47b)
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{v}^+(y_\ell )+\nu ^+_1e^{\theta ^+_1y_\ell }+\nu ^+_2e^{\theta ^+_2y_\ell }=\widehat{v}^(y_\ell )+\nu ^_1e^{\theta ^_1y_\ell }+\nu ^_2e^{\theta ^_2y_\ell }, &{} ({\mathcal {C}}^0\text { at }y_\ell )\\ \widehat{v}^(y_h)+\nu ^_1e^{\theta ^_1y_h}+\nu ^_2e^{\theta ^_2y_h}=\widehat{v}^+(y_h)+\nu ^+_1e^{\theta ^+_1y_h}+\nu ^+_2e^{\theta ^+_2y_h}, &{} ({\mathcal {C}}^0\text { at }y_h)\\ \widehat{v}^+(x^+_\ell )+\nu ^+_1e^{\theta ^+_1x^+_\ell }+\nu ^+_2e^{\theta ^+_2x^+_\ell }=\widehat{v}^+(x^{+*}_\ell )+\nu ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2e^{\theta ^+_2x^{+*}_\ell }K_p( \xi ^*( x^+_\ell )), &{} ({\mathcal {C}}^0\text { at }x^+_\ell )\\ \widehat{v}^(x^{}_h)+\nu ^_1e^{\theta ^_1x^{}_h}+\nu ^_2e^{\theta ^_2x^{}_h}=\widehat{v}^(x^{*}_h)+\nu ^_1e^{\theta ^_1x^{*}_h}+\nu ^_2e^{\theta ^_2x^{*}_h}K_p(\xi ^*( x^_h)), &{}({\mathcal {C}}^0\text { at }x^_h)\\ \widehat{v}^+_x(x^+_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^+_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^+_\ell }=\widehat{v}^+_x(x^{+*}_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^{+*}_\ell } {\kappa _1}, &{} ({\mathcal {C}}^1\text { at }x^+_\ell )\\ \widehat{v}^_x(x^{}_h)+\nu ^_1\theta ^_1e^{\theta ^_1x^{}_h}+\nu ^_2\theta ^_2e^{\theta ^_2x^{}_h}=\widehat{v}^_x(x^{*}_h)+\nu ^_1\theta ^_1e^{\theta ^_1x^{*}_h}+\nu ^_2\theta ^_2e^{\theta ^_2x^{*}_h} {+\kappa _1}. &{}({\mathcal {C}}^1\text { at }x^_h)\\ \widehat{v}^+_x(x_\ell ^{+*})+\nu ^+_1\theta ^+_1e^{\theta ^+_1x_\ell ^{+*}}+\nu ^+_2\theta ^+_2e^{\theta ^+_2 x_\ell ^{+*}} =  \kappa _1&{} {({\mathcal {C}}^1\text { at } x_\ell ^{+*}) }\\ \widehat{v}^_x(x_h^{*})+\nu ^_1\theta ^_1e^{\theta ^_1 x_h^{*}}+\nu ^_2\theta ^_2e^{\theta ^_2 x_h^{*}} = \kappa _1, &{} {({\mathcal {C}}^1\text { at } x_h^{*})} \end{array}\right. } \end{aligned}$$
(48)
$$\begin{aligned}&\begin{bmatrix} e^{\theta ^+_1y_\ell } &{} e^{\theta ^+_2y_\ell } &{} e^{\theta ^_1y_\ell } &{} e^{\theta ^_2y_\ell } \\ e^{\theta ^+_1 x^+_\ell }e^{\theta ^+_1 x^{+*}_\ell } &{} e^{\theta ^+_2 x^+_\ell }e^{\theta ^+_2 x^{+*}_\ell } &{} 0 &{} 0 \\ e^{\theta ^+_1y_h} &{} e^{\theta ^+_2y_h} &{} e^{\theta ^_1y_h} &{} e^{\theta ^_2y_h} \\ 0 &{} 0 &{} e^{\theta ^_1 x^_h}e^{\theta ^_1 x^{*}_h} &{} e^{\theta ^_2 x^_h}e^{\theta ^_2 x^{*}_h} \end{bmatrix} \cdot \begin{bmatrix} \nu ^+_1\\ \nu ^+_2\\ \nu ^_1\\ \nu ^_2 \end{bmatrix}\nonumber \\&\quad =\begin{bmatrix} \widehat{v}^(y_\ell )\widehat{v}^+(y_\ell )\\ \widehat{v}^+(x^{+*}_\ell )\widehat{v}^+(x^{+}_\ell )K_p\\ \widehat{v}^+(y_h)\widehat{v}^(y_h)\\ \widehat{v}^(x^{*}_h)\widehat{v}^(x^{}_h)K_p \end{bmatrix} \end{aligned}$$
(49)
$$\begin{aligned} {\left\{ \begin{array}{ll} v^_x(x^_h)=v^_x(x^{*}_h),\\ v^+_x(x^+_\ell )=v^+_x(x^{+*}_\ell ), \end{array}\right. } \end{aligned}$$
(50)
$$\begin{aligned} v_x^(x_h^{*}) = \partial _\xi K_p(\xi ^*(x^_h)) \qquad v_x^+(x_\ell ^{+*})&=  \partial _\xi K_p(\xi ^*(x^+_\ell )). \end{aligned}$$
(51)
Proposition 3
Let the 8tuple \((\nu ^\pm _1, \nu ^\pm _2, x^+_h, x^ _\ell , x^{+*}_h, x^{*} _\ell )\) be a solution to the system (48), such that the order in (44) is fulfilled and \(x^+ _\ell< x_\ell ^{+*}, x_h ^{*} < x_h ^\). Let \(v^\pm \) be defined in (47) and assumeThen, the functions \(v^\pm \) are the bestresponse payoffs of the producer, and a bestresponse strategy is given bywith \(\Gamma _p (t) = \Gamma ^+ _p {\mathbf {1}}_{\{\mu _t = \mu _+\}} + \Gamma ^ _p {\mathbf {1}}_{\{\mu _t = \mu _\}}\), where \(\Gamma _p ^+ = (\infty , x^+_\ell ]\) and \(\Gamma _p ^ = [x^ _h , +\infty )\), while \((X^*_t)\) follows the dynamics corresponding to the consumer’s strategy \((\sigma _i)_{i \ge 1}\) and the producer’s impulse strategy \((\tau ^* _i , \xi _i ^*)_{i \ge 1}\).
$$\begin{aligned} v_{xx} ^+(x_\ell ^{+*})< 0, \qquad v_{xx} ^ (x_h ^{*}) < 0. \end{aligned}$$
(52)
$$\begin{aligned} \tau ^* _0 = 0,&\quad \tau ^* _i = \inf \left\{ t > \tau ^* _{i1} : X^*_t \in \Gamma _p (t) \right\} , \end{aligned}$$
(53)
$$\begin{aligned} \xi ^*_i (x_\ell ^+)&= x_\ell ^{+*}  x_\ell ^+ , \qquad \xi ^*_i (x_h ^) = x_h ^{}  x_h ^{*} , \qquad i \ge 1, \end{aligned}$$
(54)
We remark that while we do not have a direct result regarding existence of solutions to (48), we provide nonetheless a verification theorem that connects a solution 8tuple to a bestresponse strategy.
3.2.3 Preemptive Response
It is possible that the static discounted future profit of the producer satisfies, say, \(v^+(x) \ge v^(x)\) for any x, so that he always prefers expansion regime to contraction regime.
In that case, the consumer switching at \(y_h\) from expansion to contraction hurts the producer and one possible strategy for him is to preempt in order to prevent the consumer from switching the drift to \(\mu _\). This situation could be viewed as looking for best \(x^+_h < y_h\), given \(y_h\). In the latter case the constrained solution could be \(x^+_h = y_h\), whereby the system (43) does not hold and the best response is to impulse \((X_t)\) right before it hits \(y_h\), \(x^+_h = y_h\). This strategy is not well defined (i.e., the supremum is not achieved on the open interval \((x_\ell ^+, y_h)\)), but the resulting preemptive bestresponse value in the \(\mu _+\) regime can be obtained by using the ansatz (where we slightly abuse the notation to write \(x_h^{+*} = y_h  \xi ^*(y_h)\) for the target impulse level at \(y_h\))and the boundary conditions for determining the target impulse levelsNote that we now have 5 unknowns, \(\nu ^+_{1,2}, x^+_\ell , x^{+*}_{\ell }, x^{+*}_h\) rather than six as we “fixed” \(x^+_h = y_h\). This yields the following systemPreemption in the contraction regime writes in a symmetric way.
$$\begin{aligned} v^+(x)&={\left\{ \begin{array}{ll} v^+(x^{+*}_h)K_p(\xi ^*(x)), &{} x\ge y_h,\\ \widehat{v}^+(x)+\nu ^+_1e^{\theta ^+_1x}+\nu ^+_2e^{\theta ^+_2x}, &{} x^+_\ell< x < y_h,\\ v^+(x^{+*}_\ell )K_p(\xi ^*(x)), &{} x\le x^+_\ell , \end{array}\right. } \end{aligned}$$
(55)
$$\begin{aligned} v_x^+(x^{+*}_h)&=  \kappa _1, \quad v_x^+(x_\ell ^{+*}) = +\kappa _1. \end{aligned}$$
(56)
$$\begin{aligned} {\left\{ \begin{array}{ll} \widehat{v}^+(y_h)+\nu ^+_1e^{\theta ^+_1y_h}+\nu ^+_2e^{\theta ^+_2y_h}=\widehat{v}^+(x^{+*}_h)+\nu ^+_1e^{\theta ^+_1(x^{+*}_h)}+\nu ^+_2e^{\theta ^+_2(x^{+*}_h)}K_p(\xi ^*(y_h)) &{} ({\mathcal {C}}^0\text {at }y_h)\\ \widehat{v}^+(x^{+}_\ell )+\nu ^+_1e^{\theta ^+_1x^{+}_\ell }+ \nu ^+_2 e^{\theta ^+_2x^{+}_\ell }=\widehat{v}^+(x^{+*}_\ell )+\nu ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2e^{\theta ^+_2x^{+*}_\ell }K_p(\xi ^*(x_\ell ^+)) &{}({\mathcal {C}}^0\text { at }x^+_\ell )\\ \widehat{v}^+_x(x^{+}_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^{+}_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^{+}_\ell }=\widehat{v}^+_x(x^{+*}_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^{+*}_\ell } +\kappa _1 &{}({\mathcal {C}}^1\text { at }x^+_\ell ) \\ {\widehat{v}}_x^+(x^{+*}_h) + \nu ^+_1\theta ^+_1e^{\theta ^+_1 x_h^{+*} }+\nu ^+_2\theta ^+_2e^{\theta ^+_2 x_h^{+*} } =  \kappa _1 &{} ({\mathcal {C}}^1\text { at }x^{+*}_h) \\ \widehat{v}^+_x(x^{+*}_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^{+*}_\ell } = \kappa _1. &{} ({\mathcal {C}}^1\text { at }x^{+*}_\ell )\\ \end{array}\right. } \end{aligned}$$
(57)
In general, we need to manually verify whether \(x_h^+ > y_h\) (the “normal” case) or \(x^+ _h = y_h\) (the preemptive case) whenever we consider the producer best response. The two situations lead to different boundary conditions at the upper threshold, and hence cannot be directly compared. Considering the optimization problem for \(x_h^+\), we expect his value function to increase in \(x_h^+\) on \((x_\ell ^+, y_h)\) and experience a positive jump at \(y_h\), i.e., conditional on someone acting, the producer prefers the consumer’s switch to applying his impulse. However, if this is not the case, the consumer action hurts the producer and assuming the impulse costs are low, the best response is \(x_h^+ = y_h\). This corner solution arises due to the underlying discontinuity: on \((x_\ell ^+, y_h)\) the producer compares the value of waiting to the value of doing an optimal impulse, but at \(y_h\) he compares the value of switching to that of doing an optimal impulse. So it could be that “waiting” > impulsing > switching at \(y_h\), leading to preemptive impulse to prevent the worst (for the producer) outcome.
Proposition 4
Assume \(\mu _{0}=\mu _+\). Let the 5tuple \((\nu ^+_1, \nu ^+ _2, x^+_\ell , x^{+*}_\ell , x^{+*} _h)\) be a solution to the system (57), such that the order in (44) is fulfilled and \(x^+ _\ell< x_\ell ^{+*}, x_h ^{+*} < y_h\). Let \(v^+\) be defined as in (55) and assumeThen, the function \(v^+\) is the bestresponse payoff of the producer in the expansion regime, and a bestresponse strategy is given bywith \(\Gamma _p (t) = \Gamma ^+ _p (t) = (\infty , x^+_\ell ] \cup [y_h , +\infty )\), while \(X^*\) follows the dynamics corresponding to the producer’s impulse strategy \((\tau ^* _i , \xi _i ^*)_{i \ge 1}\).
$$\begin{aligned} v_{xx} ^+(x_\ell ^{+*})<0, \qquad v_{xx} ^+ (x_h ^{+*}) <0. \end{aligned}$$
(58)
$$\begin{aligned} \tau ^* _0 = 0,&\quad \tau ^* _i = \inf \{ t > \tau ^*_{i1} : X_t \in \Gamma _p (t) \}, \end{aligned}$$
(59)
$$\begin{aligned} \xi ^*_i (x_\ell ^+)&= x_\ell ^{+*}  x_\ell ^+ , \quad \xi ^*_i (y_h) = y_h  x_h ^{+*} , \quad i \ge 1, \end{aligned}$$
(60)
×
Figure 3 illustrates the shapes of the producer’s value function in the different cases of best response. For the given consumer strategy, we have a dominant function in the contraction regime (\(v^_1\)) and a dominant function in the expansion regime (\(v_1^+\)).
4 Equilibria
The bestresponse functions defined in Sect. 3 lead to three types of potential market equilibria, depending on the equilibrium behavior of the consumer and characterized by the relative positions of the consumer and producer thresholds:In equilibrium type I, the consumer switches back and forth forever between the two expansion and contraction regimes. The optimal policy of the consumer is given by the threshold \(y_\ell \) in the contraction regime and \(y_h\) in the expansion regime, while the optimal policy of the producer is formed by the pair \((x_\ell ^+,x_\ell ^{+,*})\) in the expansion regime and symmetrically by the pair \((x_h^{+,*,}x_h^{})\) in the contraction regime. We anticipate this to be the most common equilibrium type; it is precisely described and illustrated in Sect. 4.1.

Type I—generic: \( y_\ell \le x^_h \) and \( x^+_\ell \le y_h\).

Type II—transitory: \( \infty = y_\ell \le x^_\ell \) and \( x^+_\ell \le y_h\); or \(y_\ell \le x^_h \) and \(y_h = +\infty \).

Type III—preemptive: \( y_\ell \le x^_h \) and \( x^+_\ell = y_h\); or \( y_\ell = x^_h \) and \( x^+_\ell \le y_h\).
In equilibrium type II, the consumer and the producer both prefer a given regime and thus, the consumer switches at most once when the market is initialized in the opposite regime. Afterward, only the producer acts to maintain the price between \((x_\ell ,x_h)\). Consider the case of a single switch from expansion to contraction; the consumer’s optimal policy consists then in only one threshold, \(y_h\). The optimal policy of the producer is more complicated: in the expansion regime, it consists of the pair \((x_\ell ^+,x_\ell ^{+,*})\) and in the contraction regime, it consists of a quadruplet \((x^_\ell ,x^{,*}_\ell ,x^{,*}_h,x^_h)\). The same reasoning applies in the other single switch case. This equilibrium is described in Sect. 4.2.
The last type of equilibrium, named type III, resembles the preceding one in the sense that at most one switch can be observed. But it differs because here the consumer is stuck forever in a state she wishes to leave. In that case described in Sect. 4.3, only the producer acts. Starting in the expansion regime, for instance, the consumer would like to switch to the contraction regime when the price reaches a threshold \(y_h\). But the producer, who prefers perpetual expansion, preempts the switch by acting at the threshold \(y_h^\), just before the action of the consumer.
Thresholdtype equilibria offer analytical tractability to describe the longrun market behavior. The latter can be summarized by the stationary distribution of the commodity price \((X^*_t)\) and the consumer regimes \((\mu ^*_t)\) as induced by the equilibrium strategies \((N^*_t, \mu ^*_t)\). To quantify these effects, we define an auxiliary discretetime jump chain \((M^*_n)_{n=0}^\infty \) which takes values in the state spaceThe chain \(M^*\) keeps track of the sequential actions of the players, where \(S_\pm \) represents the switches of the consumer (“\(S_{+}\)” stands for the switch \(\mu _\rightarrow \mu _+\) and “\(S_{}\)” for \(\mu _+\rightarrow \mu _\)) and \(I^\pm _r\) the impulses (up/down at the two impulse boundaries) of the producer. Thus, \(M^*\) summarizes the sequence of market interventions stored within \(\tau _i, \sigma _i\) stopping times. Note that states \(M^*_n \in \{S_+, I^+_{\ell h}\}\) imply a positive drift \(\mu _+\) of \(X^*\), while the rest imply a negative drift \(\mu _\). Moreover, if the consumer adopts a doubleswitch strategy and the producer adopts a nonpreemptive strategy as discussed in Sect. 4.1, then the thresholds \(x^+_h\) and \(x^_\ell \) will be hit at most once by \(X^*\) and therefore the corresponding states \(I^+_h\) and \(I^_\ell \) of \(M^*_n\) are transient.
$$\begin{aligned} E:= \left\{ S_{+}, S_{}, I^_\ell , I^_h, I^+_\ell , I^+_h\right\} . \end{aligned}$$
(61)
Because the dynamics of \(X^*\) between interventions are always Brownian motion (BM) with drift, the transition probabilities of \(M^*\) can be described in terms of hitting probabilities of a BM. This offers closedform expressions for the transition probability matrix \({\mathbf {P}}\) of \(M^*\), and its invariant distribution denoted by \(\vec {\Pi }\). Moreover, the sojourn times of \(M^*\) correspond to \((X^*_t)\) hitting the various thresholds (in terms of the original continuoustime “t”) and are similarly linked to BM first passage times. Combining the above ideas, we can then derive a complete description of \((\mu ^*_t)\), namely the longrun proportion of time that the commodity demand is in expansion/contraction regimes and the respective expected switching time, see (71).
In the following section otherwise stated, we use the parameter values in Table 1, such that \(\pi _i(x) = a_i (xx^1_i)(x^2_ix)\), \(i\in \{c,p\}\). This yields consumer and producer preferred price levels of \({\bar{X}}_c= 3, {\bar{X}}_p=4\). The same set of parameters yields an equilibrium of each type, showing the nonuniqueness of equilibria in this model.
Table 1
Model parameters for Sect. 4.1
Market  Consumer  Producer  

\(\beta \)  0.1  \(x^1_c\)  1  \(x^1_p\)  2 
\(\sigma \)  0.25  \(x^2_c\)  5  \(x^2_p\)  6 
\(\mu _+\)  0.1  \(a_c\)  0.75  \(a_p\)  0.25 
\(\mu _\)  \(0.1\)  \(h_\pm \)  10  \(\kappa _0\)  3 
\(\kappa _1\)  0 
×
We remark that while our verification results in Propositions 2–4 rigorously characterize player strategies, we do not have any existence results for the game equilibrium itself. In particular, we do not have existence results for the corresponding large systems of equations that describe all the thresholds. Global existence (i.e., for any admissible competitor strategy) seems to be out of reach, and it is challenging to formulate any algebraic conditions that would hold in equilibrium and ensure existence (Fig. 4). Thus, ultimately we numerically find a fixed point to the (specific type of) bestresponse strategy maps and then view it as an approximate (up to numerical roundoff error, since we cannot show existence directly) thresholdtype equilibrium. Consequently, technically our discussion below concerns candidate equilibria that satisfy the equilibrium equations up to machine precision.
4.1 Type I—Generic
We look for an interior, nonpreemptive equilibrium satisfying the ordering (15), i.e., a pair of consumer and producer strategies of the form \((y_\ell ^*,y_h^*)\) and \((x^+_\ell ,x^{+*}_\ell ,x^{*}_h,x^_h)\). To construct this equilibrium, we employ tâtonnement, i.e., iteratively apply the bestresponse controls alternating between the two players. This corresponds to the interpretation of Nash equilibrium as a fixed point of bestresponse maps BR. The equilibrium is obtained using two different fixedpoint algorithms. Given strategies \({\mathcal {C}}_p^0\) and \({\mathcal {C}}_c^0\), we have either an asynchronous or synchronous algorithm, namelyThe resulting equilibrium found using both algorithms is the same and is
$$\begin{aligned}&{\mathcal {C}}_p^{k+1} = BR({\mathcal {C}}_c^k), \quad&{\mathcal {C}}_p^{k+1} = BR({\mathcal {C}}_c^k), \\&{\mathcal {C}}_c^{k+1} = BR({\mathcal {C}}_p^{k+1}), \quad&{\mathcal {C}}_c^{k+1} = BR({\mathcal {C}}_p^{k}), \\&\text {asynchronous}&\text {synchronous}. \end{aligned}$$
$$\begin{aligned} {\mathcal {C}}^{{\mathrm{I}},*}_p=\begin{bmatrix} 2.0, &{} 3.6, &{} , &{} +\infty \\ \infty , &{} , &{} 4.5, &{} 6.1 \\ \end{bmatrix}, \qquad {\mathcal {C}}^{{\mathrm{I}},*}_c=[2.2, 4.4]. \end{aligned}$$
(62)
×
The dynamic equilibrium of the commodity price \(X^*\) is illustrated in Fig. 5 (Left). The market starts in the expansion regime, \(\mu ^*_0 = \mu _+\). We observe that \(x^{*}_h\) is close to \(y_h^{*}\), implying that once the price has reached the switching level \(y_h^*\), it is likely to touch soon thereafter the threshold \(x^{*}_h\), making the price drop to \(x^{*}_h\). The producer “backs up” this meanreversion by impulsing down if prices rise too much and impulsing up if they drop too much. Otherwise, she lets the consumer be in charge via switching control that benefits him as well.
At equilibrium, the price \(X^*\) fluctuates in a range of values where neither the producer nor the consumer have negative profit rate. If alone in the market, the optimal monopolistic strategies of the producer and the consumer areWe see that the equilibrium strategy of the producer \({\mathcal {C}}^{{\mathrm{I}},*}_p\) is quite close to what he would have done if alone in the market. On his side, the consumerinduced equilibrium price range is wider than he would prefer (2.6 against 2.2 if alone). In equilibrium, it is as if the producer lets the consumer do the job of bringing back the price to his preferred level \({\bar{X}}_p\). The producer intervenes only if \(X^*_t\) drops too low or gets too high, after the regime switching has occurred. But, in the long run, the average price \(\lim _{t\rightarrow \infty }{\mathbb {E}}[X^*_t]\) is close to 3.5, which is the midvalue between \({{\bar{X}}}_p\) and \({{\bar{X}}}_c\).
$$\begin{aligned} {\mathcal {C}}^m_p:=\begin{bmatrix} 1.9, &{} 3.5, &{} 3.5, &{} 5.6 \\ 2.4, &{} 4.5, &{} 4.5, &{} 6.1 \\ \end{bmatrix}, \qquad {\mathcal {C}}^{m}_c:=[1.7, 4.3]. \end{aligned}$$
(63)
The players’ equilibrium strategy profile yields a stationary distribution for the pair \((X^*_t, \mu ^*_t)\). The macromarket \(\mu ^*\) switches between the expansion and the contraction regimes back and forth, while the jointly controlled price \((X^*_t)\) is bounded in the range \([x^+_\ell , x^_h]\) and fluctuates in a meanreverting pattern due to alternating signs of its drift. These stylized features can be broadly traced in the world commodity markets which undergo cyclical Expansion/Contraction patterns.
Dynamics of \((X^*_t)\) in the Equilibrium The dynamics of the commodity price \((X^*_t)\) are less tractable due to the impulses applied by the producer. Let \(\phi ^*(\cdot )\) denote the longrun (i.e., stationary) distribution of \((X^*_t)\). In Fig. 6, we show \(\phi ^*\) obtained from an empirical density based on a long trajectory of \((X^*_t)\), relying on Monte Carlo simulations and the ergodicity of the recurrent, bounded process \((X^*_t)\). For additional interpretability, we also plot the invariant distributions \(\phi ^*_\pm \) conditional on \(\mu _t = \mu _\pm \).
×
4.2 Type II—Transitory
In another type of equilibrium, the consumer switches only in one regime, with the other being absorbing. For this reason, we name it transitory. To fix ideas, suppose that the consumer only switches from expansion regime to contraction regime. In that case, the producer effectively acts like a profit maximizing monopoly in the contraction regime with twosided impulses; in the expansion regime, she will apply a onesided impulse as in the equilibrium type I.
To solve for such equilibrium, we first compute the producer strategy in the contraction regime which is a decoupled VI as in (39) leading to the 6 equations in (43) but only for \(v^\), \(x_r^, x_r^{*}\), \(r \in \{ \ell , h \}\). This solution induces the corresponding noswitch solution \(\omega _0^\) of the consumer as in (20), (21). Both \(v^, \omega _0^\) are then fixed and act as source terms to solve for the equilibrium in the expansion regime. For the latter, we need to compute \(v^+(\cdot )\) and the associated thresholds \(x_\ell ^+, x_\ell ^{+*}\) (only one threshold), as well as \(\omega ^+(x)\) and the switching threshold \(y_h\) (note that there is no \(y_\ell \)). The boundary conditions are \(v^+(y_h) = v^(y_h), \omega ^+(y_h) = \omega ^(y_h)  h_0\).
This reasoning leads to the following algorithm. If \(x^+_\ell \) and \(x^{+,*}_\ell \) are fixed, we compute the best response of the consumer by solving the variational problem for the consumer value function \(\omega ^+\) such that:This is exactly the best response in the singleswitch case with the solution given by the system (24) and which provides the consumer’s threshold \(y_h\). Now, if we consider that \(y_h\) is fixed, we can compute the best response of the producer by solving a VI for the value function \(v^+\) that satisfiesThe boundary conditions giving the four unknowns \((x^+_\ell ,x^{+,*}_\ell , \nu ^+_1,\nu ^+_2)\) are:Now, we can perform the iterations \(y_h^{0} \rightarrow (x^{+,(0)}_\ell , x^{+,*(0)}_\ell ) \rightarrow y_h^{1} \rightarrow (x^{+,(1)}_\ell , x^{+,*(1)}_\ell ) \ldots \).
$$\begin{aligned} w^+(x)&= {\left\{ \begin{array}{ll} w_0^(x)  h_0, &{} y_h \le x,\\ \widehat{\omega }^+(x) + \lambda ^+_1 e^{\theta ^+_1x} + \lambda ^+_2 e^{\theta ^+_2x}, &{} x^+_\ell< x < y_h,\\ w^+(x^{+,*}_\ell ), &{} x \le x^+_\ell . \end{array}\right. } \end{aligned}$$
$$\begin{aligned} v^+(x)&= {\left\{ \begin{array}{ll} v^(x), &{} y_h \le x,\\ \widehat{v}^+(x) + \nu ^+_1e^{\theta ^+_1x} + \nu ^+_2e^{\theta ^+_2x}, &{} x^+_\ell< x < y_h,\\ v^+(x^{+*}_\ell )  \kappa _0  \kappa _1 (x^{+,*}_\ell  x), &{} x \le x^+_\ell . \end{array}\right. } \end{aligned}$$
$$\begin{aligned} {\left\{ \begin{array}{ll} v^(y_h)=\widehat{v}^+(y_h)+\nu ^+_1e^{\theta ^+_1y_h}+\nu ^+_2e^{\theta ^+_2y_h}, &{} ({\mathcal {C}}^0\text { at }y_h)\\ \widehat{v}^+(x^+_\ell )+\nu ^+_1e^{\theta ^+_1x^+_\ell }+\nu ^+_2e^{\theta ^+_2x^+_\ell }=\widehat{v}^+(x^{+*}_\ell )+\nu ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2e^{\theta ^+_2x^{+*}_\ell } \!\! \kappa _0  \kappa _1 ( x^{+,*}_\ell  x^+_\ell ), &{} ({\mathcal {C}}^0\text { at }x^+_\ell )\\ \widehat{v}^+_x(x^+_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^+_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^+_\ell }=\widehat{v}^+_x(x^{+*}_\ell )+\nu ^+_1\theta ^+_1e^{\theta ^+_1x^{+*}_\ell }+\nu ^+_2\theta ^+_2e^{\theta ^+_2x^{+*}_\ell } +\kappa _1, &{} ({\mathcal {C}}^1\text { at }x^+_\ell )\\ \widehat{v}^+_x(x_\ell ^{+*})+\nu ^+_1\theta ^+_1e^{\theta ^+_1x_\ell ^{+*}}+\nu ^+_2\theta ^+_2e^{\theta ^+_2 x_\ell ^{+*}} = \kappa _1. &{} ({\mathcal {C}}^1\text { at } x_\ell ^{+*}) \end{array}\right. } \end{aligned}$$
(64)
We find the following fixed point of the bestresponse functions of the producer and the consumer:The system starts in the expansion regime, and once the price reaches level \(X^*_t =4.3\), the consumer switches to contraction and the systems remains in that state forever. After that, she relies on the producer to impulse \((X^*_t)\) up/down when prices get too low/too high but never reverts to the expansion regime. Thus, in the long run \((X^*_t)\) is simply a Brownian motion with negative drift \(\mu _\) that has two impulse boundaries \(x^_\ell = 2.4, x^_h =6.1\).
$$\begin{aligned} {\mathcal {C}}^{{\mathrm{II}},*}_p=\begin{bmatrix} 1.9, &{} 3.6, &{} , &{} +\infty \\ 2.4, &{} 4.5, &{} 4.5, &{} 6.1 \\ \end{bmatrix}, \qquad {\mathcal {C}}^{{\mathrm{II}},*}_c=[\infty , 4.3], \end{aligned}$$
(65)
Compared to the doubleswitch equilibrium of the previous section, the above market equilibrium in (65) has two important differences. First, as \(t \rightarrow \infty \) we have that \(\mu ^*_t \rightarrow \mu _\) so that in the long run the market will be in the contraction regime and the consumer becomes inactive. Second, because the producer eventually “takes over,” she will intervene much more frequently (see center panel of Fig. 5), benefiting himself and reducing consumer value.
4.3 Type III—Preemptive
The producer may have an interest to preempt the switch, say, from the expansion regime to the contraction regime to avoid decline in the consumption of the commodity he produces. In this case, the equilibrium is a fixed point of the bestresponse function of the producer described in Sect. 3.2.3 and the bestresponse function of the consumer described in Sect. 3.1.2. We look for an equilibrium where the consumer would like to switch at \(y_h\) in the expansion regime, but where the producer makes \(y_h\) his own intervention threshold to impulse the price down.
Using the same protocol as in type I and type II equilibrium research, we find the following threshold strategy for the producer and the consumer:In the preemptive equilibrium, the price fluctuates in a narrower range than the other two equilibria. Here, \((X^*_t)\) oscillates between \(x^+_\ell = 1.7\) and \(x^+_h = y_h =4.3\).
$$\begin{aligned} {\mathcal {C}}^{{\mathrm{III}},*}_p:=\begin{bmatrix} 1.7, &{} 3.1, &{} 3.1, &{} 4.3 \\ , &{} , &{} , &{}  \\ \end{bmatrix}, \qquad {\mathcal {C}}^{{\mathrm{III}},*}_c:=[, 4.3]. \end{aligned}$$
(66)
4.4 Equilibrium NonUniqueness
There are at least three potential equilibria. A natural question is thus whether one of them is preferable to the others. Figure 7 shows the value functions of the producer and of the consumer in the two market regimes (expansion and contraction) and for the different equilibria from type I to type III. We observe that the producer would prefer in both regimes to live in a type I equilibrium. The function \(v_1^{\pm }\) dominates all the other ones (note that there is no \(v_3^\) because in equilibrium type III contraction never happens). However, the consumer would rather be in the preemptive equilibrium type III: her value function \(w^+_3\) dominates the other two. Intuitively, we may think that the switching costs she saves by letting the producer do all the work of maintaining the price around its longterm average value compensate for the inconvenience of having prices that are higher than preferred.
×
4.5 Impact of Market Volatility
As one example of comparative statistics that are possible in our model, we investigate the impact of volatility parameter \(\sigma \) of X on the equilibrium profits and behavior of \(X^*\). In Table 2, we list statistics of \(\phi ^*\) for a range of market volatilities \(\sigma \). We also quantify the profitability of the two players through their average percentage of optimality (APOO)which is the ratio between average profit rate \(\pi _r(X^*_t)\) in equilibrium and the maximum profit that could be hypothetically obtained at the firstbest level \(\pi _r({\bar{X}}_r)\), \(r \in \{c, p\}\).
$$\begin{aligned} \mathrm {APOO} := \frac{\int _{{\mathcal {D}}} \pi _r (x)\phi ^*( dx )}{\pi _r ({\bar{X}}_r)}, \end{aligned}$$
In all types of equilibria, both players are worse off in terms of expected profit rate as \(\sigma \) increase. This occurs even though in type I and in type II equilibria the average price \({\mathbb {E}}[ X^*]\) increases. However, that gain is dominated by the losses due to higher \(\mathrm {Var}(X^*)\) which implies that prices tend to be further from their preferred levels \({\bar{X}}_r\) decreasing \({\mathbb {E}}_{\phi ^*}\left[ \pi _r \right] \).
Table 2
Longrun mean and variance of \(X^*\), longrun profit rates, and frequency of regime switches as market volatility \(\sigma \) changes (APOO in parentheses)
\(\sigma \)  \({\mathbb {E}}_{\phi ^*}[X^*] \)  \(\mathrm {Var}_{\phi ^*}(X^*)\)  \({\mathbb {E}}_{\phi ^*}\big [\pi _p\big ]\)  \({\mathbb {E}}_{\phi ^*}\big [\pi _c\big ]\)  Switch (per yr)  

Type I  0.25  3.52  0.73  0.81 (80%)  2.4 (80%)  0.021 
0.3  3.62  0.80  0.80 (79%)  2.2 (73%)  0.021  
0.4  3.77  0.94  0.76 (75%)  1.90 (62%)  0.020  
Type II  0.25  3.73  0.68  0.87 (85%)  2.3 (74%)  0.020 
0.3  3.76  0.74  0.85 (84%)  2.2 (71%)  0.020  
0.4  3.81  0.85  0.81 (80%)  1.95 (64%)  0.020  
Type III  0.25  3.41  0.45  0.86 (85%)  2.7 (90%)  0.0 
0.3  3.35  0.51  0.83 (82%)  2.7 (90%)  0.0  
0.4  3.28  0.61  0.78 (77%)  2.7 (88%)  0.0 
4.6 Effect of Consumer’s Switching Cost
A key parameter that controls which equilibrium type we face is the consumer’s switching cost \(h_0\). Starting from the doubleswitch situation, as \(h_0\) increases (\(>0.6\)), the consumer is less incentivized to switch from \(\mu ^+\) to \(\mu ^\) and we enter the singleswitch scenario of Sect. 3.1.2. Consequently, she receives the noswitch payoff \(\omega _0^+(x)\) when \(\mu _t=\mu ^+\) and solving for her bestresponse boils down to solve for \(y_h\) only. Once \(h_0\) gets very large, her best response is simply the noswitch response \(\omega _0^{\pm }\). Conversely, as \(h_0 \downarrow 0\) her actions become free. In that situation, we can reduce the producer problem to a single, piecewise VI with a free boundary \({\tilde{X}}_c\):with the \({{{\mathcal {C}}}}^0\) regularity at \({\tilde{X}}_c\): \(\lim _{x \uparrow {\tilde{X}}_c} v(x) = \lim _{x \downarrow {\tilde{X}}_c} v(x)\).
$$\begin{aligned} \sup \Big \{  \beta v(x) + \mu _{} v_x + \frac{1}{2} \sigma ^2 v_{xx} + \pi _p (x) \; ; \; \sup _{\xi } \big \{ v(x+\xi )  v(x)  K_p (\xi ) \big \} \Big \}&= 0 \qquad x > {\tilde{X}}_c, \end{aligned}$$
(67)
$$\begin{aligned} \sup \Big \{  \beta v(x) + \mu _{+} v_x + \frac{1}{2} \sigma ^2 v_{xx} + \pi _p (x)\; ; \; \sup _{\xi } \big \{ v(x+\xi )  v(x)  K_p (\xi ) \big \} \Big \}&= 0 \qquad x < {\tilde{X}}_c, \end{aligned}$$
(68)
Figure 8 shows that for low \(h_0\), \(x^{+*}_h < y_\ell \) and \(x^+_h\) is greater but close to \(y_h\). Thus, when consumption switches from expansion to contraction, it is very likely that the price touches \(x_h^+\) soon thereafter and is impulsed back to \(x ^{+*}_h\) and thus, the regime rapidly switches back to expansion again. When the switching cost increases, this solution disappears.
×
Remark 3
It is possible for the impulse amounts to be so large as to lead to a double simultaneous control: producer’s impulse instantaneously followed by the consumer switching. In this setting, the producer effectively forces the consumer to switch the regime by impulsing \(X^*\) hard enough. This situation corresponds to \(x^{*}_h < y_\ell \), so that the impulse in the contraction regime moves \(X^*\) into the respective switching region \((\infty , y_\ell )\), and as a result the consumer immediately switches to the expansion regime. This situation occurs if, for instance, the drifts are \(\mu ^ = 0.01,\mu ^+ = 0.1\), so that the consumer is not able to ever efficiently lower prices. Consequently, the producer is forced to fully control price reduction. We observe in the above situation the equilibrium thresholds of \(x^{*}_h = 3.04 < y_\ell = 3.69\). \(\square \)
5 Case Study: Diversification Effect of Vertical Integration
In this section, we accordingly study whether or not downstream or upstream firms have an interest in being vertically integrated. To this end, we consider a small firm that has no market power regarding the commodity price X and focus on the case of the market equilibrium type I (generic case). We then investigate whether the firm can benefit from a diversification effect by having activity both in the downstream consumer side and the upstream conversion side.
To make the case study concrete, we consider a simplified version of the crude oil and gasoline markets, the latter a shorthand for refined products, calibrated to the ballpark of the 2019 state of the world. Currently, world oil consumption is about 100 Mb/d (millions of barrels per day), normalized to 1 “barrel” per day. We take as a nominal initial price \(X_0\) \(=\) 50 USD/b and a nominal volatility of crude \(\sigma \) \(=\) 10 USD/b. To calibrate our model, we consider that crude oil producers have a preferred range of prices that goes from \(x^1_p = 30\) USD/b to \(x^2_p = 100\) USD/b and that the average cost of oil extraction is \(c_p = 30\) USD/b. This leads to a demand function \(D_p(x) = 1  0.01 x\), which captures the low sensitivity of the demand of crude to prices. The crude is transformed into gasoline with a small amount of losses 5%, so that the conversion factor is \(\alpha =0.95\).
We set the transfer function of crude oil price to average price of gasoline to \(P(x) = 10 + 1.1x\), where P(x) is also expressed in USD/b. There is evidence that the (pretax) price of gasoline is a linear function of the crude. For instance, using monthly data of the Energy Information Agency of the US Department of Energy on refined products prices from January 1983 to November 2019,^{2} we regressed the US Total gasoline Retail sales by refineries \({\hat{P}}\) to the monthly crude oil price \({\hat{X}}\) and found a linear relationwith a regression \(R^2 = 95\%\). Considering that the basket of refined products includes not just gasoline (even if it accounts for the largest share), we simplified the relation. Note that the condition \(p_1 \ge \alpha \) for having a downstream convex profit function holds. Furthermore, refinery costs \(c_c\) are highly variable between 4 to 10 USD/b. We take the higher value of \(c_c=10\). Finally, we consider that the demand function for refined products, \(D_c (P)=d'_0  d'_1 P\) is such that \(d'_0=5\) b/d of crude equivalent refined products and \(d'_1=0.05\). With these parameters, the preferred range of crude prices for the consumer is between \(x^1_c = 11\) and \(x^2_c = 82\) USD/b. We consider fixed action costs both for the production firm and the downstream firm. We consider that the producer and the downstream firm lose two years of profit at optimal price to change state making \(\kappa _0 = 2 \pi _p({{\bar{X}}}_p)\) and \(h_0 = 2 \pi _c({{\bar{X}}}_c)\). Finally, we take \(\mu _\pm = \pm 0.15\) per year, which implies that it takes 10 years for the crude price to increase by 1.5 USD.
$$\begin{aligned} {\hat{P}}_m = 1.2 {\hat{X}}_m + 14 + \epsilon _m \end{aligned}$$
Table 3
Nominal values for model parameters for the crude oil case study
Value  Interpretation  Units  

\(\beta \)  0.1  Discount rate  %/year 
\(X_0\)  50  Initial oil price  USD/b 
\(d_0\)  1  Demand function for oil: intercept  Mb/d 
\(d_1\)  0.01  Demand function for oil: slope  Mb/d/(USD/b) 
\(d'_0\)  5  Demand function for gasoline: intercept  Mb/d 
\(d'_1\)  0.05  Demand function for gasoline: slope  Mb/d/(USD/b) 
\(\alpha \)  0.95  Transformation rate  dimensionless 
\(p_0\)  10  Crude–gasoline price function: intercept  USD/b 
\(p_1\)  1.1  Crude–gasoline transfer price function: slope  USD/b/(USD/b) 
\(c_p\)  30  Oil production cost  USD/b 
\(c_c\)  5  Refining cost  USD/b 
\(\mu _\pm \)  \(\pm \, 0.15\)  Annualized crude drift parameters  USD/b 
\(\sigma \)  10  Annualized crude volatility  USD/b 
\(h_0\)  \(2 \pi _c({{\bar{X}}}_c) = 29 \)  Consumption switching cost  USD 
\(\kappa _0\)  \(2 \pi _p({{\bar{X}}}_p) = 24.5\)  Production switching cost: fixed  USD 
\(\kappa _1\)  0  Production switching cost: proportional  USD/b 
The resulting equilibrium type I producer impulse strategy \({\mathcal {C}}^{\mathrm{I,*}}_p\) and consumer’s switching strategy \({\mathcal {C}} ^{\mathrm{I,*}}_c\) associated with the calibration summarized in Table 3 are given byThus, in equilibrium, the crude price \(X^*\) fluctuates between 22.5 and 87 USD/b, with potential excursions up to 104 or down to 26 USD/b at which point producers intervene.
$$\begin{aligned} {\mathcal {C}}^{\mathrm{I,*}}_p=\begin{bmatrix} 26, &{} 62, &{} , &{} +\infty \\ \infty , &{} , &{} 69, &{} 104 \\ \end{bmatrix}, \qquad {\mathcal {C}}^{\mathrm{I,*}}_c=\begin{bmatrix} 22.5, &{} 87 \\ \end{bmatrix}. \end{aligned}$$
(69)
×
×
Now let us consider a small firm engaging in a fraction \(\lambda \in (0,1)\) of activity in the downstream sector and \(1\lambda \) in the upstream sector. Her profit rate is thus \(\pi _\lambda := \lambda \pi _c + \big ( 1\lambda \big ) \pi _p\). The firm is vertically integrated when \(0< \lambda <1\). Denote by \(\sigma (\pi _\lambda )\) the standard deviation of her profit rate \(\pi _\lambda (\cdot )\) integrated against the stationary distribution \(\phi ^*\) of \(X^*\), and by \({\mathbb {E}}\big [\pi _\lambda \big ] = \int \pi _\lambda (x) \phi ^*(dx)\) the respective expected profit rate. To fix ideas and because the analysis is symmetric, we are interested in situations where a pure downstream firm (\(\lambda =0\)) would be better off having part of her activity in the upstream sector. This will take place when the upstream activity provides a higher expected profit rate and/or a lower risk as measured by \(\sigma (\pi _\lambda )\). Figure 9 presents the risk–return curves \(\lambda \mapsto (\sigma (\pi _\lambda ),{\mathbb {E}}\big [\pi _\lambda \big ])\) as the passthrough parameter \(p_1\) increases from the nominal value of 1.1 to 1.18. We observe that for low values of \(p_1\) diversification gains are limited: expected profit rate goes up, but risk also increases. For moderate \(p_1\), a pure downstream firm unambiguously benefits from some upstream activity: she can achieve the same level of risk with a higher expected profit. For high \(p_1\), the upstream sector dominates completely with lower risk and higher average profit. Figure 10 (right) shows the critical integration level \(\lambda ^*\) that minimizes the risk \(\sigma (\pi _\lambda )\) and captures the “variance–minimal” business model.
We observe that for high enough passthrough values, being a producer \((\lambda =1)\) dominates any other combination of activity. This phenomenon happens even though the maximum profit rate of the downstream firm \(\pi _c({{\bar{X}}}_c)\) increases and gets higher than the producer’s maximum profit rate function \(\pi _p({{\bar{X}}}_p)\) as shown in the left panel of Fig. 10. As shown by the evolution of equilibrium price range in Fig. 10 (Middle), as \(p_1\) increases, the equilibrium is getting more and more detrimental to the downstream firm. The shaded salmon area represents the interval \([ {\mathbb {E}}_{\phi ^*}[X^*]  \sigma _{\phi ^*}(X^*), {\mathbb {E}}_{\phi ^*}[X^*] + \sigma _{\phi ^*}(X^*)]\) where commodity prices tend to reside. The average commodity price remains stable around 65 USD/b, and its standard deviation is not affected much by \(p_1\) either, while \({{\bar{X}}}_c\) is steadily decreasing. Thus, since the expected profit rate of the integrated firm is a function of the expected price and its standard deviation, it does not change much. But its variance grows as a function of \(p_1\) and thus increases significantly. To conclude, in our model we do observe a diversification effect obtained by mixing upstream and downstream activities; however, the integration gains depend closely on the passthrough parameter \(p_1\) which serves as a transmission channel of the volatility of the commodity price to the retail price.
6 Conclusion
We showed how a simple model of competition between upstream and downstream representative firms having different pace of intervention can lead to a rich variety of equilibria, potentially nonunique. The fact that the upstream firm can impact the price more rapidly than the downstream firm gives the producer a significant advantage, enabling him to lock the consumer in the producer’s preferred range of prices. Further, in the case of the crude oil market and its refinery products, we stressed how the passthrough parameter \(p_1\) plays a key role for the diversification effect induced by vertical integration. Vertical integration is beneficial for low values of \(p_1\), while for higher values, production dominates downstream activity both in terms of expected profit rate and profit standard deviation.
7 Proofs
7.1 Proof of Proposition 1
Proof
The proof is standard; nonetheless, we give some details for the reader’s convenience. To ease the notation, let us consider only the case \(\mu = \mu _+\), the other case being identical. Let \(w^+ _0 (x) = \widehat{\omega }^{+}(x)+u^{+}(x)\), where the parameters \( (\lambda ^+ _{1,0}, \lambda ^+ _{2,0}) \in {\mathbb {R}}\) solve the system (20), (21). By construction, the function \(w_0 ^+\) is of class \({\mathcal {C}}^2\) everywhere. Hence we can apply Itô’s formula to \(e^{\beta s} w_0 ^+ (X_s)\) on the time interval \([0, t \wedge \zeta _n)\), yieldingwhere \((\zeta _n)_{n \ge 1}\) is a localizing sequence of stopping times along which the local martingale part above is in fact a true martingale. We use the notation \(^\prime \) and \(^{\prime \prime }\) for, respectively, the first and second derivative in x. Taking expectations on both sides, using the fact that \(w_0 ^+\) solves the ordinary differential Eq. (17) and letting \(n \rightarrow \infty \), we obtainNow, notice that on the jumps of X we have \(\Delta w_0 ^+ (X_s) = w_0 ^+ (x^{+ *} _r)  w_0 ^+ (x^{+} _r)\), which is zero by the boundary conditions (19), hence the jump part in the equation above vanishes. Moreover, being \(X_t \in [x_\ell ^+ , x_h ^+]\) for all \( t \ge 0\), we have by dominated convergence that \({\mathbb {E}}\left[ e^{\beta t } w_0 ^+ (X_t) \right] \rightarrow 0\) as \( t \rightarrow \infty \). Therefore, letting \(t \rightarrow \infty \) we can conclude that \(w_0 ^+ (x) = J_c ^+ (x; N, \mu ^+)\) for all \(x \in [x_\ell ^+ , x_h ^+]\). \(\square \)
$$\begin{aligned} e^{\beta (t\wedge \zeta _n) } w_0 ^+ (X_{t\wedge \zeta _n})&= w_0 ^+(x) + \int _{0+} ^{t \wedge \zeta _n} e^{\beta s} \left[ w_0 ^{+ \prime }(X_{s}) (\mu _+ \text {d}s + \sigma \text {d}W_s \text {d}N_s) \beta w_0 ^+ (X_{s})\text {d}s \right] \\&\quad + \frac{\sigma ^2}{2} \int _{0+} ^{t \wedge \zeta _n} e^{\beta s} w_0 ^{+ \prime \prime } (X_{s}) \text {d}s +\! \sum _{0<s\le t \wedge \zeta _n} e^{\beta s} \left[ \Delta w_0 ^+ (X_s) + w_{0} ^{+ \prime } (X_{s})\Delta N_s \right] , \end{aligned}$$
$$\begin{aligned} {\mathbb {E}}\left[ e^{\beta t } w_0 ^+ (X_t) \right] = w_0 ^+ (x)  {\mathbb {E}} \left[ \int _{0+} ^t e^{\beta s} \pi _c (X_s) \text {d}s + \sum _{0<s\le t} \Delta w_0 ^+ (X_s)\right] . \end{aligned}$$
Figure 11a illustrates the fact that a threshold switching strategy might not be optimal in all potential situations by considering the shape of \(w^\pm _0(x)\). In the right panel, we have comonotonicity between \(w^+\) and \(w^\): the consumer is incentivized to switch to \(\mu ^+\) when \(X_t\) is low and to \(\mu ^\) when \(X_t\) is high. In that situation, we expect that a thresholdtype strategy is a best response. In contrast, on the left panel two other cases are illustrated. First, we see that it is possible that \(w^+(\cdot ) \ll w^(\cdot )\); in other words, the consumer has a strong preference to one regime over the other. In that case, the expansion regime could be absorbing, i.e., it is optimal to never switch to \(\mu _\). In the plot, this would happen if \(h_0\) is low (dashed line), whereby \(w^(x) > w^+(x)  h_0\) and it is optimal to switch to \(\mu _\) at any x (therefore \(\mu _+\) would never be observed in the resulting game evolution). At the same time, we see that if \(h_0\) is moderate (the solid line), then the region where \(w^_0(x) > w^+_0(x)  h_0\) is disconnected, so it is likely that a twothreshold switching strategy is an optimal response.
×
7.2 Proof of Proposition 2
Proof of Proposition 2
By construction, the functions \(w^\pm (x)\) in (29) solve the system of VIs in (27), (28) and satisfy \(w^+ \in {\mathcal {C}}^2 ((x_\ell ^+ , x_h ^+) \setminus \{y_\ell \}) \cap {\mathcal {C}}^1 ((x_\ell ^+, x_h ^+)) \cap {\mathcal {C}}^0 ({\mathbb {R}})\) and \(w^ \in {\mathcal {C}}^2 ((x_\ell ^, x_h ^) \setminus \{y_h\}) \cap {\mathcal {C}}^1 ((x_\ell ^ , x_h ^)) \cap {\mathcal {C}}^0 ({\mathbb {R}})\). Let N denote the pure jump component in X’s dynamics associated with the producer’s strategy with thresholds \((x_\ell ^\pm , x_\ell ^{\pm *}; x_h ^\pm , x_h ^{\pm *})\). The proof is structured in two steps.
Step 1: optimality The following verification argument proves that such functions coincide with the bestresponse payoffs of the consumer and that the switching times \({\hat{\sigma }}_i\) as in the statement are optimal provided they are admissible. First, by an approximation procedure as in the first part of the proof in [1, Theorem 3.3], we can assume without loss of generality that \(w^+ \in {\mathcal {C}}^2 ((x_\ell ^+, x_h ^+)) \cap {\mathcal {C}}^0 ({\mathbb {R}})\). Let \(\mu _{0} = \mu _+\). Consider two consecutive switching times of any consumer admissible strategy, say \(\sigma _{2i}\) and \(\sigma _{2i +1}\), for \(i \ge 0\) with the convention \(\sigma _0 =0\), and recall that over \([\sigma _{2i}, \sigma _{2i + 1})\) the state process X has drift \(\mu _+\). Applying Itô’s formula to \(e^{\beta t} w^+ (X_t)\) over the interval \([\sigma _{2i} \wedge T , \sigma _{2i +1}\wedge T)\), for some finite \(T >0\), we obtainUsing the dynamics \(\text {d}X_s = \mu _+ \text {d}s + \sigma \text {d}W_s \text {d}N_s\) between the two switching times above, localizing the martingale part through a suitable sequence of stopping times \(\zeta _n\) and taking expectation on both sides, we obtainwhere we set \(\zeta _{n,T}^{k} := \sigma _{k}\wedge \zeta _n \wedge T\). Notice first that the third summand above vanishes since between \(\sigma _{2i}\) and \(\sigma _{2i +1}\), the state process X can jump only due to the impulses of the producer; hence, at any of such jumps the \({\mathcal {C}}^0\)pasting condition at \(x^+_h\) yieldsRegarding the second summand, we use the variational inequality (27) so that we can writeNow, letting \(n \rightarrow \infty \) we obtain by dominated convergence thatAnalogously, we can get the same inequality between the switching times \(\sigma _{2i 1}\) and \(\sigma _{2i}\) for \(i \ge 1\) with \(w^\) replacing \(w^+\), so summing them all up we haveNote that by admissibility \(\sum _{i \ge 1} e^{\beta \sigma _i} \in L^2 ({\mathbb {P}})\), which implies \(\sup _{i \ge 1} \sigma _i = +\infty \) almost surely. Then, using the \({\mathcal {C}}^0\)pasting conditions in (30) and letting \(T \rightarrow \infty \), we finally obtainfor any admissible consumer strategy \((\sigma _i)\). Applying the same arguments to the sequence \(({\hat{\sigma }}_i)\) we would get equalities instead of inequalities everywhere. The proof for the case \(\mu _{0}=\mu _\) is analogous and therefore is omitted.
$$\begin{aligned} e^{\beta (\sigma _{2i +1}\wedge T)} w^+ (X_{\sigma _{2i +1}\wedge T})&= e^{\beta (\sigma _{2i}\wedge T)} w^+ (X_{\sigma _{2i}\wedge T})\\&\quad + \int _{\sigma _{2i} \wedge T} ^{\sigma _{2i +1} \wedge T} e^{\beta s} \left\{ w^{+}_x (X_s ) \text {d}X_s + \frac{\sigma ^2}{2}\omega _{xx} ^{+} (X_s ) \text {d}s \beta X_s \text {d}s \right\} \\&\quad + \sum _{\sigma _{2i} \wedge T < u \le \sigma _{2i+1} \wedge T} e^{\beta s}\left\{ \Delta w^+ (X_s) + w_x ^{+} (X_s) \Delta N_s\right\} . \end{aligned}$$
$$\begin{aligned}&{\mathbb {E}} \left[ e^{\beta \zeta ^{2i+1} _{n,T}} w^+ (X_{\zeta ^{2i+1}_{n,T}})\right] = {\mathbb {E}} \left[ e^{\beta \zeta ^{2i} _{n,T}} w^+ (X_{\zeta ^{2i} _{n,T}})\right] \\&\quad +\, {\mathbb {E}} \left[ \int _{\zeta ^{2i} _{n,T}} ^{\zeta ^{2i+1} _{n,T}} e^{\beta s} \left\{ w_x ^{+} (X_s )\mu _+ + \frac{\sigma ^2}{2} w_{xx} ^{+} (X_s ) \beta X_s \right\} \text {d}s \right] \\&\quad +\, {\mathbb {E}} \left[ \sum _{\zeta ^{2i} _{n,T} \le s < \zeta ^{2i+1} _{n,T}} e^{\beta s} \Delta w^+ (X_s)\right] , \end{aligned}$$
$$\begin{aligned} \Delta w^+ (X_s ) = (w^+ (X_s )  w^+ (X_{s} )){\mathbf {1}}_{(\Delta X_s \ne 0)} = (w^+ (x_h ^*) w^+ (x_h ^+)){\mathbf {1}}_{(\Delta X_s \ne 0)} = 0. \end{aligned}$$
$$\begin{aligned} {\mathbb {E}} \left[ e^{\beta \zeta ^{2i+1} _{n,T}} w^+ (X_{\zeta ^{2i+1} _{n,T}})\right] \le {\mathbb {E}} \left[ e^{\beta \zeta ^{2i} _{n,T}} w^+ (X_{\zeta ^{2i} _{n,T}})\right]  {\mathbb {E}} \left[ \int _{\zeta ^{2i} _{n,T}} ^{\zeta ^{2i+1} _{n,T}} e^{\beta s} \pi _c (X_s ) \text {d}s \right] . \end{aligned}$$
$$\begin{aligned}&{\mathbb {E}} \left[ e^{\beta \sigma _{2i+1} \wedge T} w^+ (X_{\sigma _{2i+1}\wedge T})\right] \le {\mathbb {E}} \left[ e^{\beta \sigma _{2i} \wedge T} w^+ (X_{\sigma _{2i}\wedge T})\right] \\&\quad \, {\mathbb {E}} \left[ \int _{\sigma _{2i}} ^{\sigma _{2i+1}} e^{\beta s} \pi _c (X_{s \wedge T} ) \text {d}s \right] , \quad i \ge 0. \end{aligned}$$
$$\begin{aligned}  {\mathbb {E}} \left[ \int _0 ^{(\sup _i \sigma _i) \wedge T} \pi _c (X_s) \text {d}s\right]&\ge \sum _{i \ge 0} {\mathbb {E}} \left[ e^{\beta \sigma _{2i+1} \wedge T} w^+ (X_{\sigma _{2i+1}\wedge T})  e^{\beta \sigma _{2i} \wedge T} w^+ (X_{\sigma _{2i}\wedge T}) \right] \\&\quad + \sum _{i \ge 1} {\mathbb {E}} \left[ e^{\beta \sigma _{2i} \wedge T} w^ (X_{\sigma _{2i}\wedge T})  e^{\beta \sigma _{2i1} \wedge T} w^ (X_{\sigma _{2i1}\wedge T}) \right] . \end{aligned}$$
$$\begin{aligned} {\mathbb {E}} \left[ \int _0 ^{+\infty } \pi _c (X_s) ds\right] + \sum _{i \ge 1} {\mathbb {E}} \left[ e^{\beta \sigma _i} h_0 \right] \le w^+ (x) , \end{aligned}$$
Step 2: admissibility To conclude we show that the switching times \({\hat{\sigma }}_i\) are admissible, i.e., they belong to the set \({\mathcal {A}}_c\). To do so, notice first that \(({\hat{\sigma }}_i)\) is a sequence of \([0,\infty )\)valued stopping times. Hence, it remains to show that a.s. \({\hat{\sigma }}_i < {\hat{\sigma }}_{i+1}\) for all \(i\ge 0\), and \(\sum _{i \ge 1} e^{\beta {\hat{\sigma }}_i} \in L^2 ({\mathbb {P}})\). The former follows from \(y_\ell < y_h\). For the latter, we can proceed as in the proof of [1, Prop. 4.7], whose main idea is to write each \(\sigma _i\) as a sum of independent exit times for some (scaled) Brownian motion with possibly different drifts and initial conditions. First, let us denote \((\tau ^\prime _k)_{k \ge 1}\) the increasing sequence of stopping times exhausting the intervention times of both players. Therefore, we havehence it suffices to prove that \(\sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k } \in L^1 ({\mathbb {P}})\). Now, notice that \(\tau ^\prime _k\), \(k \ge 1\), can be represented as \(\sum _{r=1}^k \zeta _r\), where \(\zeta _r\) is a sequence of independent random variables distributed as the exit time, say \(\zeta ^{z,\mu }\), of one of the processes \(z + \mu t + \sigma W_t\) withfrom the respective intervalsDue to the independence of the sequence \(\zeta _r\), we havewhich is a convergent geometric series, due to \(\beta >0\) and the fact that \(\zeta ^{z,\mu } > 0\) almost surely for all \((z,\mu ) \in {\mathcal {Z}}^\pm \). This shows that sequence of switching times \({\hat{\sigma }}_i\) is an admissible consumer’s strategy and concludes the proof. \(\square \)
$$\begin{aligned} {\mathbb {E}}\left[ \left( \sum _{i \ge 1} e^{\beta \sigma _i}\right) ^2\right]&\le {\mathbb {E}}\left[ \left( \sum _{k \ge 1} e^{\beta \tau ^\prime _k}\right) ^2\right] \le \lim _{m \rightarrow \infty } 2 {\mathbb {E}}\left[ \sum _{1\le k \le r \le m} e^{ \beta (\tau ^\prime _r + \tau ^\prime _k)}\right] \\&\le \lim _{m \rightarrow \infty } 2 {\mathbb {E}}\left[ \sum _{1\le k \le m} e^{ 2\beta \tau ^\prime _k }\right] = 2 {\mathbb {E}}\left[ \sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k }\right] , \end{aligned}$$
$$\begin{aligned} (z, \mu ) \in {\mathcal {Z}}^\pm := \left\{ (y_h, \mu _+),(y_{\ell }, \mu _ ), (x_{h}^{+ *} , \mu _+), ( x_\ell ^{ *} , \mu _) \right\} , \end{aligned}$$
$$\begin{aligned} (\infty , y_h ), \quad (y_\ell , +\infty ), \quad (\infty , x_h ^+), \quad (x_\ell ^ , +\infty ). \end{aligned}$$
$$\begin{aligned} {\mathbb {E}}\left[ \sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k }\right] = \sum _{k \ge 1} \prod _{r=1}^k {\mathbb {E}}\left[ e^{ 2\beta \zeta _r }\right] \le \sum _{k \ge 1} \left( {\mathbb {E}}\left[ e^{ 2\beta \min _{(z,\mu ) \in {\mathcal {Z}}^\pm }\zeta ^{z,\mu } }\right] \right) ^k , \end{aligned}$$
7.3 Proofs of Propositions 3 and 4
Proof of Proposition 3
Let \(v: \{\mu _ , \mu _+ \} \times {\mathbb {R}} \rightarrow {\mathbb {R}}\) be the function defined as \(v(\mu _\pm , x) = v^\pm (x)\), with \(v^\pm \) as in (47). By construction, the functions \((v^+, v^)\) solve the system of VIs in (45) and moreover \(v^\pm \in {\mathcal {C}}^2 ((x_\ell ^+ , x_h ^) \setminus \{y_\ell , y_h\}) \cap {\mathcal {C}}^0 ({\mathbb {R}})\), hence not necessarily \({\mathcal {C}}^1\) at the points \(y_\ell , y_h\). Recall that \(\mu _t = \mu _+ \sum _{i=0}^\infty {\mathbf {1}}_{\{ \sigma _{2i} \le t< \sigma _{2i+1} \} } + \mu _ \sum _{i=1}^\infty {\mathbf {1}}_{\{ \sigma _{2i1} \le t < \sigma _{2i} \} }\), \(t \ge 0\), where without loss of generality we can assume \(\sigma _i\) is the ith switching instance taken by the consumer in the case \(\mu _{0}=\mu _+\) (remember the convention \(\sigma _0 =0\)). The other case \(\mu _{0}=\mu _\) can be treated in a similar way, it is therefore omitted. We split the rest of the proof in two steps.
Step 1: optimality The following verification argument proves that such functions coincide with the bestresponse payoffs of the producer and that the impulse strategy as in the statement is optimal provided it is admissible. First, by an approximation procedure as in the first part of the proof in [1, Theorem 3.3], we can assume without loss of generality that \(v^\pm \in {\mathcal {C}}^2 ((x_\ell ^+, x_h ^)) \cap {\mathcal {C}}^0 ({\mathbb {R}})\). Consider any producer admissible strategy \((\tau _i, \xi _i)_{i \ge 1}\) as in the first part of Definition 1. Applying Itô’s formula to \(e^{\beta t} v (\mu _t, X_t)\) over the interval \([ \sigma _{2i} \wedge T , \sigma _{2i +1}\wedge T)\), for some finite \(T >0\), we obtainwhere the first equality comes from the fact that over \([ \sigma _{2i} \wedge T , \sigma _{2i +1}\wedge T)\), the drift equals \(\mu _+\) (remember that \(\mu _{0}=\mu _+\)). Using the dynamics \(d\text {d}_s = \mu _+ \text {d}s + \sigma \text {d}W_s \text {d}N_s\), with \(N_t := \sum _{i \ge 1} \xi _i {\mathbf {1}}_{\{ \tau _i \le t \} }\), between the two switching times above, localizing the martingale part through a suitable sequence of stopping times \(\zeta _n\) and taking expectation on both sides, we obtainwhere we set \(\zeta _{n,T}^{k} := \sigma _{k}\wedge \zeta _n \wedge T\). For the third summand above, notice that between \(\sigma _{2i}\) and \(\sigma _{2i +1}\), due to the nonlocal term in the variational inequality (45), the state process X can jump only due to the impulses of the producer and at any of such jumps we haveimplyingRegarding the second summand, we use the variational inequality (45) so that we can writeNow, due to \(X_t \in [ x_\ell ^+ , x_h ^]\) for all \(t \ge 0\), letting \(n \rightarrow \infty \) we obtain by dominated convergence thatAnalogously, we can get the same inequality between the switching times \(\sigma _{2i 1}\) and \(\sigma _{2i}\) for \(i \ge 1\) with \(v^\) replacing \(v^+\), so summing them all up we haveNote that by admissibility \(\sum _{i \ge 1} e^{\beta \sigma _i} \in L^2 ({\mathbb {P}})\), which implies \(\sup _{i \ge 0} \sigma _i = +\infty \) almost surely. Then, using the \({\mathcal {C}}^0\)pasting conditions in (48) and letting \(T \rightarrow \infty \), we finally obtainfor any admissible producer’s strategy \((\tau _i , \xi _i)_{i \ge 1}\). Applying the same arguments to the impulse strategy \((\tau ^* _i , \xi ^* _i)_{i \ge 1}\) as in the statement we would get equalities instead of inequalities everywhere. Notice that the secondorder conditions (52) guarantee the optimality of the impulses \(\xi _i ^*\).
$$\begin{aligned} e^{\beta (\sigma _{2i +1}\wedge T)} v (\mu _{\sigma _{2i +1}\wedge T}, X_{\sigma _{2i +1}\wedge T})&= e^{\beta (\sigma _{2i +1}\wedge T)} v^+ (X_{\sigma _{2i +1}\wedge T}) \\&\quad = e^{\beta (\sigma _{2i}\wedge T)} v^+ (X_{\sigma _{2i}\wedge T})\\&\qquad + \int _{\sigma _{2i} \wedge T} ^{\sigma _{2i +1} \wedge T} e^{\beta s} \left\{ v^{+}_x (X_s ) dX_s + \frac{\sigma ^2}{2} v^+ _{xx} (X_s ) ds \beta v^+(X_s) ds \right\} \\&\qquad + \sum _{\sigma _{2i} \wedge T < s \le \sigma _{2i+1} \wedge T} e^{\beta s}\left\{ \Delta v^+ (X_s) + v_x ^{+} (X_s) \Delta N_s \right\} , \end{aligned}$$
$$\begin{aligned}&{\mathbb {E}} \left[ e^{\beta \zeta ^{2i+1} _{n,T}} v^+ (X_{\zeta ^{2i+1}_{n,T}})\right] = {\mathbb {E}} \left[ e^{\beta \zeta ^{2i} _{n,T}} v^+ (X_{\zeta ^{2i} _{n,T}})\right] \\&\quad +\, {\mathbb {E}} \left[ \int _{\zeta ^{2i} _{n,T}} ^{\zeta ^{2i+1} _{n,T}} e^{\beta s} \left\{ v_x ^{+} (X_s )\mu _+ + \frac{\sigma ^2}{2} v_{xx} ^{+} (X_s ) \beta v^+(X_s) \right\} \text {d}s \right] \\&\quad +\, {\mathbb {E}} \left[ \sum _{\zeta ^{2i} _{n,T} \le s < \zeta ^{2i+1} _{n,T}} e^{\beta s} \Delta v^+ (X_s)\right] , \end{aligned}$$
$$\begin{aligned} \Delta v^+ (X_{\tau _i} )&\le K_p (\xi _i), \quad i \ge 0, \end{aligned}$$
$$\begin{aligned} {\mathbb {E}} \left[ \sum _{\zeta ^{2i} _{n,T} \le s< \zeta ^{2i+1} _{n,T}} e^{\beta s} \Delta v^+ (X_s)\right] \le {\mathbb {E}} \left[ \sum _{j: \zeta ^{2i} _{n,T} \le \tau _j < \zeta ^{2i+1} _{n,T}} e^{\beta s} K_p (\xi _j )\right] . \end{aligned}$$
$$\begin{aligned} {\mathbb {E}} \left[ e^{\beta \zeta ^{2i+1} _{n,T}} v^+ (X_{\zeta ^{2i+1} _{n,T}})\right] \le&\, {\mathbb {E}} \left[ e^{\beta \zeta ^{2i} _{n,T}} v^+ (X_{\zeta ^{2i} _{n,T}})\right]  {\mathbb {E}} \left[ \int _{\zeta ^{2i} _{n,T}} ^{\zeta ^{2i+1} _{n,T}} e^{\beta s} \pi _p (X_s ) \text {d}s \right] \\&+ {\mathbb {E}} \left[ \sum _{j: \zeta ^{2i} _{n,T} \le \tau _j < \zeta ^{2i+1} _{n,T}} e^{\beta s} K_p (\xi _j )\right] . \end{aligned}$$
$$\begin{aligned} {\mathbb {E}} \left[ e^{\beta \sigma _{2i+1} \wedge T} v^+ (X_{\sigma _{2i+1}\wedge T})\right]&\le \, {\mathbb {E}} \left[ e^{\beta \sigma _{2i} \wedge T} v^+ (X_{\sigma _{2i}\wedge T})\right]  {\mathbb {E}} \left[ \int _{\sigma _{2i}} ^{\sigma _{2i+1}} e^{\beta s} \pi _p (X_{s \wedge T} ) \text {d}s \right] \\&\quad +\, {\mathbb {E}} \left[ \sum _{j: \sigma _{2i} \le \tau _j < \sigma _{2i+1}} e^{\beta s} K_p (\xi _j )\right] , \quad i \ge 0. \end{aligned}$$
$$\begin{aligned}  {\mathbb {E}} \left[ \int _0 ^{(\sup _i \sigma _i) \wedge T} \pi _p (X_s) ds\right]&\ge \sum _{i \ge 0} {\mathbb {E}} \left[ e^{\beta \sigma _{2i+1} \wedge T} v^+ (X_{\sigma _{2i+1}\wedge T})  e^{\beta \sigma _{2i} \wedge T} v^+ (X_{\sigma _{2i}\wedge T}) \right] \nonumber \\&\quad + \sum _{i \ge 1} {\mathbb {E}} \left[ e^{\beta \sigma _{2i} \wedge T} v^ (X_{\sigma _{2i}\wedge T})  e^{\beta \sigma _{2i1} \wedge T} v^ (X_{\sigma _{2i1}\wedge T}) \right] . \end{aligned}$$
(70)
$$\begin{aligned} {\mathbb {E}} \left[ \int _0 ^{+\infty } \pi _p (X_s) ds\right] + \sum _{i \ge 1} {\mathbb {E}} \left[ e^{\beta \tau _i} K_p (\xi _i) \right] \le v^+ (x) , \end{aligned}$$
Step 2: admissibility To conclude the proof, we need to show that the impulse strategy \((\tau _i ^* , \xi _i ^*)_{i \ge 1}\) is admissible as in the first part of Definition 1. Property 1 is granted by the dynamics of the state variable X and the fact that producer’s thresholds satisfy \(x_\ell ^+< x_\ell ^{+*}, x_h ^{*} < x_h ^\).
Property 2 is obviously satisfied by definition of the optimal impulses \(\xi _i ^*\) as in the statement. Hence, we are left with showing property 3, i.e., \(\sum _{i \ge 1} e^{\beta \tau ^* _i } \xi ^* _i \in L^2 ({\mathbb {P}})\). We can proceed once more as in the proof of [1, Prop. 4.7] and in the second part of Proposition 2’s proof. We provide all details for reader’s convenience. First, let us denote \((\tau ^\prime _k)_{k \ge 1}\) the increasing sequence of stopping times exhausting the intervention times of both players. Since the optimal impulses \((\xi _i ^*)_{i \ge 1}\) are uniformly bounded by some positive constant, say \(\kappa \), we havehence it suffices to prove that \(\sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k } \in L^1 ({\mathbb {P}})\). Now, notice that \(\tau ^\prime _k\), \(k \ge 1\), can be represented as \(\sum _{r=1}^k \zeta _r\), where \(\zeta _r\) is a sequence of independent random variables distributed as the exit time, say \(\zeta ^{z,\mu }\), of one of the processes \(z + \mu t + \sigma W_t\) withfrom the respective intervalsDue to the independence of the sequence \(\zeta _r\) we havewhich is a convergent geometric series, due to \(\beta >0\) and the fact that \(\zeta ^{z,\mu } > 0\) almost surely for all \((z,\mu ) \in {\mathcal {Z}}^\pm \). This shows that \((\tau ^*_i , \xi _i ^*)_{i \ge 1}\) is an admissible producer’s impulse strategy and concludes the proof. \(\square \)
$$\begin{aligned} {\mathbb {E}}\left[ \left( \sum _{i \ge 1} \xi ^*_i e^{\beta \tau _i}\right) ^2\right]&\le \kappa ^2 {\mathbb {E}}\left[ \left( \sum _{k \ge 1} e^{\beta \tau ^\prime _k}\right) ^2\right] \le \lim _{m \rightarrow \infty } 2 {\mathbb {E}}\left[ \sum _{1\le k \le r \le m} e^{ \beta (\tau ^\prime _r + \tau ^\prime _k)}\right] \\&\le \lim _{m \rightarrow \infty } 2\kappa ^2 {\mathbb {E}}\left[ \sum _{1\le k \le m} e^{ 2\beta \tau ^\prime _k }\right] = 2\kappa ^2 {\mathbb {E}}\left[ \sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k }\right] , \end{aligned}$$
$$\begin{aligned} (z, \mu ) \in {\mathcal {Z}}^\pm := \{ (y_h, \mu _+),(y_{\ell }, \mu _ ), (x_{h}^{ *} , \mu _), ( x_\ell ^{+ *} , \mu _+)\} , \end{aligned}$$
$$\begin{aligned} (\infty , y_h ), \quad (y_\ell , +\infty ), \quad (\infty , x_h ^), \quad (x_\ell ^+ , +\infty ). \end{aligned}$$
$$\begin{aligned} {\mathbb {E}}\left[ \sum _{k \ge 1} e^{ 2\beta \tau ^\prime _k }\right] = \sum _{k \ge 1} \prod _{r=1}^k {\mathbb {E}}\left[ e^{ 2\beta \zeta _r }\right] \le \sum _{k \ge 1} \left( {\mathbb {E}}\left[ e^{ 2\beta \min _{(z,\mu ) \in {\mathcal {Z}}^\pm }\zeta ^{z,\mu } }\right] \right) ^k , \end{aligned}$$
Proof of Proposition 4
Here, notice that \(x_h ^+ = y_h\) so that, given producer’s priority in case of simultaneous interventions (cf. Remark 2), the drift is always equal to \(\mu _+\) (recall that we are in the case \(\mu _{0} = \mu _+\)). Hence, this proof can be performed as the one of Proposition 3, by ignoring the intervals where the drift is \(\mu _\) so that the second half in the RHS of inequality (70) is zero. The admissibility is proved in the same way. The details are therefore omitted. \(\square \)
7.4 Equilibrium Dynamics Computation
Let (a, b) be an arbitrary interval and \(x \in (a,b)\) be an interior starting location. We define \(\delta _+(x;a,b)\) to be the first passage time associated with the interval (a, b) of a Brownian Motion with drift \(\mu _+\) starting from x and \(P_+(x; a, b)\) to be the probability that this BM hits a before b (similarly for \(\delta _(x;a,b)\) and \(P_(x; a, b)\) associated with drift \(\mu _\)). These quantities admit explicit expressions, see [6].
The expected time \(\tau _ := \inf \{ t : \mu _t = \mu _\}\) for \(\mu _t\) to switch from \(\mu _+\) to \(\mu _\) within a doubleswitch and one–sided impulse equilibrium is thenwhere \({\mathbf {P}}\) is the transition matrix of \(M^*_n\). Above, the first term denotes the time to either reach \(x_\ell ^+\) (producer impulses up) or \(y_h\) (switch to contraction); the second term counts the additional time if \(x_\ell ^+\) is reached first multiplied by the respective probability \(P_+(x_0; x^+_\ell , y_h)\). Let \(\vec {\zeta }\) be the resulting vector of expected sojourn times. Then, the longrun proportion of time that \(X^*\) carries a positive drift (\(\mu _+\)) isand similarly the longrun proportion associated with a negative drift (\(\mu _\)) is \(\rho _=1\rho _+\).
$$\begin{aligned} {\mathbb {E}}\big [ \tau _\big ]={\mathbb {E}}\big [\delta _+(x_0; x^+_\ell , y_h)\big ]+\frac{P_+(x_0; x^+_\ell , y_h)}{{\mathbf {P}}_{I^+_h, S_}}{\mathbb {E}}_+\big [\delta (x^{+*}_\ell ; x^+ _\ell , y_h)\big ], \end{aligned}$$
(71)
$$\begin{aligned} \rho _+ =\frac{\Pi _{S_+}\zeta _{S_+}+\Pi _{I^+_\ell }\zeta _{I^+_\ell }+\Pi _{I^+_h}\zeta _{I^+_h}}{\vec {\Pi }\cdot \vec {\zeta }^\dagger }, \end{aligned}$$
(72)
Acknowledgements
We thank the associate editor and two anonymous reviewers for their helpful feedback on the earlier version of the article. Aid acknowledges the Finance for Energy Market Research Initiative. Ludkovski is partially supported by NSF DMS1736439.
Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.