In this paper, we study variational discretization method for parabolic optimization problems. Firstly, we obtain some convergence and superconvergence analysis results of the approximation scheme. Secondly, we derive a posteriori error estimates of the approximation solutions. Finally, we present variational discretization approximation algorithm and adaptive variational discretization approximation algorithm for parabolic optimization problems and do some numerical experiments to confirm our theoretical results.
1 Introduction
Optimal control problems (OCPs) play an important role in scientific and engineering numerical simulation, and nowadays are strongly utilized in biology, economics, and finance. Finite element approximation is one of the widely used numerical methods in computing OCPs. A systematic introduction of finite element method (FEM) for partial differential equations or OPCs can be found in [1, 2, 8, 20, 30, 33].
There have been abundant researches on FEM approximation for elliptic optimal control problems (EOCPs). The pioneering work of the late 1970s in the area of finite element approximation for EOCPs includes [10, 11], where a priori error estimates were established. Then a lot of superconvergence and a posteriori error estimate results of FEM solving different kinds of EOCPs were obtained. For instance, superconvergence of FEM for linear, bilinear, and semilinear EOCPs were derived in [27, 32], and [4], respectively, while residual- and recovery-based a posteriori error estimates of FEM for distributed OCPs were constructed in [22] and [18], respectively. Moreover, adaptive FEM for EOCPs were presented in [3, 13, 17]. It is worth mentioning that some similar results of mixed FEM for EOCPs can be found in [5, 7, 16].
Anzeige
In the last decade, numerical solution of parabolic optimal control problems (POCPs) became a hot research topic. A priori error estimates of FEM, space-time FEM, and Petrov–Galerkin Crank–Nicolson approximation for POCPs were given in [12, 25], and [26], respectively. In [24], the authors constructed an adaptive space-time FEM for POCPs. There are also some results on residual-type a posteriori error estimates of FEM or mixed FEM for POCPs which can be found in [6, 23, 31], where the authors do not give any adaptive FEM approximation for POCPs.
Hinze presented a variational discretization (VD) concept for control constrained optimization problems in [14]. It can not only save some computation cost but also improve the error of the control variable. Recently, VD approximation of convection dominated diffusion EOCP with control constraints and POCP with pointwise state constraints were investigated in [15] and [9], respectively. We have investigated VD approximation for a linear POCP in [28].
The purpose of this work is to investigate VD approximation for a POCP with control constraints. We first analyze the convergence and superconvergence of the VD approximation scheme and derive a posteriori error estimates, then construct an adaptive VD approximation algorithm for the POCP. Finally, two numerical examples are provided to verify theoretical results.
where \(\nu >0\) represents the weight of the cost of the control, Ω is a bounded domain in \(\mathbb{R}^{2}\) with a Lipschitz boundary ∂Ω, \(0< T<+\infty \) and \(J=[0,T]\). The coefficient \(A(x)=(a_{ij}(x))_{2\times 2}\in (W^{1,\infty }(\bar{\Omega }))^{2 \times 2}\) is such that for any \(\xi \in \mathbb{R}^{2}\), \((A(x)\xi )\cdot \xi \geq c| \xi | ^{2}\) with \(c>0\). We assume that K is a nonempty closed convex set in \(L^{2}(J;L^{2}(\Omega ))\), defined by
In this paper, we adopt the standard notation \(W^{m,q}(\Omega )\) for Sobolev spaces on Ω with norm \(\|\cdot \|_{W^{m,q}(\Omega )}\) and seminorm \(|\cdot |_{W^{m,q}(\Omega )}\). We denote \(W^{m,2}(\Omega )\) by \(H^{m}(\Omega )\) and set \(H_{0}^{1}(\Omega )\equiv \{ v \in H^{1}(\Omega ): v|_{\partial \Omega } =0 \} \). We denote by \(L^{s}(J;W^{m,q}(\Omega ))\) the Banach space of \(L^{s}\) integrable functions from J into \(W^{m,q}(\Omega )\) with norm \(\|v\|_{L^{s}(J;W^{m,q}(\Omega ))}= (\int _{0}^{T}\|v\|_{W^{m,q}( \Omega )}^{s}\,dt )^{\frac{1}{s}}\) for \(s\in [1,\infty )\) and the standard modification for \(s=\infty \). As in [21], we can define the space \(H^{l}(J;W^{m,q}(\Omega ))\). In addition, c or C denotes a generic positive constant independent of h and k.
The outline of the paper is as follows. In Sect. 2, we give a VD approximation for the model problem. In Sect. 3, we derive some convergence and superconvergence analysis results for the control, state, and costate variables. In Sect. 4, we establish a posteriori error estimates for the approximation scheme. We present a VD approximation algorithm and adaptive VD approximation algorithm for the POCP and do some numerical experiments to confirm our theoretical results in the last section.
2 VD approximation of POCP
In this section, we present a VD approximation for the model problem (1). For ease of exposition, we denote \(L^{p}(J;W^{m,q}(\Omega ))\) by \(L^{p}(W^{m,q})\). Let \(W=H_{0}^{1}(\Omega )\) and \(U=L^{2}(\Omega )\). Moreover, we denote \(\|\cdot \|_{H^{m}(\Omega )}\) and \(\|\cdot \|_{L^{2}(\Omega )}\) by \(\|\cdot \|_{m}\) and \(\|\cdot \|\), respectively.
It is well known (see, e.g., [20, 23]) that problem (2) has a unique solution \((y,u)\), and the pair \((y,u)\in (H^{1}(L^{2})\cap L^{2}(H^{1}) )\times K\) is the solution of (2) if and only if there is an adjoint state \(p\in H^{1}(L^{2})\cap L^{2}(H^{1})\) such that the triplet \((y,p,u)\) satisfies the following optimality conditions:
Let \(\mathcal{T}^{h}\) be regular triangulations of Ω and \(\bar{\Omega }=\bigcup_{\tau \in \mathcal{T}^{h}}\bar{\tau }\). Let \(h=\max_{\tau \in \mathcal{T}^{h}}\{h_{\tau }\}\), where \(h_{\tau }\) denotes the diameter of the element τ. Moreover, we set
where \(\mathbb{P}_{1}\) is the space of polynomials up to order 1.
We now consider the time discretization for problem (2). Let \(k>0\), \(N=T/k\in \mathbb{Z}^{+}\), \(t_{n}=n k\), \(n=0,1,\ldots ,N\). Set \(\varphi ^{n}=\varphi (x,t_{n})\) and
where \(l=0\) for the control u and the state y and \(l=1\) for the adjoint state p, with the standard modification for \(p=\infty \). Just for simplicity, we denote \(|\!|\!|\cdot |\!|\!|_{l^{p}(J;W^{m,q}(\Omega ))}\) by \(|\!|\!|\cdot |\!|\!|_{l^{p}(W^{m,q})}\) and let
where \(P_{h}\) is an elliptic projection operator which will be specified later.
It follows (see, e.g., [15, 23]) that the control problem (9) has a unique solution \((y_{h}^{n},u_{h}^{n} )\), \(n=1,2,\ldots ,N\), and \((y_{h}^{n},u_{h}^{n} )\in W^{h}\times K\), \(n=1,2,\ldots ,N\), is the solution of (9) if and only if there is an adjoint state \(p_{h}^{n-1}\in W^{h}\), \(n=1,2,\ldots ,N\), such that the triplet \((y_{h}^{n},p_{h}^{n-1},u_{h}^{n} )\in W^{h}\times W^{h} \times K\), \(n=1,2,\ldots ,N\), satisfies the following optimality conditions:
For the approximation scheme (10)–(14), some convergence and superconvergence results will be derived in this section. We define the following intermediate variables. Let \((y_{h}^{n}(u),p_{h}^{n-1}(u) )\in W^{h}\times W^{h}\), \(n=1,2, \ldots ,N\), satisfy the following system:
Let\((y,p,u)\)and\((y_{h}(u),p_{h}(u) )\)be the solutions of (3)–(7) and (16)–(18), respectively. Suppose that\(y,p\in l^{2}_{D}(H^{2})\cap H^{1}(H^{2})\cap H^{2}(L^{2})\). Then
$$\begin{aligned} \bigl|\!\bigl|\!\bigl|y_{h}(u)-y\bigr|\!\bigr|\!\bigr|_{l^{2}(L^{2})}+\bigl|\!\bigl|\!\bigl|p_{h}(u)-p\bigr|\!\bigr|\!\bigr|_{l^{2}(L^{2})}\leq C \bigl(h^{2}+k\bigr). \end{aligned}$$
(22)
Proof
From the definition of \(P_{h}\), (3) and (16), for any \(w_{h}\in W^{h}\), \(n=1,2,\ldots ,N\), we have
For the control variable, we derive the following convergence result.
Theorem 3.1
Let\((y,p,u)\)and\((y_{h},p_{h},u_{h} )\)be the solutions of (3)–(7) and (10)–(14), respectively. Assume that all the conditions in Lemma3.1are satisfied. Then, we have
$$ |\!|\!|u-u_{h}|\!|\!|_{l^{2}(L^{2})}\leq C \bigl(h^{2}+k \bigr).$$
For the control, state and costate variables, we have the following results.
Theorem 3.2
Let\((y,p,u )\)and\((y_{h},p_{h},u_{h} )\)be the solutions (3)–(7) and (10)–(14), respectively. Assume that all the conditions in Theorem3.1are valid. Then
$$ |\!|\!|P_{h}y-y_{h} |\!|\!|_{l^{2}(H^{1})}+|\!|\!|P_{h}p-p_{h}|\!|\!|_{l^{2}(H^{1})} \leq C \bigl(h^{2}+k \bigr). $$
(36)
Proof
From (3) and (10), for any \(w_{h}\in W^{h}\), \(n=1,2,\ldots ,N\), we obtain the following error equation:
In this section, a posteriori error estimates of recovery type will be established by utilizing the superconvergence results derived in Sect. 3. Similar to the Z–Z patch recovery in [34, 35], we introduce recovery operators \(R_{h}\) and \(G_{h}\) for the state and the adjoint state. Let \(R_{h}v\) be a continuous piecewise linear function (without zero boundary constraint) and let the values of \(R_{h}v\) on the nodes be defined by a least-squares argument on element patches surrounding the nodes. The gradient recovery operator \(G_{h}v= (R_{h}v_{x_{1}},R_{h}v_{x_{2}} )\). The details also can be found in [18].
Theorem 4.1
Let\((y,p,u)\)and\((y_{h},p_{h},u_{h})\)be the solutions of (3)–(7) and (10)–(14), respectively. Suppose that all the conditions in Theorem3.2are valid and\(y,p\in L^{2}_{D}(J;H^{3}(\Omega ))\). Then
$$ |\!|\!|G_{h}y_{h}- \nabla y|\!|\!|_{l^{2}(L^{2})}+|\!|\!|G_{h}p_{h}- \nabla p|\!|\!|_{l^{2}(L^{2})} \leq C \bigl(h^{2}+k \bigr).$$
(47)
Proof
Let \(y_{I}\) be the piecewise linear Lagrange interpolation of y. From Theorem 2.1.1 in [19], we have
$$ \Vert G_{h}y-y_{I} \Vert _{1}\leq Ch^{2} \Vert y \Vert _{3}.$$
(48)
According to the standard interpolation error estimate technique (see, e.g., [8]), we have
$$ \Vert G_{h}y_{I}- \nabla y \Vert \leq Ch^{2} \vert y \vert _{3}.$$
We do some numerical experiments to demonstrate our theoretical results. For an acceptable error Tol, we present a VD approximation algorithm (see Algorithm 5.1) for the constrained POCP (1). For ease of exposition, we have omitted the subscript h.
×
Similar to [18], by selecting the same meshes for the state and the adjoint state and using \(\eta _{1}\) and \(\eta _{2}\) as mesh refinement indicators for the state and the adjoint state, for an acceptable error \(\mathit{Tol}^{\prime }\), we construct adaptive VD approximation algorithm (see Algorithm 5.2).
×
The following numerical examples were solved with AFEPack which is freely available. Just for simplicity, we let \(\Omega =[0,1]\times [0,1]\), \(T=1\), \(A(x)=E\) be the \(2\times 2\) identity matrix and denote \(|\!|\!|\cdot |\!|\!|_{l^{2}(L^{2})}\) and \(|\!|\!|\cdot |\!|\!|_{l^{2}(H^{1})}\) by \(|\!|\!|\cdot |\!|\!|\) and \(|\!|\!|\cdot |\!|\!|_{1}\), respectively. The convergence order is computed by the following formula: \(\mathit{Rate}=\frac{\log (e_{i+1})-\log (e_{i})}{\log (h_{i+1})-\log (h_{i})}\), where \(e_{i} (e_{i+1})\) denotes the error when the spatial partition size is \(h_{i} (h_{i+1})\). We solve the following POCP:
We use the Algorithm 5.1 to solve the first example. In Fig. 1, we plot the profile of the numerical solution \(u_{h}\) at \(t=0.5\) when \(h=\frac{1}{80}\) and \(k=\frac{1}{640}\). In Table 1, the errors \(|\!|\!|u-u_{h}|\!|\!|\), \(|\!|\!|P_{h}y-y_{h}|\!|\!|_{1}\), and \(|\!|\!|P_{h}p-p_{h}|\!|\!|_{1}\) on a sequence of uniformly refined meshes are shown. It is easy to see \(|\!|\!|u-u_{h}|\!|\!|=\mathcal{O} (h^{2}+k )\), \(|\!|\!|P_{h}y-y_{h}|\!|\!|_{1}=\mathcal{O} (h^{2}+k )\), and \(|\!|\!|P_{h}p-p_{h}|\!|\!|_{1}=\mathcal{O} (h^{2}+k )\), which confirm our theoretical results.
We take a small time size \(k=\frac{1}{100}\) and solve the second example by using the Algorithms 5.1 and 5.2, respectively. Numerical results based on a sequence of uniformly refined meshes and adaptive meshes are listed in Tables 2 and 3, respectively. It is clear that the adaptive meshes generated via the error estimators \(\eta _{1}\) and \(\eta _{2}\) are able to save substantial computational work, in comparison with the uniform meshes.
Although there has been extensive research on FEMs for various POCPs, they mostly focused on convergence and superconvergence (see, e.g., [12, 23‐26, 28, 29]), and the results on convergence and superconvergence were \(\mathcal{O}(h+k)\) and \(\mathcal{O}(h^{\frac{3}{2}}+k)\), respectively. Recently, VD were used to deal with different OCPs in [9, 14, 15, 28], while there was little work on POCPs. Hence, our results on adaptive VD for POCPs are new.
Acknowledgements
The authors are very grateful to both referees for carefully reading of the paper and their comments and suggestions.
Competing interests
The authors declare that they have no competing interests.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.