nach oben

Journal of Scientific Computing

Erschienen in:

Open Access 27.09.2017

hp-Adaptive Galerkin Time Stepping Methods for Nonlinear Initial Value Problems

verfasst von: Irene Kyza, Stephen Metcalfe, Thomas P. Wihler

Erschienen in: Journal of Scientific Computing | Ausgabe 1/2018

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

This work is concerned with the derivation of an a posteriori error estimator for Galerkin approximations to nonlinear initial value problems with an emphasis on finite-time existence in the context of blow-up. The structure of the derived estimator leads naturally to the development of both h and hp versions of an adaptive algorithm designed to approximate the blow-up time. The adaptive algorithms are then applied in a series of numerical experiments, and the rate of convergence to the blow-up time is investigated.

The authors acknowledge the support of the Swiss National Science Foundation (SNF).

1 Introduction

In this paper we focus on continuous Galerkin (cG) and discontinuous Galerkin (dG) time stepping discretizations as applied to abstract initial value problems of the form

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ1_HTML.gif

(1.1)

Here, $u:(0,T)\rightarrow H$, for some $T>0$, denotes the unknown solution with values in a real Hilbert space H with inner product $(\cdot ,\cdot )_H$ and induced norm $\Vert \cdot \Vert _H$. The initial value $u_0 \in H$ prescribes the value of the solution at $t=0$, and $\mathcal F:[0,T]\times H\rightarrow H$ is a possibly nonlinear operator. We emphasize that we include the case of $\mathcal F$ being unbounded in the sense that

$$\begin{aligned} \frac{\Vert \mathcal F(t,x)\Vert _H}{\Vert x\Vert _H}\rightarrow \infty \text { as }\Vert x\Vert _H\rightarrow \infty ,\qquad 0\le t\le T. \end{aligned}$$

Note that the existence interval for solutions may be arbitrarily small even for smooth $\mathcal F$. Indeed, for certain data the solution of (1.1) can become unbounded in finite time, i.e.,

$$\begin{aligned} \Vert u(t)\Vert _H<\infty \text { for }0<t<T_\infty ,\qquad \qquad \lim _{t\nearrow T_\infty }\Vert u(t)\Vert _H = \infty . \end{aligned}$$

This effect is commonly termed finite-time blow-up or sometimes just blow-up and the quantity $T_{\infty }$ is called the blow-up time.

The main contributions of this paper are as follows:

The derivation of conditional a posteriori error bounds for hp-cG and hp-dG approximations to the nonlinear initial value problem (1.1).
The design of efficient adaptive algorithms that lead to accurate approximation of the blow-up time in the case where problem (1.1) exhibits finite time blow-up.

To the best of our knowledge, this is the first time that an adaptive algorithm has been developed for hp-cG and hp-dG time-stepping methods based on rigorous a posteriori error control for problems of the form (1.1). The adaptive procedure that we propose includes both h-adaptive and hp-adaptive variants. Indeed, one of the contributions of this paper is to motivate how we can choose effectively between h- or p-adaptivity locally while taking into account the possible singular behavior of the problem under consideration. In this sense, these results extend the h-adaptive algorithm analyzed for some special cases of (1.1) and Euler-type time discretizations in [5]. In particular, the inclusion of higher order time-stepping schemes and hp-adaptivity allows us to overcome key limitations encountered in [5].

A posteriori error estimators for linear problems tend to be unconditional, that is, they always hold independent of the problem data and the size of the discretization parameters. For nonlinear problems, the situation is more complicated since the existence of a solution to an appropriate error equation (and, thus, of an error bound) usually requires that either the data or the discretization parameters are sufficiently small. As a result, a posteriori error estimators for nonlinear problems tend to be conditional, that is, they only hold provided that an a posteriori verifiable condition (which can be either explicit or implicit) is satisfied. For nonlinear time-dependent problems, there are two commonly used approaches for deriving conditional a posteriori error bounds: continuation arguments, cf. [5, 14], and fixed point arguments, cf. [6, 15]. The a posteriori error bounds that we derive here are obtained by utilizing a local continuation argument.

Galerkin time stepping methods for initial value problems are based on weak formulations and for both the cG and dG time stepping schemes, the test spaces consist of polynomials that are discontinuous at the time nodes. In this way, the discrete Galerkin formulations decouple into local problems on each time step and the discretizations can therefore be understood as implicit one-step schemes. In the literature, Galerkin time stepping schemes have been extensively analyzed for ordinary differential equations (ODEs), cf. [3, 7‐10, 13].

A key feature of Galerkin time stepping methods is their great flexibility with respect to the size of the time steps and the local approximation orders which lends these schemes well to an hp-framework. The hp-versions of the cG and dG time stepping schemes were introduced and analyzed in the works [19, 20, 22, 24]. In particular, in the articles [19, 24] which focus on initial value problems with uniform Lipschitz nonlinearities, the use of the Banach fixed point theorem made it possible to prove existence and uniqueness results for the discrete Galerkin solutions which are independent of the local approximation orders; these results have been extended to discrete Peano-type existence results in the context of more general nonlinearities in [12]. We emphasize that the hp-approach is well known for its ability to approximate smooth solutions with possible local singularities at high algebraic or even exponential rates of convergence; see, e.g., [4, 20, 21, 23] for the numerical approximation of problems with start-up singularities. In light of this, a main aim of this paper is to establish through numerical experiments whether or not hp-refinement, utilizing the derived a posteriori error estimator, can lead to exponential convergence towards the blow-up time for the case where (1.1) exhibits finite time blow-up.

1.1 Outline

The remainder of our article is organized as follows. In Sect. 2, we introduce the hp-cG and hp-dG time stepping schemes while in Sect. 3 we develop a posteriori error bounds for these schemes. We design h as well as hp version adaptive algorithms to approximate the blow-up time in Sect. 4 before applying them to some numerical experiments in Sect. 5. Finally, we draw conclusions and comment on possible future research in Sect. 6.

1.2 Notation

Let H denote a real Hilbert space with inner product $(\cdot ,\cdot )_H$ and induced norm $\Vert \cdot \Vert _H$ as before. Given an interval $I=(a,b)$, the Bochner space $C(\bar{I};H)$ consists of all functions $u:\bar{I}\rightarrow H$ that are continuous on $\bar{I}$ with values in H. Moreover, for $1\le p\le \infty $, we define the norm

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ62_HTML.gif

and we let $L^p(I;H)$ be the space of all measurable functions $u:I\rightarrow H$ such that the corresponding norm is bounded. Note that $L^2(I;H)$ is a Hilbert space with inner product and norm given by

$$\begin{aligned} (u,v)_{L^2(I;H)}:=\int _I(u(t),v(t))_H\,\text {d}t,\qquad \text {and}\qquad \Vert u\Vert _{L^2(I;H)}:= \left( \int _I\Vert u(t)\Vert ^2_H\,\text {d}t\right) ^{{1/2}}, \end{aligned}$$

respectively.

2 Galerkin Time Discretizations

In this section, we briefly recall the hp-cG and hp-dG time stepping schemes for the discretisation of (1.1). To this end, on the open interval $I=(0,T)$, we introduce a set of time nodes, $0 := t_0< t_1< \cdots< t_{M-1} < t_M := T$, which define a time partition $\mathcal M:=\{I_m\}_{m=1}^M$ of I into M open time intervals $I_m:=(t_{m-1},t_m)$, $m=1,\ldots ,M$. The length $k_m := t_m - t_{m-1}$ (which may be variable) of the time interval $I_m$ is called the time step length. Furthermore, to each interval $I_m$ we associate a polynomial degree $r_m \in \mathbb {N}_0$ which takes the role of a local approximation order. Then, given a (real) Hilbert space $X\subseteq H$ and some $r\in \mathbb {N}_0$, the set

$$\begin{aligned} \mathcal {P}^{r}(J;X):=\left\{ p\in C(\bar{J};X):\,p(t)=\sum _{i=0}^rx_it^i,\, x_i\in X\right\} , \end{aligned}$$

signifies the space of all polynomials of degree at most r on an interval $J\subset \mathbb {R}$ with values in X.

In practical computations, the Hilbert space H on which (1.1) is based will typically be replaced by a finite-dimensional subspace $H_m\subset H$ on each interval $I_m$, $1\le m\le M$. In this context, it is useful to define the H-orthogonal projection $\pi _m:\,H\rightarrow H_m$ given by

$$\begin{aligned} x\mapsto \pi _m x,\qquad (x-\pi _mx,y)_H=0\qquad \forall y\in H_m. \end{aligned}$$

2.1 hp-cG Time Stepping

With the above definitions, the (fully-discrete) hp-cG time marching scheme is iteratively given as follows: For $1\le m\le M$, we seek $U|_{I_m}\in \mathcal {P}^{r_m}(I_m;H_m)$ such that

$$\begin{aligned} \begin{aligned} \int _{I_m} \bigg (\frac{\,\text {d}U}{\,\text {d}t},V\bigg )_H\,\text {d}t&=\int _{I_m} (\mathcal F(t,U),V)_H \,\text {d}t \qquad \forall V\in \mathcal {P}^{r_m-1}(I_m;H_m), \end{aligned} \end{aligned}$$

(2.1)

with the initial condition $U(t_{0}) = \pi _1 u_0$ for $m=1$, and $U(t_{m-1}) = \pi _m U(t_{m-1}^-)$ for $m \ge 2$; here, the one-sided limits of a piecewise continuous function U at each time node $t_m$ are given by

$$\begin{aligned} U(t_m^+) :=\lim _{s\searrow 0} U(t_m+s), \qquad \qquad U(t_m^-) :=\lim _{s\searrow 0} U(t_m-s). \end{aligned}$$

Note that in order to enforce the initial condition on each time step, the local trial space has one degree of freedom more than the local test space in the hp-cG scheme. Furthermore, if $H_1=\cdots =H_M$, we remark that the hp-cG solution U is globally continuous on [0, T]. Finally, we observe that if $r_m = 1$ then (2.1) in fact yields the well known Crank-Nicolson method provided that the midpoint quadrature rule is applied to the right-hand side of (2.1).

The strong form of (2.1) on ${I}_m$ is

$$\begin{aligned} \begin{aligned} U'(t) = \Pi ^{r_m - 1}_m \mathcal F(t,{U}(t)), \end{aligned} \end{aligned}$$

(2.2)

where $\Pi ^{r_m - 1}_m$ denotes the $L^2$-projection operator onto the space $\mathcal {P}^{r_m-1}(I_m;H_m)$; see [12] for details. Whilst the strong form (2.2) can be exploited for the purpose of deriving a posteriori error estimates, it is well known that employing such a straightforward approach leads to suboptimal error estimates, cf. [2]. This issue will be addressed in the derivation of our error bound.

2.2 hp-dG Time Stepping

The (fully-discrete) hp-dG time marching scheme is iteratively given as follows: For $1\le m\le M$, we seek $U|_{I_m} \in \mathcal {P}^{r_m}(I_m;H_m)$ such that

$$\begin{aligned} \begin{aligned} \int _{I_m} \bigg (\frac{\,\text {d}U}{\,\text {d}t},V\bigg )_H \,\text {d}t&+ \left( {[[U]]_{m-1}}, V(t^+_{m-1})\right) _H = \int _{I_m}(\mathcal F(t,U),V)_H \,\text {d}t\quad \forall V\in \mathcal {P}^{r_m}(I_m;H_m), \end{aligned} \end{aligned}$$

(2.3)

where the discontinuity jump of U at $t_m$, $0\le m \le M-1$, is given by ${[[U]]_m} := U(t_m^+) - U(t_m^-)$ with $U(t^-_0) := u_0$. We emphasize that, in contrast to the continuous Galerkin formulation, the trial and test spaces are the same for the dG scheme; this is due to the fact that the initial values are weakly imposed (by means of an upwind flux) on each time interval. Note that if $r_m = 0$ then (2.3) is in fact just the implicit Euler method provided that the right rectangle quadrature rule is applied to the right-hand side of (2.3).

In order to find the strong formulation of (2.3), we require the use of a lifting operator. More precisely, given some real Hilbert space $X\subseteq H$, we define $\mathsf {L}^{r_m}_m:\,X\rightarrow \mathcal {P}^{r_m}(I_m;X)$ for $1\le m\le M$ uniquely through

$$\begin{aligned} z\mapsto \mathsf {L}^{r_m}_m z,\qquad \int _{I_m}(\mathsf {L}^{r_m}_m z,V)_H\,\text {d}t=\left( z,V(t^+_{m-1})\right) _H\qquad \forall V\in \mathcal {P}^{r_m}(I_m;X), \end{aligned}$$

(2.4)

cf. [22, Section 4.1]. Then, in view of this definition with $X=H_m$ and proceeding as in [12], we obtain the strong formulation of the dG time stepping method (2.3) on ${I}_m$, viz.,

$$\begin{aligned} U'(t)+\left( \mathsf {L}^{r_m}_m\pi _m{[[U]]_{m-1}}\right) (t)= \Pi ^{r_m}_m\mathcal F(t,{U}(t)), \end{aligned}$$

(2.5)

where $\Pi ^{r_m}_m$ denotes the $L^2$-projection onto $\mathcal {P}^{r_m}(I_m;H_m)$.

3 A Posteriori Error Analysis

The goal of this section is to derive $L^\infty $ a posteriori error bounds for the hp-cG and hp-dG approximations to (1.1). To this end, we require some structural assumptions on the nonlinearity $\mathcal F$. Specifically, $\mathcal F:[0,T]\times H\rightarrow H$ is assumed to satisfy $\mathcal F(\cdot ,0) \in L^1((0,T);H)$ along with the local H-Lipschitz estimate

$$\begin{aligned} \Vert \mathcal F(t,v)-\mathcal F(t,w)\Vert _H \le \mathcal {L}(t,\Vert v\Vert _H,\Vert w\Vert _H)\Vert v-w\Vert _H \qquad \forall t \in [0,T] \quad \forall v,w \in H. \end{aligned}$$

(3.1)

Here, $\mathcal {L}:[0,T]\times \mathbb {R}^+_0 \times \mathbb {R}^{+}_0 \rightarrow \mathbb {R}^{+}_0$ is a known function that satisfies $\mathcal {L}(\cdot ,a,b) \in {L^1(0,T)}$ for any $a,b \in \mathbb {R}^+_0$ and that is continuous and monotone increasing in the second and third arguments. By employing a classical Picard approach, it can be shown under these assumptions on $\mathcal F$ that problem (1.1) admits a unique (local) solution $u \in C([0,T];H)$ for $T>0$ sufficiently small.

3.1 Preliminary Error Bound

In order to remedy the suboptimal error estimates that would result from using (2.2) and (2.5) directly, we follow the approach proposed in [2] by introducing the reconstruction ${\widehat{U}}_m$ of $U |_{I_{m}}$ which is defined over each closed interval $\bar{I}_m$, $1 \le m \le M$, by

$$\begin{aligned} \begin{aligned} {\widehat{U}}_m(t) :=\pi _m U(t^-_{m-1}) + \int _{t_{m-1}}^t \Pi _m^{r_m} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.2)

This formulation of the reconstruction ${\widehat{U}}_m$ is the same as that introduced in [2] for the cG time stepping scheme, and in [17] for the dG time stepping scheme, but with the initial value projected onto the space $H_m$ for convenience; it was also used in [1, 16] for the Crank-Nicolson method (i.e., the cG method with $r_m=1$). For $t \in I_m$, $1 \le m \le M$, we define the error $e(t) := u(t) - U(t) \in H$ where U is either the hp-cG solution from (2.1) or the hp-dG solution from (2.3). Since we will be dealing with the reconstruction ${\widehat{U}}_m$, it will also be necessary to introduce the reconstruction error given by $\widehat{e}_m(t) := u(t) -{\widehat{U}}_m(t) \in H$, $t \in \bar{I}_m$, $1 \le m \le M$. We will proceed with the error analysis by first proving an $L^{\infty }$-error bound for $\widehat{e}_m$.

To formulate the error equation, we begin by substituting (1.1) into the definition of $\widehat{e}_m$, viz.,

$$\begin{aligned} \begin{aligned} \widehat{e}_m(t) = u(t_{m-1}) - {\widehat{U}}_m(t) + \int _{t_{m-1}}^t \mathcal F(s,u) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.3)

Adding and subtracting additional terms yields

$$\begin{aligned} \begin{aligned} \widehat{e}_m(t) = \widehat{e}_m(t_{m-1}) + \mathcal {R}(t) + \int _{t_{m-1}}^t \mathcal F(s,u) -\mathcal F(s,{\widehat{U}}_m) \,\text {d}s, \end{aligned} \end{aligned}$$

(3.4)

where $\mathcal {R}$ denotes the residual given by

$$\begin{aligned} \begin{aligned} \mathcal {R}(t) := {\widehat{U}}_{m}(t_{m-1})- {\widehat{U}}_m(t)+ \int _{t_{m-1}}^t \mathcal F(s,{\widehat{U}}_m) \,\text {d}s, \qquad t \in {I}_m. \end{aligned} \end{aligned}$$

(3.5)

Using the triangle inequality and Bochner’s Theorem implies that

$$\begin{aligned} \begin{aligned} \Vert {\widehat{e}}_m(t)\Vert _H \le \Vert \widehat{e}_m(t_{m-1})\Vert _H+ \Vert \mathcal {R}(t)\Vert _H+\int _{t_{m-1}}^t \Vert \mathcal F(s,u) - \mathcal F(s,{\widehat{U}}_m)\Vert _H \,\text {d}s. \end{aligned} \end{aligned}$$

(3.6)

Moreover, applying the local H-Lipschitz estimate (3.1) together with the monotonicity of $\mathcal {L}$ yields

$$\begin{aligned} \begin{aligned} \Vert {\widehat{e}}_m(t)\Vert _H \le \Vert \widehat{e}_m(t_{m-1})\Vert _H+ \eta ^{{\mathtt {res}}}_m+\int _{t_{m-1}}^t \mathcal {L}(s,\Vert \widehat{e}_m\Vert _H+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H) \Vert \widehat{e}_m\Vert _H \,\text {d}s, \end{aligned} \end{aligned}$$

(3.7)

for $t \in \bar{I}_m$. Here, $\displaystyle \eta ^{{\mathtt {res}}}_m$ denotes the residual estimator given by $\displaystyle \eta ^{{\mathtt {res}}}_m := \Vert \mathcal {R}\Vert _{L^{\infty }(I_m;H)}$.

On the first interval, recalling that $U(t_0^-)=u_0=u(t_0)$, we can estimate $\Vert \widehat{e}_m(t_{m-1})\Vert _H$ directly by $\Vert \widehat{e}_1(t_0)\Vert _H = \eta _0^{{\mathtt {proj}}}$ where the projection estimator $\eta ^{{\mathtt {proj}}}_m$ is given by

$$\begin{aligned} \eta ^{{\mathtt {proj}}}_m := \left\| U(t_m^-) - \pi _{m+1}U(t_{m}^-)\right\| _H, \qquad m\ge 0. \end{aligned}$$

(3.8)

For later intervals, the unknown term $\Vert \widehat{e}_m(t_{m-1})\Vert _H$ needs to be replaced with the known term $\Vert \widehat{e}_{m-1}(t_{m-1})\Vert _H$. To this end, for $m \ge 1$, we have

$$\begin{aligned} \begin{aligned} \widehat{e}_{m+1}(t_{m}) - \widehat{e}_{m}(t_{m})&= {\widehat{U}}_{m}(t_{m}) - {\widehat{U}}_{m+1}(t_{m}) \\&= \pi _{m}U(t_{m-1}^-) - \pi _{m+1}U(t_{m}^-) + \int _{I_m} \Pi _{m}^{r_{m}} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.9)

Recall that the hp-cG method (2.1) satisfies the strong form (2.2) on $I_m$. Thus, we have that

$$\begin{aligned} \begin{aligned} {U}(t^-_{m}) =\pi _{m} U(t^-_{m-1}) + \int _{I_m} \Pi _{m}^{r_{m}-1} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.10)

By the definition of the $L^2$-projection an equivalent formulation of the above is

$$\begin{aligned} \begin{aligned} {U}(t^-_{m}) =\pi _{m} U(t^-_{m-1}) + \int _{I_m} \Pi _{m}^{r_{m}} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.11)

Substituting this into (3.9) implies that for the hp-cG method we have

$$\begin{aligned} \begin{aligned} \widehat{e}_{m+1}(t_{m}) - \widehat{e}_{m}(t_{m}) = U(t_m^-) - \pi _{m+1}U(t_{m}^-). \end{aligned} \end{aligned}$$

(3.12)

For the hp-dG method (2.3), we have the strong form (2.5) on $I_m$. Thus, it follows that

$$\begin{aligned} \begin{aligned} {U}(t^-_{m}) =U(t^+_{m-1}) + \int _{I_m} \Pi _{m}^{r_{m}} \mathcal F(s,U) \,\text {d}s - \int _{I_m} \left( \mathsf {L}^{r_m}_m\pi _m{[[U]]_{m-1}}\right) (s) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.13)

By definition of the lifting operator $\mathsf {L}^{r_m}_m$ from (2.4) we arrive at

$$\begin{aligned} \begin{aligned} {U}(t^-_{m}) =U(t^+_{m-1})- \pi _m{[[U]]_{m-1}} + \int _{I_m} \Pi _{m}^{r_{m}} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.14)

Equivalently,

$$\begin{aligned} \begin{aligned} {U}(t^-_{m}) = \pi _m U(t^-_{m-1}) + \int _{I_m} \Pi _{m}^{r_{m}} \mathcal F(s,U) \,\text {d}s. \end{aligned} \end{aligned}$$

(3.15)

Since this is the same form as for the hp-cG method then for the hp-dG method we also have that

$$\begin{aligned} \begin{aligned} \widehat{e}_{m+1}(t_{m}) - \widehat{e}_{m}(t_{m}) = U(t_m^-) - \pi _{m+1}U(t_{m}^-). \end{aligned} \end{aligned}$$

(3.16)

Applying the triangle inequality to (3.12) and (3.16) yields

$$\begin{aligned} \begin{aligned} \Vert \widehat{e}_{m+1}(t_{m})\Vert _H \le \Vert \widehat{e}_{m}(t_{m})\Vert _H + \eta ^{{\mathtt {proj}}}_{m}, \end{aligned} \end{aligned}$$

(3.17)

with $\eta ^{{\mathtt {proj}}}_m$ from (3.8). Substituting this result into (3.7) gives

$$\begin{aligned} \begin{aligned} \Vert {\widehat{e}}_m(t)\Vert _H \le \psi _m+\int _{t_{m-1}}^t \mathcal {L}\left( s,\Vert \widehat{e}_m\Vert _H+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H\right) \Vert \widehat{e}_m\Vert _H \,\text {d}s, \end{aligned} \end{aligned}$$

(3.18)

where $\psi _m$ is chosen such that

$$\begin{aligned} \psi _m \ge {\left\{ \begin{array}{ll}\displaystyle \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m \qquad &{} \text {if } m = 1 \\ \displaystyle \Vert \widehat{e}_{m-1}(t_{m-1})\Vert _H + \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m \qquad &{} \text {if } m \ne 1 \end{array}\right. }. \end{aligned}$$

(3.19)

Finally, applying Gronwall’s inequality to (3.18) for $t \in \bar{I}_m$, $1 \le m \le M$, yields the following result.

Proposition 3.1

For the cG and dG time stepping schemes (2.1) and (2.3), respectively, there holds the error bound

$$\begin{aligned} \begin{aligned} \Vert \widehat{e}_m(t)\Vert _H&\le G_m(t)\psi _m,\qquad t \in \bar{I}_m,\quad 1 \le m \le M, \end{aligned} \end{aligned}$$

(3.20)

where $\psi _m$ satisfies (3.19) and $G_m$ is given by

$$\begin{aligned} \begin{aligned} G_m(t) := \exp \bigg (\int _{t_{m-1}}^{t} \mathcal {L}(s,\Vert \widehat{e}_m\Vert _H+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H) \,\text {d}s \bigg ). \end{aligned} \end{aligned}$$

(3.21)

3.2 Continuation Argument

In order to transform (3.20) into an a posteriori error bound, we will employ a continuation argument, cf., e.g., [5]. To this end, for $1\le m\le M$, we define the set

$$\begin{aligned} \mathcal {I}_m:= \left\{ t\in \bar{I}_m:\,\Vert \widehat{e}_m\Vert _{C([t_{m-1},t];H)}\le \delta \psi _m\right\} , \end{aligned}$$

(3.22)

where $\delta > 1$ is a parameter to be chosen. Note that $\mathcal {I}_m$ is closed since $\widehat{e}_m$ is time-continuous and, obviously, bounded. Moreover, $\mathcal {I}_m$ is assumed to be non-empty; this is certainly true for the first interval since $0 \in \mathcal {I}_1$ and is a posteriori verifiable for later intervals. To state the full error bound, we require some additional definitions. Specifically, define the function $\varphi _m:[1,\infty ) \rightarrow \mathbb {R}$ by

$$\begin{aligned} \varphi _m(\delta ):=\exp \left( \int _{I_m} \mathcal {L}(s,\delta \psi _m+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H) \,\text {d}s\right) -\delta . \end{aligned}$$

(3.23)

Additionally, if it exists, we introduce

$$\begin{aligned} \delta _m := \inf \{\delta > 1 : \varphi _m(\delta ) < 0\} \in [1, \infty ), \end{aligned}$$

(3.24)

for $m \ge 1$ and we let $\delta _0 := 1$ for convenience.

We are now ready to establish the following conditional a posteriori error bound for both Galerkin time stepping methods.

Theorem 3.1

For any $m \ge 1$, if $\delta _1, \ldots , \delta _{m}$ exist then the hp-cG scheme (2.1) and the hp-dG scheme (2.3) satisfy the a posteriori error bound

$$\begin{aligned} \Vert \widehat{e}_m\Vert _{L^\infty (I_m;H)}\le \delta _m\psi _m, \end{aligned}$$

(3.25)

on $I_m$.

Proof

We omit the trivial case $\psi _m=0$. Since $\delta _m$ exists, there is some $\delta > 1$ with $\varphi _m(\delta )<0$. Suppose for a moment that the existence of $\widehat{e}_m(t_{m-1})$ is taken for granted, then $t_{m-1} \in \mathcal {I}_m$ so $\mathcal {I}_m$ is non-empty, closed and bounded and thus has some maximal time $t^{\star }$. Let us first make the assumption that $t^{\star } < t_m$ and work towards a contradiction. Substituting the error bound from $\mathcal {I}_m$ into (3.20) and using the monotonicity of $\mathcal {L}$ implies that

$$\begin{aligned} \begin{aligned} \Vert \widehat{e}_m\Vert _{C([t_{m-1},t^{\star }];H)}&\le \exp \bigg (\int _{I_m} \mathcal {L}(s,\delta \psi _m+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H) \,\text {d}s \bigg )\psi _m\\&=\left( \varphi _m(\delta )+\delta \right) \psi _m<\delta \psi _m. \end{aligned} \end{aligned}$$

This is a contradiction since we had assumed that the set $\mathcal {I}_m$ had maximal time $t^{\star }<t_m$. Hence, it follows that $\Vert \widehat{e}_m\Vert _{L^\infty (I_m;H)} \le \delta \psi _m$. Taking the limit $\delta \rightarrow \delta _m$ we deduce (3.25). To complete the proof, we note that we can conclude by recursion that $\widehat{e}_m(t_{m-1})$ exists if $\delta _0, \ldots , \delta _{m-1}$ exist and $0 \in \mathcal {I}_1$. Since $0 \in \mathcal {I}_1$ trivially and $\delta _0, \ldots , \delta _{m-1}$ exist by premise, the original assumption that $\widehat{e}_m(t_{m-1})$ exists is unneeded. $\square $

An important question arises with regard to Theorem 3.1—is it possible that $\delta _m$ never exists for certain nonlinearities $\mathcal F$? The following lemma provides an answer to this question.

Lemma 3.1

For any $m \ge 1$, if $\delta _0, \ldots , \delta _{m-1}$ exist and the time step length $k_m>0$ is chosen sufficiently small then the set $\{\delta >1:\,\varphi _m(\delta )<0\}$ is non-empty, $\delta _m\ge 1$ from (3.24) exists and $\varphi _m(\delta _m)=0$.

Proof

Since $\delta _0, \ldots , \delta _{m-1}$ exist, Theorem 3.1 implies that $\psi _m$ exists and thus $\varphi _m$ is well-defined. For fixed $\delta ^\star >1$ and upon setting $\epsilon :={1/2}(\delta ^\star -1)>0$, we can choose $k_m>0$ small enough so that

$$\begin{aligned} \exp \left( \int _{I_m} \mathcal {L}(s,\delta ^{\star } \psi _m+\Vert {\widehat{U}}_m\Vert _H,\Vert \widehat{U}_m\Vert _H) \,\text {d}s\right) <1+\epsilon . \end{aligned}$$

A quick calculation reveals that $\varphi _m(\delta ^\star )<1+\epsilon -\delta ^\star ={1/2}(1-\delta ^\star )<0$. Therefore, for $k_m>0$ sufficiently small, the set $\{\delta >1:\,\varphi _m(\delta )<0\}$ is non-empty. Furthermore, since $\varphi _m$ is continuous and $\varphi _m(1) \ge 0$, it follows that $\delta _m$ exists and satisfies $\varphi _m(\delta _m) = 0$. $\square $

In some sense, ${\widehat{U}}_m$ is a better approximation to $u |_{I_m}$ than $U |_{I_m}$; thus from a practical standpoint it is often best to use Theorem 3.1 directly, however, for some applications a bound on the error rather than on the reconstruction error may be required so we introduce the following corollary.

Corollary 3.1

For any $m \ge 1$, if $\delta _1, \ldots , \delta _{m}$ exist then the hp-cG scheme (2.1) and the hp-dG scheme (2.3) satisfy the a posteriori error bound

$$\begin{aligned} \Vert {e}\Vert _{L^\infty (I_m;H)}\le \delta _m\psi _m + \Vert U - {\widehat{U}}_m\Vert _{L^{\infty }(I_m;H)}, \end{aligned}$$

(3.26)

on $I_m$.

Proof

The bound follows directly from rewriting the error, viz., $e = \widehat{e}_m + {\widehat{U}}_m - U$, the triangle inequality and the reconstruction error bound of Theorem 3.1. $\square $

3.3 Computable Error Bound

In order to yield fully computable error bounds we must give an explicit characterization of $\psi _m$ from (3.19). Theorem 3.1 implies that

$$\begin{aligned} \Vert \widehat{e}_{m-1}(t_{m-1})\Vert _H \le \Vert \widehat{e}_{m-1}\Vert _{L^{\infty }(I_{m-1};H)} \le \delta _{m-1}\psi _{m-1}. \end{aligned}$$

(3.27)

Thus, we can define $\psi _m$ by

$$\begin{aligned} \psi _m := {\left\{ \begin{array}{ll}\displaystyle \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m \qquad &{} \text {if } m = 1 \\ \displaystyle \delta _{m-1}\psi _{m-1}+ \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m \qquad &{} \text {if } m \ne 1 \end{array}\right. }. \end{aligned}$$

(3.28)

Defining $\psi _m$ in this way yields a recursive error estimator and makes the error bounds of Theorem 3.1 and Corollary 3.1 fully computable.

In order to develop adaptive algorithms that can approximate the blow-up time of a nonlinear initial value problem, it is important to interpret the role that $\delta _m$ plays in the error bound of Theorem 3.1. Recalling the bound and applying (3.28), we see that the reconstruction error on $I_m$ satisfies

$$\begin{aligned} \begin{aligned} \Vert \widehat{e}_m\Vert _{L^{\infty }(I_m;H)} \le \delta _m \left( \delta _{m-1}\psi _{m-1}+ \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m\right) . \end{aligned} \end{aligned}$$

(3.29)

Of the three components that make up the error estimator, the term $\delta _{m-1}\psi _{m-1}$ is a bound for the error on the previous time step while $\displaystyle \eta ^{{\mathtt {proj}}}_{m-1}+ \eta ^{{\mathtt {res}}}_m$ represents the local (additive) contribution to the error estimator on the current time step. The correct interpretation of $\delta _m$, then, is that it is an a posteriori approximation to the growth rate of the error on $I_m$; this becomes clear upon consideration of the following simple example. In fact, the following corollary shows that $\delta _m$ is the expected local growth rate for globally H-Lipschitz $\mathcal F$.

Corollary 3.2

Suppose that $\mathcal F$ is globally H-Lipschitz with constant $\mathcal {L} = L \ge 0$ then $\delta _m = \mathrm {e}^{Lk_m}$ and, thus, the error bound of Theorem 3.1 holds unconditionally on $I_m$, viz.,

$$\begin{aligned} \Vert \widehat{e}_m\Vert _{L^\infty (I_m;H)}\le \mathrm {e}^{Lk_m}\psi _m. \end{aligned}$$

(3.30)

Proof

From the definition of the function $\varphi _m$, it follows that $\varphi _m(\delta ) = \mathrm {e}^{L k_m} - \delta $. Therefore, $\displaystyle \delta _m = \inf \{\delta > 1 : \mathrm {e}^{L k_m} - \delta < 0 \} = \mathrm {e}^{L k_m}$. Since $\delta _m$ exists and is finite for any $k_m$, we conclude that the error bound of Theorem 3.1 holds unconditionally on $I_m$. $\square $

4 Adaptive Algorithms

The error estimators derived in the previous section will form the basis of an adaptive strategy to estimate the blow-up time of (1.1). In particular, we will consider both a h-adaptive approach and an hp-adaptive approach. For the remainder of this section, we assume that $H_1 = \cdots = H_M$ for simplicity but remark that both adaptive algorithms can be easily modified to account for variable finite-dimensional subspaces.

4.1 A h-Adaptive Approach

The first difficulty encountered in the construction of a h-adaptive algorithm is that both (2.1) and (2.3) are implicit methods which means that the existence of a numerical solution is not guaranteed. It is tempting to conduct an existence analysis such as in [12] to obtain bounds on the length of the time steps needed in order to yield a numerical solution, however, such bounds are inherently pessimistic. Since existence can be determined a posteriori, our h-adaptive algorithm just reduces the time step length until a numerical solution exists.

The second difficulty is how to approximate $\delta _m$; unfortunately, it is impossible to give a precise characterization for how to do this for any given $\mathcal F$ primarily because $\mathcal F$ may be ‘pathological’. Fortunately, however, most nonlinearities of interest do not fall into this category. Suppose that $\mathcal F$ is chosen such that $\varphi _m$ has, at most, a small finite number of zeros then it should be possible to approximate the zeros of $\varphi _m$ numerically. Since $\delta _m$ satisfies $\varphi _m(\delta _m) = 0$, we then only need to check the roots of $\varphi _m$ and verify that one of our numerical approximations, $\displaystyle \tilde{\delta }_m$, satisfies ${\displaystyle \varphi _m(\tilde{\delta }_m) < 0}$. In our numerical experiments, we employ a Newton method to find $\tilde{\delta }_m$ with an initial guess close to one on $I_1$ (the proof of Lemma 3.1 implies that $\delta _m \approx 1$ for $k_m \approx 0$) and an initial guess close to ${\delta }_{m-1}$ on $I_{m}$ for $m \ge 2$; this approach works well for the studied problems.

As is standard in finite element algorithms for linear problems, the time step is halved and the numerical solution recomputed until $\displaystyle \eta ^{{\mathtt {res}}}_m$ is below the tolerance $\mathtt {tol}$, however, we must also account for the nonlinear scaling that enters through $\delta _m$. The structure of the error estimator implies that the interval $I_1$ is the most important since the residual estimator on $I_1$ propagates through the remainder of the error estimator. Similarly, each successive subinterval $I_m$ is less important than the previous subinterval $I_{m-1}$ with the term $\delta _{m-1}$ measuring the extent to which this is true. To account for this, we increase the tolerance by the factor $\delta _m$ after computations on $I_m$ are complete.

We then advance to the next interval using the previous time step length as a reference and continue in this way until $\delta _m$ no longer exists; the adaptive algorithm is then terminated and it outputs the total number of degrees of freedom (DoFs) used along with the sum of all the time step lengths, T, as an estimate for $T_{\infty }$. The pseudocode for the h-adaptive algorithm is given in Algorithm 1.

4.2 An hp-Adaptive Approach

The basic outline of the hp-adaptive algorithm will be the same as that of the h-adaptive algorithm; the only difficulty lies in how we choose locally between h-refinement and p-refinement. The theory of the hp-FEM suggests that p-refinement is superior to h-refinement if the solution is ‘smooth enough’, so we perform p-refinement if $U |_{I_m}$ is smooth; otherwise, we do h-refinement. The pseudocode for the hp-adaptive algorithm is very similar to Algorithm 1; the difference lies in replacing the simple time step bisections in lines (8:) and (19:) by the following procedure:

In the numerical experiments, we consider only real-valued ODEs, and so we remark specifically on the estimation of smoothness in this special case. There are many ways to estimate smoothness of the numerical solution (see [18] for an overview); we choose to use a computationally simple approach from [11] based on Sobolev embeddings. Here, the basic idea is to monitor the constant in the Sobolev embedding $H^1(I_m)\hookrightarrow L^\infty (I_m)$ in order the classify a given function as locally smooth or not. Specifically, we define the smoothness indicator $\theta _m:\,H^1(I_m)\rightarrow [0,1]$ by

$$\begin{aligned} \begin{aligned} \theta _m[u] := {\left\{ \begin{array}{ll} \displaystyle \Vert u\Vert _{L^{\infty }(I_m)} \left( k^{-{1/2}}_m\Vert u\Vert _{L^2(I_m)} + \frac{1}{\sqrt{2}}k_m^{{1/2}}\Vert u'\Vert _{L^2(I_m)} \right) ^{-1} \qquad &{} \text {if } u \not \equiv 0,\\ \displaystyle 1&{} \text {if } u\equiv 0, \end{array}\right. } \end{aligned} \end{aligned}$$

(4.1)

which, intuitively, takes values close to one if u is smooth and values close to zero otherwise; see the reasoning of [11] for details. Following [11], we characterize $U|_{I_m} \in \mathcal {P}^{r_m}(I_m;\mathbb {R})$ as smooth if

$$\begin{aligned} \begin{aligned} \theta _m \left[ \frac{\,\text {d}^{r_m-1} U}{\,\text {d}t^{r_m-1}} \right] \ge \theta ^{\star }; \end{aligned} \end{aligned}$$

(4.2)

here, values around $\theta ^{\star } = 0.85$ were observed to produce the best results in our numerical experiments below.

We conclude this section with a corollary on the magnitude of the reconstruction error under either the h-adaptive algorithm or the hp-adaptive algorithm. In order to state the corollary, we require some additional notation. To that end, we denote the initial tolerance by $\mathtt {tol}^*$ and define the a posteriori approximation to the growth rate of the error on $(0,t_m)$ by

$$\begin{aligned} \widehat{\delta }_m := \prod _{i=0}^m \delta _i. \end{aligned}$$

We are now ready to state the corollary.

Corollary 4.1

Suppose that $H_1=\cdots =H_M$ and that $\delta _1, \ldots , \delta _M$ exist then under the h-adaptive algorithm or the hp-adaptive algorithm, the reconstruction error satisfies

$$\begin{aligned} \begin{aligned} \max _{1 \le m \le M} \Vert \widehat{e}_m\Vert _{L^{\infty }(I_m;H)} \le M \widehat{\delta }_M \mathtt {tol}^\star . \end{aligned} \end{aligned}$$

Proof

To begin the proof, we will first prove by induction that $\psi _m \le m\widehat{\delta }_{m-1} \mathtt {tol}^\star $. For $I_1$, we have that $\psi _1 = \eta ^{{\mathtt {res}}}_1 \le \mathtt {tol}^\star $, so the base case is verified. Assuming that the bound is true for $m-1$, then recalling the definition of $\psi _m$ from (3.28) as well as the scaling nature of the tolerances in the h-adaptive and hp-adaptive algorithms, that is, $\eta ^{{\mathtt {res}}}_m\le \widehat{\delta }_{m-1}{} \mathtt{{tol}}^\star $, we have

$$\begin{aligned} \begin{aligned} \psi _m&= \delta _{m-1}\psi _{m-1}+ \eta ^{{\mathtt {res}}}_m \\&\le (m-1)\delta _{m-1}\widehat{\delta }_{m-2}{} \mathtt{{tol}}^\star + \widehat{\delta }_{m-1}{} \mathtt{{tol}}^\star \\&= (m-1)\widehat{\delta }_{m-1}{} \mathtt{{tol}}^\star +\widehat{\delta }_{m-1}\mathtt {tol}^\star \\&= m\widehat{\delta }_{m-1}\mathtt {tol}^\star . \end{aligned} \end{aligned}$$

Thus the stated bound holds for any $1 \le m \le M$. Theorem 3.1 combined with this bound yields

$$\begin{aligned} \begin{aligned} \max _{1 \le m \le M} \Vert \widehat{e}_m\Vert _{L^{\infty }(I_m;H)} \le \max _{1 \le m \le M} \delta _m \psi _m \le \max _{1 \le m \le M} m \widehat{\delta }_{m} \mathtt{{tol}}^\star = M\widehat{\delta }_M \mathtt{{tol}}^\star . \end{aligned} \end{aligned}$$

This completes the proof. $\square $

5 Numerical Experiments

In this section, we will apply the adaptive algorithms developed in the previous section to two real-valued initial value problems. In both numerical experiments, we approximate the implicit Galerkin methods (2.1) and (2.3) with an explicit Picard-type iteration; cf. [12].

5.1 Example 1

In this numerical example, we consider (1.1) with the polynomial nonlinearity $\mathcal F(t,u) = u^2$. Through separation of variables the exact solution is given by

$$\begin{aligned} \begin{aligned} u(t) = \frac{u_0}{1-u_0t}, \end{aligned} \end{aligned}$$

which has blow-up time given by $T_{\infty } = u_0^{-1}$. Note that for any $v_1, v_2 \in \mathbb {R}$, the nonlinearity $\mathcal F$ satisfies

$$\begin{aligned} \begin{aligned} |\mathcal F(v_1)-\mathcal F(v_2)| = |v_1^2 - v_2^2 | \le |v_1 - v_2|(|v_1|+|v_2|), \end{aligned} \end{aligned}$$

(5.1)

so we set $\mathcal {L}(|v_1|,|v_2|) = |v_1|+|v_2|$. Thus, in this case, we have

$$\begin{aligned} \begin{aligned} \varphi _m(\delta ) = \exp \bigg (k_m\delta \psi _m+2 \int _{I_m} |{\widehat{U}}_m| \,\text {d}s \bigg )-\delta \end{aligned} \end{aligned}$$

(5.2)

in (3.23).

We begin by applying the h-adaptive algorithm to (1.1) with initial condition $u(0) = 1$ for both Galerkin time stepping methods utilizing polynomials up to and including degree $r=4$. We reduce the tolerance and observe the rate of convergence of the blow-up time error $|T - T_{\infty }|$. The results given in Fig. 1 show that

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ63_HTML.gif

for both Galerkin methods where r is the polynomial degree and #DoFs signifies the total number of degrees of freedom. Next, we apply the hp-adaptive algorithm to (1.1) with the same data. We reduce the tolerance and observe exponential convergence of the blow-up time error for both Galerkin methods, viz.,

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ64_HTML.gif

with some constant $b>0$, as shown in Fig. 1.

For the hp-adaptive algorithm under the hp-cG scheme (2.1), we also observe the effectivity indices of the error estimator (with respect to the reconstruction error) given by

$$\begin{aligned} \begin{aligned} \text {Effectivity index } |_{I_m} := \frac{\delta _m \psi _m}{\displaystyle \max _{1 \le k \le m} \Vert \widehat{e}_k\Vert _{L^{\infty }(I_k)} }, \end{aligned} \end{aligned}$$

over the course of each computational run for different tolerances and record the best effectivity indices observed in a given computational run versus the inverse of the distance to the blow-up time in Fig. 2. The results show that the error estimator seems to account well for the behaviour of the reconstruction error in certain situations with an effectivity index of 70 for a blow-up problem being particularly impressive; in other situations, however, the effectivity indices observed are much larger. Such a large variation in the effectivity indices observed is actually to be expected since in order to provide an upper bound for the error in any situation, the error estimator must account for the worst possible scenario to a blow-up problem which may not actually be realised in practise.

Finally, for the hp-adaptive algorithm under the hp-cG scheme (2.1), we plot the value of $\widehat{\delta }_m$ versus the inverse of the distance to the blow-up time over the course of the final computation and display the results in Fig. 2. If we denote the distance from the blow-up time at time $t_m$ by $\varepsilon _m := |t_m - T_{\infty }|$ then Fig. 2 implies the relationship

$$\begin{aligned} \begin{aligned} \widehat{\delta }_m \propto \varepsilon ^{-2}_m. \end{aligned} \end{aligned}$$

Thus from Corollary 4.1, we infer that there exists some constant $C>0$ that is independent of the distance to the blow-up time and the maximum time step length such that

$$\begin{aligned} \begin{aligned} \max _{1 \le m \le M} \Vert \widehat{e}_m\Vert _{L^{\infty }(I_m)} \le CM\varepsilon ^{-2}_M \mathtt {tol}^{\star }. \end{aligned} \end{aligned}$$

Since $\displaystyle \Vert u\Vert _{L^\infty (0,T)} = \varepsilon _M^{-1}$, we conclude that the error estimator blows up at a faster rate than the exact solution in this example; moreover, we infer from Fig. 2 that in the best case scenario (for the error estimator), this rate appears to be quasi-optimal in the sense that it mirrors the rate of the reconstruction error. Finally, we remark that this result gives weight to the interpretation of $\widehat{\delta }_M$ as an a posteriori approximation to the blow-up rate of the error on (0, T) for nonlinear $\mathcal F$.

5.2 Example 2

In this numerical example, we consider (1.1) with the exponential nonlinearity $\mathcal F(t,u) = \mathrm {e}^u$. Through separation of variables the exact solution is given by

$$\begin{aligned} \begin{aligned} u(t) = \log \bigg (\frac{\mathrm {e}^{u_0}}{1-\mathrm {e}^{u_0}t}\bigg ), \end{aligned} \end{aligned}$$

which has blow-up time given by $T_{\infty } = \mathrm {e}^{-u_0}$. Note that for any $v_1, v_2 \in \mathbb {R}$, the nonlinearity $\mathcal F$ satisfies

$$\begin{aligned} \begin{aligned} |\mathcal F(v_1)-\mathcal F(v_2)| = |\mathrm {e}^{v_1} -\mathrm {e}^{v_2}| \le \frac{1}{2}|v_1 - v_2|(\mathrm {e}^{|v_1|}+\mathrm {e}^{|v_2|}), \end{aligned} \end{aligned}$$

(5.3)

so we set $\mathcal {L}(|v_1|,|v_2|) = {1/2}(\mathrm {e}^{|v_1|}+\mathrm {e}^{|v_2|})$. Thus for this nonlinearity, we have

$$\begin{aligned} \begin{aligned} \varphi _m(\delta ) = \exp \bigg (\frac{1}{2}(1+\mathrm {e}^{\delta \psi _m})\int _{I_m} \mathrm {e}^{|{\widehat{U}}_m|} \,\text {d}s \bigg )-\delta \end{aligned} \end{aligned}$$

(5.4)

in (3.23).

As in Example 1, we apply the h-adaptive algorithm to (1.1) with initial condition $u(0) = 1$ for both Galerkin methods utilizing polynomials up to and including degree $r=4$. We reduce the tolerance and observe the rate of convergence to the blow-up time. The results given in Fig. 3 show that for this example we also have that

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ65_HTML.gif

for both Galerkin methods where r is the polynomial degree. Next, we apply the hp-adaptive algorithm to (1.1) with the same data. As in Example 1, the tolerance is reduced and the results given in Fig. 3 show exponential convergence to the blow-up time, viz.,

https://static-content.springer.com/image/art%3A10.1007%2Fs10915-017-0565-x/MediaObjects/10915_2017_565_Equ66_HTML.gif

with some constant $b>0$, for both Galerkin methods.

Additionally, for the hp-adaptive algorithm under the hp-cG scheme (2.1), we observe the effectivity indices (with respect to the reconstruction error) over the course of each computational run for different tolerances and record the best effectivity indices observed in a given computational run in Fig. 4. The range of effectivity indices observed over all computational runs is comparable to the range observed in the previous example with large variation and an effectivity index of around 60 in the best case, cf. Fig. 4.

Finally, for the hp-adaptive algorithm under the hp-cG scheme (2.1), we plot the value of $\widehat{\delta }_m$ over the course of the final computation with the results displayed in Fig. 4. With the notation from the previous example, we deduce that

$$\begin{aligned} \begin{aligned} {\widehat{\delta }}_m \propto \varepsilon ^{-1}_m, \end{aligned} \end{aligned}$$

for this example. Thus from Corollary 4.1, we infer that there exists some constant $C>0$ that is independent of the distance to the blow-up time and the maximum time step length such that

$$\begin{aligned} \begin{aligned} \max _{1 \le m \le M} \Vert \widehat{e}_m\Vert _{L^{\infty }(I_m)} \le CM\varepsilon ^{-1}_M \mathtt {tol}^{\star }. \end{aligned} \end{aligned}$$

Since $\displaystyle \Vert u\Vert _{L^\infty (0,T)} = \log (\varepsilon _M^{-1})$, we remark that the error estimator blows up at a faster rate than the exact solution in this example as well. Moreover, Fig. 4 implies that this rate is again quasi-optimal in the best case scenario for the error estimator. To conclude, we remark that this result gives additional weight to the interpretation of $\widehat{\delta }_M$ as an a posteriori approximation to the blow-up rate of the error on (0, T) when $\mathcal F$ is nonlinear.

6 Conclusions

In this paper, we derived a conditional a posteriori error bound through a continuation argument for Galerkin time discretizations of general nonlinear initial value problems; the derived bound was then used to drive h and hp versions of an adaptive algorithm designed to approximate the blow-up time. Numerical experiments indicate that the h-version of the adaptive algorithm attains algebraic convergence towards the blow-up time while the hp-version of the adaptive algorithm attains exponential convergence towards the blow-up time. This work constitutes an important step towards deriving hp-version a posteriori error bounds for the semilinear heat equation that are robust with respect to the distance from the blow-up time, thereby extending the results in the works [5, 15].

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Vorheriger Artikel Simultaneous Approximation Terms for Multi-dimensional Summation-by-Parts Operators

Nächster Artikel Recursive Computation of Logarithmic Derivatives, Ratios, and Products of Spheroidal Harmonics and Modified Bessel Functions and Applications

Akrivis, G., Makridakis, C., Nochetto, R.H.: A posteriori error estimates for the Crank-Nicolson method for parabolic equations. Math. Comput. 75(254), 511–531 (2006)MathSciNetCrossRefMATH

Akrivis, G., Makridakis, C., Nochetto, R.H.: Optimal order a posteriori error estimates for a class of Runge-Kutta and Galerkin methods. Numer. Math. 114(1), 133–160 (2009)MathSciNetCrossRefMATH

Bangerth, W., Rannacher, R.: Adaptive Finite Element Methods for Differential Equations. Lectures in Mathematics, ETH Zürich. Birkhäuser Verlag, Basel (2003)CrossRefMATH

Brunner, H., Schötzau, D.: $hp$-discontinuous Galerkin time-stepping for Volterra integrodifferential equations. SIAM J. Numer. Anal. 44, 224–245 (2006)MathSciNetCrossRefMATH

Cangiani, A., Georgoulis, E.H., Kyza, I., Metcalfe, S.: Adaptivity and blow-up detection for nonlinear evolution problems. SIAM J. Sci. Comput. 38, A3833–A3856 (2016)

Cuesta, E., Makridakis, C.: A posteriori error estimates and maximal regularity for approximations of fully nonlinear parabolic problems in Banach spaces. Numer. Math. 110, 257–275 (2008)MathSciNetCrossRefMATH

Delfour, M., Hager, W., Trochu, F.: Discontinuous Galerkin methods for ordinary differential equations. Math. Comput. 36, 455–473 (1981)MathSciNetCrossRefMATH

Delfour, M.C., Dubeau, F.: Discontinuous polynomial approximations in the theory of one-step, hybrid and multistep methods for nonlinear ordinary differential equations. Math. Comput. 47(175), 169–189, S1–S8 (1986)MathSciNetCrossRefMATH

Estep, D.: A posteriori error bounds, global error control for approximation of ordinary differential equations. SIAM J. Numer. Anal. 32, 1–48 (1995)MathSciNetCrossRefMATH

10.

Estep, D., French, D.: Global error control for the continuous Galerkin finite element method for ordinary differential equations. RAIRO Modél. Math. Anal. Numér. 28, 815–852 (1994)MathSciNetCrossRefMATH

11.

Fankhauser, T., Wihler, T.P., Wirz, M.: The hp-adaptive FEM based on continuous Sobolev embeddings: isotropic refinements. Comput. Math. Appl. 67(4), 854–868 (2014)MathSciNetCrossRefMATH

12.

Janssen, B., Wihler, T.P.: Existence results for the continuous and discontinuous Galerkin time stepping methods for nonlinear initial value problems, arXiv preprint. arXiv:1407.5520 (2014)

13.

Johnson, C.: Error estimates and adaptive time-step control for a class of one-step methods for stiff ordinary differential equations. SIAM J. Numer. Anal. 25, 908–926 (1988)MathSciNetCrossRefMATH

14.

Kessler, D., Nochetto, R.H., Schmidt, A.: A posteriori error control for the Allen-Cahn problem: circumventing Gronwall’s inequality. ESAIM Math. Model. Numer. Anal. 38(1), 129–142 (2004). (Eng)MathSciNetCrossRefMATH

15.

Kyza, I., Makridakis, C.: Analysis for time discrete approximations of blow-up solutions of semilinear parabolic equations. SIAM J. Numer. Anal. 49(1), 405–426 (2011)MathSciNetCrossRefMATH

16.

Lozinski, A., Picasso, M., Prachittham, V.: An anisotropic error estimator for the Crank-Nicolson method: application to a parabolic problem. SIAM J. Sci. Comput. 31(4), 2757–2783 (2009)MathSciNetCrossRefMATH

17.

Makridakis, C., Nochetto, R.H.: A posteriori error analysis for higher order dissipative methods for evolution problems. Numer. Math. 104(4), 489–514 (2006)MathSciNetCrossRefMATH

18.

Mitchell, W.F., McClain, M.A.: A comparison of hp-adaptive strategies for elliptic partial differential equations. ACM Trans. Math. Softw. (TOMS) 41(1), 2 (2014)MathSciNetCrossRefMATH

19.

Schötzau, D., Schwab, C.: An $hp$ a-priori error analysis of the DG time-stepping method for initial value problems. Calcolo 37, 207–232 (2000)MathSciNetCrossRefMATH

20.

Schötzau, D., Schwab, C.: Time discretization of parabolic problems by the $hp$-version of the discontinuous Galerkin finite element method. SIAM J. Numer. Anal. 38, 837–875 (2000)MathSciNetCrossRefMATH

21.

Schötzau, D., Schwab, C.: $hp$-discontinuous Galerkin time-stepping for parabolic problems. C. R. Acad. Sci. Paris, Série I 333, 1121–1126 (2001)MathSciNetCrossRefMATH

22.

Schötzau, D., Wihler, T.P.: A posteriori error estimation for $hp$-version time-stepping methods for parabolic partial differential equations. Numer. Math. 115(3), 475–509 (2010)MathSciNetCrossRefMATH

23.

Werder, T., Gerdes, K., Schötzau, D., Schwab, C.: $hp$-discontinuous Galerkin time-stepping for parabolic problems. Comput. Methods Appl. Mech. Engrg. 190, 6685–6708 (2001)MathSciNetCrossRefMATH

24.

Wihler, T.P.: An a-priori error analysis of the $hp$-version of the continuous Galerkin FEM for nonlinear initial value problems. J. Sci. Comput. 25, 523–549 (2005)MathSciNetCrossRefMATH

Titel: hp-Adaptive Galerkin Time Stepping Methods for Nonlinear Initial Value Problems
verfasst von: Irene Kyza
Stephen Metcalfe
Thomas P. Wihler
Publikationsdatum: 27.09.2017
Verlag: Springer US
Erschienen in: Journal of Scientific Computing / Ausgabe 1/2018
Print ISSN: 0885-7474
Elektronische ISSN: 1573-7691
DOI: https://doi.org/10.1007/s10915-017-0565-x

Springer Professional

Abstract

1 Introduction

1.1 Outline

1.2 Notation

2 Galerkin Time Discretizations

2.1 hp-cG Time Stepping

2.2 hp-dG Time Stepping

3 A Posteriori Error Analysis

3.1 Preliminary Error Bound

3.2 Continuation Argument

3.3 Computable Error Bound

4 Adaptive Algorithms

4.1 A h-Adaptive Approach

4.2 An hp-Adaptive Approach

5 Numerical Experiments

5.1 Example 1

5.2 Example 2

6 Conclusions

Weitere Artikel der Ausgabe 1/2018

Supercloseness of Primal-Dual Galerkin Approximations for Second Order Elliptic Problems

An Adaptive Algorithm for the Time Dependent Transport Equation with Anisotropic Finite Elements and the Crank–Nicolson Scheme

Recursive Computation of Logarithmic Derivatives, Ratios, and Products of Spheroidal Harmonics and Modified Bessel Functions and Applications

An Efficient Semi-implicit Solver for Direct Numerical Simulation of Compressible Flows at All Speeds

Certified Reduced Basis Methods for Parametrized Elliptic Optimal Control Problems with Distributed Controls

An Efficient Spectral-Galerkin Approximation and Error Analysis for Maxwell Transmission Eigenvalue Problems in Spherical Geometries