Low-storage implicit/explicit Runge–Kutta schemes for the simulation of stiff high-dimensional ODE systems

doi:10.1016/j.jcp.2015.01.031

Journal of Computational Physics

Volume 286, 1 April 2015, Pages 172-193

https://doi.org/10.1016/j.jcp.2015.01.031 Get rights and content

Abstract

Implicit/explicit (IMEX) Runge–Kutta (RK) schemes are effective for time-marching ODE systems with both stiff and nonstiff terms on the RHS; such schemes implement an (often A-stable or better) implicit RK scheme for the stiff part of the ODE, which is often linear, and, simultaneously, a (more convenient) explicit RK scheme for the nonstiff part of the ODE, which is often nonlinear. Low-storage RK schemes are especially effective for time-marching high-dimensional ODE discretizations of PDE systems on modern (cache-based) computational hardware, in which memory management is often the most significant computational bottleneck. In this paper, we develop and characterize eight new low-storage implicit/explicit RK schemes which have higher accuracy and better stability properties than the only low-storage implicit/explicit RK scheme available previously, the venerable second-order Crank–Nicolson/Runge–Kutta–Wray (CN/RKW3) algorithm that has dominated the DNS/LES literature for the last 25 years, while requiring similar storage (two, three, or four registers of length N) and comparable floating-point operations per timestep.

Introduction

Although a wide variety of methods have been used for spatial discretization and subgrid-scale modeling in the Direct Numerical Simulation (DNS) and Large Eddy Simulation (LES) of turbulent flows, time marching schemes for such systems have relied, in most cases, on an implicit scheme for the advancement of the stiff terms and an explicit scheme for the advancement of the nonstiff terms. Among these so-called IMEX schemes, an approach that gained favor due to [11] and [12] coupled the (implicit, second-order) Crank–Nicolson (CN) scheme for the stiff terms with the (explicit) second-order Adams–Bashforth (AB2) scheme for the nonstiff terms. This approach was refined in [13], which used the (implicit) CN scheme for the stiff terms, at each RK substep, together with the (explicit) third-order low-storage Runge–Kutta–Wray (RKW3) scheme [22] for the nonstiff terms. This venerable IMEX algorithm, dubbed CN/RKW3, still enjoys extensive use today, and is particularly appealing, as only two registers are required for advancing the ODE in time, though if three registers are used, the number of flops required by the algorithm may be significantly reduced. In high-dimensional discretizations of 3D PDE systems on modern computational hardware, the reduced memory footprint of this time marching algorithm, in its two-register or three-register form, can significantly reduce the execution time of a simulation. However, the CN/RKW3 scheme has the considerable disadvantage of being only second-order accurate, and its implicit part is only A-stable. In recent years, there have been relatively few attempts to refine the CN/RKW3 time-marching scheme for turbulence simulations, perhaps due to a mistaken notion that modifying it to achieve higher order might result in either increased storage requirements, significantly more computation per timestep, or the loss of A stability of the implicit part. It turns out that this is untrue; in fact, there is much to be gained by revising this algorithm.

When using an IMEX scheme, such as those described above, to march the incompressible Navier–Stokes equation, one natural choice is to treat the (linear) diffusion terms as the “stiff terms” and the (nonlinear) convective terms as the “nonstiff terms”. Note that a better choice for discretizations with significant grid clustering implemented in one or more spatial directions, as usually present when simulating wall-bounded turbulent flows, is to treat the diffusion and linearized convection terms with derivatives in the direction of most significant grid clustering (e.g., in the direction normal to the nearest wall) as the “stiff” terms, and the remaining terms as the “nonstiff” terms, as suggested by [1]. Note further that so-called fractional step methods are often combined with such IMEX schemes in order to enforce the incompressibility constraint (see, e.g., [13]). The present paper focuses exclusively on the IMEXRK part of such time-advancement algorithms; various creative choices for which terms to take implicitly at different points in the physical domain of interest, and various methods for implementing fractional step techniques for enforcing exactly the divergence-free constraint, may subsequently be addressed in an identical manner as discussed in [1] and [13], and elsewhere in the literature.

Over the last 30 years, there has been significant development of (full-storage) IMEXRK algorithms. A comprehensive review of this literature is given in [9], and a brief summary of this subject is given in Section 1.1 below, including the general structure of full-storage IMEXRK schemes, their general implementation, conditions on their parameters for second-, third-, and fourth-order accuracy, and characterizations of their stability.

Further, in the years since the development of RKW3 in [22], there has been significant development of alternative low-storage explicit RK schemes; a comprehensive review of this literature is given in [10], and a brief summary of this subject is given in Section 1.2 below, including the extension to implicit RK schemes, the introduction of a general 2-register IMEXRK form, efficient 3-register and 2-register implementations of this form, as well as the introduction of a general 3-register IMEXRK form, and efficient 4-register and 3-register implementations of this form.

We then develop eight new low-storage IMEXRK schemes well suited for turbulent flow simulations, and other computational grand challenge applications, using two, three, or four registers of length N (the dimension of the ODE under consideration). With an eye on the computational cost of their implementation, we focus on schemes with the smallest number of stages possible for a given order, stability, and storage requirement. A comprehensive summary of the schemes developed in this paper is given in Table 1. In short:

•
Section 2 presents two second-order, 2-register IMEXRK schemes:
- –
  the classic 3-stage, A-stable, CN/RKW3 scheme, and
- –
  a new, $(2, 3)$ -stage [that is, a scheme with 2 implicit stages and 3 explicit stages], L-stable, strong-stability-preserving scheme, dubbed IMEXRKCB2.
•
Section 3 presents five new third-order, 2-register IMEXRK schemes:
- –
  a $(2, 3)$ -stage, strongly A-stable scheme, dubbed IMEXRKCB3a,
- –
  a $(3, 4)$ -stage, strongly A-stable scheme with ESDIRK implicit part, dubbed IMEXRKCB3b, and
- –
  three $(3, 4)$ -stage, L-stable schemes:
  - –
    one with coefficients selected to maximize stability of the ERK part on the negative real axis while being strong stability preserving, dubbed IMEXRKCB3c,
  - –
    one with coefficients selected to be strong stability preserving for the maximum possible timestep, dubbed IMEXRKCB3d, and
  - –
    one with coefficients selected to maximize accuracy of the ERK part, dubbed IMEXRKCB3e.
•
Section 4 presents a new third-order, 3-register, 4-stage, L-stable, stage-order-2 scheme dubbed IMEXRKCB3f.
•
Section 5 presents a new fourth-order, 3-register, 6-stage, L-stable, stage-order-2 scheme dubbed IMEXRKCB4.

In Section 6, we provide an analysis of the well-known order reduction phenomenon arising during the integration of very stiff ODEs using these IMEXRK schemes. Finally, Section 7 considers the application of all of these low-storage IMEXRK schemes, and some of their full-storage IMEXRK competitors, to a representative test problem in order to compare their computational efficiency.

A comprehensive review of (full-storage) IMEXRK schemes is given by Kennedy, Carpenter, and Lewis [9]. In short, IMEXRK schemes incorporate a coordinated pair of Diagonally Implicit Runge–Kutta (DIRK, with lower-triangular A) and Explicit Runge–Kutta (ERK, with strictly lower-triangular A) schemes, with parameters as summarized in the standard Butcher tableaux for the time advancement of an ODE of the form $\frac{d x (t)}{d t} = f (x, t) + g (x, t),$ where $f (x, t)$ represents the stiff part of the RHS [advanced with the DIRK method at left in (1)], and $g (x, t)$ represents the nonstiff part of the RHS [simultaneously advanced with the ERK method at right in (1)].

If the stiff part of the ODE is linear [that is, if $f (x, t) = A x$ ] then, denoting the efficient solution of $A x = b$ as $A^{- 1} b$ , a full-storage implementation of the IMEXRK scheme in (1) to advance from $x = x_{n}$ to $x = x_{n + 1}$ proceeds as follows $for k = 1 : s$ $if k = = 1, y = x, else, y = x + \sum_{i = 1}^{k - 1} a_{k, i}^{IM} Δ t f_{i} + \sum_{j = 1}^{k - 1} a_{k, j}^{EX} Δ t g_{j}, end$ $f_{k} = A {(I - a_{k, k}^{IM} Δ t A)}^{- 1} y [equivalently, f_{k} = {(I - a_{k, k}^{IM} Δ t A)}^{- 1} A y]$ $g_{k} = g (y + a_{k k}^{IM} Δ t f_{k}, t_{n} + c_{k}^{EX} Δ t)$ $end$ $x \leftarrow x + \sum_{i = 1}^{s} b_{i}^{IM} Δ t f_{i} + \sum_{j = 1}^{s} b_{j}^{EX} Δ t g_{j}$ $\hat{x} \leftarrow \hat{x} + \sum_{i = 1}^{s} {\hat{b}}_{i}^{IM} Δ t f_{i} + \sum_{j = 1}^{s} {\hat{b}}_{j}^{EX} Δ t g_{j}$ Line (3c) above is simply $f_{k} = f (z, t_{n} + c_{k}^{IM} Δ t)$ , where z is the solution of $e (z) = z - y - a_{k k}^{IM} Δ t f (z, t_{n} + c_{k}^{IM} Δ t) = 0$ [that is, where $z = y + a_{k k}^{IM} Δ t f (z, t_{n} + c_{k}^{IM} Δ t)$ ], in the special case that $f (x, t) = A x$ . More generally, if the stiff part $f (x, t)$ is nonlinear, then line (3c) is replaced by a Newton–Raphson iteration (see [16]) to find the z such that $e (z) = 0$ : $\begin{matrix} Initialize : & z_{0} = y + a_{k k}^{IM} Δ t f (y, t_{n} + c_{k}^{IM} Δ t) \\ Iterate : & (I - a_{k k}^{IM} Δ t {\frac{\partial f (x, t_{n} + c_{k}^{IM} Δ t)}{\partial x} |}_{x = z_{m}}) (z_{m + 1} - z_{m}) = - z_{m} + y + a_{k k}^{IM} Δ t f (z_{m}, t_{n} + c_{k}^{IM} Δ t) \\ Upon exit : & f_{k} = f (z_{converged}, t_{n} + c_{k}^{IM} Δ t) \end{matrix}}$ The Jacobian used in this iteration may be computed analytically or approximated numerically. The low-storage IMEXRK algorithms developed in this work may be applied in the linear or nonlinear setting, mutatis mutandis; Sections 1.2.1 General three-register implementation of [2R] IMEXRK schemes, 1.2.2 General two-register implementation of [2R] IMEXRK schemes, 1.2.3 General four-register implementation of [3R] IMEXRK schemes, 1.2.4 General three-register implementation of [3R] IMEXRK schemes provide low-storage pseudocode implementations in the case in which the stiff part of the ODE is linear.

Finally, note that the ${\hat{b}}_{i}^{IM}$ and ${\hat{b}}_{i}^{EX}$ coefficients in the Butcher tableaux, if provided, are used to form a so-called embedded scheme to advance the solution at each timestep with an order of accuracy reduced by one with respect to the main scheme. Using this embedded scheme, one may estimate the error of the simulation at each timestep, and adjust the stepsize at the next iteration accordingly.

As is well known (see, e.g., [3]), for the DIRK and ERK components in (1), when used in isolation, to be first-order accurate, it is required that $τ_{1}^{IM (1)} = \sum_{i} b_{i}^{IM} - 1 = 0 τ_{1}^{EX (1)} = \sum_{i} b_{i}^{EX} - 1 = 0,$ for these schemes, when used in isolation, to be second-order accurate, it is additionally required that $τ_{1}^{IM (2)} = \sum_{i} b_{i}^{IM} c_{i}^{IM} - 1 / 2 = 0 τ_{1}^{EX (2)} = \sum_{i} b_{i}^{EX} c_{i}^{EX} - 1 / 2 = 0,$ for these schemes, when used in isolation, to be third-order accurate, it is additionally required that $τ_{1}^{IM (3)} = (1 / 2) \sum_{i} b_{i}^{IM} c_{i}^{IM} c_{i}^{IM} - 1 / 6 = 0 τ_{1}^{EX (3)} = (1 / 2) \sum_{i} b_{i}^{EX} c_{i}^{EX} c_{i}^{EX} - 1 / 6 = 0$ $τ_{2}^{IM (3)} = \sum_{i, j} b_{i}^{IM} a_{i j}^{IM} c_{j}^{IM} - 1 / 6 = 0 τ_{2}^{EX (3)} = \sum_{i, j} b_{i}^{EX} a_{i j}^{EX} c_{j}^{EX} - 1 / 6 = 0,$ and for these schemes, when used in isolation, to be fourth-order accurate, it is additionally required that $τ_{1}^{IM (4)} = (1 / 6) \sum_{i} b_{i}^{IM} c_{i}^{IM} c_{i}^{IM} c_{i}^{IM} - 1 / 24 = 0 τ_{1}^{EX (4)} = (1 / 6) \sum_{i} b_{i}^{EX} c_{i}^{EX} c_{i}^{EX} c_{i}^{EX} - 1 / 24 = 0$ $τ_{2}^{IM (4)} = (1 / 3) \sum_{i, j} b_{i}^{IM} c_{i}^{IM} a_{i j}^{IM} c_{j}^{IM} - 1 / 24 = 0 τ_{2}^{EX (4)} = (1 / 3) \sum_{i, j} b_{i}^{EX} c_{i}^{EX} a_{i j}^{EX} c_{j}^{EX} - 1 / 24 = 0$ $τ_{3}^{IM (4)} = (1 / 2) \sum_{i, j} b_{i}^{IM} a_{i j}^{IM} c_{j}^{IM} c_{j}^{IM} - 1 / 24 = 0 τ_{3}^{EX (4)} = (1 / 2) \sum_{i, j} b_{i}^{EX} a_{i j}^{EX} c_{j}^{EX} c_{j}^{EX} - 1 / 24 = 0$ $τ_{4}^{IM (4)} = \sum_{i, j, k} b_{i}^{IM} a_{i j}^{IM} a_{j k}^{IM} c_{k}^{IM} - 1 / 24 = 0 τ_{4}^{EX (4)} = \sum_{i, j, k} b_{i}^{EX} a_{i j}^{EX} a_{j k}^{EX} c_{k}^{EX} - 1 / 24 = 0 .$ Recall that, in the scalar case, the exact solution of $x^{'} = f (x) + g (x)$ has the following terms: $x_{n + 1} = x_{n} + Δ t x_{n}^{'} + {(Δ t)}^{2} x_{n}^{″} / 2! + {(Δ t)}^{3} x_{n}^{‴} / 3! + O ({(Δ t)}^{4}) = x_{n} + Δ t {f + g}_{(x_{n}, t_{n})} + \frac{{(Δ t)}^{2}}{2!} {f^{'} f + f^{'} g + g^{'} f + g^{'} g}_{(x_{n}, t_{n})} + \frac{{(Δ t)}^{3}}{3!} {f^{″} f f + 2 f^{″} f g + f^{″} g g + g^{″} f f + 2 g^{″} f g + g^{″} g g + f^{'} f^{'} f + f^{'} g^{'} f + g^{'} f^{'} f + g^{'} g^{'} f + f^{'} f^{'} g + f^{'} g^{'} g + g^{'} f^{'} g + g^{'} g^{'} g}_{(x_{n}, t_{n})} + O ({(Δ t)}^{4});$ note in particular that there are 2 terms at second order and 10 terms at third order that involve both f and g. For the DIRK and ERK components in (1), when used together in an IMEX fashion, to be second-order accurate, it is thus additionally required that $τ_{1}^{IMEX (2)} = \sum_{i} b_{i}^{IM} c_{i}^{EX} - 1 / 2 = 0 τ_{2}^{IM EX (2)} = \sum_{i} b_{i}^{EX} c_{i}^{IM} - 1 / 2 = 0,$ for these schemes, when used together in an IMEX fashion, to be third-order accurate, it is additionally required that $τ_{1}^{IMEX (3)} = (1 / 2) \sum_{i} b_{i}^{IM} c_{i}^{EX} c_{i}^{EX} - 1 / 6 = 0 τ_{2}^{IMEX (3)} = (1 / 2) \sum_{i} b_{i}^{EX} c_{i}^{IM} c_{i}^{IM} - 1 / 6 = 0$ $τ_{3}^{IMEX (3)} = (1 / 2) \sum_{i} b_{i}^{IM} c_{i}^{IM} c_{i}^{EX} - 1 / 6 = 0 τ_{4}^{IMEX (3)} = (1 / 2) \sum_{i} b_{i}^{EX} c_{i}^{IM} c_{i}^{EX} - 1 / 6 = 0$ $τ_{5}^{IMEX (3)} = \sum_{i, j} b_{i}^{IM} a_{i j}^{EX} c_{j}^{EX} - 1 / 6 = 0 τ_{6}^{IMEX (3)} = \sum_{i, j} b_{i}^{EX} a_{i j}^{IM} c_{j}^{IM} - 1 / 6 = 0$ $τ_{7}^{IMEX (3)} = \sum_{i, j} b_{i}^{EX} a_{i j}^{EX} c_{j}^{IM} - 1 / 6 = 0 τ_{8}^{IMEX (3)} = \sum_{i, j} b_{i}^{IM} a_{i j}^{IM} c_{j}^{EX} - 1 / 6 = 0$ $τ_{9}^{IMEX (3)} = \sum_{i, j} b_{i}^{IM} a_{i j}^{EX} c_{j}^{IM} - 1 / 6 = 0 τ_{10}^{IMEX (3)} = \sum_{i, j} b_{i}^{EX} a_{i j}^{IM} c_{j}^{EX} - 1 / 6 = 0,$ and for these schemes, when used together in an IMEX fashion, to be fourth-order accurate, 44 additional constraints are required (see [9]), which for brevity aren't listed here.

The stability of an RK scheme may be characterized by considering the model problem $d x / d t = λ x$ and defining $z = λ Δ t$ , $σ (z) = x_{n + 1} / x_{n}$ , and $σ (\infty) ≜ \lim_{| z | \to \infty} σ (z)$ . The stability function of an RK scheme with Butcher tableau parameters A and b is then given by $σ (z) = 1 + z b^{T} {(I - z A)}^{- 1} 1$ , where 1 denotes a vector of ones; the RK scheme is said to be stable for any z such that $| σ (z) | \leq 1$ . Further, considering its application to stiff systems, an RK scheme is said to be

•
A-stable if $| σ (z) | \leq 1$ over the entire LHP of z,
•
strongly A-stable if it is A-stable and $| σ (\infty) | < 1$ , and
•
L-stable if it is A-stable and $σ (\infty) = 0$ .

The stability of an IMEXRK scheme is a bit more difficult to characterize. Of course, one may start by characterizing the stability of the implicit and explicit parts considered in isolation. To evaluate the stability of the implicit and explicit components of an IMEX scheme working together, we consider the model problem $d x / d t = λ_{f} x + λ_{g} x$ , where the first term on the RHS is handled implicitly, and the second term on the RHS is handled explicitly. Defining $z^{IM} = λ_{f} Δ t$ , $z^{EX} = λ_{g} Δ t$ , and $σ (z^{IM}; z^{EX}) = x_{n + 1} / x_{n}$ , we may write (see [9]) $σ (z^{IM}; z^{EX}) = \frac{det [I - z^{IM} A^{IM} - z^{EX} A^{EX} + z^{IM} 1 {(b^{IM})}^{T} + z^{EX} 1 {(b^{EX})}^{T}]}{det [I - z^{IM} A^{IM}]} .$ We may then characterize the stability of the implicit and explicit parts of an IMEXRK scheme working in concert, when the implicit part of the problem is stiff, by looking at $σ (z^{IM}; z^{EX})$ as $z^{IM} \to \infty$ for finite $z^{EX}$ .

Consider the 1D hyperbolic PDE $\partial u / \partial t = - \partial f (u) / \partial x;$ denoting by $u_{i} (t)$ the discretization of $u (x, t)$ on N spatial grid points $x_{i}$ , and by $u (t)$ a vector containing all of the $u_{i} (t)$ , we write the spatial discretization of this PDE as the ODE $d u / d t = L (u) .$ If a TVD spatial discretization is used, such as a Godunov or MUSCL scheme with an appropriate flux limiter incorporated (see [14]), then applying a simple Explicit Euler time discretization to (7), $u^{n + 1} = u^{n} + Δ t L (u^{n}),$ under the appropriate CFL condition on the timestep, $Δ t \leq Δ t_{CFL}$ , results in a simulation exhibiting a total variation of the discrete solution which does not increase in time, that is, $TV (u^{n + 1}) \leq TV (u^{n}), where TV (u^{n}) = \sum_{j} | u_{j + 1}^{n} - u_{j}^{n} | .$ Strong-stability preserving (SSP) explicit time-discretization methods (see [17] and [18]) are simply higher-order time discretization methods that conserve this total variation diminishing property with a modified CFL condition on the timestep, $Δ t \leq c Δ t_{CFL}$ .

In [18] (see also [6]), a condition for an explicit Runge–Kutta scheme to be SSP has been developed. This condition states that if an s-stage explicit Runge–Kutta scheme is written in incremental form, that is, $u^{(0)} = u^{n}$ $u^{(i)} = \sum_{j = 0}^{i - 1} (α_{i j} u^{(j)} + Δ t β_{i j} L (u^{(j)})) for i = 1, \dots, s$ $u^{n + 1} = u^{(s)},$ where all of the $α_{i j} \geq 0$ , and if the forward Euler method applied to the ODE (7) arising from a TVD spatial discretization of the hyperbolic PDE (6) is strongly stable under the appropriate CFL restriction, then such an explicit Runge–Kutta method is SSP provided that all of the $β_{i j} \geq 0$ and that the following CFL restriction is fulfilled: $Δ t \leq c Δ t_{CFL}, c = \min_{i, j} \frac{α_{i j}}{β_{i j}} .$ In case an explicit scheme is coupled with an implicit scheme, as in an IMEXRK formulation, then, provided the implicit scheme used to integrate the stiff part of the ODE is L-stable, in the stiff limit the time integration scheme becomes the explicit Runge–Kutta scheme, and the order of accuracy of the limiting scheme is greater than or equal to the order of accuracy of the IMEXRK scheme itself. Hence, as stated in [15], if the explicit part of the IMEXRK scheme is SSP, then the IMEXRK scheme will also be SSP in the stiff limit.

In [15], three full-storage second-order and two full-storage third-order IMEXRK schemes are presented which are SSP in the stiff limit; no other IMEXRK schemes with this SSP property were found in our review of the IMEXRK literature. The present paper derives three new IMEXRK schemes which are SSP in the stiff limit (one which is second-order and two which are third-order); unlike the schemes in [15], the IMEXRK schemes derived here are of the low-storage variety.

The existing literature on low-storage RK schemes to date appears to focus exclusively on explicit schemes. Note that a cavalier implementation of a full-storage ERK scheme [see the explicit part of (3a), (3b), (3c), (3d), (3e), (3f), (3g)] requires storage of the state vector [x], the intermediate vector [y], and s values of the RHS vectors [ $g_{k}$ ]; that is, $s + 2$ vectors of length N, where $x = x_{N \times 1}$ . We now summarize the two main classes of low-storage ERK schemes,¹ a comprehensive review of which is given in Kennedy, Carpenter, and Lewis [10].

The two-register Williamson class of ERK schemes [20], denoted “ $[2 N]$ ” schemes, may be written to advance from $x = x_{n}$ to $x = x_{n + 1}$ as $\begin{matrix} for k = 1 : s \\ if k = = 1, Δ x \leftarrow Δ t g (x, t_{n} + c_{k} Δ t), else \\ Δ x \leftarrow α_{k} Δ x + Δ t g (x, t_{n} + c_{k} Δ t) \\ end \\ x \leftarrow x + β_{k} Δ x \\ end \end{matrix}$ If handled with care, such schemes can often be implemented efficiently in two registers of length N, x and $Δ x$ .

The two-register van der Houwen class of schemes [19], denoted “[2R]” schemes, restrict the parameters $a_{i j}$ below the first subdiagonal in the Butcher tableau of the ERK scheme to be equal to the parameters $b_{j}$ of the corresponding column, and may thus be written to advance from $x = x_{n}$ to $x = x_{n + 1}$ as $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, else \\ y \leftarrow x + (a_{k, k - 1} - b_{k - 1}) Δ t g (y, t_{n} + c_{k - 1} Δ t) \\ end \\ x \leftarrow x + b_{k} Δ t g (y, t_{n} + c_{k} Δ t) \\ end \end{matrix}$ Such schemes can often be implemented efficiently in two registers of length N (namely, x and y). If implemented with three registers, however, the function $g (y, t_{n} + c_{k} Δ t)$ can be computed just once per timestep (instead of twice). RKW3 [22] is a commonly-used example of a two-register, three-stage, third-order van der Houwen ERK scheme, with a Butcher tableau of

In the three-register van der Houwen class of schemes, denoted “[3R]” schemes, only the parameters $a_{i j}$ below the second subdiagonal of the Butcher tableau of the ERK scheme must equal the parameters $b_{j}$ of the corresponding column. An effective implementation of such [3R] schemes that uses only three registers of length N (namely, x, y and z) is given by $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, z \leftarrow x, else, \\ z \leftarrow y + a_{k, k - 1} Δ t g (y, t_{n} + c_{k - 1} Δ t) \\ if k < s, y \leftarrow x + (a_{k + 1, k - 1} - b_{k - 1}) g (y, t_{n} + c_{k - 1} Δ t), end \\ end \\ x \leftarrow x + b_{k} Δ t g (y, t_{n} + c_{k} Δ t) \\ end \end{matrix}$ Again, if implemented with four registers, the function $g (y, t_{n} + c_{k} Δ t)$ can be computed just once per timestep (instead of thrice). In the present work, we extend the two- and three-register van der Houwen classes of ERK schemes to the DIRK case, which can be accomplished with precisely the same restrictions on the (lower triangular) DIRK Butcher tableau as in the (strictly lower triangular) ERK case, as specified above. Further, we will develop coordinated pairs of such [2R] and [3R] DIRK and ERK schemes in the IMEX setting described in Section 1.1. In particular, we will develop a [2R] second-order IMEX scheme, [2R] and [3R] third-order IMEX schemes, and a [3R] fourth-order IMEX scheme.

As shown in Section 1.1, six constraints on the parameters of the IMEX Butcher tableaux (1) must be satisfied for second-order accuracy, fourteen additional constraints must be satisfied for third-order accuracy, and fifty-two additional constraints must be satisfied for fourth-order accuracy. Before proceeding, we thus introduce some significant simplifying assumptions. Following [15] and [9] and the CN/RKW3 scheme of [13], we synchronize the stages of DIRK and ERK components by imposing $c_{i}^{IM} = c_{i}^{EX} = c_{i}$ for $i = 1, \dots, s$ . We also coordinate the constituent DIRK and ERK components such that $b_{i}^{IM} = b_{i}^{EX} = b_{i}$ for $i = 1, \dots, s$ , as also done in [15] and [9], but which is not satisfied by CN/RKW3. Finally, for each stage, a stage-order of one is also imposed such that $\sum_{j = 1}^{i} a_{i j}^{IM} = \sum_{j = 1}^{i - 1} a_{i j}^{EX} = c_{i} for i = 1, \dots, s;$ it follows that $c_{1} = a_{11}^{IM} = a_{11}^{EX} = 0$ . As a result of these assumptions, the number of constraints on the IMEX parameters [see (4a), (4b), (4c), (4d), (4e), (4f), (4g), (4h), (4i), (4j), (4k), (4l), (4m), (4n)] for second-order accuracy is reduced to just two, the number of constraints for third-order accuracy is reduced to five, and the number of constraints for fourth-order accuracy is reduced to fourteen.

For several of the IMEXRK schemes developed in this paper, a lower-order embedded scheme is also developed, relaxing the ${\hat{b}}_{i}^{IM} = {\hat{b}}_{i}^{EX}$ restriction to provide increased freedom during the design phase. As a general guideline, none of the leading-order truncation terms of an embedded scheme should vanish, so that each of these terms will contribute to the error estimate (subject to this restriction, the remaining free parameters of the embedded scheme are then optimized to maximize the magnitude of the leading-order truncation terms). Unfortunately, this is not always achievable; as a result, not all of the schemes developed in this paper are listed with embedded schemes. For all of the embedded schemes we do report, the DIRK part of the embedded scheme is at least A-stable, which is a property of the embedded scheme recommended by [8]; note, however, that the embedded scheme is not used for time marching, it is only used to adjust the timestep.

The IMEX Butcher tableaux in (1) for coordinated pairs of [2R] DIRK and ERK schemes are thus simplified to and the IMEX Butcher tableaux for coordinated pairs of [3R] DIRK and ERK schemes are simplified to Note also that, as the DIRK component, the IMEXRK form considered above has an explicit first stage, its stability function (5) may be written $σ (z^{IM}; z^{EX}) = \frac{1 + \sum_{i = 1}^{s} p_{i} (z^{EX}) {[z^{IM}]}^{i}}{1 + \sum_{i = 1}^{s - 1} q_{i} {[z^{IM}]}^{i}} where p_{i} (z^{EX}) = \sum_{j = 0}^{s - i} {\hat{p}}_{i j} {[z^{EX}]}^{j} .$

Note that, if the stiff part of the ODE is linear [that is, if $f (x, t) = A x$ ] then, denoting the efficient solution of $A x = b$ as $A^{- 1} b$ , a straightforward implementation of the low-storage IMEXRK scheme in (16) that uses three registers² of length N to advance from $x = x_{n}$ to $x = x_{n + 1}$ proceeds as follows: $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, else \\ y \leftarrow x + (a_{k, k - 1}^{IM} - b_{k - 1}^{IM}) Δ t z + (a_{k, k - 1}^{EX} - b_{k - 1}^{EX}) Δ t y \\ end \\ z = {(I - a_{k, k}^{IM} Δ t A)}^{- 1} A y \\ y \leftarrow g (y + a_{k, k}^{IM} Δ t z, t_{n} + c_{k}^{EX} Δ t) \\ x \leftarrow x + b_{k}^{IM} Δ t z + b_{k}^{EX} Δ t y \\ \hat{x} \leftarrow \hat{x} + {\hat{b}}_{k}^{IM} Δ t z + {\hat{b}}_{k}^{EX} Δ t y \\ end \end{matrix}$ where z and y store the implicit and explicit parts of the RHS at each stage, x is used to advance the solution of the main scheme,³ and $\hat{x}$ stores the solution of the embedded scheme if adaptive time stepping is implemented. Note that one linear solve of the form ${(I - c A)}^{- 1} b$ , one matrix/vector product $A y$ , and one nonlinear function evaluation $g (y, t)$ are computed per stage, in addition to various level-1 BLAS (basic linear algebra subroutine) operations. As discussed in Section 1.1, implementation in the case of a nonlinear stiff part is a straightforward extension.

By applying the matrix inversion lemma ${(\hat{A} + \hat{B} \hat{C} \hat{D})}^{- 1} = {\hat{A}}^{- 1} - {\hat{A}}^{- 1} \hat{B} {({\hat{C}}^{- 1} + \hat{D} {\hat{A}}^{- 1} \hat{B})}^{- 1} \hat{D} {\hat{A}}^{- 1}$ with $\hat{A} = \hat{C} = I$ , $\hat{D} = A$ , and $B = - a_{k, k}^{IM} Δ t$ , the algorithm laid out in Section 1.2.1 may be rewritten in a form that only requires two registers² of length N: $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, else \\ y \leftarrow x + (a_{k, k - 1}^{IM} - b_{k - 1}^{IM}) Δ t A y + (a_{k, k - 1}^{EX} - b_{k - 1}^{EX}) Δ t g (y, t_{n} + c_{k - 1}^{EX} Δ t) \\ end \\ y \leftarrow {(I - a_{k, k}^{IM} Δ t A)}^{- 1} y \\ x \leftarrow x + b_{k}^{IM} Δ t A y + b_{k}^{EX} Δ t g (y, t_{n} + c_{k}^{EX} Δ t) \\ \hat{x} \leftarrow \hat{x} + {\hat{b}}_{k}^{IM} Δ t A y + {\hat{b}}_{k}^{EX} Δ t g (y, t_{n} + c_{k}^{EX} Δ t) \\ end \end{matrix}$ In this case, one linear solve of the form ${(I - c A)}^{- 1} b$ and two operations of the form⁴ $x + c A y + d g (y, t)$ are computed per stage, in addition to various level-1 BLAS operations. However, the storage requirement is reduced from three registers of length N to only two, which is quite significant. In many cases, some of the coefficients in the above algorithm turn out to be zero, so the increased computational cost associated with the extra nonlinear function evaluations and matrix/vector products in this implementation is not as bad as one might initially anticipate, as quantified in Section 7.

For the development of the stage-order-two schemes IMEXRKCB3f and IMEXRKCB4 in Section 4 and Section 5, the [3R] IMEXRK structure (17) will be used to provide increased freedom during the design phase. Such schemes admit the following four-register implementation: $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, z^{IM} = x, z^{EX} \leftarrow x, else \\ z^{EX} \leftarrow y + a_{k, k - 1}^{EX} Δ t z^{EX} \\ if k < s, y \leftarrow x + (a_{k + 1, k - 1}^{IM} - b_{k - 1}^{IM}) Δ t z^{IM} + (a_{k + 1, k - 1}^{EX} - b_{k - 1}^{EX}) (z^{EX} - y) / a_{k, k - 1}^{EX}, end \\ z^{EX} \leftarrow z^{EX} + a_{k, k - 1}^{IM} Δ t z^{IM} \\ end \\ z^{IM} = {(I - a_{k, k}^{IM} Δ t A)}^{- 1} A z^{EX} \\ z^{EX} \leftarrow g (z^{EX} + a_{k, k}^{IM} Δ t z^{IM}, t_{n} + c_{k}^{EX} Δ t) \\ x \leftarrow x + b_{k}^{IM} Δ t z^{IM} + b_{k}^{EX} Δ t z^{EX} \\ \hat{x} \leftarrow \hat{x} + {\hat{b}}_{k}^{IM} Δ t z^{IM} + {\hat{b}}_{k}^{EX} Δ t z^{EX} \\ end \end{matrix}$ where $z^{IM}$ and $z^{EX}$ store the implicit and explicit parts of the RHS at each stage, y is a temporary variable which contributes to advance the solution to the next stage, x is used to advance the solution of the main scheme, and $\hat{x}$ stores the solution of the embedded scheme if adaptive timestepping is used. As in the three-register implementation of the [2R] scheme, only one linear solve of the form ${(I - c A)}^{- 1} b$ , one matrix/vector product, and one nonlinear function evaluation are computed per stage.

Leveraging matrix inversion lemma as done in Section 1.2.2, we obtain a general three-register implementation of any [3R] IMEXRK scheme: $\begin{matrix} for k = 1 : s \\ if k = = 1, y \leftarrow x, z \leftarrow x, else \\ if k < s \\ z \leftarrow y + a_{k, k - 1}^{IM} Δ t A z \\ y \leftarrow A^{- 1} (z - y) / (a_{k, k - 1}^{IM} Δ t) \\ z \leftarrow z + a_{k, k - 1}^{EX} Δ t g (y, t_{n} + c_{k - 1}^{EX} Δ t) \\ y \leftarrow x + (a_{k + 1, k - 1}^{IM} - b_{k - 1}^{IM}) Δ t A y + (a_{k + 1, k - 1}^{EX} - b_{k - 1}^{EX}) Δ t g (y, t_{n} + c_{k - 1}^{EX} Δ t) \\ else \\ z \leftarrow y + a_{k, k - 1}^{IM} Δ t A z + a_{k, k - 1}^{EX} Δ t g (y, t_{n} + c_{k - 1}^{EX} Δ t) \\ end \\ end \\ z \leftarrow {(I - a_{k, k}^{IM} Δ t A)}^{- 1} z \\ x \leftarrow x + b_{k}^{IM} Δ t A z + b_{k}^{EX} Δ t g (z, t_{n} + c_{k}^{EX} Δ t) \\ \hat{x} \leftarrow \hat{x} + {\hat{b}}_{k}^{IM} Δ t A z + {\hat{b}}_{k}^{EX} Δ t g (z, t_{n} + c_{k}^{EX} Δ t) \\ end \end{matrix}$ Note that this algorithm requires the invertibility of the matrix A, a condition that is often true when A arises from the discretization of a PDE. In this case, two linear systems, three matrix/vector products, and three nonlinear function evaluations must be performed per stage (except for the last stage), plus an additional matrix/vector product and one nonlinear function evaluation if the embedded scheme is used for adaptive timestepping.

Finally, note that a (hardware-dependent) trade-off between flops and storage must ultimately be conducted to select between the two-register and three-register implementation of any [2R] scheme, or between the three-register and four-register implementation of any [3R] scheme.

Section snippets

Two second-order, 2-register IMEXRK schemes

The classical second-order, A-stable CN/RKW3 method may easily be written in the low-storage IMEXRK Butcher tableaux form (16) (albeit with the $b_{i}^{IM} = b_{i}^{EX} = b_{i}$ constraint relaxed) with the four-stage IMEX Butcher tableaux A DIRK scheme with $c_{1} = 0$ and $c_{s} = 1$ [such as that shown at left in (23)] is known as a first-same-as-last (FSAL) scheme. In such a scheme, the implicit part of the last stage of one timestep is precisely the implicit part of the first stage of the next timestep, and thus an FSAL

A $(2, 3)$ -stage, strongly A-stable scheme

As suggested by (24), to streamline the implementation, we can suppress the first stage of the DIRK scheme by imposing $b_{1} = a_{21}^{IM} = 0$ . Following this simplification, the entire first column of the DIRK scheme is zero, thus leading to a scheme with $s - 1$ implicit stages and s explicit stages. In the $s = 3$ case, the IMEXRK Butcher tableaux take the general form To achieve third-order accuracy, after imposing stage-order-one conditions on both implicit and explicit part, we arrive at five nonlinear

A third-order, 3-register, 4-stage, L-stable scheme

All of the schemes so-far described have stage-order one for both the implicit and explicit components. It is well known in the literature (see [7]) that this limits the order of convergence of such methods when applied to stiff ODEs. In particular, if the stiffness is so high that the ODE turns into an index-1 DAE, some variables convert from differential to algebraic and their convergence rate is determined by the stage-order of the method. For this reason, an attempt has been made to improve

A fourth-order, 3-register, 6-stage, L-stable scheme

Solving the nonlinear system of equations arising from the imposition of the fourth-order accuracy constraints is a difficult task. For this reason, stage-order conditions higher than one are usually imposed, as pointed out in [8]. These conditions simplify the search for a solution by significantly reducing the nonconvexity of the corresponding optimization problem. For this reason, after imposing the same $b_{i}$ and $c_{i}$ over the explicit and implicit components and stiff accuracy for the implicit

Order reduction

We now consider the order reduction present when the schemes developed above are applied to the van der Pol equation. It is well documented in the literature (see, e.g., [8]) that whenever an RK method is used to integrate a singular perturbation problem (that is, an ODE characterized by a stiffness parameter ε whose behavior transitions towards that of an index-1 DAE as the stiffness increases), the observed convergence rate appears to be lower than the nominal order of accuracy of the RK

Computational cost

To illustrate the relative computational cost of our new low-storage IMEXRK schemes on a representative PDE model problem discretized on $N ≫ 1$ gridpoints, we now compare the efficient implementation of each of the methods developed herein to CN/RKW3 and several full-storage IMEX Runge–Kutta schemes available in literature. We consider as a model PDE problem the one-dimensional Kuramoto–Sivashinsky equation $\frac{\partial u}{\partial t} = - u \frac{\partial u}{\partial x} - \frac{\partial^{2} u}{\partial x^{2}} - \frac{\partial^{4} u}{\partial x^{4}}$ over the domain $x \in [- L / 2, L / 2]$ with $u = \partial u / \partial x = 0$ at $x = \pm L / 2$ , where L is

Conclusions

We have developed eight new IMEX Runge–Kutta schemes with reduced storage requirements, the properties of which are succinctly summarized and compared with competing schemes in Table 1. It is seen that:

•
IMEXRKCB2 is second-order accurate, like CN/RKW3; IMEXRKCB3a–3f are third-order accurate, and IMEXRKCB4 is fourth-order accurate.
•
IMEXRKCB2 and 3a–3e, like CN/RKW3, admit both two-register and three-register implementations, with the three-register implementations requiring slightly fewer flops.
•

Acknowledgements

The authors would like to thank one of the reviewers for the especially valuable comments and suggestions, which contributed to improve the results of the present work. The authors also gratefully acknowledge the financial support of AFOSR FA9550-12-1-0046 and NSF CNS-1035828.

References (22)

U.M. Ascher et al.
Implicit–explicit Runge–Kutta methods for time-dependent partial differential equations
Appl. Numer. Math.
(1997)
M.P. Calvo et al.
Linearly implicit Runge–Kutta methods for advection–reaction–diffusion equations
Appl. Numer. Math.
(2001)
C.A. Kennedy et al.
Additive Runge–Kutta schemes for convection–diffusion–reaction equations
Appl. Numer. Math.
(2003)
C.A. Kennedy et al.
Low-storage, explicit Runge–Kutta schemes for the compressible Navier–Stokes equations
Appl. Numer. Math.
(2000)
J. Kim et al.
Application of a fractional-step method to incompressible Navier–Stokes equations
J. Comput. Phys.
(1985)
H. Le et al.
An improvement of fractional step methods for the incompressible Navier–Stokes equations
J. Comput. Phys.
(1991)
C.W. Shu et al.
Efficient implementation of essentially non-oscillatory shock-capturing schemes
J. Comput. Phys.
(1988)
J.H. Williamson
Low-storage Runge–Kutta schemes
J. Comput. Phys.
(1980)
K. Akselvoll et al.
Large eddy simulation of turbulent confined coannular jets and turbulent flow over a backward facing step
(1995)
J.C. Butcher
Numerical Methods for Ordinary Differential Equations
(2008)

M.H. Carpenter et al.

Fourth-order Runge–Kutta schemes for fluid mechanics applications

J. Sci. Comput.

(2005)

Cited by (50)

A high-order finite difference method for moving immersed domain boundaries and material interfaces
2024, Journal of Computational Physics
We present a high-order sharp treatment of immersed moving domain boundaries and material interfaces, and apply it to the advection-diffusion equation in two and three dimensions. The spatial discretization combines dimension-split finite difference schemes with an immersed boundary treatment based on a weighted least-squares reconstruction of the solution, providing stable discretizations with up to sixth order accuracy for diffusion terms and third order accuracy for advection terms. The temporal discretization relies on a novel strategy for maintaining high-order temporal accuracy in problems with moving boundaries that minimizes implementation complexity and allows arbitrary explicit or diagonally-implicit Runge-Kutta schemes. The approach is broadly compatible with popular PDE-specialized Runge-Kutta time integrators, including low-storage, strong stability preserving, and diagonally implicit schemes. Through numerical experiments we demonstrate that the full discretization maintains high-order spatial and temporal accuracy in the presence of complex 3D geometries and for a range of boundary conditions, including Dirichlet, Neumann, and flux conditions with large jumps in coefficients.
Effect of pulsating flow on flow-induced vibrations of circular and square cylinders in the laminar regime
2024, Ocean Engineering
Through fluid-structure interaction simulations, this study assesses the dynamic response characteristics of elastically mounted circular and square cylinders subjected to pulsating inflow conditions, providing valuable insights into the analysis and optimization of these systems. The main focus of the present work is on analyzing the effects of two factors: (i) the ratio of the oscillatory velocity component to the steady velocity component in pulsating flow (flow ratio) and (ii) the ratio of the oscillation frequency of pulsating flow to the natural frequency of the structure (frequency ratio). The simulation results for different parameters of interest are analysed using Fourier analysis and Poincaré maps of time series data, and contour plots of vorticity. For the circular cylinder, it is found that cylinder loses synchronization in lock-in as the flow and frequency ratios are increased. Three distinct vibration patterns of vortex-induced vibration are observed for selected combinations of flow and frequency ratios at a Reynolds number of 110 for circular cylinder. For the galloping of square cylinder at a Reynolds number of 250, it is found that the instability and nonlinearity of vortex shedding become more pronounced as the flow ratio increases.
A nonhydrostatic atmospheric dynamical core on cubed sphere using multi-moment finite-volume method
2023, Journal of Computational Physics
A nonhydrostatic dynamical core has been developed by using the multi-moment finite volume method that ensures the rigorous numerical conservation of mass in this study. To represent the spherical geometry free of the polar problems, the cubed-sphere grid is adopted. A fourth-order multi-moment discretization formulation is applied to solve the governing equations cast in the local curvilinear coordinates on each patch of the cubed sphere through a gnomonic projection. In the vertical direction, the height-based terrain-following coordinate is used to deal with the topography and a finite difference scheme, also assuring the conservation of mass, is adopted for the spatial discretization. The proposed dynamical core adopts the nonhydrostatic governing equations. To get around the CFL stability restriction imposed by the sound wave and the relatively small grid spacing in the vertical direction, the dimensional splitting time integration algorithm using the HEVI (horizontally-explicit and vertically-implicit) strategy is implemented by applying the IMEX (implicit-explicit) Runge-Kutta method. The proposed model was checked by the widely-used benchmark tests in this study. The numerical results show that the multi-moment model has the comparable solution quality in comparison with the existing advanced ones and the great practical potential as a numerical platform for development of the atmospheric general circulation models.
High order all-speed semi-implicit weighted compact nonlinear scheme for the isentropic Navier–Stokes equations
2022, Journal of Computational and Applied Mathematics
This paper presents a high order all-speed semi-implicit weighted compact nonlinear scheme (WCNS) for the isentropic Navier–Stokes system. To avoid the severe CFL stability restriction, the pressure and viscous terms are treated implicitly in time, while the other terms are treated explicitly in time. The third-order IMEX Runge–Kutta methods and the fifth-order WCNS are used for time discretization and spatial discretization, respectively. The generated linear equations of velocity components are solved by the GMRES iterative algorithm. Numerical results in one, two and three dimensions in both compressible and incompressible regimes are presented to show the performance of the designed scheme.
An assessment of implicit-explicit time integrators for the pseudo-spectral approximation of Boussinesq thermal convection in an annulus
2022, Journal of Computational Physics
Citation Excerpt :
Schemes located in the top right quadrant of each panel are more accurate and yet more efficient than CNAB2. Six IMEX-RK schemes are systematically located in this quadrant, one scheme of order 2, SMR432 [29], four schemes of order 3, ARS343 [17], CB443 [32], CFN343 [58], KC443 [51] and one scheme of order 4, KC664 [51]. At the operational, dissipation-based limit of stability, it is remarkable to note that ARS343 and KC664 enable a significant improvement of accuracy at a lower cost, regardless of the configuration.
We analyze the behavior of an ensemble of time integrators applied to the semi-discrete problem resulting from the spectral discretization of the equations describing Boussinesq thermal convection in a cylindrical annulus. The equations are cast in their vorticity-streamfunction formulation that yields a differential algebraic equation (DAE). The ensemble comprises 28 members: 4 implicit-explicit multistep schemes, 22 implicit-explicit Runge-Kutta (IMEX-RK) schemes, and 2 fully explicit schemes used for reference. The schemes whose theoretical order varies from 2 to 5 are assessed for 11 different physical setups that cover laminar and turbulent regimes. Multistep and order 2 IMEX-RK methods exhibit their expected order of convergence under all circumstances. IMEX-RK methods of higher-order show occasional order reduction that impacts both algebraic and differential field variables. We ascribe the order reduction to the stiffness of the problem at hand and, to a larger extent, the presence of the DAE. Using the popular Crank-Nicolson Adams-Bashforth of order 2 (CNAB2) integrator as reference, performance is defined by the ratio of maximum admissible time step to the cost of performing one iteration; the maximum admissible time step is determined by inspection of the time series of viscous dissipation within the system, which guarantees a physically acceptable solution. Relative performance is bounded between 0.5 and 1.5 across all studied configurations. Considering accuracy jointly with performance, we find that 6 schemes consistently outperform CNAB2, meaning that in addition to allowing for a more efficient calculation, the accuracy that they achieve at their operational, dissipation-based limit of stability yields a lower error. In our most turbulent setup, where the behavior of the methods is almost entirely dictated by their explicit component, 13 IMEX-RK integrators outperform CNAB2 in terms of accuracy and efficiency.
High order semi-implicit weighted compact nonlinear scheme for the full compressible Euler system at all Mach numbers
2022, Computers and Mathematics with Applications
A high order semi-implicit weighted compact nonlinear scheme (WCNS) is presented to solve the full compressible Euler equations at all Mach numbers. To avoid the stringent acoustic CFL restriction for an explicit time discretization method, the pressure splitting methodology is introduced to split the Euler equations into nonstiff and stiff terms. The nonstiff and stiff terms are treated explicitly and implicitly, respectively, based on the high order IMEX Runge-Kutta time discretization method that is asymptotic preserving and asymptotically accurate in the zero Mach number limit. A fifth-order WCNS and fourth-order centered finite difference schemes with zero numerical viscosity are used for the spatial discretization. Numerical tests in one and two space dimensions are displayed to demonstrate the performance of the high order semi-implicit WCNS in compressible and incompressible regimes. Comparisons with the third-order explicit weighted essentially non-oscillatory (WENO) scheme are also made in order to better assess the proposed scheme.

View all citing articles on Scopus

View full text

Published by Elsevier Inc.

Low-storage implicit/explicit Runge–Kutta schemes for the simulation of stiff high-dimensional ODE systems

Abstract

Introduction

Section snippets

Two second-order, 2-register IMEXRK schemes

A (2,3)-stage, strongly A-stable scheme

A third-order, 3-register, 4-stage, L-stable scheme

A fourth-order, 3-register, 6-stage, L-stable scheme

Order reduction

Computational cost

Conclusions

Acknowledgements

Appl. Numer. Math.

Appl. Numer. Math.

Appl. Numer. Math.

Appl. Numer. Math.

J. Comput. Phys.

J. Comput. Phys.

J. Comput. Phys.

J. Comput. Phys.

Large eddy simulation of turbulent confined coannular jets and turbulent flow over a backward facing step

Numerical Methods for Ordinary Differential Equations

Fourth-order Runge–Kutta schemes for fluid mechanics applications

J. Sci. Comput.

A $(2, 3)$ -stage, strongly A-stable scheme