Finds documents with both search terms in any word order, permitting "n" words as a maximum distance between them. Best choose between 15 and 30 (e.g. NEAR(recruit, professionals, 20)).
Finds documents with the search term in word versions or composites. The asterisk * marks whether you wish them BEFORE, BEHIND, or BEFORE and BEHIND the search term (e.g. lightweight*, *lightweight, *lightweight*).
The article explores the application of Newton’s method with Armijo step sizes to solve the regularized p-Stokes equations, which are crucial for modeling ice-sheet dynamics. The authors introduce a convex functional to control step sizes, ensuring global convergence. They validate their method through numerical experiments, including the ISMIP-HOM B test case and a sliding block scenario. The results demonstrate the effectiveness of their approach, which outperforms traditional methods like the Picard iteration. The study also discusses the convergence of the regularized solution to the non-regularized problem under specific conditions.
AI Generated
This summary of the content was generated with the help of AI.
Abstract
The motion of glaciers can be simulated with the \(\varvec{p}\)-Stokes equations. Up to now, Newton’s method to solve these equations has been analyzed in finite-dimensional settings only. We analyze the problem in infinite dimensions to gain a new viewpoint. We do that by proving global convergence of the infinite-dimensional Newton’s method with Armijo step sizes to the solution of these equations. We only have to add an arbitrarily small diffusion term for this convergence result. We prove that the additional diffusion term only causes minor differences in the solution compared to the original \(\varvec{p}\)-Stokes equations under the assumption of some regularity. Finally, we test our algorithms on two experiments: A reformulation of the experiment ISMIP-HOM \(\varvec{B}\) without sliding and a block with sliding. For the former, the approximation of exact step sizes for the Picard iteration and exact step sizes and Armijo step sizes for Newton’s method are superior in the experiment compared to the Picard iteration. For the latter experiment, Newton’s method with Armijo step sizes needs many iterations until it converges fast to the solution. Thus, Newton’s method with approximately exact step sizes is better than Armijo step sizes in this experiment.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
1 Introduction
Ice-sheet models spend most of the computation time solving the momentum equations, [1]. These equations are nonlinear partial differential equations named p-Stokes equations.
We prove that Newton’s method with Armijo step sizes converges to the solution of the p-Stokes equations if we add a small diffusion term. We control the step size with a convex functional, which is the anti-derivative of the p-Stokes equations. Evaluating this functional has nearly no computational cost. Moreover, the convexity of the functional allows us to approximate the exact step sizes. We slightly modify the numerical experiment ISMIP-HOM B to test Newton’s method with Armijo step sizes. Moreover, we test this algorithm with a sliding block. We test both experiments with the Picard iteration and Newton’s method with approximations of exact step sizes.
Advertisement
The small diffusion term is necessary to imply Gâteaux differentiability of the p-Stokes equations in the variational formulation. Furthermore, the shear-thinning viscosity term has a negative exponent. Thus, we need for differentiability a positive constant in this term. To conclude the theory, we show that the regularized solution converges to the solution of the p-Stokes equations under a slight regularity assumption.
Regarding the well-posedness of the equations, there are different types of literature results: nonlinear friction boundary conditions [2], an implicitly given viscosity [3] with a differentiable shear-thinning term, or a differentiable shear-thinning term with Dirichlet boundary conditions, [4]. One recent publication with more general boundary conditions than we consider in two dimensions uses Newton’s method in a finite-dimensional setting, see [5]. However, we need a combination of a differentiable, explicitly given shear-thinning viscosity, and nonlinear friction boundary conditions. Glaciologists use such a formulation to simulate glaciers, [6]. Gâteaux differentiability results are mainly devoted to optimal control and the necessary control-to-state mapping, see for example, [7] for the p-Laplace equations or [8] for the Navier-Stokes equations with a nonlinear p-Stokes term. In [8], the Gâteaux differentiability is shown for \(p\ge 2\). The idea of how to prove differentiability for \(1<p<2\) is briefly mentioned in [9, section 6]. That paper uses the additional diffusion term for optimal control. It proves convergence in Sobolev spaces for vanishing regularization of the diffusion term. However, that paper has different requirements: It considers optimal control in a slightly different formulation, has Dirichlet boundary conditions, and has a convective term. It has more restrictions on the exponent of the shear-thinning fluids than we have. In [3], local quadratic convergence of Newton’s method is shown for the finite-dimensional case. Glaciologists already consider Newton’s method, [10], but we consider a different approach by adding a small diffusive term and using a convex functional for step size control. There is also a multigrid approach with Newton’s method and a reformulation of the partial differential equation with first derivatives available, [11].
The paper is structured as follows: In Section 2, we derive the variational formulation, project to divergence-free spaces, introduce a minimization problem that is equivalent to solving the full-Stokes equations, and verify existence and uniqueness of a solution. In Section 3, we consider Gâteaux differentiability. In Section 4, we verify that Newton steps can be calculated. In Section 5, we use the functional to verify global convergence. Additionally, we prove that the solution of the regularized p-Stokes equations converges to the solution of the p-Stokes equations for vanishing regularization under some regularity assumptions. In Section 6, we consider one experiment without sliding and one with. We summarize our results in the final Section 7.
2 The p-Stokes equations
In this section, we formulate the p-Stokes equations in both the classical and the variational formulations. The p-Stokes equations can be used to, e.g., model the motion of glaciers. Their complexity results from nonlinear viscosity, also described as shear-thinning, [6]. Mass conservation and incompressibility lead to a divergence-free velocity \(\varvec{v}\). Let \(N \in \{2,3\}\), \(\Omega \subseteq \mathbb {R}^N\) be a Lipschitz domain. Let \(\sigma \) be the stress tensor, \(\varvec{v}\) the velocity, \(\rho \) the density, and \(\varvec{g}\) the gravitational acceleration. The p-Stokes equations are:
The norm \(|\cdot |\) is the Frobenius norm. For vectors \(\varvec{v}\in \mathbb {R}^n\), we interpret \(S^p\) as a mapping from \(\mathbb {R}^n\rightarrow \mathbb {R}^n\) with
The case \(p=2\) reduces to the Stokes problem. In glaciological applications \(p=4/3\), see [6, section 1.4.3], or in more recent approaches \(p=5/4\), see [12, Abstract], is used. The following result states information about the integrability of \(S^p\):
Lemma 2.2
Let \(p\in (1,2)\). For all \(P \in L^p(\Omega )^{N \times N}\) follows \(S^p(P)\in L^{p'}(\Omega )^{N \times N}\) with the dual exponent \(p'\).
Proof
With the dual exponent \(p'=p/(p-1)\) and \(p\in (1,2)\) follows
Let \(\partial \Omega = \Gamma _d \cup \Gamma _a \cup \Gamma _b\). The Dirichlet boundary condition \(\varvec{v}=0\) is satisfied for those parts of the glacier frozen to the ground \(\Gamma _d\). The interaction of the glacier with the air is given by \(\sigma \cdot \varvec{n}=0\) on \(\Gamma _a\) with the matrix-vector multiplication of the matrix \(\sigma \) and the unit normal vector \(\varvec{n}\). We assume nonlinear sliding at parts of the bedrock \(\Gamma _b\) that are not frozen to the ground. This sliding is represented by tangential sliding. For these parts of the bedrock, we assume that the normal component of the velocity is 0 because we neglect the melting of ice or freezing of water at the bedrock. We remind of the definition of the tangential components
The restriction \(|\Gamma _d|>0\) is also fulfilled in common applications, see for example [12, Abstract] or [13, Abstract].
The Poincaré inequality implies that the \(W_p\)-norm is equivalent to the \(W^{1,p}(\Omega )^N\)-norm. For example, the Poincaré inequality was proved in [14, Theorem 1.5] for \(p=2\). But the proof is identical for \(p\in (1,2)\).
2.1 Variational formulation
We derive the variational formulation in the space \(W_p\). We define the double scalar product and obtain the following equation for \(\sigma ,\tau \in \mathbb {R}^{N \times N}\) immediately:
Let \(p'\) be the dual exponent, \(\varvec{\phi }\in W_{p'}\), \(\sigma \in W^{1,p}(\Omega )^{N \times N}\). We conclude by multiplication with the test function \(\varvec{\phi }\) and partial integration for the left-hand side of (1)
Let \(\varvec{v}\in W_{p}\), \(\pi \in L^{p'}(\Omega )\). We use \(\sigma =-\pi I+BS^p(D\varvec{v})\) and \(I:\nabla \varvec{\phi }=\textrm{div}\varvec{\phi }\) to obtain for the first summand on the right-hand side of (11)
The second summand on the right-hand side of (11) vanishes on \(\Gamma _d\) because the test function \(\varvec{\phi }\in W_{p'}\) vanishes on \(\Gamma _d\), see the definition of \(W_{p'}\) in Definition 2.3. Moreover, \(\sigma \cdot \varvec{n}=0\) is valid on \(\Gamma _a\), see (6). Thus, the integral over this domain disappears, too. It follows
We split the boundary integral on \(\Gamma _b\) into normal and tangential components and use the boundary conditions on \(\Gamma _b\), see (7), and the definition of \(\varvec{\phi }\in W_{p'}\), namely \(\varvec{\phi }\cdot \varvec{n}=0\) on \(\Gamma _b\), see Definition 2.3, to obtain
We set \(L^{p'}_0(\Omega ):=\{\pi \in L^{p'}(\Omega )\text {; } \int _{\Omega }\pi \, dx=0\}\). Then a weak solution of the problem is given by \((\varvec{v},\pi )\in (W_p,L^{p'}_0(\Omega ))\) with
Now, we examine surjectivity of the restricted operator \(A:V_p\rightarrow V_p^*\),
$$\begin{aligned} \langle A\varvec{v},\varvec{\phi }\rangle _{V_p^*,V_p}=\int _{\Omega }\rho \varvec{g}\cdot \varvec{\phi }\, dx \quad \text { for all }\phi \in V_p. \end{aligned}$$
(14)
By solving the divergence-free problem, we solve the original problem: The inf-sup condition is proved by [15, Corollary 3.2] for Dirichlet boundary conditions:
The inf-sup condition is also fulfilled for \(W_p\supseteq W^{1,p}_0(\Omega )^N\). Thus, we can apply [16, Theorem IV.1.4]. This yields for each solution in the divergence-free formulation a unique solution for the original problem. This theorem is stated in Hilbert spaces, but the proof is identical for operators on Banach spaces with an existing dual operator.
2.2 Regularization of the equation and the divergence-free space
In this subsection, we introduce a regularization and the divergence-free space. We add a small diffusion term that allows us to obtain a solution in \(V_2\). We need solutions in \(V_2\) and not only in \(V_p\) to obtain Gâteaux differentiability in Section 3. We define for \(\mu _0>0\) the operator \(A:V_2\rightarrow V_2^*\),
(Weak solution of the p-Stokes equations) Let \(p\in (1,2)\), \(|\Gamma _d|>0\), \(\delta >0\), and \(\mu _0>0\). A solution of
$$\begin{aligned} \langle A\varvec{v},\varvec{\phi }\rangle _{V_2^*,V_2}=\int _{\Omega }\rho \varvec{g}\cdot \varvec{\phi }\, dx\quad \text { for all }\varvec{\phi } \in V_2. \end{aligned}$$
(16)
is called weak solution of the p-Stokes equations.
For \(\mu _0=0\) and \(\delta =0\) existence and uniqueness of a solution to (14) is shown in [2]; for \(\mu =0\), \(\delta >0\), and \(\Gamma _d=\partial \Omega \), see [4]. We consider \(\mu _0>0\) and \(\delta >0\) because we want Gâteaux differentiability in the infinite-dimensional space. In finite dimensions, \(\mu _0>0\) is not necessary, see [4, section 3].
For vanishing \(\mu _0\) and \(\delta \), the weak solution of the p-Stokes equations converges to the solution of \(\mu _0=0\) and \(\delta =0\) under slight regularity assumptions, see Section 5.3.
2.3 Equivalent minimization problem
In this subsection, we introduce a minimization problem, which is equivalent to finding a solution to the regularized p-Stokes equations. In [4], the convex functional was introduced for \(\mu _0=0\) and Dirichlet boundary conditions. In [2], it was introduced for \(\mu _0=0\) and \(\delta =0\) for more general boundary conditions. We use those formulations for our convex functional with \(\mu _0,\delta \in [0,\infty )\).
Definition 2.6
(Convex functional) Let \(\mu _0,\delta \in [0,\infty )\), \(J_{\mu _0,\delta }:V_r\rightarrow \mathbb {R}\) with \(r=2\) for \(\mu _0>0\) and \(r=p\) for \(\mu _0=0\),
For \(\delta >0\), the Gâteaux differentiability of the first and fourth summand are discussed in [4] on \(W^{1,p}_0(\Omega )^N\). Because the boundary conditions do not influence this integral, the Gâteaux differentiability is also clear on \(V_2\). The second summand can be handled identically to the first summand. Also the term \(\varvec{v}\mapsto \mu _0(\nabla \varvec{v},\nabla \varvec{v})\) is Gâteaux differentiable.
For \((\mu _0,\delta )=0\), differentiability is proved in [2]. Adding the summand \(\mu _0(\nabla \varvec{v},\nabla \varvec{v})\) does not change the differentiability result.
Thus, \(J_{\mu _0,\delta }\) is Gâteaux differentiable. Because A and \((\varvec{v},\varvec{w})\mapsto (\rho \varvec{g},\varvec{w})\) are continuous, \(J_{\mu _0,\delta }\) is Fréchet differentiable. \(\square \)
2.4 Existence and uniqueness of weak solutions in the divergence-free space
Differentiability and uniqueness were proven for \(\delta =0=\mu _0\) in [2] by using the strictly convex functionals as in Definition 2.6. For Dirichlet zero boundary conditions existence and uniqueness of solutions were stated without detailed proof via the Browder-Minty Theorem, e.g., in [17]. For completeness, we prove existence and uniqueness of a solution with the Browder-Minty Theorem as our boundary conditions, see equations (5) to (8) are more complicated than the standard Dirichlet boundary conditions and with \(\delta >0\) and \(\mu _0>0\) different to the problem in [2]. Thus, we prove a well-known result for a slightly different formulation.
Before we can state the existence and uniqueness result, we have to introduce the following definitions:
Definition 2.8
(Strict monotonicity, coercivity) Let X be a reflexive Banach space, \(A:X\rightarrow X^*\). The operator A is called strictly monotone, if we have for all \(\varvec{v},\varvec{w}\in X\)
The conditions for existence and uniqueness are formulated in the following theorem.
Theorem 2.9
(Browder-Minty Theorem) Let X be a reflexive separable Banach space and \(A:X\rightarrow X^*\) a strictly monotone, coercive, and continuous operator. Let \(f\in X^*\). Then, there exists a unique solution \(u\in X\) for the equation
$$\begin{aligned} Au=f\quad \text { in }X^*. \end{aligned}$$
Proof
The existence of \(u\in X\) is proved in [18] Theorem 2 in a more general version. Uniqueness follows immediately with the strict monotonicity of A. \(\square \)
We analyze \(S^p\) and \(S^s\) to verify that A is strictly monotone, coercive, and continuous. For that purpose, we need the following result:
Lemma 2.10
Let \(r\in (1,2)\), \(S:\mathbb {R}^{N \times N}\rightarrow \mathbb {R}^{N \times N}\) with
$$\begin{aligned} \bigg |\frac{\partial S_{ij}(P)}{\partial P_{k\ell }}\bigg |&\le c_2(|P|+\delta )^{r-2}\quad \text { for all }i,j,k,\ell \in \{1,...,N\} \end{aligned}$$
(19)
for all \(P,Q\in \mathbb {R}^{N \times N}\), \(c_1,c_2\in (0,\infty )\), \(\delta \ge 0\). Then, there exist \(c,C\in \mathbb {R}\) independent of \(\delta \) for all \(P,Q\in \mathbb {R}^{N \times N}\) with
$$\begin{aligned} (S(P)-S(Q)):(P-Q)&\ge c (\delta +|P|+|Q|)^{r-2}|P-Q|^2,\\ |S(P)-S(Q)|&\le C (\delta +|P|+|Q|)^{r-2}|P-Q|. \end{aligned}$$
We set \(P:=I_{ij}p\) and \(Q:=I_{ij}q\) with \(p,q\in \mathbb {R}^N\) for the vector-valued situation. \(\square \)
We verify the three properties for the Browder-Minty Theorem, see Theorem 2.9 in the three following lemmata. We verify Lipschitz continuity of A instead of continuity:
Lemma 2.12
Let \(|\Gamma _d|>0\), \(\delta >0\), \(\mu _0>0\), and \(p\in (1,2)\). The operator \(A:V_2\rightarrow V_2^*\), see (15) is Lipschitz continuous.
Proof
Let \(\varvec{v},\varvec{w},\varvec{\phi }\in V_2\). We obtain for the second summand of the operator A with Lemma 2.11 and \(C\in \mathbb {R}\)
The same arguments are valid for the first summand of the operator A with \(\Omega \) instead of \(\Gamma _b\), B instead of \(\tau \), and \(s=p\). Thus, we obtain Lipschitz continuity with \(C_2\in \mathbb {R}\) and
Let \(|\Gamma _d|>0\), \(\delta >0\), \(\mu _0>0\), and \(p\in (1,2)\). There exists exactly one weak solution of the divergence-free regularized p-Stokes equations, see Definition 2.5.
Proof
(In [4] the case \(\Gamma _d=\partial \Omega \) is analyzed.) Lemma 2.12 yields Lipschitz continuity of the operator A and thus continuity. We verified the coercivity of A in Lemma 2.13. The operator A is also strictly monotone, see Lemma 2.14. We conclude the existence and uniqueness of a solution with the Browder-Minty Theorem, see Theorem 2.9. \(\square \)
3 Differentiability
We calculate the Gâteaux derivative of the operator A to apply Newton’s method. At first, we only consider the first summand of the operator A.
Definition 3.1
(Root problem) Let \(\delta >0\), \(\mu _0>0\), \(B\in L^{\infty }(\Omega )\), and \(\tau \in L^{\infty }(\Gamma _b)\). We define the operator \(G:V_2\rightarrow V_2^*\) for all \(\varvec{\phi } \in V_2\) by
Let \(\delta >0\), \(\mu _0>0\), \(B\in L^{\infty }(\Omega )\), \(\tau \in L^{\infty }(\Gamma _b)\), and \(\varvec{v},\varvec{w}\in V_2\). The directional derivative has the form
Furthermore, the operator \(G:V_2\rightarrow V_2^*\) is Gâteaux differentiable.
Proof
First, we consider the first summand on the right-hand side of (21).
The o-notation is the limit to zero in the following calculations. We prove pointwise convergence. For all \(i,j \in \{1,...,N\}\), we calculate the Taylor expansion of \(S^p\) in \(D\varvec{v}\) with the continuous derivative, see (20). This yields almost everywhere
for all \(i,j\in \{1,...,N\}\) almost everywhere. We obtained the pointwise convergence. To apply the dominated convergence theorem, we calculate a majorant. Using the Lipschitz continuity of \(S^p\), see Lemma 2.11, and \(C\in \mathbb {R}\), we conclude
The directional derivative for the second summand on the right-hand side of (21) and the boundedness follow in the same ways if we use as the integration area \(\Gamma _b\) instead of \(\Omega \) and \(V,\Phi \) with \(V_{ij}:=I_{ij}v_j\) and \(\Phi _{ij}:=I_{ij}\phi _j\) instead of \(D\varvec{v}\) and \(D\varvec{\phi }\). With this notation, we relate the vector-valued expressions to the matrix-valued expressions. The last summand in (22) follows trivially due to the linearity of \(\varvec{v}\mapsto \mu _0(\nabla \varvec{v},\nabla \varvec{\phi })\), \(\varvec{\phi }\in V_2\).
Thus, \(\varvec{w}\mapsto G'(\varvec{v};\varvec{w})\) is a bounded linear operator. Hence, G is Gâteaux differentiable. \(\square \)
The operator G has only a directional derivative on \(V_2\) not on \(V_p\):
Remark 3.3
Let \(\delta >0\). The second summand of \(G'(\varvec{v};\varvec{w})\) in Theorem 3.2 is not well-defined for all \(\varvec{w},\varvec{\phi }\in V_p\): Set \(B\equiv 1\), \(\varvec{v}\equiv 0\). Then we have for all \(\varvec{w},\varvec{\phi }\in V_2\)
Thus, the integral is not defined for suitable \(\varvec{w},\varvec{\phi }\in V_p\).
4 Infinite-dimensional Newton’s method
In this section, we state Newton’s method in infinite dimensions and prove that we can calculate the Newton iterations. To our knowledge, this result is new as the Gâteaux derivative exists only in all directions for \(\mu _0>0\) and the combination of Newton’s method and \(\mu _0>0\) was not considered before. Newton’s method is:
Choose \(\varvec{v_0}\in V_2\) sufficiently close to the solution \(\varvec{\overline{v}}\in V_2\) of \(G(\varvec{\overline{v}})=0\). For \(k=0,1,2,...:\) Obtain \(\varvec{w_k}\) by solving
and set \(\varvec{v_{k+1}}:=\varvec{v_k}+\varvec{w_k}\).
The problem to calculate a Newton step, see (24), is linear because we handle the linear problem \((\varvec{w},\varvec{\phi })\mapsto \langle G'(\varvec{v})\varvec{w},\varvec{\phi }\rangle _{V_2^*,V_2}\) for all \(\varvec{w},\varvec{\phi } \in V_2\). Hence, we can apply Lax-Milgram’s Lemma.
Lemma 4.1
Let \(\mu _0>0\), \(\delta >0\), and \(\varvec{v}\in V_2\). There exists exactly one solution \(\varvec{w}=\varvec{w}(\varvec{v})\in V_2\) such that (24) is fulfilled. Moreover, we have
Let \(\varvec{v}\in V_2\). We show the continuity of \((\varvec{w},\varvec{\phi })\mapsto \langle G'(\varvec{v})\varvec{w},\varvec{\phi }\rangle _{V_2^*,V_2}\) for all \(\varvec{w},\varvec{\phi }\in V_2\). Since the arguments in the calculation of relation (23) are also valid for the boundary term, we find \(\tilde{C}\in \mathbb {R}\) with the trace operator, see [20, Theorem 1.12], such that
Hence, the conditions for applying Lax-Milgram’s Lemma are validated, and a unique solution for each Newton step, see (24), exists. The coercivity constant is \(\mu _0\). This implies the uniform bound
In this section, we prove global convergence of Newton’s method with a step size control. To our knowledge, an analysis of Newton’s method with a step size control was only performed in finite dimensions; for details of this analysis, see, e.g., [4]. Note that our minimization functional \(J_{\mu _0,\delta }\) is not two times continuously differentiable as the second derivative is only a Gâteaux derivative. From the applied perspective, the usage of approximations of exact step sizes, see Definition 5.3, could be interesting as they perform well in our numerical examples, see chapter 6.
To prove the convergence, we use the convex functional \(J_{\mu _0,\delta }\) introduced in Section 2.3 for a step size control. We verify the equivalence of minimizing the convex functional and finding a weak solution of the p-Stokes equations:
Lemma 5.1
Let \(\mu _0,\delta \in [0,\infty )\). Then, we have
with \(r=2\) for \(\mu _0>0\) and \(r=p\) for \(\mu _0=0\).
Proof
The strict convexity for \((\mu _0,\delta )=(0,0)\) was proved in [2] on \(V_p\). Thus, the strict convexity follows for \(\mu _0>0\) on the smaller space \(V_2\). Thus, the case \(\delta >0\) remains. We verified in Lemma 4.1 positive definitness of \(J''_{\mu _0,\delta }=G'\) for \((\mu _0,\delta )\in (0,\infty )^2\). This implies strict convexity. The case \(\mu _0=0\) and \(\delta >0\) is considered in [4].
Because the necessary first-order optimality condition is sufficient for strict convex functions, the claim follows. \(\square \)
5.1 Step size controls
In this subsection, we remind of step size controls. Step size controls guarantee convergence under certain conditions by setting \(\varvec{v_{k+1}}:=\varvec{v_k}+\alpha _k \varvec{w_k}\) with \(\alpha _k\in (0,\infty )\) instead of \(\varvec{v_{k+1}}:=\varvec{v_k}+\varvec{w_k}\), see the begin of chapter 4. The choice of \(\alpha _k\) should be such that the new functional value \(J_{\mu _0,\delta }(\varvec{v_k}+\alpha _k\varvec{w_k})\) reduces compared to the old functional value \(J_{\mu _0,\delta }(\varvec{v_k})\) and a small reduction already implies convergence, see Lemma 5.4 condition c) for the mathematical details. A commonly used step size control that guarantees these properties under some conditions on \(J_{\mu _0,\delta }\) is the Armijo step size, see for example [20, section 2.2.1.1]:
Definition 5.2
(Armijo step sizes) Let \(J_{\mu _0,\delta }:V_2\rightarrow \mathbb {R}\) be continuously differentiable, \(\gamma \in (0,1)\), \(\varvec{v_k}\in V_2\) the point, and \(\varvec{w_k}\in V_2\) the direction. Determine the biggest \(\alpha _k\in \{1,1/2,1/2^2,...\}\) such that
For convex functions, we can try to find \(\alpha _k\) that minimizes \(J_{\mu _0,\delta }(\varvec{v_k}+\alpha _k\varvec{w_k})\):
Definition 5.3
(Exact step sizes) Let \(J_{\mu _0,\delta }:V_2\rightarrow \mathbb {R}\) be continuously differentiable, \(\varvec{v_k}\in V_2\) the point, and \(\varvec{w_k}\in V_2\) the direction. Determine \(\alpha _k\) with
In our case, we can only approximately solve this problem, which we discuss in the numerical experiment.
5.2 Global convergence
In this subsection, we verify global convergence. We obtain global convergence by employing a step size control. The step sizes are calculated by reducing the functional \(J_{\mu _0,\delta }\), see Definition 2.6, in each iteration. We verify the conditions for the following convergence result:
Lemma 5.4
Let \(J_{\mu _0,\delta }:V_2\rightarrow \mathbb {R}\) be continuously Fréchet differentiable, bounded from below, \(\varvec{v_0}\in V_2\), \(\varvec{v_{k+1}}:=\varvec{v_k}+\alpha _k\varvec{w_k}\) with the direction \(\varvec{w_k}\in V_2\), and the step sizes \(\alpha _k \in (0,\infty )\). We additionally assume that we have
Let \(\mu _0>0\), \(\delta >0\). Newton steps fulfill the angle condition for the functional \(J_{\mu _0,\delta }\), see Definition 2.6, and are descent directions.
Proof
We calculated in Lemma 4.1 the coercivity constant \(\mu _0\) and the continuity constant C independent of k such that
In [21, Theorem 1.2], the proof of the angle condition is done with \(\eta \ge \mu _0/C\) for the finite-dimensional case. The proof for the infinite-dimensional case is identical. The angle condition implies that the Newton steps are descent directions.\(\square \)
To verify large enough step sizes, we need the following lemma:
Lemma 5.6
Let \(J_{\mu _0,\delta }'\) be uniformly continuous. Let the step sizes \((\alpha _k)_k\) be Armijo step sizes and let the direction \(\varvec{w_k}\) fulfill
with some \(\varphi :[0,\infty )\rightarrow [0,\infty )\) monotonically increasing and satisfying \(\varphi (t)>0\) for all \(t>0\). Then the step sizes \((\alpha _k)_k\) are admissible.
Let \(\mu _0>0\), \(\delta >0\), and \(|\Gamma _d|>0\). For Newton’s method, the Armijo step sizes, see Definition 5.2, are admissible.
Proof
We want to apply Lemma 5.6. We verify the conditions. The function \(J'_{\mu _0,\delta }\) is Lipschitz continuous, because the operator A is Lipschitz continuous, see Lemma 2.12, and \((\varvec{v},\varvec{w})\mapsto \mu _0(\varvec{v},\varvec{w})\) is trivially Lipschitz continuous. Therefore, \(J_{\mu _0,\delta }'\) is uniformly continuous. We already proved that the directions \(\varvec{w_k}\) are descent directions. It remains to show that the descent directions are not too short. We know with the Newton steps and the right inequality in relation (27)
Therefore, the step size is bounded from below by the monotonically increasing function \(\varphi :[0,\infty )\rightarrow [0,\infty )\), \(\varphi (t)=t/C\) with \(\varphi (t)>0\) for \(t>0\). Hence, the step sizes are large enough, and we can apply Lemma 5.6. \(\square \)
Corollary 5.8
Let \(\mu _0>0\), \(\delta >0\), and \(|\Gamma _d|>0\). Newton’s method with Armijo step sizes is globally convergent.
Proof
We already explained why the step sizes are admissible, see Lemma 5.7. Moreover, the directions \(\varvec{w_k}\) are descent directions. Furthermore, we validated the angle condition. Trivially, the functional \(J_{\mu _0,\delta }\) is bounded from below. Thus, all conditions for Lemma 5.4 are fulfilled. Hence, we conclude \(G(\varvec{v_k})=J_{\mu _0,\delta }'(\varvec{v_k})\rightarrow 0\in V_2^*\).
Let \(\varvec{v^*}\in V_2\) be the solution of \(G(\varvec{v^*})=0\). For \(\varvec{v_k}=\varvec{v^*}\) the claim is clear. For \(\varvec{v_k}\ne \varvec{v^*}\), the strict monotonicity of A, see Lemma 2.14, implies
Due to \(G(\varvec{v_k})=J'_{\mu _0,\delta }(\varvec{v_k})\rightarrow 0\) for \(k\rightarrow \infty \) and \(G(\varvec{v^*})=0\) follows \(\varvec{v_k}\rightarrow \varvec{v^*}\in V_2\). \(\square \)
5.3 Convergence to the non regularized problem
In this subsection, we argue that the solution \(\varvec{v}=\varvec{v_{\mu _0,\delta }}\in V_2\) converges to \(\varvec{v_{0,0}}\in V_2\). We only have to assume \(\varvec{v_{0,0}}\in V_2\). From [2], we only know \(\varvec{v_{0,0}}\in V_p\).
Theorem 5.9
(Convergence for smooth solutions) Let \(|\Gamma _d|>0\) and \(\varvec{v_{\mu _0,\delta }}\) be the solution of \(G_{\mu _0,\delta }(\varvec{v})=0\) with the variables \(\mu _0,\delta \in (0,\infty )\) or \((\mu _0,\delta )=(0,0)\) as in the definition of the operator G. Assume \(\varvec{v_{0,0}}\in V_2\). Then there exists \(\tilde{c}\in \mathbb {R}\) with
We follow the idea in [4, Theorem 4.1]. We set \(\varvec{\tilde{v}}:=\varvec{v_{0,0}}-\varvec{v_{\mu _0,\delta }}\) to shorten the notation. We use in the upcoming calculation the monotonicity of \(S^p\) and \(S^s\), \(J_{\mu _0,\delta }'(\varvec{v_{\mu _0,\delta }})=0\), and the main theorem of calculus:
In this section, we describe two numerical experiments: ISMIP-HOM B formulated in [22] and a sliding block. We test these experiments with Newton’s method with Armijo step sizes and compare them with approximately exact step sizes that we describe in the next subsection. We also use the approximately exact step sizes to modify the Picard iteration. We implemented the experiments in FEniCS, [23], version 2019.1.0. FEniCS is a tool that solves partial differential equations with finite elements. We use the Taylor-Hood element \(P_2-P_1\) for the velocity \(\varvec{v}\) and the pressure \(\pi \). In all experiments, we calculate the initial guess by replacing \(\big (|D\varvec{v}|^2+\delta ^2\big )^{(p-2)/2}\) with \(10^6\) and solving the resulting Stokes problem.
Let \(\varvec{v}\in V_2\). Then \(G(\varvec{v})\in V_2^*\) with the norm
To evaluate this norm, we use the Riesz isomorphism:
Definition 6.1
(Norm of the Riesz isomorphism) Let \(|\Gamma _d|>0\), \(\mu _0>0\), \(\delta >0\), and \(\varvec{v}\in V_2\). Let \(\varvec{\tilde{v}}\in V_2\) be the solution of
with \(\varvec{\tilde{v}}\cong G(\varvec{v})\in V_2^*\).
If the norm of the Riesz isomorphism is small for \(\varvec{v}\in V_2\), it follows \(G(\varvec{v})\approx 0\).
6.1 Numerical solvers and computational effort
The Picard iteration was suggested for modeling glaciers in [24] and is used in ice models like ISSM, see [25]. The Picard iteration is: For given \(\varvec{v_k}\in V_2\) find \(\varvec{v_{k+1}}\in V_2\) with
for all \(\varvec{\phi } \in V_2\). This algorithm is said to be often globally convergent in practice but slow, [10]. It is, for example, used for the Navier-Stokes equations with proved convergence theory, [16]. We will compare the following algorithms:
The Picard iteration.
The Picard iteration with approximations of exact step sizes, see Definition 5.3, with \(\varvec{w_k}:=\varvec{v_{k+1}}-\varvec{v_k}\).
Newton’s method with approximations of exact step sizes, see Definition 5.3.
Newton’s method with Armijo step sizes, see Definition 5.2.
We can calculate the exact step size arbitrarily precise because we have a convex functional. Let \(0\le a<b<\infty \). For the actual velocity \(\varvec{v_k}\) and the direction \(\varvec{w_k}\), we can calculate
We use a simple bisection and set \(b:=(a+b)/2\) for \(J_{help}'((a+b)/2)\ge 0\) and \(a:=(a+b)/2\) for \(J_{help}'((a+b)/2)<0\) until we obtain the wanted accuracy. Finally, we set \(t:=(a+b)/2\). The chain rule yields for \(t\in (a,b)\)
Furthermore, we can do these calculations with mixed elements:
Remark 6.2
The step-size control is identical with mixed elements. Only the calculation of the direction changes.
Proof
We calculate the initial guess \((\varvec{v_0},\pi _0)\in (W_2,L^2_0(\Omega ))\) by replacing \(\big (|D\varvec{v}|^2+\delta ^2\big )^{(p-2)/2}\) with \(10^6\) and solving the resulting Stokes problem. This initial guess is divergence-free. Then, we calculate a divergence-free direction \(\varvec{w_0}\) for the velocity and a direction \(\psi _0\) for the pressure. Hence, we have for the convex functional
Additionally, our iteratives \(\varvec{v_k}\) are divergence-free as a linear combination of the divergence-free \(\varvec{v_{k-1}}\) and \(\varvec{w_{k-1}}\).
Thus, the step size control is identical with mixed elements. \(\square \)
Now, we consider the computational effort. The computation consists of three parts: Assembling the matrices, solving the linear system of equations, and calculating the step size.
In [26], multigrid methods are used to solve the p-Stokes equations, even on real-world examples like the Antarctic ice sheet. In [27], the computational effort of multigrid method is discussed. They measured that evaluating the residual takes less computation time than, e.g., assembling the Jacobi matrix or necessary matrix-vector multiplications. For their results, they used a simplified version of the p-Stokes equations. As calculating the residual is not the computationally expensive part also calculating the step size control is computationally affordable compared to the other parts.
6.2 Computational domain
A unit square is too simple to represent real-world glaciers. Instead, in the ISMIP-HOM experiments, see [22], a sinusoidal bedrock, see Fig. 1, is suggested, and Dirichlet boundary conditions are imposed at the bedrock. We set \(\alpha :=0.5\)\(^{\circ }\), \(L:=5000\) m, and define the upper and lower boundary of the domain by \(z_s:[0,L]\rightarrow \mathbb {R}\), \(z_b:[0,L]\rightarrow \mathbb {R}\),
with \(\omega =2\pi /L\). The experiment needs periodic boundary conditions on the left and the right side of the domain and \(\sigma \cdot \varvec{n}=0\) at the surface.
6.3 Parameters
In this subsection, we discuss setting constants and forcing boundary conditions. We set the constants for the experiment corresponding to the ISMIP-HOM experiments, see [22]: The ice parameter \(B:=0.5\cdot (10^{-16})^{-1/3}\) Pa\(^{-3}\)a\(^{-1}\), the density \(\rho :=910\) kg m\(^{-3}\), the gravitational acceleration \(\textbf{g}:=(0,-9.81)\) m s\(^{-2}\), and the seconds per year \(31\, 556\, 926\). The nonlinear viscosity is \((0.5|D\varvec{v}|^2+\delta ^2)^{(p-2)/2}\) with \(p:=4/3\). The factor 0.5 within the nonlinear term is given by [22]. However, we could modify B and \(\delta \) to formulate the equations as in our convergence analysis before.
Periodic boundary conditions have some difficulties in implementation for unstructured grids. Moreover, they are not necessary for real-world applications. Thus, we extend our domain by three copies to the left and the right. We apply Dirichlet boundary conditions at these boundaries. This approach is suggested in the Supplement of [22]. The resulting domain is shown in Fig. 1.
We rotate the gravity to obtain a horizontal upper surface. Formally, we should rotate the left and right boundaries, too. However, this only changes the position in x-direction by a maximum of \(1500 m \cdot \sin (0.5\)\(^{\circ })\)\(\approx 13 m \). This is neglectable compared to the horizontal extent of \(35\, 000 m \). Thus, we simplify the domain with vertical left and right boundaries and a horizontal surface boundary. Moreover, this approach is more flexible because we can simply change the angle \(\alpha \) without changing the domain to generate different experiments. Furthermore, changes between using periodic boundary conditions and the variant with copies of the glacier are easier.
The value \(\delta ^2\) should be small compared to typical values of \(0.5|D\varvec{v}|^2\). The range of \(|D\varvec{v}|\) is typically between \(10^{-7}\) /s and \(10^{-11}\) /s. The calculations are done in years. Hence, we obtain as a necessary condition
with the machine precision \(\textrm{eps}:= 10^{-16}\). By rounding down to a factor of 10, we obtain \(\delta :=10^{-12}/a \). We should choose \(\mu _0\) such that
Consequently, we choose \(\mu _0:=10^{-17}\)\(kg a/ (m s ^2)\). The unit for \(\mu _0\) follows from \([\mu _0 |\nabla \varvec{v}|^2]=[-\rho \varvec{g}\cdot \varvec{v}]\).
6.4 Results and interpretation
The ISMIP-HOM B experiment measures the norms of the surface velocity \(v_r\):
On the Fig. 2 are the surface velocities for all used algorithms restricted to the original glacier \(x\in [0,5000]\). The surface velocities are approximately the same for all our methods. Moreover, the velocities are similar to [22, Figure 6].
Fig. 2
The velocity norms at the surface are shown in the left figure. All methods produce nearly the same surface velocity as the result
All step size controls use the convex functional, see Definition 2.6, as the minimization term. In Fig. 3, we see the relative norm of the Riesz isomorphism, which compares the actual norm of the Riesz isomorphism of the iteration with the first calculated norm of the Riesz isomorphism, see Definition 6.1. Newton’s method with Armijo step sizes reduces the norm of the Riesz isomorphism compared to the initial norm of the Riesz isomorphism by approximately \(5\cdot 10^3\). Newton’s method with Armijo step sizes starts with a quadratic convergence. After a few iterations, this tends to a linear convergence. Then, the norm of the Riesz isomorphism is constant. The norm of the Riesz isomorphism of the Picard iteration decreases linearly but at a slower rate than Newton’s method with Armijo step sizes. Newton’s method and the Picard iteration with approximately exact step sizes converge similarly fast as Newton’s method with Armijo step sizes.
Fig. 3
The relative norms of the Riesz isomorphism are visualized with 701 grid points in x-direction and 20 in y-direction
In [4], it was discussed that a higher value of \(\delta \) leads to higher accuracy and a lower number of necessary iterations. We reproduce this result in Fig. 4. Additionally, we see that the velocity field at the surface is nearly identical for \(\delta :=10^{-4}\). With \(\delta :=10^{-4}\), Newton’s method is much better than the Picard iteration for both step size controls, see Fig. 5.
Fig. 4
Left: the relative norms of the Riesz isomorphism are visualized for different values of \(\delta \). Right: the velocity norms at the surface are shown in the right figure
To verify our claim regarding the computation time, see Section 6.1, we calculated the mean and the standard deviation for 100 iterations for each method, see Table 1. In our experiment, the computation time with a step size control is within the standard deviation of the computation time for the Picard iteration. Moreover, the step size control needs only 1 % of the whole computation time for each iteration. Another important result is that the Picard iteration and Newton’s method need comparable computation times.
Table 1
Computation time in seconds
Computation time each iteration
Computation time step size
Mean
Standard deviation
Mean
Standard deviation
Picard
98.39
2.39
−
−
Picard with exact step sizes
99.47
2.16
1.00
0.01
Newton with Armijo step sizes
98.89
2.40
0.41
0.12
Newton with Armijo step sizes
98.55
2.29
0.08
0.01
and minimum step size=0.5
Newton with exact step sizes
99.71
2.23
1.01
0.01
We fixed the number of integral evaluations for the Armijo step size to a maximum of 20 integral evaluations for each iteration and for the exact step sizes to 25. Newton’s method with a minimum step size of 0.5 needs one or two iterations. Thus, Table 1 implies that both step size controls need a similar computation time for each iteration. We did not try to reduce the number of iterations for the step size because they are not important compared to solving the linear system of equations.
6.5 Numerical experiment with friction
In this subsection, we describe an experiment with friction. The experiment has a similar structure to the experiment ISMIP-HOM B. We set \(\Omega :=(0,5000)\times (0,1000)\), \(\Gamma _d:=\{(0,y);\, y\in [0,1000]\}\cup \{(5000,y);\, y\in [0,1000]\}\), \(\Gamma _a:=\{(x,1000);\, x\in (0,5000)\}\), and \(\Gamma _b:=\{(x,0);\, x\in (0,5000)\}\). We set B, \(\rho \), g, and p as in the experiment ISMIP-HOM B. We set \(s:=p\). Again, we rotate the gravity instead of the domain.
We test our experiment with \(\tau \in \{10^3,10^7\}\) Pa a\(^{p-1}\)m\(^{1-p}\). As prescribed by the problem, we have zero velocities at the left and right boundary. In Fig. 6, we see the norms of the Riesz isomorphism for the high friction coefficient with two different initial guesses. On the left, we have the same initial guess as in the ISMIP-HOM experiments. On the right, we added the term \(\int _{\Gamma _b} \tau \varvec{v_0}\cdot \varvec{\phi }\, ds\) to the variational formulation of the initial Stokes problem.
Fig. 6
The friction coefficient is \(\tau :=10^7\). Left: the initial guess is the same as in experiment ISMIP-HOM B. Right: the initial guess has the additional term \(\int _{\Gamma _b}\tau \varvec{v_0}\cdot \varvec{\phi }\, ds\)
Interestingly, Newton’s method with Armijo step sizes performs worse than all other algorithms for a high friction coefficient. It converges linearly and needs many iterations until it reaches quadratic convergence. The other initial guess does not resolve this issue. Newton’s method with approximately exact step sizes reaches quadratic convergence much earlier. Also, the Picard iteration with approximately exact step sizes is good.
Fig. 7
The friction coefficient is \(\tau :=10^3\). Left: the initial guess is the same as in experiment ISMIP-HOM B. Right: the initial guess has the additional term \(\int _{\Gamma _b}\tau \varvec{v_0}\cdot \varvec{\phi }\, ds\)
For a low friction coefficient, all algorithms behave similarly regarding convergence to experiment ISMIP-HOM B, see Fig. 7, independent of the initial guess.
7 Conclusion
In conclusion, it is possible to obtain a global convergent Newton method with step size control with a convex functional. Moreover, the approximation of exact step sizes is also good. For a nonlinear sliding boundary, the convergence rate of Newton’s method depends on the friction coefficient. Overall, the approximately exact step sizes seem slightly better because the convergence rate was in all experiments good. However, the convergence rate depends on the size of \(\delta \). More experiments should be done in three dimensions, with physically motivated sliding coefficients, and with different values of \(\delta \). Furthermore, an implementation in ice sheet models could be interesting.
Acknowledgements
I thank my supervisor Prof. Thomas Slawig from Kiel University, for many helpful discussions about mathematical and glaciological problems. Moreover, I thank Prof. Angelika Humbert and Dr. Thomas Kleiner from Alfred-Wegener-Institut in Bremerhaven, Dr. Martin Rückamp from the Bavarian Academy of Sciences and Humanities, and Prof. Andreas Rademacher from the University of Bremen for helpful discussions. Finally, I thank the reviewers for their helpful feedback.
Declarations
Competing Interests
The authors declare no competing interests.
Consent to Publication
The author consents to publish the article.
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Fischler, Y., Rückamp, M., Bischof, C., Aizinger, V., Morlighem, M., Humbert, A.: A scalability study of the ice-sheet and sea-level system model (issm, version 4.18). Geosci. Model Dev. 15(9), 3753–3771 (2022). https://doi.org/10.5194/gmd-15-3753-2022CrossRef
2.
Chen, Q., Gunzburger, M., Perego, M.: Well-Posedness Results for a Nonlinear Stokes Problem Arising in Glaciology. SIAM J. Math. Anal. 45(5), 2710–2733 (2013). https://doi.org/10.1137/110848694MathSciNetCrossRef
3.
Jouvet, G., Rappaz, J.: Analysis and finite element approximation of a nonlinear stationary stokes problem arising in glaciology. Adv. Numer. Anal. 2011, 1–24 (2011). https://doi.org/10.1155/2011/164581MathSciNetCrossRef
4.
Hirn, A.: Finite element approximation of singular power-law systems. Math. Comput. 82(283), 1247–1268 (2013)MathSciNetCrossRef
5.
Diego, G.G., Farrell, P.E., Hewitt, I.J.: On the finite element approximation of a semicoercive stokes variational inequality arising in glaciology. SIAM J. Numer. Anal. 61(1), 1–25 (2023). https://doi.org/10.1137/21m1437640MathSciNetCrossRef
Casas, E., Fernandez, L.A.: Distributed control of systems governed by a general class of quasilinear elliptic equations. J. Differ. Equ. 104, 20–47 (1993)MathSciNetCrossRef
Fraters, M.R.T., Bangerth, W., Thieulot, C., Glerum, A.C., Spakman, W.: Efficient and practical Newton solvers for non-linear Stokes systems in geodynamic problems. Geophys. J. Int. 218(2), 873–894 (2019). https://doi.org/10.1093/gji/ggz183CrossRef
11.
Allen, J., Leibs, C., Manteuffel, T., Rajaram, H.: A fluidity-based first-order system least-squares method for ice sheets. SIAM J. Sci. Comput. 39(2), 352–374 (2017). https://doi.org/10.1137/140974973MathSciNetCrossRef
MacGregor, J.A., Fahnestock, M.A., Catania, G.A., Aschwanden, A., Clow, G.D., Colgan, W.T., Gogineni, S.P., Morlighem, M., Nowicki, S.M.J., Paden, J.D., Price, S.F., Seroussi, H.: A synthesis of the basal thermal state of the greenland ice sheet. J. Geophys. Res. Earth Surf. 121(7), 1328–1350 (2016). https://doi.org/10.1002/2015jf003803CrossRef
Amrouche, C.S., Girault, V.: Decomposition of vector spaces and application to the stokes problem in arbitrary dimension. Czechoslovak Math. J. 44, 109–140 (1994)MathSciNetCrossRef
16.
Girault, V., Raviart, P.-A.: Finite Element Methods for Navier-Stokes Equations. Springer, New York (1986)CrossRef
Diening, L., Ebmeyer, C., Růžička, M.: Optimal convergence for the implicit space-time discretization of parabolic systems with p-structure. SIAM J. Numer. Anal. 45(2), 457–472 (2007)MathSciNetCrossRef
Pattyn, F., Perichon, L., Aschwanden, A., Breuer, B., Smedt, B., Gagliardini, O., Gudmundsson, G.H., Hindmarsh, R.C.A., Hubbard, A., Johnson, J.V., Kleiner, T., Konovalov, Y., Martin, C., Payne, A.J., Pollard, D., Price, S., Rückamp, M., Saito, F., Souček, O., Sugiyama, S., Zwinger, T.: Benchmark experiments for higher-order and full-Stokes ice sheet models (ISMIP–HOM). Cryosphere 2(2), 95–108 (2008). https://doi.org/10.5194/tc-2-95-2008CrossRef
23.
Logg, A., Mardal, K.-A., Wells, G. (eds.): Automated Solution of Differential Equations by the Finite Element Method. Springer, Berlin (2012). https://doi.org/10.1007/978-3-642-23099-8
Larour, E., Seroussi, H., Morlighem, M., Rignot, E.: Continental scale, high order, high spatial resolution, ice sheet modeling using the ice sheet system model (ISSM). J. Geophys. Res. Earth Surf. 117(F1) (2012). https://doi.org/10.1029/2011jf002140
26.
Isaac, T., Stadler, G., Ghattas, O.: Solution of nonlinear stokes equations discretized by high-order finite elements on nonconforming and anisotropic meshes, with application to ice sheet dynamics. SIAM J. Sci. Comput. 37(6), 804–833 (2015). https://doi.org/10.1137/140974407MathSciNetCrossRef