nach oben

Theory of Computing Systems

Erschienen in:

Open Access 26.10.2021

Stable Multi-Level Monotonic Eroders

verfasst von: Péter Gács, Ilkka Törmä

Erschienen in: Theory of Computing Systems | Ausgabe 1/2022

Aktivieren Sie unsere intelligente Suche, um passende Fachinhalte oder Patente zu finden.

search-config

KI-gestützte Suche

Patentsuche

Aus

Abstract

Eroders are monotonic cellular automata with a linearly ordered state set that eventually wipe out any finite island of nonzero states. One-dimensional eroders were studied by Gal’perin in the 1970s, who presented a simple combinatorial characterization of the class. The multi-dimensional case has been studied by Toom and others, but no such characterization has been found. We prove a similar characterization for those one-dimensional monotonic cellular automata that are eroders even in the presence of random noise.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

Cellular automata, CA for short, are a class of dynamical systems that are discrete in both space and time. A CA consists of a finite or infinite grid of identical finite state machines, usually called cells, that interact with only finitely many neighbors. At each time step, each machine synchronously enters a new state based on the current states of itself and its neighbors. The grid is usually assumed to be homogeneous, that is, each machine has a neighborhood of identical shape and uses the same update function.

In this article, we consider monotonic cellular automata with random errors. In a monotonic CA, the state set of the finite state machines is totally ordered, and the dynamics of the system respects this order. They are closely related to bootstrap percolation, introduced in [1], in which the state set is {0,1}, a cell cannot change its state from 1 to 0, and the system is initiated from a random configuration. In general, we denote by 0 the bottom element of the state set. The dynamical properties of arbitrary monotonic CA in dimension 1 were studied by Gal’perin [3, 4], who gave a computable characterization of those automata that erase arbitrary finite islands in a sea of 0-states. We introduce randomness into the model by allowing each machine to make an error with some fixed small probability, turning it into a probabilistic cellular automaton. See [6] and references therein for a survey on this topic. In fact, since we allow the errors of different states to be dependent, our model is somewhat more general than probabilistic cellular automata. We prove a version of Gal’perin’s result in this extended setting. More explicitly, we characterize those one-dimensional monotonic cellular automata for which the asymptotic density of 0-states in the trajectory started from the all-0 configuration tends to zero with the error rate. We call a CA that satisfies this condition a stable eroder. As a corollary, we also show that it is decidable whether a given monotonic automaton is a stable eroder.

The proof of our result is split into two sections, one for each direction of the equivalence. One direction of the proof, that our condition only holds for stable eroders, uses combinatorial objects that record the causal relationships between nonzero states in a trajectory of the CA, inspired by the work of Toom [11]. We show that a nonzero state is always accompanied by such an object, as well as a set of errors whose size is comparable to the size of the object. A counting argument then bounds the probability of the nonzero state. For the converse direction, we prove that if a CA does not satisfy our condition, one can find a finite island of nonzero states that persists forever with an arbitrarily high probability. We show how randomly occurring errors will “repair” the borders and internal structure of the island faster than the CA can erode it away.

2 Definitions

Let $\mathbb {Z}^{d}$ denote the d-dimensional integer lattice. We fix a finite state set S, endow it with the discrete topology, and endow $S^{\mathbb {Z}^{d}}$ with the product topology. The shift by $\textbf {n} \in \mathbb {Z}^{d}$ is the function $\sigma ^{\textbf {n}} : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ defined by σⁿ(x)_v = x_v+n for all $x \in S^{\mathbb {Z}^{d}}$ and $\textbf {v} \in \mathbb {Z}^{d}$. If d = 1, we denote σ = σ¹. A cellular automaton is a function from $S^{\mathbb {Z}^{d}}$ to itself that is continuous and commutes with each σⁿ. By the Curtis-Lyndon-Hedlund Theorem, it has a finite neighborhood: the value of f(x)₀ depends only on finitely many coordinates of x. A radius of f is an integer r ≥ 0 such that f(x)₀ depends only on $x|_{[-r, r]^{d}}$. A radius gives rise to a local rule: a function $F : S^{[-r, r]^{d}} \to S$ such that $f(x)_{\textbf 0} = F(x|_{[-r, r]^{d}})$. Indexing an element $\eta \in S^{\mathbb {Z}^{d} \times \mathbb {N}}$ by $(\textbf {n}, t) \in \mathbb {Z}^{d} \times \mathbb {N}$ is denoted $\eta _{\textbf {n}}^{t}$, and $\eta ^{t} \in S^{\mathbb {Z}^{d}}$ is the t’th d-dimensional slice of η.

Denote by ${\mathscr{M}}(S^{\mathbb {Z}^{d}})$ the set of Borel probability measures on $S^{\mathbb {Z}^{d}}$, and similarly for $S^{\mathbb {Z}^{d} \times \mathbb {N}}$. Consider a mapping R from ${\mathscr{M}}(S^{\mathbb {Z}^{d}})$ to ${\mathscr{M}}({{S}^{\mathbb {Z}^{d} \times \mathbb {N}}})$. For a measure μ, we consider R(μ) as a random variable with values in $S^{\mathbb {Z}^{d} \times \mathbb {N}}$. We say that R is a stochastic symbolic process, if

it is continuous in the weak topologies,
it is linear, that is, R(λμ + (1 − λ)ν) = λR(μ) + (1 − λ)R(ν) holds for $\mu , \nu \in {\mathscr{M}}(S^{\mathbb {Z}^{d}})$ and 0 ≤ λ ≤ 1, and
R(μ)⁰ = μ.

Such a mapping is determined by its images on point measures. If μ is a point measure concentrated on $x \in S^{\mathbb {Z}^{d}}$, we denote R(μ) = R(x), and call it a random trajectory with initial condition x. We say that R is an 𝜖-perturbation of f, if for any $x \in S^{\mathbb {Z}^{d}}$ and any finite set $C \subset \mathbb {Z}^{d} \times \mathbb {N}$, the probability that $R(x)_{\textbf {v}}^{t+1} \neq f(R(x)^{t})_{\textbf {v}}$ holds for all (v, t) ∈ C is at most 𝜖^|C|. In this context, the set of coordinates $(\textbf {v}, t) \in \mathbb {Z}^{d} \times \mathbb {N}$ with $R(x)_{\textbf {v}}^{t+1} \neq f(R(x)^{t})_{\textbf {v}}$ is usually called the error set of R(x).

For now on, consider only linearly ordered state sets: S = {0,1,…,m} for some m ≥ 1. For $x, y \in S^{\mathbb {Z}^{d}}$, we denote x ≥ y if x_v ≥ y_v holds for all $\textbf {v} \in \mathbb {Z}^{d}$. A cellular automaton $f : S^{\mathbb {Z}^{d}} \to S^{\mathbb {Z}^{d}}$ is monotonic if x ≥ y implies f(x) ≥ f(y) for all $x, y \in S^{\mathbb {Z}^{d}}$. For a ∈ S, the all-a configuration is denoted by $\hat {0}$. The state a is called quiescent for f, if $f(\hat {0}) = \hat {0}$. We will always assume that the extremal states 0 and m are quiescent. A configuration $x \in S^{\mathbb {Z}^{d}}$ is an a-island, if $\hat {0} \leq x$ and x_v = a for all but finitely many $\textbf {v} \in \mathbb {Z}^{d}$. A 0-island will simply be called an island.

3 Eroders and Zero Sets

The notion of an eroder has appeared many times in the literature with different names, including nilpotency on finite configurations. It simply means that a CA removes all islands in a finite (but not necessarily uniform) number of steps.

Definition 3.1

We say that a monotonic cellular automaton f is an eroder, if for every 0-island $x \in S^{\mathbb {Z}^{d}}$ there exists $n \in \mathbb {N}$ such that $f^{n}(x) = \hat {0}$.

With the terminology of [11], this means that the all-0 trajectory is attractive. Our aim is to extend this notion to cellular automata with random perturbations. Of course, a nontrivial 𝜖-perturbation of a deterministic CA will almost surely never reach the uniform zero configuration $\bar {0}$. Thus we present the following definition, which is equivalent to the all-0 trajectory being stable, again using the terminology of [11].

Definition 3.2

Let f be a monotonic cellular automaton on $S^{\mathbb {Z}^{d}}$, and denote

$$ P(\epsilon) = \inf \{ \text{Pr} [R(\hat{0})_{\textbf{0}}^{t} = 0] | {t \ \in \mathbb{N}, \ R \ \text{is} \ \text{an} \ \epsilon-\text{perturbation of}\ f} \}. $$

We say that f is a stable eroder, if P(𝜖)→1 as 𝜖→0.

The following result justifies our terminology.

Proposition 3.3

All stable eroders are eroders.

Proof

Suppose that $f : S^{\mathbb {Z}^{d}} \to S^{\mathbb {Z}^{d}}$ is a monotonic cellular automaton that is not an eroder. Then there exists an island $x \in S^{\mathbb {Z}^{d}}$ such that $f^{n}(x) \neq \hat {0}$ for all $n \in \mathbb {N}$. We choose a sequence of coordinates $(\textbf {v}_{i})_{i \in \mathbb {N}}$ such that $f^{i}(x)_{\textbf {v}_{i}} \neq 0$.

Let 𝜖 > 0 be arbitrary, and let R_𝜖 be the 𝜖-perturbation of f where

$$ R_{\epsilon}(y)^{t+1}_{\textbf{v}} = \left\{ \begin{array}{ll} m & \mathrm{with~probability~} \epsilon \\ f(R_{\epsilon}(y)^{t})_{\textbf{v}} & \mathrm{with~probability~} 1 - \epsilon \end{array} \right. $$

independently for every $(\textbf {v}, t) \in \mathbb {Z}^{d} \!\times \! \mathbb {N}$ and $y \!\in \! S^{\mathbb {Z}^{d}}$, where m ∈ S is the maximal state. Note that R_𝜖(y)^t+ 1 ≥ f(R_𝜖(y)^t) holds for all $t \in \mathbb {N}$. Because of this, R_𝜖(y)^t ≥ σⁿ(x) implies R_𝜖(y)^t+i ≥ σⁿ(fⁱ(x)), and thus $R_{\epsilon }(y)^{t+i}_{\textbf {v}_{i} - \textbf {n}} \neq 0$, for all $i \in \mathbb {N}$ and $\textbf {n} \in \mathbb {Z}^{d}$.

For every $\textbf {n} \in \mathbb {Z}^{d}$ and t > 0, the probability that $R_{\epsilon }(\hat {0})^{t} \geq \sigma ^{\textbf {n}}(x)$ is bounded from below by a positive constant, say δ > 0. Then we have

$$ \text{Pr}[R_{\epsilon}(\hat{0})_{0}^{t} \neq 0] \geq \text{Pr}[\exists i \leq t : R_{\epsilon}(\hat{0})^{t-i} \geq \sigma^{\textbf{v}_{i}}(x)] \geq 1 - (1 - \delta)^{t} \stackrel{t \to \infty}{\longrightarrow} 1 $$

Since 𝜖 was arbitrary, f is not a stable eroder. □

It is known that for automata with more than two states, the converse does not hold: there exist automata that are eroders but not stable eroders. One such example is given in [10], and we present a slight modification of it here for completeness.

Example 3.4

Let S = {0,1,2}, and let $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ be the radius-1 cellular automaton defined by the local rule

$$ F(a,b,c) = \begin{cases} 0, & \text{if~} a = 0, b \leq 1, c \leq 1, \\ 1, & \text{if~} b = 2, c \leq 1, \\ 2, & \text{if~} a + b \geq c = 2, \\ b, & \text{otherwise.} \end{cases} $$

A simple (yet tedious) case analysis shows that f is monotonic. Consider the 0-island $x = {}^{\infty } 0 2^{N} 0^{\infty }$ consisting of a run of 2-states of length N. The CA f erodes the 2-states from the right: for 0 ≤ t ≤ N we have $f^{t}(x) = {}^{\infty } 0 2^{N-t} 1^{t} 0^{\infty }$. When only 1-states remain, they are eroded from the left: for 0 ≤ s ≤ N we have $f^{N + s}(x) = {}^{\infty } 0 0^{s} 1^{N-s} 0^{\infty }$. In particular, f^2N(x) is the all-0 configuration. Since all 0-islands are majored by an island like x, this implies that f is an eroder.

On the other hand, f is not a stable eroder. We only present a high-level idea of the proof. Consider a perturbation R of f where on each space-time coordinate $(i, t) \in \mathbb {Z} \times \mathbb {N}$, an error occurs with probability 𝜖 > 0 independently of all other coordinates, and always produces the state 2. Consider the random trajectory R(x) with $x = {}^{\infty } 0 2^{N} 0^{\infty }$, where the leftmost 2 is at coordinate 0. Let ℓ(0) = 0 and r(0) = N − 1, and for each t ≥ 0, define $[\ell (t), r(t)] \subset \mathbb {N}$ as the largest interval of non-0 cells in R(x) containing [ℓ(t − 1) + 1,r(t − 1) − 1]. Note that this interval is well-defined: if $R(x)^{t}_{[a,b]}$ consists of non-0 cells, then so does $R(x)^{t+1}_{[a+1,b-1]}$.

If $R(x)^{t}_{\ell (t)} = 2$, then ℓ(t + 1) ≤ ℓ(t), and if an error occurs at (ℓ(t) − 1,t), then ℓ(t + 1) ≤ ℓ(t) − 1. This means that as long as the left end of the interval contains a 2-state, the sequence ℓ(t) performs a random walk and moves to the left with average speed at least 𝜖. Suppose that $R(x)^{s}_{\ell (s)} = 2$ for all s < t. If we have $R(x)^{t}_{\ell (t)} = 1$, then ℓ(t) = ℓ(s) and $R(x)^{t-1}_{\ell (s)+1} = 1$ as well, by the local rule of f. This can be extended to $R(x)^{t-s'}_{\ell (s)+s'} = 1$ for all s^′≤ N − ℓ(s). The probability of this event is (1 − 𝜖)^N−ℓ(s), which drops exponentially as the border ℓ(s) moves to the left. One can verify that as N grows, for small enough 𝜖 the probability of maintaining the left border in state 2 forever approaches unity. In this case $R(x)^{t}_{0} \geq 1$ for all t ≥ 0.

Finally, consider the random trajectory R(0) and a fixed large N. As t grows, the probability that for some s ≤ t, all coordinates of {0,…,N − 1}×{s} contain errors, also approaches unity. If this event occurs, then $R(0)^{s} \geq {}^{\infty } 0 2^{N} 0^{\infty }$, and with a high probability $R(0)^{s'}_{0} \geq 1$ for all s^′≥ s. Thus f is not a stable eroder.

Definition 3.5

Let $V \subset \mathbb {Z}^{d}$ be a finite set, let k > 0, and let a < b ∈ S be quiescent states. We say that V is a, b-forcing at level k for a CA f, if for every configuration $x \in S^{\mathbb {Z}^{d}}$ such that $x \leq \hat {b}$ and x_v ≤ a for all v ∈ V, we have f^k(x)₀ ≤ a. The family of all minimal a, b-forcing sets at level k for f is denoted by $\mathcal {V}^{k}_{a,b}(f)$.

Note that if U is a, b-forcing at level k and V is a, b-forcing at level ℓ, then U + V is a, b-forcing at level k + ℓ, and thus some subset W ⊂ U + V satisfies $W \in \mathcal {V}^{k + \ell }_{a,b}(f)$. Forcing sets are analogous to zero sets as usually defined for binary cellular automata, and in that context, they can be used to characterize eroders.

Definition 3.6

For a set $A \subset \mathbb {R}^{d}$, denote by C(A) the convex hull of A. For a CA f and quiescent states a < b, denote

$$ \tau^{k}_{a,b}(f) = \bigcap\limits_{V \in \mathcal{V}^{k}_{a,b}(f)} C(V) \subset \mathbb{R}^{d} $$

where each a, b-forcing set $V \in \mathcal {V}^{k}_{a,b}(f)$ is interpreted as a subset of $\mathbb {R}^{d}$. We say f is a, b-shrinking, if $\tau ^{k}_{a,b}(f) = \emptyset $ for some k > 0.

It is not immediately clear whether one can algorithimically decide if a cellular automaton is a, b-shrinking for given states a, b, since Definition 3.6 refers to an unbounded variable k, but there may exist a bound K, computable from the radius of f and the number of states, such that f is a, b-shrinking precisely when $\tau ^{k}_{a,b}(f) = \emptyset $ for some k ≤ K. We show by an example that it is not sufficient to consider the case k = 1. However, the condition turns out to be decidable for one-dimensional cellular automata. This follows from the results of [3, 4], and we repeat it explicitly in Lemma 4.4. The decidability of the condition in the multi-dimensional case is left open.

Example 3.7

We show that the case k = 1 is not enough to determine the shrinking condition for two given states. Namely, consider the one-dimensional cellular automaton f with radius 1 and state set S = {0,1,…,m} defined by the local rule

$$ F(a,b,c) = \begin{cases} b-1, & \text{if~} \ b > 0 \text{~and~} c = 0, \\ b, & \text{otherwise.} \end{cases} $$

This automaton decrements a nonzero state by one if its right neighbor is 0, and otherwise keeps the state fixed. Consider the families $\mathcal {V}^{k}_{0,m}(f)$ of 0,m-forcing sets. We have $\{0\} \in \mathcal {V}^{k}_{0,m}(f)$ for all k ≥ 1, since the state 0 always stays as 0. For 1 ≤ k < m, the configuration $x = {}^{\infty } 0 . m 0^{\infty }$ satisfies f^k(x)₀≠ 0, which implies $\mathcal {V}^{k}_{0,m}(f) = \{ \{0\} \}$. However, f^m(y)₀ = 0 for any $y \in S^{\mathbb {Z}}$ with y₁ = 0, which implies $\{1\} \in \mathcal {V}^{m}_{0,m}(f)$. It follows that the CA f is 0,m-shrinking, but this fact cannot be deduced from $\mathcal {V}^{k}_{0,m}(f)$ for any k < m.

For monotonic binary cellular automata, the 0,1-shrinking condition is always decidable, as it suffices to consider only k = 1. Furthermore, it is equivalent to being an eroder and a stable eroder.

Proposition 3.8 (Section IV in 11)

Let S = {0,1}, and let f be a monotonic CA on $S^{\mathbb {Z}^{d}}$. The following conditions are equivalent.

f is an eroder.
f is a stable eroder.
f is 0,1-shrinking.
$\tau ^{1}_{a,b}(f) = \emptyset $.

It is known that this characterization of eroders generalizes to larger alphabets in the one-dimensional case, but not in the multi-dimensional case. This was proved in [3], and we repeat it in Theorem 4.6. In this article, we will show that the characterization of stable eroders likewise generalizes to larger alphabets in the one-dimensional case (although the generalization is different). The multi-dimensional case is left open.

Definition 3.9

Let S = {0,1,…,m}, and let f be a monotonic CA on $S^{\mathbb {Z}^{d}}$. We say that f satisfies the stability condition if there exist quiescent states 0 = a₁ < a₂ < ⋯ < a_k = m such that f is a_i, a_i+ 1-shrinking for all 1 ≤ i < k.

Theorem 3.10

A one-dimensional monotonic automaton f is a stable eroder if and only if it satisfies the stability condition.

The next three sections are devoted to the proof of this result.

4 Eroders in One Dimension: Gal’perin Rates

In this section, we fix a one-dimensional monotonic automaton $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ with radius r ≥ 0 and consider the evolution of certain configurations under f.

Definition 4.1

An increasing ladder is a configuration $x \in S^{\mathbb {Z}}$ such that x_i ≥ x_j for all i ≥ j. Decreasing ladders are defined analogously. Let a≠b ∈ S be quiescent states. The step of type a, b is the ladder $x \in S^{\mathbb {Z}}$ defined by

$$ x_{i} = \left\{ \begin{array}{ll} a, & \mathrm{if~} i < 0, \\ b, & \mathrm{if~} i \geq 0. \end{array} \right. $$

For $t \in \mathbb {N}$, we denote

$$ \begin{array}{@{}rcl@{}} L^{t}_{a,b} & {} = \max \{ i \in \mathbb{Z} | f^{t}(x)_{i} = a \} \\ R^{t}_{a,b} & {} = \min \{ i \in \mathbb{Z} | f^{t}(x)_{i} = b \} \end{array} $$

In the limit, we denote $L_{a,b} = \lim _{t \to \infty } L^{t}_{a,b} / t$, and similarly for R_{a, b}.

See Fig. 1 for a visualization of the definition. We know from the work of Gal’perin [3] that these quantities, which we call the Gal’perin rates of f, always exist and are rational numbers. Furthermore, they can be effectively computed from the local rule of f [4, 9]. The following result is also convenient.

Lemma 4.2

[3] There exists a constant K > 0 such that

$$ \left| L^{t}_{a,b} - t \cdot L_{a,b} \right| \leq K \text{~and~} \left| R^{t}_{a,b} - t \cdot R_{a,b} \right| \leq K $$

hold for all quiescent a≠b ∈ S and $t \in \mathbb {N}$.

We now show how these rates are connected to the forcing sets of f.

Lemma 4.3

Let a≠b ∈ S be quiescent. Then L_{a, b} ≤ R_{a, b}. If there is no quiescent state c between a and b, then L_{a, b} = R_{a, b}.

Proof

The first claim is clear, since $L^{t}_{a,b} \leq R^{t}_{a,b}$ holds for all $t \in \mathbb {N}$. For the second claim, suppose that a < b and no state a < c < b is quiescent. Since f is monotonic, there exists n ≥ 1 such that $f^{n}(\bar {c}) \in \{\bar {a}, \bar {b}\}$ for all such c. If $C = R^{t}_{a,b} - L^{t}_{a,b}$ is large enough, there exist a < c < b and $L^{t}_{a,b} + C/3 < i < R^{t}_{a,b} - C/3$ such that f^t(x)|_{[i−rn, i+rn]} = c^2rn+ 1, where r ≥ 0 is the radius of f. Then f^t+n(x)_i ∈{a, b}, and we have either $L^{t+n}_{a,b} > L^{t}_{a,b} + C/3$ or $R^{t+n}_{a,b} < R^{t}_{a,b} - C/3$. Lemma 4.2 implies $|L^{t+n}_{a,b} - L^{t}_{a,b}| \leq n L_{a,b} + K$, and similarly for $R^{t}_{a,b}$, so C is bounded by a constant. Then we have L_{a, b} = R_{a, b}. □

Lemma 4.4

Let a < b ∈ S be quiescent. Then

$$ \begin{array}{@{}rcl@{}} L_{a,b} & {} = \sup \{ -\max U / k | k > 0, U \in \mathcal{V}^{k}_{a,b}(f) \} \\ R_{b,a} & {} = \inf \{ -\min V / k | k > 0, V \in \mathcal{V}^{k}_{a,b}(f) \} \end{array} $$

Proof

Let $x \in S^{\mathbb {Z}}$ be the step of type a, b, and let $U \in \mathcal {V}^{k}_{a,b}(f)$ be arbitrary. Denote $u = {\max \limits } U$. For $t \in \mathbb {N}$, we have $f^{t}(x)_{L^{t}_{a,b} + i - u} = a$ for all i ∈ U. Since U is a, b-forcing, this implies $f^{t+k}(x)_{L^{t}_{a,b} - u} = a$. Because f is monotonic, f^t+k(x) is an increasing ladder, and we have $L^{t+k}_{a,b} \geq L^{t}_{a,b} - u$. An inductive argument now shows $L^{t}_{a,b} \geq - t u / k$, which gives L_{a, b} ≥−u/k.

For the other direction, let $U_{t} = \{- r t, - r t + 1, \ldots , - L^{t}_{a,b} - 1 \} \subset \mathbb {Z}$ for t > 0, where $r \in \mathbb {N}$ is the radius of f. If $y \in S^{\mathbb {Z}}$ is a configuration satisfying $y \leq \hat {b}$ and y_i ≤ a for all i ∈ U_t, then $y_{i} \leq \sigma ^{L^{t}_{a,b}}(x)_{i}$ for all − rt ≤ i ≤ rt. This implies $f^{t}(y)_{0} \leq f^{t}(x)_{L^{t}_{a,b}} = a$, so U_t is an a, b-forcing set at level t. We also have $- {\max \limits } U_{t} / t = (L^{t}_{a,b} + 1) / t \longrightarrow L_{a,b}$ as t grows.

The second statement follows by symmetry. □

Lemma 4.5

The following conditions are equivalent.

f is a, b-shrinking.

For some k > 0, there exist $U,V \in \mathcal {V}^{k}_{a,b}(f)$ such that ${\max \limits } U < {\min \limits } V$.

L_{a, b} > R_{b, a}.

Furthermore, they are algorithmically decidable from the local rule of f.

Proof

If f is a, b-shrinking, then there exists $k \in \mathbb {N}$ with $\tau ^{k}_{a,b} = \emptyset $. Since the convex hull C(V ) is an interval for each $V \in \mathcal {V}^{k}_{a,b}$, this means that there exist forcing sets $U, V \in \mathcal {V}^{k}_{a,b}$ with ${\max \limits } U < {\min \limits } V$. Conversely, (2) implies U ∩ V = ∅, so f is a, b-shrinking. Thus (1) and (2) are equivalent.

If the (2) holds, then $L_{a,b} \geq -{\max \limits } U / k > {\min \limits } V / k \leq R_{b,a}$ by Lemma 4.4, so we obtain (3). Suppose finally L_{a, b} > R_{b, a}. Then there exist $k, \ell \in \mathbb {N}$, and two sets $U \in \mathcal {V}^{k}_{a,b}(f)$ and $V \in \mathcal {V}^{\ell }_{a,b}(f)$ with ${\max \limits } U / k < {\min \limits } V / \ell $. Denote U^′ = U + U + ⋯ + U (a sum of ℓ sets), and V^′ = V + V + ⋯ + V (a sum of k sets). By the remark after Definition 3.5, there exist subsets $\hat U \subset U'$ and $\hat V \subset V'$ that are in $\mathcal {V}^{k \ell }_{a,b}$ and $\mathcal {V}^{k \ell }_{b,a}$ respectively. This implies ${\max \limits } \hat U \leq \ell {\max \limits } U < k {\min \limits } V \leq {\min \limits } \hat V$. Thus the second condition holds.

The decidability follows from the results of [4]. □

The characterization of eroders in one dimension is the following.

Theorem 4.6 (3)

The cellular automaton f is an eroder if and only if R_0,a > L_a,0 holds for all quiescent states a ∈ S ∖{0}.

This condition is not equivalent to f being 0,a-shrinking for all quiescent a ∈ S ∖{0}. More explicitly, if f is 0,m-shrinking where m ∈ S is the maximal state, then it is an eroder, but the converse does not hold in general. In the one-dimensional case, see Example 4.8. Also, the condition implies that the time required for f to erode an island is at most linear in the diameter of the island, and in two dimensions examples of slower eroders are known, even in the case that the automaton is decreasing, that is, f(x) ≤ x holds for all $x \in S^{\mathbb {Z}^{2}}$ [8].

We will now reformulate our main result, Theorem 3.10, in terms of Gal’perin rates.

Theorem 4.7

Let S = {0,1,…,m}. A one-dimensional monotonic automaton f on $S^{\mathbb {Z}}$ is a stable eroder if and only if there exist quiescent states 0 = a₁ < a₂ < ⋯ < a_k = m such that $L_{a_{i}, a_{i+1}} > R_{a_{i+1}, a_{i}}$ for all 1 ≤ i < k.

Example 4.8

Consider again the three-state CA f of Example 3.4. We compute the Gal’perin rates of f for all pairs of states, since every state of f is quiescent. Consider first R_0,1. In the step configuration ${}^{\infty } 0 . 1^{\infty }$, the leftmost 1 turns into a 0 in one application of f. Thus R_0,1 = 1. In a similar manner, we compute

$$ \begin{array}{cccccc} L_{0,1} = 1 & L_{1,0} = 0 & L_{0,2} = 0 & L_{2,0} = -1 & L_{1,2} = -1 & L_{2,1} = -1 \\ R_{0,1} = 1 & R_{1,0} = 0 & R_{0,2} = 0 & R_{2,0} = 0 & R_{1,2} = -1 & R_{2,1} = -1 \end{array} $$

From this table, one can check that the condition of Theorem 4.6 holds, so f is an eroder. Note also that f is not 0,2-shrinking. The condition of Theorem 4.7 does not hold, so f is not a stable eroder.

We list here some generally useful lemmas.

Lemma 4.9

Let a < b ∈ S be quiescent. Then we have

$$ L_{a,b} \geq L_{a,b+1} \text{~and~} R_{b,a} \leq R_{b+1,a} $$

(1)

together with

$$ R_{a,b} \geq R_{a+1,b} \text{~and~} L_{b,a} \leq L_{b,a+1}. $$

(2)

Proof

We only prove the first inequality of (1); the others follow by symmetry, either by swapping left and right, or inverting the order of the state set. Let $x, y \in S^{\mathbb {Z}}$ be the steps of type a, b and a, b + 1, respectively. Then we have $\bar {a} \leq f^{t}(x) \leq f^{t}(y)$ for all $t \in \mathbb {N}$ by monotonicity, and in particular, f^t(y)_i = a implies f^t(x)_i = a for all $i \in \mathbb {Z}$. This implies $L^{t}_{a,b} \geq L^{t}_{a,b+1}$, and the claim follows by taking the limit. □

Lemma 4.10

Let a < b ∈ S be quiescent states. Then there exists a quiescent state c ∈ S with a < c ≤ b such that L_{c, a} = R_{b, a}.

Of course, there also exists a < d ≤ b such that R_{a, d} = L_{a, b} by symmetry.

Proof

This follows directly from Lemma 5(δ) of [3]. □

5 Stability Condition is Sufficient

In this section, we prove the first part of Theorem 4.7: automata that satisfy the stability condition are stable eroders. The proof follows the ideas presented in [11]. The high level idea is that we take an 𝜖-perturbation of f, and consider a single coordinate (0,T) in a random trajectory starting fom the uniform zero configuration. Assuming the coordinate has a nonzero state, we construct a geometric object that records the ‘reason’ for this event, that is, traces it back to some finite subset of coordinates where errors have occurred. We prove that the size of this subset grows linearly with the size of the object, and there is an exponential number of objects of a given size. A simple calculation then shows that the probability of (0,T) having a nonzero state approaches 0 with 𝜖. The main difference to [11] is that our geometric objects are polygons instead of trees.

For the remainder of this section, fix a monotonic cellular automaton f on $S^{\mathbb {Z}}$ that satisfies the stability condition for the quiescent states 0 = a₁ < ⋯ < a_k = m. For convenience, we denote $L_{n} = L_{a_{n}, a_{n+1}}$, $R_{n} = R_{a_{n+1}, a_{n}}$, S_n = {a ∈ S|a_n < a ≤ a_n+ 1} and $S_{n}^{+} = \cup _{\ell \geq n} S_{\ell }$ for n ∈{1,…,k − 1}. By Lemma 4.4, there exist k_n > 0 and forcing sets $U_{n}, V_{n} \in \mathcal {V}^{k_{n}}_{a_{n}, a_{n+1}}(f)$ such that U_n, V_n ⊂{−k_nr,…,k_nr} and $-k_{n} L_{n} \leq u_{n} = {\max \limits } U_{n} < {\min \limits } V_{n} = v_{n} \leq -k_{n} R_{n}$. Recall the number K from Lemma 4.2. We may assume that

$$ 2 K < v_{n} - u_{n}, \qquad k_{n} = 1 $$

(3)

hold for all n; this can be guaranteed by taking a large enough power of f that is divisible by each k_n. Let R be an 𝜖-perturbation of f for some small 𝜖 > 0, and consider the random trajectory $\eta = R(\hat {0})$. We will prove that ${\text {Pr}[\eta _{0}^{T}} > 0] \stackrel {\epsilon \to 0}{\longrightarrow } 0$ uniformly in T.

We define some more auxiliary concepts before proceeding with the proof. Define four sets of integer vectors:

$$ \begin{array}{@{}rcl@{}} {{\Delta}^{L}_{n}} & {} = \{ (i, -1) | -r \leq i \leq u_{n}\}, \qquad {{\Delta}^{R}_{n}} & {} = \{ (i, 1) | -r \leq i \leq -v_{n} \}, \\ {{\Delta}^{C}_{n}} & {} = \{ (i, 0) | 0 < i \leq 2 r \}, \qquad {{\Delta}^{B}_{n}} & {} = - {{\Delta}^{C}_{n}} \end{array} $$

Finally, define the random error set by $E = \{ (i, t) \in \mathbb {Z} \times \mathbb {N} | \eta ^{t+1}_{i} \neq f(\eta ^{t})_{i} \}$.

Let T ≥ 1 be an arbitrary positive integer. We will now construct a system of geometric shapes on the vertex set $W = \{ (i, t) \in \mathbb {Z} \times \mathbb {N} | t \leq T, |i| \leq r (T - t) \}$ parametrized by the states S_n ⊂ S and a set C ⊂ W such that ${\eta ^{t}_{i}} \in S_{n}$ for all (i, t) ∈ C. For a set $P \subset \mathbb {R}^{2}$, denote its border by ∂P. By a polygon we mean a closed and bounded subset of $\mathbb {R}^{2}$ that is a finite union of triangles and line segments. In particular, a polygon may not be equal to the closure of its interior.

Definition 5.1

For n ∈{1,…,k}, define a space-time polygon of level n as a pair 〈P, X〉 satisfying the following conditions.

a) $P \subset \mathbb {R}^{2}$ is a non-empty simply connected polygon.
b) X = 〈w₁,…,w_ℓ〉 is a cyclic list of vertices in W ∩ ∂P, which occur in counterclockwise order on ∂P, and ∂P is the union of the line segments $\overline {w_{i} w_{i+1}}$.
c) For all i ∈{1,…,ℓ} we have $w_{i+1} - w_{i} \in {{\Delta }^{L}_{n}} \cup {{\Delta }^{R}_{n}} \cup {{\Delta }^{C}_{n}} \cup {{\Delta }^{B}_{n}}$.
d) If (i, t),(j, t) ∈ X and 0 < |i − j|≤ 2r, then the line segment from (i, t) to (j, t) is a subset of P.
e) Each w = (i, t) ∈ X satisfies ${\eta ^{t}_{i}} \in S_{n}$ and one of the following conditions:
1.

There exist $\tilde n > n$ and $j \in \{- \tilde n r, \ldots , \tilde n r\}$ with $\eta _{i+j}^{t-1} \in S_{\tilde n}$. Then w has type 1, the coordinate v = (i + j, t − 1) ∈ W is the support point of w at level $\tilde n$, and we denote v = supp(w). If there are several candidates for the support point, we choose one that maximizes the level $\tilde n$.

2.

There is an error at the coordinate that precedes w in the trajectory: we have (i, t − 1) ∈ E. Then w has type 2.

3.

There exist j ∈ U_n and j^′∈ V_n such that the triangle spanned by w, (i + j, t − 1) and (i + j^′, t − 1) is a subset of P. Then w has type 3, and w is a child of (i + j, t − 1) and (i + j^′, t − 1).

Item (d) in Definition 5.1 is a technical constraint that is easy to enforce during the construction and makes the proof of Lemma 5.2 simpler. If the condition does not hold, we say that the pair (i, t),(j, t) violates condition (d) in 〈P, X〉. Similarly, the constraint on maximizing $\tilde n$ in item (e) simplifies the proofs of Lemma 5.4 and Lemma 5.6 without affecting the construction in any other way.

See Fig. 2 for a visualization of a space-time polygon. The shaded area is P, the black dots are the vertices w_i in the list X, and each arrow is a line segment $\overline {w_{i} w_{i+1}}$. An edge is in ${{\Delta }^{C}_{n}}$ if it points east, in ${{\Delta }^{B}_{n}}$ if it points west, in ${{\Delta }^{L}_{n}}$ if southwest and in ${{\Delta }^{R}_{n}}$ if northwest. Some edges in the figure are labeled with the set they belong to. Note that horizontal line segments may be traversed twice, as is the case near the rightmost vertex. The darker shaded areas at the bottom are parts of a space-time polygon of level $\tilde n$ for some $\tilde n > n$, and the dashed lines denote the support point relation. The vertices in X with dashed lines have type 1. Vertices enclosed in boxes are produced by errors, and they have type 2. Other vertices have type 3; one of the triangles is depicted in the figure.

Intuitively, a space-time polygon on level n records the ‘reason’ for the vertices in the set C being in the state n. It resembles the notion of truss in [11]. The idea of the proof is that a coordinate $(i, t) \in \mathbb {Z}^{2}$ with ${\eta ^{t}_{i}} \neq 0$ gives rise to a finite collection of space-time polygons, a constant fraction of whose vertices have type 2, that is, are caused by errors. The number of such collections with N vertices in total grows exponentially with N, and by choosing the error rate 𝜖 small enough, we can bound the probability that any collection of polygons enables ${\eta ^{t}_{i}} \neq 0$. We begin by showing that in a single polygon 〈P, X〉, the number of vertices of type 1 or 2 grows linearly with the size of the set X.

Lemma 5.2

For all n ∈{1,…,k}, there exists δ_n > 0 such that for any space-time polygon 〈P, X〉 of level n, there are at least δ_n|X| elements of type 1 or 2 in X.

Proof

Let X = 〈w₁,…,w_ℓ〉, and denote $I = \{i \in \{1, \ldots , \ell \} | w_{i+1} - w_{i} \in {{\Delta }^{C}_{n}} \}$ and J = {1,…,ℓ}∖ I. Define a linear function $M_{n} : \mathbb {R}^{2} \to \mathbb {R}$ by M_n(i, t) = i + t(u_n + v_n)/2. We claim that M_n is positive on ${{\Delta }^{C}_{n}}$ and negative on ${{\Delta }^{L}_{n}} \cup {{\Delta }^{R}_{n}} \cup {{\Delta }^{B}_{n}}$. The case of ${{\Delta }^{C}_{n}}$ and ${{\Delta }^{B}_{n}}$ is clear, since we have t = 0, and i is positive on ${{\Delta }^{C}_{n}}$ and negative on ${{\Delta }^{B}_{n}}$. Let then $(i, -1) \in {{\Delta }^{L}_{n}}$, so that i ≤ u_n. Then we have M_n(i,− 1) = i − (u_n + v_n)/2 ≤ (u_n − v_n)/2 < 0, since u_n < v_n. For $(i, 1) \in {{\Delta }^{R}_{n}}$ we have i ≤−v_n, which implies M_n(i,1) = i + (u_n + v_n)/2 ≤ (u_n − v_n)/2 < 0.

Since these sets are finite, there exists 0 < δ < 1 such that δ < M_n(v) < δ^− 1 for all $v \in {{\Delta }^{C}_{n}}$ and − δ^− 1 < M_n(v) < −δ for all $v \in {{\Delta }^{R}_{n}} \cup {{\Delta }^{L}_{n}} \cup {{\Delta }^{B}_{n}}$. Since X is a circular list, we have ${\sum }_{i=1}^{\ell } M_{n}(w_{i+1} - w_{i}) = 0$ by linearity of M_n. The sets I and J form a partition of {1,…,ℓ}, so we have δ²|J|≤|I|≤ δ^− 2|J|. From this we also deduce |I|≥|X|/(1 + δ^− 2).

We present a geometric argument for the fact that the number of vertices of type 1 or 2 in X is at least |I|/2. Let i ∈{1,…,ℓ} be such that $w_{i+1} - w_{i} \in {{\Delta }^{C}_{n}}$, that is, i ∈ I. Suppose that both w_i and w_i+ 1 have type 3. Then there exist $u, v \in \mathbb {Z}^{2}$ with $w_{i} - u \in {{\Delta }^{R}_{n}}$, $v - w_{i+1} \in {{\Delta }^{L}_{n}}$ and $\overline {u w_{i}}, \overline {w_{i+1} v} \subset P$. The angle $\angle w_{i-1} w_{i} w_{i+1}$ cannot be larger than $\angle u w_{i} w_{i+1} < \pi $, which implies $w_{i} - w_{i-1} \in {{\Delta }^{R}_{n}}$. By a symmetric argument we have $w_{i+2} - w_{i+1} \in {{\Delta }^{L}_{n}}$. Denoting w_i− 1 = (a, t) and w_i+ 2 = (b, s), a simple calculation shows that t = s and |a − b|≤ 2r. We must also have a≠b, since P is simply connected. This violates condition (d), so one of w_i or w_i+ 1 has type 1 or 2. See Fig. 3 for a visualization of this argument. It follows that the number of type-1 or 2 vertices in X is at least |I|/2 ≥|X|/(2 + 2δ^− 2). We define δ_n = (2 + 2δ^− 2)^− 1, which finishes the proof. □

We will now construct a set Q(C) of disjoint space-time polygons of level n whose union contains C, the set of initial vertices. We construct the polygons iteratively, maintaining a set Q_p(C) of ‘incomplete polygons’ that are allowed to violate condition (d) and the second part of condition (e) in Definition 5.1, meaning that the vertices at time t may not have a type. We start with Q₀(C) being the collection of single-vertex polygons 〈{w},〈w〉〉 for w ∈ C.

Assume then that we have constructed the set Q_p(C) for some p ≥ 0. There are three possible conditions that prevent us from choosing Q(C) = Q_p(C), which we refer to as violations:

Some vertex w ∈ W occurs in the list of some polygon of Q_p(C), and has no type in any polygon in which it occurs.

Some pair of vertices v, w occurs on the list of some polygon of Q_p(C) (not necessarily consecutively), and violates condition (d) in every polygon in which it occurs.

Some polygons of Q_p(C) have nonempty intersection.

If none of these violations hold, then Q_p(C) consists of disjoint space-time polygons of level n. Namely, if w ∈ W is a vertex of some polygon 〈P, X〉∈ Q_p(C), then it has a type in one of them (since violation 1 does not hold), which must be 〈P, X〉 since the polygons are disjoint. Similarly, if v, w ∈ W occur in the list of some polygon 〈P, X〉∈ Q_p(C), then it does not violate condition (d) in some polygon (since violation 2 does not hold), which must be 〈P, X〉. We now show how to handle the violations one by one. The first two cases involve adding new simple polygons to the set Q_p(C), and they are visualized in Fig. 4. In the last case, we show how to merge two intersecting polygons into one, which is a more involved process.

Suppose that violation 1 holds: there exist 〈P, X〉∈ Q_p(C) and w = (i, t) ∈ X that has no type in any polygon of Q_p(C) it belongs to. In particular, (i, t − 1) does not contain an error and $\eta ^{t-1}_{i + j} \notin S^{+}_{n+1}$ for all − r ≤ j ≤ r. Since U_n and V_n are a_n, a_n+ 1-forcing sets and ${\eta ^{t}_{1}} \in S_{n}$, there exist j_L ∈ U_n and j_R ∈ V_n such that $\eta ^{t-i}_{i + j_{L}}, \eta ^{t-1}_{i + j_{R}} \in S_{n}$. We also have $(j_{L}, -1) \in {{\Delta }^{R}_{n}}$, $(j_{R}, 1) \in {{\Delta }^{L}_{n}}$ and $(j_{R} - j_{L}, 0) \in {{\Delta }^{C}_{n}}$. Denote X^′ = 〈w,(i + j_L, t − 1),(i + j_R, t − 1)〉, and let $P' \subset \mathbb {R}^{2}$ be the triangle spanned by these three points. Then 〈P^′, X^′〉 is a (possibly incomplete) three-vertex polygon and w has type 3 in it. We define Q_p+ 1(C) = Q_p(C) ∪{〈P^′, X^′〉}, and w is no longer a witness for violation 1.

Suppose then that violation 2 holds, so that some polygon 〈P, X〉∈ Q_p(C) violates condition (d): some v = (i, t),w = (j, t) ∈ X satisfy 0 < i − j ≤ 2r. Thus we have $v - w \in {{\Delta }^{C}_{n}}$ and $w - v \in {{\Delta }^{B}_{n}}$. Let $P' = \overline {v w}$ and X^′ = 〈v, w〉. Then 〈P^′, X^′〉 is a polygon with two vertices that satisfies every condition of Definition 5.1 except the latter part of (e). In particular, the pair v, w does not violate condition (d) in 〈P^′, X^′〉. We define Q_p+ 1(C) = Q_p(C) ∪{〈P^′, X^′〉}, and then v, w is no longer a witness for violation 2.

Suppose finally that violation 3 holds: there exist 〈P, X〉,〈P^′, X^′〉∈ Q_p(C) such that P ∩ P^′≠∅. We construct a new polygon that contains the union P ∪ P^′. Let $\tilde P$ be the polygon obtained by filling all holes in P ∪ P^′, and let $\tilde X = \langle {w_{1}, \ldots , w_{z}}\rangle $ be the list obtained by traversing the border of $\tilde P$ in the counterclockwise direction and enumerating the elements of X ∪ X^′ in the order they are encountered. Finally, let $\hat P$ be the simply connected polygon whose border is exactly the union of the line segments $\overline {w_{i} w_{i+1}}$. We claim that $(\hat P, \tilde X)$ is a valid space-time polygon. It is easy to see that $w_{i} \in W \cap \partial \hat P$ for all i ∈{1,…,z}, and we now show that the differences of successive elements of the list $\tilde X$ have the correct form.

Lemma 5.3

For all i ∈{1,…,z} we have $w_{i+1} - w_{i} \in {{\Delta }^{L}_{n}} \cup {{\Delta }^{R}_{n}} \cup {{\Delta }^{C}_{n}} \cup {{\Delta }^{B}_{n}}$.

Proof

We first show that the vertical distance between w_i = (j, s) and w_i+ 1 = (j^′, s^′) is at most 1, which implies that it is either 0 or 1. Namely, there is a path from w_i to w_i+ 1 along the border of $\tilde P$, and assuming s^′≥ s + 1 (the case s ≥ s^′ + 1 being analogous), this path runs through some coordinate $(j^{\prime \prime }, s + 1)$ with $j^{\prime \prime } \in \mathbb {R}$; let this be the first such coordinate along the path. Then $(j^{\prime \prime }, s + 1)$ is the endpoint of some line segment formed by consecutive elements of X or X^′, so in particular we have $(j^{\prime \prime }, s+1) \in X \cup X'$, which implies $(j^{\prime \prime }, s + 1) = w_{i+1}$.

Suppose now that s^′ = s + 1. If the path L from w_i to w_i+ 1 along $\partial \tilde P$ is a single line segment, then it is contained in the border of either P or P^′, and we are done. Otherwise, it is formed from two segments, one from P and one from P^′, and there are coordinates $j_{1}, j_{2} \in \mathbb {Z}$ such that (j₁, s) and w_i+ 1 are consecutive elements in one of the lists (say X), and w_i and (j₂, s + 1) are consecutive elements in the other list (say X^′), and L is formed form parts of the associated line segments, which have a single crossing point. See Fig. 5. Furthermore, we are traversing the border of P ∪ P^′ in the counterclockwise direction, which means that (j₁, s) is to the left of w_i and (j₂, s + 1) is to the left of w_i+ 1. Consequently, j₁ ≤ j and j₂ ≤ j^′. Since (j₂, s + 1) − w_i and w_i+ 1 − (j₁, s) are elements of ${{\Delta }^{R}_{n}}$, we have

$$ -r \leq j_{2} - j \leq j' - j \leq j' - j_{1} \leq -v_{n} $$

which implies $w_{i+1} - w_{i} \in {{\Delta }^{R}_{n}}$. The case for s^′ = s − 1 is similar, but with ${{\Delta }^{L}_{n}}$ in place of ${{\Delta }^{R}_{n}}$.

Finally, suppose s^′ = s. Then there is a path γ in ∂P from w_i to w_i+ 1 that either is a line segment, or consists of finitely many line segments and crosses an integral y-coordinate only at its endpoints. In the former case, γ is contained in a line segment between consecutive elements of either X or X^′, which implies |j − j^′|≤ 2r and thus $w_{i+1} - w_{i} \in {{\Delta }^{C}_{n}} \cup {{\Delta }^{B}_{n}}$. Suppose then that the latter case holds, and let $w_{i} = \hat w_{1}, \hat w_{2}, \ldots , \hat w_{\ell } = w_{i+1}$ be the endpoints of the line segments that form γ; equivalently, they are the vertices of $\tilde P$ that lie on γ. We may assume that the y-coordinates of the $\hat w_{p}$ are between s − 1 and s, and that j^′ < j. The cases of the y-coordinates lying between s and s + 1 and/or j^′ > j are symmetric, and in fact cannot actually happen in space-time polygons.

Each line segment L_p from $\hat w_{p}$ to $\hat w_{p+1}$ is part of a longer segment between some (j_p, s) and (jp′,s − 1) in ∂P or ∂P^′. We say that the segment L_p is decreasing if (j_p, s) is closer to $\hat w_{p}$ than $\hat w_{p+1}$, and increasing otherwise. The interior angle α_p of $\tilde P$ at any endpoint $\hat w_{p}$ for 1 < p < ℓ is greater than π, for otherwise $\hat w_{p}$ would be a vertex of P or P^′, and would lie in X ∪ X^′. This implies two things. First, there exists 1 ≤ p^′ < ℓ such that L_p is decreasing for each p ≤ p^′, and increasing for each p > p^′. Second, j_p < j_p+ 1 holds whenever 1 ≤ p < ℓ and p≠p^′. Since j₁ = j and j_ℓ = j^′, this means j_p ≥ j for p ≤ p^′, and j_p ≤ j^′ for p > p^′. Each $\hat {w}_{p}$ also has y-coordinate strictly above s − 1, and hence we have jp^′′ < jp^′+ 1′. We now compute $|j - j'| \leq |j_{p'} - j'_{p'}| + |j_{p'+1} - j'_{p'+1}| \leq 2 r$, so that $w_{i+1} - w_{i} \in {{\Delta }^{C}_{n}} \cup {{\Delta }^{B}_{n}}$. See Fig. 6 for a visualization of this argument. □

To finish the treatment of violation 3, we define $Q_{p+1}(C) = (Q_{p}(C) \setminus \{\langle P, X \rangle , \langle P', X' \rangle \}) \cup \{ \langle \hat P, \tilde X \rangle \}$.

Now, all three constructions detailed above have the property that the union of all polygons in Q_p+ 1(C) contains the union of all polygons in Q_p(C). In particular, if a vertex or pair of vertices witnesses violation 1 or 2 and we apply the construction to them, they cannot witness the violation again at any later stage. Since there are finitely many sets of polygons on the vertex set W, we eventually reach a set Q_p(C) that avoids violations 1, 2 and 3, and choose Q(C) = Q_p(C). As claimed, it is a set of disjoint space-time polygons of level n whose union contains C. At this point we also remark that in any polygon 〈P, X〉∈ Q(C), any point w ∈ X whose time coordinate is maximal is an element of C. In particular, X ∩ C≠∅. However, note also that C is not necessarily a subset of X, since some of its elements may cease to be vertices as polygons are merged, and become interior points instead.

We will now use the construction of space-time polygons to bound the probability of a given coordinate (0,T) having a nonzero state in the random trajectory η. For this purpose, suppose that ${\eta ^{T}_{0}} > 0$, and let n₀ ∈{1,…,k − 1} be such that ${\eta ^{T}_{0}} \in S_{n_{0}}$. Define the initial set of vertices as $C_{n_{0}} = \{(0, T)\}$, and construct the set of level-n₀ space-time polygons $Q(C_{n_{0}})$. Recall that each vertex w = (i, t) that has type 1 in some polygon of $Q(C_{n_{0}})$ has a support point supp(w) = (j, t − 1) at some level n > n₀ with |i − j|≤ nr. Inductively, for each n₀ < n ≤ k, we define C_n ⊂ W as the set of level-n support points of the polygons in $\bigcup _{p < n} Q(C_{p})$:

$$ C_{n} = \{ (i,t) | p < n, \langle P, X \rangle \in Q(C_{p}), w \in X, (i,t) = \text{supp}(w), {\eta^{t}_{i}} \in S_{n} \} $$

This defines an indexed family $(Q(C_{n}))_{n = n_{0}}^{k}$ of sets of space-time polygons, where the polygons of each set Q(C_n) are disjoint.

Fix two numbers n₀ ≤ n < ℓ ≤ k, and let 〈P, X〉∈ Q(C_ℓ) and 〈P^′, X^′〉∈ Q(C_n) be two space-time polygons of different levels. This means that 〈P, X〉 is constructed at a later stage than 〈P^′, X^′〉, and its vertices have higher states. The statements of Lemmas 5.4, 5.5 and 5.6 refer to these polygons. The series of results shows that the two polygons can intersect only if P ⊂ P^′. First, we show that the vertex lists of the polygons are separated horizontally by at least r steps.

Lemma 5.4

Let w = (i, t) ∈ X be arbitrary. Then there are no elements w^′ = (j, t) ∈ X^′ with |i − j|≤ r.

Proof

Suppose on the contrary that such an element exists, and consider the first point in the construction of Q(C_n) where w^′ is added to the vertex set of some polygon. If w^′ was introduced as a parent of another vertex (j^′, t + 1) that was a witness to violation 1, then |j − j^′|≤ r, which implies |i − j^′|≤ 2r ≤ ℓr. Then (j^′, t + 1) has type 1 in any polygon it belongs to, since w is its potential support point, which contradicts the assumption that it witnessed violation 1 by having no type. Thus w^′ was not introduced as a parent of another vertex. Since this is the only way of adding new elements to the vertex lists of polygons, the vertex w^′ must have been present in the first phase Q₀(C_n), which implies w^′∈ C_n.

The initial vertex (0,T) is the only vertex with time T, and the only element of $C_{n_{0}}$. Since w and w^′ have the same y-coordinate, this implies w^′≠(0,T) and n > n₀. Thus the vertex w^′ is the level-n support point of some vertex v = (j^′, t + 1) with $\eta ^{t+1}_{j'} \in S_{n'}$, n^′ < n and |j − j^′|≤ nr. But then |i − j^′|≤|i − j| + |j − j^′|≤ (n + 1)r ≤ ℓr, and w is also a potential support point for v. Since ℓ > n and the support point is chosen in a way that maximizes its level, the vertex w^′ cannot be the support point of v, which contradicts w^′∈ C_n. □

Lemma 5.5

∂P ∩ ∂P^′ = ∅

Proof

Assume for contradiction that ∂P intersects ∂P^′. Then there are consecutive vertices v, w ∈ X and v^′, w^′∈ X^′ such that the line segments $\overline {v w} \subset P$ and $\overline {v' w'} \subset P'$ intersect. The vertical distance between the endpoints of each segment is at most 1, so some pair of endpoints, say v and v^′, have the same y-coordinate, say v = (i, t) and v^′ = (j, t) with i≠j. If the segment $\overline {v w}$ is horizontal, then one of v or w is within distance r of v^′, contradicting Lemma 5.4, and analogously if $\overline {v' w'}$ is horizontal. Thus both segments are non-horizontal, and we may assume w = (i^′, t + 1) and w^′ = (j^′, t + 1) with i^′≠j^′. Since the segments intersect, we have either i < j and i^′ > j^′, or i > j and i^′ < j^′. In both cases, |i − i^′|≤ r and |j − j^′|≤ r imply $\min \limits (|i - j|, |i' - j'|) \leq r$, again a contradiction with Lemma 5.4. □

Lemma 5.6

There does not exist w ∈ X^′ with w ∈ P.

Proof

Suppose on the contrary that some w ∈ X^′ satisfies w ∈ P. We may assume that n is the minimal element of {n₀,…,ℓ − 1} that allows this. We have w ∈ P ∩ ∂P^′≠∅, so Lemma 5.5 implies P^′⊂ P. Let v = (i, t) ∈ C_n ∩ P^′ be arbitrary. Since P^′⊂ P, there exists a vertex of X with the same y-coordinate as v. As (0,T) is the only vertex of $C_{n_{0}}$ and the only vertex with y-coordinate equal to T, we have v≠(0,T) and n > n₀. Then there exist m < n, a polygon $\langle {\hat P, \hat X}\rangle \in Q(C_{m})$ and a vertex $u = (i', t+1) \in \hat X$ with v = supp(u). By the minimality of n, we have u∉P. Then the line segment $I = \overline {v u}$ intersects ∂P, so there are consecutive elements w₁, w₂ ∈ X such that the segment $\overline {w_{1} w_{2}}$ intersects I. See Fig. 7.

As in the proof of Lemma 5.5, we can assume w₁ = (j, t) and w₂ = (j^′, t + 1) with i≠i^′ and j≠j^′, and we have either i < j and i^′ > j^′, or i > j and i^′ < j^′. We also have |i − i^′|≤ nr and |j − j^′|≤ r, and a simple calculation shows |i^′− j|≤ (n + 1)r ≤ ℓr. Thus w₁ is also a potential support point for u with a higher level, which contradicts v = supp(u). □

Next, we show that every element of each set of support points C_ℓ is close to the border of the polygon that contains it. We will use this fact to show that the total perimeter of all polygons in Q(C_ℓ) is bounded from below by a constant multiple of |C_ℓ|.

Lemma 5.7

Let n₀ ≤ ℓ ≤ k, let w = (i, t) ∈ C_ℓ, and let 〈P, X〉∈ Q(C_ℓ) be the space-time polygon containing w. Then there exists w^′ = (i^′, t^′) ∈ X such that |i − i^′|≤ ℓr and |t − s|≤ 1.

Proof

The case of ℓ = n₀ is clear since (0,T) is the only vertex of $C_{n_{0}}$ and the only vertex with y-coordinate at least T. Suppose thus that ℓ > n₀. Since w ∈ C_ℓ, there exist n < ℓ, a polygon 〈P^′, X^′〉∈ Q(C_n) and a vertex v = (j, t + 1) ∈ X^′ such that $\eta ^{t+1}_{j} \in S_{n}$ and supp(v) = w. In particular, we have |i − j|≤ ℓr. Lemma 5.6 implies v∉P. Then the line segment $\overline {w v}$, which is of length at most ℓr + 1, intersects some segment of the border ∂P. One endpoint w^′ = (i^′, t^′) ∈ X of that segment satisfies |i − i^′|≤ ℓr and |t − t^′|≤ 1, and the claim holds. □

We now show that out of all vertices in the system $(Q(C_{n}))_{n=n_{0}}^{k}$, a positive fraction have type 2. Recall that we showed already in Lemma 5.2 that for each individual space-time polygon, a positive fraction of its vertices have type 1 or 2. The idea is that since type 1 vertices have support points, they give rise to new polygons of higher levels, and the polygons of level k, the maximum, have no type 1 vertices.

Lemma 5.8

There exists β > 0, depending only on the automaton f, with the following property. Let $\mathcal {X} = \bigcup _{n = n_{0}}^{k} \bigcup _{\langle P,X \rangle \in Q(C_{n})} X$ be the set of all border vertices in the system of polygons starting from the initial vertex $C_{n_{0}} = \{(0, T)\}$. Then $|\{ w \in \mathcal {X} | \text {\textit {w} has type 2} \}| \geq \upbeta |\mathcal {X}|$.

Proof

For n₀ ≤ n ≤ k, denote $\mathcal {X}^{a}_{n} = \{ (i, t) \in \mathcal {X} | (i,t) \text {has type} \textit {a} \text {and} \eta ^t_i \in S_n \}$ and $\mathcal {X}_{n} = \bigcup _{a \in \{1,2,3\}} \mathcal {X}^{a}_{n}$. We define a process where each set $\mathcal {X}_{n}$ is given a non-negative weight D(n), and the weights are iteratively re-distributed in a way that preserves their sum, which is precisely the number of vertices of type 2. We prove a positive lower bound on $D(n) / |\mathcal {X}_{n}|$ for each n at the end of the process, which implies the claim of the Lemma.

More formally, for all n₀ ≤ ℓ ≤ k + 1, we define a weight distribution D_ℓ : {n₀, $ \ldots ,k\} \to \mathbb {R}$ with ${\sum }_{n = n_{0}}^{k} D_{\ell }(n) = |\{ w \in \mathcal {X} | \text {\textit {w} has type 2} \}|$. The initial distribution D_k+ 1 is defined by $D_{k+1}(n) = |\mathcal {X}^{2}_{n}|$. Denote β_k+ 1 = 1 and β_n = δ_nβ_n+ 1/(2rk + 2) < β_n+ 1 for n ≥ k, where δ_n > 0 is given by Lemma 5.2. In the course of the construction, we maintain the following invariants:

For each n₀ ≤ ℓ ≤ n ≤ k, we have $D_{\ell }(n) \geq {\upbeta }_{n} |\mathcal {X}_{n}|$.
For each n₀ < ℓ ≤ k + 1, we have $D_{\ell }(\ell -1) \geq \delta _{\ell -1} {\upbeta }_{\ell } |\mathcal {X}_{\ell -1}|$.

For ℓ = k, k − 1,…,n₀, we compute the new distribution D_ℓ from D_ℓ+ 1 as follows. For each p < ℓ and each pair $(v, w) \in \mathcal {X}_{p} \times C_{\ell }$ such that w is the support point of v, we transfer β_ℓ units of weight from $\mathcal {X}_{\ell }$ to $\mathcal {X}_{p}$. The resulting weight distribution is D_ℓ.

We now show that the invariants hold for D_ℓ, and for the first one, let n₀ ≤ ℓ ≤ n ≤ k. If ℓ < n, then $D_{\ell }(n) = D_{\ell +1}(n) \geq {\upbeta }_{n} |\mathcal {X}_{n}|$ by the induction hypothesis and the fact that $\mathcal {X}_{n}$ retains its weight in the construction of D_ℓ. Suppose then that ℓ = n. By Lemma 5.7, each w ∈ C_ℓ is within distance rk from some vertex of $\mathcal {X}_{\ell }$ with the same y-coordinate, implying $|C_{\ell }| \leq (2 k r + 2) |\mathcal {X}_{\ell }|$. Since each such w can be the support point for at most 2rk + 1 other vertices, at most ${\upbeta }_{\ell } (2 r k + 2)(2 r k + 1) = \delta _{\ell } {\upbeta }_{n+1} \frac {2 r k + 1}{2 r k + 2}$ units of weight were transferred from $\mathcal {X}_{\ell }$ via C_ℓ. On the other hand, we have $D_{\ell +1}(\ell ) \geq \delta _{\ell } {\upbeta }_{\ell +1} |\mathcal {X}_{\ell }|$ by the induction hypothesis. This implies

$$ D_{\ell}(\ell) \geq D_{\ell+1}(\ell) - |\mathcal{X}_{\ell}| \delta_{\ell} {\upbeta}_{\ell+1} \frac{2 r k + 1}{2 r k + 2} \geq {\upbeta}_{\ell} |\mathcal{X}_{\ell}| $$

For the second invariant, let n₀ < ℓ ≤ k + 1 and consider the weight D_ℓ(ℓ − 1). In the case ℓ = k + 1, we have $D_{k+1}(k) = |\mathcal {X}^{2}_{k}| \geq \delta _{k} {\upbeta }_{k+1} |\mathcal {X}_{k}|$ by Lemma 5.2, since $\mathcal {X}^{1}_{k}$ is empty and β_k+ 1 = 1 by definition. Suppose then that ℓ ≤ k. For each type-1 vertex $w \in \mathcal {X}_{\ell -1}$ with support point of level n ≥ ℓ, exactly β_n ≥β_ℓ units of weight have been transferred to $\mathcal {X}_{\ell -1}$. In addition, it has the initial weight of $|\mathcal {X}^{2}_{\ell -1}|$ given by the type-2 vertices. Since $|\mathcal {X}^{1}_{\ell -1}| + |\mathcal {X}^{2}_{\ell -1}| \geq \delta _{\ell -1} |\mathcal {X}_{\ell -1}|$ by Lemma 5.2, this implies

$$ D_{\ell}(\ell-1) \geq {\upbeta}_{\ell} |\mathcal{X}^{1}_{\ell-1}| + |\mathcal{X}^{2}_{\ell-1}| > \delta_{\ell-1} {\upbeta}_{\ell} |\mathcal{X}_{\ell-1}| $$

which is what we wanted to prove.

All in all, we have shown that $D_{n_{0}}(\ell ) \geq {\upbeta }_{n_{0}} |\mathcal {X}_{\ell }|$ holds for all n₀ ≤ ℓ ≤ k. Since ${\upbeta }_{n_{0}}$ depends only on the automaton f, we can choose $\upbeta = {\upbeta }_{n_{0}}$, and the proof is complete. □

Next, we estimate the number of different space-time polygon systems rooted at the coordinate (0,T) having a certain perimeter size $|\mathcal {X}| = H$. Knowing this size, we can characterize the entire system $(Q(C_{n}))_{n = n_{0}}^{k}$ as follows. First, for each polygon 〈P, X〉∈ Q(C_n) in the system, there exists a vertex w = (i, t) ∈ X ∩ C_n (for example, one that maximizes t). If n > n₀, there also exist ℓ < n, another polygon 〈P^′, X^′〉∈ Q(C_ℓ), and a vertex w^′ = (j, t + 1) ∈ X^′ such that w = supp(w^′) and |i − j|≤ kr. We fix one such pair (w, w^′) and call it the base of 〈P, X〉. Now, we construct a list L of vertices as follows. We begin with the polygon 〈P, X〉 containing the topmost vertex w₁ = (0,T), and set L₁ = X = 〈w₁,…,w_|X|〉.

Suppose now that we have constructed a list L_p for some p ≥ 1. If there is a polygon 〈P^′, X^′〉 that has not been processed yet, and its base (w^′, w) has a vertex w ∈ L_p, then we replace w in L_p by the list of vertices 〈w, w^′, w1′,…,wq′,w^′, w〉 where X^′ = 〈w^′, w1′,…,wq′〉, and denote the resulting list by L_p+ 1. If such a polygon does not exist, then the list L_p = L = 〈w₁,…,w_s〉 contains every vertex in $\mathcal {X}$ at least once and at most 2kr times, so that |L|≤ 2krH. Furthermore, for each index i ∈{1,…,s} we have w_i+ 1 − w_i ∈{−kr,…,kr}×{− 1,0,1}. Since the list L, together with the mapping L →{1,…,k} that sends each vertex (i, t) to the number n such that ${\eta ^{t}_{i}} \in S_{n}$, characterizes $\mathcal {X}$ completely, the number of different systems $(Q(C_{n}))_{n = n_{0}}^{k}$ with $|\mathcal {X}| = H$ is at most ${\sum }_{s = H}^{2 k r H} (3 k (2 k r + 1))^{s} \leq (3 k (2 k r + 1))^{(2 k r + 1) H}$.

Now, if we have ${\eta ^{T}_{0}} > 0$, then there exists a system $\mathcal {P} = (Q(C_{n}))_{n = 1}^{k}$ of time-space polygons of some size H > 0 rooted at (0,T); we denote this event by $W(\mathcal {P})$. By Lemma 5.8, such a system contains at least βH vertices of type 2, which correspond to a subset of the random error set E. Since η is generated by an 𝜖-perturbation of the cellular automaton f, the probability of $W(\mathcal {P})$ is at most 𝜖^βH. Thus we have

$$ \begin{array}{@{}rcl@{}} \text{Pr}[ {\eta^{T}_{0}} > 0 ] & \leq& \sum\limits_{H = 1}^{\infty} \sum\limits_{\mathcal{P}} \text{Pr}[ W(\mathcal{P}) ] \leq \sum\limits_{H = 1}^{\infty} (3 k (2 k r + 1))^{(2 k r + 1) H} \epsilon^{\upbeta H} \\ & =& \sum\limits_{H = 1}^{\infty} \left( \frac{(3 k (2 k r + 1))^{2 k r + 1}}{\epsilon^{\upbeta}} \right)^{H} \overset{{\epsilon \to 0} }{{\kern1pt}\longrightarrow}{0} \end{array} $$

since we have 𝜖^β < (3k(2kr + 1))^2kr+ 1 for all small enough 𝜖. Then f is a stable eroder, and we have proved the first half of Theorem 4.7.

6 Stability Condition is Necessary

In this section, we prove the second half of Theorem 4.7: every one-dimensional stable eroder satisfies the stability condition. The idea of the proof is the following. We assume that a CA f does not satisfy the stability condition, and our goal is to show that for all 𝜖 > 0, there exists a finite island that f almost surely never erodes, if each coordinate of the trajectory contains an error with probability of 𝜖 independently of the others. Intuitively, the random errors extend the borders of the island faster than the automaton can erode it. We begin by proving an alternative formulation of the stability condition for one-dimensional automata.

Lemma 6.1

The automaton f satisfies the stability condition if and only if for all quiescent a ∈ S ∖{0} there exist quiescent b < a such that L_{b, a} > R_{a, b}.

Proof

Suppose that f satisfies the latter condition. Denote a₁ = m. Then there exist quiescent a₂ < a₁ such that $L_{a_{2},a_{1}} > R_{a_{1},a_{2}}$. Iterating this argument at most |S|− 1 times, we find a sequence $(a_{i})_{i=1}^{k}$ of quiescent states such that m = a₁ > a₂ > ⋯ > a_k = 0 and $L_{a_{i+1},a_{i}} > R_{a_{i},a_{i+1}}$ for all i ∈{1,…,k − 1}. This is precisely the stability condition.

Suppose then that f satisfies the stability condition with the sequence 0 = a₁ < ⋯ < a_k = m, and let a ∈ S be quiescent. There is a unique i ∈{1,…,k − 1} such that a_i < a ≤ a_i+ 1. By the definition of a_i and a_i+ 1, we have $L_{a_{i},a_{i+1}} > R_{a_{i+1},a_{i}}$. From (1) we deduce $L_{a_{i},a} \geq L_{a_{1},a_{i+1}}$, and (2) gives $R_{a_{i+1},a_{i}} \geq R_{a,a_{i}}$. Thus we have $L_{a_{i},a} > R_{a,a_{i}}$, which is what we wanted to show. □

Now, we define a special kind of 𝜖-perturbation of a cellular automaton: one where errors occur independently with probability 𝜖, and always produce a fixed state a ∈ S unless it would cause the state to decrease. Our goal is to prove that if a CA fails to satisfy the condition of Lemma 6.1, then these perturbations form a counterexample to it being a stable eroder.

Definition 6.2

Let f be a cellular automaton on $S^{\mathbb {Z}}$, let a ∈ S, and let 𝜖 > 0. The independent a, 𝜖-perturbation of f is the stochastic symbolic process R defined as follows. Let $E \subset \mathbb {Z} \times \mathbb {N}$ be a random set, where each $v \in \mathbb {Z} \times \mathbb {N}$ belongs to E independently with probability 𝜖. Then R(x) = R_E(x) is defined by

$$ R(x)^{t+1}_{i} = \left\{ \begin{array}{ll} \max(a, f(R(x)^{t})_{i}), & \text{if~} (i, t) \in E. \\ f(R(x)^{t})_{i}, & \text{if~} (i, t) \notin E. \end{array} \right. $$

The set E is called the underlying error set.

Note that R(x)^t+ 1 ≥ f(R(x)^t) holds for all $x \in S^{\mathbb {Z}}$ and $t \in \mathbb {N}$, since an error can only increase the value of a coordinate.

The following result is a stronger version of Proposition 3.3. It tells us that in order to prove that a CA f is not a stable eroder, it suffices to find a finite island that any given independent perturbation of f never erodes away with arbitrarily high probability. Note that we fix a sequence of coordinates that the evolving island must contain. Also, we cannot require that an island survives forever with probability 1, since f might be an eroder, and there is a small but positive probability that no errors occur in the vicinity of the island before it is eroded away.

Lemma 6.3 (Proposition 3 of 10)

Let f be a one-dimensional monotonic CA, let a ∈ S, and let $R^{a}_{\epsilon }$ be the independent a, 𝜖-perturbation of f. Suppose that for all $N \in \mathbb {N}$, there exist a 0-island $x^{N} \in S^{\mathbb {Z}}$ with $x^{N} \leq \hat {a}$ and a sequence of coordinates $({i^{N}_{t}})_{t \in \mathbb {N}}$ such that for all 𝜖 > 0, we have

$$ \inf_{t \in \mathbb{N}} \text{Pr} [R^{a}_{\epsilon}(x^{N})^{t}_{{i^{N}_{t}}} = a] \stackrel{N \to \infty}{\longrightarrow} 1 $$

Then f is not a stable eroder.

For the remainder of this section, we fix a monotonic automaton f that does not satisfy the condition of Lemma 6.1. Then there exists a quiescent state ω ∈ S ∖{0} such that L_{a, ω} ≤ R_{ω, a} for all quiescent a < ω. If there are several such states, let ω be the lowest one. We fix a small 𝜖 > 0, and let R be the independent ω, 𝜖-perturbation of f. Note that a random trajectory R(x) will not contain a state greater than ω if $x \leq \hat {\omega }$, so we can safely forget the states {ω + 1,…,m} and assume S = {0,…,ω}.

Definition 6.4

Let a < ω be a quiescent state of f. We say that a is inductive if there exist positive real numbers 0 < δ_a < 1, 0 < 𝜃_a ≤ r and Q_a > 0 with the following property for all large enough N ≥ 1: If $x \in S^{\mathbb {Z}}$ is such that $x \geq \hat {a}$ and x_i ≥ ω for − N ≤ i ≤ N, then

$$ \text{Pr} \left[ \forall t \in \mathbb{N}, t(L_{a, \omega} - \theta_{a}) < i < t(R_{\omega,a} + \theta_{a}) : R(x)^{t}_{i} = \omega \right] \geq 1 - \delta_{a}^{N^{Q_{a}}} $$

The intuition for an inductive state a is that with a high probability, every large enough island of ω-states surrounded by a-states will spread at an average speed strictly greater than L_{a, ω} − R_{ω, a}. This causes a “cone” of ω-states to appear in the trajectory of the configuration. Our goal in the remainder of this section is to prove Proposition 6.5: every state a < ω is inductive. In particular, the lowest state 0 is inductive, and then we can apply Lemma 6.3 to the island configurations ${}^{\infty } 0 \omega ^{N} . \omega ^{N} 0^{\infty }$ to prove that f is not a stable eroder.

Proposition 6.5

Every quiescent state a < ω is inductive.

We will actually prove the following one-sided versions of inductivity, which makes the arguments conceptually simpler.

Lemma 6.6

Let 0 ≤ a < ω, 𝜃 > 0, 0 < δ < 1 and Q > 0. Suppose that we have

$$ \begin{array}{@{}rcl@{}} \text{Pr}[ \forall t \in \mathbb{N}, i < t(R_{\omega,a} + \theta_{a}) : R(x)^{t}_{i} = \omega ] \geq 1 - \delta^{N^{Q}} \end{array} $$

(4)

$$ \begin{array}{@{}rcl@{}} \text{Pr}[ \forall t \in \mathbb{N}, i > t(L_{a,\omega} - \theta_{a}) : R(y)^{t}_{i} = \omega ] \geq 1 - \delta^{N^{Q}} \end{array} $$

(5)

for $x = {}^{\infty } \omega . \omega ^{N} a^{\infty }$ and $y = {}^{\infty } a \omega ^{N} . \omega ^{\infty }$ and all large enough N. Then a is an inductive state.

Proof

Let K > 0 be given by Lemma 4.2. Denote $x = {}^{\infty } \omega . \omega ^{2 N} a^{\infty }$ and $y = {}^{\infty } a \omega ^{2 N} . \omega ^{\infty }$. Consider the configuration $z = {}^{\infty } a \omega ^{2 N} . \omega ^{2 N} a^{\infty }$, and construct a coupling of the three random trajectories R(x), R(y) and R(z) where the same underlying error set is used for each. Denote by P the event that $R(y)^{t}_{i} = R(x)^{t}_{i} = \omega $ for all $t \in \mathbb {N}$ and t(L_{a, ω} − 𝜃_a) − N ≤ i ≤ t(R_{ω, a} + 𝜃_a) + N. From (4), and (5), the assumption L_{a, ω} ≤ R_{ω, a} and the union bound, we obtain $\text {Pr} [P] \geq 1 - 2 \delta ^{N^{Q}}$.

If P occurs and N > r, then for all $t \in \mathbb {N}$, the distance between the boundaries of ω-states in R(y)^t and R(x)^t is at least 2r. This means that to the cellular automaton f, the trajectory R(z) looks locally like R(y) to the left of t(R_{ω, a} + 𝜃_a) + N, and like R(x) to the right of t(L_{a, ω} − 𝜃_a) − N. More formally, we can prove by induction on t that $R(z)^{t}_{i} = R(y)^{t}_{i}$ for all i ≤ t(R_{ω, a} + 𝜃_a) + N, and $R(z)^{t}_{i} = R(x)^{t}_{i}$ for all i ≥ (L_{a, ω} − 𝜃_a) − N. In particular, this shows that $R(z)^{t}_{i} = \omega $ for all $t \in \mathbb {N}$ and t(L_{a, ω} − 𝜃_a) ≤ i ≤ t(R_{ω, a} + 𝜃_a).

We choose δ_a = 2^1/Mδ where M ≥ 1 is such that δ_a < 1. By the above, these constants satisfy the conditions of Definition 6.4. □

Most of the remainder of this section is devoted to the proof of Proposition 6.5. It suffices to prove (4), since the other case is symmetric. We will use an auxiliary result about biased random walks.

Lemma 6.7

Let $(X_{n})_{n \in \mathbb {N}}$ be a sequence of independent random variables with Pr[X_n = 1] = p and Pr[X_n = 0] = 1 − p, where 0 < p < 1. For all 0 < q < p, there exists 0 < d < 1 such that

$$ \text{Pr} \left[ \exists n \geq 0 : N + \sum\limits_{i = 0}^{n} X_{n} \leq n q \right] \leq d^{N} $$

(6)

for all N ≥ 1.

Proof

Denote $S_{n} = {\sum }_{i = 0}^{n} X_{n}$ and δ = p − q. The expected value of S_n is of course pn. Using Hoeffding’s inequality for binomial distributions [5, Theorem 1], we compute

$$ \begin{array}{@{}rcl@{}} \text{Pr}[S_{n} \leq q n - N] &= & \text{Pr}[S_{n} \leq p n - (\delta n + N)] \\ {} &\leq& \exp(-2(\delta n + N)^{2}/n) \\ {} &\leq& \exp(-2 \delta N) \cdot \exp(-2 \delta^{2} n). \end{array} $$

By the union bound,

$$ \begin{array}{@{}rcl@{}} \text{Pr}[\exists n \geq 0: S_{n} \leq q n - N] \leq {} & \exp(-2 \delta N) {\sum}_{n \geq 0} \exp(-2 \delta^{2} n) \\ {} = {} & \exp(-2\delta N) / (1 - \exp(-2 \delta^{2})). \end{array} $$

Thus any $\exp (-2\delta ) < d < 1$ satisfies (6) for large enough N. By choosing d close enough to 1 we can establish it for all N. □

We prove Proposition 6.5 by downward induction on the state a, and the idea of the proof is this. We consider the configuration of (4) for a large $N \in \mathbb {N}$. By Lemma 4.10, there exists a quiescent state a < b ≤ ω with L_{b, a} = R_{ω, a}. Either b = ω or b is inductive. In the former case, we essentially have a biased random walk on the ω, a-interface of the configuration ${}^{\infty } \omega . \omega ^{N} a^{\infty }$. The random walk is slightly biased to the right of R_{ω, a}, and we can apply Lemma 6.7 to it. In the latter case, we maintain two regions in the random trajectory: an inner region of ω-states, and an outer region of quiescent states at least b. The surface of the outer region behaves like a random walk as in the first case, and the surface of the inner region is constantly “repaired” by randomly appearing patches of ω-states that produce small ω-cones as per the induction hypothesis. Before proceeding with the proof, we formalize the argument about random walks, since it is used in both cases. Recall the number K > 0 from Lemma 4.2.

Lemma 6.8

Let 0 < d < 1 be given by Lemma 6.7 for p = 𝜖^K+ 1 and q = p/2. Let a < b ∈ S be quiescent states with L_{b, a} = R_{b, a}. Let $M \in \mathbb {N}$, and let $x = {}^{\infty } b . b^{M + K} a^{\infty }$ be the step of type b, a shifted M + K steps to the right. Then

$$ \text{Pr} [ \forall t \in \mathbb{N}, i \leq (R_{b,a} + q) t : R(x)^{t}_{i} \geq b ] \geq 1 - d^{M} $$

Proof

We know that f^t(x)_i = b, and thus $R(x)^{t}_{i} \geq b$, holds for all t ≥ 0 and i < L_{b, a}t + M. Let $E \subset \mathbb {Z} \times \mathbb {N}$ be the underlying error set of R(x). Let us define a sequence of random variables $(X(t))_{t \in \mathbb {N}}$ with values in {0,1} as follows. For $t, s \in \mathbb {N}$, let ${i^{s}_{t}} \in \mathbb {Z}$ be the largest integer with f^s(R(x)^t)_i ≥ b for all $i \leq {i^{s}_{t}}$. Note that $i^{0}_{t+s} \geq {i^{s}_{t}} \geq {i^{0}_{t}} + L_{b,a} s - K$ for all $t, s \in \mathbb {N}$ (the latter inequality follows from Lemma 4.2). We set X(t) = 1 if and only if $({i^{1}_{t}} + j, t - 1) \in E$ holds for all j ∈{1,…,K + 1}. In that case we have $i^{0}_{t+1} \geq {i^{1}_{t}} + K + 1$, which implies $i^{0}_{t+s} \geq {i^{0}_{t}} + L_{b,a} s - K + 1$ for all s ≥ 1. In general, if 1 ≤ t₁ < t₂ < ⋯ < t_n are such that X(t_k) = 1 for all 1 ≤ k ≤ n, then $i^{0}_{t_{n} + s} \geq {i^{0}_{0}} + L_{b,a} s - K + n = L_{b,a} s + M + n$ for all s ≥ 1.

Each X(t) has value 1 independently with probability p = 𝜖^K+ 1, so the sum ${\sum }_{t = 1}^{T} X_{t}$ forms a simple random walk. By Lemma 6.7, the probability that ${i^{0}_{t}} < (L_{b,a} + q) t$ for some $t \in \mathbb {N}$ is then at most d^M. This is equivalent to $R(x)^{t}_{i} < b$ for some $t \in \mathbb {N}$ and i ≤ (L_{b, a} + q)t = (R_{b, a} + q)t. □

We proceed with the proof of Proposition 6.5 by (4). Assume first that b = ω, so that L_{ω, a} = R_{ω, a}. When N > K, we can apply Lemma 6.8 to M = N − K, and choose 𝜃_a = q = 𝜖^K+ 1/2, δ_a = d and Q_a = 1 to obtain (4).

Suppose now that a < b < ω, so that b is inductive by the induction hypothesis. Without loss of generality, we assume that R_{ω, a} > 0. If this is not the case, we compose f with a large right shift, which does not change its eroding properties. Define $x = {}^{\infty } \omega . \omega ^{N} a^{\infty }$ and 𝜃_a = 𝜖^K+ 1/3. We will determine the values of δ and Q later.

Definition 6.9

We say that a trajectory R(x) is ω-good if

$$ \forall t \in \mathbb{N}, i \leq t (R_{\omega,a} + \theta_{a}) : R(x)^{t}_{i} = \omega $$

(7)

and b-good if

$$ \forall t \in \mathbb{N}, i \leq t (R_{\omega,a} + 2 \theta_{a}) : R(x)^{t}_{i} \geq b $$

(8)

Our goal in this proof is to maintain an inner region of ω-states and a slightly wider outer region of states that are at least b. The above definitions formalize this goal. A trajectory is ω-good if we are able to maintain the inner region indefinitely, and b-good if we maintain the outer region indefinitely. As in the proof of the case b = ω, maintaining the outer region indefinitely is relatively simple.

Lemma 6.10

Let 0 < d < 1 be given by Lemma 6.7 for p = 𝜖^K+ 1 and q = p/2. For N > K, the probability of R(x) being b-good is at least 1 − d^N−K.

Proof

This follows directly from Lemma 6.8. □

We now turn to the problem of maintaining the inner region. For convenience, we make the following observation about the state b: in a trajectory that starts from a configuration above $\hat {b}$ and contains a long sequence of ω-states, with a high probability one finds a large rectangle of ω-states. This is simply because one expects to find an infinite cone of ω-states by the inductive hypothesis of Proposition 6.5, inside which the rectangle can be picked. See Fig. 8 for a visualization. Note that we have a lot of freedom in choosing the numbers α and β. In particular, we can choose α as large as we want, and β as small as we want. Recall the definition D_b = (L_{b, ω} + R_{ω, b})/2.

Observation 6.11

There exist numbers α > 0, 0 <β< 1 and 0 < λ < 1 with the following property. Let C, M ≥ 1 (not necessarily integers), and let $y \in S^{\mathbb {Z}}$ be such that y_i = ω for 0 ≤ i ≤ M and y_i ≥ b for i ≤ Cα. If M and C/M are large enough, then

$$ \text{Pr} \left[ \forall 0 \leq i \leq (R_{\omega,a} + \theta_{a}) C \upbeta, 0 \leq t \leq 2 C \upbeta : R(y)^{\lfloor C + t \rfloor}_{\lfloor C D_{b} + i \rfloor} = \omega \right] \geq 1 - \lambda^{M^{Q_{b}}} $$

If this event occurs, we call the set [CD_b, CD_b + (R_{ω, a} + 𝜃_a)Cβ] × [C, C + 2Cβ] an ω-rectangle.

The constant α in the Observation should be chosen so that any coordinate of y to the right of Cα has no influence on the evolution of the ω-rectangle. Such a choice is possible due to the finite radius of f. The rectangle is nonempty for large enough C, since we assumed R_{a, ω} > 0. We will now dispose of the requirement of ω-states in Observation 6.11 by considering a smaller ω-rectangle.

Lemma 6.12

Let C ≥ 1, and let $y \in S^{\mathbb {Z}}$ satisfy y_i ≥ b for i ≤ C(α + rβ). If C is large enough, then

$$ \begin{array}{@{}rcl@{}} && \text{Pr} \left[ \forall 0 \leq i \leq (R_{\omega,a} + \theta_{a}) C \upbeta, C \upbeta \leq t \leq 2 C \upbeta : R(y)^{t + \lfloor C \rfloor}_{i + \lfloor C D_{b} \rfloor} = \omega \right] \\ && \geq {} 1 - \lambda^{C^{Q_{b}/3}} \end{array} $$

with the notation of Observation 6.11.

Proof

The idea of the proof is to consider a family of $\sqrt {C \upbeta }$ rectangles that overlap in the desired region. Each of them can be generated by a short horizontal segment of ω-states that happens to occur in the trajectory at the right position. Instead of applying Observation 6.11 directly to these segments, we only consider those segments that grow at a linear pace for $\sqrt {C \upbeta }$ steps, which produces a better estimate for the probability of the final rectangle.

Denote $u = \lfloor \sqrt {C \upbeta } \rfloor $, and for 0 ≤ n < u, denote t_n = nu. Let M ≥ 1 be an integer constant so large that Observation 6.11 holds for it, and $\gamma = \epsilon ^{M + 1} + \delta _{b}^{M^{Q_{b}}} < 1$. Denote by P₁(n) the event that (i −⌈D_bu⌉,t_n − 1) ∈ E for all 0 ≤ i ≤ M, and by P₂(n) the event that $R(y)^{t_{n+1}}_{i} = \omega $ for all 0 ≤ i ≤ (R_{ω, a} + 𝜃_a)uβ.

We now estimate the probabilities of P₂(n) and the occurrence of the ω-rectangle. We have Pr[P₁(n)] = 𝜖^M+ 1 for each 0 ≤ n < u − 1, since the elements of E are chosen independently. If P₁(n) occurs, then $R(y)^{t_{n}}_{i - \lceil D_{b} u \rceil } = \omega $ for each 0 ≤ i ≤ M. Furthermore, we have

$$ R(y)^{t}_{i} \geq b \text{~for all~} t \geq 0, i \leq C (\alpha + r \upbeta) - r t $$

(9)

since r is a radius for f. Since t_n ≤ Cβ and b is a quiescent state, this implies $R(y)^{t_{n}}_{i - \lceil D_{b} u \rceil } \geq b$ for all i ≤ Cα + ⌈D_bu⌉. Noticing that Cα + ⌈D_bu⌉≥ uα, we now apply Observation 6.11 to (a shifted version of) $R(y)^{t_{n}}$ with the variables M and u. If C is large enough, this gives us $R(y)^{t_{n+1}}_{i} = \omega $ for all 0 ≤ i ≤ (R_{ω, a} + 𝜃_a)uβ with probability at least $1 - \lambda ^{M^{Q_{b}}}$. Hence we have $\text {Pr} [P_{2}(n) | P_{1}(n)] \geq 1 - \delta _{b}^{M^{Q_{b}}}$. This bound is valid even when conditioned on any combination of P₁(k) and P₂(k) for any k < n, since it only depends on the composition of the error set E between time steps t_n and t_n+ 1. As long as C is large enough, the probability of P₂(n) occurring for some 0 ≤ n < u − 1 is thus at least 1 − (1 − γ)^u− 1.

Next, let P₃(n) denote the event that P₂(n) holds but P₂(k) does not hold for any k < n. If P₃(n) holds, then (9) implies $R(y)^{t_{n+1}}_{i} \geq b$ for all i ≤ Cα, since t_n+ 1 ≤ Cβ. We can then apply Observation 6.11 to $R(y)^{t_{n+1}}$ with the variables (R_{ω, a} + 𝜃_a)uβ and C. This yields

$$ \begin{array}{@{}rcl@{}} \begin{array}{ll} &\!\!\!\!\!\!\!\!\!\! \text{Pr} \left[ \forall 0 \leq i \leq (R_{\omega,a} + \theta_{a}) C \upbeta, 0 \leq s \leq 2 C \upbeta : R(y)^{t_{n+1} + \lfloor C + s \rfloor}_{\lfloor C D_{b} + i \rfloor} = \omega \middle| P_{3}(n) \right] \\ &\!\!\!\!\!\!\!\!\!\!\geq 1 - \lambda^{((R_{\omega,a} + \theta_{a}) u \upbeta)^{Q_{b}}} \end{array} \end{array} $$

(10)

which implies the event in the statement of this Lemma, as t_n+ 1 ≤ Cβ < C.

If C is large enough, we have $\lambda ^{C^{Q_{b}/3}} \geq (1 - \gamma )^{u-1} + \lambda ^{((R_{\omega ,a} + \theta _{a}) u \upbeta )^{Q_{b}}}$. The claim now follows from inequality (10), the union bound and the fact that the events P₃(n) form a partition of the union of the events P₂(n). □

In the case that the outer region can be maintained indefinitely, in the sense that the trajetory is b-good, the evolution of the inner region can be seen as a two-dimensional generalized bootstrap percolation process. Initially, all cells (i,0) with i ≤ N are active. Lemma 6.12 implies the following for all large enough C. If i + C(α + rβ) ≤ t(R_{ω, a} + 2𝜃_a), so that all cells between (i, t) and (i + C(α + rβ),t) are within the outer region, then with a combined probability of at least $1 - \lambda ^{C^{Q_{b}/3}}$, the cell (i + j, t + s) is active for each (j, s) ∈ [CD_b, CD_b + (R_{ω, a} + 𝜃_a)Cβ] × [C, C + Cβ]. These probabilities are independent for those choices of (i, t) that are far enough from each other, depending on the respective choices of C, and all of them are positively correlated. In this way, we obtain an initial distribution $A_{0} \subset \mathbb {Z} \times \mathbb {N}$ of active cells in a two-dimensional random configuration in the form of a half-infinite line at time 0 and a random collection of rectangles. The set A_k+ 1 contains all cells of A_k, and each cell (i, t) such that (i + j, t − 1) ∈ A_k for each − r ≤ j ≤ r. As a limit of this percolation process, we obtain a set $A_{\infty } = \bigcup _{k = 0}^{\infty } A_{k}$ of active cells, with the property that $R(x)^{t}_{i} = \omega $ for all $(i, t) \in A_{\infty }$. Thus, the conditional probability of R(x) being ω-good given that it is b-good is at least the probability of $(i, t) \in A_{\infty }$ for all t ≥ 0 and i ≤ t(R_{ω, a} + 𝜃_a).

The following lemma establishes a lower bound for the probability of maintaining the inner region indefinitely, expressed as a property of the set $A_{\infty }$. The idea of the proof is to show that with high probability the rectangles in A₀, together with the cells activated by the initial half-line, cover the entire discrete line of coordinates (i, t) with i ≈ t(R_{ω, a} + 𝜃_a).

Lemma 6.13

There exists ξ > 0 with the following property. Suppose that R(x) is b-good. If N is large enough, the conditional probability of R(x) being ω-good is at least $1 - \xi ^{Q_{b}/3}$.

Proof

Let α,β,λ be given by Observation 6.11. Denote

$$ \rho = \frac{\theta_{a}}{\alpha + r \upbeta - D_{b} + R_{\omega,a} + 2 \theta_{a}} $$

We may choose α so large that 0 < ρ < 1. We define a sequence of time steps by $t_{k} = \lceil \frac {N}{4 r} (1 + \rho \upbeta /2)^{k/2} \rceil $ for all k ≥ 0. Consider one of these time steps t_k, and set i_k = ⌊t_k(R_{ω, a} + 𝜃_a)⌋. Denote also C_k = ρt_k, j_k = i_k −⌈C_kD_b⌉ and s_k = t_k −⌊C_k⌋. If α and N are large enough, we have j_k, s_k ≥ 1 for all k ≥ 0. Denote by P(k) the event that (i_k + i, t_k + t) ∈ A₀ for all 0 ≤ i ≤ (R_{ω, a} + 𝜃_a)C_kβ and 0 ≤ t ≤ C_kβ. This is the rectangle we are seeking to produce. See Fig. 9 for a visualization.

We claim that if P(k) holds for all k ≥ 0, then $(i,t) \in A_{\infty }$ for all $t \in \mathbb {N}$ and i ≤ t(R_{ω, a} + 𝜃_a). First, we have $(i,0) \in A_{\infty }$ for all i ≤ N, and hence $(i,t) \in A_{\infty }$ for all $t \leq t_{0} = \lceil \frac {N}{4 r} \rceil $ and i ≤ N − rt by the definition of the sets A_k. Since R_{ω, a} + 𝜃_a ≤ 2r, this holds for all t ≤ t₀ and i ≤ t(R_{ω, a} + 𝜃_a) as long as N is large enough. Hence the claim holds up to t₀.

Suppose then that the claim holds up to some t_k. Since P(k) holds, we have (i, t) ∈ A₀ for all t_k ≤ t ≤ (1 + ρβ)t_k and i_k ≤ i ≤ i_k + ρβt_k(R_{ω, a} + 𝜃_a). If N (and hence t_k) is large enough, we have (1 + ρβ)t_k > t_k+ 1 and

$$ i_{k} + \rho \upbeta (R_{\omega,a} + \theta_{a}) t_{k} \geq t_{k} (\rho \upbeta + 1) (R_{a, \omega} + \theta_{a}) - 1 > i_{k+1} + r $$

so that (i, t) ∈ A₀ for all t_k ≤ t ≤ t_k+ 1 and i_k ≤ i ≤ i_k+ 1 + r. By the induction hypothesis, we also have $(i, t_{k}) \in A_{\infty }$ for all i ≤ t_k(R_{ω, a} + 𝜃_a). For each t_k ≤ t < t_k+ 1, the condition $(i, t) \in A_{\infty }$ for all i ≤ i_k+ 1 + r implies $(i, t+1) \in A_{\infty }$ for all i ≤ i_k+ 1 by the definition of the A_k, so by an inductive argument we obtain $(-\infty , i_{k+1}] \times [t_{k}, t_{k+1}] \subset A_{\infty }$. Since i_k+ 1 ≥ t_k+ 1(R_{ω, a} + 𝜃_a) ≥ t(R_{ω, a} + 𝜃_a) in this time interval, we have shown that the claim holds up to t_k+ 1.

We now estimate the probability of the P(k), given that R(x) is b-good. We compute

$$ \begin{array}{@{}rcl@{}} j_{k} + C_{k}(\alpha + r \upbeta) &\leq {} & t_{k} (R_{\omega,a} + \theta_{a}) + C_{k} (\alpha + r \upbeta - D_{b}) \\ &{} \stackrel{(*)}{=} {} & t_{k} (R_{\omega,a} + 2 \theta_{a}) - C_{k} (R_{\omega,a} + 2 \theta_{a}) \\ {}& \leq {} & s_{k} (R_{\omega,a} + 2 \theta_{a}) \end{array} $$

where the equality (∗) follows from C_k = ρt_k. By the definition of A₀, this implies (i_k + i, t_k + t) ∈ A₀ for each 0 ≤ i ≤ (R_{ω, a} + 𝜃_a)C_kβ and 0 ≤ t ≤ C_kβ – which is exactly the event P(k) – with probability at least $1 - \lambda ^{C_{k}^{Q_{b}/3}}$. From the union bound we obtain $\text {Pr} [ \forall k \geq 0 : P(k) | G_{b}] \geq 1 - {\sum }_{k = 0}^{\infty } \lambda ^{C_{k}^{Q_{b}/3}}$, where G_b is the event that R(x) is b-good.

We now compute $C_{k}^{Q_{b}/3} \geq (\frac {\rho N}{4 r} (1 + \rho \upbeta )^{k/2})^{Q_{b}/3} = \gamma N^{Q_{b}/3} \phi ^{k}$, where $\gamma = (\frac {\rho }{4 r})^{Q_{b}/3}$ and $\phi = (1 + \rho \upbeta )^{Q_{b}/6}$ are constants independent of N and k. For large enough k, we have ϕ^k > k, so that

$$ \begin{array}{@{}rcl@{}} \sum\limits_{k = 0}^{\infty} \lambda^{C_{k}^{Q_{b}/3}} &\leq {} & \sum\limits_{k = 0}^{k_{0}-1} \lambda^{\gamma N^{Q_{b}/3} \phi^{k}} + \sum\limits_{k=k_{0}}^{\infty} (\lambda^{\gamma N^{Q_{b}/3}})^{k} \\ &{} = {} & \sum\limits_{k = 0}^{k_{0}-1} \lambda^{\gamma N^{Q_{b}/3} \phi^{k}} + \frac{\lambda^{\gamma k_{0} N^{Q_{b}/3}}}{1 - \lambda^{\gamma N^{Q_{b}/3}}} \end{array} $$

Since this is a finite sum with a constant number of terms, each of which is $\exp (-$ ${\Theta }(N^{Q_{b}/3}))$, the claim follows. □

Together with the union bound, Lemma 6.10 and Lemma 6.13 imply that the probability of R(x) being ω-good and b-good is at least $1 - d^{N - K} - \xi ^{N^{Q_{b}/3}}$ for large enough N. We now choose ξ < δ_a < 1 arbitrarily, and set Q_a = Q_b/3. With these choices (4) holds, which finishes the proof of Proposition 6.5.

Proof of second half of Theorem 4.7

Let $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ be a monotonic cellular automaton that does not satisfy the stability condition. Let 𝜖 > 0 and let R be the independent maximizing 𝜖-perturbation of f. By Lemma 6.1, there exists a quiescent state ω ∈ S ∖{0} such that L_{a, ω} ≤ R_{ω, a} for all a < ω. By Proposition 6.5, every quiescent state a < ω is inductive, so in particular 0 is an inductive state. Thus there exist 0 < δ₀ < 1 and Q₀ > 0 such that, denoting $x = {}^{\infty } 0 \omega ^{N} . \omega ^{N} 0^{\infty }$, we have

$$ \text{Pr} [ \forall t \in \mathbb{N} : R(x)^{t}_{\lfloor t (L_{a,\omega} + R_{\omega,a}) / 2 \rfloor} = \omega ] \geq 1 - \delta_{0}^{N^{Q_{0}}} $$

for all large enough N. Lemma 6.3 implies that f is not a stable eroder. □

7 Further Results

We have presented a characterization of those one-dimensional monotonic cellular automata that erode finite islands in the presence of sufficiently low random noise. The characterization was given in terms of forcing sets (Definition 3.9 and Theorem 3.10), and alternatively in terms of Gal’perin rates (Theorem 4.7). Since the Gal’perin rates of a one-dimensional monotonic CA can be computed from the local rule [4, 9], we further obtain the following.

Corollary 7.1

Given the local rule of a one-dimensional monotonic cellular automaton, it is decidable whether the automaton is a stable eroder.

In the context of probabilistic cellular automata, another interesting property is ergodicity, meaning the phenomenon where the system’s initial state eventually becomes irrelevant. The ergodicity of random perturbations of deterministic cellular automata has recently been studied in [7]. The authors consider several classes of noisy CA, including nilpotent, permutive and linear, but not monotonic CA.

Definition 7.2

Let S be a state set and d ≥ 1, and let R be a stochastic symbolic process on $S^{\mathbb {Z}^{d}}$. We say R is ergodic, if the marginals R(μ)^t converge weakly to the same measure on $S^{\mathbb {Z}^{d}}$ for every choice of $\mu \in {\mathscr{M}}({S^{\mathbb {Z}^{d}}})$.

Consider a one-dimensional monotonic CA f on S = {0,…,m}, suppose that the states 0 and m are quiescent, and let R be an 𝜖-perturbation of f. We present some necessary or sufficient conditions for the ergodicity of R. Suppose first that R only introduces increasing errors (that is, R(x)^t+ 1 ≥ f(R(x)^t) holds for all $t \in \mathbb {N}$). Then we have $\lim _{t \to \infty } R(\hat {m})^{t} = \hat {m}$. If f a stable eroder, then $\lim _{t \to \infty } R(\hat {0})^{t} \neq \hat {m}$ for small enough 𝜖, so R is not ergodic.

Suppose then that f is not a stable eroder. Lemma 6.1 gives us a quiescent state ω > 0 with L_{a, ω} ≤ R_{ω, a} for all a < ω. We again choose ω to be minimal. In this case, if R is the independent ω, 𝜖-perturbation of f, the results of Section 6 imply that $\lim _{t \to \infty } \text {Pr} [\forall i \in C : R(\hat {0})^{t}_{i} \geq \omega ] = 1$ for any finite set $C \subset \mathbb {Z}$. Since f is monotonic, we can replace $\hat {0}$ by an arbitrary measure and the result still holds. If ω = m, this implies that R is ergodic. However, if ω < m, then R is not ergodic, since $R(\hat {0})^{t}$ converges to the point measure on $\hat {\omega }$. Furthermore, it can be the case that even the independent maximizing m, 𝜖-perturbation of f is not ergodic for any 𝜖. We show this by an example.

Example 7.3

Recall the CA $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ from Example 3.4. Let T = {0,1,2,3}, and for a ∈ T, denote $\bar {a} = \min \limits (a, 2)$. Let $g : T^{\mathbb {Z}} \to T^{\mathbb {Z}}$ be the cellular automaton defined by the local rule

$$ G(a,b,c) = \begin{cases} F(\bar{a},\bar{b},\bar{c}), & \text{if~} b \leq 2, \\ 3, & \text{if~} a = b = c = 3, \\ \bar{b}, & \text{otherwise.} \end{cases} $$

The CA g behaves exactly like f on $S^{\mathbb {Z}}$. There is also a new state, 3, that always erodes away at linear speed. It is easy to see that g is a monotonic eroder, and it is not a stable eroder, since f is not. Consider then the restriction of g on $\{2,3\}^{\mathbb {Z}}$. This binary CA satisfies L_2,3 = 1 > − 1 = R_3,2, so it is a stable eroder. In particular, the independent 3,𝜖-perturbation R of g is not ergodic for any 𝜖 > 0, since $\lim _{t \to \infty } R(\hat {2})^{t} \neq R(\hat {3})$. In this example, we have ω = 2.

Finally, consider the two-dimensional cellular automaton f with state set {0,1} and neighborhood N = {(0,0),(1,0),(0,1)}, where the local rule always chooses the majority state in the three cells of N. This automaton was first studied by Toom. It is monotonic, and we can apply Proposition 3.8 to show that it is a stable eroder. In fact, since f is symmetric with respect to switching the states 0 and 1, it is also a stable eroder toward 1, meaning that if a small perturbation of f is initialized on the all-1 configuration, the probability of any single cell to contain 0 is low. With our results, we can show that in the one-dimensional case, such an automaton does not exist.

Proposition 7.4

Let S = {0,…,m}, and let $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ be a monotonic CA. Let $g : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ be the CA defined by g(x)_i = m − x_i. If f is a stable eroder, then h = g ∘ f ∘ g is not an eroder.

Proof

Since f is a stable eroder, Theorem 4.7 implies the existence of a quiescent state a < m with L_{a, m} > R_{m, a}. If h was an eroder, we would have R_{m, b} > L_{b, m} for all b < m, which is impossible. □

On the other hand, there does exist a one-dimensional monotonic CA that is an eroder in both directions. The classical non-monotonic example of this phenomenon is the Gács-Kurdyumov-Levin automaton [2], which has two states. A four-state radius-1 example is given in [12, Section 8.2]. It is monotonic, but the alphabet is only partially ordered ({0,1}² with component-wise comparisons). We present a monotonic three-state CA with a linearly ordered alphabet.

Example 7.5

Let S = {0,1,2}, and let $f : S^{\mathbb {Z}} \to S^{\mathbb {Z}}$ be the radius-2 CA defined by the local rule

$$ F(a,b,c,d,e) = \begin{cases} 0, & \text{if~} \max(a, b, c, d) \leq 1, e = 0, \\ 1, & \text{if~} c = 0, \min(d, e) \geq 1, \\ 1, & \text{if~} \max(a, b) \leq 1, c = 2, \\ 2, & \text{if~} a = 2, \min(b, c, d, e) \geq 1, \\ c, & \text{otherwise.} \end{cases} $$

A case analysis shows that f is monotonic. By analyzing the behavior of f on different steps, we can compute R_0,1 = − 1 > − 2 = L_1,0 and R_0,2 = 1 > 0 = L_2,0. By Theorem 4.6, f is an eroder. If we define g(x)_i = 2 − x_−i, then g ∘ f ∘ g = f, which means that the symbol-inverted version of f behaves like the left-right-inverted version of f. In particular, the former is also an eroder. Of course, f is not a stable eroder.

8 Open Problems

The results proved in this article concern one-dimensional cellular automata on linearly ordered alphabets of size at least 3: we prove that such a CA is a stable eroder if and only if it satisfies the stability condition. As stated in Section 3, we do not know if our results generalize to the multi-dimensional case, nor whether the class of multi-dimensional stable eroders is decidable. For the binary alphabet, the answer to both questions is positive due to Proposition 3.8.

Problem 8.1

Let d ≥ 2, m ≥ 2 and S = {0,1,…,m}. Does a monotonic cellular automaton $f : S^{\mathbb {Z}^{d}} \to S^{\mathbb {Z}^{d}}$ satisfy the stability condition if and only if it is a stable eroder?

Note that we have formulated the stability condition in all dimensions (see Definition 3.9), so the direct generalization of Theorem 3.10 from $\mathbb {Z}$ to $\mathbb {Z}^{d}$ for all d ≥ 1 is well-defined. Of course, it is also possible that stable eroders are characterized by a simple geometric condition, but our version is not the correct one.

The class of multi-dimensional monotonic eroders with more than two states is likewise mysterious. To our knowledge, its decidability status is unknown, and no simple characterization has been conjectured. Another property related to eroders is the speed of erosion. We discussed this subject briefly in Section 4. For a fixed one-dimensional eroder f, every finite island x satisfies $f^{n}(x) = \hat {0}$ for some n ≥ 0 that is linear in the diameter of x, and the rate is computable from f. In the multi-dimensional context this is no longer true [8], and it is unknown whether the time of erosion can exhibit busy beaver-type behavior.

Problem 8.2

Let d ≥ 2, m ≥ 2 and S = {0,1,…,m}. Let $f : S^{\mathbb {Z}^{d}} \to S^{\mathbb {Z}^{d}}$ be a monotonic eroder. Is there necessarily a computable function $T : \mathbb {N} \to \mathbb {N}$ such that $f^{T(n)}(x) = \hat {0}$ for all islands $x \in S^{\mathbb {Z}^{d}}$ with diameter n? Is there such a T that is uniformly computable from d, m and/or f?

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Vorheriger Artikel The Complexity of Counting CSPd

Nächster Artikel Multi-Pass Streaming Algorithms for Monotone Submodular Function Maximization

Chalupa, J., Leath, P.L., Reich, G.R.: Bootstrap percolation on a Bethe lattice. J. Phys. C: Solid State Phys. 12(1), L31 (1979). http://stacks.iop.org/0022-3719/12/i=1/a=008

Gács, P., Kurdjumov, G.L., Levin, L.A.: One-dimensional uniform arrays that wash out finite islands. Probl. Pered. Inf. 14(3), 92–96 (1978). Engl. Transl. Probl. Inf. Transmission 14(3), 223–226 (1978)

Gal’perin, G.A.: One-dimensional automata networks with monotone local interaction. Probl. Pered. Inf. 12(4), 74–87 (1976). Engl. Transl. Probl. Inf. Transmission 12(4), 299–310 (1977)

Gal’perin, G.A.: Rates of the propagation of the interaction in one-dimensional automata networks. Probl. Pered. Inf. 13(1), 73–81 (1977). Engl. Transl. Probl. Inf. Transmission 13(1), 52–58 (1977)

Hoeffding, W.: Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58, 13–30 (1963). https://doi.org/10.2307/2282952MathSciNetCrossRef

Mairesse, J., Marcovici, I.: Around probabilistic cellular automata. Theor. Comput. Sci. 559, 42–72. https://doi.org/10.1016/j.tcs.2014.09.009. http://www.sciencedirect.com/science/article/pii/S0304397514006744. Non-uniform Cellular Automata (2014)

Marcovici, I., Sablik, M., Taati, S.: Ergodicity of some classes of cellular automata subject to noise. Electron. J. Probab. 24, 44. https://doi.org/10.1214/19-EJP297 (2019)

de Menezes, M.L., Toom, A.: A non-linear eroder in presence of one-sided noise. Brazil. J. Probab. Stat. 20(1), 1–12. http://www.jstor.org/stable/43601070 (2006)

de Santana, L.H., Leite, A., Toom, A.: Computing directional galperin’s rates. Proc. Ser. Brazil. Soc. Appl. Comput. Math. 2(1) (2014)

10.

Toom, A.: Unstable multicomponent systems. Probl. Pered. Inf. 12(3), 78–84. English translation: Problems of Information Transmission 12(3), 220–225 (1977) (1976)

11.

Toom, A.: Stable and Attractive Trajectories in Multicomponent Systems. In: Multicomponent Random Systems, Adv. Probab. Related Topics, vol. 6, pp 549–575. Dekker, New York (1980)

12.

Toom, A.L.: Cellular Automata with Errors: Problems for Students of Probability. In: Snell, J.L. (ed.) Topics in Contemporary Probability and Its Applications, Probability and Stochastic Series, pp 117–157. CRC Press, Boca Raton (1995)

Titel: Stable Multi-Level Monotonic Eroders
verfasst von: Péter Gács
Ilkka Törmä
Publikationsdatum: 26.10.2021
Verlag: Springer US
Erschienen in: Theory of Computing Systems / Ausgabe 1/2022
Print ISSN: 1432-4350
Elektronische ISSN: 1433-0490
DOI: https://doi.org/10.1007/s00224-021-10061-w

Springer Professional

Stable Multi-Level Monotonic Eroders

Abstract

Publisher’s Note

1 Introduction

2 Definitions

3 Eroders and Zero Sets

4 Eroders in One Dimension: Gal’perin Rates

5 Stability Condition is Sufficient

6 Stability Condition is Necessary

7 Further Results

8 Open Problems

Publisher’s Note

Premium Partner

Springer Professional

Abstract

Publisher’s Note

1 Introduction

2 Definitions

3 Eroders and Zero Sets

4 Eroders in One Dimension: Gal’perin Rates

5 Stability Condition is Sufficient

6 Stability Condition is Necessary

7 Further Results

8 Open Problems

Publisher’s Note

Weitere Artikel der Ausgabe 1/2022

FKT is Not Universal — A Planar Holant Dichotomy for Symmetric Constraints

The Power of the Weighted Sum Scalarization for Approximating Multiobjective Optimization Problems

Univariate Ideal Membership Parameterized by Rank, Degree, and Number of Generators

Multi-Pass Streaming Algorithms for Monotone Submodular Function Maximization

The Complexity of Counting CSPd

An Alternating Algorithm for Finding Linear Arrow-Debreu Market Equilibria

Premium Partner