Top

Cryptography and Communications

Published in:

Open Access 07-07-2021

Maximum order complexity of the sum of digits function in Zeckendorf base and polynomial subsequences

Authors: Damien Jamet, Pierre Popoli, Thomas Stoll

Published in: Cryptography and Communications | Issue 5/2021

Activate our intelligent search to find suitable subject content or patents.

search-config

AI-assisted search

Patentsearch

Off

Abstract

Automatic sequences are not suitable sequences for cryptographic applications since both their subword complexity and their expansion complexity are small, and their correlation measure of order 2 is large. These sequences are highly predictable despite having a large maximum order complexity. However, recent results show that polynomial subsequences of automatic sequences, such as the Thue–Morse sequence, are better candidates for pseudorandom sequences. A natural generalization of automatic sequences are morphic sequences, given by a fixed point of a prolongeable morphism that is not necessarily uniform. In this paper we prove a lower bound for the maximum order complexity of the sum of digits function in Zeckendorf base which is an example of a morphic sequence. We also prove that the polynomial subsequences of this sequence keep large maximum order complexity, such as the Thue–Morse sequence.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

1 Introduction

Feedback shift register (FSR) sequences are used for many cryptographic applications such as pseudorandom number generators for stream cipher cryptosystems, see [10]. A binary n-stage feedback shift register (FSR) is a mapping $\mathfrak {F}$ from $\mathbb {F}_{2}^{n}$ to $\mathbb {F}_{2}^{n}$ of the form

$$ \mathfrak{F}: (x_{0},x_{1},\ldots,x_{n-1}) \mapsto (x_{1},x_{2},\ldots,x_{n-1},f(x_{0},x_{1},\ldots,x_{n-1})), $$

where f is a function from $\mathbb {F}_{2}^{n}$ to $\mathbb {F}_{2}$. Consider the binary sequence $\mathcal {S}=(s_{i})_{i\geq 0}$ whose first n terms are given and the remaining terms are uniquely determined by the recurrence relation

$$ s_{i+n}=f(s_{i},\ldots,s_{i+n-1}), \quad i\geq 0. $$

The sequence $\mathcal {S}$ is called the output sequence of the FSR. An output sequence of a short FSR is considered weak in cryptographic applications. In order to determine this shortness for an infinite sequence $\mathcal {S}$, Jansen [12, 13] introduced the notion of N th maximum order complexity of $\mathcal {S}$, denoted as $M(\mathcal {S},N)$, which is the length of the shortest FSR that generates the first N elements of $\mathcal {S}$. If the mapping $\mathfrak {F}$ is a linear transformation, then the FSR is called a linear feedback shift register (LFSR). This leads to the notion of the linear complexity profile of $\mathcal {S}$ denoted as $L(\mathcal {S},N)$. We have obviously $M(\mathcal {S},N)\leq L(\mathcal {S},N)$. The sequence $(M(\mathcal {S},N))_{N}$ is called the maximum order complexity profile of $\mathcal {S}$ and $M(\mathcal {S})=\sup _{N\geq 1} M(\mathcal {S},N)$ is called the maximum order complexity of $\mathcal {S}$.

A sequence $\mathcal {S}$ is said to have a perfect linear complexity profile if

$$ L(\mathcal{S},N) =\left\lceil \frac{n}{2} \right\rceil, \quad n\geq 1. $$

An explicit construction of sequences with perfect linear complexity profile is given in [1]. These are known under the name of apwenian sequences (see [17]). Diem [7] observed that these sequences and sequences based on function expansion into expansion series can be efficiently computed from relatively short sequences. This leads to the notion of expansion complexity, denoted as $E(\mathcal {S},N)$, see Definition 2 below for more details. It was proved by Mérai, Niederreiter and Winterhof [16] that the expansion complexity and the linear complexity of an infinite sequence satisfy $E(\mathcal {S},N)\leq L(\mathcal {S},N)+1$.

A natural question is to ask about the mutual relationship between the maximum order complexity and the expansion complexity. It is known that these two complexities have expected behavior $\log N$ for the maximum order complexity, see [12, 20], and $\sqrt {N}$ for the expansion complexity, see [17, Theorem 6.2] and [16]. This suggests that maximum order complexity is smaller than the expansion complexity. However, the Thue–Morse sequence has a large maximum order complexity and a bounded expansion complexity. Thus in order to determine the unpredictability of a sequence, both the expansion complexity and the maximum order complexity are useful and needed.

In recent years, research focused in particular on automatic sequences, i.e. sequences that are generated by a deterministic finite automaton. These sequences are not suitable for cryptographic applications as we will state later. Nevertheless, their pseudorandom behavior changes radically when the sequence is rarefied along special subsequences such as polynomial subsequences. One can extend the definition of automatic to morphic sequences which is an again larger class.

The aim of the present article is to study pseudorandomness of some morphic sequences and its polynomially rarefied subsequences. We first introduce several measures of complexity (Section 1.1) and illustrate their behavior with a simple example. We then introduce automatic and morphic sequences (Section 1.2) and give an overview about the results known for their measures of complexity. We then indicate our main results of this paper (Section 1.3).

1.1 Measures of complexity

Let us first introduce some measures of complexity that we will focus on in the present paper.

Definition 1 (Maximum order complexity)

Let N be a positive integer with N ≥ 2, and $\mathcal {S}=\left (s_{n} \right )_{n\geq 0}$ be a sequence over {0,1} with (s₀,…,s_N− 2)≠(a,…,a) for a = 0 or 1. The Nth maximum order complexity $M(\mathcal {S},N)$ is the smallest positive integer M such that there is a polynomial $f(x_{1},\ldots ,x_{M}) \in \mathbb {F}_{2}[x_{1},\ldots ,x_{M}]$ with

$$ \begin{array}{@{}rcl@{}} s_{i+M}=f(s_{i},\ldots,s_{i+M-1}), \quad 0\leq i\leq N-M-1. \end{array} $$

If s_i = a for i = 0,…,N − 2, we define $M(\mathcal {S},N)=0$ if s_N− 1 = a and $M(\mathcal {S},N)=N-1$ else.

A sequence with small maximum order complexity is not suitable for cryptographic applications. Indeed, such a sequence can be built with relatively short blocks of consecutive terms. However, a sequence with large maximum order complexity can still be predictable, as it is known for the Thue–Morse sequence, see [15, 28]. Note also that the largest possible order of magnitude of $M(\mathcal {S},N)$ is N and the expected value of $M(\mathcal {S},N)$ is $\log N$, see [12, 20].

The maximum order complexity has been studied by several authors, for general results see [11‐13, 20, 31] or [21, 27‐29] for applications to some particular sequences such as the Thue–Morse sequence or the Rudin–Shapiro sequence. From a computational perspective, Jansen [12, Proposition 3.17] showed how Blumer’s DAWG (Direct Acyclic Weighted Graph) algorithm [3] can be used to compute the maximum order complexity in linear time and memory. In the last section of this paper, we use DAWG algorithm to formulate some conjectures (Section 3.2).

Diem [7] introduced the expansion complexity of a sequence as follows.

Definition 2 (Expansion complexity)

Let N be a positive integer, $\mathcal {S}=\left (s_{n} \right )_{n\geq 0}$ be a sequence over {0,1} and $G_{\mathcal {S}}(x)$ its generating function defined by

$$ G_{\mathcal{S}}(x)= \underset{i\geq 0}{\sum} s_{i}x^{i}. $$

The Nth expansion complexity $E(\mathcal {S},N)$ is defined as the least total degree of a nonzero polynomial $h(x,y) \in \mathbb {F}_{2}[x,y]$ with

$$ h(x,G_{\mathcal{S}}(x)) \equiv 0 \pmod{x^{N}}, $$

if s₀,…,s_N− 1 are not all equal to 0, and $E(\mathcal {S},N)=0$ otherwise.

A truly random sequence has expected value $E(\mathcal {S},N)$ of order N^1/2, see [17].

Binary, and more generally, p-ary automatic sequences are not good pseudorandom sequences since their expansion complexity satisfies $E(\mathcal {S},N)<+\infty $ by Christol’s theorem, see [6]. In fact, automatic sequences generated by a finite automaton do not have sufficiently many different factors of a fixed length.

The subword complexity, denoted by $p_{\mathcal {S}}(k)$, counts how many different subwords of fixed length k the sequence contains. It can again serve as a measure of pseudorandomness. A sequence $\mathcal {S}$ over {0,1} is normal if for every k ≥ 1 and for any block of length (b₀,…,b_k− 1) ∈{0,1}^k, we have

$$ \underset{N \to \infty }{\lim} \frac{1}{N} \text{Card}\{i<N: s_{i}=b_{0},\ldots,s_{i+k-1}=b_{k-1} \}=\frac{1}{2^{k}}. $$

One can associate to a binary sequence a symbolic dynamical system based on the shift on $\{0,1\}^{\mathbb {N}}$. The topological entropy of such a dynamical system is $\lim _{k\rightarrow +\infty } \frac {\log _{2} p_{\mathcal {S}}(k)}{k}$, where $\log _{2}$ denotes the base 2-logarithm. A sequence with 0 topological entropy is said to be deterministic and it should not provide a pseudo-random sequence since such a sequence has a small subword complexity, see [23].

1.2 Automatic and morphic sequences

Let k ≥ 2 and Σ be a finite alphabet. We denote by Σ^∗ the set of all finite or infinite words over Σ. A morphism f is a mapping $f:{{{\varSigma }}}^{*} \rightarrow {{{\varSigma }}}^{*}$ satisfying f(uv) = f(u)f(v) for all words u and v. A morphism is said to be k-uniform if |f(x)| = k for all x ∈Σ, where |x| is the length of the word x. If there is b ∈Σ such that f(b) starts with b, i.e. f(b) = bu for some u ∈Σ^∗, we say that f is b-prolongable. In this case, let us denote f^ω(b) the fixed point of f given by f^ω(b) = bf(u)f²(u)…

Definition 3 (Automatic and morphic sequences)

Let $\mathcal {S}$ be a sequence over an alphabet Σ. $\mathcal {S}$ is morphic if $\mathcal {S}=\pi (f^{\omega }(b))$ where $f:{{{\varSigma }}}^{*} \rightarrow {{{\varSigma }}}^{*}$ is a b-prolongeable morphism and $\pi :{{{\varSigma }}}\rightarrow {{{\varDelta }}}$ is a morphism, named coding. $\mathcal {S}$ is said k-automatic if f is k-uniform.

For equivalent definitions of automatic sequences, we refer to [2]. An emblematic example of automatic sequence is the Thue–Morse sequence, or Prouhet–Thue–Morse sequence, independently introduced by Thue [30], Morse [18] and Prouhet sixty years before [22]. This sequence appears in many different fields of mathematics, see [14], and is still an important object of recent work.

Definition 4 (Thue–Morse sequence)

For an integer n ≥ 0, we write $n={\sum }_{i \geq 0}\varepsilon _{i}2^{i}$ with ε_i ∈{0,1} for all i and (n)₂ = ⋯ε₁ε₀. The binary sum-of-digits of n equals $s_{2}(n)={\sum }_{i \geq 0} \varepsilon _{i}$. The Thue–Morse sequence $\mathcal {T}=(t(n))_{n\geq 0}$ is defined by t(n) = s₂(n) mod 2.

The Thue–Morse sequence is 2-automatic and the corresponding morphism is

$$ \begin{array}{@{}rcl@{}} f:\left\{ \begin{array}{lll} 0 \mapsto 01 \\ 1 \mapsto 10. \end{array} \right. \end{array} $$

The Thue–Morse sequence has a large maximum order complexity, see [28], but this sequence is far from being pseudorandom. Indeed, it is well-known that $E(\mathcal {T},N)\leq 5$ for all N since h(x,y) = (x + 1)³y² + (x + 1)²y + x satisfies $h(x,G_{\mathcal {T}}(x))=0$ where $G_{\mathcal {T}}(x)$ is the generating function of $\mathcal {T}$, see [2, Example 12.1.12]. Also, its subword complexity is small, $p_{\mathcal {T}}(k)\ll k$ (see [2, Corollary 10.3.2] for a general result for all automatic sequences).

The behavior of this sequence regarding the defined pseudorandomness measures changes when this sequence is rarefied along specific subsequences. Drmota, Mauduit et Rivat [8] showed that the Thue–Morse sequence along squares is normal and Moshe [19] showed that the polynomial subsequences of degree d ≥ 2 of the Thue–Morse sequence have an exponential subword complexity. Sun and Winterhof [27] and Popoli [21] showed that the maximum order complexity of the Thue–Morse sequence along polynomial subsequences remains comparatively large. A large maximum order complexity is desired, however, it should be noted that it should be not too large since the correlation measure of order 2 gets large in that case (see [12, Proposition 3.1]).

Furthermore, these polynomial subsequences are no longer automatic sequences, see [2, Theorem 6.10.1], and their expansion complexity is no longer bounded by Christol’s theorem. These statements mean that the polynomial subsequences of the Thue–Morse sequence are better candidates for cryptographic applications than the original sequence.

In order to generalize the investigation to other morphic sequences and sum of digits function, let us introduce the Zeckendorf base sum of digits function, which is related to the Fibonacci sequence (see [2, Theorem 3.8.1], [32]).

Definition 5 (Zeckendorf base)

Let $\mathcal {F}=(F_{n})_{n\geq 0}$ be the Fibonacci sequence with initial values F₀ = 0, F₁ = 1 and F_n+ 2 = F_n+ 1 + F_n for all n ≥ 0. Let us denote $\varphi =(1+ \sqrt {5})/2$ the golden ratio. Each integer n can be represented uniquely by

$$ n=\underset{i \geq 0}{\sum} \varepsilon_{i}(n) F_{i+2}, $$

with ε_i(n) ∈{0,1} and ε_i(n)ε_i+ 1(n) = 0 for all i ≥ 0.

Since $F_{n}=\lfloor \frac {\varphi ^{n}}{\sqrt {5}}+\frac {1}{2} \rfloor $ for n ≥ 0, the index n(a) of the largest Fibonacci number that is not greater than an integer a > 1 is $\lfloor \frac {\log (a \sqrt {5}+1/2)}{\log (\varphi )} \rfloor $. We call n(a) − 1 the length of a.

Thus we can define the sum of digits function in Zeckendorf base by $s_{Z}(n)={\sum }_{i \geq 0} \varepsilon _{i}(n)$, see [9, 24, 26] or sequence A007895 in On-Line Encyclopedia of Integer Sequences (OEIS) for discussions on this sequence. We denote by $\mathcal {S}_{Z}$ the sequence defined by $\mathcal {S}_{Z}=\left (s_{Z}(n)\mod 2\right )_{n\geq 0}$, an analog of the Thue–Morse sequence. It is known that $\mathcal {S}_{Z}$ is a morphic sequence, see for example [4, p.14] or [2, Examples 7.8.2, 7.8.4]. An example of the corresponding morphism and coding is

$$f:\left\{ \begin{array}{lll} a \mapsto ab \\ b \mapsto c \\ c \mapsto cd \\ d \mapsto a \end{array} \right. \quad \text{and} \quad g:\left\{ \begin{array}{lll} a \mapsto 0 \\ b \mapsto 1 \\ c \mapsto 1 \\ d \mapsto 0. \end{array} \right. $$

Thus, the sequence modulo 2 is obtained by iterating f on the letter a and recoding with g : $\mathcal {S}_{Z}=\texttt {0}\texttt {1}\texttt {1}\texttt {1}\texttt {0}\texttt {1}\texttt {0}\texttt {0}\texttt {1}\texttt {0}\texttt {0}\texttt {0}\texttt {1}\texttt {1}\texttt {0}\texttt {0}\texttt {0}\texttt {1}\texttt {0}\texttt {1}\texttt {1}\ldots $ This sequence is not automatic, see [9, Remark 1]. Indeed the authors show that the k-kernel is infinite for every k, see [2] for more details on equivalent definitions of automatic sequences. This implies that the expansion complexity of $\mathcal {S}_{Z}$ is not bounded, since this sequence is not automatic, and motivates our study of this sequence. In fact, morphic non-automatic sequences are better candidates for cryptographic applications regarding this complexity. However morphic sequences have subword complexity at most bounded by a quadratic function, see [2, Corollary 10.4.9], and their associated dynamical system has 0 topological entropy, see [23, Corollary V.20] for an alternative proof.

Example 1.1 (Carry propagations in Zeckendorf base)

In Zeckendorf base, the carry propagations work in a different way than in the usual q-base with $q\in \mathbb {N}$ and q ≥ 2. Indeed, since we cannot have two consecutive 1-bits, the carry is “transversal” by the Fibonacci recurrence relation and if we have two 1-bits on the same column we use the formulas 2F_j = F_j+ 1 + F_j− 2 for all j ≥ 2 in order to get an admissible expansion. For example, 5 = F₅, 6 = F₅ + F₂ and 5 + 6 = 11 = F₆ + F₄ as shown by the following calculation:

https://static-content.springer.com/image/art%3A10.1007%2Fs12095-021-00507-w/MediaObjects/12095_2021_507_Figa_HTML.png

In this kind of calculation, we first do not take into consideration the restriction of non-adjacent terms then we normalize to obtain the Zeckendorf representation with non-adjacent terms. This simple example shows that the carry propagation is on both sides of the expansion since 2F_j = F_j+ 1 + F_j− 2 for all j ≥ 2. We will see later that this specific carry propagation, compared to the usual q-base carry propagation, will play an important role in our investigation.

As the sum of digits function for the usual q-base, s_Z is a subadditive function, i.e. for any integers n₁,n₂ ≥ 0 we have s_Z(n₁ + n₂) ≤ s_Z(n₁) + s_Z(n₂), see [26, Proposition 1]. Consider the Zeckendorf expansion of n₁ and n₂,

$$ \begin{array}{@{}rcl@{}} n_{i}=\sum \limits_{j=k_{i}}^{K_{i}}\varepsilon_{j}^{(i)}F_{j+2}, \qquad i=1,2; \end{array} $$

with $\varepsilon _{k_{i}}^{(i)}=\varepsilon _{K_{i}}^{(i)}=1$. Without loss of generality suppose that k₁ ≤ k₂. We say that n₁ and n₂ are non-interfering if k₂ − K₁ ≥ 2. This means that non-interfering integers have digital blocks that do not overlap and no normalization after the addition has to be executed. We therefore have

$$ s_{Z}(n_{1}+n_{2})=s_{Z}(n_{1})+s_{Z}(n_{2}). $$

(1)

Remark 1.1

For usual q-base we have an analogous definition. Let a,b be integers such that a < q^ℓ for some ℓ ≥ 0, thus we have the identity

$$ \begin{array}{@{}rcl@{}} s_{q}(a+q^{\ell}b)=s_{q}(a)+s_{q}(b), \end{array} $$

(2)

where s_q denotes the sum of digits function in q-base. For Zeckendorf base there are two major differences with respect to the usual q-base. First, we have a transversal carry propagation and this translates into the condition k₂ − K₁ ≥ 2 instead of k₂ − K₁ ≥ 1 that we would have for the usual q-base. Secondly, the identity (2) is not true in general when we replace q^ℓ by F_ℓ and s_q by s_Z. We will use Lucas numbers to replace the usual shift q^ℓ.

1.3 Main results

In the following we will determine a lower bound for the maximum order complexity of $\mathcal {S}_{Z}=(s_{Z}(n) \mod 2)_{n}$ and for $\mathcal {S}_{Z,P}=(s_{Z}(P(n)) \mod 2)_{n}$ with $P \in \mathbb {Z}[X]$ a monic polynomial such that $P(\mathbb {N}_{0}) \subset \mathbb {N}_{0}$. In what follows, we use Vinogradov’s notation f ≪ g is defined as |f|≤ c|g| for some c > 0.

Theorem 1.1

There exists N₀ > 0 such that for all N > N₀ we have

$$ M(\mathcal{S}_{Z},N) \geq \frac{1}{\varphi+\varphi^{3}}N+1. $$

Since the correlation measure is bounded below by the maximum order complexity (see [17]), Theorem 1.1 shows that $\mathcal {S}_{Z}$ is non-random in this specific respect. We mention also the recent result of Shutov [25] that shows that the autocorrelation of order 2 with lag equal to 1 of $(-1)^{s_{Z}(n)}$ is large. This serves as motivation to look at subsequences, that we address in following result.

Theorem 1.2

Let $P(X) \in \mathbb {Z}[X]$ be a monic polynomial of degree d ≥ 2 with $P(\mathbb {N}_{0}) \subset \mathbb {N}_{0}$. Then $\mathcal {S}_{Z,P}$ satisfies

$$ M(\mathcal{S}_{Z,P},N)\gg N^{1/(2d)}, \qquad N \to \infty, $$

where the implied constant only depends on P.

We will discuss the heuristics that supports the real growth of the maximum order complexity in the final part of the paper (Section 3.2). In particular, we conjecture that the lower bound is indeed the actual growth of the maximal order complexity.

The rest of the paper is structured as follows. In Section 2, we prove Theorems 1.1 and 1.2 where we first prove the linear case and the particular case P(X) = X^d for d ≥ 2 before tackling the general case. We conclude the paper with some final remarks (Section 3) about possible generalizations of this result to other numerations systems and some heuristically supported conjectures.

2 Sum of digits function in Zeckendorf base

In order to prove a lower bound for the Nth maximum order complexity of a sequence, we use a tool from [12, Proposition 3.1].

Lemma 1 (12)

Let $\mathcal {S}$ be a sequence over $\mathbb {F}_{2}$ of length n. Let k be the length of the longest subsequence of $\mathcal {S}$ that occurs at least twice with different successors. Then $\mathcal {S}$ has maximum order complexity k + 1.

The proofs of Theorem 1.1 and Theorem 1.2 are split into two parts. Let N be a sufficiently large integer. First we built two subsequences of the sequence $\mathcal {S}$ of same length L(N), depending on N, which coincide by using non-interfering terms. Then, for these subsequences, we look for different successors, by studying precisely the involved carry propagations. Thus, by Lemma 1, we then will have $M(\mathcal {S},N)\geq L(N)+1$.

2.1 Linear case

We first study the maximum order complexity of $\mathcal {S}_{Z}$.

Lemma 2

Let ℓ ≥ 2, for all 0 ≤ n < F_ℓ we have

$$ \begin{array}{@{}rcl@{}} s_{Z}(n+F_{\ell+1})&=&s_{Z}(n+F_{\ell+2}), \\ s_{Z}(F_{\ell}+F_{\ell+1}) \bmod 2&\neq& s_{Z}(F_{\ell}+F_{\ell+2}) \bmod 2. \end{array} $$

Proof

If n < F_ℓ, the terms n and F_ℓ+ 1, respectively, n and F_ℓ+ 2 are non-interfering, see (1). The second line follows from s_Z(F_ℓ + F_ℓ+ 1) = 1 and s_Z(F_ℓ + F_ℓ+ 2) = 2. □

We are now able to prove Theorem 1.1.

Proof of Theorem 1.1

We choose ℓ ≥ 2 such as F_ℓ + F_ℓ+ 2 ≤ N < F_ℓ+ 1 + F_ℓ+ 3. This implies N ≥ F₂ + F₄ = 4. Then by Lemma 2, $(s_{Z}(n+F_{\ell +1}))_{0\leq n <F_{\ell }}$ and $(s_{Z}(n+F_{\ell +2}))_{0\leq n <F_{\ell }}$ are two subsequences of same length with different successors. Thus by Lemma 1, we have $M(\mathcal {S}_{Z},N)\geq F_{\ell }+1$. Furthermore, since $\lim _{n\to \infty } \frac {F_{n+k}}{F_{n}}= \varphi ^{k}$, for k ≥ 1, we have N < (φ + φ³)F_ℓ for ℓ large enough. Thus, the theorem is proved. □

2.2 Monomial subsequences

We now study the sequence $\mathcal {S}_{Z}$ along polynomial values. In a classical q-base, xq^j is a shift of length j for the expansion of x. In Zeckendorf base, the Fibonacci numbers do not have this property since a power of a Fibonacci number is in general not a Fibonacci number (note that the only exceptions of pure powers that are Fibonacci numbers are 1,8 and 144, see [5]). The Lucas numbers are an interesting analogue of powers of q in q-base.

Definition 6 (Lucas numbers)

Let ${\mathscr{L}}=(L_{n})_{n}$ be the sequence defined by L₀ = 2, L₁ = 1 and L_j+ 2 = L_j+ 1 + L_j for all j ≥ 0.

We have for all j ≥ 1, the basic relation L_j = F_j+ 1 + F_j− 1, which means that the expansion of Lucas numbers in Zeckendorf base is simple. Moreover we have the following formulas.

Lemma 3 (26)

For all k ≥ ℓ ≥ 0 and h ≥ 0, we have

L_kL_ℓ = L_k+ℓ + (− 1)^ℓL_k−ℓ,

L_kF_ℓ = F_k+ℓ − (− 1)^ℓF_k−ℓ,

${L_{k}^{h}}=\sum \limits _{i=0}^{\frac {h-1}{2}}\binom {h}{i}(-1)^{ik}L_{(h-2i)k},\qquad h$ odd,

${L_{k}^{h}}=\sum \limits _{i=0}^{\frac {h}{2}-1}\binom {h}{i}(-1)^{ik}L_{(h-2i)k}+\binom {h}{h/2}(-1)^{hk/2},\qquad h$ even.

Furthermore, let m > 0, there are non-adjacent terms ε_−(2u+ 1)(m),…, ε_2u+v(m) ∈{0,1}, ε_{− 2u− 1}(m) = ε_2u+v(m) = 1 for some u,v integers depending only on m such that for all k ≥ 2u + 3,

$$ \begin{array}{@{}rcl@{}} mL_{k}=\sum \limits_{j=-(2u+1)}^{2u+v}\varepsilon_{j}(m)F_{k+j} \end{array} $$

and $[-(2u+1),2u+v] \subseteq \left [k-\ell -1,k+\ell +1\right ]$ where ℓ is such that F_ℓ ≤ m < F_ℓ+ 1.

The last statement of Lemma 3 has an important role for the construction of sums with non-interfering terms. Indeed, for an integer m and k large enough (of order of length of m), the expansion of mL_k is the expansion of m centered around F_k. Notice that this result is different from the q-base since the digits here appear on both sides of the expansion of m. We will see the impact in our main result Theorem 1.2 with the occurence of N^1/(2d) in the place of N^1/d that we got in the case of q-base expansion, see [21].

Let us start with the case P(X) = X^d for d ≥ 2. In the following we write $\overline {\lambda }=\lambda \mod 2$ with $\overline {\lambda }\in \{0,1\}$. This case is the building brick for the general case of $P(X) \in \mathbb {Z}[X]$ with $P(\mathbb {N}_{0}) \subset \mathbb {N}_{0}$.

To begin with, we study the expression of (n + L_ℓ)^d in terms of a sum of distinct Lucas numbers.

Lemma 4

Let n ≥ 0 be an integer and ℓ ≥ 0 be an even integer. We have

$$ (n+L_{\ell})^{d}=\sum \limits_{i=0}^{\frac{d-\overline{d}}{2}} \eta_{i,0}n^{d-2i}+\sum \limits_{\lambda=1}^{d}\left( \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{\frac{d-\overline{d}}{2}-\overline{\lambda}}\eta_{i,\lambda} n^{d-2i-\overline{\lambda}} \right) L_{\lambda \ell}, $$

with $\eta _{i,\lambda }=\binom {d}{2i+\overline {\lambda }}\binom {2i+\overline {\lambda }}{i-\frac {\lambda -\overline {\lambda }}{2}}$.

Proof

Let d be even. Then by Lemma 3 we have

$$ \begin{array}{@{}rcl@{}} (n+L_{\ell})^{d}&=&\sum \limits_{i=0}^{d/2} \binom{d}{2i} n^{d-2i} L_{\ell}^{2i}+\sum \limits_{i=0}^{d/2-1}\binom{d}{2i+1}n^{d-2i-1}L_{\ell}^{2i+1}, \\ &=&n^{d}+ \sum \limits_{i=1}^{d/2} \binom{d}{2i} n^{d-2i} \left( \sum \limits_{j=0}^{i-1} \binom{2i}{j} L_{(2i-2j)\ell}+\binom{2i}{i} \right) \\ &&+\sum \limits_{i=0}^{d/2-1}\binom{d}{2i+1}n^{d-2i-1} \left( \sum \limits_{j=0}^{i} \binom{2i+1}{j} L_{(2i-2j+1)\ell} \right), \\ &=&\sum \limits_{i=0}^{\frac{d}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}\\ &&+ \sum \limits_{\lambda=1}^{d}\left( \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{\frac{d}{2}-\overline{\lambda}}\binom{d}{2i+\overline{\lambda}}\binom{2i+\overline{\lambda}}{i-\frac{\lambda-\overline{\lambda}}{2}} n^{d-2i-\overline{\lambda}} \right) L_{\lambda \ell}. \end{array} $$

For d odd the proof runs along the same lines and we have

$$ \begin{array}{@{}rcl@{}} (n+L_{\ell})^{d}&=&\sum \limits_{i=0}^{\frac{d-1}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}\\ &&+ \sum \limits_{\lambda=1}^{d}\left( \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{\frac{d-1}{2}-\overline{\lambda}}\binom{d}{2i+\overline{\lambda}}\binom{2i+\overline{\lambda}}{i-\frac{\lambda-\overline{\lambda}}{2}} n^{d-2i-\overline{\lambda}} \right) L_{\lambda \ell}. \end{array} $$

□

Lemma 5

Let n ≥ d be an integer, d ≥ 3 an odd integer and 1 ≤ λ ≤ d. Then we have

$$ \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{\frac{d-1}{2}-\overline{\lambda}}\binom{d}{2i+\overline{\lambda}}\binom{2i+\overline{\lambda}}{i-\frac{\lambda-\overline{\lambda}}{2}} n^{d-2i-\overline{\lambda}}<\sum \limits_{i=0}^{\frac{d-1}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}. $$

Proof

We distinguish the two following cases:

If $\overline {\lambda }=0$, we have

$$ \sum \limits_{i=0}^{\frac{d-1}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}=\sum \limits_{i=0}^{\frac{\lambda}{2}-1} \binom{d}{2i}\binom{2i}{i}n^{d-2i}+\sum \limits_{i=\frac{\lambda}{2}}^{\frac{d-1}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}. $$

Since $\binom {2i}{i}\geq \binom {2i}{i-\frac {\lambda }{2}}$ for all $i \geq \frac {\lambda }{2}$, we have the result.

If $\overline {\lambda }=1$, we have

$$ \begin{array}{@{}rcl@{}} \sum \limits_{i=0}^{\frac{d-1}{2}} \binom{d}{2i}\binom{2i}{i}n^{d-2i}&=&\sum \limits_{i=0}^{\frac{\lambda-1}{2}-1} \binom{d}{2i}\binom{2i}{i}n^{d-2i}\\ && +\sum \limits_{i=\frac{\lambda-1}{2}}^{\frac{d-1}{2}-1} \binom{d}{2i}\binom{2i}{i}n^{d-2i}+d\binom{d-1}{\frac{d-1}{2}}n, \end{array} $$

and

$$ \begin{array}{@{}rcl@{}} &&\sum \limits_{i=\frac{\lambda-1}{2}}^{\frac{d-1}{2}-1}\binom{d}{2i+1}\binom{2i+1}{i-\frac{\lambda-1}{2}} n^{d-2i-1}\\ &\leq& \frac{d-(\lambda-1)}{n} \sum \limits_{i=\frac{\lambda-1}{2}}^{\frac{d-1}{2}-1}\binom{d}{2i}\binom{2i}{i-\frac{\lambda-1}{2}} n^{d-2i}. \end{array} $$

Again we get the result for n ≥ d.

□

Remark 2.1

The condition n ≥ d in Lemma 5 has no impact on the quality of the lower bound of the complexity since it does not depend on ℓ.

Now, for k ≥ 1 consider

$$ \begin{array}{@{}rcl@{}} t(k)=m_{3}L_{6k}-m_{2}L_{4k}+m_{1}L_{2k}+m_{0}L_{0}, \end{array} $$

with real parameters m_i, see [26]. For d ≥ 1 set

$$ \begin{array}{@{}rcl@{}} T_{d}(k)=t(k)^{d}= \underset{0\leq i \leq 3d}{\sum}c_{i}L_{2ik}. \end{array} $$

Note that for k sufficiently large, the coefficients c_i are independent from k.

We will need the following auxiliary result [26, Lemma 4].

Lemma 6 (26)

Let M ≥ 1, and $m_{0},m_{1},m_{2},m_{3} \in \mathbb {R}$ with

$$ \begin{array}{@{}rcl@{}} 1\leq m_{0},m_{1},m_{3}<M, \\ 0<m_{2}<\frac{1}{d^{3}(32M)^{d}}. \end{array} $$

Then we have c_3d > 0, c_3d− 1 < 0 and c_i > 0 for i = 0,1,…,3ℓ − 2.

This lemma will be useful to construct two elements with different sums of digits in Zeckendorf base when we take their d th powers. Indeed, only the subdominant coefficient is negative so we are able to create a block of digits 1010⋯10 of length k for any k large enough. Indeed, as we will state later, the transition from k to k + 1 adds exactly one block of 10. We have already used a similar method in the usual 2-base, see [21, proof of Lemma 6].

In the following, let α ≥ 1 be an integer such that

$$ \begin{array}{@{}rcl@{}} \varphi^{\alpha}>d^{3}\varphi(32\varphi)^{d}, \end{array} $$

and m₂ = 1 and m₀,m₁,m₃ integers such that

$$ \begin{array}{@{}rcl@{}} \varphi^{\alpha-1}\leq m_{0},m_{1},m_{3}<\varphi^{\alpha}. \end{array} $$

(3)

Under these conditions, T_d(k) and T_d(k + 1) have all positive integral coefficients with the only exception of the coefficient of L_{2k(3d− 1)} and L_{2(k+ 1)(3d− 1)} respectively, see [26]. Notice that these coefficients are the same for these two polynomials for k large enough.

Lemma 7

Let d ≥ 3 be odd and k ≥ 0 be a sufficiently large integer. Let n be an integer such that n ≥ d and

$$ \sum \limits_{i=0}^{(d-1)/2} \binom{d}{2i}\binom{2i}{i}n^{d-2i} < F_{3k}. $$

(4)

Then we have

$$ \begin{array}{@{}rcl@{}} s_{Z}((n+L_{6k+2})^{d}) &=& s_{Z}((n+L_{6k+4})^{d}), \end{array} $$

(5)

$$ \begin{array}{@{}rcl@{}} s_{Z}(T_{d}(k)) \bmod 2&\neq& s_{Z}(T_{d}(k+1)) \bmod 2. \end{array} $$

(6)

Proof

Let ℓ ≥ 0 be an even integer, we have already proved in Lemma 4 that for odd d we have

$$ (n+L_{\ell})^{d}=\sum \limits_{i=0}^{\frac{d-1}{2}} \eta_{i,0}n^{d-2i}+\sum \limits_{\lambda=1}^{d}\left( \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{\frac{d-1}{2}-\overline{\lambda}}\eta_{i,\lambda} n^{d-2i-\overline{\lambda}} \right) L_{\lambda \ell}. $$

Condition (4) and Lemma 5 imply that all coefficients in front of each L_λℓ, for 1 ≤ λ ≤ d, are < F_3k. Since the aim is to build a sum with non-interfering terms, we study for the range of digits of each term in the sum with the help of Lemma 3. Hence for sufficiently large ℓ, the range of digits of

$$ \left( \sum \limits_{i=\frac{\lambda-\overline{\lambda}}{2}}^{d/2-\overline{\lambda}}\binom{d}{2i+\overline{\lambda}}\binom{2i+\overline{\lambda}}{i-\frac{\lambda-\overline{\lambda}}{2}} n^{d-2i-\overline{\lambda}} \right) L_{\lambda \ell} $$

is included in the interval $\left [\lambda \ell - 3k,\lambda \ell + 3k\right ]$ for λ > 0 and $\left [0,3k-1\right ]$ for λ = 0. If we suppose that these intervals are disjoints plus one small gap, see Remark 1.1, we have a non-interfering sum, i.e when we suppose

$$ \begin{array}{@{}rcl@{}} &&[0,3k-1] \cap [ \ell -3k-1,\ell +3k+1]=\emptyset,\\ &&[ \ell -3k-1,\ell +3k+1] \cap [2\ell -3k-1,2\ell +3k+1]=\emptyset, \\ &&\ldots, \\ &&[(d-1)\ell-3k-1,(d-1)\ell+3k+1]\cap [d\ell -3k-1,d\ell +3k+1]=\emptyset. \end{array} $$

For this to happen, it is sufficient to suppose ℓ > 6k and we have imposed ℓ even. So we can choose ℓ = 6k + 2 in order to have a non-interfering sum. An identical proof works for ℓ = 6k + 4 and (5) is proved.

The second part follows directly from [26, Lemma 3]. For m₁,m₂ ≥ 1 and k₁ > k₂ large enough, we have

$$ s_{Z}(m_{1}L_{2k_{1}}-m_{2}L_{2k_{2}})=k_{1}-k_{2}+C(m_{1},m_{2}), $$

where C(m₁,m₂) only depends on m₁ and m₂. For k sufficiently large we therefore have

$$ \begin{array}{@{}rcl@{}}s_{Z}(t(k)^{d})&=&s_{Z}(c_{3d}L_{6kd}-(-c_{3d-1})L_{2k(3d-1)}+\cdots+c_{0}L_{0}) \\ &=&(3dk-k(3d-1))+C(c_{3d},c_{3d-1})+\kappa \\ &=&k+C(c_{3d},c_{3d-1})+\kappa, \end{array} $$

for some κ independent from k and

$$ \begin{array}{@{}rcl@{}} s_{Z}(t(k+1)^{d})&=&s_{Z}(c_{3d}L_{6(k+1)d}-(-c_{3d-1})L_{2(k+1)(3d-1)}+\cdots+c_{0}L_{0}) \\ &=&(3d(k+1)-(k+1)(3d-1))+C(c_{3d},c_{3d-1})+\kappa \\ &=&k+1+C(c_{3d},c_{3d-1})+\kappa. \end{array} $$

This proves (6). □

We are now able to prove Theorem 1.2 in the particular case P(X) = X^d.

Proof

(Proof of Theorem 1.2) We suppose the same hypotheses as in Lemma 7 and we choose k ≥ 0 such that

$$ \begin{array}{@{}rcl@{}} t(k+1)<N\leq t(k+2). \end{array} $$

(7)

Thus condition (4) gives n^d ≪ F_3k and $n \ll F_{k}^{3/d}$. As we have already stated in Lemma 1, we have

$$ M(\mathcal{S}_{Z,P},N) \gg F_{k}^{3/d} $$

since we have two blocks of length $\ll F_{k}^{3/d}$ with two different successors. It remains to ensure that

$$ t(k) \gg F_{k}^{3/d}+L_{6k+2}, $$

(8)

since we need t(k)^d a successor of the non-interfering block. We have $t(k)\sim m_{3}L_{6k}\sim m_{3}\varphi ^{6k}$ and $F_{k}^{3/d}+L_{6k+2}\sim L_{6k+2} \sim \varphi ^{6k+2}$ as $k\to +\infty $. Since by (3), m₃ ≥ d³(32φ)^d, we have t(k) ≫ 32^dd³φ^6k+d for $k\to +\infty $. Therefore for sufficiently large k, (8) is satisfied. For the same reasons, we need

$$ t(k+1) \gg F_{k}^{3/d}+L_{6k+4}. $$

(9)

A very similar proof of (8) gives (9) for k sufficiently large. Furthermore, (7) gives

$$ N\leq t(k+2) \ll L_{6(k+2)} \ll {F_{k}^{6}}. $$

We finally get

$$ M(\mathcal{S}_{Z,P},N) \gg F_{k}^{3/d} \gg N^{1/(2d)}, $$

and the theorem is proved if d is odd.

For d even, we prove a similar result as Lemma 7 and a similar proof works for the same reasons (we omit the details). Thus Theorem 1.2 is proved in the case P(X) = X^d. □

2.3 Polynomial subsequences

We consider now the general case with P(X) = α_dX^d + ⋯ + α₀ a polynomial such that $P(\mathbb {N}_{0}) \subset \mathbb {N}_{0}$ and α_d = 1. Such as before, we shall determine exactly P(n + L_k) for the integers n ≥ 0 et k ≥ 0.

Lemma 8

We have for integers n ≥ 0 et k ≥ 0,

$$ \begin{array}{@{}rcl@{}} P(n+L_{k})=\underset{0 \leq \lambda \leq d}{\sum}\beta_{\lambda}(n)L_{\lambda k} \end{array} $$

with $\beta _{\lambda }(n)=\underset {\lambda \leq i \leq d}{\sum }\alpha _{i} \sum \limits _{j=\frac {\lambda -\overline {\lambda }}{2}}^{\lfloor \frac {i-\overline {\lambda }}{2} \rfloor }\gamma _{i,j,\lambda }n^{i-2j-\overline {\lambda }}$ and $ \gamma _{i,j,\lambda }=\binom {i}{2j+\overline {\lambda }}\binom {2j+\overline {\lambda }}{j-\frac {\lambda -\overline {\lambda }}{2}}$.

Proof

The proof is similar to Lemma 4. □

Lemma 9

Let n ≥ C_d and λ ≥ 1 be integers with C_d an absolute constant depending only on d. Then we have β_λ(n) < β₀(n).

Proof

The proof is again similar to Lemma 5. □

Let μ be an integer. We have by Lemma 8

$$ P(n+L_{2d\mu k+2})=\beta_{0}(n)+\beta_{1}(n)L_{2d\mu k+2}+\cdots+\beta_{d}(n)L_{d(2d\mu k+2)}. $$

(10)

Remark 2.2

We introduce μ since it is sufficient to adjust the non-interfering block to have the same result. Later, we shall take μ depending only on d.

To prove a lower bound on the Nth maximum order complexity of $\mathcal {S}_{Z,P}$, we look for an analogue of Lemma 7. The analogue of the first part of this lemma is described as follows.

Lemma 10

Let k ≥ 0 be a sufficiently large integer and r > 1. For any integer C_d < n < F_μk we have

$$ s_{Z}(P(n+L_{2d\mu k+2}))=s_{Z}(P(n+L_{2d\mu k+2r})). $$

(11)

Proof

For n < F_μk we have β₀(n) ≪ L_dμk. Then the terms in (10) are non-interfering for k large enough such as in the proof of Lemma 7 where Lemma 5 is replaced by Lemma 9. Thus we have

$$ s_{Z}(P(n+L_{2d\mu k}))=s_{Z}(\beta_{0}(n))+s_{Z}(\beta_{1}(n))+\cdots+s_{Z}(\beta_{d}(n)). $$

Under the same hypothesis, we have for any r > 1,

$$ s_{Z}((P(n+L_{2d\mu k+2r}))=s_{Z}(\beta_{0}(n))+s_{Z}(\beta_{1}(n))+\cdots+s_{Z}(\beta_{d}(n)). $$

Therefore, for any n < F_μk, k large enough, and r > 1, we have

$$ s_{Z}(P(n+L_{2d\mu k}))=s_{Z}((P(n+L_{2d\mu k+2r})). $$

Thus the lemma is proved. □

In order to obtain the analogue of the second part of Lemma 7, we are now looking for an integer n of the form L_λ for some λ and r > 1 such that

$$ s_{Z}(P(L_{\lambda}+L_{2d \mu k+2})) \neq s_{Z}(P(L_{\lambda}+L_{2d \mu k+2r})). $$

(12)

According to Lemma 10, we need λ ≥ μk. Note that we want to prove that $M(\mathcal {S}_{Z,P},N)\gg N^{1/(2d)}$ in order to have the same bound as the one for the monomial case. The following result gives sufficient conditions for the size of λ and r for this bound.

Lemma 11

If λ ≤ 2dμk and r is constant, then we have $M(\mathcal {S}_{Z,P},N)\gg N^{1/(2d)}$.

Proof

If we have λ such that (12) is verified, we show in the same way as before that

$$M(\mathcal{S}_{Z,P},N) \geq F_{\mu k},$$

where k is chosen in a way that

$$ L_{\lambda}+L_{2d\mu k+2r}<N\leq L_{\lambda}+L_{2d\mu (k+1)+2r}. $$

This implies

$$ \begin{array}{@{}rcl@{}} N &\ll& L_{\lambda}+L_{2d\mu k+2r}\\ & \ll& F_{\mu k}^{\lambda/\mu k}+F_{\mu k}^{2d+2r/ \mu k} \end{array} $$

If λ ≤ 2dμk and r is constant, we have $N\ll F_{\mu k}^{2d}$ and $M(\mathcal {S}_{Z,P},N)\gg N^{1/(2d)}$. Notice that μ independent from k is also required to have this result. □

We are now looking for (λ,r) such that μk ≤ λ ≤ 2dμk, r constant and (12) is verified.

As we will state later, only two Lucas numbers will interfere. The following lemma describes all the possibilities for this interference.

Lemma 12

Let k,ℓ be two integers such that k ≥ ℓ ≥ 5. We have

$$ s_{Z}(L_{k}+L_{\ell}) \equiv 1 \pmod 2 \quad \Longleftrightarrow \quad k=\ell +2. $$

Proof

We have L_k = F_k+ 1 + F_k− 1 and L_ℓ = F_ℓ+ 1 + F_ℓ− 1. So we shift two blocks of 101 one against the other and look for transversals in the carry propagations. Note that if k ≥ ℓ + 4, there is no carry propagation at all and s_Z(L_k + L_ℓ) = 4 ≡ 0 (mod 2). By symmetry, there remain only four cases to consider. We suppose ℓ ≥ 5:

Case k = ℓ:

Then we have s_Z(L_k + L_ℓ) ≡ 0 (mod 2).
Case k = ℓ + 1:

Then we have s_Z(L_k + L_ℓ) ≡ 0 (mod 2).
Case k = ℓ + 2:

Then we have s_Z(L_k + L_ℓ) ≡ 1 (mod 2).
Case k = ℓ + 3:

Then we have s_Z(L_k + L_ℓ) ≡ 0 (mod 2).

Thus the lemma is proved. □

Proof

(Proof of Theorem 1.2) We now investigate the interferences in

$$ \begin{array}{@{}rcl@{}} P(L_{\lambda}+L_{2d\mu k+2})=\beta_{0}(\lambda)+\beta_{1}(\lambda)L_{2d\mu k+2}+\cdots+\beta_{d}(\lambda)L_{d(2d\mu k+2)}. \end{array} $$

For each β_i(λ)L_{i(2dμk+ 2)} we locate the least significant digit and the most significant digit. Then we have the following possible interferences

$$ \begin{array}{@{}rcl@{}} &&L_{\lambda d}+(-1)^{(d-1)\lambda}L_{2d\mu k+2 -(d-1)\lambda} \\ &&L_{2d\mu k+2 +(d-1)\lambda}+(-1)^{(d-2)\lambda}L_{2(2d\mu k+2) -(d-2)\lambda} \\ &&{\cdots} \\ &&L_{(d-1)(2d\mu k+2)+\lambda}+L_{d(2d\mu k+2)} \end{array} $$

since α_d = 1.

We will now choose λ even. Thus we have for the first interference L_λd + L_{2dμk−(d− 1)λ+ 2}. By Lemma 12 we have for sufficiently large k

$$ \begin{array}{@{}rcl@{}} s_{Z}(L_{\lambda d}+L_{2d\mu k -(d-1)\lambda+2})&=&1 \Leftrightarrow 2d\mu k-(d-1)\lambda+2=\lambda d +2 \\\Leftrightarrow \lambda&=&\frac{2d\mu k}{2d-1}. \end{array} $$

(13)

Hence it is sufficient to take μ = 2d − 1 and λ = 2dk. Furthermore, all other possible interferences are non-existing by this choice of λ and μ. We finally need to check if λ satisfies all the conditions: λ is even and (2d − 1)k ≤ λ ≤ 2d(2d − 1)k. Furthermore we can take r = 2 for example and all the appearing blocks do not interfere. Summing up, we have proved that $s_{Z}\left (P\left (L_{\lambda }+L_{2d(2d-1) k +2}\right )\right )\neq s_{Z}\left (P\left (L_{\lambda }+L_{2d(2d-1) k +4}\right )\right )$. Thus, we have proved the general case of Theorem 1.2. □

Remark 2.3

The choice of L_2dμk+ 2 in the proof is motivated to have an integer solution for λ in (13). Indeed, if we had chosen L_2dμk instead, (13) would not have provided an integer for all k.

3 Final remarks

3.1 Generalizations

Our result can be generalized to other numeration systems without new methods. Let us first comment on Ostrowski’s α-numeration system, see [2, Theorem 3.9.1], related to the continued fraction of an irrational number.

Definition 7 (Continued fraction)

Let α be a real number, we define the sequence (a_i)_i such that

$$\alpha=a_{0}+\frac{1}{a_{1}+\frac{1}{a_{2}+\cdots}}=\left[a_{0},a_{1},\ldots\right] $$

and we define p_− 2 = 0, p_− 1 = 1, q_− 2 = 1, q_− 1 = 0, and p_n = a_np_n− 1 + p_n− 2, q_n = a_nq_n− 1 + q_n− 2. Then we have $\frac {p_{n}}{q_{n}}=\left [a_{0},a_{1},\ldots ,a_{n}\right ]$. The sequence (q_n)_n is called denominators of the convergents of the continued fraction of α.

Definition 8 (Ostrowski’s α-numeration system)

Let α be a non rational real number, and let (q_n) be the sequence of the denominators of the convergents of the continued fraction of α. Then every non-negative integer n can be represented uniquely in the form

$$ \begin{array}{@{}rcl@{}} n=\underset{0\leq i \leq j}{\sum}\varepsilon_{i} q_{i} \end{array} $$

where the ε_i are integers satisfying the following conditions:

0 ≤ b₀ < a₁

0 ≤ b_i ≤ a_i+ 1, for i ≥ 1.

For i ≥ 1, if b_i = a_i+ 1 then b_i− 1 = 0.

For the specific case α = φ, we have the Zeckendorf expansion since φ = [1,1,1,…].

Our result might be generalized to a quadratic irrational α such that the continued fraction of α is of the form α = [1,a,a,…]. In this particular case, we have $q_{n+2}=\frac {1}{a}q_{n+1}+q_{n}$. We denote by γ the zero of the polynomial $x^{2}-\frac {1}{a}x-1$ such that its Galois conjugate $\overline {\gamma }$ verifies $|\overline {\gamma }|<|\gamma |$. Thus we have $q_{n}=\frac {1}{\sqrt {a^{-2}+4}}(\gamma ^{n}-\overline {\gamma }^{n})$, an analogous formula that for Fibonacci numbers. We define an analogue of Lucas numbers by $L^{\prime }_{n}=\gamma ^{n}+\overline {\gamma }^{n}$ for all n ≥ 0. Then, in this particular case, our new notion of “Lucas numbers” verifies $L^{\prime }_{n}=q_{n+1}+q_{n-1}$ and an analogous formula of Lemma 3 holds true. So it might be possible to generalize our result in this case since no more input in the proof is needed.

3.2 Conjectures

We can compute the maximum order complexity thanks to Blumer’s DAWG (Direct Acyclic Weighted Graph) algorithm (see [3] and [12]). The C++ program that computes the maximum order complexity of a sequence is available on the web page of the second author.¹ By using this program, we are able to formulate some heuristically supported conjectures.

Conjecture 3.1

We conjecture that $M(\mathcal {S}_{Z},N) \sim \frac {1}{1+\varphi ^{2}}N$. Figure 1 indicates that the maximum order complexity for $\mathcal {S}_{Z}$ is linear with coefficient $\frac {1}{1+\varphi ^{2}}=0.27639\ldots $

Let us compare the maximum order complexity along polynomial subsequences of the Thue–Morse sequence to the Zeckendorf expansion sequence.

The plots for squares and cubes (and the large values of R²), see Figs. 2 and 3, indicate that the growth is close to x^1/2 and x^1/3, respectively. We therefore formulate the following conjecture :

Conjecture 3.2

The Thue–Morse sequence along polynomial subsequences, denoted by $\mathcal {T}_{P}$ for a polynomial P of degree d, verifies $M(\mathcal {T}_{P},N)\asymp N^{1/d}$, i.e. there are c,C > 0 such as for all N large enough we have

$$ cN^{1/d}\leq M(\mathcal{T}_{P},N) \leq CN^{1/d}. $$

A proof of this conjecture would imply that the lower bound proved by Popoli [21] is optimal.

The maximum order complexity of $\mathcal {S}_{\varphi }$ is algorithmically more difficult to handle. Motivated by the results on Thue–Morse we conjecture the following:

Conjecture 3.3

The sequence $\mathcal {S}_{Z}$ along polynomial subsequences, denoted by $\mathcal {S}_{Z,P}$ for a polynomial P of degree d ≥ 2, verifies $M(\mathcal {S}_{Z,P},N)\asymp N^{1/(2d)}$, i.e. there are c,C > 0 such as for all N large enough we have

$$ cN^{1/(2d)}\leq M(\mathcal{S}_{Z,P},N) \leq CN^{1/(2d)}. $$

This is, to some extent, supported by the plot for squares, see Fig. 4. Again, a proof of this conjecture would imply that the lower bound in Theorem 1.2 is sharp. Notice that there is a larger gap for the maximum order complexity between the linear case and the quadratic case for $\mathcal {S}_{Z}$ compared to the classical Thue–Morse sequence.

We conclude the discussion by a surprising phenomenon on steps. Since the maximum order complexity is integer-valued and an increasing function, it is a step function. Each time $M(\mathcal {S},N)$ has a different value from $M(\mathcal {S},N-1)$, we say that N is a step. The ratio of successive steps is the ratio N₁/N₂ for two steps N₁,N₂ such that there is no N ≥ 1 with $M(\mathcal {S},N_{1})<M(\mathcal {S},N)<M(\mathcal {S},N_{2})$.

We observed that the ratio of successive steps seems to converge.

Conjecture 3.4

The ratio of successive steps for the maximum order complexity of a sequence related to the Thue–Morse sequence, respectively, the sequence related to the Zeckendorf sum of digits, tends to 2, respectively, the golden ration φ.

This phenomenon seems to be related to numeration systems, and not directly to the fact that a sequence is automatic or morphic.

N	Ratio of steps in in Fig. 1.
2	–
5	2,5
12	2,4
19	1,583333333
30	1,578947368

48	1,6
77	1,604166667
124	1,61038961
200	1,612903226
323	1,615
522	1,616099071
844	1,616858238
1365	1,617298578
2208	1,617582418
3572	1,617753623
5779	1,617861142
9350	1,617926977
15128	1,617967914
24477	1,617993125
39604	1,618008743
64080	1,618018382
103683	1,618024345
167762	1,618028028
271444	1,618030305
439205	1,618031712
710648	1,618032582

N	Ratio of steps in Fig. 2.
2	–
4	2
10	2,5
23	2,3
37	1,608695652
52	1,405405405
73	1,403846154
138	1,890410959
270	1,956521739
530	1,962962963
1048	1,977358491
2082	1,986641221
4146	1,991354467
8258	1,991799325
16478	1,995398402
32898	1,996480155
65720	1,997689829
131330	1,998326233
262510	1,998857839

524802	1,999169555
1049302	1,999424545
2098178	1,999594016
4195754	1,999713084
8390658	1,999797414

N	Ratio of steps in Fig. 4.
2	–
4	2
6	1,5
8	1,33333333
18	2,25
24	1,33333333
42	1,75
59	1,40476191
171	2,89830509
448	2,61988304
18367	40,9977679
22118	1,20422497
28371	1,28271091
83517	2,94374538
167773	2,00884850
317825	1,89437514
439221	1,38195863
832054	1,89438574
1346308	1,61805364
2178338	1,61800866
3524634	1,61803816
5702922	1,61801821
9227522	1,61803405
14930387	1,61802779
24157917	1,61803689
39088244	1,61803040
63246131	1,61803460
102334245	1,61803170
165580287	1,61803399
267914386	1,61803311
433494698	1,61803442
701408926	1,61803346

N	Ratio of steps in Fig. 5.
2	–
9	4,5
37	4,11111111
59	1,59459460
67	1,13559322
167	2,49253731
273	1,63473054
1677	6,14285714
1960	1,16875373
2178	1,11122449
6378	2,92837466
11095	1,73957353
17046	1,53636773
20398	1,19664437
40879	2,00406903
57958	1,41779398
510998	8,81669485
567415	1,11040552
1356065	2,38989981
1390403	1,02532180
2264908	1,62895794
2860200	1,26283275
6783276	2,37160898
9626340	1,41912846
54170633	5,62733427
86568697	1,59807431
175721520	2,02985058
449510050	2,55808196

The plot for cubes, see Fig. 5, does not enlighten Conjecture 3.3 and Conjecture 3.4. With our program and our machine, it is not possible to compute the maximum order complexity of a sequence any further than 10⁹ terms. Nevertheless, we believe that if it were possible to compute for a few more terms, both of theses conjectures should appear more clearly.

To perform all these plots, we used the cluster yeti that consists of 4 nodes, 4 x Intel Xeon Gold 6130 CPU and 16 cores / CPU, with 768 GiB of memory, see https://www.grid5000.fr/w/Grenoble:Hardware#yeti.

Acknowledgements

We would like to thank Arne Winterhof for very helpful discussions and comments on a preliminary version of the paper, and for sending us the bachelor thesis of his student Jan-Michael Holzinger. This work was supported partly by the French PIA project “Lorraine Université d’Excellence”, reference ANR-15-IDEX-04-LUE, and by the projects ANR-18-CE40-0018 (EST) and ANR-20-CE91-0006 (ArithRand).

Declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

previous article New nonexistence results on circulant weighing matrices

next article Formal self duality

https://iecl.univ-lorraine.fr/membre-iecl/popoli-pierre/

Allouche, J.P., Han, G.N., Niederreiter, H.: Perfect linear complexity profile and apwenian sequences. Finite Fields Appl. 68(101761), 13 (2020). https://doi.org/10.1016/j.ffa.2020.101761MathSciNetMATH

Allouche, J.P., Shallit, J.: Automatic sequences. Cambridge University Press, Cambridge (2003). https://doi.org/10.1017/CBO9780511546563. Theory, applications, generalizationsCrossRef

Blumer, A., Blumer, J., Haussler, D., Ehrenfeucht, A., Chen, M.T., Seiferas, J.: The smallest automaton recognizing the subwords of a text. pp. 31–55. https://doi.org/10.1016/0304-3975(85)90157-4. Special issue: Eleventh international colloquium on automata, languages and programming (Antwerp, 1984) (1985)

Bruyère, V.: Automata and numeration systems. Sém Lothar. Combin. 35, Art. B35b, approx. 19 (1995)MathSciNetMATH

Bugeaud, Y., Mignotte, M., Siksek, S.: Classical and modular approaches to exponential Diophantine equations. I. Fibonacci and Lucas perfect powers. Ann. of Math. (2) 163(3), 969–1018 (2006). https://doi.org/10.4007/annals.2006.163.969MathSciNetCrossRef

Christol, G.: Ensembles presque periodiques k-reconnaissables. Theoret. Comput. Sci. 9(1), 141–145 (1979). https://doi.org/10.1016/0304-3975(79)90011-2MathSciNetCrossRef

Diem, C.: On the use of expansion series for stream ciphers. LMS J. Comput. Math. 15, 326–340 (2012). https://doi.org/10.1112/S146115701200109XMathSciNetCrossRef

Drmota, M., Mauduit, C., Rivat, J.: Normality along squares. J. Eur. Math. Soc. (JEMS) 21(2), 507–548 (2019). https://doi.org/10.4171/JEMS/843MathSciNetCrossRef

Drmota, M., Müllner, C., Spiegelhofer, L.: Möbius orthogonality for the Zeckendorf sum-of-digits function. Proc. Amer. Math. Soc. 146(9), 3679–3691 (2018). https://doi.org/10.1090/proc/14015MathSciNetCrossRef

10.

Golomb, S. W.: Shift register sequences. With portions co-authored by Lloyd R. Welch, Richard M. Goldstein, and Alfred W. Hales. Holden-Day, Inc., San Francisco, Calif.-Cambridge-Amsterdam (1967)

11.

Işık, L., Winterhof, A.: Maximum-order complexity and correlation measures. Cryptography 1. https://doi.org/10.3390/cryptography1010007 (2017)

12.

Jansen, C.J.A.: Investigations on nonlinear streamcipher systems: Construction and evaluation methods. ProQuest LLC, Ann Arbor, MI. Thesis (Dr.) – Technische Universiteit Delft (The Netherlands) (1989)

13.

Jansen, C.J.A.: The maximum order complexity of sequence ensembles. In: Davies, D.W. (ed.) Advances in Cryptology — EUROCRYPT ’91, pp 153–159. Springer, Berlin (1991)

14.

Mauduit, C.: Multiplicative properties of the Thue-Morse sequence. Period. Math. Hungar. 43(1-2), 137–153 (2001). https://doi.org/10.1023/A:1015241900975MathSciNetMATH

15.

Mauduit, C., Sárközy, A.: On finite pseudorandom binary sequences. II. The Champernowne, Rudin-Shapiro, and Thue-Morse sequences, a further construction. J. Number Theory 73(2), 256–276 (1998). https://doi.org/10.1006/jnth.1998.2286MathSciNetCrossRef

16.

Mérai, L., Niederreiter, H., Winterhof, A.: Expansion complexity and linear complexity of sequences over finite fields. Cryptogr. Commun. 9(4), 501–509 (2017). https://doi.org/10.1007/s12095-016-0189-2MathSciNetCrossRef

17.

Mérai, L., Winterhof, A.: Pseudorandom sequences derived from automatic sequences. arXiv:2105.030862105.03086 (2021)

18.

Morse, H.M.: Recurrent geodesics on a surface of negative curvature. Trans. Amer. Math. Soc. 22(1), 84–100 (1921). https://doi.org/10.2307/1988844MathSciNetCrossRef

19.

Moshe, Y.: On the subword complexity of Thue-Morse polynomial extractions. Theoret. Comput. Sci. 389(1-2), 318–329 (2007). https://doi.org/10.1016/j.tcs.2007.10.015MathSciNetCrossRef

20.

Niederreiter, H., Xing, C.: Sequences with high nonlinear complexity. IEEE Trans. Inform. Theory 60(10), 6696–6701 (2014). https://doi.org/10.1109/TIT.2014.2343225MathSciNetCrossRef

21.

Popoli, P.: On the maximum order complexity of Thue-Morse and Rudin-Shapiro sequences along polynomial values. Unif. Distrib. Theory 15(2), 9–22 (2020). https://doi.org/10.2478/udt-2020-0008MathSciNetCrossRef

22.

Prouhet, E.: Mémoire sur quelques relations entre les puissances des nombres. C. R. Acad. Sc. Paris 33, 225 (1851)

23.

Queffélec, M.: Substitution dynamical systems—spectral analysis. Lecture Notes in Mathematics, vol. 1294. Springer, Berlin (1987). https://doi.org/10.1007/BFb0081890

24.

Shallit, J.: Subword complexity of the Fibonacci-Thue-Morse sequence: The proof of Dekking’s conjecture. Indag. Math. (N.S.) 32(3), 729–735 (2021). https://doi.org/10.1016/j.indag.2021.03.004MathSciNetCrossRef

25.

Shutov, A.: On the sum of digits of the Zeckendorf representations of two consecutive numbers. Fibonacci Quart. 58(3), 203–207 (2020)MathSciNetMATH

26.

Stoll, T.: Combinatorial constructions for the Zeckendorf sum of digits of polynomial values. Ramanujan J. 32(2), 227–243 (2013). https://doi.org/10.1007/s11139-012-9422-6MathSciNetCrossRef

27.

Sun, Z., Winterhof, A.: On the maximum order complexity of subsequences of the Thue-Morse and Rudin-Shapiro sequence along squares. Int. J. Comput. Math. Comput. Syst. Theory 4(1), 30–36 (2019). https://doi.org/10.1080/23799927.2019.1566275MathSciNetCrossRef

28.

Sun, Z., Winterhof, A.: On the maximum order complexity of the Thue-Morse and Rudin-Shapiro sequence. Unif. Distrib. Theory 14(2), 33–42 (2019)MathSciNetCrossRef

29.

Sun, Z., Zeng, X., Lin, D.: On the N th maximum order complexity and the expansion complexity of a Rudin-Shapiro-like sequence. Cryptogr. Commun. 12(3), 415–426 (2020). https://doi.org/10.1007/s12095-019-00396-0MathSciNetCrossRef

30.

Thue, A.: Über die gegenseitige Lage gleicher Teile gewisser Zeichenreihen. Skrifter udgivne af Videnskabsselskabet i Christiania. 1, Math.Nat.wiss.Kl.1912,1 Jacob Dybwad (1912)

31.

Topuzoğlu, A., Winterhof, A.: Pseudorandom sequences Topics in geometry, coding theory and cryptography. Springer, Dordrecht, https://doi.org/10.1007/1-4020-5334-4_4, vol. 6, pp 135–166 (2007)

32.

Zeckendorf, E.: Représentation des nombres naturels par une somme de nombres de Fibonacci ou de nombres de Lucas. Bull. Soc. Roy. Sci. Liè,ge 41, 179–182 (1972)MathSciNetMATH

Title: Maximum order complexity of the sum of digits function in Zeckendorf base and polynomial subsequences
Authors: Damien Jamet
Pierre Popoli
Thomas Stoll
Publication date: 07-07-2021
Publisher: Springer US
Published in: Cryptography and Communications / Issue 5/2021
Print ISSN: 1936-2447
Electronic ISSN: 1936-2455
DOI: https://doi.org/10.1007/s12095-021-00507-w

Springer Professional

Maximum order complexity of the sum of digits function in Zeckendorf base and polynomial subsequences

Abstract

Publisher’s note

1 Introduction

1.1 Measures of complexity

1.2 Automatic and morphic sequences

1.3 Main results

2 Sum of digits function in Zeckendorf base

2.1 Linear case

2.2 Monomial subsequences

2.3 Polynomial subsequences

3 Final remarks

3.1 Generalizations

3.2 Conjectures

Acknowledgements

Declarations

Conflict of Interests

Publisher’s note

Premium Partner

Springer Professional

Abstract

Publisher’s note

1 Introduction

1.1 Measures of complexity

1.2 Automatic and morphic sequences

1.3 Main results

2 Sum of digits function in Zeckendorf base

2.1 Linear case

2.2 Monomial subsequences

2.3 Polynomial subsequences

3 Final remarks

3.1 Generalizations

3.2 Conjectures

Acknowledgements

Declarations

Conflict of Interests

Publisher’s note

Other articles of this Issue 5/2021

2-Adic complexity of two constructions of binary sequences with period 4N and optimal autocorrelation magnitude

New nonexistence results on circulant weighing matrices

Quantum synchronizable codes from the Whiteman’s generalized cyclotomy

Quasi-cyclic constructions of asymmetric quantum error-correcting codes

Several classes of minimal binary linear codes violating the Ashikhmin-Barg bound

Efficient generation of quadratic cyclotomic classes for shortest quadratic decompositions of polynomials

Premium Partner