1 Introduction
There are various methods to study the complexity of symbolically defined dynamical systems. Block (word) complexity, which counts the number of blocks of size k which appear in a subshift, was introduced in 1938 by Morse and Hedlund as a way to determine the ‘size’ of different subshifts. Their first paper, [Reference Morse and HedlundMH38], contains the proof of the Morse–Hedlund theorem, which states that any aperiodic sequence must have block complexity at least
$k+1$
for all
$k \in \mathbb {N}$
. Their second paper, [Reference Morse and HedlundMH40], introduces Sturmian sequences, defined to be infinite sequences over two letters that have block complexity
$k+1$
for all
$k \in \mathbb {N}$
and that can be represented as a coding of an irrational circle rotation.
For larger alphabets, a similar proof shows that the block complexity of an aperiodic sequence over
$\ell \geq 2$
letters must be at least
$k +\ell - 1$
. A Sturmian sequence over
$\ell \geq 2$
letters is defined to be an aperiodic sequence with block complexity
$k+\ell - 1$
for all
$k \in \mathbb {N}$
[Reference Pytheas FoggFog02].
The maximal pattern complexity of an infinite sequence
$\alpha $
, denoted
$p_{\alpha }^{*}(k)$
, was introduced by Kamae and Zamboni in 2002 in [Reference Kamae and ZamboniKZ02b] as a way to link word complexity and sequence entropy. In their first paper, they showed that the maximal pattern complexity of any aperiodic sequence must be at least
$2k$
for all
$k \in \mathbb {N}$
, which is a direct analog to the Morse–Hedlund theorem.
In [Reference Kamae and ZamboniKZ02b], they also introduced pattern Sturmian sequences as an analog to Sturmian sequences and defined pattern Sturmian sequences over two letters to be aperiodic sequences with the lowest possible maximal pattern complexity,
$2k$
. In [Reference Kamae and ZamboniKZ02a], they furthered their study of pattern Sturmian sequences, showing that sequences generated by a coding of an irrational circle rotation by any two interval partitions of a circle and simple Toeplitz sequences are examples of pattern Sturmian sequences over two letters.
In 2006, Kamae and Rao extended the study of maximal pattern complexity for sequences over finite alphabets of size greater than two in [Reference Kamae and RaoKR06]. They determined that a sequence over
$\ell \geq 2$
letters which has maximal pattern complexity less than
$\ell k$
for some
$k \in \mathbb {N}$
is periodic by projection, meaning there is a non-constant, coordinatewise, factor map f that maps the sequence to a periodic sequence. They extended the definition of pattern Sturmian sequences over two letters to pattern Sturmian sequences over
$\ell \geq 2$
letters, defining them to be sequences of maximal pattern complexity
$\ell k$
that are not periodic by projection.
As stated in their paper, their results hold for sequences in which every letter appears infinitely often. We will make this same assumption for all sequences in this paper, since the ability for a letter to show up a finite number of times can be eliminated with a finite shift.
Although [Reference Kamae and RaoKR06] isolates the minimal maximal pattern complexity for aperiodic sequences which are not periodic by projection, there are aperiodic sequences with lower maximal pattern complexity. The goal of this work is to isolate the lowest maximal pattern complexity for any aperiodic sequence and determine the structure of such sequences. We define a strong pattern Sturmian sequence over
$\ell \geq 2$
letters to be an aperiodic sequence with maximal pattern complexity
$2k + \ell - 2$
. As seen in Theorem 2.11, this is the lowest possible maximal pattern complexity an aperiodic sequence can have (given all letters of the alphabet appear in the sequence) and this notion aligns closely with the definition of a Sturmian sequence over
$\ell $
letters. Furthermore, since the notions of pattern Sturmian sequences and strong pattern Sturmian sequences coincide when the alphabet is two letters, we will restrict our study to sequences for which the size of the alphabet is greater than two.
In this paper, we introduce three main results. The first result, Theorem 1.1, gives a general structure for aperiodic sequences with low maximal pattern complexity. Example 6.3 demonstrates that the bound of
$4k$
in Theorem 1.1 cannot be substantially improved.
Theorem 1.1. Let
$\alpha $
be an aperiodic sequence such that
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
. If
then there exists some decomposition of
$\alpha $
into
$m>1$
residues
such that exactly one of these residues is aperiodic over eventually only two letters, a and b, and any residue containing finitely many a and b symbols is eventually constant.
If
$\alpha $
is uniformly recurrent, then the bound in (1.1) can be improved to
$4k$
.
The second result, Theorem 1.2, states that any strong pattern Sturmian sequence must consist of a pattern Sturmian sequence over two letters threaded together with constant sequences. Example 6.1 demonstrates the existence of sequences which are strong pattern Sturmian.
Theorem 1.2. Let
$\alpha $
be an aperiodic sequence such that
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
. If
$p_{\alpha }^*(k) =\, 2k + \ell -2$
for all
$k \in \mathbb {N}$
, then the aperiodic residue from Theorem 1.1 must be a pattern Sturmian over two letters and all other residues must be constant.
The third result, Theorem 1.3, is a partial converse of Theorem 1.2, which states that a sequence that can be decomposed into a pattern Sturmian sequence threaded together with constant sequences must have maximal pattern complexity
$2k+C$
for some constant
$C \in \mathbb {N}$
. Example 6.2 demonstrates that a literal converse to Theorem 1.2 is untrue. When
$C=\ell - 2$
, the sequence is strong pattern Sturmian.
Theorem 1.3. Let
$\alpha $
be an aperiodic sequence such that
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
. If
$\alpha $
can be decomposed into residues such that one residue is a pattern Sturmian over two letters and all other residues are constant, then for sufficiently large k,
$p_{\alpha }^*(k) = 2k + C$
for some
$C \in \mathbb {N}$
with
$C \geq \ell -2$
.
It is important to note that Theorem 1.1 allows for certain residues to be eventually periodic over two letters, but any non-constant periodic residue must be periodic over the alphabet of the aperiodic residue. However, Theorem 1.2 requires that any residue be aperiodic or constant. Strong pattern Sturmian sequences also cannot have eventually constant residues and the aperiodic residue can only witness two letters.
It is also important to recognize that in the case where
$\ell = 3$
, these results give a general structure for all aperiodic sequences
$\alpha $
that have maximal pattern complexity less than
$3k$
. These sequences require a single aperiodic residue that is eventually over two letters, a and b, threaded together with eventually constant sequences of the third letter c.
2 Definitions and preliminaries
This section is dedicated to introducing definitions and results needed in the following sections.
Definition 2.1. Let
$\mathcal {A}$
be a finite set and
$\alpha = \alpha _0\alpha _1\alpha _2\ldots \in \mathcal {A}^{\mathbb {N}_0}$
. Then,
$\alpha $
is a (one-sided) infinite sequence over finite alphabet
$\mathcal {A}$
.
As mentioned in the introduction, we only consider sequences in which a letter appearing once in the sequence means the letter appears infinitely in the sequence.
Throughout this paper,
$\mathcal {A}^{\mathbb {N}_0}$
is endowed with the product topology.
Definition 2.2. Let
$\sigma : \mathcal {A}^{\mathbb {N}_0} \to \mathcal {A}^{\mathbb {N}_0}$
be the shift map defined by
$\sigma (\alpha _0\alpha _1\alpha _2\ldots ) = \alpha _1\alpha _2\ldots $
. We say that a finite shift of
$\alpha $
is the sequence
$\sigma ^t(\alpha ) = \alpha _t\alpha _{t+1}\alpha _{t+2}\ldots $
for any integer
$t \in \mathbb {N}_0$
. We say that the orbit of
$\alpha $
, denoted
$\mathcal {O}(\alpha )$
, is the set
$\mathcal {O}(\alpha ) :=\{\sigma ^t(\alpha ): t \in \mathbb {N}_0\}$
. The orbit closure of
$\alpha $
is the orbit of
$\alpha $
together with its limit points and is denoted
$\overline {\mathcal {O}(\alpha )}$
.
While
$\alpha $
uses letters from
$\mathcal {A}$
, it may not use all letters from
$\mathcal {A}$
. Thus, to denote the letters that appear in an infinite sequence
$\alpha $
, we use
Definition 2.3. A sequence
$\alpha $
is periodic if there exists some
$t \in \mathbb {N}$
such that
$\alpha = \sigma ^t(\alpha )$
. A sequence
$\alpha $
is eventually periodic if there exists some
$t \in \mathbb {N}$
such that
$\sigma ^t(\alpha )$
is periodic. A sequence
$\alpha $
is aperiodic if
$\alpha $
is not periodic or eventually periodic.
Definition 2.4. A sequence
$\alpha $
is recurrent if for all
$L \in \mathbb {N}$
, there exists some
$M \in \mathbb {N}$
such that
$\alpha _0\ldots \alpha _{L-1} = \alpha _{M}\ldots \alpha _{M+L-1}$
. If for all
$L \in \mathbb {N}$
, the set
$R_L=\{M: \alpha _0\ldots \alpha _{L-1} = \alpha _{M}\ldots \alpha _{M+L-1}\}$
is syndetic (there are bounded gaps between subsequent elements), then
$\alpha $
is uniformly recurrent.
Definition 2.5. A
${k}$
-sized window
${\tau }$
is an increasing set of k indices beginning with zero:
We say a
$(k+1)$
-sized window
$\tau '$
is an immediate extension of a k-sized window
$\tau $
if
$\tau ' = \{\tau (0), \tau (1), \ldots , \tau (k-1), h\}$
for some
$h> \tau (k-1)$
.
Given a k-sized window
$\tau $
, we use the notation
$m\tau $
to denote the k-sized window
$m\tau : = \{m\tau (0), m\tau (1),\ldots , m\tau (k-1)\}$
.
Definition 2.6. A
${\tau }$
-word over
${\alpha }$
is a finite word of length k denoted
$\alpha [n + \tau ]$
and defined as
for some
$n \in \mathbb {N}_0$
.
The set of all
${\tau }$
-words over
${\alpha }$
is the set denoted by
$F_{\alpha }(\tau )$
and defined by
Since
$F_{\alpha }(\tau )$
is finite, there exists some
$L \in \mathbb {N}$
such that
$F_{\alpha }(\tau ) = \{\alpha [n + \tau ] : n \in \mathbb {N} \cap [0, L)\}$
.
Definition 2.7. The maximal pattern complexity of an infinite sequence
$\alpha $
is the function
$p_{\alpha }^*:\mathbb {N} \to \mathbb {N}$
defined by
where the supremum is taken over all k-sized windows
$\tau $
.
We note that for any
$k \in \mathbb {N}$
, there exists some k-sized window
$\tau $
such that
${p_{\alpha }^*(k) = \#F_{\alpha }(\tau )}$
and we call any such window
$\tau $
a maximal k-sized window
${\tau }$
.
In [Reference Kamae and ZamboniKZ02b], Kamae and Zamboni proved the following two theorems for maximal pattern complexity, the first that acts as an analog to the Morse–Hedlund theorem for block complexity and the second that relates the number of
$\tau $
-words to the number of
$\tau '$
-words in an aperiodic sequence
$\alpha $
for some immediate extension
$\tau '$
of
$\tau $
.
Theorem 2.8. [Reference Kamae and ZamboniKZ02b]
An infinite sequence
$\alpha \in \mathcal {A}^{\mathbb {N}_0}$
for a finite alphabet
$\mathcal {A}$
is eventually periodic if and only if
$p_{\alpha }^*(k) \leq 2k - 1$
for some
$k \in \mathbb {N}$
.
Theorem 2.9. [Reference Kamae and ZamboniKZ02b]
For an arbitrary word
$\alpha $
over a finite set
$\mathcal {A}$
, if there exists
${k \in \mathbb {N}}$
and a window
$\tau $
of size k such that
$\#F_{\alpha }(\tau ') \leq \#F_{\alpha }(\tau ) + 1$
holds for any immediate extension
$\tau '$
of
$\tau $
, then
$\alpha $
is eventually periodic.
The following fact is an extension of the Morse–Hedlund theorem for any window
$\tau $
, which follows from the same proof.
Fact 2.10. For any aperiodic sequence
$\alpha $
, any
$k \in \mathbb {N}$
, and any k-sized window
$\tau $
,
$\#F_{\alpha }(\tau ) \geq k+1$
.
Using Theorem 2.9, it is easy to extend Theorem 2.8 to the case where
$\mathcal {A}_{\alpha } = \mathcal {A}$
as shown in the following theorem.
Theorem 2.11. If an infinite sequence
$\alpha $
is aperiodic and
$\#\mathcal {A}_{\alpha } = \ell \geq 2$
, then
$p_{\alpha }^*(k) \geq 2k + \ell -2$
for all
$k \in \mathbb {N}$
.
Proof. Suppose
$\alpha $
is aperiodic and
$\#\mathcal {A}_{\alpha } = \ell \geq 2$
. Clearly, since
$\#\mathcal {A}_{\alpha } = \ell $
,
$p_{\alpha }^*(1) = \ell $
.
Suppose for some
$k> 1$
, we have
$p_{\alpha }^*(k) \geq 2k + \ell -2$
.
Let
$\tau $
be a maximal k-sized window for
$\alpha $
. Then,
$\#F_{\alpha }(\tau ) \geq 2k + \ell - 2$
. By Theorem 2.9, since
$\alpha $
is aperiodic, there exists some immediate extension
$\tau '$
of
$\tau $
such that
$\#F_{\alpha }(\tau ') \geq \#F_{\alpha }(\tau ) + 2$
. Thus,
Hence, for all
$k \in \mathbb {N}$
,
$p_{\alpha }^*(k) \geq 2k + \ell -2$
.
In [Reference Kamae and ZamboniKZ02b], Kamae and Zamboni defined the maximal pattern complexity analog for a Sturmian sequence over a two-letter alphabet.
Definition 2.12. A pattern Sturmian sequence over a two-letter alphabet is an aperiodic sequence
$\alpha $
for which
$p_{\alpha }^*(k) = 2k$
for all
$k \in \mathbb {N}$
.
Kamae and Rao introduced the singular decomposition of an infinite sequence in [Reference Kamae and RaoKR06]. In what follows is a quick outline of this process, but a more in depth exploration can be found in [Reference Kamae and RaoKR06].
In Definition 2.15, the singular decomposition of a recurrent sequence is defined and in Definition 2.17, the singular decomposition of a non-recurrent sequence is defined. A singular decomposition requires a recurrent sequence, so for a non-recurrent sequence
$\alpha $
, we need to find some recurrent sequence
$\beta $
in the orbit closure of
$\alpha $
that satisfies some technical conditions outlined in Definition 2.16, then decompose
$\alpha $
with respect to the decomposition of
$\beta $
. The following definitions outline the singular decomposition of any sequence
$\alpha $
.
Definition 2.13. A sequence
$\alpha $
is periodic by projection if there exists some proper, non-empty subset of the alphabet,
$S \subset \mathcal {A}$
, such that the sequence
$1_{S}(\alpha _0)1_{S}(\alpha _1)1_{S}(\alpha _2)\ldots \in \{0, 1\}^{\mathbb {N}_0}$
is eventually periodic.
A letter
$b \in \mathcal {A}_{\alpha }$
is singular if the sequence
$1_{\{b\}}(\alpha _0)1_{\{b\}}(\alpha _1)1_{\{b\}}(\alpha _2)\ldots \in \{0, 1\}^{\mathbb {N}_0}$
is periodic.
Definition 2.14. Let
$\beta $
be some recurrent sequence. We say
$m \in \mathbb {N}$
is the decomposition cycle of
$\beta $
if m is the smallest integer which is a period of
$1_{\{b\}}(\alpha _0)1_{\{b\}}(\alpha _1)1_{\{b\}}(\alpha _2)\ldots $
for each singular letter b of
$\beta $
. If
$\beta $
has no singular letters, then we set
$m=1$
.
Definition 2.15. The singular decomposition of a recurrent sequence
$\beta $
is the set of sequences
where m is the decomposition cycle of
$\beta $
.
Definition 2.16. Choose some
$r \in \{0, 1, \ldots , m-1\}$
such that
$\lim _{n \to \infty } \sigma ^{mk_n+r}(\alpha )$
converges for some sequence
$(k_n)_{n\in \mathbb {N}}$
. Let
$\beta = \sigma ^{m-r}(\lim _{n \to \infty } \sigma ^{mk_n+r}(\alpha ))$
. Here,
$\beta $
is an auxiliary word of
${\alpha }$
if the following conditions are true:
-
(1)
$\beta $
is recurrent; -
(2)
$\beta \in \overline {\mathcal {O}(\alpha )}$
; -
(3)
$\beta ^{(i)} \in \overline {\mathcal {O}(\alpha ^{(i)})}$
for all
$i \in \mathbb {N} \cap [0, m-1]$
.
In [Reference Kamae and RaoKR06], Kamae and Rao proved that any aperiodic sequence has at least one auxiliary word.
Note that if
$\beta $
is an auxiliary word of
$\alpha $
, then
$p_{\alpha }^*(k) \geq p_{\beta }^*(k)$
for all
$k \in \mathbb {N}$
since for any k-sized window
$\tau $
,
$F_{\beta }(\tau ) \subseteq F_{\alpha }(\tau )$
.
Definition 2.17. The singular decomposition of a non-recurrent word
${\alpha }$
with respect to an auxiliary word
${\beta }$
is the set of sequences
for m the decomposition cycle of the auxiliary word
$\beta $
.
Definition 2.18. Define an undirected graph
$\Gamma $
on the residues of the singular decomposition of
$\alpha $
by
$V(\Gamma ) = \{0, 1, 2, \ldots , m-1\}$
, the residues of
$\alpha $
, and
$E(\Gamma ) = \{(i, j) : \mathcal {A}_{\alpha ^{(i)}} \cap \mathcal {A}_{\alpha ^{(i)}} \neq \emptyset \}$
.
Definition 2.19. For any infinite sequence
$\alpha $
, k-sized window
$\tau $
, and residue i,
$0 \leq i \leq m-1$
for m the decomposition cycle of
$\alpha $
, we define the set of
${\tau }$
-words beginning on the ith residue as the set
We note that
$\bigcup _{i=0}^{m-1} C_i(\tau , \alpha ) = F_{\alpha }(\tau )$
.
In [Reference Kamae and RaoKR06], Kamae and Rao also proved the following theorems about the maximal pattern complexity of sequences and found a threshold under which a non-trivial singular decomposition is forced.
Theorem 2.20. [Reference Kamae and RaoKR06]
Let
$\beta $
be a recurrent sequence over
$\ell $
letters,
$\ell \geq 2$
. If
${p_{\beta }^*(k) < \ell k}$
holds for some
$k \in \mathbb {N}$
, then
$\beta $
contains at least one singular letter.
Theorem 2.21. [Reference Kamae and RaoKR06]
If
$\ell \geq 2$
and
$p_{\alpha }^*(k)< \ell k$
holds for some
$k \in \mathbb {N}$
, then
$\Gamma $
is disconnected.
Definition 2.22. [Reference Kamae and RaoKR06]
A sequence
$\alpha $
over
$\ell \geq 2$
letters is called a pattern Sturmian sequence if
$p_{\alpha }^*(k) = \ell k$
and
$\alpha $
is not periodic by projection.
We define the following slightly more restrictive condition.
Definition 2.23. A sequence
$\alpha $
over
$\ell \geq 2$
letters is called a strong pattern Sturmian sequence if
$p_{\alpha }^*(k) = 2k + \ell -2$
.
Clearly, if
$\ell = 2$
, the notions of pattern Sturmian and strong pattern Sturmian are the same.
In general, two sequences may have very different maximal windows. However, when both sequences are uniformally recurrent, the following theorem shows we can find a window which gives at least
$2k$
words for each sequence. This is a shared maximal window when both sequences are pattern Sturmian. We prove this with an argument from the proof of Theorem 2.9, which can be found in [Reference Kamae and ZamboniKZ02b].
Theorem 2.24. For any two aperiodic sequences
$\alpha $
and
$\beta $
that are uniformly recurrent and for each
$k \in \mathbb {N}$
, there exists some k-sized window
$\tau $
such that
$\#F_{\alpha }(\tau ) \geq 2k$
and
$\#F_{\beta }(\tau ) \geq 2k$
.
Proof. Consider the product space
$\overline {\mathcal {O}(\alpha )} \times \overline {\mathcal {O}(\beta )}$
with the map
$\sigma \times \sigma $
.
$\overline {\mathcal {O}(\alpha )} \times \overline {\mathcal {O}(\beta )}$
has a recurrent point (by a simple application of the Poincaré recurrence theorem), call it
$(\gamma , \eta )$
.
Since
$\alpha $
and
$\beta $
are uniformly recurrent,
$\gamma $
and
$\eta $
are both aperiodic.
Clearly,
$\gamma $
and
$\eta $
share a window of size one, and since they are both aperiodic, this window gives at least two words of length one for each sequence.
Now, suppose for some
$k>1$
, there exists a
$(k-1)$
-sized window
$\tau $
such that
$F_{\gamma }(\tau ) \geq 2(k-1)$
and
$F_{\eta }(\tau ) \geq 2(k-1)$
.
Let
$L \in \mathbb {N}$
such that
$\gamma _0\gamma _1\ldots \gamma _{L-1}$
and
$\eta _0\eta _1\ldots \eta _{L-1}$
witness all
$\tau $
words over
$\gamma $
and
$\eta $
, respectively. Since
$(\gamma , \eta )$
is recurrent, there exists
$n \in \mathbb {N}$
such that for all
$0 \leq i < L$
,
$(\gamma , \eta )_i = (\gamma , \eta )_{n + i}$
. Therefore, for all
$0 \leq i < L$
,
$\gamma _i = \gamma _{i+ n}$
and
$\eta _i = \eta _{i + n}$
.
Define
$\tau '$
to be an immediate extension of
$\tau $
by
$\tau '(i) = \tau (i)$
for all
$0 \leq i \leq k-2$
and
$\tau '(k-1) = n$
.
In the proof of Theorem 2.9 from [Reference Kamae and ZamboniKZ02b], the following fact was proved: if
$\delta $
is aperiodic and there is some
$\tau ' = \tau \cup \{n\}$
for some n such that
$awa \in \#F_{\delta }(\tau ')$
for all
${aw \in \#F_{\delta }(\tau )}$
, a a single letter, and w a
$k-2$
-sized
$\tau \setminus \{0\}$
word, then
$\#F_{\delta }(\tau ') \geq \#F_{\delta }(\tau ) + 2$
.
Thus, for
$\delta \in \{\gamma , \eta \}$
,
$\#F_{\delta }(\tau ') \geq \#F_{\delta }(\tau ) + 2$
, and by the inductive hypothesis,
$\#F_{\alpha }(\tau ') \geq 2k$
and
$\#F_{\beta }(\tau ') \geq 2k$
.
3 Recurrent case
The purpose of this section is to prove Theorem 1.1 in the case where
$\alpha $
is recurrent.
For the entirety of this section, we assume
$\alpha $
is aperiodic, recurrent,
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
, and
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 3k = -\infty $
(or in the case where
$\alpha $
is uniformly recurrent, that
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 4k = -\infty $
).
By Theorem 2.20, since
$p_{\alpha }^*(k) < 3k \leq \ell k$
for some
$k\in \mathbb {N}$
,
$\alpha $
has a singular letter. Let m be the decomposition cycle of
$\alpha $
,
$m \geq 2$
since the decomposition is non-trivial by the existence of a singular letter.
Theorem 3.1. If
$\alpha $
is aperiodic, recurrent, and
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 3k = -\infty $
, then exactly one residue in the singular decomposition of
$\alpha $
is aperiodic. In the case where
$\alpha $
is uniformly recurrent, the implication holds for
$4k$
instead of
$3k$
.
Proof. Suppose
$\alpha $
is aperiodic, recurrent, and
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 3k = -\infty $
. Let m be the decomposition cycle of
$\alpha $
,
$m\geq 2$
.
Since
$\alpha $
is aperiodic, there exists some p,
$0 \leq p \leq m-1$
, such that
$\alpha ^{(p)}$
is aperiodic. Let
$k> m$
and let
$\tau $
be a maximal
$(k-(m-1))$
-sized window over
$\alpha ^{(p)}$
. Then,
$\#F_{\alpha ^{(p)}}(\tau ) \geq 2k - 2(m-1)$
by Theorem 2.8.
Define a k-sized window
$\tau '$
over
$\alpha $
by
$\tau ' = m\tau \cup \{h_i : 1 \leq i \leq m-1\}$
for
${h_1> m\tau (k-(m-1) -1)}$
,
$h_{i+1}> h_i$
for all
$1 \leq i \leq m-2$
, and
$h_i \equiv i \,\mod m$
for all
${1 \leq i \leq m-1}$
.
Define
$x \in (\mathcal {A}\cup \{?\})^{\mathbb {N}}$
by
$x_i = \alpha _i$
if
$\alpha _i$
is a singular letter and
$x_i = \,?$
if not. Then, x has least period m since a smaller least period for x would result in the singular letters of
$\alpha $
sharing a period smaller than m.
Therefore,
$C_i(\tau ', \alpha )\, \cap \, C_j(\tau ', \alpha ) = \emptyset $
for all
$0 \leq i < j \leq m-1$
since the position of the singular letters uniquely determines to which set
$C_i(\tau ', \alpha )$
a word belongs. Furthermore,
$\#C_p(\tau ', \alpha ) \geq 2k-2(m-1)$
since every word in
$F_{\alpha ^{(p)}}(\tau )$
must appear as a prefix for some word in
$C_p(\tau ', \alpha )$
.
Suppose toward a contradiction that there exists an i,
$0 \leq i \leq m-1$
and
$i \neq p$
, such that
$\alpha ^{(i)}$
is also aperiodic. Consider
$\tau $
over
$\alpha ^{(i)}$
. Since
$\alpha ^{(i)}$
is aperiodic, then by Fact 2.10,
$\#F_{\alpha ^{(i)}}(\tau ) \geq k - (m-1) +1$
. Thus,
$\#C_i(\tau ', \alpha ) \geq k-(m-1)+1$
, and since
$C_p(\tau ', \alpha )$
and
$C_i(\tau ', \alpha )$
are disjoint,
$\#F_{\alpha }(\tau ') \geq 3k-3(m-1) + 1$
. Therefore,
Since m is fixed, this is a contradiction.
In the case where
$\alpha $
is uniformly recurrent, any aperiodic residue must also be uniformly recurrent. This is because the appearance of singular letters in any block of
$\alpha $
forces the points where the block begins again to fall on exactly the same residue as the original block. Thus, blocks in each residue must also appear syndetically, as they directly correspond to the uniform recurrence of the original sequence. Therefore, if
$\alpha ^{(p)}$
and
$\alpha ^{(i)}$
are both aperiodic, then they are both uniformly recurrent, and by Theorem 2.24, there exists a
$(k-(m-1))$
-sized window
$\tau $
such that
$\#F_{\alpha ^{(p)}}(\tau ) \geq 2k - 2(m-1)$
and
$\#F_{\alpha ^{(i)}}(\tau ) \geq 2k - 2(m-1)$
. Define a k-sized window
$\tau '$
over
$\alpha $
by
$\tau ' = m\tau \cup \{h_i : 1 \leq i \leq m-1\}$
for
$h_1> m\tau (k-(m-1) -1)$
,
$h_{i+1}> h_i$
for all
$1 \leq i \leq m-2$
, and
$h_i \equiv i \,\mod m$
for all
$1 \leq i \leq m-1$
. Then,
$C_p(\tau ', \alpha ) \cap C_i(\tau ', \alpha ) = \emptyset $
,
$\#C_p(\tau ', \alpha ) \geq 2k - 2(m-1)$
, and
$\#C_i(\tau ', \alpha ) \geq 2k-2(m-1)$
. Therefore,
which is a contradiction since m is fixed.
Thus, exactly one residue of the singular decomposition of
$\alpha $
is aperiodic.
Proposition 3.2. If
$\alpha $
is aperiodic, recurrent, and
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 3k = -\infty $
, then the aperiodic residue from Theorem 3.1 must be over two letters and for all
$c \in \mathcal {A}$
if c is not a letter in the aperiodic residue, c is singular for
$\alpha $
.
Proof. Let
$\alpha ^{(p)}$
be the aperiodic residue from Theorem 3.1,
$0 \leq p \leq m-1$
. Suppose toward a contradiction that
$\#\mathcal {A}_{\alpha ^{(p)}} \geq 3$
. Since for any
$k \in \mathbb {N}$
and k-sized window
$\tau $
over
$\alpha ^{(p)}$
,
$F_{\alpha ^{(p)}}(\tau ) \subset F_{\alpha }(m\tau )$
, then
$p_{\alpha ^{(p)}}^*(k) \leq p_{\alpha }^*(k) < 3k$
. Thus, by Theorem 2.20,
$\alpha ^{(p)}$
must have a singular letter, call it c. Since all other residues are periodic and c appears in
$\alpha ^{(p)}$
periodically, c must appear in
$\alpha $
periodically, and so c is singular for
$\alpha $
and thus cannot appear in
$\alpha ^{(p)}$
, which is a contradiction. Thus,
$\#\mathcal {A}_{\alpha ^{(p)}} = 2$
.
Suppose
$c \in \mathcal {A}$
,
$c \notin \mathcal {A}_{\alpha ^{(p)}}$
. Then, c only appears in residues which are periodic. Thus, c appears periodically in
$\alpha $
, so c is singular for
$\alpha $
.
Clearly, Theorem 3.1 and Proposition 3.2 together prove Theorem 1.1 in the case when
$\alpha $
is recurrent.
4 Non-recurrent case
This section is dedicated to proving Theorem 1.1 in the non-recurrent case. For the entirety of this section, we assume that
$\alpha $
is a non-recurrent, aperiodic sequence,
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
, and
$\liminf _{k \to \infty } p_{\alpha }^*(k) - 3k = -\infty $
.
Let
$\beta $
be an auxiliary word for
$\alpha $
. By Theorem 2.21, since
$p_{\alpha }^*(k) < 3k \leq \ell k$
for some
$k \in \mathbb {N}$
,
$\Gamma $
is disconnected. Let m be the decomposition cycle of the singular decomposition of
$\alpha $
with respect to
$\beta $
,
$m \geq 2$
since the decomposition is non-trivial.
For each eventually periodic
$\alpha ^{(i)}$
, there exists an
$n_i \in \mathbb {N}_0$
such that
$\sigma ^{n_i}(\alpha ^{(i)})$
is periodic. For each aperiodic
$\alpha ^{(i)}$
, there exists an
$n_i \in \mathbb {N}_0$
such that each letter in
$\mathcal {A}_{\sigma ^{n_i}(\alpha ^{(i)})}$
appears in
$\sigma ^{n_i}(\alpha ^{(i)})$
infinitely often. Let
$t = \max \{n_i : 0 \leq i \leq m-1\}$
. Then, for the sequence
${\alpha ' = \sigma ^{mt}(\alpha )}$
, all residues of
$\alpha '$
are aperiodic or periodic, every letter appearing in a residue of
$\alpha '$
appears in that residue infinitely often,
$\alpha = w\alpha '$
for some finite word w, and
$p_{\alpha }^*(k) = p_{\alpha '}^*(k) + C'$
for some constant
$C' \in \mathbb {N}_0$
and k sufficiently large (since for any window, a finite prefix can only contribute a finite number of words). We further note that
$\beta $
is an auxiliary word for
$\alpha '$
and the singular decomposition of
$\alpha '$
is
$\{\alpha ^{\prime (i)} = \sigma ^{t}(\alpha ^{(i)}) : 0\leq i\leq m-1\}$
. The graph
$\Gamma '$
of the residues of
$\alpha '$
is such that
$V(\Gamma ) = V(\Gamma ')$
and
$E(\Gamma ') \subset E(\Gamma )$
, and so
$\Gamma '$
is also disconnected.
Lemma 4.1. Exactly one connected component of
$\Gamma '$
contains aperiodic residues and all other connected components of
$\Gamma '$
contain only singular residues.
Proof. Since
$\alpha '$
is aperiodic, at least one connected component of
$\Gamma '$
contains residues which are aperiodic.
Suppose toward a contradiction that there are two disjoint connected components of
$\Gamma '$
which contain aperiodic residues. Then, there are two residues,
$i \neq j$
for which
$\alpha ^{\prime (i)}$
and
$\alpha ^{\prime (j)}$
are both aperiodic on disjoint alphabets. Let
$k \in \mathbb {N}$
and let
$\tau $
be a maximal k-sized window over
$\alpha ^{\prime (i)}$
. Then, by Theorem 2.8,
$\#F_{\alpha ^{\prime (i)}}(\tau ) \geq 2k$
. Furthermore,
$F_{\alpha ^{\prime (i)}}(\tau ) \cap F_{\alpha ^{\prime (j)}}(\tau ) = \emptyset $
since the alphabets of
$\alpha ^{\prime (i)}$
and
$\alpha ^{\prime (j)}$
are disjoint, and by Fact 2.10,
$\#F_{\alpha ^{\prime (j)}}(\tau ) \geq k+1$
since
$\alpha ^{\prime (j)}$
is aperiodic. Then,
$\#F_{\alpha '}(m\tau ) \geq 2k + k + 1$
, which is a contradiction.
Let
$A \subset \{0, 1, \ldots , m-1\}$
be the set of residues in the aperiodic connected component and
$S = \{0, 1, \ldots , m-1\} \setminus A$
be all other residues.
Let
$i \in S$
. Then,
$\alpha ^{\prime (i)}$
is periodic and so
$\beta ^{(i)}$
is also periodic. Since the letters appearing in
$\beta ^{(i)}$
only appear in residues from S, those letters must appear periodically in
$\beta $
and thus are singular for
$\beta $
. Therefore, since the letters are singular for
$\beta $
and periodic in
$\alpha $
, they must be singular for
$\alpha '$
, and for all
$i \in S$
,
$\alpha ^{\prime (i)}$
is constant.
For the remainder of this section, let A be the set of residues in the aperiodic connected component and S be the collection of all other residues.
Lemma 4.2. If more than two letters appear in a single aperiodic residue,
$\alpha ^{\prime (i)}$
, then one of the letters must appear periodically within
$\alpha ^{\prime (i)}$
.
Proof. Suppose
$\alpha ^{\prime (i)}$
is an aperiodic residue for
$\alpha '$
and
$\#\mathcal {A}_{\alpha ^{(i)}} \geq 3$
. Then,
$p_{\alpha ^{\prime (i)}}^*(k) \leq p_{\alpha '}^*(k) < 3k$
for sufficiently large k.
If
$\alpha ^{\prime (i)}$
is recurrent, then by Theorem 2.20, it must have some singular letter and, by definition, this letter is periodic in
$\alpha ^{\prime (i)}$
.
If
$\alpha ^{\prime (i)}$
is not recurrent, then by Theorem 2.21, the graph
$\Gamma _{i}'$
of
$\alpha ^{\prime (i)}$
is disconnected, and by Lemma 4.1, only one connected component has aperiodic pieces and all other pieces are singular for
$\alpha ^{\prime (i)}$
. Thus, there is at least one letter that appears periodically in
$\alpha ^{\prime (i)}$
.
Lemma 4.3. If
$c \in \mathcal {A}_A := \bigcup _{i \in A} \mathcal {A}_{\alpha ^{\prime (i)}}$
, then there exists some residue j,
$0\leq j \leq m-1$
such that c appears aperiodically in
$\alpha ^{\prime (j)}$
.
Proof. Let
$c \in \mathcal {A}_A$
. If c appears periodically in each
$\alpha ^{\prime (j)}$
,
$0 \leq j \leq m-1$
, then c would appear periodically in each
$\beta ^{(j)}$
. Thus, c would be singular for
$\beta $
and since c is periodic in every residue of
$\alpha '$
, c would be singular for
$\alpha '$
, which is a contradiction.
Theorem 4.4. The alphabet of the aperiodic residues,
$\mathcal {A}_A = \bigcup _{i \in A} \mathcal {A}_{\alpha ^{\prime (i)}}$
, has exactly two letters.
Proof. Clearly,
$\#\mathcal {A}_A \geq 2$
or else no residue from A would be aperiodic. Thus, suppose toward a contradiction that
$\#\mathcal {A}_A> 2$
.
Since
$\beta $
is contained in the orbit closure of
$\alpha '$
,
$p_{\beta }^*(k) \leq p_{\alpha '}^*(k)$
. Thus, since
$\beta $
is recurrent,
$\beta $
either has the form outlined in the conclusion of Theorem 1.1 by §3, or
$\beta $
is periodic.
Let
$a, b, $
and c be distinct letters in
$\mathcal {A}_A$
and let
$k> m$
.
Case I. Suppose
$\beta $
has the form outlined in the conclusion of Theorem 1.1.
Suppose
$\beta ^{(p)}$
,
$0 \leq p \leq m-1$
, is the only aperiodic residue and is over the letters
$\{a, b\}$
, and all other residues of
$\beta $
are either non-constant periodic over
$\{a, b\}$
or constant.
Let
$\tau $
be a maximal k-sized window over
$\alpha ^{\prime (p)}$
. Since
$\beta $
is in the orbit closure of
$\alpha '$
,
$F_{\beta ^{(p)}}(\tau ) \subset F_{\alpha ^{\prime (p)}}(\tau )$
and since
$\beta ^{(p)}$
is aperiodic,
$F_{\alpha ^{\prime (p)}}(\tau )$
contains at least
$2k \,\tau $
-words which only have a and b as letters.
By Lemma 4.3, since
$c \in \mathcal {A}_A$
, there exists some residue i,
$0\leq i \leq m-1$
, such that c appears aperiodically (and infinitely) in
$\alpha ^{\prime (i)}$
. Since
$c \neq a, b$
, then either
$\beta ^{(i)} = c^{\infty }$
or c does not appear in
$\beta ^{(i)}$
because any non-constant residue of
$\beta $
can only be over the letters a and b.
If
$\beta ^{(i)} = c^{\infty }$
, then by the aperiodicity of
$\alpha ^{\prime (i)}$
, there are
$k-2 \,\tau $
-words over
$\alpha ^{\prime (i)}$
of the form
$c^jd_ju_j$
,
$0 < j< k$
,
$d_j \neq c$
, and
$u_j \in \mathcal {A}^{k-j-1}$
. Thus,
$p_{\alpha }^*(k) \geq \#F_{\alpha '}(m\tau ) \geq 3k-2$
, which is a contradiction.
If c does not appear in
$\beta ^{(i)}$
, then there are at least
$k-2 \,\tau $
-words over
$\alpha ^{\prime (i)}$
with the form
$u_jcu_j'$
for
$0 < j < k$
, and some
$u_j \in (\mathcal {A} \setminus \{c\})^{j}$
,
$u_j' \in \mathcal {A}^{k-j-1}$
. Thus,
$p_{\alpha }^*(k) \geq \#F_{\alpha '}(m\tau ) \geq 3k-2$
for all
$k> m$
, which is a contradiction.
Case II. Suppose
$\beta $
is periodic.
Since every letter of
$\mathcal {A}_A$
must appear aperiodically in some residue of
$\alpha '$
and
$\#\mathcal {A}_A\geq 3$
, by Lemma 4.2, there must be at least two residues of
$\alpha '$
which are aperiodic,
$\alpha ^{\prime (p)}$
and
$\alpha ^{\prime (i)}$
, for some
$0\leq p < i \leq m-1$
. Since
$\beta $
is periodic, every letter in
$\mathcal {A}_{\beta }$
is singular for
$\beta $
. Then, without loss of generality, either
$a^{\infty } = \beta ^{(p)} \neq \beta ^{(i)} = b^{\infty }$
or
$a^{\infty } = \beta ^{(p)} = \beta ^{(i)}$
.
Let
$\tau $
be a maximal k-sized window over
$\alpha ^{\prime (p)}$
.
Case IIa. Suppose that
$a^{\infty } = \beta ^{(p)} \neq \beta ^{(i)} = b^{\infty }$
. Then, b must appear in
$\alpha ^{\prime (p)}$
infinitely often (and likewise, a must appear in
$\alpha ^{\prime (i)}$
infinitely often) or otherwise there are at least
$2k \,\tau $
-words in
$\alpha ^{\prime (p)}$
with no b in them and
$k-2 \,\tau $
-words of the form
$b^jdu_j$
for
$0<j<k$
,
$d \neq b$
, and
$u_j \in \mathcal {A}^{k-j-1}$
in
$\alpha ^{(i)}$
. Thus,
$\#F_{\alpha }(m\tau ) \geq 2k + k -2$
, which is a contradiction.
Since c must appear in some residue of
$\alpha '$
aperiodically, then either there exists some q,
$0\leq q \leq m-1$
and
$q \neq p,i$
such that
$\beta ^{(q)} = c^{\infty }$
, or c appears in
$\alpha ^{\prime (p)}$
and
$\alpha ^{\prime (i)}$
infinitely often (since appearing in one and not the other would give at least
$2k$
words with no c in one residue and
$k-2$
words of the form
$u_jcu_j'$
for
$0<j<k$
in the other residue).
If there exists some q,
$0\leq q \leq m-1$
, and
$q \neq p,i$
such that
$\beta ^{(q)} = c^{\infty }$
, then there are
$k-2 \,\tau $
-words of the form
$a^jdu_j$
for
$0<j<k$
,
$d \neq b$
, and
$u_j \in \mathcal {A}^{k-j-1}$
in
$\alpha ^{(p)}$
;
$k-2$
$\tau $
-words of the form
$b^jeu_j$
for
$0<j<k$
,
$e \neq b$
, and
$u_j \in \mathcal {A}^{k-j-1}$
in
$\alpha ^{(i)}$
; and
$k-2 \, \tau $
-words of the form
$c^jfu_j$
for
$0<j<k$
,
$f \neq c$
, and
$u_j \in \mathcal {A}^{k-j-1}$
in
$\alpha ^{(q)}$
. Thus,
$\#F_{\alpha }(m\tau ) \geq 3k - 6$
for all
$k> m$
, which is a contradiction.
Thus, c appears in
$\alpha ^{\prime (p)}$
infinitely often. Since
$\alpha ^{\prime (p)}$
sees
$a, b,$
and c infinitely often, by Lemma 4.2, there is at least one letter in
$\alpha ^{\prime (p)}$
that appears periodically in
$\alpha ^{\prime (p)}$
, which is a contradiction.
Case IIb. Now, assume that
$\beta ^{(p)} = \beta ^{(i)} = a^{\infty }$
. Further, assume that there are no residues q for which
$\alpha ^{\prime (q)}$
is aperiodic and
$\beta ^{(q)} = d^{\infty }$
for
$d \neq a$
or else we can reduce to Case IIa.
Thus, b and c must appear aperiodically in either
$\alpha ^{\prime (p)}$
or
$\alpha ^{\prime (i)}$
by Lemma 4.3.
If, without loss of generality, b only appears in
$\alpha ^{\prime (i)}$
and not in
$\alpha ^{\prime (p)}$
, then there are at least
$2k \,\tau $
-words from
$\alpha ^{\prime (p)}$
that do not contain a b and
$k-2 \,\tau $
-words from
$\alpha ^{\prime (i)}$
of the form
$u_jbu_j'$
for
$0<j<k$
,
$u_j \in (\mathcal {A}\setminus \{b\})^j$
and
$u_j' \in \mathcal {A}^{k-j-1}$
.
Thus, b and c appear in
$\alpha ^{\prime (p)}$
infinitely often. Then, by Lemma 4.2, there is some letter in
$\alpha ^{\prime (p)}$
that appears periodically, which is a contradiction.
Therefore, by Cases I and II,
$\#\mathcal {A}_A = 2$
.
Theorem 4.5. There exists some decomposition of
$\alpha '$
into
$1< m_0 \leq m$
residues, where m is the singular decomposition cycle of
$\alpha '$
, such that exactly one residue of the decomposition is aperiodic over two letters and any letter not in
$\mathcal {A}_A$
is singular for
$\alpha '$
.
Proof. Suppose that
$\alpha ^{\prime (p)}$
,
$0\leq p \leq m-1$
, is aperiodic.
Let
$k> m$
and let
$\tau $
be a maximal
$(k-(m-1))$
-sized window over
$\alpha ^{\prime (p)}$
. Then,
$\#F_{\alpha ^{\prime (p)}}(\tau ) \geq 2k - 2(m-1)$
by Theorem 2.8.
Define a k-sized window
$\tau '$
over
$\alpha '$
by
$\tau ' = m\tau \cup \{h_i : 1 \leq i \leq m-1\}$
for
${h_1> m\tau (k-(m-1) -1)}$
,
$h_{i+1}> h_i$
for all
$1 \leq i \leq m-2$
, and
$h_i \equiv i \,\mod m$
for all
${1 \leq i \leq m-1}$
.
Define
$x \in (\mathcal {A} \cup \{?\})^{\mathbb {N}}$
by
$x_i = \alpha ^{\prime }_i$
if
$i \in S$
and
$x_i = ?$
if
$i \in A$
.
If x has least period m, then
$C_i(\tau ', \alpha ') \cap C_j(\tau ', \alpha ') = \emptyset $
for all
$0 \leq i < j \leq m-1$
. Furthermore,
$\#C_p(\tau ', \alpha ') \geq 2k-2(m-1)$
since
$C_p(\tau ', \alpha ')$
witnesses
$\tau $
over
$\alpha ^{\prime (p)}$
.
If a second residue
$\alpha ^{\prime (i)}$
,
$1 \leq i \leq m-1$
, were also aperiodic, then
$\#F_{\alpha ^{\prime (i)}}(\tau ) \geq k - (m-1) +1$
by Fact 2.10. Thus,
$\#C_i(\tau ', \alpha ') \geq k-(m-1)+1$
, and since
$C_p(\tau ', \alpha ')$
and
$C_i(\tau ', \alpha ')$
are disjoint,
$\#F_{\alpha '}(\tau ') \geq \#F_{\alpha '}(\tau ') \geq 3k-3(m-1) + 1$
for all
$k> m$
, which is a contradiction.
If x has least period
$m_0 < m$
, then
$m_0$
divides m. Suppose that there is a second aperiodic residue,
$\alpha ^{\prime (i)}$
. By Lemma 4.1,
$i \in A$
.
If
$i \not \equiv p \,\mod m_0$
, then
$C_p(\tau ', \alpha ') \cap C_i(\tau ', \alpha ') = \emptyset $
. Thus, since
$\#C_p(\tau ', \alpha ') \geq 2k - 2(m-1)$
and by Fact 2.10,
$\#C_i(\tau ', \alpha ') \geq k - (m-1) + 1$
, then
$\#F_{\alpha '}(\tau ')\geq 3k - 3(m-1) + 1$
, which is a contradiction.
If
$i \equiv p \,\mod m_0$
and for all
$j \not \equiv 0 \,\mod m_0$
,
$\alpha ^{\prime (j)}$
is periodic, and by Theorem 4.4,
$\#\mathcal {A}_A = 2$
, then
$\alpha '$
can be decomposed with respect to
$m_0$
. We denote this decomposition
$\{\alpha ^{\prime [i]} : 0 \leq i \leq m_0-1\}$
. Then, the only aperiodic residue is
$\alpha ^{\prime [p]}$
and all other residues either belong to S and thus are singular for
$\alpha '$
, or are periodic over the alphabet of
$\alpha ^{\prime [p]}$
.
Thus, we have exactly one aperiodic residue over two letters.
Suppose
$c \notin \mathcal {A}_{\alpha ^{\prime (p)}}$
. Then, c only appears in periodic residues. Thus, c appears periodically in
$\alpha '$
and thus, must appear periodically in
$\beta $
. Hence, c is singular for
$\beta $
and must be singular for
$\alpha '$
.
By Theorem 4.5 and the discussion at the beginning of this section, the conclusion for Theorem 1.1 has been proved for when
$\alpha $
is not recurrent, completing the proof of Theorem 1.1.
5 Strong pattern Sturmian sequences
In this section, we prove Theorems 1.2 and 1.3. First, Lemma 5.1 and its immediate corollary introduce the relationship between the maximal pattern complexity of a sequence and the maximal pattern complexity of its aperiodic residue, which is unique by Theorem 1.1.
Lemma 5.1. If
$\liminf _{k\to \infty }p_{\alpha }^*(k) - 3k = -\infty $
and
$\alpha ^{(p)}$
is the aperiodic residue from Theorem 1.1, then
$p_{\alpha }^*(k) \geq p_{\alpha ^{(p)}}^*(k) + \ell - 2$
.
Proof. Let
$k \in \mathbb {N}$
and let
$\tau $
be a maximal k-sized window over
$\alpha ^{(p)}$
. Consider the window
$m\tau $
over
$\alpha $
. Then,
$\#F_{\alpha }(m\tau ) \geq \#F_{\alpha ^{(p)}}(\tau ) + \ell - 2$
since there is at least one word beginning with each letter of the alphabet. Thus,
$p_{\alpha }^*(k) \geq \#F_{\alpha }(m\tau ) \geq p_{\alpha ^{(p)}}^*(k) + \ell -2$
.
An immediate corollary of Lemma 5.1 is as follows.
Corollary 5.2. If
$p_{\alpha }^*(k) = 2k + C$
for some
$C \geq \ell - 2$
and
$\alpha ^{(p)}$
is the only aperiodic residue,
$0 \leq p \leq m-1$
, then
$p_{\alpha ^{(p)}}^*(k) \leq 2k + D$
for some
$0 \leq D \leq C - (\ell - 2)$
and all
$k \in \mathbb {N}$
.
Lemma 5.3 sets the foundation for proving Theorem 1.2 by showing that a non-trivial prefix of a non-recurrent sequence, as discussed in the beginning of §4, cannot occur when the maximal pattern complexity is restricted to
$2k + \ell -2$
.
Lemma 5.3. If
$p_{\alpha }^*(k) = 2k + \ell -2$
for all
$k \in \mathbb {N}$
, then the aperiodic residue from Theorem 1.1 is over two letters, a and b, and for any
$c \in \mathcal {A}$
that is not equal to a or b, c is singular.
Proof. If
$\alpha $
is recurrent, this is clearly true by Theorem 3.1 and Proposition 3.2. Therefore, we will assume that
$\alpha $
is not recurrent. By the discussion at the beginning of §4, there is a word w such that
$\alpha = w\alpha '$
, all residues of
$\alpha '$
are aperiodic or periodic, and since
$p_{\alpha '}^*(k) \leq p_{\alpha }^*(k)$
, by Theorem 4.5,
$\alpha '$
is a sequence with exactly one aperiodic residue over two letters, a and b, and for any
$c \in \mathcal {A}$
,
$c \neq a, b$
, c is singular. Thus, it suffices to show that w is trivial.
Assume, toward a contradiction, that w is non-trivial.
Define
$x \in (\mathcal {A}_{\alpha } \cup \{?\})^{\mathbb {N}_0}$
by
$x_i = ?$
if
$\mathcal {A}_{\alpha ^{\prime (i)}} \subseteq \{a, b\}$
and
$x_i = \alpha ^{\prime }_i$
if
$\mathcal {A}_{\alpha ^{\prime (i)}} \not \subseteq \{a, b\}$
. Without loss of generality, we can assume x has least period m, or else
$\alpha $
could be decomposed with respect to
$m_0 < m$
, and
$\alpha = w'\alpha "$
, where w is still non-trivial and
$\alpha "$
has the same structure as
$\alpha '$
.
Let
$\alpha ^{\prime (p)}$
,
$0 \leq p \leq m-1$
, be the only aperiodic residue of
$\alpha '$
.
Since w is non-trivial, there is some residue i such that either there exists some
${c \in \mathcal {A}_{\alpha ^{(i)}}}$
such that
$c \notin \mathcal {A}_{\alpha ^{\prime (i)}}$
and
$\mathcal {A}_{\alpha ^{(i)}} \neq \{a, b\}$
or
$\mathcal {A}_{\alpha ^{(i)}} = \{a, b\}$
,
$\alpha ^{\prime (i)}$
is periodic (potentially constant), and for any
$t \in \mathbb {N}$
,
$\sigma ^{t}(\alpha ^{(i)}) \neq \alpha ^{(i)}$
.
Consider the first case, where there exists some
$c \in \mathcal {A}_{\alpha ^{(i)}}$
such that
$c \notin \mathcal {A}_{\alpha ^{\prime (i)}}$
and
$\mathcal {A}_{\alpha ^{(i)}} \neq \{a, b\}$
.
Let
$k \geq 2$
and let
$\tau $
be a maximal k-sized window over
$\alpha ^{\prime (p)}$
. Then,
$F_{\alpha '}(m\tau ) = F_{\alpha ^{\prime (p)}}(\tau ) \cup \{d^k : d \in \mathcal {A}_{\alpha } \text { and } d \neq a, b\}$
, and so
$\#F_{\alpha '}(m\tau ) \geq 2k + \ell -2$
. We also note that
$F_{\alpha ^{\prime (p)}}(\tau )$
only contains words over the letters a and b.
However, since there is some residue i for which there exists some
$c \in \mathcal {A}_{\alpha ^{(i)}}$
such that
$c \notin \mathcal {A}_{\alpha ^{\prime (i)}}$
, then either the
$m\tau $
-word
$cd^{k-1}$
for
$c \notin \{a, b\}$
,
$cd^{k-1}$
for
$d \notin \{a, b\}$
, or
$cu$
for
$c \notin \{a, b\}$
and
$u \in F_{\alpha ^{\prime (p)}}(\tau \setminus \{\tau (k-1)\})$
must also appear in
$F_{\alpha }(m\tau )$
, and so
$\#F_{\alpha }(m\tau ) \geq 2k + \ell - 1$
, which is a contradiction.
Now, consider the second case, where
$\mathcal {A}_{\alpha ^{(i)}} = \{a, b\}$
,
$\alpha ^{\prime (i)}$
is periodic (potentially constant), and for any
$t \in \mathbb {N}$
,
$\sigma ^{t}(\alpha ^{(i)}) \neq \alpha ^{(i)}$
. We will further assume that the first case cannot occur, so constant residues in
$\alpha '$
over letters other than a and b must also be constant in
$\alpha $
.
Let q be the least period of
$\alpha ^{\prime (i)}$
. Since m is the least period of x, there exists some j,
$0 \leq j \leq m-1$
, such that
$\mathcal {A}_{\alpha ^{(i + j \,\mod m)}} \cap \mathcal {A}_{\alpha ^{(p + j \,\mod m)}} = \emptyset $
. Without loss of generality, suppose that
$p + j \not \equiv i \,\mod m$
(otherwise, use the distance
$-j \,\mod m$
instead).
Define a q-sized window
$\tau $
by
$\tau = \{0, m, 2m, \ldots , (q-1)m\}$
. Define an immediate extension
$\tau '$
of
$\tau $
by
$\tau ' = \tau \cup \{h\}$
for some
$h> (q-1)m$
and
$h \equiv j \,\mod m$
. Then,
$C_i(\tau ', \alpha ') \cap C_p(\tau ', \alpha ') = \emptyset $
and
$\{\sigma ^t(\alpha ^{\prime (i)})_0\sigma ^t(\alpha ^{\prime (i)})_1\ldots \sigma ^t(\alpha ^{\prime (i)})_{q-1} : 0 \leq \, t \leq q-1\} \cup \{\alpha ^{(i)}_0\alpha ^{(i)}_1\ldots \alpha ^{(i)}_{q-1}\} \subset C_i(\tau , \alpha )$
, so
$\#C_i(\tau , \alpha ) \geq q + 1$
.
If
$i + j \equiv p \,\mod m$
, then
$\#C_i(\tau ', \alpha ) \geq q + 2$
since at least one word from
$C_i(\tau , \alpha )$
must have at least two extensions in
$C_i(\tau ', \alpha )$
or else
$\alpha ^{(p)}$
would be eventually periodic. Furthermore,
$\#C_p(\tau ', \alpha ) \geq q+1$
by Fact 2.10 since
$\alpha ^{(p)}$
is aperiodic. Since there are at least
$\ell - 2$
residues that are constant over letters not equal to a or b, there are at least
$\ell - 2$
more
$\tau '$
-words. Thus,
$\#F_{\alpha }(\tau ') \geq 2q + 2 + \ell - 1$
, which is a contradiction.
If
$i + j \not \equiv p \,\mod m$
, then
$\#C_i(\tau ', \alpha ) \geq q + 1$
,
$\#C_p(\tau ', \alpha ) \geq q+1$
by Fact 2.10. Since
$C_i(\tau , \alpha ) \cap C_p(\tau , \alpha ) = \emptyset $
, either
$\alpha ^{(p-j\,\mod m)} = c^{\infty }$
or
$\alpha ^{(i-j\,\mod m)} = c^{\infty }$
for some
$c \notin \{a, b\}$
. Thus, either
$\#C_{i-j\,\mod m}(\tau ', \alpha ) \geq 2$
or
$\#C_{p-j\,\mod m}(\tau ', \alpha ) \geq 2$
. Then, there are at least
$\ell -3 \,\tau '$
-words over the residues that are constant over letters not equal to a, b, or c. Thus,
$\#F_{\alpha }(\tau ') \geq 2q + 2 + \ell - 1$
, which is a contradiction.
Thus, w must be trivial.
We now prove Theorem 1.2.
Proof of Theorem 1.2
Suppose that
$\alpha $
is an aperiodic sequence such that
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
and
$p_{\alpha }^*(k) = 2k+\ell - 2$
for all
$k \in \mathbb {N}$
.
By Lemma 5.3, exactly one residue is aperiodic over a two-letter alphabet
$\{a, b\}$
. Let
$\alpha ^{(p)}$
,
$0 \leq p \leq m-1$
, be the aperiodic residue. Also by Lemma 5.3, for all
$c \in \mathcal {A}$
such that
$c \notin \mathcal {A}_{\alpha ^{(p)}}$
, c is singular for
$\alpha $
.
By Lemma 5.1, since
$2k+\ell -2 = p_{\alpha }^*(k) \geq p_{\alpha ^{(p)}}^*(k) + \ell - 2$
, then
$p_{\alpha ^{(p)}}^*(k) \leq 2k$
. Since
$\alpha ^{(p)}$
is aperiodic, by Theorem 2.8,
$p_{\alpha ^{(p)}}^*(k) \geq 2k$
, and so the aperiodic residue must have maximal pattern complexity
$2k$
and is a pattern Sturmian over two letters.
Since each letter not witnessed by
$\alpha ^{(p)}$
is singular, any residue on letters other than those in
$\mathcal {A}_{\alpha ^{(p)}}$
are constant. Thus, it suffices to show that there is no residue
$\alpha ^{(j)}$
,
$j \neq p$
, which is non-constant periodic over
$\mathcal {A}_{\alpha ^{(p)}}$
.
Suppose that there is some residue j,
$0 \leq j \leq m-1$
and
$j \neq p$
, such that
$\alpha ^{(j)}$
is non-constant periodic over
$\mathcal {A}_{\alpha ^{(p)}} = \{a, b\}$
. Let
$k> m$
,
$\tau $
be a maximal
$(k - (m-1))$
-sized window over
$\alpha ^{(p)}$
, and define a k-sized window
$\tau '$
over
$\alpha $
by
$\tau ' = m\tau \cup \{h_i : 1 \leq i \leq m-1\}$
for
$h_1> m\tau (k-(m-1) -1)$
,
$h_{i+1}> h_i$
for all
$1 \leq i \leq m-2$
, and
$h_i \equiv i \,\mod m$
for all
$1 \leq i \leq m-1$
. Then,
$\#C_p(\tau ', \alpha ) \geq 2k - 2(m-1)$
. Since
$\alpha ^{(j)}$
is non-constant and
$\alpha ^{(p)}$
is aperiodic, in every
$C_i(\tau ', \alpha )$
, there are at least three distinct words classified by the letters appearing in the parts of the window centered over
$\alpha ^{(p)}$
and
$\alpha ^{(j)}$
: two distinct words with an a or b in the
$\alpha ^{(j)}$
slot, and a third word since if the letter appearing in the
$\alpha ^{(j)}$
slot always forced the letter appearing in the
$\alpha ^{(p)}$
slot, then
$\alpha ^{(p)}$
would be periodic (either constant, equal to
$\alpha ^{(j)}$
, or the bit flip of
$\alpha ^{(j)}$
). Therefore,
$\#F_{\alpha }(\tau ) \geq 2k - 2(m-1) + 3(m-1) = 2k + m - 1$
. Furthermore,
$m \geq \ell $
since
$\alpha ^{(p)}$
and
$\alpha ^{(j)}$
only use the letters a and b, and there are at least
$\ell -2$
other constant residues over the letters not equal to a or b. Therefore,
$\#F_{\alpha }(\tau ) \geq 2k + \ell - 1$
, which is a contradiction.
The next part of this section is dedicated to proving Theorem 1.3.
Proof of Theorem 1.3
Suppose
$\alpha $
is aperiodic,
$\#\mathcal {A}_{\alpha } = \ell \geq 3$
, and there exists some
${m \geq 2}$
such that the decomposition of
$\alpha $
with respect to m has one residue that is a pattern Sturmian over two letters and all other residues are constant.
Let
$\alpha ^{(p)}$
,
$0 \leq p \leq m-1$
, be the aperiodic pattern Sturmian over two letters.
Let
$k \in \mathbb {N}$
and let
$\tau $
be any k-sized window. Define
$\tau _i = \{\tau (j) : j \equiv i \,\mod m\}$
for all
$0 \leq i \leq m-1$
. Then,
$\tau = \tau _0 \sqcup \tau _1 \sqcup \ldots \sqcup \tau _{m-1}$
.
Suppose that
$I = \{i: 0 \leq i \leq m-1 \text { and } \tau _i \neq \emptyset \}$
.
Since any word in
$C_i(\tau , \alpha )$
is completely determined by the letters of
$\alpha ^{(p)}$
which appear in it,
$\#C_i(\tau , \alpha ) \leq 2(\#\tau _{p-i \,\mod m})$
.
Furthermore,
$\#(\bigcup _{i \notin I} C_{p-i \,\mod m}(\tau , \alpha ))\leq m-1-\#I$
since for all
$i \notin I$
,
$C_{p-i \,\mod m}(\tau , \alpha )$
is a set of words that only see constant sequences, so each
$C_{p-i \,\mod m}(\tau , \alpha )$
can contribute at most one word.
Therefore, for all k-sized windows
$\tau $
,
$$ \begin{align*} \#F_{\alpha}(\tau) \leq \sum_{i \in I} 2(\#\tau_i) + \#\bigg(\!\bigcup_{i \notin I} C_i(\tau, \alpha)\bigg) \leq 2k + m-1-\#I \leq 2k + m-2. \end{align*} $$
Thus,
$p_{\alpha }^*(k) \leq 2k + m-2$
. By an application of Theorem 2.9,
${p_{\alpha }^*(k+1) \geq p_{\alpha }^*(k) + 2}$
for all
$k \in \mathbb {N}$
, and so for sufficiently large k,
$p_{\alpha }^*(k) = 2k+C$
for some
$\ell -2 \leq C \leq m-2$
.
6 Examples
This section introduces some relevant examples. Example 6.1 demonstrates the existence of strong pattern Sturmian sequences over any finite alphabet.
Example 6.1. We define a sequence
$\alpha \in \{0, 1, \ldots , \ell -1\}^{\mathbb {N}_0}$
that has maximal pattern complexity
$p_{\alpha }^*(k) = 2k + \ell - 2$
.
Let
$x \in \{0, 1\}^{\mathbb {N}_0}$
be a pattern Sturmian sequence over two letters. Define
$\alpha \in \{0, 1, \ldots , \ell -1\}^{\mathbb {N}_0}$
by
$$ \begin{align*} \alpha_{(\ell - 1)i + j} = \begin{cases} x_i & \text{if } j \equiv 0 \,\mod \ell -1,\\ j + 1 & \text{if } j\not\equiv 0 \,\mod \ell -1. \end{cases} \end{align*} $$
The singular decomposition cycle of
$\alpha $
is
$\ell - 1$
.
Let
$k \in \mathbb {N}$
and let
$\tau $
be any k-sized window. Define
$\tau _i = \{\tau (j) : j \equiv i \,\mod m\}$
for all
$0 \leq i \leq m-1$
. Then,
$\tau = \tau _0 \sqcup \tau _1 \sqcup \cdots \sqcup \tau _{m-1}$
.
Suppose that
$I = \{i : 0 \leq i \leq \ell -2 \text { and } \tau _i \neq \emptyset \}$
.
Since any two residues have disjoint alphabets, for all
$0 \leq i < j \leq \ell - 2$
,
$C_i(\tau , \alpha )\cap C_j(\tau , \alpha ) = \emptyset $
.
Furthermore,
$C_i(\tau , \alpha ) \leq \max \{2(\#\tau _{\ell -1 -i}), 1\}$
and so
$\#F_{\alpha }(\tau ) = \sum _{i = 0}^{\ell -2}\#C_i(\tau , \alpha ) \leq 2k + (\ell - 1 - \#I) \leq 2k + \ell - 2$
since
$\#I \geq 1$
.
Since this is true for any k-sized window
$\tau $
, we have that
$p_{\alpha }^*(k) \leq 2k + \ell -2$
and since by Theorem 2.11,
$p_{\alpha }^*(k) \geq 2k + \ell -2$
, for this sequence,
$p_{\alpha }^*(k) = 2k + \ell -2$
.
Example 6.2 is a sequence that has the structure outlined in Theorem 1.3, but has maximal pattern complexity strictly greater than
$2k+\ell -2$
demonstrating that the converse of Theorem 1.2 is not true.
Example 6.2. We will define a sequence
$\alpha \in \{0, 1, 2, 3\}^{\mathbb {N}_0}$
that is a pattern Sturmian sequence zippered with constant sequences, but has maximal pattern complexity
$p_{\alpha }^*(k) = 2k + C$
for
$C> \ell - 2$
.
Let
$x \in \{0, 1\}^{\mathbb {N}_0}$
be a pattern Sturmian over two letters. Define
$\alpha $
by
$$ \begin{align*} \alpha_{5i + j} = \begin{cases} x_i & \text{if } j \equiv 0 \,\mod 5,\\ 2 & \text{if } j\equiv 1, 4 \,\mod 5,\\ 3 &\text{if } j \equiv 2, 3 \,\mod 5. \end{cases} \end{align*} $$
Let
$\tau $
be a maximal window for x of size
$k - 1$
. Define
$\tau '$
by
$\tau '(i) = m\tau (i)$
for all
$0 \leq i \leq k-2$
and
$\tau '(k-1) = h$
for some
$h \equiv 1 \,\mod 5$
and
$h> m\tau (k-2)$
.
Then,
$\#C_0(\tau ', \alpha ) = 2(k-1)$
since
$\alpha ^{(0)}$
is a pattern Sturmian sequence over two letters and
$\tau $
is a maximal window for
$\alpha ^{(0)}$
. Furthermore,
$\#C_4(\tau , \alpha ) = 2$
since
$C_4(\tau , \alpha ) = \{2^{k-1}0, 2^{k-1}1\}$
, and
$\#C_1(\tau , \alpha ) = \#C_2(\tau , \alpha ) = \#C_3(\tau , \alpha ) = 1$
since
$C_1(\tau , \alpha ) = \{2^{k-1}3\}$
,
$C_2(\tau , \alpha ) = \{3^k\}$
, and
$C_3(\tau , \alpha ) = \{3^{k-1}2\}$
. Clearly,
$\{C_i(\tau ', \alpha ) : 0 \leq i \leq 4\}$
are pairwise disjoint. Thus,
$\#F_{\alpha }(\tau ') = 2k + 3> 2k + (4-2) = 2k + \ell - 2$
for
$\ell = 4$
.
Example 6.3 is a uniformly recurrent sequence with two aperiodic residues that has complexity
$4k$
, showing that the bound in Theorem 1.1 cannot be substantially improved below
$4k$
.
Example 6.3. We create a sequence that has complexity
$4k$
and is two recurrent pattern Sturmian sequences zippered together with a constant sequence.
Let
$x \in \{0, 1\}^{\mathbb {N}_0}$
be the Fibonacci substitution defined by
$0 \to 01$
and
$1 \to 0$
, a Sturmian and therefore pattern Sturmian sequence, which is also uniformly recurrent.
Define
$\alpha \in \{0, 1, 2\}^{\mathbb {N}_0}$
by
$$ \begin{align*} \alpha_{3i + j} = \begin{cases} x_i & \text{if } j \equiv 0, 1 \,\mod 3,\\ 2 &\text{if } j \equiv 2 \,\mod 3. \end{cases} \end{align*} $$
First, we must show that
$\alpha $
is uniformly recurrent. Let
$L> 0$
and let
$R_L(\alpha ) = \{M \in \mathbb {N} : \alpha _0\ldots \alpha _{L-1} = \alpha _{M}\ldots \alpha _{M+L-1}\}$
. Let
$L' = \lceil {L}/{3}\rceil $
and define
$R_L'(x) = \{M \in \mathbb {N} : x_0\ldots x_{L'-1} = x_{M}\ldots x_{M+L'-1}\}$
. Since x is uniformly recurrent,
$R_{L'}(x) = \{M_1, M_2, \ldots \}$
is syndetic and for any
$M_i$
,
$x_0x_1\ldots x_{L'-1} = x_{M_i}x_{M_i + 1}\ldots x_{M_i + L'-1}$
. Thus,
$x_0x_02x_1x_12\ldots x_{L'-1}x_{L'-1}2 = x_{M_i}x_{M_i}2x_{M_i + 1}x_{M_i + 1}2\ldots x_{M_i + L'-1}x_{M_i + L'-1}2$
, and so
$R_L(\alpha ) = \{3M_1, 3M_2, \ldots \}$
is syndetic and thus,
$\alpha $
is a uniformly recurrent sequence.
Now, we show that
$\alpha $
has maximal pattern complexity
$p_{\alpha }^*(k) = 4k$
for all
$k \geq 2$
.
First, we show that for any k-sized window
$\tau $
over
$\alpha $
,
$p_{\alpha }^*(k) \leq 4k$
.
Let
$k \in \mathbb {N}$
and let
$\tau $
be any k-sized window over
$\alpha $
. If for all
$0 \leq i \leq k-1$
,
$\tau (i) \equiv \, 0 \mod 3$
(i.e.
$\tau $
only sees one residue at a time), then
$C_0(\tau , \alpha ) = C_1(\tau , \alpha )$
since
$\alpha ^{(0)} =\, \alpha ^{(1)} = x$
, and
$\#C_0(\tau , \alpha ) \leq 2k$
since x is a pattern Sturmian sequence. Furthermore,
$C_2(\tau , \alpha ) = \{2^k\}$
. Thus,
$\#F_{\alpha }(\tau ) \leq 2k + 1$
.
Suppose now that
$\tau = \tau _1 \sqcup \tau _2 \sqcup \tau _3$
for
$\tau _i = \{\tau (j): \tau (j) \equiv i \,\mod 3\}$
and there exists
$i\neq j$
such that
$\#\tau _i, \#\tau _j> 0$
. Then, for all
$0 \leq i < j \leq 2$
,
$C_i(\tau , \alpha ) \cap C_j(\tau , \alpha ) = \emptyset $
since the position of the
$2$
s uniquely determines to which set
$C_i(\tau , \alpha )$
a
$\tau $
-word belongs.
Then,
$\#C_0(\tau ', \alpha ) \leq 2(\#\tau _0) + 2(\#\tau _1)$
,
$\#C_1 \leq 2(\#\tau _0) + 2(\#\tau _2)$
, and
$\#C_2 \leq 2(\#\tau _1) + 2(\#\tau _2)$
. Therefore,
Thus, for all
$k \in \mathbb {N}$
and any k-sized window
$\tau $
,
$\#F_{\alpha }(\tau ) \leq 4k$
, and so
$p_{\alpha }^*(k) \leq 4k$
for all
$k \in \mathbb {N}$
.
Now, we show that for all
$k \geq 2$
, there exists some window
$\tau '$
such that
$\#F_{\alpha }(\tau ') = 4k$
.
Let
$\tau $
be a maximal k-sized window for x such that the immediate restriction of
$\tau $
is a maximal
$(k-1)$
-sized window. Define
$\tau '$
by
$\tau '(i) = 3\tau (i)$
for all
$0 \leq i \leq k-2$
and
$\tau '(k-1) = 3\tau (k-1) + 1$
.
For all
$0 \leq i < j \leq 2$
,
$C_i(\tau , \alpha ) \cap C_j(\tau , \alpha ) = \emptyset $
since words in
$C_0(\tau ', \alpha )$
are of the form
$w_0$
for
$w_0 \in \{0, 1\}^{k}$
, words in
$C_1$
are of the form
$w_12$
for
$w_1 \in \{0, 1\}^{k-1}$
, and words in
$C_2$
are of the form
$2^{k-1}w_3$
for
$w_3 \in \{0, 1\}$
.
Then,
$\#C_0(\tau ', \alpha ) = 2k$
since
$C_0(\tau ', \alpha ) = F_{x}(\tau )$
,
$\#C_1(\tau ', \alpha ) = 2k-2$
since
$C_1(\tau ', \alpha ) = \{w_12 : w_1 \in F_{x}(\tau \setminus \{\tau (k-1)\})\}$
, and
$\#C_2(\tau ', \alpha ) = 2$
since
$C_2(\tau ', \alpha ) = \{2^{k-1}0, 2^{k-1}1\}$
.
Thus,
$\#F_{\alpha }(\tau ') = 4k$
and so
$p_{\alpha }^*(k) = 4k$
for all
$k \in \mathbb {N}$
.
Acknowledgements
C.S. would like to express her gratitude to her advisor Prof. Ronnie Pavlov for suggesting this question and for the numerous helpful comments, suggestions, and conversations during the formation of this paper.






