1 Main result
Let
$ \Sigma \subset \mathscr {A}^{\mathbb {N}} $
be any one-sided shift space defined over a countable alphabet
$ \mathscr {A} $
and let
$ \sigma : \Sigma \to \Sigma $
be the left shift map. To be concrete, we assume from this point forward that
$ \mathscr {A} $
is identified with the set of non-negative integers
$ \mathbb {Z}_{+} $
. As usual,
$ \mathscr {A} $
is endowed with the discrete topology and
$ \Sigma $
with the product topology.
We introduce the language
$ \mathscr {L} $
of
$ \Sigma $
as the set of allowed words, that is, the union of the following sets:
$$ \begin{align*} \mathscr{L}_{0} & = \{ \epsilon \} \quad \text{where } \epsilon \text{ is the empty word in } \mathscr{A}^{0}, \\ \text{and} \quad \mathscr{L}_{k} & = \{ ( x_0, \ldots, x_{k-1} ) \in \mathscr{A}^k : x \in \Sigma \} \quad \text{for all } k \geq 1. \end{align*} $$
As a notational convention, we use
$ x $
,
$ y $
or
$ z $
for (infinite) sequences and
$ u $
,
$ v $
or
$ w$
for (finite) words from
$ \mathscr {L} $
. In particular, the length of a word
$ w \in \mathscr {L}_{k} $
is indicated as
$ \ell (w) = k $
. We define the predecessor set as
and similarly, we introduce the follower sets as
$$ \begin{align*} \mathscr{F}_1 ( w ) & = \{ b \in \mathscr{L}_{1} : w b \in \mathscr{L} \} \quad \text{and} \\ \mathscr{F}_m ( w ) & = \{ b \in \mathscr{L}_{1} : w v b \in \mathscr{L} \text{ for some } v \in \mathscr{L}_{m-1} \} \quad \text{for any } m \geq 2. \end{align*} $$
For the empty word, we assume that
$ \mathscr {P} (\epsilon ) = \mathscr {F}_1 (\epsilon ) = \mathscr {L}_1 $
.
We call a potential any Borel function
$ A : \Sigma \to \mathbb {R} $
that is bounded from above. We introduce its ergodic maximizing constant as
$$ \begin{align*} \beta ( A ) = \sup \bigg\{\!\int_{\Sigma} A \,d \mu : \mu \text{ is a } \sigma \text{-invariant probability measure} \bigg\}. \end{align*} $$
Obviously,
$ \beta (A) \in (-\infty , \sup A ] $
. In these notes, we study the
$ \sigma $
-invariant probability measures
$ \mu $
that maximize the integral of
$ A $
over
$ \Sigma $
, that is,
$ \int A\, d \mu = \beta (A) $
. Our main objective is to provide sufficient conditions (on the space and on the potential) for the existence of these measures, called maximizing probabilities.
Existence Theorem. Let
$ \Sigma $
be a shift over a countable alphabet that satisfies both:
-
• finite cyclic predecessor assumption –
$ \mathscr {P} (a) \cap \mathscr {F}_m (a) $
is finite for every
$ a \in \mathscr {L}_1 $
and for all
$ m \geq 1 $
; -
• denseness of periodic measures – the set of ergodic probabilities supported on periodic orbits of
$ \Sigma $
is (weak
$^{\ast }$
) dense among the
$ \sigma $
-invariant measures.
Then, every upper semi-continuous potential
$ A $
fulfilling
has a maximizing probability. Furthermore, if
$ \sup A \leq \beta (A) $
, then there exists a common finite subshift that contains the support of any maximizing measure for
$ A $
.
To the best of our knowledge, this is an inaugural result for potentials defined on a non-compact shift that are less regular than summable variation ones, a typical scenario (see [Reference Bissacot and Freire3, Reference Bissacot and Garibaldi4, Reference Jenkinson, Mauldin and Urbański9]). Moreover, the key assumption about the potentials is the requirement that, except occasionally in a finite subset of the alphabet, the supremum of their restrictions to cylinders of size
$ 1 $
is strictly smaller than the respective maximizing ergodic constant. In particular, this embraces the coercive framework and the conditions of oscillations considered in previous works [Reference Bissacot and Freire3, Reference Bissacot and Garibaldi4, Reference Jenkinson, Mauldin and Urbański9]. With respect to the dynamics, the most comprehensive context addressed in the literature consists of topologically transitive countable alphabet Markov shifts. Our existence theorem also applies to more general countable alphabet shifts, taking into account situations beyond the Markovian case (see §7). Note that in the Markov setting, topological transitivity implies the denseness of periodic measures (see, for instance, [Reference Coudene and Schapira5]). An equivalent formulation of the finite cyclic predecessor assumption was previously used in [Reference Iommi and Velozo8] to study the set of invariant measures of topologically transitive countable Markov shifts. Here, this hypothesis has a central role in the proof of the (weak
$^{\ast }$
) compactness of the set of invariant probabilities for a compact extension
$ ( \hat {\Sigma }, \hat {\sigma } ) $
of
$ ( \Sigma , \sigma ) $
, known as the blur shift [Reference Almeida and Sobottka1], whose construction will be described in detail in the next section.
The rest of the paper is organized as follows. After reviewing the main properties and results of blur shifts in §2, we study their respective invariant probabilities in §3. We not only give a convex characterization (Proposition 3.4), but we also show that the finite cyclic predecessor assumption and the denseness of periodic measures are sufficient conditions for compactness of the set of blur invariant measures (Corollary 3.10). In §4, we describe a minimal upper semi-continuous extension of the potential on the blur shift space. At this point, we can clearly present in §5 the ergodic optimization problem for the blur shift and relate it to the initial one for the original countable alphabet shift. Section 6 is devoted to the proof of the existence theorem. In §7, we exhibit a countable alphabet non-Markovian shift that satisfies all the hypotheses required by our main result, demonstrating the applicability of our framework beyond the Markovian setting.
2 Blur shift space
The blur shift space was recently introduced in [Reference Almeida and Sobottka1] as a compactification scheme for classical shift spaces. It is a unified presentation and generalization of the Ott–Tomforde–Willis shift space [Reference Ott, Tomforde and Willis11] and the Gonçalves–Royer ultragraph shift space [Reference Gonçalves and Royer7], which were successfully used to study and extend results on the equivalence between the conjugacy of a subclass of Markov shift spaces and the isomorphism of the
$C^\ast $
-algebras associated with their graphs.
To the best of our knowledge, this note is a pioneer application of the blur shift compactification method to ergodic optimization theory.
Proofs of all the results just mentioned in this section may be found in [Reference Almeida and Sobottka1, Reference Ott, Tomforde and Willis11]. The reader interested in concrete examples of shift compactification may also consult these references.
Let
$ \Sigma $
be a shift space over a countable alphabet. A set
$ \mathscr {V} = \{ B_1, \ldots , B_s \} \subset 2^{ \mathscr {A} } $
is said to be a finite resolution of blurred sets if it verifies:
$ B_r $
is infinite for each
$ 1 \leq r \leq s $
;
$ B_i \cap B_j $
is finite for all
$ 1 \leq i \neq j \leq s $
; and
$ \mathscr {L}_{1} \setminus \bigcup _{r=1}^{s} B_r $
is finite. We define the blur shift space
$ \hat {\Sigma } \subset ( \mathscr {A} \sqcup \mathscr {V} )^{ \mathbb {N} } $
with resolution
$ \mathscr {V} $
associated with
$ \Sigma $
as
where
$\partial _{ B_r } \Sigma = \{ ( w, B_r, B_r, \ldots ) : w \in \mathscr {L} \text { and } \mathscr {F}_1 (w) \cap B_r \text { is infinite} \}$
for
$1 \leq r \leq s$
.
Note that each
$ \partial _{ B_r } \Sigma $
is non-empty, since the empty word
$ \epsilon {\kern-1pt}\in{\kern-1pt} \mathscr {L} $
fulfills
${ \mathscr {F}_1 ( \epsilon ) {\kern-1pt}\cap{\kern-1pt} B_r {\kern-1pt}={\kern-1pt} B_r }$
. In addition, we identify
$ ( \epsilon , B_r, B_r, \ldots ) $
with
$ ( B_r, B_r, \ldots ) $
.
Based on the original shift language, we consider the following decomposition of the set
$ \hat {\Sigma } \setminus \Sigma $
:
$$ \begin{align*} \hat{\mathcal{L}}_{0} & = \{ ( B_r, B_r, \ldots ) \in \hat{\Sigma} : 1 \leq r \leq s \} \\ \text{and} \quad \hat{\mathcal{L}}_{k} & = \{ ( w, B_r, B_r, \ldots ) \in \hat{\Sigma} : w \in \mathscr{L}_k \text{ and } 1 \leq r \leq s \} \quad\text{for } k \geq 1. \end{align*} $$
For notational convenience, we denote
$ \partial \Sigma = \hat {\Sigma } \setminus \Sigma = \bigsqcup _{r = 1}^{s} \partial _{ B_r } \Sigma = \bigsqcup _{k \geq 0} \hat {\mathcal {L}}_{k} $
.
Let us recall how we can interpret
$ \hat {\Sigma } $
as a compact space in which
$ \Sigma $
can naturally be seen as its topological subspace. First, consider on
$ \mathscr {A} \sqcup \mathscr {V} $
the topology generated by the family formed of singletons of
$ \mathscr {A} $
and of sets of the form
$ B_r \setminus S $
, where
$ S $
is some finite subset of
$ B_r $
and
$ 1 \leq r \leq s $
. Thus, the set
$ ( \mathscr {A} \sqcup \mathscr {V} )^{ \mathbb {N} } $
is equipped with the corresponding product topology. We say that
$ x $
,
$ y \in ( \mathscr {A} \sqcup \mathscr {V} )^{ \mathbb {N} } $
are equivalent and we write
$ x \sim y $
if:
-
• either
$ x $
,
$ y \in \mathscr {A}^{ \mathbb {N} } $
and
$ x = y $
; -
• or
$ x $
,
$ y {\kern-1pt}\in{\kern-1pt} ( \mathscr {A} \sqcup \mathscr {V} )^{ \mathbb {N} } \setminus \mathscr {A}^{ \mathbb {N} } $
,
$ x {\kern-1pt}={\kern-1pt} ( u , B_i, \ldots ) $
,
$ y {\kern-1pt}={\kern-1pt} ( v , B_j, \ldots ) $
with
$ {u {\kern-1pt}={\kern-1pt} v {\kern-1pt}\in{\kern-1pt} \bigsqcup _{ k \geq 0} \mathscr {A}^k }$
, and
$ B_i = B_j \in \mathscr {V} $
.
From this equivalence relation, we can provide the quotient topology on
$ ( \mathscr {A} \sqcup \mathscr {V} )^{ \mathbb {N} } / { \sim } $
. Hence, since
$ \hat {\Sigma } $
can be seen as a subset of
$ (\mathscr {A} \sqcup \mathscr {V})^{ \mathbb {N} } / { \sim } $
, the blur shift space
$ \hat {\Sigma } $
inherits this quotient topology as a subspace. It can be shown that
$ \hat {\Sigma } $
is therefore a compact Hausdorff metrizable space (see [Reference Almeida and Sobottka1, Theorem 3.24 and Corollary 3.18]). The topological structure on
$ \hat {\Sigma } $
has a countable basis of generalized cylinder sets, namely:
$$ \begin{align*} Z [ w ] & = \{ x \in \hat{\Sigma} : x_i = w_i \text{ for all } 0 \leq i < \ell(w) \} \quad\text{and} \\ Z [ w B_r; S ] & = \{ x \in \hat{\Sigma} : x_i = w_i \text{ for all } 0 \leq i < \ell(w), \ x_{ \ell (w) } = B_r \text{ or } x_{ \ell (w) } \in B_r \setminus S \}, \end{align*} $$
where
$ w \in \mathscr {L} $
is any finite word,
$ B_r \in \mathscr {V} $
is some blurred set, and
$ S \subset B_r $
is a finite set. In particular, the usual cylinder set
$ [ w ] = \{ x \in \Sigma : x_i = w_i \text { for all } 0 \leq i < \ell (w) \} \subset \Sigma $
coincides with the set
$ Z [ w ] \cap \Sigma $
. It follows from this fact that the topology of
$ \Sigma $
is the subspace topology from
$ \hat {\Sigma } $
on
$ \Sigma $
. Note also that
$ Z [ \epsilon ] = \hat {\Sigma } $
and
$ Z [ B_r; S ] = \{ x \in \hat {\Sigma } : x_0 = B_r \text { or } x_0 \in B_r \setminus S \} $
. The generalized cylinders sets are simultaneously closed and open (see [Reference Almeida and Sobottka1, Proposition 3.1]). The neighborhood basis for a point is
$$ \begin{align*} \{ Z [ x_0, \ldots, x_{n} ] : n \geq 0 \} \quad & \text{if } ( x_0, x_1, \ldots ) \in \Sigma, \\ \{ Z [ w B_r; S ] : S \text{ finite subset of } B_r \} \quad & \text{if } ( w, B_r, B_r, \ldots ) \in \partial \Sigma \text{ with } \ell(w) \geq 1, \\ \text{or} \quad \{ Z [ B_r; S ] : S \text{ finite subset of } B_r \} \quad & \text{if } ( B_r, B_r, \ldots ) \in \hat{\mathcal{L}}_{0} \end{align*} $$
(see [Reference Almeida and Sobottka1, Proposition 3.2]). The original shift space
$ \Sigma $
is a dense subset in
$ \hat {\Sigma } $
. Indeed, given
$ ( w, B_r, B_r, \ldots ) \in \partial \Sigma $
and some neighborhood
$ Z [ w B_r, S ] $
, since
$(\mathscr {F}_1 (w) \cap B_r)\setminus {S}$
is infinite, let
$ a \in B_r \setminus S $
such that
$ w a \in \mathscr {L} $
. In particular, there exists
$ y \in \Sigma $
fulfilling
$ ( y_0, \ldots , y_{\ell (w)} ) = w a $
and, hence,
$ y \in Z [ w B_r, S ] $
.
The convergence on
$ \hat {\Sigma } $
can be described as follows.
Lemma 2.1. [Reference Ott, Tomforde and Willis11, Corollary 2.17]
-
(i) The sequence
$ \{ x^n \} \subset \hat {\Sigma } $
converges to
$ x \in \Sigma $
if and only if for every positive integer
$ M $
, there exists an integer
$ N> 0 $
such that
$ n> N $
implies
$x^{n}_{i} = x_i $
for all
$ 1 \leq i \leq M $
. -
(ii) The sequence
$ \{ x^n \} \subset \hat {\Sigma } $
converges to
$ ( w, B_r, B_r, \ldots ) \in \partial \Sigma $
if and only if for every finite subset
$ S \subset B_r $
, there exists an integer
$ N> 0 $
such that
$ n> N $
implies
$ x^{n}_{i} = w_i $
for all
$ 0 \leq i < \ell ( w ) $
and
$ x^n_{ \ell (w) } = B_r $
or
$ x^n_{ \ell (w) } \in B_r \setminus S $
.
The blur shift space
$ \hat {\Sigma } $
obviously admits as measurable structure the Borel sigma-algebra generated by the generalized cylinder sets, which contains the Borel sigma-algebra of
$ \Sigma $
. Let
$ \mathcal {M} ( \Sigma ) $
,
$ \mathcal {M} (\hat {\Sigma }) $
denote respectively the sets of Borel probability measures on
$ \Sigma $
and on
$ \hat {\Sigma } $
. Both sets are convex and each one is considered equipped with the respective weak
$^{\ast }$
topology. The following result allows us to commit the abuse of notation
$ \mathcal {M} ( \Sigma ) \subset \mathcal {M} (\hat {\Sigma }) $
.
Lemma 2.2.
$ \mathcal {M} ( \Sigma ) $
is in a continuous bijective correspondence with the subset of
$ \mathcal {M} (\hat {\Sigma }) $
formed by those probabilities that gives mass
$ 1 $
to
$ \Sigma $
.
Proof. For
$ \mu \in \mathcal {M} (\Sigma ) $
, note that
$ \dot {\mu } ( \,\cdot \, ) = \mu ( \,\cdot \, \cap \Sigma ) $
defines a Borel probability on
$ \hat {\Sigma } $
. This remark allows us to consider the map
$ \mu \in \mathcal {M} (\Sigma ) \, \mapsto \Phi ( \mu ) = \dot {\mu } \in \mathcal {M} (\hat {\Sigma }) $
, which is injective due to standard arguments on extensions of measures (see, for instance, [Reference Billingsley2, Theorem 3.3]). Clearly, any
$ \hat {\mu } \in \mathcal {M} (\hat {\Sigma }) $
with
$ \hat {\mu } ( \Sigma ) = 1 $
may be seen as the image of its restriction on the Borel measurable sets of
$ \Sigma $
. By definition and linearity, we have that
$ \int _{ \Sigma } \hat {\varphi }|_{ \Sigma } \,d \mu = \int _{ \hat {\Sigma } } \hat {\varphi } \,d \dot {\mu } $
for every simple function
$ \hat {\varphi } $
on
$ \hat {\Sigma } $
, and thus by the monotone convergence theorem, for every continuous function
$ \hat {\varphi } $
on
$ \hat {\Sigma } $
. This fact shows that if
$ \hat {V} $
is a basis set of the weak
$^{\ast }$
topology on
$ \mathcal {M} (\hat {\Sigma }) $
, then
$ \Phi ^{-1} (\hat {V}) $
is a basis set of weak
$^{\ast }$
topology on
$ \mathcal {M} ( \Sigma ) $
. In other words,
$ \Phi $
is continuous.
As
$ \hat {\Sigma } $
is compact metrizable space, it is well known that
$ \mathcal {M} (\hat {\Sigma }) $
is a weak
$^{\ast }$
compact metrizable space (see, for instance, [Reference Walters13, Proposition 6.5]). As a subset of
$ \mathcal {M} (\hat {\Sigma }) $
,
$ \mathcal {M} (\Sigma ) $
is clearly not closed: for example,
$ \delta _{ ( k, 0, 0, \ldots ) } \, \stackrel {\ast }{\rightharpoonup } \delta _{ ( B_r, B_r, \ldots ) } $
as
$ k \in B_r $
tends to
$ \infty $
.
We now introduce the (blur) shift map
$ \hat {\sigma } : \hat {\Sigma } \longrightarrow \hat {\Sigma } $
as the usual left shift map. It is obvious that
$ \hat {\sigma } $
coincides with the shift map
$ \sigma $
on
$ \Sigma $
. Note that, in
$ \partial _{ B_r } \Sigma $
, there is a single fixed point
$ ( B_r, B_r, \ldots ) = \hat {\sigma } ( B_r, B_r, \ldots ) $
that absorbs all points of
$ \partial _{ B_r } \Sigma $
, that is,
$ \hat {\sigma }^{ \ell ( w ) } ( w, B_r, B_r, \ldots ) = ( B_r, B_r, \ldots ) $
. Consequently,
$ \hat {\mathcal {L}}_{0} $
absorbs
$ \partial \Sigma $
.
Note that the shift map
$ \hat {\sigma } $
is measurable, since
$$ \begin{align*} \hat{\sigma}^{-1} ( Z [ \epsilon ] ) & = \hat{\Sigma}, \\\hat{\sigma}^{-1} ( Z [ w ] ) & = \bigsqcup_{ i \in \mathscr{P} (w) } Z [ i w ], \\\hat{\sigma}^{-1} ( Z [ B_r; S ] ) & = \{ ( B_r, B_r, \ldots ) \} \sqcup \bigsqcup_{ i \in \mathscr{L}_1 } Z [ i B_r; S ], \text{ and}\\\hat{\sigma}^{-1} ( Z [ w B_r; S ] ) & = \bigsqcup_{ i \in \mathscr{P} (w) } Z [ i w B_r; S ], \end{align*} $$
where
$ w $
is any finite non-empty word and
$ 1 \leq r \leq s $
. The shift map
$ \hat {\sigma } $
is continuous only on
$ \hat {\Sigma } \setminus \hat {\mathcal {L}}_{0} $
(see [Reference Almeida and Sobottka1, Proposition 4.1]). In fact, the discontinuity of
$ \hat {\sigma } $
on
$ \hat {\mathcal {L}}_{0} $
can be observed from the convergences of the following sequences, as
$ k \in B_r $
tends to
$ \infty $
,
$$ \begin{align*} ( k, 0, 0, \ldots )& \to ( B_r, B_r, \ldots )\quad \text{and} \\ \hat{\sigma} ( k, 0, 0, \ldots ) & \to ( 0, 0, \ldots ) \neq \hat{\sigma} ( B_r, B_r, \ldots ). \end{align*} $$
3 Invariant probabilities
Let us introduce the following sets of invariant measures:
$$ \begin{align*} \mathcal{M} ( \Sigma, \sigma ) & = \bigg\{ \mu \in \mathcal{M} ( \Sigma ) : \begin{array}{c} \mu \text{ is } \sigma \text{-invariant probability, that is,} \\ \sigma_{\ast} \mu ( \cdot ) = \mu ( \sigma^{-1} ( \cdot ) ) = \mu ( \cdot ) \end{array}\!\! \bigg\}, \\ \text{and} \quad \mathcal{M} ( \hat{\Sigma}, \hat{\sigma} ) & = \bigg\{ \hat{\mu} \in \mathcal{M} ( \hat{\Sigma} ) : \begin{array}{c} \hat{\mu} \text{ is } \hat{\sigma} \text{-invariant probability, that is,} \\ \hat{\sigma}_{\ast} \hat{\mu} ( \cdot ) = \hat{\mu} ( \hat{\sigma}^{-1} ( \cdot ) ) = \hat{\mu} ( \cdot ) \end{array}\!\! \bigg\}. \end{align*} $$
Whenever we write
$ \overline { \mathcal {M} ( \Sigma , \sigma ) } $
or
$ \overline { \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) } $
, it will be understood that we consider the closure with respect to the weak
$^{\ast }$
topology of
$ \mathcal {M} ( \hat {\Sigma } ) $
.
In this section, we will study how these subsets are related, as well as their main properties. It is clear that
$ \mathcal {M} ( \Sigma , \sigma ) \subset \mathcal {M} (\hat {\Sigma }, \hat {\sigma }) $
and both sets are convex. We cannot expect that either
$ \mathcal {M} ( \Sigma , \sigma ) $
or
$ \mathcal {M} (\hat {\Sigma }, \hat {\sigma }) $
will be closed in general, as illustrated below.
Example 3.1. From item (ii) of Lemma 2.1, we observe that the sequence of invariant probabilities
as
$ k \in B_r $
tends to
$ \infty $
, converges to
However, this probability is not invariant:
It is important to highlight that under suitable hypotheses on the original shift, precisely the finite cyclic predecessor assumption and the denseness of periodic measures, we can guarantee the compactness of the set of blur invariant measures (see Corollary 3.10). A first step will be a convex decomposition of blur invariant probabilities (Proposition 3.4), which will hold for any shift space over a countable alphabet.
3.1 Convexity description
Our goal is to exhibit a convex characterization for
$ \mathcal {M} (\hat {\Sigma }, \hat {\sigma }) $
. To do that, we need to better understand the behavior of blur shift probabilities on
$ \partial \Sigma $
.
Lemma 3.2. If
$ \hat {\mu } \in \mathcal {M} (\hat {\Sigma }, \hat {\sigma }) $
, then
$ \hat {\mu } (\bigsqcup _{k \geq 1} \hat {\mathcal {L}}_{k}) = 0 $
. In particular,
Proof. Remember that
$ \hat {\mathcal {L}}_{0} $
absorbs
$ \partial \Sigma $
. Since
$ \hat {\mu } $
is
$ \hat {\sigma } $
-invariant measure, we have
$ \hat {\mu } ( \hat {\mathcal {L}}_{0} ) = \hat {\mu } ( \hat {\sigma }^{-k} ( \hat {\mathcal {L}}_{0} ) ) = \hat {\mu } ( \hat {\mathcal {L}}_{0} \sqcup \cdots \sqcup \hat {\mathcal {L}}_{k} ) $
for all
$ k \geq 1 $
. Hence,
$$ \begin{align*} \hat{\mu} \bigg( \bigsqcup_{k \geq 1} \hat{\mathcal{L}}_{k} \bigg) = \lim_{ k \to \infty } \hat{\mu} ( \hat{\mathcal{L}}_{0} \sqcup \cdots \sqcup \hat{\mathcal{L}}_{k} ) - \hat{\mu} ( \hat{\mathcal{L}}_{0} ) = 0.\\[-47pt] \end{align*} $$
Lemma 3.3. If
$ \hat {\nu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
and
$ \hat {\nu } ( \partial \Sigma ) = 1 $
, then
$ \hat {\nu } \in \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
.
Proof. It is immediate from Lemma 3.2 that
$ \hat {\nu } ( \hat {\mathcal {L}}_{0} ) = \hat {\nu } ( \partial \Sigma ) = 1 $
. Hence,
$$ \begin{align*} \hat{\nu} ( \cdot ) = \hat{\nu} ( \cdot \cap \hat{\mathcal{L}}_{0} ) = \sum_{r=1}^{s} \hat{\nu} (\cdot \cap \{ ( B_r, B_r, \ldots ) \} ) = \sum_{r=1}^{s} \alpha_r \delta_{ ( B_r, B_r, \ldots ) } (\cdot), \end{align*} $$
where
$ \alpha _r : = \hat {\nu } ( \{ ( B_r, B_r, \ldots ) \} ) $
fulfills
$ \sum _{r=1}^{s} \alpha _r = \hat {\nu } ( \hat {\mathcal {L}}_{0} ) = 1 $
.
We obtain the following convex characterization for the
$ \hat {\sigma } $
-invariant probabilities (see Figure 1).

Figure 1 Graphical representations of convex decompositions of the set of
$ \hat {\sigma } $
-invariant measures for blur shifts with one-symbol and three-symbol resolutions.
Proposition 3.4.
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) = \mathrm {Conv} ( \mathcal {M} ( \Sigma , \sigma ) \sqcup \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
.
Proof. By the convexity of the set of blur invariant probabilities, it follows that
$ \mathrm {Conv} (\mathcal {M} (\Sigma , \sigma ) \cup \{\delta _{(B_r, B_r, \ldots )} : 1 \leq r \leq s \} ) \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
. However, assume that
$ \hat {\mu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
and remember that
$ \hat {\Sigma } = \Sigma \sqcup \partial \Sigma $
. We will consider three cases.
Case 1:
$ \hat {\mu } ( \Sigma ) = 0 $
. It is immediate that
$ \hat {\mu } ( \partial \Sigma ) = 1 $
. Thus, Lemma 3.3 guarantees that
$ \hat {\mu } \in \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
.
Case 2:
$ \hat {\mu } (\Sigma ) \in (0, 1) $
. Consider the decomposition
with constant
$ t = \hat {\mu } (\Sigma ) $
, consequently
$ 1 - t = \hat {\mu } ( \partial \Sigma ) $
, and
$ \hat {\sigma } $
-invariant probabilities
In particular,
$ \dot {\mu } ( \Sigma ) = 1 $
and
$ \hat {\nu } ( \partial \Sigma ) = 1 $
. By Lemma 2.2, we can identify
$ \dot {\mu } $
as a probability measure in
$ \mathcal {M} ( \Sigma , \sigma ) $
. Moreover,
$ \nu = \sum _{r=1}^{s} \alpha _r \delta _{ ( B_r, B_r, \ldots ) } $
with
$ \sum _{r=1}^{s} \alpha _r = 1 $
, due to Lemma 3.3. We conclude that
$$ \begin{align*} \hat{\mu} = t \dot{\mu} + \sum_{r=1}^{n} ( 1 - t ) \alpha_r \delta_{ ( B_r, B_r, \ldots ) } \quad\text{with }t + \sum_{r=1}^{n} ( 1 - t ) \alpha_r = 1. \end{align*} $$
Case 3:
$ \hat {\mu } (\Sigma ) = 1 $
. It is straightforward from Lemma 2.2.
Remark 3.5. From the previous result, for any
$ \hat {\mu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
, we have
$ \hat {\mu } ( \cdot \cap \Sigma ) = t \dot {\mu } ( \cdot ) $
with
$ t = \hat {\mu } ( \Sigma ) \in [0,1] $
and
$ \dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) $
. This fact induces a projection from the set of
$ \hat {\sigma } $
-invariant probabilities on
$ \hat {\Sigma } $
to the set of
$ \sigma $
-invariant subprobabilities on
$ \Sigma $
, which is a central notion used in [Reference Iommi and Velozo8] to study invariant measures for countable Markov shifts. The phenomenon of escape of mass can be translated as a transference of mass from
$ \Sigma $
to points of
$ \partial \Sigma $
.
3.2 Compactness
In this subsection, we show the compactness of the set of blur invariant probabilities, supposing the finite cyclic predecessor assumption and the denseness of periodic measures. Initially, we argue that these hypotheses on the original shift impose that any measure belonging to the closure of
$ \sigma $
-invariant probabilities fulfills the necessary condition to be a blur invariant probability described in Lemma 3.2, that is, the set
$ \partial \Sigma \setminus \hat {\mathcal {L}}_0 $
has null measure.
Lemma 3.6. Let
$ \Sigma $
be a shift over a countable alphabet verifying the finite cyclic predecessor assumption and the denseness of periodic measures. Then,
Proof. By Lemma 2.2 and by the denseness of periodic measures, the set of ergodic probabilities supported on periodic orbits of
$ \Sigma $
is also (weak
$^{\ast }$
) dense on
$ \overline { \mathcal {M} ( \Sigma , \sigma ) } $
as a subset of
$ \mathcal {M} ( \hat {\Sigma } ) $
. Therefore, given
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
, let
denote a sequence of periodic points such that the respective periodic probabilities
$ \dot {\mu }_l = ({1}/{p_{l}}) \sum _{ k = 0 }^{ p_{l} - 1 } \delta _{ \sigma ( y^{l} ) } $
converge to
$ \bar {\mu } $
.
Note that the countable set
$ \hat {\mathcal {L}}_1 = \bigsqcup _{ 1 \leq r \leq s } \bigsqcup _{ j \in B_r } \bigsqcup _{ i \in \mathscr {P} (j) } \{ ( i, B_r, B_r, \ldots ) \} $
has null measure if and only if
$ \bar {\mu } ( \{ ( a, B_r, B_r, \ldots ) \} ) = 0 $
for all
$ a \in \bigsqcup _{ j \in B_r } \mathscr {P} (j) $
and for any
${ 1 \leq r \leq s }$
. We fix once for all
$ ( a, B_r, B_r, \ldots ) $
and we consider the following three possibilities.
Suppose that
$ \{ \max _{ 0 \leq j \leq p_l} y_{j}^{l} \}_{ l \geq 1 } $
is bounded. Take into account the finite subset of
$ B_r $
given as
and consider the clopen neighborhood
$ Z [ a B_r; S ] $
of
$ ( a, B_r, B_r, \ldots ) $
. Since
${ y_{j}^{l} \notin B_r \setminus S }$
, for every
$ 0 \leq j \leq p_l $
, it follows that
$ \mathrm {supp} \dot {\mu }_l = \mathrm {orb} ( y^l ) \subset Z [ a B_r; S ]^{\complement } $
. In particular,
$ \dot {\mu }_{l} ( Z [ a B_r; S ] ) = 0 $
for all
$ l \geq 1 $
. Due to the portmanteau theorem, we have that
Suppose that
$ \{ \max _{ 0 \leq j \leq p_l} y_{j}^{l} \}_{ l \geq 1 } $
is unbounded and
$ \{ p_{l} \}_{ l \geq 1 } $
is bounded. By taking a subsequence, we may assume that
$ p_{l'} = p $
is constant. Without loss of generality, we consider
$ \{ y_{0}^{l'} \} $
unbounded. Note that
$$ \begin{align*} \{ y_{0}^{l'} \} \subset \bigcup_{ l' \geq 1 } \mathscr{P} ( y_{1}^{l'} ) \cap \mathscr{F}_{ p - 1 } ( y_{1}^{l'} ). \end{align*} $$
By the finite cyclic predecessor assumption, we have a union of finite sets that contains an infinite set. This is only possible if
$ \{ y_{1}^{l'} \} $
is also an unbounded set. By a recursive argument, we obtain that
$ \{ y_{j}^{l'} \} $
is unbounded for any
$ 0 \leq j \leq p-1 $
. In particular, there exists an infinite subset of indexes
$ l" $
such that
$ y_{j}^{l"}> a $
for all
$ 0 \leq j \leq p - 1 $
. We thus obtain
$ \mathrm {supp} \dot {\mu }_{l"} = \mathrm {orb} ( y^{l"} ) \subset Z [ a B_r ]^{\complement } $
, so that
Suppose that
$ \{ \max _{ 0 \leq j \leq p_l} y_{j}^{l} \}_{ l \geq 1 } $
and
$ \{ p_{l} \}_{ l \geq 1 } $
are unbounded. Given a positive integer
$ m $
, consider the following finite sets:
$$ \begin{align*} R_{0} & = \{a\}, & R_{1} & = \bigcup_{ k = 1 }^{m} \mathscr{P} (a) \cap \mathscr{F}_{k} (a), \\ R_{2} &= \bigcup_{ b \in R_{1} } \bigcup_{ k = 1 }^{m} \mathscr{P} (b) \cap \mathscr{F}_{k} (b), & \ldots, \qquad R_{m} &= \bigcup_{ c \in R_{m-1} } \bigcup_{ k = 1 }^{m} \mathscr{P} (c) \cap \mathscr{F}_{k} (c). \end{align*} $$
Denote
$ S_m = B_r \cap ( R_{0} \cup R_{1} \cup R_{2} \cup \cdots \cup R_{m} ) $
. Due to the portmanteau theorem, we have that
To conclude, we will show that
$ \dot {\mu }_{l"} ( Z [ a B_r; S_m ] ) < {1}/({ m + 1 }) $
for infinitely many indexes
$ l" $
. Without loss of generality, let
$ \{ y_{0}^{l} \} $
be unbounded. We may consider a subsequence such that
$ p_{l'}> m + 1 $
for all
$ l' $
. From the finite cyclic predecessor assumption and a recursive argument as above, we obtain that
$ \{ y_{j}^{l'} \} $
is also unbounded for any
$ 0 \leq j \leq m $
. Passing to another subsequence, we may assume that
$ y_{j}^{l"}> a $
for all
$ l" $
and
$ 0 \leq j \leq m $
. Note that:
-
•
$ \sigma ^i ( y^{l"} ) \in Z [ a B_r; S_m ] $
implies
$ m < i < p_{l} $
, because
$ y_{j}^{l"}> a $
for all
$ 0 \leq j \leq m $
; -
•
$ \sigma ^i ( y^{l"} ) $
,
$ \sigma ^j ( y^{l"} ) \in Z [ a B_r; S_m ] $
, with
$ i < j $
, implies
$ j - i> m + 1 $
, because, otherwise, if
$ 1 < j - i = k \leq m + 1 $
, as
$ y_{i}^{l"} = y_{j}^{l"} = a $
and
$ y_{i+1}^{l"} \notin R_{k-1} $
, then
$ y_{j-1}^{l"} \in \mathscr {P} (a) $
and
$ y_{j-1}^{l"} \notin R_{1} $
, which means that
$ y_{j-1}^{l"} \notin \mathscr {F}_{k-1} (a) $
, which is a contradiction.
Hence, from a counting argument, it follows that
$$ \begin{align*} \!\!\!\!\dot{\mu}_{l"} ( Z [ a B_r; S_n ] ) < \!\dfrac{1}{ p_{l"} } \bigg\lceil \dfrac{ p_{l"} - m - 1 }{ m + 1 } \bigg\rceil \leq \dfrac{1}{ p_{l"} } \bigg(\!\dfrac{ p_{l"} - m - 1 }{ m + 1 } + 1 \!\bigg) = \dfrac{ 1 }{ m + 1 }\quad \text{for all } l". \end{align*} $$
Remark 3.7. For countable Markov shifts, we point out the condition
$ \bar {\mu } ( \hat {\mathcal {L}}_1 ) = 0 $
for all
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
implies the finite cyclic predecessor assumption. In fact, assume that
$ \mathscr {P} (a) \cap \mathscr {F}_{m} (a) $
is infinite for some
$ a \in \mathscr {L}_1 $
and
$ m \geq 1 $
. For each
$ l \in \mathscr {P} (a) \cap \mathscr {F}_{m} (a) $
, thus, there exists a periodic point
$ y^l = ( a, u^{l}, l, a, u^{l}, l \ldots ) \in \Sigma $
, where
$ u^{l} \in \mathscr {L}_{m-1} $
is a word that connects
$ a $
to
$ l $
. Because
$ \mathcal {M} ( \hat {\Sigma } ) $
is compact, there is a subsequence of periodic measures
$ \dot {\mu }_{l'} = ({1}/({ m + 1 })) \sum _{ i = 0 }^{m} \delta _{ \sigma ^i ( y^{l'} ) } $
converging to some
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
, as
$ l' \in \mathscr {P} (a) \cap \mathscr {F}_{m} (a) $
tends to
$ \infty $
. Note that there is a subsequence
$ y^{l"} $
converging to a point
$ ( a, v, B_r, B_r, \ldots ) \in \hat {\Sigma } $
, with
$ \ell (v) < m $
and
$ 1 \leq r \leq s $
. For any clopen neighborhood
$ Z [ a v B_r; S ] $
of
$ ( a, v, B_r, B_r, \ldots ) $
, it is easy to see that
$ \bar {\mu }_{l"} ( Z [ a v B_r; S ] ) \geq ({1}/({ m + 1 })) \delta _{ y^{l"} } ( Z [ a v B_r; S ] ) = {1}/({ m + 1 }) $
for
$ l" $
sufficiently large. By the portmanteau theorem, considering
$ S_n = B_r \cap \{ 1, \ldots , n \} $
, we obtain that
For
$ \ell (v) \geq 1 $
, the contrapositive statement is a consequence of the following lemma for general shifts.
Lemma 3.8. Let
$ \Sigma $
be a shift over a countable alphabet. If
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
, then
$ \bar {\mu } ( \hat {\mathcal {L}}_{k+1} ) \leq \bar {\mu } ( \hat {\mathcal {L}}_{k} ) $
for all
$ k \geq 1 $
.
Proof. Let
$ \{ \mu _n \} \in \mathcal {M} ( \Sigma , \sigma ) \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
be a sequence that converges to
$ \bar {\mu } \in \mathcal {M} ( \hat {\Sigma } ) $
. Fix a point
$ ( w, B_r, B_r, \ldots ) $
, with
$ \ell (w) \geq 1 $
, and consider any finite subset
$ S $
of
$ B_r $
. The clopen
$ Z [ w B_r; S ] $
that does not intersect
$ \hat {\mathcal {L}}_0 $
. Since
$ \hat {\sigma } $
is continuous outside
$ \hat {\mathcal {L}}_0 $
, the inverse image
$ \hat {\sigma }^{-1} ( Z [ w; B_r; S ] ) $
is an open neighborhood of
$ \hat {\sigma }^{-1} ( \{ ( w, B_r, B_r, \ldots ) \} ) $
. Due to the portmanteau theorem and
$ \hat {\sigma } $
-invariance of
$ \mu _n $
, we obtain
$$ \begin{align*} \bar{\mu} ( \hat{\sigma}^{-1} ( \{ ( w, B_r, B_r, \ldots) \} ) ) & \leq \bar{\mu} (\hat{\sigma}^{-1} ( Z [ w B_r; S ])) \leq \liminf_{ n \to \infty } \mu_n ( \hat{\sigma}^{-1} (Z[ w B_r; S])) \\ & = \lim_{ n \to \infty } \mu_n ( Z [ w B_r; S ] ) = \bar{\mu} ( Z [ w B_r; S ] ). \end{align*} $$
As this inequality holds for every finite subset
$ S $
of
$ B_r $
, we have that
$ \bar {\mu } ( \hat {\sigma }^{-1} ( \{ ( w, B_r, B_r,\ldots )\})) \leq \bar {\mu } ( \{ ( w, B_r, B_r, \ldots ) \} ) $
. To conclude the proof, note that
$ \hat {\mathcal {L}}_{k} $
is a countable union of singletons
$ \{ ( w, B_r, B_r, \ldots ) \} $
, with
$ \ell (w) = k $
and
$1 \leq r \leq s$
, and consequently,
$ \hat {\mathcal {L}}_{k+1} = \hat {\sigma }^{-1} ( \hat {\mathcal {L}}_{k} ) $
is a countable disjoint union of inverse images
$ \hat {\sigma }^{-1} ( \{ ( w, B_r, B_r, \ldots ) \} ) $
.
In the next result, we provide equivalent formulations for the compactness of the set of blur invariant probability measures.
Proposition 3.9. Let
$ \Sigma $
be a shift over a countable alphabet. The following statements are equivalent:
-
(i)
$ \bar {\mu } ( \hat {\mathcal {L}}_1 ) = 0 $
for every
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
; -
(ii)
$ \bar {\mu } ( \bigsqcup _{ k \geq 1 } \hat {\mathcal {L}}_k ) = 0 $
for all
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
; -
(iii)
$ \overline { \mathcal {M} ( \Sigma , \sigma ) } \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
; -
(iv)
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
is (weak
$^{\ast }$
) compact.
Proof. The equivalence (i)
$\Leftrightarrow $
(ii) is a straightforward consequence of Lemma 3.8. The implication (iii)
$\Rightarrow $
(ii) is immediate from Lemma 3.2. Moreover, (iv)
$\Rightarrow $
(iii) because, when assuming statement (iv),
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
is closed. It remains to prove that (ii)
$ \Rightarrow $
(iii)
$ \Rightarrow $
(iv).
(ii)
$\Rightarrow $
(iii). Let
$ \{ \mu _n \} \in \mathcal {M} ( \Sigma , \sigma ) \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
be a sequence that converges to a probability
$ \bar {\mu } \in \overline { \mathcal {M} ( \Sigma , \sigma ) } $
. To show that
$ \bar {\mu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
, it is sufficient to verify the
$ \hat {\sigma } $
-invariance of the measure on generalized cylinder sets.
Case 1:
$ Z [ \epsilon ] = \hat {\Sigma } $
. It is trivial.
Case 2:
$ Z [ u ] $
with
$ \ell (u) \geq 1 $
. Remember that
$ Z [ u ] $
is a the clopen set that does not intersect
$ \hat {\mathcal {L}}_0 $
. Since
$ \hat {\sigma } $
is continuous outside
$ \hat {\mathcal {L}}_0 $
, the inverse image
$ \hat {\sigma }^{-1} ( Z [ u ] ) $
is open. Due to the portmanteau theorem and
$ \hat {\sigma } $
-invariance of
$ \mu _n $
, we obtain
Note that, for each
$ k \geq 1 $
, we have the partitions
$$ \begin{align*} \hat{\Sigma} \setminus ( \hat{\mathcal{L}}_{0} \sqcup \cdots \sqcup \hat{\mathcal{L}}_{k-1} ) & = \bigsqcup_{ v \in \mathscr{L}_k } Z [ v ]\quad \text{and} \\ \hat{\Sigma} \setminus ( \hat{\mathcal{L}}_{0} \sqcup \cdots \sqcup \hat{\mathcal{L}}_{k} ) = \hat{\Sigma} \setminus \hat{\sigma}^{-1} ( \hat{\mathcal{L}}_{0} \sqcup \cdots \sqcup \hat{\mathcal{L}}_{k-1} ) & = \bigsqcup_{ v \in \mathscr{L}_k } \hat{\sigma}^{-1} (Z[v]). \end{align*} $$
From item (ii), it is easy to see that
Hence, for any
$ u $
with
$ \ell (u) = k \geq 1 $
, the inequality (3.1) allows us to obtain its opposite as follows:
$$ \begin{align*} \bar{\mu} (Z[u]) & = \bar{\mu} ( \hat{\Sigma} \setminus \hat{\mathcal{L}}_0 ) - \sum_{ v \in \mathscr{L}_k \setminus \{ u \} } \bar{\mu} (Z[v]) \\ & \leq \bar{\mu} ( \hat{\Sigma} \setminus \hat{\mathcal{L}}_0 ) - \sum_{ v \in \mathscr{L}_k \setminus \{ u \} } \bar{\mu} (\hat{\sigma}^{-1} (Z[v])) = \bar{\mu} (\hat{\sigma}^{-1} (Z[u])). \end{align*} $$
Case 3:
$ Z [ B_r; S ] $
. First, observe that
$$ \begin{align*} Z [ B_r; S ] = \{( B_r, B_r, \ldots)\} & \sqcup \,\bigsqcup_{ j \in B_r \setminus S } Z [ j ] \quad \text{and} \\ \hat{\sigma}^{-1} (Z[B_r; S]) = \hat{\sigma}^{-1} (\{( B_r, B_r, \ldots)\}) & \sqcup \bigsqcup_{ j \in B_r \setminus S } \hat{\sigma}^{-1} (Z[j]), \end{align*} $$
as well as
Due to item (ii), the measurable set
$ \{ ( i, B_r, B_r, \ldots ) : \mathscr {F}_{1} (i) \cap B_r \text { is infinite} \} \subset \hat {\mathcal {L}}_1 $
has null measure and, therefore,
$ \bar {\mu } ( \hat {\sigma }^{-1} ( \{ ( B_r, B_r, \ldots ) \} ) ) = \bar {\mu } ( \{ ( B_r, B_r, \ldots ) \} ) $
. Thanks to the previous case, we have
$ \bar {\mu } ( Z [ j ] ) = \bar {\mu } ( \hat {\sigma }^{-1} ( Z [ j ] ) ) $
for every
$ j \in B_r \setminus S $
, which shows that
$ \bar {\mu } ( Z [ B_r; S ] ) = \bar {\mu } ( \hat {\sigma }^{-1} ( Z [ B_r; S ] ) ) $
.
Case 4:
$ Z [ w B_r; S ] $
with
$ \ell (w) \geq 1 $
. Note that
$$ \begin{align*} Z[w B_r; S] & = \{(w, B_r, B_r, \ldots)\} \sqcup \bigsqcup_{j \in B_r \setminus S} Z[w j],\\ \hat{\sigma}^{-1} (Z[w B_r; S]) & = \bigsqcup_{i \in \mathscr{P} (w)} \{(i, w, B_r, B_r, \ldots)\} \sqcup \bigsqcup_{ j \in B_r \setminus S } \hat{\sigma}^{-1} (Z[w j]). \end{align*} $$
From item (ii), both
$ \{ ( w, B_r, B_r, \ldots ) \} \subset \hat {\mathcal {L}}_{ \ell (w) } $
and
$ \bigsqcup _{ i \in \mathscr {P} ( w ) } \{ ( i, w, B_r, B_r, \ldots ) \} \subset \hat {\mathcal {L}}_{ \ell (w) + 1 } $
have null measure. As we already know that
$ \bar {\mu } ( Z [ w j ] ) = \bar {\mu } ( \hat {\sigma }^{-1} ( Z [ w j ] ) ) $
for all
$ j \in B_r \setminus S $
, we conclude that
$ \bar {\mu } ( Z [ w B_r; S ] ) = \bar {\mu } ( \hat {\sigma }^{-1} ( Z [ w B_r; S ] ) ) $
.
(iii)
$ \Rightarrow $
(iv). Consider a sequence
$ \{ \hat {\mu }_n \} \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
. By the convex decomposition of Proposition 3.4, we may write
where
$ t_n \in [ 0, 1 ] $
,
$ \dot {\mu }_n \in \mathcal {M} ( \Sigma , \sigma ) $
and
$ \hat {\nu }_n \in \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
. Note that:
-
• there exists a subsequence
$ \{ t_{n'} \} $
whose limit is
$ t \in [0,1] $
; -
• by compactness of the simplex
$ \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
, there exists a subsequence
$ \{ \hat {\nu }_{n"} \} $
that converges to
$ \bar {\nu } \in \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
; -
• since
$ \overline { \mathcal {M} ( \Sigma , \sigma ) } \subset \mathcal {M} ( \hat {\Sigma } ) $
is also compact, there exists a convergent subsequence
$ \{ \dot {\mu }_{n"'} \} $
such that, due to item (iii) and Proposition 3.4, its limit probability is given as
$ \tau \dot {\mu } + ( 1 - \tau ) \dot {\nu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
, with
$\tau \in [ 0, 1 ]$
,
$\dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) $
, and
$ \dot {\nu } \in \mathrm {Conv} ( \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s\})$
.
Taking this last subsequence, we obtain that
If
$ t = \tau = 1 $
, we have
$ \bar {\mu } = \dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
. Otherwise, we may write
$$ \begin{align*} \bar{\mu}= (t\tau) \dot{\mu} + (1 - t\tau) \bigg[\dfrac{(t - t \tau)}{(1 - t \tau)} \dot{\nu} + \dfrac{(1 - t)}{(1 - t \tau)} \bar{\nu}\bigg], \end{align*} $$
with
$ t \tau \in [ 0, 1 ) $
,
$ \dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) $
and
Hence, by Proposition 3.4,
$ \bar {\mu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
. This shows that
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
is (sequentially) compact.
We can close the discussion of this subsection by achieving as a consequence of the previous proposition, in view of Lemma 3.6, the compactness of the set of blur invariant probabilities.
Corollary 3.10. Let
$ \Sigma $
be a shift over a countable alphabet that satisfies both the finite cyclic predecessor assumption and denseness of periodic measures. Then,
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
is (weak
$^{\ast }$
) compact.
4 Extensions of potentials
Given a potential
$ A : \Sigma \to \mathbb {R} $
, as usual, we say that
$ \bar {A} : \hat {\Sigma } \to \mathbb {R} \cup \{ \pm \infty \} $
is an extension of
$ A $
to
$ \hat {\Sigma } $
whenever
$ \bar {A}|_{\Sigma } = A $
. Our aim in this section is to find extension functions on
$ \hat {\Sigma } $
that inherit the regularity of
$ A $
. In this section,
$ \Sigma $
will always be a general shift space over a countable alphabet. We will also adopt the name potential for any Borel function
$ \bar {A} : \hat {\Sigma } \to \mathbb {R} \cup \{ -\infty \} $
that is bounded from above. Although the restriction to
$ \Sigma $
of any bounded continuous real-valued function defined on
$ \hat {\Sigma } $
is, in fact, a bounded continuous real-valued function on
$ \Sigma $
, we cannot expect a bounded continuous real-valued function on
$ \Sigma $
to always admit a continuous extension on
$ \hat {\Sigma } $
.
Example 4.1. Let
$ ( u, B_r, B_r, \ldots ) $
be a point of
$ \partial \Sigma $
and consider a countable partition of
$ B_r = \bigsqcup _{k \geq 1 } C_k $
into infinite subsets. Let
$ \{ q_k \}_{ k \geq 1 } $
be an enumeration of rational numbers of
$ ( 0, 1 ] $
. We introduce the locally constant function
$ A : [ u ] \to [0,1] $
defined on the cylinder set
$ [ u ] \in \Sigma $
as
$$ \begin{align*} A (u a x) = A (u a) = \begin{cases} 0 & \text{if } a \in \mathscr{A} \setminus B_r, \\ q_k & \text{if } a \in C_k, \end{cases} \end{align*} $$
where
$ a \in \mathscr {A} $
,
$ x \in \Sigma $
, and
$ u a x \in \Sigma $
. Due to Tietze extension theorem, there is a bounded continuous
$ A : \Sigma \to [0,1] $
, which extends this locally constant function to
$ \Sigma $
. However, for any open neighborhood
$ Z [ u B_r; S ] $
of
$ ( u, B_r, B_r, \ldots ) $
, the oscillation of
$ A $
on
$ Z [ u B_r; S ] \cap \Sigma $
is always equal to
$ 1 $
. We conclude that there is no continuous extension to
$ \hat {\Sigma } $
.
However, upper semi-continuity gives us a lot of freedom to obtain extension functions on
$ \hat {\Sigma } $
with the same regularity, as highlighted in the following proposition. This is a property resulting from the topology of
$ \hat {\Sigma } $
, especially from the fact that, according to item (ii) of Lemma 2.1, any sequence in
$ \partial \Sigma $
converging to a point of the form
$ ( v, B_r, B_r, \ldots ) \in \partial \Sigma $
is eventually constant.
Proposition 4.2. Let
$ A : \Sigma \to \mathbb {R} $
be an upper semi-continuous potential. Suppose that
$ \bar {A} : \hat {\Sigma } \to \mathbb {R} \cup \{ -\infty \} $
is an extension of
$ A $
fulfilling, for all
$ ( v, B_r, B_r, \ldots ) \in \partial \Sigma $
,
Then,
$ \bar {A} $
is an upper semi-continuous potential.
Proof. Obviously,
$ \bar {A} $
is bounded from above. Concerning its regularity, for any real number
$ R $
, we need to argue that
$ \bar {A}^{-1} [-\infty , R) $
is an open set of
$ \hat {\Sigma } $
. Our discussion will be divided into two situations, depending on whether a point of this set belongs to
$ \partial \Sigma $
or not.
First, consider a point
$ ( v, B_r, B_r, \ldots ) $
of
$ \partial \Sigma $
verifying
$ \limsup _{B_r \ni i \to \infty } \sup A|_{[vi]} \leq \bar {A} (v B_r, B_r,\ldots ) < R $
. From the definition of superior limit, there exists a finite subset
$ S \subset B_r $
such that
$ \sup A|_{[v j]} < R $
for all
$ j \in B_r \setminus S $
. Take the neighborhood
$ Z [ v B_r; S ] $
of
$ ( v, B_r, B_r, \ldots ) $
, whose points are of the form
where
$ j \in B_r \setminus S $
is a symbol,
$ w \in \mathscr {L} $
is a finite (or empty) word, and
$ ( y_0, y_1, \ldots ) \in \Sigma $
is an infinite sequence. In particular, we obtain
$$ \begin{align*} \bar{A} (v, j, w, B_r, B_r, \ldots) \leq \sup A|_{[v j w]} \leq \sup A|_{[v j]} & < R \quad\text{or}\\ \bar{A} (v,j, y_0, y_1,\ldots)=A (v, j, y_0, y_1,\ldots)\leq\sup A|_{[v j]} & < R \end{align*} $$
for every
$ j \in B_r \setminus S $
. Hence,
$ Z [ v B_r; S ] \subset \bar {A}^{-1} [-\infty , R) $
.
Now, let
$ x \in \Sigma $
be a point such that
$ A ( x ) = \bar {A} ( x ) < R $
. Since
$ A $
is upper semi-continuous on
$ \Sigma $
, given
$ \varepsilon> 0 $
, there exists a neighborhood
$ [ x_0, \ldots , x_{n} ] $
of
$ x $
such that
$ A (z) < R - \varepsilon $
for all
$ z \in [ x_0, \ldots , x_{n} ] $
. Consider the neighborhood
$ Z [ x_0, \ldots , x_{n} ] $
of
$ x $
in
$ \hat {\Sigma } $
, whose points are
where
$ w \in \mathscr {L} $
is a finite (or empty) word and
$ ( y_0, y_1, \ldots ) \in \Sigma $
is an infinite sequence. Note that
$$ \begin{align*} \bar{A} (x_0, \ldots, x_{n}, w, B_r, B_r, \ldots) \leq \sup A|_{[ x_0, \ldots, x_{n}, w ] } \leq &\sup A|_{[ x_0, \ldots, x_{n} ] } \leq\! R - \varepsilon <\! R \\\text{or}\quad\! \bar{A} ( x_0, \ldots, x_{n},{\kern-1pt} y_0,{\kern-1pt} y_1, \ldots) {\kern-1pt}={\kern-1pt} A (x_0, \ldots, x_{n}, y_0, y_1, \ldots) {\kern-1pt}\leq &\sup A|_{ [ x_0, \ldots, x_{n} ] } \leq\! R {\kern-1pt}-{\kern-1pt} \varepsilon <\! R. \end{align*} $$
Therefore,
$ Z [ x_0, \ldots , x_{n} ] \subset \bar {A}^{-1} [-\infty , R)$
.
We will take into account a natural choice of extension potential, which will play a key role in the prove of the existence theorem in §6.
Corollary 4.3. Let
$ A : \Sigma \to \mathbb {R} \cup \{ -\infty \} $
be an upper semi-continuous potential. Then, the potential
$ \hat {A} : \hat {\Sigma } \longrightarrow \mathbb {R} \cup \{ -\infty \} $
given as
$$ \begin{align*} x \in \Sigma & \longmapsto \hat{A} (x) = A (x) \quad \text{and}\\ (v, B_r, B_r, \ldots) \in \partial \Sigma & \longmapsto \hat{A} (v, B_r, B_r, \ldots) = \limsup_{B_r \ni i \to \infty} \sup A|_{[ v i ]} \end{align*} $$
is the minimal upper semi-continuous extension of
$ A $
to
$ \hat {\Sigma } $
.
Proof. Due to Proposition 4.2, we already know that
$ \hat {A} $
is an upper semi-continuous potential. Let
$ \bar {A} : \Sigma \to \mathbb {R} \cup \{ -\infty \} $
be any upper semi-continuous extension of
$ A $
. We claim that
$ \hat {A}|_{ \partial \Sigma } \leq \bar {A}|_{ \partial \Sigma } $
. As matter of fact, given any real number
$ R $
such that
$ \bar {A} ( v, B_r, B_r, \ldots ) < R $
, there exists a finite subset
$ S_R \subset B_r $
that verifies
$ Z [ v B_r; S_R ] \subset \bar {A}^{-1} [-\infty , R) $
. Since
$ [ v i ] \subset Z [ v B_r; S_R ] $
for
$ i \in B_r \setminus S_R $
, note that
$ \sup A|_{ [ v i ] } \leq \sup \bar {A}|_{ Z [ v B_r; S_R ] } \leq R $
whenever
$ i \in B_r \setminus S_R $
. By definition, we have
$ {\hat {A} ( v, B_r, B_r, \ldots ) = \limsup _{ B_r \ni i \to \infty } \sup A|_{ [ v i ] } \leq R }$
. Hence,
$ \hat {A} ( v, B_r, B_r, \ldots ) $
became a lower bound for any real number
$ R> \bar {A} ( v, B_r, B_r, \ldots ) $
. We conclude that
$ \hat {A} ( v, B_r, B_r, \ldots ) \leq \bar {A} ( v, B_r, B_r, \ldots ) $
.
A well-studied class of potentials in the literature [Reference Bissacot and Freire3, Reference Bissacot and Garibaldi4, Reference Garibaldi and Gomes6, Reference Jenkinson, Mauldin and Urbański9] is the coercive one, namely, those potentials
$ A $
fulfilling
$ \lim _{i \to \infty } \sup A|_{[i]} = - \infty $
. The corresponding minimal extension
$ \hat {A} $
exhibits in the coercive framework a convenient behavior as described in the following example.
Example 4.4. Assume that
$ A $
is a coercive upper semi-continuous potential and consider the minimal upper semi-continuous extension
$ \hat {A} $
. It is easy to see that
$$ \begin{align*} \int \hat{A}\, d \delta_{( B_r, B_r, \ldots )} = \hat{A} ( B_r, B_r, \ldots) = \limsup_{ B_r \ni i \to \infty } \sup A|_{[i]} = \lim_{ i \to \infty } \sup A|_{[i]} = -\infty \end{align*} $$
for every
$ ( B_r, B_r, \ldots ) \in \hat {\mathcal {L}}_0 $
. In particular, the coercive assumption prevents the invariant probabilities giving mass
$ 1 $
to
$ \partial \Sigma $
from being maximizing measures.
In §6, we will show that such a behavior may be observed with respect to more general potentials
$ A $
: those satisfying
$ \limsup _{ i \to \infty } \sup A|_{[i]} < \beta (A) $
, a key assumption in the existence theorem. To do that, we will study in §5 the main aspects of ergodic optimization theory on blur shifts.
5 Ergodic optimization
Throughout this section, let
$ \Sigma $
be any shift over a countable alphabet, assume that
${ A : \Sigma \to \mathbb {R} }$
is a potential and consider
$ \bar {A} : \hat {\Sigma } \to \mathbb {R} \cup \{ - \infty \} $
any extension potential of
$ A $
. We introduce the following ergodic maximizing constants:
$$ \begin{align*} \beta (A) & = \sup_{ \mu \in \mathcal{M} (\Sigma, \sigma)} \int_{ \Sigma } A\, d \mu, \\ \hat{ \beta } ( \bar{A} ) & = \sup_{ \hat{\mu} \in \mathcal{M} ( \hat{\Sigma}, \hat{\sigma} ) } \int_{ \hat{\Sigma} } \bar{A}\, d \hat{\mu}\quad \text{and} \\ \max \bar{A}|_{\hat{\mathcal{L}}_{0} } & = \max_{ 1 \leq r \leq s } \int \bar{A}\, d \delta_{ ( B_r, B_r, \ldots)}. \end{align*} $$
As a notational shorthand, we may also use
$ \vee $
for the maximum between values. The above constants are related as follows.
Lemma 5.1.
$ \hat {\beta } ( \bar {A} ) = \beta ( A ) \vee \max \bar {A}|_{ \hat {\mathcal {L}}_{0} }.$
Proof. Remember that
$ \mathcal {M} ( \Sigma , \sigma ) \subset \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
. In addition,
$ \bar {A} ( B_r, B_r, \ldots ) = \int \bar {A} \,d \delta _{ ( B_r, B_r, \ldots ) } $
and
$ \delta _{ ( B_r, B_r, \ldots ) } $
is a
$\hat {\sigma } $
-invariant probability measure for every
$ 1 \leq r \leq s $
. It is immediate from these facts that
$ \beta ( A ) \vee \max \bar {A}|_{\hat {\mathcal {L}}_{0} } \leq \hat {\beta } ( \bar {A} ) $
. However, the convex characterization of Proposition 3.4 implies
$$ \begin{align*} \hat{\beta} (\bar{A}) & = \sup \bigg\{t \int_{ \Sigma } A \,d \dot{\mu} \,+\, ( 1 - t ) \int_{ \partial \Sigma } \bar{A} \,d \hat{\nu}: \\ &\qquad\qquad t \in [0,1], \dot{\mu} \in \mathcal{M} (\Sigma, \sigma)\text{ and } \hat{\nu} \in \mathrm{Conv} (\{ \delta_{( B_r, B_r, \ldots)} : 1 \leq r \leq s\})\bigg\}\\ & \leq \beta (A) \vee \max \bigg\{\!\sum_{ r = 1 }^{ s } \alpha_r \bar{A} (B_r, B_r, \ldots) :\sum_{ r = 1 }^{ s } \alpha_r = 1 \bigg\}. \end{align*} $$
Clearly, the last term is less than or equal to
$ \max \bar {A}|_{ \hat {\mathcal {L}}_{0} }$
.
Henceforward, we take into account the maximizing sets
$$ \begin{align*} \mathcal{M}_{\max} ( A) & = \bigg\{\mu \in \mathcal{M} (\Sigma, \sigma) : \int_{\Sigma} A\, d \mu = \beta (A)\bigg\},\\ \hat{\mathcal{M}}_{\max} (\bar{A}) & = \bigg\{\hat{\mu}\in \mathcal{M} (\hat{\Sigma}, \hat{\sigma}) : \int_{ \hat{\Sigma} } \bar{A} \,d \hat{\mu} = \hat{\beta} (\bar{A})\bigg\} \quad \text{and}\\ \hat{\mathcal{D}}_{\max} (\bar{A}) & = \{\delta_{( B_r, B_r, \ldots)} : \bar{A} (B_r, B_r, \ldots) = \max \bar{A}|_{\hat{\mathcal{L}}_{0}}\}. \end{align*} $$
It is clear that
$ \hat {\mathcal {D}}_{\max } $
is a non-empty set. Nevertheless, the convex sets
$ \mathcal {M}_{\max } ( A ) $
and
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
may be empty or not. The maximizing sets are associated according to the following result.
Proposition 5.2.
-
(i) If
$ \max \bar {A}|_{ \hat {\mathcal {L}}_{0} } < \beta ( A ) $
, then
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) = \mathcal {M}_{\max } ( A ) $
. -
(ii) If
$ \max \bar {A}|_{ \hat {\mathcal {L}}_{0} } = \beta ( A ) $
, then
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) = \mathrm {Conv} ( \mathcal {M}_{\max } ( A ) \sqcup \hat {\mathcal {D}}_{\max } (\bar {A}))$
. -
(iii) If
$ \max \bar {A}|_{ \hat {\mathcal {L}}_{0} }> \beta ( A ) $
, then
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) = \mathrm {Conv} ( \hat {\mathcal {D}}_{\max } ( \bar {A} ) ) $
.
Proof. First, note that Lemma 5.1 and Proposition 3.4 guarantee that any maximizing measure
$ \hat {\mu } \in \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
(when it exists) satisfies
$$ \begin{align} \beta (A) \vee \max \bar{A}|_{\hat{\mathcal{L}}_{0}} = \int_{ \hat{\Sigma} } \bar{A} \,d \hat{\mu} = t \int_{\Sigma} A\, d \dot{\mu} + (1-t) \sum_{r = 1}^{s} \alpha_r \bar{A} (B_r, B_r, \ldots), \end{align} $$
where
$ t \dot {\mu } + (1-t) \sum _{r=1}^{s} \alpha _r \delta _{ ( B_r, B_r, \ldots ) } $
, with
$ t \in [0,1] $
,
$ \dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) $
,
$ \alpha _r \geq 0 $
, and
$ \sum _{r} \alpha _r = 1 $
, is the convex representation of
$ \hat {\mu } $
.
Item (i): from Lemma 5.1, we have
$ \hat {\beta } ( \bar {A} ) = \beta ( A ) $
. It is straightforward that
$ \mathcal {M}_{\max } ( A ) \subset \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
. However, assume that there exists a maximizing probability
$ \hat {\mu } \in \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
(otherwise, the result is trivial). By (5.1), it follows that
$$ \begin{align*} \beta ( A ) = t \int_{\Sigma} A \,d \dot{\mu} + (1-t)\, \sum_{ r = 1 }^{ s } \alpha_r \bar{A} ( B_r, B_r, \ldots ) \leq t \beta ( A ) + (1-t) \max \bar{A}|_{ \hat{\mathcal{L}}_{0} }. \end{align*} $$
Due to the hypothesis, this inequality only holds when
$ t = 1 $
. Thus, the maximizing measure
$ \hat {\mu } = \dot {\mu } \in \mathcal {M} ( \Sigma , \sigma ) $
verifies
$ \int A \,d \dot {\mu } = \beta ( A ) $
, which means that
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) \subset \mathcal {M}_{\max } ( A ) $
.
Item (ii): from Lemma 5.1,
$ \hat {\beta } ( \bar {A} ) = \beta ( A ) = \max \bar {A}|_{ \hat {\mathcal {L}}_{0} } $
. These equalities imply that
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
contains both
$ \mathcal {M}_{\max } ( A ) $
and
$ \hat {\mathcal {D}}_{\max } ( \bar {A} ) $
. In particular,
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
is non-empty and, because of its convexity,
$ \mathrm {Conv} ( \mathcal {M}_{\max } ( A ) \sqcup \hat {\mathcal {D}}_{\max } ( \bar {A} ) ) \subset \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
. Given
$ \hat {\mu } \in \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
, since
$ \hat {\mu } = t \dot {\mu } + (1-t) \sum _{r=1}^{s} \alpha _r \delta _{ ( B_r, B_r, \ldots ) } $
, it remains to show that
$ \dot {\mu } \in \mathcal {M}_{\max } ( A ) $
and
$ \sum _{ r = 1 }^{ s } \alpha _r \delta _{ ( B_r, B_r, \ldots ) } \in \mathrm {Conv} ( \hat {\mathcal {D}}_{\max } ( \bar {A} ) ) $
. If either
$ t = 0 $
or
$ t = 1 $
, this is obvious. For
$ t \in ( 0, 1 ) $
, it follows from (5.1) that
$$ \begin{align*} \int_{\Sigma} A \,d \dot{\mu} = \dfrac{1}{t} \bigg[ \beta ( A ) - (1-t) \sum_{ r = 1 }^{ s } \alpha_r \bar{A} ( B_r, B_r, \ldots )\bigg] & \geq \beta (A) \\ \text{and} \quad \sum_{ r = 1 }^{ s } \alpha_r \bar{A} ( B_r, B_r, \ldots ) = \dfrac{1}{1-t} \bigg[\!\max \bar{A}|_{ \hat{\mathcal{L}}_{0}} - t \int_{\Sigma} A \,d \dot{\mu}\bigg] & \geq \max \bar{A}|_{ \hat{\mathcal{L}}_{0} }. \end{align*} $$
It is easy to see that the above inequalities are in fact equalities, from which we immediately conclude the discussion.
Item (iii): Lemma 5.1 implies
$ \hat {\beta } ( \bar {A} ) = \max \bar {A}|_{ \hat {\mathcal {L}}_{0} } $
. Consequently,
$ \hat {\mathcal {D}}_{\max } ( \bar {A} ) \subset \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
and
$ \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
is non-empty. By convexity,
$ \mathrm {Conv} ( \hat {\mathcal {D}}_{\max } ( \bar {A} ) ) \subset \hat {\mathcal {M}}_{\max } ( \bar {A} ) $
. Given any
${ \hat {\mu } \in \hat {\mathcal {M}}_{\max } ( \bar {A} ) }$
, it follows from (5.1) that
$$ \begin{align*} \max \bar{A}|_{ \hat{\mathcal{L}}_{0} } = t \int_{\Sigma} A \,d \dot{\mu} + (1-t) \sum_{ r = 1 }^{ s } \alpha_r \bar{A} ( B_r, B_r, \ldots ) \leq t \beta (A) + (1-t) \max \bar{A}|_{ \hat{\mathcal{L}}_{0} }, \end{align*} $$
which, thanks to the hypothesis, only holds if
$ t = 0 $
. Hence,
$ \bar {A} ( B_r, B_r, \ldots ) = \max \bar {A}|_{ \hat {\mathcal {L}}_{0} } $
whenever
$ \alpha _r> 0 $
, and we obtain that
$ \hat {\mu } = \sum _{ r = 1 }^{ s } \alpha _r \delta _{ ( B_r, B_r, \ldots ) } \in \mathrm {Conv} ( \hat {\mathcal {D}}_{\max } ( \bar {A} ) ) $
.
Remark 5.3. Since the set
$ \hat {\mathcal {D}}_{\max } $
is always non-empty, we can conclude that the converse of item (i) of Proposition 5.2 is also true.
6 Existence of maximizing measure
This section is dedicated to prove the existence theorem via the blur shift compactification method.
Proof of the existence theorem
Due to Corollary 3.10,
$ \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) $
is a compact set. Note that the upper semi-continuous extension potential
$ \hat {A} $
(guaranteed by Corollary 4.3) induces an upper semi-continuous application
$ \hat {\mu } \in \mathcal {M} ( \hat {\Sigma }, \hat {\sigma } ) \longmapsto \int _{ \hat {\Sigma } } \hat {A} \,d \hat {\mu } $
. Therefore,
$ \hat {\mathcal {M}}_{\max } ( \hat {A} ) $
is non-empty. From the key assumption on the potential, we obtain
Hence, item (i) of Proposition 5.2 implies that
$ \mathcal {M}_{\max } ( A ) = \hat {\mathcal {M}}_{\max } ( \hat {A} ) $
is non-empty. The last statement of the theorem will follow from Proposition 6.1.
Proposition 6.1. Let
$ A : \Sigma \to \mathbb {R} $
be an upper semi-continuous potential that admits a maximizing probability
$ \mu \in \mathcal {M} ( \Sigma , \sigma ) $
and fulfills both conditions
Then, there exists a common finite alphabet subshift of
$ \Sigma $
that contains the support of any maximizing measure for
$ A $
.
Proof. Since
$ \beta (A) \leq \sup A $
in general, the hypothesis means that
$ \beta (A) = \sup A $
. First, we claim that the support of any maximizing probability for
$ A $
lies in
$ A^{-1} ( \beta (A) ) $
. In fact, for
$ \mu \in \mathcal {M}_{\max } ( A ) $
, we clearly have
By the upper semi-continuity of
$ A $
, it is immediate that
$ A^{-1} ( \beta (A) ) $
is a closed subset of
$ \Sigma $
, which shows our claim.
Let us prove the desired result by contrapositive argument. Assume that there exists an infinite sub-alphabet
$ \mathscr {C} $
of
$ \mathscr {L}_1 $
such that each cylinder set
$ [i] $
, for
$ i \in \mathscr {C} $
, has non-empty intersection with the support of some maximizing measure for
$ A $
. By the previous claim,
$ [i] \cap {A}^{-1} ( \beta (A) ) $
is also non-empty, so that
$ \sup A|_{[i]} \geq \beta (A) $
for every
$ i \in \mathscr {C} $
. We conclude that
We end this section by showing that the converse of the above proposition is false; more precisely, it may happen that the supports of all maximizing measures lie in a certain finite alphabet subshift and yet
$ \limsup _{ i \to \infty } \sup A|_{[i]} = \beta ( A ) $
.
Example 6.2. Let
$ y \in \Sigma $
be a periodic point, with period
$ p $
, and consider the associated periodic probability
$ \mu _{y} = ({1}/{ p }) \sum _{ i = 0 }^{ p - 1 } \delta _{ \sigma ^i ( y ) } $
. Introduce the bounded continuous potential
$ A : \Sigma \to \mathbb {R} $
given as
where
$ d $
denotes the usual distance on
$ \Sigma $
, namely, two infinite sequences of
$ \Sigma $
are far from each other
$\tfrac {1}{2} $
raised to the power of their first disagreement. Obviously,
$ \beta (A) = 0 $
and
$ \mathcal {M}_{\max } ( A ) = \{ \mu _{y} \} $
. In particular,
$ \sup A \leq \beta ( A ) $
and the subshift over a finite alphabet
$ \mathrm {orb} (y) $
is the support of the unique maximizing probability for
$ A $
. However,
Note that, in this case, the minimal extension
$ \hat {A} $
fulfills
$ \hat {A}|_{ \hat {\mathcal {L}}_{0} } \equiv \beta ( A ) $
and, by item (ii) of Proposition 5.2,
$ \hat {\mathcal {M}}_{\max } ( \hat {A} ) = \mathrm {Conv} ( \{ \mu _{y} \} \sqcup \{ \delta _{ ( B_r, B_r, \ldots ) } : 1 \leq r \leq s \} ) $
.
7 Non-Markovian example
We exhibit in this section a countable alphabet non-Markovian shift to which we may apply the results of this paper. Recall that a countable alphabet shift is called Markovian if there exists countable zero-one matrix
$ M $
such that any point
$ ( x_0, x_1, \ldots ) $
obeys
$ {M (x_i, x_{i+1}) = 1 }$
for all
$ i \geq 0 $
(for a comprehensive study, see, for instance, [Reference Kitchens10]). A key aspect of our analysis will be the specification property, which plays a crucial role in establishing the denseness of periodic measures. Observe that a shift space has the specification property if, given any finite collection of cylinder sets and sufficiently large gaps between them, one can construct a point that visits each cylinder set at the prescribed times while filling the gaps with arbitrary transitional words.

Figure 2 Graph representation of conditions (C2) and (C3) in Definition 7.1.
Definition 7.1. Consider a partition
$ \mathbb {Z}_{+} = \bigsqcup _{k=1}^{\infty } Z_{k} $
such that
$ Z_1 = \{ 1 \} $
,
$ Z_2 = \{ 2, 3 \} $
, and
$ Z_k $
are finite sets with increasing cardinality, that is,
$ | Z_k | < | Z_{k+1} | $
for all
$ k \geq 1 $
. Let
$ \Lambda $
be a (one-sided) shift space defined over a countable alphabet
$ \mathbb {Z}_{+} $
that satisfies the following conditions.
-
(C1) For every point
$ x \in \Lambda $
, there exists an integer
$ N_x \geq 0 $
, such that
$ \sigma ^n (x) \notin [1]$
for any
$ n \geq N_x $
. In other terms, the symbol
$1$
appears a finite number of times for each sequence. -
(C2) For
$ a \in Z_k $
and
$ k> 1 $
, the predecessor set
$ \mathscr {P} (a) $
is included in
$ Z_k $
, with the exception of
$ a = \min Z_{k} $
for which one also admits
$ 1 $
as an extra predecessor. In particular, the follower set
$ \mathscr {F}_1 (1) = \{ \min Z_k : k> 1 \} $
. -
(C3) For
$ b \in Z_k $
and
$ k> 1 $
, the follower set
$ \mathscr {F}_1 (b) $
is included in
$ Z_k $
, with the exception of
$ b = \max Z_{k} $
for which one also admits
$ 1 $
as an extra follower. In particular, the predecessor set
$ \mathscr {P} (1) = \{\max Z_{k} : k> 1 \} $
. -
(C4) The subshift
$ \Lambda \cap Z_{2}^{ \mathbb {N} } $
is the sofic even shift and, for any
$ k> 2 $
, each finite alphabet subshift
$ \Lambda \cap Z_{k}^{ \mathbb {N} } $
verifies the specification property. In particular, the restriction
$ \sigma |_{ \Lambda \cap Z_{k}^{ \mathbb {N} } } $
is transitive for all
$ k> 1 $
. -
(C5) Any word of the language of
$ \Lambda $
with first symbol
$ \min Z_{k} $
and last symbol
$ \max Z_{k} $
must have length at least
$|Z_{k}|$
.
We can explicitly exhibit the predecessor and follower relations of conditions (C2) and (C3) as arrow connections on a graph given in Figure 2. Moreover, conditions (C4) and (C5) may be interpreted as the subshift
$ \Lambda \cap Z_{k}^{\mathbb {N}} $
verifying certain regularity properties that ensure well-behaved dynamics within each partition class while maintaining global non-Markovian structure. Specifically, condition (C4) establishes that the restriction to each
$Z_k$
with
$k \geq 2$
inherits the specification property from finite alphabet theory, thereby guaranteeing transitivity and enabling the construction of connecting words between arbitrary cylinder sets. Meanwhile, condition (C5) imposes a minimal length constraint that prevents trivial loops and ensures that transitions between the extremal elements of each partition class cannot occur too rapidly, thus preserving the hierarchical structure encoded in the increasing cardinalities
$|Z_k|$
. (Observe that if we were to impose only condition (C5) to define all forbidden words, we would be led to consider a subshift of finite type that clearly satisfies the specification property, which thus constitutes a natural example of finite alphabet shift to be considered at any level.) Together, these conditions create a delicate balance: while the global system exhibits non-Markovian behavior, the local dynamics within each class retains sufficient regularity to support the specification property and its associated consequences for the ergodic theory of the system.
In the following results, we will show that any shift
$ ( \Lambda , \sigma ) $
verifying the conditions in the above Definition 7.1 possesses the properties:
-
• it is non-Markovian (Lemma 7.1);
-
• it obeys the finite cyclic predecessor assumption (Lemma 7.2);
-
• it is non-locally compact and the set
$ \partial \Lambda \setminus \hat {\mathcal {L}}_{0} $
is non-empty (Lemma 7.3); -
• it is topologically transitive (Lemma 7.5);
-
• it satisfies the denseness of periodic measures (Corollary 7.8).
Lemma 7.1. The shift space
$ \Lambda $
is non-Markovian.
Proof. Since
$ \Lambda $
contains the sofic even shift (property (C4)), it is non-Markovian.
Lemma 7.2. For the shift space
$ \Lambda $
, we have:
-
(i)
$ \mathscr {P} (1) \cap \mathscr {F}_{m} (1) $
is finite for all
$ m \geq 1 $
; -
(ii)
$ \mathscr {P} (a) $
is finite for every
$ a \geq 2 $
.
In particular,
$ \Lambda $
obeys the finite cyclic predecessor assumption.
Proof. Item (i) is a direct consequence of properties (C2), (C3), and (C5), and item (ii) is a reinterpretation of property (C2).
Lemma 7.3. The shift space
$ \Lambda $
is non-locally compact. Moreover, for any associated blur shift
$ ( \hat {\Lambda }, \hat {\sigma } ) $
with finite resolution, the set
$ \bigsqcup _{ k \geq 1 } \hat {\mathcal {L}}_k $
is non-empty.
Proof. Consider any cylinder set
$ [ x_0, \ldots , x_{n} ] $
. Due to properties (C4) and (C3), there exists a word
$ w $
of the language of
$ \Lambda $
with first symbol
$ x_{n} $
and last symbol
$ 1 $
. We can define a sequence of points
which has no accumulation point on
$ \Lambda $
. Therefore, the neighborhood basis consisting of cylinder sets is not (sequentially) compact.
To prove the second assertion, we can view the previous sequence
$ \{ y^{k} \}_{ k> 1 } $
as being defined in a blur shift space
$ \hat {\Lambda } $
with resolution
$ \mathscr {V} = \{ B_1, \ldots , B_s \} $
associated with
$ \Lambda $
. In particular, there exists some
$ B_r $
such that
$ \{ \min Z_{k} : k> 1 \} \cap B_r $
is infinite. Using item (ii) of Lemma 2.1, there exists a limit point
$ ( x_0, \ldots , x_{n-1}, w, B_r, B_r, \ldots ) \in \bigsqcup _{ k \geq 1 } \hat {\mathcal {L}}_k $
for the subsequence
$ y^{k} = ( x_0, \ldots , x_{n-1}, w, \min Z_{k}, \ldots ) $
, as
$ \min Z_{k} \in B_r $
with
$ k \to \infty $
.
Remark 7.4. Note that any associated blur shift
$ ( \hat {\Lambda }, \hat {\sigma } ) $
with finite resolution is non-trivial, in the sense that the set
$ \bigsqcup _{ k \geq 1 } \hat {\mathcal {L}}_k $
is non-empty. Nevertheless, this set has always null measure, as a consequence of item (ii) of Proposition 3.9.
Lemma 7.5. The shift
$ ( \Lambda , \sigma ) $
is topologically transitive.
Proof. Let
$ [ x_0, \ldots , x_{n} ] $
and
$ [ y_0, \ldots , y_{m} ] $
be two non-empty cylinder sets in the shift
$ \Lambda $
. From properties (C2), (C3), and (C4), it is straightforward to guarantee the existence of the following words:
-
•
$ u $
, with first symbol
$ x_n $
and last symbol
$ 1 $
, such that the word
$ ( x_0, \ldots , x_{n-1}, u ) $
belongs to the language of
$ \Lambda $
– in particular,
$ u = 1 $
when
$ x_n = 1 $
; -
•
$ v $
, with first symbol
$ 1 $
and last symbol
$ y_0 $
, such that the word
$ ( v, y_1, \ldots , y_{m} ) $
belongs to the language of
$ \Lambda $
– in particular,
$ v = 1 $
when
$ y_0 = 1 $
.
Hence, there exists a point
$ z = ( x_0, \ldots , x_{n-1}, u, 2, 3, v, y_1, \ldots , y_{m}, \ldots ) \in [ x_0, \ldots , x_{n} ] \subset \Lambda $
that verifies
$ \sigma ^{ n + \ell (u) + 2 + \ell (v) - 1 } (z) \in [ y_0, \ldots , y_{m} ] $
.
We establish a sufficient condition to the denseness of periodic measures.
Proposition 7.6. Let
$ ( \Sigma , \sigma ) $
be a shift over a countable alphabet that satisfies: given
$ \delta> 0 $
and
$ \nu \in \mathcal {M} ( \Sigma , \sigma ) $
, there exists a finite alphabet subshift
$ X_{ \nu , \delta } \subset \Sigma $
fulfilling the specification property such that
$ \nu ( \Sigma \setminus X_{ \nu , \delta } ) < \delta $
. Then,
$ ( \Sigma , \sigma ) $
obeys the denseness of periodic measures.
Proof. Let
$ \nu $
be a
$ \sigma $
-invariant probability measure on
$ \Sigma $
and consider the (weak
$^{\ast }$
) neighborhood
$ V_{\Sigma } ( \nu; f_1, \ldots , f_m; \varepsilon ) $
for some continuous bounded functions
$ f_1, \ldots , f_k $
on
$ \Sigma $
and for any
$ \varepsilon> 0 $
. We must exhibit a probability measure
$ \mu _{\mathrm {per}} $
that is supported on a periodic orbit and verifies, for every
$ 1 \leq l \leq m $
,
$$ \begin{align*} \bigg|\!\int_{\Sigma} f_{l} \,d \nu - \int_{\Sigma} f_{l} \,d \mu_{\mathrm{per}} \bigg| < \varepsilon. \end{align*} $$
Let
and
$ X = X_{ \nu , \delta } $
be the subshift provided by the hypothesis. From
, we obtain
$$ \begin{align*} \bigg|\!\int_{\Sigma} f_{l} \,d \nu - \int_{X} f_{l}\, d \nu \bigg| < \dfrac{\varepsilon}{3} \end{align*} $$
for each
$ 1 \leq l \leq m $
. Since
$ ( X, \sigma |_{X} ) $
is a finite alphabet shift satisfying the specification property and, on the Borel sets of
$ X $
, the probability
$ \mu = ({1}/{ \nu (X) }) \nu $
is a
$ \sigma |_{X} $
-invariant measure, we may apply [Reference Sigmund12, Theorem 1], which guarantees the existence of a probability measure
$ \mu _{ \mathrm {per} } $
supported on a periodic orbit of
$ X $
in the (weak
$^{\ast }$
) neighborhood
. Clearly,
$ \mu _{ \mathrm {per} } \in \mathcal {M} ( \Sigma , \sigma ) $
and
$$ \begin{align*} \bigg|\!\int_{X} f_{l} \,d \nu - \nu (X) \int_{\Sigma} f_{l} \,d \mu_{\mathrm{per}} \bigg| = | \int_{X} \nu (X) f_{l} \,d \mu - \int_{X} \nu (X) f_{l} \,d \mu_{\mathrm{per}}| < \dfrac{\varepsilon}{3} \end{align*} $$
for all
$ 1 \leq l \leq m $
. It remains to estimate
$$ \begin{align*} \bigg| \nu ( X ) \int_{\Sigma} f_{l} \,d \mu_{ \mathrm{per} } - \int_{\Sigma} f_{l} \,d \mu_{ \mathrm{per} } \bigg| \leq | \nu ( \Sigma \setminus X ) | \| f_l \|_\infty < \dfrac{\varepsilon}{3} \end{align*} $$
for every
$ 1 \leq l \leq m $
.
Lemma 7.7. For each
$ K> 1 $
, the finite alphabet subshift
$ \Lambda \cap ( Z_1 \sqcup Z_2 \sqcup \cdots \sqcup Z_{ K } )^{ \mathbb {N} } $
satisfies the specification property.
Proof. Given any two words
$ u $
and
$ w $
of the language of
$ \Lambda \cap ( Z_1 \sqcup Z_2 \sqcup \cdots \sqcup Z_{ K } )^{ \mathbb {N} } $
, we show that there exists a positive integer
$ N = N(K) $
such that these words can be connected by some transitional word
$ v $
of length
$ N $
, meaning that
$ u v w $
belongs to the language of
$ \Lambda \cap ( Z_1 \sqcup Z_2 \sqcup \cdots \sqcup Z_{ K } )^{ \mathbb {N} } $
.
We can decompose
$ u $
and
$ w $
in the following forms:
-
•
$ u = \dot {u} \ddot {u} $
or
$ u = \dot {u} \ddot {u} 1 $
, where
$ \ddot {u} $
is the subword of
$ u $
with largest length whose symbols belong to the same
$ Z_{i} $
for
$ i> 1 $
; -
•
$ w = \ddot {w} \dot {w} $
or
$ w = 1 \ddot {w} \dot {w} $
, where
$ \ddot {w} $
is the subword of
$ w $
with largest length whose symbols belong to the same
$ Z_{j} $
for
$ j> 1 $
.
Thanks to property (C4), each subshift
$ \Lambda \cap Z_{k}^{ \mathbb {N} } $
satisfies the specification property for
$ k> 1 $
. Thus, when
$ u = \dot {u} \ddot {u} $
, there exists a positive integer
$ N_{i} $
such that one can find a word
$ \bar {u} $
, with
$ \ell ( \bar {u} ) = N_{i} $
for which
$ \ddot {u} \bar {u} \max Z_{i} $
belongs to the language of
$ \Lambda \cap Z_{i}^{ \mathbb {N} } $
. Similarly, when
$ w = \ddot {w} \dot {w} $
, there is a word
$ \bar {w} $
, with
$ \ell ( \bar {w} ) = N_{j} $
, for which
$ \min Z_{j} \bar {w} \ddot {w} $
belongs to the language of
$ \Lambda \cap Z_{j}^{ \mathbb {N} } $
.
Therefore, in any situation, we are able to find words
$ \check {v} $
and
$ \hat {v} $
, with lengths in
$ \{ 0, N_{2} + 2, \ldots , N_{K} + 2 \} $
, such that
$ 1 $
is the last symbol of
$ \check {v} $
and the first one of
$ \hat {v} $
, and both words
$ u \check {v} $
and
$ \hat {v} u $
belong to the language of
$ \Lambda \cap ( Z_1 \sqcup Z_2 \sqcup \cdots \sqcup Z_{ K } )^{ \mathbb {N} } $
. Note that words of the form
$ ( 2, \ldots , 2, 3 ) $
, where the symbol
$ 2 $
appears
$ m \geq 1 $
consecutive times, are admissible in this subshift. Hence, we can consider
$ N (K) = 2 \max _{2 \leq i \leq K} N_{i} + 6 $
to conclude the proof.
Corollary 7.8. The set of ergodic probabilities supported on periodic orbits of
$ \Lambda $
is (weak
$^{\ast }$
) dense among the
$ \sigma $
-invariant measures.
Proof. Due to property (C1), we can decompose the shift space
$ \Lambda = \Delta \sqcup \bigsqcup _{k \geq 0} \Lambda _{k} $
, where
$$ \begin{align*} \Delta & = \{(x_0, x_1, \ldots) \in \Lambda : x_n \neq 1 \text{ for all } n \geq 0\} \\ \text{and} \quad \Lambda_{k} & = \{( x_0, x_1, \ldots) \in \Lambda : x_k = 1 \text{ and } x_n \neq 1 \text{ for all } n> k \} \quad \text{for } k \geq 0. \end{align*} $$
First, we will prove that
$ \nu ( \bigsqcup _{k \geq 0} \Lambda _{k} ) = 0 $
for every
$ \nu \in \mathcal {M} ( \Lambda , \sigma ) $
. From the Birkhoff ergodic theorem, we have
, where
is the characteristic function of the set
$ \Lambda _k $
. If
$ x \in \Delta $
, then
for all
$ i \geq 0 $
. Moreover, if
$ x \in \Lambda _l $
, it follows that

Hence, the previous limit is zero everywhere.
It is straightforward from condition (C3) that
$ \Delta $
is a disjoint union of finite alphabet subshifts
$ \Delta _{k} = \Lambda \cap Z_{k}^{ \mathbb {N} } $
for
$ k \geq 2 $
. In particular,
$ \sum _{ k \geq 2 } \nu ( \Delta _{k} ) = \nu ( \Delta ) = \nu ( \Lambda ) = 1 $
for any
$ \nu \in \mathcal {M} ( \Lambda , \sigma ) $
.
Given
$ \delta> 0 $
and a
$ \sigma $
-invariant probability measure
$ \nu $
, there exists some integer
$ K = K ( \nu ,\delta ) $
such that
$ \sum _{ k> K } \nu ( \Delta _{k} ) < \delta $
. Note that, by conditions (C2), (C3), and (C4), the smallest finite alphabet subshift of
$ \Lambda $
that contains the subset
$ \bigsqcup _{ k = 2 }^{ K } \Delta _{k} $
is given as
which verifies
$ \nu ( X_{ \nu , \delta } ) \geq \sum _{ k = 2 }^{K} \nu ( \Delta _{k} ) \geq 1 - \delta $
. Thanks to Lemma 7.7, we conclude the result by applying Proposition 7.6.
Acknowledgements
E.G., J.G., and M.S. were partially supported by AMSUD240026, CAPES MathAmSud 88881.985536/2024-01. E.G. and J.G. were partially supported by ANR-22-CE40-3348 THERMOGAMAS, FAPESP 24/04685-2. J.G. was partially supported by PROEX 88881.973919/2024-01.


