Keywords

1 Introduction

Let \(X\subseteq A^\mathbb {Z}\) be a one-dimensional subshift over a finite symbol set A. A cellular automaton (CA) is a function \(F:X\rightarrow X\) defined by a local rule, and it endows the space X with translation invariant dynamics given by local interactions. It is natural to ask how the structure of the underlying subshift X affects the range of possible topological dynamics that can be achieved by CA on X. Our preferred approach is via the framework of directional dynamics of Sablik [16]. This framework is concerned with the possible space-time diagrams of \(x\in X\) with respect to F, in which successive iterations \(F^t(x)\) are drawn on consecutive rows (see Fig. 1 for a typical space-time diagram of a configuration with respect to the CA which shifts each symbol by one position to the left). Information cannot cross the dashed gray line in the figure so we say that the slope of this line is an almost equicontinuous direction. On the other hand, a slope is called a sensitive direction if information can cross over every line having that slope.

Fig. 1.
figure 1

A space-time diagram of the binary shift map \(\sigma \). White and black squares correspond to digits 0 and 1 respectively. The dashed gray line shows an almost equicontinuous direction.

It has been proven in Theorem 5.2.19 of [7] that every nontrivial mixing sofic subshift admits a reversible CA which is sensitive in all directions. On the other hand, Subsect. 5.4.2 of [7] presents a collection of non-sofic S-gap shifts \(X_S\), all of them synchronizing and many with specification property, such that every reversible CA on \(X_S\) has an almost equicontinuous direction. It would be interesting to extend the latter result to other natural classes of subshifts. The classical study of \(\text {Aut}(X)\), the group of reversible CA on X, is mostly not related to our line of inquiry. However, we highlight the result of [4] saying that is a periodic group if X is a transitive subshift that has subquadratic growth. This implies for such X that every \(F\in \text {Aut}(X)\) has an almost equicontinuous direction.

In this paper we consider two-sided beta-shifts, which form a naturally occurring class of mixing coded subshifts. We show in Theorem 3 that if \(S_\beta \) is a non-sofic beta-shift, then every reversible CA on \(S_\beta \) has an almost equicontinuous direction. As an application we use this result to show in Theorem 4 that non-sofic beta shifts are topologically direct prime, i.e. they are not conjugate to direct topological factorizations \(X\times Y\) of two nontrivial subshifts X and Y, thus answering a problem suggested in the presentation [13].

The proof of Theorem 4 relies on the observation that whenever X and Y are infinite transitive subshifts, then \(X\times Y\) has a very simple reversibe CA with all directions sensitive: it just shifts information into opposite directions in the X and Y components. Therefore the problem of determining whether a given subshift is topologically direct prime is closely related to the study of directional dynamics. In the last section of this paper we suggest a program of studying direct topological factorizations of sofic beta-shifts and accompany this suggestion with some preliminary remarks.

2 Preliminaries

In this section we recall some preliminaries concerning symbolic dynamics and topological dynamics in general. Good references to these topics are [8, 11].

Definition 1

If X is a compact metrizable topological space and \(T:X\rightarrow X\) is a continuous map, we say that (XT) is a (topological) dynamical system.

When there is no risk of confusion, we may identify the dynamical system (XT) with the underlying space or the underlying map, so we may say that X is a dynamical system or that T is a dynamical system.

Definition 2

We write \(\psi :(X,T)\rightarrow (Y,S)\) whenever (XT) and (YS) are dynamical systems and \(\psi :X\rightarrow Y\) is a continuous map such that \(\psi \circ T=S\circ \psi \) (this equality is known as the equivariance condition). Then we say that \(\psi \) is a morphism. If \(\psi \) is injective, we say that \(\psi \) is an embedding. If \(\psi \) is surjective, we say that \(\psi \) is a factor map and that (YS) is a factor of (XT) (via the map \(\psi \)). If \(\psi \) is bijective, we say that \(\psi \) is a conjugacy and that (XT) and (YS) are conjugate (via \(\psi \)).

A finite set A containing at least two elements (letters) is called an alphabet. In this paper the alphabet usually consists of numbers and thus for \(n\in {\mathbb {N}_+}\) we denote \(\varSigma _n=\{0,1,\dots ,n-1\}\). The set \(A^\mathbb {Z}\) of bi-infinite sequences (configurations) over A is called a full shift. The set \(A^\mathbb {N}\) is the set of one-way infinite sequences over A. Formally any \(x\in A^\mathbb {Z}\) (resp. \(x\in A^\mathbb {N}\)) is a function \(\mathbb {Z}\rightarrow A\) (resp. \(\mathbb {N}\rightarrow A\)) and the value of x at \(i\in \mathbb {Z}\) is denoted by x[i]. It contains finite, right-infinite and left-infinite subsequences denoted by \(x[i,j]=x[i]x[i+1]\cdots x[j]\), \(x[i,\infty ]=x[i]x[i+1]\cdots \) and \(x[-\infty ,i]=\cdots x[i-1]x[i]\). Occasionally we signify the symbol at position zero in a configuration x by a dot as follows:

$$\begin{aligned} x=\cdots x[-2]x[-1].x[0]x[1]x[2]\cdots . \end{aligned}$$

A configuration \(x\in A^\mathbb {Z}\) or \(x\in A^\mathbb {N}\) is periodic if there is a \(p\in {\mathbb {N}_+}\) such that \(x[i+p]=x[i]\) for all \(i\in \mathbb {Z}\). Then we may also say that x is p-periodic or that x has period p. We say that x is eventually periodic if there are \(p\in {\mathbb {N}_+}\) and \(i_0\in \mathbb {Z}\) such that \(x[i+p]=x[i]\) holds for all \(i\ge i_0\).

A subword of \(x\in A^\mathbb {Z}\) is any finite sequence x[ij] where \(i,j\in \mathbb {Z}\), and we interpret the sequence to be empty if \(j<i\). Any finite sequence \(w=w[1] w[2]\cdots w[n]\) (also the empty sequence, which is denoted by \(\epsilon \)) where \(w[i]\in A\) is a word over A. Unless we consider a word w as a subword of some configuration, we start indexing the symbols of w from 1 as we have done here. The concatenation of a word or a left-infinite sequence u with a word or a right-infinite sequence v is denoted by uv. A word u is a prefix of a word or a right-infinite sequence x if there is a word or a right-infinite sequence v such that \(x=uv\). Similarly, u is a suffix of a word or a left-infinite sequence x if there is a word or a left-infinite sequence v such that \(x=vu\). The set of all words over A is denoted by \(A^*\), and the set of non-empty words is \(A^+=A^*\setminus \{\epsilon \}\). The set of words of length n is denoted by \(A^n\). For a word \(w\in A^*\), \({\left| w \right| }\) denotes its length, i.e. \({\left| w \right| }=n\iff w\in A^n\). For any word \(w\in A^+\) we denote by \({}^\infty w\) and \(w^\infty \) the left- and right-infinite sequences obtained by infinite repetitions of the word w. We denote by \(w^\mathbb {Z}\in A^\mathbb {Z}\) the configuration defined by \(w^\mathbb {Z}[in,(i+1)n-1]=w\) (where \(n={\left| w \right| }\)) for every \(i\in \mathbb {Z}\).

Any collection of words \(L\subseteq A^*\) is called a language. For any \(S\subseteq A^\mathbb {Z}\) the collection of words appearing as subwords of elements of S is the language of S, denoted by L(S). For \(n\in \mathbb {N}\) we denote \(L^n(S)=L(S)\cap A^n\). For any \(L\subseteq A^*\), the Kleene star of L is

$$\begin{aligned} L^*=\{w_1\cdots w_n\mid n\ge 0,w_i\in L\}\subseteq A^*, \end{aligned}$$

i.e. \(L^*\) is the set of all finite concatenations of elements of L. If \(\epsilon \notin L\), define \(L^+=L^*\setminus \{\epsilon \}\) and if \(\epsilon \in L\), define \(L^+=L^*\).

To consider topological dynamics on subsets of the full shifts, the sets \(A^\mathbb {Z}\) and \(A^\mathbb {N}\) are endowed with the product topology (with respect to the discrete topology on A). These are compact metrizable spaces. The shift map \(\sigma :A^\mathbb {Z}\rightarrow A^\mathbb {Z}\) is defined by \(\sigma (x)[i]=x[i+1]\) for \(x\in A^\mathbb {Z}\), \(i\in \mathbb {Z}\), and it is a homeomorphism. Also in the one-sided case we define \(\sigma :A^\mathbb {N}\rightarrow A^\mathbb {N}\) by \(\sigma (x)[i]=x[i+1]\). Any topologically closed nonempty subset \(X\subseteq A^\mathbb {Z}\) such that \(\sigma (X)=X\) is called a subshift. A subshift X equipped with the map \(\sigma \) forms a dynamical system and the elements of X can also be called points. Any \(w\in L(X)\setminus {\epsilon }\) and \(i\in \mathbb {Z}\) determine a cylinder of X

$$\begin{aligned} \text { Cyl}_X(w,i)=\{x\in X\mid w \text{ occurs } \text{ in } x \text{ at } \text{ position } i\}. \end{aligned}$$

Definition 3

We say that a subshift X is transitive (or irreducible in the terminology of [11]) if for all words \(u,v\in L(X)\) there is \(w\in L(X)\) such that \(uwv\in L(X)\). We say that X is mixing if for all \(u,v\in L(X)\) there is \(N\in \mathbb {N}\) such that for all \(n\ge N\) there is \(w\in L^n(X)\) such that \(uwv\in L(X)\).

Definition 4

Let \(X\subseteq A^\mathbb {Z}\) and \(Y\subseteq B^\mathbb {Z}\) be subshifts. We say that the map \(F:X\rightarrow Y\) is a sliding block code from X to Y (with memory m and anticipation a for integers \(m\le a\)) if there exists a local rule \(f:A^{a-m+1}\rightarrow B\) such that \(F(x)[i]=f(x[i+m],\dots ,x[i],\dots ,x[i+a])\). If \(X=Y\), we say that F is a cellular automaton (CA). If we can choose m and a so that \(-m=a=r\ge 0\), we say that F is a radius-r CA.

Note that both memory and anticipation can be either positive or negative. Note also that if F has memory m and anticipation a with the associated local rule \(f:A^{a-m+1}\rightarrow A\), then F is also a radius-r CA for \(r=\max \{{\left| m \right| },{\left| a \right| }\}\), with possibly a different local rule \(f':A^{2r+1}\rightarrow A\).

The following observation of [6] characterizes sliding block codes as the class of structure preserving transformations between subshifts.

Theorem 1

(Curtis-Hedlund-Lyndon). A map \(F:X\rightarrow Y\) between subshifts X and Y is a morphism between dynamical systems \((X,\sigma )\) and \((Y,\sigma )\) if and only if it is a sliding block code.

Bijective CA are called reversible. It is known that the inverse map of a reversible CA is also a CA. We denote by \(\text {End}(X)\) the monoid of CA on X and by \(\text {Aut}(X)\) the group of reversible CA on X (the binary operation is function composition).

The notions of almost equicontinuity and sensitivity can be defined for general topological dynamical systems. We omit the topological definitions, because for cellular automata on transitive subshifts there are combinatorial characterizations for these notions using blocking words. We present these alternative characterizations below.

Definition 5

Let \(F:X\rightarrow X\) be a radius-r CA and \(w\in L(X)\). We say that w is a blocking word if there is an integer e with \({\left| w \right| }\ge e\ge r+1\) and an integer \(p\in [0,{\left| w \right| }-e]\) such that

$$\begin{aligned} \forall x,y\in {Cyl }_X(w,0), \forall n\in \mathbb {N}, F^n(x)[p,p+e-1]=F^n(y)[p,p+e-1]. \end{aligned}$$

The following is proved in Proposition 2.1 of [16].

Proposition 1

If X is a transitive subshift and \(F:X\rightarrow X\) is a CA, then F is almost equicontinuous if and only if it has a blocking word.

We say that a CA on a transitive subshift is sensitive if it is not almost equicontinuous. The notion of sensitivity is refined by Sablik’s framework of directional dynamics [16].

Definition 6

Let \(F:X\rightarrow X\) be a cellular automaton and let \(p,q\in \mathbb {Z}\) be coprime integers, \(q>0\). Then p/q is a sensitive direction of F if \(\sigma ^p\circ F^q\) is sensitive. Similarly, p/q is an almost equicontinuous direction of F if \(\sigma ^p\circ F^q\) is almost equicontinuous.

As indicated in the introduction, this definition is best understood via the space-time diagram of \(x\in X\) with respect to F, in which successive iterations \(F^t(x)\) are drawn on consecutive rows (see Fig. 1 for a typical space-time diagram of a configuration with respect to the shift map). By definition \(-1=(-1)/1\) is an almost equicontinuous direction of \(\sigma :A^\mathbb {Z}\rightarrow A^\mathbb {Z}\) because \(\sigma ^{-1}\circ \sigma ={{\,\mathrm{Id}\,}}\) is almost equicontinuous. This is directly visible in the space-time diagram of Fig. 1, because it looks like the space-time diagram of the identity map when it is followed along the dashed line. Note that the slope of the dashed line is equal to \(-1\) with respect to the vertical axis extending downwards in the diagram.

The notions of subshifts of finite type (SFT) and sofic subshifts are well known and can be found in Chapters 2 and 3 of [11]. Any square matrix A with nonnegative entries is an adjacency matrix of a directed graph with multiple edges. The set of all bi-infinite sequences of edges forming valid paths is an edge SFT (associated to A), whose alphabet is the set of edges.

Some other classes of subshifts relevant to the study of beta-shifts are the following.

Definition 7

Given a subshift X, we say that a word \(w\in L(X)\) is synchronizing if

$$\begin{aligned} \forall u,v\in L(X): uw,wv\in L(X)\implies uwv\in L(X). \end{aligned}$$

We say that a transitive subshift X is synchronizing if L(X) contains a synchronizing word.

Definition 8

A language \(L\subseteq A^+\) is a code if for all distinct \(u,v\in L\) it holds that u is not a prefix of v. A subshift \(X\subseteq A^\mathbb {Z}\) is a coded subshift (given by a code L) if L(X) is the set of all subwords of elements of \(L^*\).

3 Beta-Shifts

We recall some preliminaries on beta-shifts from Blanchard’s paper [2] and from Lothaire’s book [12].

For \(\xi \in \mathbb {R}\) we denote \(\mathrm {Frac}\left( \xi \right) =\xi -\left\lfloor \xi \right\rfloor \), for example \(\mathrm {Frac}\left( 1.2\right) =0.2\) and \(\mathrm {Frac}\left( 1\right) =0\).

Definition 9

For every real number \(\beta >1\) we define a dynamical system \((\mathbb {I},T_\beta )\), where \(\mathbb {I}=[0,1]\) and \(T_\beta (\xi )=\mathrm {Frac}\left( \beta \xi \right) \) for every \(\xi \in \mathbb {I}\).

Definition 10

The \(\beta \)-expansion of a number \(\xi \in \mathbb {I}\) is the sequence \(d(\xi ,\beta )\in \varSigma _{\left\lfloor \beta \right\rfloor +1}^\mathbb {N}\) where \(d(\xi ,\beta )[i]=\left\lfloor \beta T^{i}(\xi )\right\rfloor \) for \(i\in \mathbb {N}\).

Denote \(d(1,\beta )=d(\beta )\). By this convention \(d(2)=2000\dots \) If \(d(\beta )\) ends in infinitely many zeros, i.e. \(d(\beta )=d_0\cdots d_m 0^\infty \) for \(d_m\ne 0\), we say that \(d(\beta )\) is finite, write \(d(\beta )=d_0\cdots d_m\), and define \(d^*(\beta )=(d_0\cdots (d_m-1))^\infty \). Otherwise we let \(d^*(\beta )=d(\beta )\). Denote by \(D_\beta \) the set of \(\beta \)-expansions of numbers from [0, 1). It is the set of all infinite concatenations of words from the code

$$\begin{aligned} Y_\beta =\{d_0d_1\cdots d_{n-1} b\mid n\in \mathbb {N},0\le b<d_n\} \end{aligned}$$

where \(d(\beta )=d_0d_1d_2\dots \). For example, \(Y_2=\{0,1\}\). Let \(S_\beta \) be the coded subshift given by the code \(Y_\beta \). Since \(S_\beta \) is coded, it also has a natural representation by a deterministic automaton (not necessarily finite) [3, 17]. These representations allow us to make pumping arguments similar to those that occur in the study of sofic shifts and regular languages.

The subshift \(S_\beta \) is mixing. Namely, any \(u,v\in L(S_\beta )\) are subwords of \(u_1\cdots u_n\) and \(v_1\cdots v_m\) respectively for some \(n,m\in {\mathbb {N}_+}\) and \(u_i,v_i\in Y_\beta \). Because the code \(Y_\beta \) always contains the word 0, it follows that \(u_1\cdots u_n 0^i v_1\cdots v_m\in L(S_\beta )\) for all \(i\in \mathbb {N}\) and mixingness follows. The subshift \(S_\beta \) is sofic if and only if \(d(\beta )\) is eventually periodic and it is an SFT if and only if \(d(\beta )\) is finite.

There is a natural lexicographical ordering on \(\varSigma _n^\mathbb {N}\) which we denote by < and \(\le \). Using this we can alternatively characterize \(S_\beta \) as

$$\begin{aligned} S_\beta =\{x\in \varSigma _{\lfloor \beta \rfloor }^\mathbb {Z}\mid x[i,\infty ]\le d^*(\beta ) \text{ for } \text{ all } i\in \mathbb {Z}\}. \end{aligned}$$

We call \(S_\beta \) a beta-shift (with base \(\beta \)). When \(\beta >1\) is an integer, the equality \(S_\beta =\varSigma _\beta ^\mathbb {Z}\) holds.

4 CA Dynamics on Beta-Shifts

In this section we study the topological dynamics of reversible CA on beta-shifts, and more precisely the possibility of them having no almost equicontinuous directions. By Theorem 5.2.19 of [7] every nontrivial mixing sofic subshift admits a reversible CA which is sensitive in all directions, and in particular this holds for mixing sofic beta-shifts. In this section we see that this result does not extend to the class of non-sofic beta-shifts.

We begin with a proposition showing that a CA on a non-sofic beta-shift has to “fix the expansion of one in the preimage” in some sense.

Proposition 2

Let \(\beta >1\) be such that \(S_\beta \) is not sofic, let \(F\in \mathrm{End}(S_\beta )\), let \(x\in S_\beta \) be such that \(x[0,\infty ]=d(\beta )\) and let \(y\in F^{-1}(x)\). Then there is a unique \(i\in \mathbb {Z}\) such that \(y[i,\infty ]=d(\beta )\). Moreover, i does not depend on the choice of x or y.

Proof

Let \(r\in \mathbb {N}\) be such that F is a radius-r CA.

We first claim that i does not depend on the choice of x or y when it exists. To see this, assume to the contrary that for \(j\in \{1,2\}\) there exist \(x_j\in S_\beta \) with \(x_j[0,\infty ]=d(\beta )\), \(y_j\in F^{-1}(x_j)\) and \(i_j\in \mathbb {Z}\) such that \(i_1<i_2\) and \(y_1[i_1,\infty ]=d(\beta )=y_2[i_2,\infty ]\). Then in particular for \(M=\max \{i_2-i_1,i_2\}\) it holds that \(y_2[M,\infty ]=y_2[M-i_2+i_2,\infty ]=y_1[M-i_2+i_1,\infty ]\) and

$$\begin{aligned}&d(\beta )[M-i_2+i_1+r,\infty ]=x_1[M-i_2+i_1+r,\infty ]=F(y_1)[M-i_2+i_1+r,\infty ] \\&= F(y_2)[M+r,\infty ]=x_2[M+r,\infty ]=d(\beta )[M+r,\infty ]. \end{aligned}$$

Then \(d(\beta )\) would be eventually periodic, contradicting the assumption that \(S_\beta \) is not sofic.

For the other claim, let us assume that for some x and y as in the assumption of the proposition there is no \(i\in \mathbb {Z}\) such that \(y[i,\infty ]=d(\beta )\). We claim that the sequence \(y[-r,\infty ]\) can be written as an infinite concatenation of elements of \(Y_\beta \). This concatenation is found inductively. By our assumption \(y[-r,\infty ]<d(\beta )\), so \(y[-r,\infty ]\) has a prefix of the form \(w_1=d_0d_1\cdots d_{n-1}b\in Y_\beta \) for some \(n\in \mathbb {N}\), \(b<d_n\). We can write \(y[-r,\infty ]=w_1x_1\) for some \(x_1\in \varSigma _{\lfloor \beta \rfloor }^\mathbb {N}\). Because \(x_1\) is a suffix of y, then again from our assumption it follows that \(x_1<d(\beta )\) and we can find a \(w_2\in Y_\beta \) which is a prefix of \(x_1\). For all \(i\in \mathbb {Z}\) we similarly we find \(w_i\in Y_\beta \) such that \(y[-r,\infty ]=w_1w_2w_3\dots \).

Let \(r_i\) be such that \(y[-r,r_i]=w_1\cdots w_i\) for all \(i\in \mathbb {N}\). Fix some \(j,k\in \mathbb {N}\) such that \(0\le r_j<r_k\), \({\left| r_k-r_j \right| }\ge 2r\) and \(y[r_j-r,r_j+r]=y[r_k-r,r_k+r]\). Because x is not eventually periodic, it follows that \(x[r_j+1,\infty ]\ne x[r_k+1,\infty ]\).

Assume first that \(x[r_j+1,\infty ]<x[r_k+1,\infty ]\). Because \(S_\beta \) is coded, there is a configuration \(z\in S_{\beta }\) such that \(z[-r,\infty ]=w_1\cdots w_{j}w_{k+1}w_{k+2}\cdots \), i.e. this suffix can be found by removing the word \(w_{j+1}\cdots w_k\) from the middle of \(y[-r,\infty ]\). Then \(F(z)\in S_\beta \) but \(F(z)[0,\infty ]=x[0,r_j]x[r_k+1,\infty ]>x[0,r_j]x[r_j+1,\infty ]=d(\beta )\) contradicting one of the characterizations of \(S_\beta \).

Assume then alternatively that \(x[r_j+1,\infty ]>x[r_k+1,\infty ]\). Because \(S_\beta \) is coded, there is a configuration \(z\in S_{\beta }\) such that

$$\begin{aligned} z[-r,\infty ]=w_1\cdots w_{j}(w_{j+1}\cdots w_k)(w_{j+1}\cdots w_k)w_{k+1}w_{k+2}\cdots , \end{aligned}$$

i.e. this suffix can be found by repeating the occurrence of the word \(w_{j+1}\cdots w_k\) in the middle of \(y[-r,\infty ]\). Then \(F(z)\in S_\beta \) but

$$\begin{aligned}&F(z)[0,\infty ]=x[0,r_j]x[r_j+1,r_k]x[r_j+1,r_k]x[r_k+1,\infty ] \\&=x[0,r_j]x[r_j+1,r_k]x[r_j+1,\infty ] >x[0,r_j]x[r_j+1,r_k]x[r_k+1,\infty ]=d(\beta ) \end{aligned}$$

contradicting again the characterization of \(S_\beta \).    \(\square \)

To apply the previous proposition for a non-sofic \(S_\beta \) and \(F\in \text {End}(S_\beta )\), there must exist at least some \(x,y\in S_\beta \) such that \(x[0,\infty ]=d(\beta )\) and \(y\in F^{-1}(x)\). This happens at least when F is surjective, in which case we take the number \(i\in \mathbb {Z}\) of the previous proposition and say that the intrinsic shift of F is equal to i. If the intrinsic shift of F is equal to 0, we say that F is shiftless.

In the class of non-synchronizing beta-shifts we get a very strong result on surjective CA: they are all shift maps.

Theorem 2

If \(S_\beta \) is not synchronizing, then all surjective CA in \(End (S_\beta )\) are powers of the shift map.

Proof

Let \(F\in \text {End}(S_\beta )\) be an arbitrary surjective CA and let \(r\in \mathbb {N}\) be some radius of F. We may assume without loss of generality (by composing F with a suitable power of the shift if necessary) that F is shiftless. We prove that \(F={{\,\mathrm{Id}\,}}\).

Assume to the contrary that \(F\ne {{\,\mathrm{Id}\,}}\), so there is \(x\in S_\beta \) such that \(F(x)[0]\ne x[0]\). Let \(e={}^\infty 0.d(\beta )\) and let \(z\in F^{-1}(e)\) be arbitrary, so in particular \(z[0,\infty ]=d(\beta )\) by Proposition 2. Since \(S_\beta \) is not synchronizing, it follows that every word of \(L(S_\beta )\) occurs in \(d(\beta )\) (as explained by Kwietniak in [9], attributed to Bertrand-Mathis [1]). In particular it is possible to choose \(i\ge r+1\) such that \(\sigma ^i(z)[-r,r]=x[-r,r]\) and \(F(x)[0]=F(\sigma ^i(z))[0]=\sigma ^i(z)[0]=x[0]\), a contradiction.    \(\square \)

Next we consider only reversible CA. They do not have to be shift maps in the class of general non-sofic beta-shifts, and in fact the group \(\text {Aut}(X)\) contains a copy of the free product of all finite groups whenever X is an infinite synchronizing subshift by Theorem 2.17 of [5]. Nevertheless \(\text {Aut}(S_\beta )\) is constrained in the sense of directional dynamics.

Theorem 3

If \(S_\beta \) is not sofic and \(F\in \mathrm{Aut}(S_\beta )\) is shiftless then F admits a blocking word. In particular all elements of \(\mathrm{Aut}(S_\beta )\) have an almost equicontinuous direction.

Proof

Let \(r\in {\mathbb {N}_+}\) be a radius of both F and \(F^{-1}\). Since \(d(\beta )\) is not eventually periodic, it is easy to see (and is one formulation of the Morse-Hedlund theorem, see e.g. Theorem 7.3 of [15]) that there is a word \(u\in \varSigma _{\lfloor \beta \rfloor }^{3r}\) and symbols \(a<b\) such that both ua and ub are subwords of \(d(\beta )\). Let \(p=p'ub\) (\(p,p'\in L(S_\beta )\)) be some prefix of \(d(\beta )\) ending in ub. We claim that p is blocking. More precisely we will show that if \(x\in S_\beta \) is such that \(x[0,{\left| p \right| }-1]=p\) then \(F^t(x)[0,{\left| p \right| }-2]=p'u\) for all \(t\in \mathbb {N}\).

Assume to the contrary that \(t\in \mathbb {N}\) is the minimal number for which we have \(F^t(x)[0,{\left| p \right| }-2]\ne p'u\). We can find \(w,v,v'\in L(S_\beta )\) and \(c,d\in \varSigma _{\lfloor \beta \rfloor }\) (\(c<d\)) so that \(u=wdv\), \({\left| w \right| }\ge 2r\) and \(F^t(x)[0,{\left| p \right| }-2]=p'wcv'\). Indeed, \(F^{-1}\) is shiftless because F is, and therefore the prefix \(p'w\) still remains unchanged in \(F^t(x)[0,\infty ]\).

Now we note that x could have been chosen so that some of its suffixes is equal to \(0^\infty \) and in particular under this choice no suffix of \(F^t(x)\) is equal to \(d(\beta )\). As in the proof of Proposition 2 we can represent \(F^t(x)[0,\infty ]=w_1w_2w_3\dots \) where \(w_i\in Y_\beta \) for all \(i\in \mathbb {N}\) and in fact \(w_1=p'wc\).

Now let \(q=q'ua\) (\(q,q'\in L(S_\beta )\)) be some prefix of \(d(\beta )\) ending in ua. Then also \(q'wd\) is a prefix of \(d(\beta )\) and thus \(q'wc\in Y_\beta \). Because \(S_\beta \) is a coded subshift, there is a configuration \(y\in S_\beta \) such that \(y[0,\infty ]=(q'wc)w_2w_3\dots \). For such y it holds that \(F^{-t}(y)\in S_\beta \) but \(F^{-t}(y)[0,\infty ]=q'(ub) x[{\left| p \right| },\infty ]>d(\beta )\) contradicting the characterization of \(S_\beta \).    \(\square \)

5 Topological Direct Primeness of Beta-Shifts

We recall the terminology of Meyerovitch [14]. A direct topological factorization of a subshift X is a subshift \(X_1\times \cdots \times X_n\) which is conjugate to X and where each \(X_i\) is a subshift. We also say that each subshift \(X_i\) is a direct factor of X. The subshift X is topologically direct prime if it does not admit a non-trivial direct factorization, i.e. if every direct factorization contains one copy of X and the other \(X_i\) in the factorization contain just one point.

Non-sofic \(\beta \)-shifts turn out to be examples of topologically direct prime dynamical systems. This is an application of Theorem 3.

Theorem 4

If \(S_\beta \) is a non-sofic beta-shift then it is topologically direct prime.

Proof

Assume to the contrary that there is a topological conjugacy \(\phi :S_\beta \rightarrow X\times Y\) where X and Y are non-trivial direct factors of \(S_\beta \). The subshifts X and Y are mixing as factors of the mixing subshift \(S_\beta \), and in particular both of them are infinite, because a mixing finite subshift can only contain one point.

Define a reversible CA \(F:X\times Y\rightarrow X\times Y\) by \(F(x,y)=(\sigma (x),\sigma ^{-1}(y))\) for all \(x\in X\), \(y\in Y\). Because X and Y are infinite, it follows that F has no almost equicontinuous directions, i.e. \(\sigma ^r\circ F^s\) is sensitive for all coprime r and s such that \(s>0\). Then define \(G=\phi ^{-1}\circ F\circ \phi : S_\beta \rightarrow S_\beta \). The map G is a reversible CA on \(S_\beta \) and furthermore \((S_\beta ,G)\) and \((X\times Y,F)\) are conjugate via the map \(\phi \). By Theorem 3 the CA G has an almost equicontinuous direction, so we can fix coprime r and s such that \(s>0\) for which \(\sigma ^r\circ G^s\) is almost equicontinuous. But \(\sigma ^r\circ G^s\) is conjugate to \(\sigma ^r\circ F^s\) via the map \(\phi \), so \(\sigma ^r\circ F^s\) is also almost equicontinuous, a contradiction.    \(\square \)

In general determining whether a given subshift is topologically direct prime or not seems to be a difficult problem. Lind gives sufficient conditions in [10] for SFTs based on their entropies: for example any mixing SFT with entropy \(\log p\) for a prime number p is topologically direct prime. The paper [14] contains results on multidimensional full shifts, multidimensional 3-colored chessboard shifts and p-Dyck shifts with p a prime number.

In the class of beta-shifts the question of topological direct primeness remains open in a countable number of cases.

Problem 1

Characterize the topologically direct prime sofic beta-shifts.

Example 1

If \(\beta >1\) is an integer, then \(S_\beta =\varSigma _\beta ^\mathbb {Z}\) is topologically direct prime if and only if \(\beta \) is a prime number. Namely, if \(\beta =nm\) for integers \(n,m\ge 2\), then \(S_\beta \) is easily seen to be conjugate to \(S_n\times S_m\) via a coordinatewise symbol permutation. The case when \(\beta =p\) is a prime number follows by Lind’s result because the entropy of \(\varSigma _p^\mathbb {Z}\) is \(\log p\).

In this example the existence of a direct factorization is characterized by the existence of direct factorization into beta shifts with integral base. Therefore, considering the following problem might be a good point to start with Problem 1.

Problem 2

Characterize the numbers \(n,\gamma >1\) such that n is an integer and \(S_{n\gamma }\) is conjugate to \(S_n\times S_{\gamma }\).

In Theorem 5 we consider this simpler problem in the SFT case. We start with a definition and a lemma stated in [2].

Definition 11

Let \(n>1\) be an integer, \(a\in \varSigma _n\) and \(w\in \varSigma _n^*\). We say that aw is lexicographically greater than all its shifts if \(aw0^\infty >\sigma ^i(aw0^\infty )\) for every \(i>0\).

Lemma 1

(Blanchard, [2]). \(S_\beta \) is an SFT if and only if \(\beta >1\) is the unique positive solution of some equation \(x^d=a_{d-1}x^{d-1}+\dots +a_0\) where \(d\ge 1\), \(a_{d-1},a_0\ge 1\) and \(a_i\in \mathbb {N}\) such that \(a_{d-1}\cdots a_{0}\) is lexicographically greater than all its shifts. Then \(d(\beta )=a_{d-1}\cdots a_0\).

Proof

The polynomial equation is equivalent to \(1=a_{d-1}x^{-1}+\cdots +a_0 x^{-d}\), which clearly has a unique positive solution. If \(\beta \) satisfies such an equation then \(d(\beta )=a_{d-1}\cdots a_0\) and \(S_\beta \) is an SFT. On the other hand, if \(S_\beta \) is an SFT, then \(d(\beta )\) takes the form of a word \(a_{d-1}\cdots a_0\) which is lexicographically greater than all its shifts and \(\beta \) satisfies \(1=a_{d-1}x^{-1}+\cdots +a_0 x^{-d}\).    \(\square \)

For the following we also recall some facts on zeta functions. The zeta function \(\zeta _X(t)\) of a subshift X is a formal power series that encodes information about the number of periodic configurations in X and it is a conjugacy invariant of X (for precise definitions see Section 6.4 of [11]). Every SFT X is conjugate to an edge SFT associated to a square matrix A. Let I be an index set and let \(\{\mu _i\in \mathbb {C}\setminus \{0\}\mid i\in I\}\) be the collection of non-zero eigenvalues of A with multiplicities: it is called the non-zero spectrum of X. It is known that then \(\zeta _X(t)=\prod _{i\in I}(1-\mu _i t)^{-1}\). The number of p-periodic configurations in X is equal to \(\sum _{i\in I}\mu _i^p\) for \(p\in {\mathbb {N}_+}\). If Y is also an SFT with \(\zeta _Y(t)=\prod _{j\in J}(1-\nu _j t)^{-1}\), then the zeta function of \(X\times Y\) is \(\zeta _X(t)\otimes \zeta _Y(t)=\prod _{i\in I, j\in J}(1-\mu _i\nu _j t)^{-1}\) [10].

Theorem 5

Let \(S_\gamma \) be an SFT with \(\gamma \) the unique positive solution of some equation \(x^d=a_{d-1}x^{d-1}+\dots +a_0\) where \(d\ge 1\), \(a_{d-1},a_0\ge 1\) and \(a_i\ge 0\) such that \(a_{d-1}\cdots a_{0}\) is lexicographically greater than all its shifts. If \(n\ge 2\) is an integer such that also \((na_{d-1})\cdots (n^d a_0)\) is lexicographically greater than all its shifts, then \(S_{n\gamma }\) is topologically conjugate to \(S_n\times S_\gamma \). The converse also holds: if \((na_{d-1})\cdots (n^d a_0)\) is not lexicographically greater than all its shifts, then either \(S_{n\gamma }\) is not an SFT or \(S_{n\gamma }\) and \(S_n\times S_\gamma \) have different zeta functions. In particular they are not conjugate.

Proof

We have \(d(\gamma )=a_{d-1}\cdots a_0\). The roots of \(x^d=a_{d-1}x^{d-1}+\dots +a_0\) are \(\gamma _1=\gamma ,\gamma _2,\dots ,\gamma _d\). By multiplying both sides by \(n^d\) and by substituting \(y=nx\) we see that the roots of \(y^d=na_{d-1}y^{d-1}+\cdots +n^d a_0\) are \(n\gamma _i\) and \(n\gamma \) is the unique positive solution. Because multiplying \(\prod _i(x-\gamma _i)=0\) by \(n^d\) yields \(\prod _i(y-n\gamma _i)=0\), we also see that the multiplicities of the roots \(\gamma _i\) and \(n\gamma _i\) are the same in their respective equations. If \((na_{d-1})\cdots (n^d a_0)\) is lexicographically greater than all its shifts, then \(S_{n\gamma }\) is an SFT with \(d(n\gamma )=na_{d-1}\cdots n^d a_0\). As in [17], the shifts \(S_\gamma \) and \(S_{n\gamma }\) are conjugate to the edge shifts \(X_C\) and \(X_B\) respectively given by the matrices

$$C=\left( \begin{matrix} a_{d-1} &{} 1 &{} 0 &{} \cdots &{} 0 \\ a_{d-2} &{} 0 &{} 1 &{} \cdots &{} 0 \\ \vdots &{} \vdots &{} \vdots &{} &{} \vdots \\ a_0 &{} 0 &{} 0 &{} \cdots &{} 0\end{matrix}\right) \qquad B=\left( \begin{matrix} na_{d-1} &{} 1 &{} 0 &{} \cdots &{} 0 \\ n^2 a_{d-2} &{} 0 &{} 1 &{} \cdots &{} 0 \\ \vdots &{} \vdots &{} \vdots &{} &{} \vdots \\ n^d a_0 &{} 0 &{} 0 &{} \cdots &{} 0\end{matrix}\right) . $$

They are also the companion matrices of the polynomials \(x^d-a_{d-1}x^{d-1}-\dots -a_0\) and \(y^d-na_{d-1}y^{d-1}-\cdots -n^d a_0\), so the eigenvalues are the roots of these polynomials and the zeta functions of \(S_\gamma \) and \(S_{n\gamma }\) are

$$\begin{aligned} \zeta _{X_C}(t)=\prod _i(1-\gamma _i t)^{-1} \quad \text{ and } \quad \zeta _{X_B}(t)=\prod _i(1-n\gamma _i t)^{-1}. \end{aligned}$$

In any case \(\zeta _{S_n}=(1-nt)^{-1}\), so the zeta function of \(X=S_n\times S_\gamma \) is \(\zeta _X(t)=\prod _i(1-n\gamma _i t)^{-1}\), which is equal to \(\zeta _{X_B}\).

We will construct a conjugacy between \(S_n\times X_C\) and \(X_B\). We will choose the labels of the edges in \(X_C\) and \(X_B\) as in Figs. 2 and 3. The labels in the figures range according to \(0\le i_j<n\) and \(0\le k_j<a_{d-j}\) for \(1\le j\le d\).

Fig. 2.
figure 2

The choice of labels for the graph of \(X_C\).

Fig. 3.
figure 3

The choice of labels for the graph of \(X_B\).

The labeling has been chosen in a way that suggests the correct choice of the reversible sliding block code \(\phi :S_n\times X_C\rightarrow X_B\). For any \(x\in S_n\times X_C\) we make the usual identification \(x=(x_1,x_2)\) where \(x_1\in S_n\), \(x_2\in X_C\) and we denote \(\pi _1(x)=x_1\), \(\pi _2(x)=x_2\). Then \(\phi \) is defined by

$$\begin{aligned} \phi (x)[i]= \left\{ \begin{array}{l} *_j\text { when } \pi _2(x)[i]=*_j, \\ (i_1,k_1)\text { when } \pi _2(x)[i]=(1,k_1) \text{ and } \pi _1(x)[i]=i_1, \\ (i_1,i_2,\dots ,i_j,k_j) \text{ when } \pi _2(x)[i-(j-1),i]=*_1*_2\cdots *_{j-1}(j,k_j) \\ \qquad \text{ and } \pi _1(x)[i-(j-1),i]=i_1 i_2\cdots i_j \end{array} \right. \end{aligned}$$

The intuition here is that the sliding block code \(\phi \) attempts to deposit all the information at x[i] to \(\phi (x)[i]\). This is not possible when \(\pi _2(x)[x]=*_j\), so the remaining information is deposited to the nearest suitable coordinate to the right.

For the converse, assume that the word \((na_{d-1})\cdots (n^d a_0)\) is not lexicographically greater than all its shifts and that \(S_{n\gamma }\) is an SFT. Then \(n\gamma \) is the unique positive solution of some equation \(x^e=b_{e-1}x^{e-1}+\dots +b_0\) where \(e\ge 1\), \(b_{e-1},b_0\ge 1\) and \(b_i\ge 0\) such that \(b_{e-1}\cdots b_{0}\) is lexicographically greater than all its shifts. As above, \(S_{n\gamma }\) is conjugate to an edge shift Y given by a matrix with eigenvalues \(\beta _1,\beta _2,\dots ,\beta _e\) which are also the roots of the corresponding polynomial. By our assumption the polynomials \(x^e-b_{e-1}x^{e-1}-\dots -b_0\) and \(y^d-na_{d-1}y^{d-1}-\cdots -n^d a_0\) are different, so they have different sets of roots (with multiplicities taken into account) and

$$\begin{aligned} \zeta _Y(t)=\prod _j(1-\beta _i t)^{-1}\ne \prod _i(1-n\gamma _i t)^{-1}=\zeta _X(t), \end{aligned}$$

because \(\mathbb {C}[t]\) is a unique factorization domain.    \(\square \)

We conclude with an example concerning an SFT beta-shift \(S_{\beta _1\times \beta _2}\) where the assumption of either \(\beta _1\) or \(\beta _2\) being an integer is dropped.

Example 2

A beta-shift \(S_{\gamma \times \gamma }\) can be topologically direct prime even if \(S_{\gamma }\) and \(S_{\gamma \times \gamma }\) are SFTs (and then in particular \(S_{\gamma \times \gamma }\) is not conjugate to \(S_{\gamma }\times S_{\gamma }\)). Denote by \(\gamma \) the unique positive root of \(x^3-x^2-x-1\). By Lemma 1 we have \(d(\gamma )=111\) and in particular \(S_\gamma \) is an SFT. Denote \(\beta =\gamma ^2\). Its minimal polynomial is \(x^3-3x^2-x-1\) and by Lemma 1 \(d(\beta )=311\), so \(S_\beta \) is an SFT and it is conjugate to the edge SFT given by the matrix \(A=\left( {\begin{matrix} 3 &{} 1 &{} 0 \\ 1 &{} 0 &{} 1 \\ 1 &{} 0 &{} 0 \end{matrix}}\right) \). It has three distinct eigenvalues \(\beta _0=\beta \), \(\beta _1\) and \(\beta _2\).

We claim that \(S_\beta \) is topologically direct prime. To see this, assume to the contrary that \(S_\beta \) is topologically conjugate to \(X\times Y\) where X and Y are nontrivial direct factors for \(S_\beta \). Since \(X\times Y\) is a mixing SFT, it follows from Proposition 6 of [10] that X and Y are mixing SFTs and in particular they are infinite. The zeta functions of X and Y are of the form

$$\begin{aligned} \zeta _X(t)=\prod _i(1-\mu _i t)^{-1} \quad \text{ and } \quad \zeta _Y(t)=\prod _j(1-\nu _j t)^{-1} \end{aligned}$$

for some \(\mu _i,\nu _j\in \mathbb {C}\setminus \{0\}\). The zeta function of \(S_\beta \) is

$$\begin{aligned} \zeta _{S_\beta }(t)=(1-\beta t)^{-1}(1-\beta _1 t)^{-1}(1-\beta _2 t)^{-1}=\prod _{i,j}(1-\mu _i\nu _j t)^{-1}. \end{aligned}$$

Because \(\mathbb {C}[t]\) is a unique factorization domain and because X and Y are non-trivial SFTs, we may assume without loss of generality that \(\zeta _X(t)=(1-\mu t)\) and \(\zeta _Y(t)=(1-\nu _1 t)(1-\nu _2 t)(1-\nu _3 t)\) for some \(\mu ,\nu _1,\nu _2,\nu _3\in \mathbb {C}\setminus \{0\}\). The quantities \(\mu \) and \(\nu _1+\nu _2+\nu _3\) are the numbers of 1-periodic points of X and Y respectively and thus the number of 1-periodic points of \(S_\beta \) is equal to \(\mu (\nu _1+\nu _2+\nu _3)=3\) where \(\mu \) and \(\nu _1+\nu _2+\nu _3\) are nonnegative integers. In particular \(\mu \in \{1,3\}\).

Assume first that \(\mu =1\). Therefore X has the same zeta function as the full shift over the one letter alphabet and X has just one periodic point. As a mixing SFT X has periodic points dense so X only contains one point, contradicting the nontriviality of X.

Assume then that \(\mu =3\). Therefore X has the same zeta function as \(\varSigma _3^\mathbb {Z}\) and X has precisely \(3^n\) n-periodic points for all \(n\in {\mathbb {N}_+}\). In particular the number of 2-periodic points of \(S_\beta \) is divisible by \(3^2=9\). On the other hand the number of 2-periodic points of \(S_\beta \) is equal to \({{\,\mathrm{Tr}\,}}(A^2)=11\), a contradiction.