Abstract
We study global-local mixing for a family of accessible skew products with an exponentially mixing base and non-compact fibers, preserving an infinite measure. For a dense set of almost periodic global observables, we prove rapid mixing, and for a dense set of global observables vanishing at infinity, we prove polynomial mixing. More generally, we relate the speed of mixing to the “low frequency behavior” of the spectral measure associated to our global observables. Our strategy relies on a careful choice of the spaces of observables and on the study of a family of twisted transfer operators.
Similar content being viewed by others
1 Introduction
Dynamical systems within the category of skew products have a long history. They receive attention for many different reasons: in earlier ergodic theory, they were studied as mild generalizations of suspensions which cannot be factored [17, Chapter 10], providing examples of simple partially hyperbolic systems; in recent days, they are often used to model real-life situations where fast-slow dynamics can be observed, such as in the study of climate [4, 25]. As their name suggests, they are products of a base dynamics and a fiber dynamics. Here, we are concerned with the statistical properties of these systems.
One of the simplest examples of a partially hyperbolic skew product is given by circle extensions of Anosov diffeomorphisms on the two-dimensional torus \({\mathbb {T}}^2\), defined as follows. Given an Anosov diffeomorphism \(A:{\mathbb {T}}^2 \rightarrow {\mathbb {T}}^2\) and a smooth map \(f :{\mathbb {T}}^2 \rightarrow {\mathbb {T}}\), a skew product \(F :{\mathbb {T}}^3 \rightarrow {\mathbb {T}}^3\) over A induced by f is defined by \(F(x,r)= (Ax, r+f(x))\). The torus \({\mathbb {T}}^3\) is equipped with a product measure \(\mu _u \times \text {Leb}\), where \(\mu _u\) is any Gibbs measure with Hölder potential u and \(\text {Leb}\) is the Lebesgue measure on the fibersFootnote 1. In this case, Dolgopyat [18] proved that generic functions f induce skew products F with rapid decay of correlations, or rapid mixing, i.e., decorrelation of \(\mathscr {C}^{\infty }\)-observables is faster than any given polynomial. The speed of mixing may, in fact, be exponential, but this is still an open problem ([18, Problem 2]). Dolgopyat’s result holds in general for compact group extensions. The interested reader can also check the introductions of [11, 24, 41] for an overview of old and new results on skew products.
In this paper, we are interested in skew products with non-compact fibers. In particular, we will consider \({\mathbb {R}}\)-extensions of topologically mixing Anosov diffeomorphisms via their symbolic counterparts. For an introduction to infinite ergodic theory, we refer the reader to Aaronson’s book [1]. In our setting, Guivarc’h showed that any Hölder function f with zero integral which is not cohomologous to a constant induces an ergodic skew product [28] (see also [16]).
Concerning stronger statistical properties, an historical perspective on the various possible definitions of mixing in the infinite ergodic setting can be found in [34]. We will be interested in global-local mixing, a notion introduced by Lenci in [34], namely we will study the correlations between global and local observables (see Definition 2.1). Local observables are akin to compactly supported observables, while global observables are supported over most of the of phase space. One possible concern about the notion of global-local mixing may be the seemingly arbitrary choice of the averaging involved in the definition of global observables. In our setting, the infinite volume average is analogous to statistical infinite volume limits introduced and refined along the years by Van Hove, Fisher [37, Section 3.3] and Ruelle [46, Section 3.9], which built on the inspirational work of Bogoliubov [6]. Global-local mixing has been studied in different situations, for example random walks [34, 35], mechanical systems [19, 20] and one-dimensional parabolic systems [12].
The main ingredient needed to study our class of skew product is accessibility (see, among others, [13, 14, 47]). The skew product F is accessible if, roughly speaking, it is possible to reach any point in the space by moving along segments of stable and unstable manifolds.
From the measure-theoretic point of view, using Markov partitions, we can translate the problem to study a skew product over a subshift of finite type, keeping the same \({\mathbb {R}}\)-fibers. The question whether accessibility is preserved passing to symbolic dynamics appears to be delicate, and will be discussed in “Appendix A.” Our main result, Theorem 3.9, provides quantitative estimates for the decay of correlations of global and local observables for an accessible \({\mathbb {R}}\)-extension of a symbolic shift. To the best of our knowledge, this is the first quantitative result in the context of global-local mixing.
Contrary to the case of compact group extensions, we cannot expect exponential mixing in general, since, taking Fourier transforms, we have to deal with arbitrary low frequencies. Indeed, we will show in Theorem 3.9 that the speed of convergence of correlations depends on the behavior near zero of the spectral measure associated to the global observable (namely, its inverse Fourier–Stieltjes transform): if the support of this measure intersects a neighborhood of 0 only at 0, then mixing is rapid (Theorem 2.2) as we expect from Dolgopyat’s result; in other cases we obtain polynomial estimates (Theorems 2.3, 2.4), which correspond to the expected behavior (see Remark 2.5). Note that since the infinite volume average equals the value of the associated spectral measure at zero, our choice of infinite volume average is natural.
The main novelties of the work rely on a careful choice of the functional spaces involved. The accessibility hypothesis, when coupled with a standard central limit theorem for the underlying symbolic dynamics on the basis, allows for transfer operator bounds on suitable splitting of lower and higher frequency modes and the exploiting of cancellation effects due to accessibility.
1.1 Krickeberg Mixing
Another independent notion of mixing for infinite measure preserving transformation is known as Krickeberg (or local) mixing. An infinite measure preserving system \((X, \mu , T)\) is said to be Krickeberg mixing if there exists a sequence of positive numbers \(\rho _n \rightarrow \infty \) such that, for any pair of “nice” finite measure sets A, B (precisely, bounded sets whose boundary has zero measure), the rescaled correlations converge, that is
The first example of a system satisfying this property dates back to Hopf in 1937 [31], but it was overlooked for several years, until the 60s when Krickeberg proposed (1) as a definition of infinite measure mixing [33]. Since then, it has received considerable attention and Krickeberg mixing has been proved in several situations, see, e.g., [21, 27, 38,39,40, 42] to name a few.
In the language of this paper, Krickeberg mixing can be seen as a remarkably strong form of “local-local mixing”: the correlations of local observables converge to zero and the first-order term is the same up to a constant for any two sufficiently smooth compactly supported functions. On the other hand, global-local mixing is a softer property, which however gives information on the correlations for a much wider class of observables, in particular for those which are not supported on a compact subset but “see” the whole phase space.
Exploiting some variations on the methods developed in this paper, we are able to establish a strong form of Krickeberg mixing for the accessible skew products we consider here, namely we can prove a full asymptotic expansion of the correlations of Schwartz observables, in the spirit of the main theorem of [22]. This result will appear in a separate paper.
1.2 Outline of the Paper
The rest of the paper is organized as follows. In Sect. 2, we rigorously introduce our framework and state our main results. In Sect. 3, we describe in detail the classes of global and local observables we consider and we state our core result, Theorem 3.9. We then deduce Theorems 2.2, 2.3 and 2.4 from Theorem 3.9.
In Sect. 4, we present a preliminary result in the non-invertible case of skew products over one-sided subshifts, Theorem 4.5. We also describe a “collapsed accessibility” property, which constitutes the main working assumption on the skew product in this setting.
The main tool to prove Theorem 4.5 is a family of twisted transfer operators. In Sect. 5, we show how the collapsed accessibility property can be exploited to obtain some cancellations in the expression for the twisted transfer operators, as in the work of Dolgopyat [18]. In Sect. 6, we prove some estimates on the norm of the twisted transfer operators. For large twisting parameters, the estimates are obtained exploiting the results in Sect. 5; for small parameters, we apply some standard results in the theory of analytic perturbations of bounded linear operators.
Section 7 contains some technical results that will be applied to prove the main theorems. Section 8 is devoted to the proof of Theorem 4.5. In order to deduce Theorem 3.9 from Theorem 4.5, in Sect. 9 we deduce the collapsed accessibility property for a one-sided skew product from the accessibility property of the corresponding two-sided skew product. In Sect. 10, we prove Theorem 3.9.
In “Appendix A,” we discuss the problem whether the accessibility property for a skew product over an Anosov diffeomorphism is equivalent to the accessibility of the associated symbolic system. Appendices B and C contain the proofs of several technical results.
2 Setup and Main Results
Let \(\sigma :\Sigma \rightarrow \Sigma \) be a topologically mixing two-sided subshift of finite type, equipped with a Gibbs measure \(\mu = \mu _u\) with respect to a Hölder potential u (see, e.g., [10, §1] or [41, §3]). Explicitly, for \( x \in \Sigma \), \(x= \{ x_i \}_{i \in \mathbb {Z}}\), we have \((\sigma x)_i = x_{i+1}\). For \(0<\theta < 1\), define the distance
Let us denote by \(\mathscr {F}_{\theta }\) the space of Lipschitz continuous functions \(w:\Sigma \rightarrow {\mathbb {C}}\) equipped with the norm
We consider the skew product
where \(f:\Sigma \rightarrow {\mathbb {R}}\) is a Lipschitz continuous function with zero average, \(\int _\Sigma f \mathop {}\!\mathrm {d}\mu _u = 0\).
As mentioned in the introduction, we will assume that F is accessible: roughly speaking, this means that any two points can be connected by a path consisting of pieces of stable and unstable manifolds; see Sect. 3 for the precise definition.
We are interested in the mixing properties of the map F with respect to the infinite measure \(\nu =\mu \times {{\,\mathrm{Leb}\,}}\), where \({{\,\mathrm{Leb}\,}}\) is the Lebesgue measure on \({\mathbb {R}}\).
Definition 2.1
A local observable is any function \(\psi \in L^1(\nu )\). A global observable is any function \(\Phi \in L^{\infty }(\nu )\) such that the following limit exists
If \(\psi \) is a local observable, we will write \(\nu (\psi ) = \int _{\Sigma \times {\mathbb {R}}} \psi \mathop {}\!\mathrm {d}\nu \). We will show in Lemma 3.2 that if \(\Phi \) is a global observable, then so is \(\Phi \circ F\), and the average \(\nu _{{{\,\mathrm{av}\,}}}\) defined in (4) is invariant under F.
For any pair of global and local observables \((\Phi ,\psi )\), let us denote by \({{\,\mathrm{cov}\,}}(\Phi , \psi )\) the covariance
We are interested in showing “global-local mixing,” namely in proving that the correlations \({{\,\mathrm{cov}\,}}(\Phi \circ F^n, \psi )\) satisfy
and in studying the rate of convergence to such limit (also known as the rate of decay of correlations).
The main result of this paper, Theorem 3.9, establishes quantitative global-local mixing estimates. Since some preliminary work is needed, the statement of the main theorem is postponed to Sect. 3. We state here some corollaries which should give the reader a rather complete picture of the possible scenarios. Theorem 2.2 states that, for a dense class of almost periodicFootnote 2 global observables, we have rapid mixing, namely the decay of correlations is faster than any given polynomial, in analogy to Dolgopyat’s result [18] in the case of circle extensions. On the other hand, for a dense class of global observables which vanish at infinity, Theorems 2.3 and 2.4 state that the decay is polynomial. The bound in Theorem 2.4 is generically optimal, as we show in §3.6.
Let us fix some notation. Let \(\mathscr {C}^k({\mathbb {R}})\) be the space of k-times differentiable functions on \({\mathbb {R}}\). We will denote by \(\mathscr {P}\) the subspace of \(\mathscr {C}^2({\mathbb {R}})\) consisting of \(2\pi \)-periodic functions; by \(\mathscr {C}_0({\mathbb {R}})\) the space of continuous functions on \({\mathbb {R}}\) which vanish at infinity, and by \(\mathscr {C}^\infty _c({\mathbb {R}})\) the subspace of infinitely differentiable functions with compact support. The space \(\mathscr {C}^\infty _c({\mathbb {R}})\) has the structure of a Fréchet space, induced by the family of seminorms \(\Vert \cdot \Vert _{\mathscr {C}^k}\), for \(k \in {\mathbb {N}}\). We will say that a map \(\psi :\Sigma \rightarrow \mathscr {C}^\infty _c({\mathbb {R}})\) is Lipschitz if it is Lipschitz with respect to \(\Vert \cdot \Vert _{\mathscr {C}^k}\), for all \(k \in {\mathbb {N}}\).
In the rest of the paper, we will implicitly identify maps a from \(\Sigma \) to some space of complex-valued measurable functions over \({\mathbb {R}}\) with complex-valued measurable functions on \(\Sigma \times {\mathbb {R}}\) by setting \(a(x,r) = [a(x)](r)\).
Theorem 2.2
(Rapid global-local mixing) Assume that F, defined as in (3), is accessible. For any Lipschitz map \(\psi :\Sigma \rightarrow \mathscr {C}^\infty _c({\mathbb {R}})\), for any Lipschitz map \(\Phi :\Sigma \rightarrow \mathscr {P}\), and for every \(\ell \in {\mathbb {N}}\), there exists a constant \(C=C(\ell , \psi , \Phi )\ge 0\) such that for all \(n \in {\mathbb {N}}\),
Rapid mixing holds in many more situations besides the one in Theorem 2.2. In a precise sense that will become clear later, the key property of the global observable that ensures rapid mixing is the absence of “low frequencies components” in almost every fiber, a property which is clearly satisfied by smooth functions which are periodic on the fibers.
The situation is different when the global observable vanishes at infinity. Note that, in this case, the average \(\nu _{{{\,\mathrm{av}\,}}}\) defined in (4) is zero.
Theorem 2.3
(Polynomial global-local mixing I) Assume that F, defined as in (3), is accessible. There exists a space of bounded continuous functions \(\mathscr {D} \subset \mathscr {C}_0({\mathbb {R}})\), which is dense in \(\mathscr {C}_0({\mathbb {R}})\) with respect to \(\Vert \cdot \Vert _{\infty }\), and there exists \(\alpha >0\) such that the following holds. For any Lipschitz map \(\psi :\Sigma \rightarrow \mathscr {C}^\infty _c({\mathbb {R}})\), and for any map \(\Phi :\Sigma \rightarrow \mathscr {D}\) which satisfies some explicit Lipschitz condition, there exists a constant \(C=C(\psi , \Phi )\ge 0\) such that for all \(n \in {\mathbb {N}}\),
The Lipschitz conditions for the global observable in Theorem 2.3 will be stated explicitly in Sect. 3, as well as a bound on \(\alpha \) (see the second paragraph of the proofs at §3.5). If we further assume that the global observable takes values in \(W^1({\mathbb {R}})\), where \(W^1({\mathbb {R}})\) is the Sobolev space of \(L^2\) functions with weak derivative in \(L^2\), then the statement reads as follows.
Theorem 2.4
(Polynomial global-local mixing II). For any Lipschitz map \(\psi :\Sigma \rightarrow \mathscr {C}^\infty _c({\mathbb {R}})\), for any Lipschitz map \(\Phi :\Sigma \rightarrow W^1({\mathbb {R}})\), and for any \(\varepsilon >0\), there exists a constant \(C=C(\psi , \Phi , \varepsilon )\ge 0\) such that for all \(n \in {\mathbb {N}}\),
If moreover \(\Phi :\Sigma \rightarrow W^1({\mathbb {R}}) \cap L^p({\mathbb {R}})\) for some \(1 \le p \le 2\), then
Remark 2.5
The bound in Theorem 2.4 is optimal: we will provide an example in §3.6 of a pair of global and local observables \(\Phi , \psi \) for which the correlations are bounded below by \(| {{\,\mathrm{cov}\,}}(\Phi \circ F^n, \psi ) | \ge B n^{-\frac{1}{2}}\) for some constant \(B>0\), and, on the other hand, Theorem 2.4 implies that for any \(\varepsilon >0\) there exists a constant \(C>0\) such that \(| {{\,\mathrm{cov}\,}}(\Phi \circ F^n, \psi ) | \le C n^{-\frac{1}{2}+\varepsilon }\).
We stated our results on the symbolic systems, as it makes the statements easier to read. Thanks to the semiconjugacy between the diffeomorphism and the symbolic dynamics, analogous classes of observables give analogous results on the original dynamics, as their regularity is preserved (see Lemma 3.2 for the details).
3 The Main Result
In this section, we first recall the definition of accessibility for F as in (3); then, we describe the classes of global and local observables we consider, and we state our main result. We deduce Theorems 2.2, 2.3 and 2.4 from Theorem 3.9. To help the reader in following the flow of the proofs, some of the lemmas stated in this section are proved in “Appendix B.”
3.1 Accessibility
For each point \(x\in \Sigma \), we define the stable and unstable set at x by, respectively,
By definition, for any \(y \in W^s(x)\), \(d_\theta (\sigma ^nx, \sigma ^ny) \rightarrow 0 \) exponentially fast and, similarly, for \(y \in W^u(x)\), \(d_\theta (\sigma ^{-n}x, \sigma ^{-n}y) \rightarrow 0 \); moreover, note that \(d_\theta \) attains a discrete set of values \(\{ \theta ^{i} \}_{i \in {\mathbb {N}}}\).
The skew product (3) is partially hyperbolic in the following sense. Let us denote by \(f_n(x) = \sum _{i=0}^{n-1} f \circ \sigma ^i(x)\) the n-th Birkhoff sum at x. Let us define for any \((x,r) \in \Sigma \times {\mathbb {R}}\)
We equip \( \Sigma \times \mathbb {R}\) with the product distance given by
it is easy to see that
The sets \(W^s(x,r)\) and \(W^u(x,r)\) are called the (strong) stable and (strong) unstable manifold at \((x,r) \in \Sigma \times {\mathbb {R}}\). Vertical lines \(\{x\} \times {\mathbb {R}}\) constitute the center manifolds, namely they form an invariant fibration and the action of F on each line is isometric.
We now define the accessibility property. A su-path from (x, r) to (y, s) is a finite sequence \((x^i,r_i) \in \Sigma \times {\mathbb {R}}\), for \(0 \le i \le m\) for some \(m \in {\mathbb {N}}\), such that \((x^0,r_0) = (x,r)\), \((x^m,r_m)= (y,s)\), and \((x^i,r_i) \in W^s(x^{i-1},r_{i-1})\) or \((x^i,r_i) \in W^u(x^{i-1},r_{i-1})\) for all \(1 \le i \le m\). We say that F is accessible if for any two points \((x,r), (y,s) \in \Sigma \times {\mathbb {R}}\) there is a su-path from (x, r) to (y, s).
A consequence of the accessibility property is the following fact, which will be proved in “Appendix B.”
Lemma 3.1
If F is accessible, then f is not cohomologous to zero.
This lemma is used in Sect. 6 when analyzing the analytic perturbation of the transfer operator. However, we still need to directly use accessibility of F in other parts of the proof.
3.2 The Classes of Global and Local Observables
We now describe the classes of global and local observables we consider, which we will call good global and local observables. Let us start by observing that the average defined in (4) is invariant under F.
Lemma 3.2
If \(\Phi \) is a global observable according to Definition 2.1, then \(\Phi \circ F\) is a global observable and \(\nu _{{{\,\mathrm{av}\,}}}(\Phi \circ F)=\nu _{{{\,\mathrm{av}\,}}}(\Phi )\).
The proof of Lemma 3.2, which can be found in “Appendix B,” is a consequence of the invariance of the Gibbs measure with respect to the dynamics.
We will denote by \(\mathscr {S}\) the Fréchet space of Schwartz functions on \({\mathbb {R}}\), with the family of seminorms
We will say that a function \(\psi :\Sigma \rightarrow \mathscr {S}\) is Hölder if it is Hölder with respect to \(\Vert \cdot \Vert _{a,\ell }\) for all \(a,\ell \in {\mathbb {N}}\). Starting from definition (2.1), we restrict ourselves from now on to smaller classes of observables.
Definition 3.3
(Good local observables) We denote by \(\mathscr {L}\subset L^1(\nu )\) the space of Hölder functions \(\psi :\Sigma \rightarrow \mathscr {S}\).
Let \(\eta \) be a complex measure over \({\mathbb {R}}\). We will denote by \(|\eta |\) the variation of \(\eta \) and by \(\Vert \eta \Vert _{{{\,\mathrm{TV}\,}}} = |\eta |({\mathbb {R}})\) its total variation. We recall that the Fourier–Stieltjes transform \({\widehat{\eta }}(r)\) of a complex measure \(\eta \) of finite total variation is the \(L^\infty \) function defined by
Let \(\mathscr {A}\) be the set of such \({\widehat{\eta }}\)’s: the space \(\mathscr {A}\) of all Fourier–Stieltjes transforms is an algebra of functions, called the Fourier–Stieltjes algebra. We equip \(\mathscr {A}\) with the total variation norm, namely, for \({\widehat{\eta }}_1, {\widehat{\eta }}_2 \in \mathscr {A}\), we set
Definition 3.4
(Good global observables). We denote by \(\mathscr {G}\subset L^\infty (\nu )\) the space of Hölder functions \(\Phi :\Sigma \rightarrow \mathscr {A}\) which satisfy the following tightness condition:
where \(\widehat{\eta _x} = \Phi (x)\).
The tightness condition (TC) above ensures that one can control the large frequency behavior of \(\Phi (x,\cdot )\) uniformly in the point \(x \in \Sigma \). In particular, it will be exploited in the proofs of Lemma 10.2 and Proposition 10.3 in “Appendix C” to have some compactness property.
Remark 3.5
Any Hölder function from \(\Sigma \) into a metric space can be made Lipschitz by choosing a larger \(\theta \) and hence changing the metric \(d_\theta \) on \(\Sigma \). Since we are not imposing any condition on \(\theta \), here and henceforth we will assume that good local observables are Lipschitz maps \(\psi :\Sigma \rightarrow \mathscr {S}\) and good global observables are Lipschitz maps \(\Phi :\Sigma \rightarrow \mathscr {A}\) satisfying (TC).
The following lemma shows that elements of \(\mathscr {G}\) are indeed global observables; more precisely, the average \(\nu _{{{\,\mathrm{av}\,}}}\) of \(\Phi \) is the average of the values of the associated measures \(\eta _x\) at 0. See “Appendix B” for the short complex integration computation which leads to the result.
Lemma 3.6
If \(\Phi \in \mathscr {G}\), then \(\Phi \) is a global observable according to Definition 2.1 and
where as before, \(\widehat{\eta _x} = \Phi (x)\).
We conclude this section by providing a useful criterion to determine whether a given function is the Fourier–Stieltjes transform of a finite complex measure which satisfies (TC). A positive definite function is any function \(g :{\mathbb {R}}\rightarrow {\mathbb {C}}\) such that
for all \(n \ge 1\), \(x_i,x_j \in {\mathbb {R}}\) and \(z_i, z_j \in {\mathbb {C}}\). By Bochner’s theorem, a function g is continuous and positive definite if and only if it is the Fourier–Stieltjes transform \({\widehat{\eta }}\) of a finite positive measure \(\eta \) on \({\mathbb {R}}\) (see, e.g., [45, Theorem IX.9]). For example, it is easy to check that \(g(x) = e^{ix}\) or \(g(x)=\cos (x)\) are positive definite functions. A less trivial example is the function \(g(x)=\frac{1}{|x|+1}\); the fact that g is positive definite follows from Pólya’s Criterion: any positive, continuous, even function which, for positive x, is non-increasing, convex and tends to 0 for \(x\rightarrow \infty \) is the Fourier–Stieltjes transform of an \(L^1\) function, thus positive definite. We refer the reader to [36] and [49, Chapter 6] for more results concerning the Fourier–Stieltjes transform of measures.
Lemma 3.7
Any linear combination of Lipschitz positive definite functions is the Fourier–Stieltjes transform of a complex measure of finite total variation which satisfies (TC).
Since any complex measure of finite total variation is a linear combination of positive finite measures, the proof of the lemma above follows immediately from the following tail estimate, whose proof can be found in “Appendix B.”
Lemma 3.8
Let \(\eta \) be a finite positive measure on \({\mathbb {R}}\), let \(\Phi (r)\) be its Fourier–Stieltjes transform. Then, if \(\Phi (r)\) is Lipschitz of constant L, for all \(r >0\) we have
3.3 Statement of the Main Result
For any good global observable \(\Phi \in \mathscr {G}\) and for any \(r >0\), let us define the “low frequency variation” as
Notice that \({{\,\mathrm{LF}\,}}(\Phi , \cdot )\) is monotone and \({{\,\mathrm{LF}\,}}(\Phi , r) \rightarrow 0\) for \(r \rightarrow 0\). We are now ready to state our main result.
Theorem 3.9
(Quantitative global-local mixing). Assume that F, defined as in (3), is accessible. Then, for every \(\psi \in \mathscr {L}\), for every \(\Phi \in \mathscr {G}\), for any \(k \in {\mathbb {N}}\), and for every \(\varepsilon >0\), there exists a constant \(C=C(\Phi , \psi , k, \varepsilon )>0\) such that for every \(n \in {\mathbb {N}}\),
The bound in Theorem 3.9 is the sum of two terms, namely a superpolynomial term and the contribution given by the measures \(|\eta _x|\) close to 0. In particular, if the support of the measures \(\eta _x\) does not intersect some neighborhood of 0, then the decay of correlations is superpolynomial. On the other hand, for example under the assumptions of Theorem 2.4, the measures \(|\eta _x|\) are absolutely continuous and the decay is polynomial.
In the rest of the section, we prove Theorems 2.2, 2.3 and 2.4 from the result above.
3.4 Proof of Theorem 2.2
We deduce Theorem 2.2 from Theorem 3.9.
By the theory of Fourier series, any \(p \in \mathscr {P} \subset \mathscr {C}^2({\mathbb {R}})\), by periodicity, is the Fourier–Stieltjes transform of a discrete measure \(\eta \) of the form \(\eta =\sum _{n \in {\mathbb {Z}}} a_n \delta _n\), where \(a_n \in {\mathbb {C}}\) and \(\delta _n\) is the Dirac measure at n. We claim that any Lipschitz map \(\Phi :\Sigma \rightarrow \mathscr {P}\) is contained in \(\mathscr {G}\). Theorem 3.9 then immediately implies the result, since \(|\eta _x| ( (-1,1) \setminus \{0\} )=0\) (where, as usual, we write \(\Phi (x) = \widehat{\eta _x}\)).
We first check the Lipschitz condition. For \(x \in \Sigma \), let us write \(\eta _x = \sum _{n \in {\mathbb {Z}}} a_n(x) \delta _n\). Since \(\Phi (x) \in \mathscr {C}^2({\mathbb {R}})\), it follows that \(\lim _{n \rightarrow \infty }|n^2 a_n(x)| = 0\). In particular, the sequence \(|a_n(x)| \cdot (1+i|n|)\) is square-summable (notice that \(ina_n(x)\) are the Fourier coefficients of the derivative \(\Phi (x)'\)). Thus, for any \(x,y \in \Sigma \), by Cauchy–Schwartz, we have
Hence, by Plancharel formula, there exists a constant \(C >0\) such that
This shows that \(\Phi :\Sigma \rightarrow \mathscr {G}\) is Lipschitz.
We now show that \(\Phi \) satisfies the tightness condition (TC). Since \(\Phi (x) \in \mathscr {C}^2({\mathbb {R}})\), we can bound \(|a_n(x)| \le \Vert \Phi (x)''\Vert _\infty n^{-2}\). Thus, for any \(r \ge 2\) we have
which concludes the proof.
3.5 Proof of Theorems 2.3 and 2.4
Let us first prove Theorem 2.3. To this end, fix any \(p>1\) and consider as \(\mathscr {D}\) the space of Fourier transforms of functions \(f \in L^1 \cap L^p\) with power decay, namely, for which there exist constants \(A,a >0\) such that \(f(\xi ) \le A |\xi |^{-a}\) for all \(|\xi | \ge 1\). Then, since \(\mathscr {S} \subset \mathscr {D} \subset \mathscr {G}\), it is clear that \(\mathscr {D}\) is dense in \(\mathscr {C}_0({\mathbb {R}})\).
Consider \(\Phi :\Sigma \rightarrow \mathscr {D}\), and write \(\Phi (x) = \widehat{\eta _x}\) where \(\mathop {}\!\mathrm {d}\eta _x = f_x(\xi ) \mathop {}\!\mathrm {d}\xi \), with \(f_x \in L^1 \cap L^p\). The Lipschitz condition we need to impose on \(\Phi \) to be a good global observable is \(\Vert \eta _x - \eta _y\Vert _{{{\,\mathrm{TV}\,}}} = \Vert f_x - f_y\Vert _{L^1} \le L(\Phi ) \, d_{\theta }(x,y)\) for some constant \(L(\Phi ) >0\). In order to conclude, we show that \(|\eta _x|\big ( -n^{-\frac{1}{2}+ \varepsilon }, n^{-\frac{1}{2}+ \varepsilon }\big )\) decays as a power of n for all \(x \in \Sigma \). Let \(0<{{\tilde{\alpha }}} = \frac{1}{2} - \varepsilon < \frac{1}{2}\). Then, using Hölder inequality, with \(\frac{1}{p}+\frac{1}{q} = 1\),
Therefore, Theorem 2.3 holds for any \(\alpha \) of the form \({{\tilde{\alpha }}} (1\!-\!\frac{1}{p}) \!=\! (\frac{1}{2} \!-\! \varepsilon )(1\!-\!\frac{1}{p}) \), with \(\varepsilon >0\).
Let us now prove Theorem 2.4. It follows from [5, Theorem 4.2] that any function \(f \in W^1\) is the Fourier transform of a function \(g \in L^1 \cap L^2\), which satisfies \(\Vert g\Vert _2 = \Vert f\Vert _2\) and \(\Vert g\Vert _1 \le \Vert f\Vert _{W^1}\). This implies that any Lipschitz map \(\Phi :\Sigma \rightarrow W^1\) is Lipschitz also with respect to the total variation norm.
Let us check that \(\Phi \) satisfies the tightness condition (TC). Denote \(\mathop {}\!\mathrm {d}\eta _x(\xi ) = f_x \mathop {}\!\mathrm {d}\xi \), with \(f_x \in L^1 \cap L^2\). Again, it follows from [5, Theorem 4.2] that \(\xi f_x(\xi ) \in L^2\) and \( \Vert \xi f_x(\xi )\Vert _2 \le \Vert \Phi (x)'\Vert _2\). For any \(r \ge 2\), by Cauchy–Schwartz and by Plancharel formula, we have
The estimate \(|\eta _x|\big ( -n^{-\frac{1}{2}+ \varepsilon }, n^{-\frac{1}{2}+ \varepsilon } \big ) = O \big ( n^{-\frac{1}{4}+ \varepsilon }\big )\) follows from Cauchy–Schwarz inequality exactly as above. If in addition \(\Phi \) has range in \(W^1 \cap L^p\), then the functions \(f_x\) belong to \(L^q\), where \(\frac{1}{p} + \frac{1}{q} = 1\), and one can conclude using Hölder inequality again. This finishes the proof.
3.6 Example
We discuss a simple example, which shows that the bound in Theorem 2.4 cannot, in general, be improved. As good local observable, let us consider any nonnegative \(\psi (x,r) = \psi (r) \in \mathscr {C}^\infty _c({\mathbb {R}})\) which equals 1 in the interval \(\left[ -\frac{1}{2},\frac{1}{2} \right] \) and, as good global observable, let \(\Phi (x,r)= \Phi (r) = \frac{1}{1+|r|}\). Then, \(\Phi \in W^1({\mathbb {R}}) \cap L^p({\mathbb {R}})\) for any \(p >1\), so that Theorem 2.4 implies that for any \(\varepsilon >0\) there exists a constant \(C\ge 0\) such that
Let us show that there is a lower bound of order exactly \(O(n^{-\frac{1}{2}})\).
Lemma 3.1 implies that f is not cohomologous to zero. Moreover, by the Central Limit Theorem, there exists a constant \(C'>0\) such that for any \(n \in {\mathbb {N}}\) sufficiently large, on a subset \(Y_n \subset \Sigma \) of measure at least 1/2, the Birkhoff sums \(f_n (x) = f(x) + \cdots + f(\sigma ^{n-1}x)\) are bounded by \(|f_n(x)| \le C' \sqrt{n}\). In particular, for any \(x \in Y_n\) and \( r \in \left[ -\frac{1}{2},\frac{1}{2} \right] \), we have
Thus, for any \(n \in {\mathbb {N}}\) sufficiently large, it follows that
We have shown that there exists a constant \(C = (4C')^{-1}\) and, for any \(\varepsilon >0\), there exists a constant \(C_\varepsilon >0\) such that we can bound the correlations by \(C n^{-\frac{1}{2}} \le {{\,\mathrm{cov}\,}}(\Phi \circ F^n, \psi ) \le C_\varepsilon n^{-\frac{1}{2}+\varepsilon }\), hence the bound of Theorem 2.4 is, in this case, optimal.
4 Skew Products Over One-Sided Subshifts
To prove Theorem 3.9, we have to first prove analogous statements for one-sided subshifts. In this section, we discuss the case of skew products over topologically mixing one-sided subshifts of finite type.
Let \(\sigma :X \rightarrow X\) be a topologically mixing one-sided subshift of finite type, equipped with a Gibbs measure \(\mu = \mu _u\) with respect to the potential u. For \(0<\theta < 1\), the distance \(d^+_\theta \) and the space of Lipschitz functions \(\mathscr {F}^+_{\theta }\) are defined analogously to the case of the two-sided shift. Let \(f^+ \in \mathscr {F}^+_{\theta }\) be a real-valued Lipschitz function with zero average and consider the skew shift
Denote by \(\nu \) the infinite measure \(\mu \times {{\,\mathrm{Leb}\,}}\) on \(X \times {\mathbb {R}}\). For any pair of global and local observables \(\Phi , \psi \) over \(X \times {\mathbb {R}}\), define the analogous correlation function
4.1 Good Global and Local Observables for Skew Shifts Over One-Sided Subshifts
The class of global and local observables we consider in this case are described below. In this setting, we require less regularity of the observables than in the case of two-sided shifts.
Definition 4.1
(Good local observables—one-sided case). Let \(\mathscr {L}^+\subset L^1(\nu )\) be the space of functions \(\psi :X \rightarrow \mathscr {S}\) such that, for every \(\ell \in {\mathbb {N}}\), the function \(x \mapsto \partial ^{\ell } \psi (x)\) from X to \(L^1({\mathbb {R}})\) is Lipschitz. For every \(\psi \in \mathscr {L}^+\), denote by \({{\,\mathrm{Max}\,}}_\ell (\psi )\) and \({{\,\mathrm{Lip}\,}}_\ell (\psi )\) the minimum constants such that
Let us remark that, if \(\psi \in \mathscr {L}^+\), then, for every fixed \(x \in X\), the Fourier transform \(\widehat{\psi (x)}\) of \(\psi (x) \in \mathscr {S}\) is a Schwarz function as well. For any fixed \(\xi \in {\mathbb {R}}\), we denote by \({\widehat{\psi }}_\xi :X \rightarrow {\mathbb {C}}\) the function \({\widehat{\psi }}_\xi (x)=\widehat{\psi (x)}(\xi )\).
Lemma 4.2
Let \(\psi \in \mathscr {L}^+\). For every \(\xi \in {\mathbb {R}}\), we have \({\widehat{\psi }}_\xi \in \mathscr {F}^+_\theta \). Moreover, for every \(\ell \ge 0\), and for all \(\xi \ne 0\) we have
Proof
For any \(\xi \ne 0\), \(x \in X\), and \(\ell \ge 0\) we have, by assumption in equation (8),
hence \(\sup _x |{\widehat{\psi }}_\xi (x)| \le {{\,\mathrm{Max}\,}}_\ell (\psi ) \xi ^{-\ell }\). Similarly, for any \(x \ne y \in X\),
so that, for any fixed \(\xi \ne 0\), we have \(|{\widehat{\psi }}_\xi |_{\theta } \le {{\,\mathrm{Lip}\,}}_\ell (\psi ) \xi ^{-\ell }\). \(\square \)
Let us recall that \(\mathscr {A} \subset L^{\infty }\) denotes the space of Fourier–Stieltjes transforms of complex measures with finite total variation.
Definition 4.3
(Good global observables—one-sided case). Let \(\mathscr {G}^+ \subset L^{\infty }(\nu )\) be the space of bounded functions \(\Phi :X \rightarrow \mathscr {A}\). For \(\Phi \in \mathscr {G}^+\), we define
where as usual, \(\widehat{\eta _x} = \Phi (x)\).
In the next sections, we will deal only with the non-invertible case of the skew product \(F^+\) and we will often suppress the \(+\) in the notations introduced above, as it should not generate confusion. In Sect. 10, we will return to the invertible setting.
4.2 Collapsed Accessibility
The property we need in the case of one-sided shifts which will replace the accessibility assumption is the following notion of collapsed accessibility.
Definition 4.4
A Lipschitz function \(f :X \rightarrow {\mathbb {R}}\) has the collapsed accessibility property if there are constants C and N such that the following holds: for any \(x \in X, t \in [0,1],\) and \(n \ge 2 N\), there is a sequence of points
such that
-
(1)
\(m \le N\) and \(x_1 = x_{m+1} = x\);
-
(2)
\(\sigma ^n x_i = \sigma ^n y_i\);
-
(3)
\(d(y_i, x_{i+1}) \le C r^n\); and
-
(4)
\(t = \sum _{k=1}^m f_n(x_k) - f_n(y_k)\).
The adjective “collapsed” refers to the fact that local stable manifolds are collapsed to points when going from \(\Sigma \times {\mathbb {R}}\) to \(X \times {\mathbb {R}}.\)
In order to prove Theorem 3.9, we will see in Sect. 10 that we can reduce an accessible skew product F to a skew product \(F^+\) over a one-sided shift such that \(f^+\) enjoys the collapsed accessibility property.
4.3 The One-Sided Version of the Main Theorem
We state our main theorem in the case of skew products over one-sided subshifts which have the collapsed accessibility property. In Sect. 10, we will deduce Theorem 3.9 from Theorem 4.5.
Theorem 4.5
(Quantitative global-local mixing for one-sided subshifts). Assume that \(f^+\), defined as in (7), has the collapsed accessibility property. Then, for every \(\psi \in \mathscr {L}^+\), for every \(\Phi \in \mathscr {G}^+\), for any \(k \in {\mathbb {N}}\), and for every \(\varepsilon >0\), there exists a constant \(C=C(\Phi , \psi , k, \varepsilon )>0\) such that for every \(n \in {\mathbb {N}}\),
The “low frequency” term \({{\,\mathrm{LF}\,}}(\Phi , \cdot )\) in Theorem 4.5 is defined exactly as in (6), except that the integral is on X instead of \(\Sigma \).
4.4 An Expression for the Correlation Function
The main tool to study the correlations is the transfer operator. We recall the relevant definitions.
We denote by \(L = L_\sigma :L^1(\mu ) \rightarrow L^1(\mu )\) the transfer operator for the base dynamics \(\sigma :X \rightarrow X\), namely the operator on \( L^1(\mu )\) defined implicitly by
for \(v \in L^{\infty }(\mu )\) and \(w \in L^1(\mu )\). Similarly, we denote by \(L_{F^+}:L^1(\nu ) \rightarrow L^1(\nu )\) the transfer operator associated to \(F^+\), that is, the operator which, for every \(\Phi \in L^{\infty }(\nu )\) and \(\psi \in L^1(\nu )\), satisfies
Explicitly, for any \(n \in {\mathbb {N}}\), we can write
For any \(z \in {\mathbb {C}}\), we let us further define the twisted transfer operator \({\mathcal {L}}_{z} :L^1(\mu ) \rightarrow L^1(\mu )\) by
where u is the potential defining the Gibbs measure and \(u_n\) its cocycle. Notice that all the operators described above restrict to operators acting on \(\mathscr {F}^+_\theta \).
Proposition 4.6
Let \(\psi \in \mathscr {L}^+\) and \(\Phi \in \mathscr {G}^+\). Then, for every \(n \in {\mathbb {N}}\) we have
Proof
By definition of the transfer operator \(L_{F^+}\), we can write
where the applicability of the Fubini–Tonelli Theorem follows immediately from the definition of \(\mathscr {G}^+\) and \(\mathscr {L}^+\). Since \(\Phi (x)\) is the Fourier–Stieltjes transform of a measure \(\eta _x\) we get
For every \(x \in X\), we have
thus we can again apply the Fubini–Tonelli Theorem to get
The conclusion follows by construction due to the equality
\(\square \)
5 Cancellations for Twisted Transfer Operators
From Proposition 4.6, it is clear that, in order to estimate the correlations, we need to study the twisted transfer operators \(\mathcal {L}_\xi \), for real frequencies \(\xi \in {\mathbb {R}}\). The aim of this section is to show that the collapsed accessibility property can be exploited to obtain some cancellations in the expression of \(\mathcal {L}_\xi \).
Let us fix a complex-valued Lipschitz function \(\tilde{g}:X \rightarrow {\mathbb {C}}\), and let \(|\tilde{g}| = g\). In this section, we use a tilde to denote a complex-valued or “twisted” function, and the same letter without a tilde to denote its absolute value. We denote by \(\mathcal {L} :\mathscr {F}_\theta ^+ \rightarrow \mathscr {F}_\theta ^+\) the operator defined by
and by \(L:\mathscr {F}_\theta ^+ \rightarrow \mathscr {F}_\theta ^+\) the positive “untwisted” operator
Up to conjugating L with a suitable multiplication operator, we can assume that \(L 1 = 1\), namely
for all \(x \in X\).
Notice that the for the operator \(\mathcal {L}_\xi \) defined in the previous section, we have \(\tilde{g}= \exp (u + i \xi f)\), where u is the potential for the Gibbs measure \(\mu \).
One can easily see that \(|\mathcal {L} \tilde{v}(x)| \le Lv(x)\). Moreover, recall that \(| \cdot |_\theta \) is the Lipschitz seminorm defined in (2). Then, the following Lasota–Yorke inequality holds, see [41, Proposition 2.1].
Lemma 5.1
(Basic inequality). There exists a constant \(C_0>0\) such that
where \(R= C_0 |\tilde{g}|_\theta \).
By induction,
where \(g_n\) and \(\tilde{g}_n\) are the cocycles
It follows that for all \(n \ge 1\) we have
5.1 Collapsed Accessibility and Cancellation Pairs
Let us fix a positive constant \(\varepsilon > 0\), and an integer \(n \ge 1\). We assume \(\varepsilon < \frac{1}{2}\) and \(\varepsilon < 1 - \theta \). Define
A Lipschitz function \(\tilde{v}: X \rightarrow {\mathbb {C}}\) is a nice observable if \(|\tilde{v}|_\theta \le H\) and \(1 - \varepsilon< v(x) < 1\) for all \(x \in X\) (as always, \(|\tilde{v}| = v\)).
We say \(\mathcal {L}\) has \((\varepsilon ,n)\)-cancellation if for any observable \(\tilde{v}\) with \(|\tilde{v}|_\theta \le H\) and \(0 \le v(x) < 1\) for all \(x \in X\), there is an integer \(0 \le k \le n\) and a point \(x \in X\) such that \( |\mathcal {L}^k \tilde{v}(x)| \le 1 - \varepsilon \). We say \(\mathcal {L}\) has strong \((\varepsilon ,n)\)-cancellation if for every nice observable \(\tilde{v}\), there is a point \(x \in X\) such that \( |\mathcal {L}^n \tilde{v}(x)| \le 1 - \varepsilon \). One can see that strong \((\varepsilon ,n)\)-cancellation implies \((\varepsilon ,n)\)-cancellation.
A pair of points (x, y) in X is a stable pair if \(\sigma ^n x = \sigma ^n y\). We say a stable pair (x, y) is a cancellation pair for a nice observable \(\tilde{v}\) if
Lemma 5.2
If (x, y) is a cancellation pair for \(\tilde{v}\), then
where \(p = \sigma ^n x = \sigma ^n y\).
Proof
By definition of cancellation pair, we have
where we used that \(L^n1=1\). \(\square \)
For a stable pair (x, y) define the phase of (x, y) as
Here, arg is the complex argument and so the phase is the angle between \(\tilde{g}_n(x)\) and \(\tilde{g}_n(y)\) in the complex plane. For the most part, we can just think of this value as an angle. However, if we include it in an inequality, we will assume it is a real number between \(-\pi \) and \(\pi \).
Define the stable tolerance of (x, y) as the number \(0< \delta < \pi \) which satisfies
Note that the right-hand side must be less than two for this to be well defined. In practice, we will always choose \(\varepsilon \) small enough so that this is the case.
Proposition 5.3
Let (x, y) be a stable pair, and \(\tilde{v}\) a nice observable. If (x, y) is not a cancellation pair for \(\tilde{v}\), then
where \(\delta \) is the stable tolerance of (x, y).
In other words, if s is the phase of (x, y), then
ignoring issues of the angle only being defined up to a multiple of \(2\pi \).
To prove the proposition, we first establish the following lemma.
Lemma 5.4
Let \(z_1\) and \(z_2\) be non-zero complex numbers with \(\alpha = \arg (\frac{z_1}{z_2})\). If
then
Proof
Write \(z_0 = z_1 + z_2\) and \(r_k = |z_k|\) for \(k = 0,1,2\). We wish to show that \(r_0^2 \le (r_1 + r_2 - \varepsilon )^2\). The cosine rule implies that
and so it is enough to show
This may be rewritten as
\(\square \)
Proof of Proposition 5.3
We show the contrapositive. Define \(z_1 = \tilde{g}_n(x) \tilde{v}(x)\) and \(z_2 = \tilde{g}_n(y) \tilde{v}(y)\). Then
and a similar estimate holds for \(|z_2|\). Using \(\varepsilon < \frac{1}{2}\) and the definition of the stable tolerance, one sees that
Let \(\alpha \) be the angle between \(z_1\) and \(z_2\). If \(\delta < \alpha \), then \(1 - \cos (\delta ) < 1 - \cos (\alpha )\) and Lemma 5.4 shows that (x, y) is a cancellation pair for \(\tilde{v}\). \(\square \)
For an arbitrary pair (x, y) of points in X, define the unstable tolerance as \(0 \le \delta < \frac{\pi }{2}\) such that
Note that x and y must be reasonably close for this to be well defined.
Proposition 5.5
If (x, y) is a pair with unstable tolerance \(\delta \) and \(\tilde{v}\) is a nice observable, then
Again, we rely on a trigonometric lemma.
Lemma 5.6
Let \(z_1\) and \(z_2\) be non-zero complex numbers with angle \(\alpha = \arg (\frac{z_1}{z_2})\). If \(0< \alpha < \frac{\pi }{4}\) and \(1 - \varepsilon \le |z_i| \le 1\), then
Proof
Assume \(|z_1| > |z_2|\) and consider the acute triangle defined by the points 0, \(z_1\) and \(z_2\) in complex plane. Split this triangle into two right triangles by adding a line segment from \(z_2\) to the opposite side of the triangle. This new segment has length \(|z_2| \sin (\alpha ) \ge (1-\varepsilon ) \sin (\alpha )\) and so the line segment from \(z_1\) to \(z_2\) has length at least \((1 - \varepsilon ) \sin (\alpha )\). \(\square \)
Proof of Proposition 5.5
Let \(z_1 = \tilde{v}(x)\) and \(z_2 = \tilde{v}(y)\) and let \(\alpha \) be the angle between them. The above lemma and the definition of “nice” together imply that
Since \(\varepsilon < \frac{1}{2}\) by assumption, the result follows. \(\square \)
A us-cycle is a (finite) sequence of points in X:
where \(x_1 = x_{m+1}\) and each pair \((x_k, y_k)\) is a stable pair. The tolerance of the cycle is the sum of the stable tolerances of the pairs
and the unstable tolerances of the pairs
We only consider us-cycles for which this tolerance is well defined. The phase of the cycle is
That is, the phase of the cycle is the sum of the phases of the individual stable pairs (up to a multiple of \(2\pi \)). As defined, the phase is a number in \((-\pi , \pi ]\). We will only consider cycles where the phase is positive.
Proposition 5.7
If there is a us-cycle where the phase is greater than the tolerance, then \(\mathcal {L}\) has strong \((\varepsilon , n)\)-cancellation.
Proof
Let \(\tilde{v}\) be a nice observable. Our goal is to show that one of the stable pairs in the cycle is a cancelling pair for \(\tilde{v}\). We assume none of them is a cancelling pair and derive a contradiction.
Let S be the phase of the cycle and \(\delta = \delta _s + \delta _u\) be the tolerance, where \(\delta _s\) is the sum of the stable tolerances and \(\delta _u\) is the sum of the unstable tolerances. We are assuming \(0< \delta < S.\)
Proposition 5.3 implies that
and Proposition 5.5 implies that
Since \(x_1 = x_{m+1}\), the complicated product in the middle of each inequality is actually the same complex number and so we get \(S - \delta _s < \delta _u\), a contradiction. \(\square \)
5.2 Cancellation by Frequency
We now apply the results above to the specific case of the operators \(\mathcal {L}_\xi \) defined in the previous section, namely to the case
where \(\tilde{g}_\xi = \exp (u + i \xi f)\). To simplify the presentation we only consider positive \(\xi \), but analogous results will hold for negative frequencies.
One can show that the Lipschitz norm of \(\tilde{g}_\xi \) satisfies \(|\tilde{g}_\xi |_\theta \le |g|_\theta + \xi |f|_\theta \), and so each twisted operator satisfies a Lasota–Yorke inequality \( |\mathcal {L}_\xi \tilde{v}|_\theta \le \theta |\tilde{v}|_\theta \ + R_\xi \Vert \tilde{v}\Vert _\infty \) and by Lemma 5.1, \(R=R_\xi \) grows linearly in \(\xi \).
The notion of a “nice observable” will also depend on the frequency. In particular, the value H from the previous section depends on \(\xi \) and so \(H = H_\xi = \frac{2}{1-\theta } R_\xi \), which also grows linearly in \(\xi \). Define a constant \(G = \inf \{ \frac{1}{g(x)} : x \in X \}\) and an exponent \(\alpha > 0\) determined by \(\theta ^\alpha G = 1\). Note that G and \(\alpha \) are independent of the frequency.
We now show that accessibility of the skew product leads to cancellation of these twisted operators and we give quantitative estimates of the amount of cancellation.
Proposition 5.8
Suppose \(f^+\) has the collapsed accessibilty property and \(\xi _0 > 0\) is given. Then there are positive constants A and B such that if \(\xi \ge \xi _0\) and
then \(\mathcal {L}_\xi \) has strong \((\varepsilon _\xi ,n_\xi )\)-cancellation.
Remark 5.9
One can see from the definitions of \(\varepsilon _\xi \) and \(n_\xi \) that \(\varepsilon _\xi = \frac{1}{A} \theta ^{\alpha n_\xi } \ge \frac{\theta }{A B^\alpha } \xi ^{-\alpha }\).
Proof
Assume without loss of generality that \(0< \xi _0 < \pi \). The overall strategy of the proof is to use collapsed accessibility to show that, for any frequency \(\xi \ge \xi _0\), there is a us-cycle with phase equal to \(\xi _0\) and tolerance less than \(\xi _0\). Proposition 5.7 then gives cancellation.
Let C and N be as given in Definition 4.4 of collapsed accessibility . Then there is a constant \(0< a < 1\) such that any angle \(0< \delta < \pi \) which satisfies either \(1 - \cos (\delta ) \le a\) or \(\sin (\delta ) \le a\) also satisfies \(\delta < \frac{1}{2N} \xi _0\).
Since \(H_\xi \) grows linearly in \(\xi \), there is \(B > 0\) such that \(2 C H_\xi \le a B \xi \), for all \(\xi \ge \xi _0\).
Up to increasing the value of B, we can also ensure that \(n > 2N\) for any integer n which satisfies \(\theta ^n < \frac{1}{B \xi _0}\). Define \(A = \frac{2}{a}\).
Now consider a specific frequency \(\xi \ge \xi _0\) and use \(n = n_\xi \) and \(\varepsilon = \varepsilon _\xi \) defined as in the statement of the proposition. Using this n and \(t = \frac{\xi }{\xi _0}\), there is a sequence of points \(x_1, y_1, x_2, y_2, \ldots , y_m, x_{m+1}\) satisfying Definition 4.4. This sequence is a us-cycle for \(\mathcal {L}_\xi \) and has phase equal to \(\xi _0\). If \(\delta \) is the stable tolerance of a pair \((x_k, y_k),\) then
If instead \(\delta \) is the unstable tolerance of a pair \((y_k, x_{k+1}),\) then
Together, these estimates show that the total tolerance of the us-cycle is less than \(\xi _0\) and so Proposition 5.7 gives cancellation. \(\square \)
6 Contraction
In this section, we show how to obtain some estimates on the norm of the operator \(\mathcal {L}_\xi \). For high frequencies, we exploit the cancellations obtained in the previous section, while, for low frequencies, we apply some standard results from the perturbation theory of bounded linear operators.
6.1 High Frequencies
Recall that we defined \(H := \max \left\{ 1, \frac{2 R}{1-\theta } \right\} \). It will be convenient to define the following norm on \(\mathscr {F}^+_\theta \): let
Notice that the norms \(\Vert \cdot \Vert _{H}\) and \(\Vert \cdot \Vert _\theta \) are equivalent, namely
In this section, we will prove the following result.
Proposition 6.1
Suppose that \(f^+\) has the collapsed accessibility property, and let \(\xi _0>0\) be given. Then, there exists positive constants \(A, B>0\) and an exponent \(\beta > 0\) such that for all \(\xi \ge \xi _0\) we have
for all \(N \ge B |\log \xi |\).
We start by proving some simple preliminary results.
Lemma 6.2
For any given \(\xi >0\), if \({{\tilde{v}}} \in \mathscr {F}_\theta ^+\), then \( \Vert \mathcal {L}_\xi {{\tilde{v}}} \Vert _H \le \Vert {{\tilde{v}}} \Vert _H\).
Proof
Clearly, \(\Vert \mathcal {L}_\xi {{\tilde{v}}} \Vert _\infty \le \Vert {{\tilde{v}}} \Vert _\infty \le \Vert {{\tilde{v}}} \Vert _H\). From the Basic Inequality in Lemma 5.1 we also get
since \(H > {R}/(1-\theta )\). This completes the proof. \(\square \)
Let us recall that, from the definition of Gibbs measure, it follows that there exist constants \(C_u, d\) such that for any ball B(x, r) centered at \(x \in X\) with radius \(r \ge 0\) we can bound
We will also use the fact that the untwisted transfer operator L on \(\mathscr {F}^+_\theta \) has a spectral gap, namely the following well-known result, see, e.g., [41, Theorem 2.2].
Lemma 6.3
There exist a bounded operator \(\mathcal {N}:\mathscr {F}_{\theta }^+ \rightarrow \mathscr {F}_{\theta }^+\), a real number \(0<\delta <1\), and a constant \(C>0\) such that for all \(n \in {\mathbb {N}}\) we have \(\left\Vert \mathcal {N}^n\right\Vert _{\theta } \le C \delta ^n\), and for all \(\tilde{v}\in \mathscr {F}_{\theta }^+ \),
We have the following result.
Lemma 6.4
There exists a constant \(C_1>0\) such that the following holds. For any \(\varepsilon >0\), \(\ell \ge 1\), and \({\tilde{v}} \in \mathscr {F}^+_\theta \) with \(|{{\tilde{v}}}|_\theta \le \ell \), if \(| {{\tilde{v}}} ({{\bar{x}}})| \le 1-\varepsilon \) for a point \({{\bar{x}}} \in X\), then
Proof
Define \(w = 1-v\) and note that \(w({{\bar{x}}}) \ge \varepsilon \) and the Lipschitz semi-norm of w satisfies \(|w|_\theta = |v|_\theta \le |{{\tilde{v}}}|_\theta \le \ell \). If B is the ball centered at \({{\bar{x}}}\) of radius \(r = \frac{\varepsilon }{2 \ell }\), then \(w(x) \ge \frac{\varepsilon }{2}\) for all \(x \in B\), so
Since \(\int _X v \mathop {}\!\mathrm {d}\mu = 1- \int _X w \mathop {}\!\mathrm {d}\mu \), the result follows. \(\square \)
Lemma 6.5
There exist constants \({{\bar{A}}}, {{\bar{B}}} >0\) such that the following holds. Assume that \(\mathcal {L}= \mathcal {L}_\xi \) has \((\varepsilon , n)\)-cancellation. Then, for every \({{\tilde{v}}} \in \mathscr {F}_{\theta }^+\) with \(\Vert {{\tilde{v}}}\Vert _{H} \le 1\), and for any \(N \ge N_0 := \lfloor -{{\bar{B}}} \log (\varepsilon /H) \rfloor \), we have
Proof
Let \(N_1\) and \(N_2\) be the minimum integers which satisfy
and let \(N_0 = N_1 + N_2\). It is clear from the definition that there exists a constant \({ {\bar{B}}} >0\) such that \(N_0 = \lfloor -{ {\bar{B}}} \log (\varepsilon /H) \rfloor \).
For every \(x \in X\),
where the last inequality follows from the definition of \(N_1\). From Lemma 6.2, we have \(| \mathcal {L}^n {{\tilde{v}}} |_\theta \le H \Vert {{\tilde{v}}}\Vert _H \le H\). Thus, Lemma 6.4 applied to \(| \mathcal {L}^{n} {{\tilde{v}}}|\) gives us
Therefore, we obtain
Finally, the inequality (10) gives us
By the definition of \(N_2\) and (12), we conclude
The inequality above and (12) conclude the proof. \(\square \)
We are in position to complete the proof of Proposition 6.1
Proof of Proposition 6.1
By Proposition 5.8, \(\mathcal {L}_\xi \) has \((\varepsilon _\xi , n_\xi )\)-cancellations, with \(\varepsilon \ge A_0 \xi ^{-\alpha }\) and \(n_\xi \le B_0 |\log \xi |\), for some positive constants \(A_0, B_0\). Therefore, by Lemma 6.5, for every \(N \ge N_0 + n_\xi \) we have
for some constant \(A>0\). By the definitions of \(N_0\) and \(n_\xi \), there exists a constant \(B>0\) such that \(N_0 + n_\xi \le B |\log \xi |\). \(\square \)
6.2 Low Frequencies
We now want to estimate the norm of \(\mathcal {L}_\xi \) for small \(\xi \in {\mathbb {R}}\). Let us notice that there exists \(\xi _0 >0\) such that for all \( 0 \le \xi \le \xi _0\) we have \(H = \max \left\{ 1, \frac{2 R}{1-\theta } \right\} = 1\), so that \(\Vert \cdot \Vert _H \le \Vert \cdot \Vert _\theta \le 2 \Vert \cdot \Vert _H\). We will prove the following bound.
Proposition 6.6
There exist \(\kappa >0\) and a constant \(A_\kappa >0\) such that, for all \(0< \xi < \kappa \) and for all \(n \ge 0\), we have
Let us recall that the family of operators \(z \mapsto \mathcal {L}_z\) is analytic for \(z \in {\mathbb {C}}\). This ensures that we can apply classical results from analytic perturbation theory to study the spectrum of bounded linear operators, see in particular [32, Theorem VII.1.8]. In our case, since the operator \(\mathcal {L}_0 = L\) has a spectral gap, we can deduce the following result, see [41, Chapter 4], [26, Proposition 2.3] and [48, p.15].
Theorem 6.7
There exists a \(\kappa >0\) such that the twisted transfer operator \(\mathcal {L}_z\) on \(\mathscr {F}^+_\theta \) has a spectral gap for all \(|z|<\kappa \). Moreover, there exist \(\lambda _z \in {\mathbb {C}}\) and linear operators \(\mathcal {P}_z\) and \(\mathcal {N}_z\) such that \(\mathcal {L}_z = \lambda _z \mathcal {P}_z + \mathcal {N}_z\) and which satisfy the following properties:
-
(1)
\(\lambda _z\), \(\mathcal {P}_z\) and \(\mathcal {N}_z\) are analytic on the disk \(\{|z| < \kappa \}\),
-
(2)
\(\mathcal {P}_z\) is a projection and its range has dimension 1,
-
(3)
\(\mathcal {P}_z\mathcal {N}_z = \mathcal {N}_z\mathcal {P}_z = 0\),
-
(4)
the spectral radius \(\rho (\mathcal {N}_z)\) of \(\mathcal {N}_z\) satisfies \(\rho (\mathcal {N}_z) < \lambda _z - \delta \), for some \(\delta \) independent of z.
In our case, we restrict to real frequencies \(0< \xi < \kappa \). For the proof of the following lemma, see [41, Chapter 4] and [48, Section 4].
Lemma 6.8
With the notation of Theorem 6.7, there exist constants \(A_\kappa , B_\kappa >0\) such that for all \(0< \xi < \kappa \) we have
The fact that \(A_\kappa \) is strictly positive follows from the fact that \(f^+\) is not cohomologous to zero, see Lemma 3.1.
Proof of Proposition 6.6
By Theorem 6.7, up to choosing a smaller \(\kappa \), we can assume that \(\rho (\mathscr {N}_\xi ) < |\lambda _\xi | \le 1- A_\kappa \xi ^2\) for all \(0 < \xi \le \kappa \). Therefore, for any \({\tilde{v}} \in \mathscr {F}^+_\theta \), we have
which proves the result. \(\square \)
7 Rapid Decay
In Sect. 8, we will use the contraction results established for the twisted transfer operator \(\mathcal {L}_\xi \) in the previous section in order to prove rapid mixing. In this section, we give several technical propositions in an abstract setting which encapsulate most of the difficult inequalities involved in the proof.
Definition 7.1
Consider a function \(w : A \subseteq (0,\infty ) \rightarrow {\mathbb {R}}\). We say \(w(\xi )\) decays rapidly in \(\xi \) if for each \(\ell \ge 1,\) there is a constant C such that \(|w(\xi )| \le C \xi ^{-\ell }\) for all \(\xi \). We say a sequence \(\{s_n\}\) decays rapidly in n if for each \(\ell \ge 1\), there is a constant C such that \(|s_n| \le C n^{-\ell }\) for all n.
Proposition 7.2
Suppose A, B, \(\beta \), and \(\xi _0\) are positive constants and that \(\{w_n\}\) is a decreasing sequence of nonnegative functions of the form \(w_n :[\xi _0, \infty ) \rightarrow [0, 1]\). If \(w_0(\xi )\) decays rapidly in \(\xi \) and
then the sequence \(\{s_n\}\) defined by \(s_n = \sup _\xi w_n(\xi )\) decays rapidly in n.
In order to prove this, we first give a lemma which establishes for each fixed \(\xi \) an exponential rate of decay of the sequence \(\{w_n(\xi )\}\).
Lemma 7.3
In the setting of Proposition 7.2, there are constants D and \(\gamma \) such that \(w_{n+K}(\xi ) < \frac{1}{e} w_n(\xi )\) for all \(K > D \xi ^\gamma \).
Proof
Consider a specific \(\xi \) and let k and N be the smallest integers such that \(k > \frac{1}{A} \xi ^\beta \) and \(N > B \log (\xi )\). Then \((1 - A \xi ^{-\beta })^k< \exp (-k A \xi ^{-\beta }) < \exp (-1)\) and so
If we choose an exponent \(\gamma > \beta \), then there is a constant D such that
Moreover, this constant D may be chosen uniformly for all \(\xi \). If \(K > D \xi ^\gamma \), then \(K > k N\) and \(w_{n+K}(\xi ) \le w_{n + k N}(\xi )\). \(\square \)
Proof of Proposition 7.2
We will show for any \(q > 0\) that there is a constant Q such that \(s_n < \varepsilon \) for all \(0< \varepsilon < 1\) and \(n > Q \varepsilon ^{-q}\). One can see that this condition implies that \(\{s_n\}\) decays rapidly. Let D and \(\gamma \) be as in the above lemma and choose an integer \(\ell > \gamma / q\). As \(w_0(\xi )\) decays rapidly in \(\xi \), there is C such that \(w_0(\xi ) < C \xi ^{-\ell }\) for all \(\xi \) in the domain. For a given \(\varepsilon > 0\):
-
let \(a > 0\) be such that \(C a^{-\ell } = \varepsilon \),
-
let j be the smallest integer such that \(e^{-j} < \varepsilon \), and
-
let K be the smallest integer such that \(K > D a^\gamma \).
Now consider a frequency \(\xi \). If \(\xi > a\), then \(w_{j K}(\xi ) \le w_0(\xi ) < \varepsilon \). If instead \(\xi \le a\), then \(K > D \xi ^\gamma \) which implies \(w_{j K}(\xi ) \le e^{-j} w_0(\xi ) < \varepsilon \). Together, these imply that \(s_n < \varepsilon \) for all \(n > j K\). Since \(a^\gamma = \frac{1}{C} \varepsilon ^{-\gamma /\ell }\) and \(q > \gamma /\ell \), there is a constant Q such that
holds uniformly for all \(\varepsilon \). \(\square \)
Proposition 7.4
Suppose A and \(\xi _0\) are positive constants, \(0< \alpha < \frac{1}{2}\), and \(\{w_n\}\) is a decreasing sequence of nonnegative functions of the form \(w_n :(0, \xi _0] \rightarrow [0, 1]\). If \(w_{n+k}(\xi ) \le 4(1 - A \xi ^{2})^k w_n(\xi )\) for all \(\xi \), k and n, then the sequence \(\{s_n\}\) defined by
decays rapidly in n.
Proof
If \(n^{-\alpha } \le \xi \), then \(w_n(\xi ) \le 4(1 - A \xi ^2)^n \le 4\exp (- n A \xi ^2) \le 4\exp (- n^{1-2\alpha } A)\), and, since \(1 - 2\alpha > 0\), one can show that \(4\exp (- n^{1-2\alpha } A)\) decays rapidly in n. \(\square \)
Propositions 7.2 and 7.4 are enough to establish rapid mixing in the setting of skew products over one-sided shifts. However, to handle two-sided shifts, we will need the following more technical results.
Proposition 7.5
Let A, B, \(\beta \), \(\xi _0\) and \(\theta \) be positive constants. Let \(v_{n,m} :[\xi _0, \infty ) \rightarrow [0, \infty )\) be a collection of functions defined for all integers \(n \ge 0\) and \(m \ge 0\) and let \(w :[\xi _0, \infty ) \rightarrow [0, \infty )\) be a bounded function. Suppose for all \(n,m \ge 0\) and \(\xi \ge \xi _0\) that
-
(1)
\(v_{n,m}(\xi ) \le v_{n+1,m}(\xi )\),
-
(2)
\(v_{n+N,m}(\xi ) \le (1 - A \xi ^{-\beta })v_{n,m}(\xi )\) when \(N > B \log (\xi )\),
-
(3)
\(v_{0,m}(\xi ) \le \theta ^{-m} w(\xi )\), and
-
(4)
\(w(\xi )\) decays rapidly in \(\xi \).
Then, for any \(c > 0,\) the sequence \(\{t_n\}\) defined by
decays rapidly in n.
Proof
As \(w(\xi )\) is bounded, we may without loss of generality assume that \(w(\xi ) \le 1\) for all \(\xi \). Define \(w_n(\xi ) = \sup _m \theta ^m v_{n,m}(\xi )\). One can verify that \(\{w_n\}\) satisfies the hypotheses of Proposition 7.2. Hence \(\sup _\xi w_n(\xi )\) decays rapidly in n, meaning that for a given \(\ell \), there is C such that \(v_{n,m}(\xi ) < C \theta ^{-m} n^{-\ell }\) for all m and n. If \(m < c \log (n)\), then
From this, one can see that \(\{t_n\}\) decays rapidly in n. \(\square \)
Proposition 7.6
Let A, \(\beta \), \(\xi _0\) and \(\theta \) be positive constants, and \(0< \alpha < \frac{1}{2}\). Let \(v_{n,m} :(0, \xi _0] \rightarrow [0, \infty )\) be a collection of functions defined for all integers \(n \ge 0\) and \(m \ge 0\), and let \(w :(0, \xi _0] \rightarrow [0, \infty )\) be a bounded function. Suppose for all \(n,m,k \ge 0\) and \(\xi \le \xi _0\) that
-
(1)
\(v_{n,m}(\xi ) \le v_{n+1,m}(\xi )\),
-
(2)
\(v_{n+k,m}(\xi ) \le 4(1 - A \xi ^{2})^k v_{n,m}(\xi )\),
-
(3)
\(v_{0,m}(\xi ) \le \theta ^{-m} w(\xi )\).
Then for any \(c > 0,\) the sequence \(\{t_n\}\) defined by
decays rapidly in n.
Proof
This follows from Proposition 7.4 using the proof of Proposition 7.5.
\(\square \)
8 Proof of Theorem 4.5
This section is devoted to the proof of Theorem 4.5.
Let \(\psi \in \mathscr {L}^+\) and \(\Phi \in \mathscr {G}^+\) be given, and fix \(k \in {\mathbb {N}}\) and \(0< \alpha < 1/2\). Recall that the good global observable \(\Phi \) defines a complex measure \(\eta _x\) for each \(x \in X\) and that there is a uniform constant \(M = \Vert \Phi \Vert _{\mathscr {G}^+}\) such that \( \Vert \eta _x \Vert _{{{\,\mathrm{TV}\,}}} \le M\) for all \(x \in X\). The Fourier transform of the good local observable \(\psi \) is a function of the form \({\widehat{\psi }} :X \times {\mathbb {R}}\rightarrow {\mathbb {C}}\) where, for each frequency \(\xi \), the function \({\widehat{\psi }}_\xi :X \rightarrow {\mathbb {C}}\) defined by \({\widehat{\psi }}_\xi (x) = \widehat{\psi (x)}(\xi )\) is Hölder and lies in \(\mathscr {F}^+_\theta \).
By Proposition 4.6, we have
We will estimate the correlations by splitting the frequencies \(\xi \in {\mathbb {R}}\) into the cases \(\xi = 0\), \(0< |\xi | < n^{-\alpha }\), and \(|\xi | > n^{-\alpha }\). In fact, we only consider \(\xi \ge 0\) as the estimates for \(\xi < 0\) are analogous.
The proof of Theorem 4.5 follows from the next three lemmas.
Lemma 8.1
There exist constants \(C >0\) and \(0< \delta <1\) (depending from L, as given in Lemma 6.3) such that
for all n.
Proof
Recalling that \(\mathcal {L}_0 = L\) is the transfer operator associated to \(\sigma \), we have
By Lemma 6.3, there exists a constant \(C>0\) and \(0<\delta <1\) such that
where we used that \({\widehat{\psi }}_0(x) = \int _{\mathbb {R}}\psi (x,r) \mathop {}\!\mathrm {d}r\). Hence, by Lemmas 3.6 and 4.2, we conclude
\(\square \)
Lemma 8.2
We have that
for all n.
Proof
Since, by Lemma 4.2, we have \(\Vert \mathcal {L}_\xi ^n {\widehat{\psi }}_\xi \Vert _\infty \le \Vert {\widehat{\psi }}_\xi \Vert _\infty \le {{\,\mathrm{Max}\,}}_0(\psi )\), we obtain
which settles the proof. \(\square \)
Lemma 8.3
The sequence
decays rapidly in n.
Proof
For each n, define a function \(w_n :[0, \infty ) \rightarrow [0, \infty )\) by
By Lemma 6.2, \(w_n\) is a decreasing sequence of functions, and by Lemma 4.2, \(w_0(\xi )\) is a bounded function which (in the notation of Sect. 7) decays rapidly in \(\xi \). Up to rescaling \(\psi \), we may freely assume that \(w_0\) takes values in [0, 1].
Proposition 6.6 implies that \(w_n\) restricted to \((0,\kappa ]\) satisfies the hypotheses of Proposition 7.4. We then fix \(\xi _0 = \kappa \), so that Proposition 6.1 implies that \(w_n\) restricted to \([\xi _0, \infty )\) satisfies the hypotheses of Proposition 7.2. Hence, the sequence defined by
decays rapidly in n. Note that \( \Vert \mathcal {L}_\xi ^n {\widehat{\psi }}_\xi \Vert _\infty \le \Vert \mathcal {L}_\xi ^n {\widehat{\psi }}_\xi \Vert _H\) and so, for each n,
As \(\mu \) is a probability measure, it follows that
where M is the uniform bound on \( \Vert \eta _x \Vert _{{{\,\mathrm{TV}\,}}}\). \(\square \)
9 From Accessibility to Collapsed Accessibility
In this section, we relate the notion of accessibility for a skew product F as in (3) to the property of collapsed accessibility defined in Sect. 4.
For a two sided shift \(\sigma :\Sigma \rightarrow \Sigma \), let X be the corresponding one sided shift and let \(\pi :\Sigma \rightarrow X\) be the projection. Note that \(\pi \) is a continuous, surjective, open map. We also write \(x^+\) for \(\pi (x).\)
For \(x \in \Sigma \), define \(W^s_0(x) = \pi ^{-1} \pi (x)\). In other words, \(y \in W^s_0(x)\) if and only if \(x^+ = y^+\). For \(n \in {\mathbb {Z}}\), define \(W^s_n(x) = \sigma ^{-n} W^s_0( \sigma ^n x)\) and note that
is an increasing sequence whose union is \(W^s(x)\). For a subset \(U \subset \Sigma \), write
Lemma 9.1
If \(U \subset \Sigma \) is open, then \(W^s_n(U)\) is open for all n. If \(K \subset \Sigma \) is compact, then \(W^s_n(K)\) is compact for all n.
Proof
Since \(\pi \) is an open map, \(W^s_0(U) = \pi ^{-1} \pi (U)\) is an open set. Since \(\sigma ^n\) is a diffeomorphism \(W^s_n(U) = \sigma ^{-n}(W^s_0(\sigma ^n U))\) is also open. A similar proof holds for compact sets. \(\square \)
Lemma 9.2
For points x and y in \(\Sigma \) and \(n \in {\mathbb {Z}},\) the following are equivalent:
-
(1)
\(y \in W^s_n(x)\),
-
(2)
\({\text {dist}}(\sigma ^{n+k} x, \sigma ^{n+k} y) \le \theta ^k\) for all \(k \ge 0\).
Proof
One can show that each of these conditions is equivalent to the sequences of symbols for x and y satisfying \(x_m = y_m\) for all \(m \ge n\). \(\square \)
Instead of projecting onto the future \(x \mapsto x^+\), we can analogously project onto the past \(x \mapsto x^-\). Define local unstable manifolds by \(y \in W^u_0(x)\) if and only if \(x^- = y^-\), and for \(n \in {\mathbb {Z}}\) define \(W^u_n(x) = \sigma ^n W^s_0( \sigma ^{-n} x)\). Analogous versions of the above lemmas hold for these manifolds.
Let us now consider the skew product (3). Writing \(p = (x,s)\) and \(q = (y,t)\), we define local stable manifolds by \(p \in W^s_n(q)\) if and only if \(p \in W^s(q)\) and \(x \in W^s_n(y)\). Define local unstable manifolds analogously. For points p and q and an integer \(n > 0\), a us-N-path from p to q is a sequence
such that \(n \le N\) and for each \(0 \le k < n\) either \(p_{k+1} \in W^s_N(p_k)\) or \(p_{k+1} \in W^u_N(p_k)\).
For a point p, define \(AC_N(x)\) by \(q \in AC_N(x)\) if and only if there is a us-N-path from p to q. Note that \(AC_N(x)\) form an increasing sequence whose union is AC(p).
For a subset \(U \subset \Sigma \times {\mathbb {R}},\) define
Lemma 9.3
If \(U \subset \Sigma \times {\mathbb {R}}\) is open, then \(AC_N(U)\) is open for all \(n \ge 0\). If \(K \subset \Sigma \times {\mathbb {R}}\) is compact, then \(AC_N(K)\) is compact for all \(n \ge 0\).
Proof
This follows directly from Lemma 9.1. \(\square \)
Proposition 9.4
Let K be a compact subset of \(\Sigma \times {\mathbb {R}}\) such that \(\overline{{\text {int}}(K)} = K\). If \(p \in \Sigma \times {\mathbb {R}}\) is such that \(K \subset AC(p)\), then there is N such that \(K \subset AC_N(p)\).
Proof
Since \(AC_N(p)\) is an increasing sequence of compact sets and K is a Baire space, there is \(N_1\) such that \(AC_{N_1}(p)\) contains a non-empty open subset \(U \subset K\). Since \(AC_N(U)\) is an increasing sequence of open sets whose union contains the compact set K, there is \(N_2\) such that \(K \subset AC_{N_2}(U)\). Then \(K \subset AC_{N_1+N_2}(p)\). \(\square \)
We have the following result.
Proposition 9.5
Let \(f :X \rightarrow {\mathbb {R}}\) be a Lipschitz function. If the skew product
is accessible, then f has the collapsed accessibility property.
Proof
Let \(K = \Sigma \times [0,1]\). By Proposition 9.4, there is a uniform constant N such that \(AC_N(p)\) contains K for any \(p \in K\). With N fixed, let \(x \in X, t \in [0,1],\) and \(n \ge 1\) be given. Then there is a sequence of points \( p_1, q_1, p_2, q_2, \ldots , q_m, p_{m+1} \) such that
-
(1)
\(m \le N, p_1 = F^{N-n}(x,0)\), and \(p_{m+1} = F^{N-n}(x,t)\);
-
(2)
\(p_k \in W^s_N(q_k)\); and
-
(3)
\(p_{k+1} \in W^u_N(q_k)\).
Applying \(F^{N-n}\) to this sequence, we define \((a_k, s_k) = F^{N-n}(p_k)\) and \((b_k, t_k) = F^{N-n}(q_k)\) which satisfy \(b_k \in W^s_n(a_k)\) and \(a_{k+1} \in W^u_{2N-n}(b_k)\).
As \((a_k, s_k)\) and \((b_k, t_k)\) are on the same stable manifold in \(\Sigma \times {\mathbb {R}}\), it follows that
By the unstable analogue of Lemma 9.2, one can show that \(d(b_k, a_{k+1}) \le \theta ^{n-2n+1}\). That is, \(d(b_k, a_{k+1}) \le C \theta ^n\) where \(C = \theta ^{1-2N}\). One can then check that \(x_k = a_k^+\) and \(y_k = b_k^+\) satisfy all of the conditions in the definition of collapsed accessibility. \(\square \)
10 Proof of Theorem 3.9
We now prove Theorem 3.9. The strategy of the proof is to reduce the problem to the setting of Theorem 4.5.
10.1 Step 1: f Only Depends on Future Coordinates
Let us start with a preliminary step: we show that we can assume that the function f in (3) only depends on the future coordinates. From [41, Proposition 1.2], we inherit the following result.
Lemma 10.1
There exist \(h \in \mathscr {F}_{\sqrt{\theta }}\) and \(f^+ \in \mathscr {F}^+_{\sqrt{\theta }}\) such that \(f = f^+ + h - h\circ \sigma \).
When reducing to a one-sided shift, we will encounter some loss in regularity as in the previous lemma: the functions h and \(f^+\) are Holder with exponent 1/2. We can however replace \(\theta \) with \(\sqrt{\theta }\) in the definition of the distance \(d_\theta \) to make them Lipschitz. We remark that this is not an issue, and we will freely replace \(\theta \) with a suitable choice that makes the functions Lipschitz.
For any \(\Phi \in \mathscr {G}\) and \(\psi \in \mathscr {L}\), using Lemma 10.1, we can write
Let us define \(\Phi _h(x,r) = \Phi (x, r-h(x))\) and \(\psi _h(x,r)=\psi (x,r-h(x))\). We change variable \(s=r+h(x)\) and we get
where the skew product \(F_1\) is defined by \(F(x,r) = (\sigma x, r + f^+(x))\). The map \(H(x,r) = (x,r+h(x))\) used in the change of variable above is a conjugacy between F and \(F_1\), namely \(H \circ F = F_1 \circ H\). Moreover, H is uniformly continuous (more precisely, it is Lipschitz with respect to the distance \(d_{\sqrt{\theta }}\), exactly as h), hence it preserves stable and unstable manifolds. In particular, \(F_1\) is accessible.
The initial claim follows from the following lemma, whose proof is contained in “Appendix C.”
Lemma 10.2
With the notation above, \(\psi _h \in \mathscr {L}\) with \(\nu (\psi _h) = \nu (\psi )\), and \(\Phi _h \in \mathscr {G}\) with \(\nu _{{{\,\mathrm{av}\,}}}(\Phi _h) = \nu _{{{\,\mathrm{av}\,}}}(\Phi )\). Moreover, for every \(x \in \Sigma \), we have \(|(\eta _h)_x| = |\eta _x|\), where \(\widehat{(\eta _h)_x} = \Phi _h(x)\) and \(\widehat{\eta _x}=\Phi (x)\).
10.2 Step 2: Observables Only Depend on Future Coordinates
In the previous subsection, we have seen that we can assume that \(f=f^+ \in \mathscr {F}^+_{\theta }\) (up to replacing \(\theta \) with \(\sqrt{\theta }\)). We now show that we can replace the observables \(\Phi = \Phi _h \in \mathscr {G}\) and \(\psi = \psi _h \in \mathscr {L}\) with observables in \(\mathscr {G}^+\) and in \(\mathscr {L}^+\) respectively: this is the content of Proposition 10.3. The proof follows the same lines as in [18]; however in our case there are some additional difficulties in showing that the functions defined belong to \(\mathscr {G}^+\) and \(\mathscr {L}^+\). In particular, we will need to use the assumption (TC) to ensure some compactness property in \(\mathscr {A}\). We postpone the proof to “Appendix C.”
Proposition 10.3
Let \(\Phi \in \mathscr {G}\) and \(\psi \in \mathscr {L}\). There exist constants \(K, M(\Phi ) \ge 0\), sequences \(\{\Phi _m\}_{m \in {\mathbb {N}}} \subset \mathscr {G}^+\), \(\{\psi \}_{m \in {\mathbb {N}}} \subset \mathscr {L}^+\), and, for every \(\ell \in {\mathbb {N}}\), there exist constants \(M(\psi , \ell )\) and \(L(\psi , \ell )\) such that the following properties hold for all \(\ell , m, n \in {\mathbb {N}}\) and \(x \in X\):
-
(i)
\(\nu _{{{\,\mathrm{av}\,}}}(\Phi _m)=\nu _{{{\,\mathrm{av}\,}}}(\Phi )\) and \(\nu (\psi _m) = \nu (\psi )\),
-
(ii)
\(\Vert \Phi \circ F^m(x, \cdot ) - \Phi _m(x,\cdot ) \Vert \le M(\Phi ) \theta ^m\), and \(\Vert \Phi (x)\Vert \le M(\Phi )\),
-
(iii)
\({{\,\mathrm{Max}\,}}_\ell (\psi _m) \le M(\psi , \ell )\) and \({{\,\mathrm{Lip}\,}}_\ell (\psi _m) \le \theta ^{-m} L(\psi , \ell )\),
-
(iv)
\(|{{\,\mathrm{cov}\,}}(\Phi \circ F^n,\psi ) - {{\,\mathrm{cov}\,}}(\Phi _m \circ (F^+)^n,\psi _m)| \le K \theta ^m\).
From Proposition 9.5, it follows that the function \(f^+\) in the definition of the one-sided skew product \(F^+\) has the collapsed accessibility property.
10.3 Step 3: End of the Proof
We are now ready to prove Theorem 3.9. Let \(\Phi \in \mathscr {G}\) and \(\psi \in \mathscr {L}\), and fix \(k \in {\mathbb {N}}\) and \(0<\alpha <1/2\). Consider the sequence of functions \(\{\psi \}_{m \in {\mathbb {N}}} \subset \mathscr {L}^+\) given by Proposition 10.3. By Lemma 4.2, their Fourier transforms \((\widehat{\psi _m})_\xi \) satisfy
If we define a function \(w :(0, \infty ) \rightarrow [0, \infty )\) by \(w(\xi ) = \sup _m \theta ^{m}\Vert (\widehat{\psi _m})_\xi \Vert _H\), then these estimates imply that \(w(\xi )\) decays rapidly in \(\xi \) in the sense of Sect. 7. We further define functions \(v_{n,m}:(0, \infty ) \rightarrow [0, \infty )\) by \(v_{n,m}(\xi )=\Vert \mathcal {L}_\xi (\widehat{\psi _m})_\xi \Vert _H\), and we notice that \(v_{n,m}\) and w satisfy the hypotheses of Proposition 7.5 and Proposition 7.6. Consequently, for any \(c>0\), the sequence \(\{t_n\}_{n \in {\mathbb {N}}}\) defined by
decays rapidly in n.
Fix \(n \in {\mathbb {N}}\) and let m be the largest integer such that \(m < c \log (n)\), where \(c = k/(-\log (\theta ))\); in particular
By Proposition 10.3-(iv), we get
hence it suffices to bound the first summand in the right-hand side above. By Proposition 4.6, we have
The last summand in the right-hand side above is bounded by \(M t_n\), hence decays rapidly. The first term, by Lemma 8.1, is bounded by
which decays rapidly as well. Finally, for the second term, Lemma 8.2 implies
In order to coclude the proof of Theorem 3.9, it suffices to establish the following lemma.
Lemma 10.4
With the notation above, for any \(r>0\) we have
Proof
Note that the measure associated to \((\Phi \circ F^m)(x,\cdot )\) is \(e^{-i \xi f_m(x)} \mathop {}\!\mathrm {d}\eta _{\sigma ^m x}(\xi )\), whose variation is \(|\eta _{\sigma ^m x}|\). Let us denote \(R= (-r,r)\setminus \{0\} \subset {\mathbb {R}}\). Then, by Proposition 10.3-(ii),
\(\square \)
Notes
Recall that if one chooses the potential as the \(\det (DA|_{E_s})\), one recovers the usual SRB measure.
We recall that the space of almost periodic functions is the closure of the space of trigonometric polynomials with respect to the uniform norm.
References
Aaronson, J.: An Introduction to Infinite Ergodic Theory. Mathematical Surveys and Monographs, vol. 50. American Mathematical Society, Providence (1997)
Araujo, V., Melbourne, I.: Exponential decay of correlations for nonuniformly hyperbolic flows with a \(\mathscr {C}^{1+\alpha }\) stable foliation, including the classical Lorenz attractor. Ann. Henri Poincaré 17(11), 2975–3004 (2016)
Araujo, V., Butterley, O., Varandas, P.: Open sets of axiom A flows with exponentially mixing attractors. Proc. Am. Math. Soc. 144(7), 2971–2984 (2016)
Berger, A., Siegmund, S.: On the gap between random dynamical systems and continuous skew products. J. Dyn. Differ. Equ. 15(2–3), 237–279 (2003)
Bobkov, S.G.: Proximity of probability distributions in terms of their Fourier–Stieltjes transforms. Russ. Math. Surv. 71(6), 1021–1079 (2016)
Bogoliubov, N.N.: Problems of a dynamical theory in statistical physics. Stud. Stat. Mech. 1, 1–118 (1962)
Bowen, R.: Markov partitions for axiom A diffeomorphisms. Am. J. Math. 92, 725–747 (1973)
Bowen, R.: Symbolic dynamics for hyperbolic flows. Am. J. Math. 95, 429–460 (1973)
Bowen, R.: Markov partitions are not smooth. Proc. Am. Math. Soc. 71(1), 130–132 (1978)
Bowen, R.: Equilibrium States and the Ergodic Theory of Anosov Diffeomorphisms. Lecture Notes in Mathematics, vol. 470. Springer, Berlin (2008). ISBN 978-3-540-77605-5/pbk
Butterley, O., Eslami, P.: Exponential mixing for skew-products with discontinuities. Trans. Am. Math. Soc. 369(2), 783–803 (2017)
Bonanno, C., Giulietti, P., Lenci, M.: Infinite mixing for one-dimensional maps with an indifferent fixed point. Nonlinearity 31(11), 5180–5213 (2018)
Burns, K., Pugh, C., Shub, M., Wilkinson, A.: Recent results about stable ergodicity. Smooth ergodic theory and its applications. In: Proceedings of the AMS Summer Research Institute, Seattle, WA, USA, July 26–August 13 (1999)
Burns, K., Wilkinson, A.: On the ergodicity of partially hyperbolic systems. Ann. Math. 171, 51–489 (2010)
Butterley, O., War, K.: Open sets of exponentially mixing Anosov flows. J. Eur. Math. Soc. (JEMS) 22(7), 2253–2285 (2020)
Cirilo, P., Lima, Y., Pujals, E.: Ergodic properties of skew-products in infinite measure. Israel J. Math. 241(1), 43–66 (2016)
Cornfeld, I.P., Fomin, S.V., Sinai. Y.G.: Ergodic theory. Grundlehren der Mathematischen Wissenschaften, vol. 245. Springer, New York, Heidelberg, Berlin (1982)
Dolgopyat, D.: On mixing properties of compact group extensions of hyperbolic systems. Israel J. Math. 130, 157–205 (2002)
Dolgopyat, D., Lenci, M., Nandori, P.: Global observables for random walks: law of large numbers. Preprint arXiv:1902.11071 (2018)
Dolgopyat, D., Nandori, P.: Infinite measure mixing for some mechanical systems. Preprint arXiv:1812.01174 (2018)
Dolgopyat, D., Nandori, P.: Infinite measure renewal theorem and related results. Bull. Lond. Math. Soc. 51(1), 145–167 (2019)
Dolgopyat, D., Nandori, P., Pene, F.: Asymptotic expansion of correlation functions for \({{\mathbb{Z}}}^d\) covers of hyperbolic flows. Preprint arXiv:1908.11504 (2019)
Franks, J.: Anosov diffeomorphisms. In: Global Analysis (Proceedings of Symposia in Pure Mathematics). American Mathematical Society, Providence (1968)
Galatolo, S., Rousseau, J., Saussol, B.: Skew products, quantitative recurrence, shrinking targets and decay of correlations. Ergod. Theory Dyn. Syst. 35, 1814–1845 (2015)
Ghil, M., Lucarini, V.: The physics of climate variability and climate change. Preprint arXiv:1910.00583 (2019)
Gouezel, S.: Limit theorems in dynamical systems using the spectral method. Proc. Symp. Pure Math. 89, 161–193 (2015)
Gouezel, S.: Correlation asymptotics from large deviations in dynamical systems with infinite measure. Colloq. Math. 125, 193–212 (2011)
Guivarc’h, Y.: Propriétés ergodiques, en mesure infinie, de certains systèmes dynamiques fibrés. Ergod. Theory Dyn. Syst. 9(3), 433–453 (1989)
Hirsch, M., Palis, J., Pugh, C., Shub, M.: Neighborhoods of hyperbolic sets. Invent. Math. 9, 121–134 (1969/70)
Hirsch, M., Pugh, C., Shub, M.: Invariant Manifolds. Lecture Notes in Mathematics, vol. 583. Springer, Berlin (1977)
Hopf, E.: Ergodentheorie. Springer, Berlin (1937)
Kato, T.: Perturbation theory for linear operators. Grundlehren der Mathematischen Wissenschaften, vol. 132. Springer, Berlin (1984)
Krickeberg, K.: Strong mixing properties of Markov chains with infinite invariant measure. In: 1967 Proceedings of Fifth Berkeley Symposium on Mathematical Statistics and Probability (Berkeley, California, 1965/66), Vol. II: Contributions to Probability Theory, Part 2, pp. 431–446. University of California Press, Berkeley (1967)
Lenci, M.: On infinite volume mixing. Commun. Math. Phys. 298(2), 485–514 (2010)
Lenci, M.: Uniformly expanding Markov maps of the real line: exactness and infinite mixing. Discrete Contin. Dyn. Syst. 37(7), 3867–3903 (2017)
Liflyand, E., Samko, S., Trigub, R.: The Wiener algebra of absolutely convergent Fourier integrals: an overview. Anal. Math. Phys. 2, 1–68 (2012)
McCoy, B.M.: Advanced Statistical Mechanics. In: International Series of Monographs on Physics, vol. 146. Oxford Univ. Press, New York, NY (2010). https://doi.org/10.1093/acprof:oso/9780199556632.001.0001
Melbourne, I.: Mixing for invertible dynamical systems with infinite measure. Stoch. Dyn. 15(02), 1550012 (2015). https://doi.org/10.1142/S0219493715500124
Melbourne, I., Terhesiu, D.: Operator renewal theory and mixing rates for dynamical systems with infinite measure. Invent. Math. 189, 61–110 (2012)
Melbourne, I., Terhesiu, D.: Renewal theorems and mixing for non Markov flows with infinite measure. Ann. Inst. Henri Poincaré Probab. Stat. 56(1), 449–476 (2020)
Parry, W., Pollicott, M.: Zeta functions and the periodic orbit structure of hyperbolic dynamics. Asterisque 187–188 (1990)
Pène, F.: Mixing and decorrelation in infinite measure: the case of the periodic Sinai billiard. Ann. Inst. Henri Poincaré Probab. Stat. 55(1), 378–411 (2019)
Ratner, M.: Markov partitions for Anosov flows on \(n\)-dimensional manifolds. Israel J. Math. 15, 92–114 (1973)
Roy, A.: Symbolic dynamics and Markov partitions. Bull. Am. Math. Soc. 35(1), 1–56 (1998)
Reed, M., Simon, B.: Methods of Modern Mathematical Physics. II. Fourier Analysis, Self-adjointness. Academic Press, New York (1975)
Ruelle, D.: Thermodynamic formalism. The mathematical structures of equilibrium statistical mechanics, 2nd edn. (English). In: Cambridge Mathematical Library. Cambridge University Press, Cambridge (2004). ISBN 0-521-54649-4/pbk
Rodriguez-Hertz, F., Rodriguez-Hertz, M., Ures, R.: Accessibility and stable ergodicity for partially hyperbolic diffeomorphisms with 1D-center bundle. Invent. Math. 172, 353–381 (2008)
Sarig, O.: Introduction to the Transfer Operator Method. Winter School on Dynamics (2020). Available on the author webpage https://www.weizmann.ac.il/math/sarigo/lecture-notes
Trigub, R.M., Bellinsky, E.S.: Fourier Analysis and Approximation of Functions. Springer, Berlin (2004)
Acknowledgements
P.G. acknowledges the support of the Centro di Ricerca Matematica Ennio de Giorgi and of UniCredit Bank R&D group through the “Dynamics and Information Theory Institute” at the Scuola Normale Superiore. This research was partially funded by the Australian Research Council. We would like to thank Oliver Butterley, Dima Dolgopyat, Marco Lenci, Federico Rodriguez Hertz, Omri Sarig, and Khadim War for several useful discussions.
Funding
Open access funding provided by Universitá di Pisa within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Dmitry Dolgopyat.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A. Accessibility and Symbolic Dynamics
Let \(A :M \rightarrow M\) be a diffeomorphism and let \(\Omega \subset M\) be a transitive uniformly hyperbolic subset. One can construct a Markov partition on \(\Omega \) and use it to define symbolic dynamics. That is, there is a subshift of finite type \(\sigma :\Sigma \rightarrow \Sigma \) and a continuous surjective map \(\pi :\Sigma \rightarrow \Omega \) such that \(A \circ \pi = \pi \circ \sigma \).
Let \(f :M \rightarrow {\mathbb {R}}\) be a continuous function which defines a skew product
on \(M \times {\mathbb {R}}.\) This function then also defines a “symbolic skew product”
on \(\Sigma \times {\mathbb {R}}\) where \(f_{\text {sym}}:\Sigma \rightarrow {\mathbb {R}}\) is given by \(f_{\text {sym}}= f \circ \pi \).
Let \(u :\Omega \rightarrow {\mathbb {R}}\) be a Hölder continuous function, and let \(\mu = \mu _u\) be the unique equilibrium state for u. We want to obtain a quantitative mixing result for F with respect to \(\nu = \mu \times {{\,\mathrm{Leb}\,}}\) by applying Theorem 3.9 to the symbolic skew product \(F_{\text {sym}}\). In this “Appendix,” we discuss how the classes of good local and global observables and the accessibility property translate from the original system to the symbolic counterpart.
1.1 A.1. The Observables
The classes of good local and global observables on \(M \times {\mathbb {R}}\) we consider are defined, respectively, as Hölder functions \(\psi :M \rightarrow \mathscr {S}\) from M to the space of Schwartz functions \(\mathscr {S}\) and Hölder functions \(\Phi :M \rightarrow \mathscr {A}\) from M to the Fourier–Stieltjes algebra \( \mathscr {A}\) such that the tightness condition (TC) is satisfied (with \(x \in \Sigma \) replaced by \(p \in M\)). If we equip the symbolic system with the invariant measure \(\nu _{\text {sym}}= \mu _{u \circ \pi } \times {{\,\mathrm{Leb}\,}}\), where \(\mu _{u \circ \pi }\) is the Gibbs measure with potential \(u \circ \pi \), we obtain the following lemma.
Lemma A.1
If \(\psi \) is a good local observable on \(M \times {\mathbb {R}}\), the function \(\psi _{\text {sym}} := \psi \circ \pi \) is a good local observable for the symbolic system. Similarly, if \(\Phi \) is a good global observable on \(M \times {\mathbb {R}}\), then \(\Phi _{\text {sym}}:= \Phi \circ \pi \) is a good global observable for the symbolic system. Moreover,
where the reference measure is \(\nu \) on the left and \(\nu _{\text {sym}}\) on the right-hand side.
Proof
For any choice of \(\theta \in (0,1)\), the semiconjugacy \(\pi :\Sigma \rightarrow \Omega \) is Hölder continuous with respect to the distance \(d_{\theta }\) on \(\Sigma \) (see, e.g., [10, Lemma 4.2]). Hence, given any \(\psi \) and \(\Phi \) as in the statement, the functions \(\Phi _{\text {sym}}\) and \(\psi _{\text {sym}}\) are Hölder continuous and, up to taking a larger \(\theta \), they are actually Lipschitz. This shows that they are good local and global observables for the symbolic system.
By [10, Theorem 4.1], the equilibrium state \(\mu = \mu _u\) can be expressed as the push-forward \(\mu _u = \pi ^{*} \mu _{u \circ \pi }\). The final claim follows immediately from the semiconjugacy between \(F_{\text {sym}}\) and A. \(\square \)
1.2 A.2. Accessibility
In order to apply Theorem 3.9 to \(F_{\text {sym}}\), one needs to address the following question.
Question A.2
If \(F|_{\Omega \times {\mathbb {R}}}\) is accessible, does it follow that \(F_{\text {sym}}\) is accessible?
At first glance, the question might seem easy to answer, but there are some subtle issues here. As \(\pi \) is uniformly continuous, if points x and y lie on the same stable manifold in \(\Sigma \), then they project down to points \(\pi (x)\) and \(\pi (y)\) lying on the same stable manifold in M. However, one can construct examples where \(\pi (x)\) and \(\pi (y)\) lie on the same stable manifold, but \(\sigma ^n(x)\) and \(\sigma ^n(y)\) stay far apart for all \(n \in {\mathbb {Z}}\). Hence, not all us-paths in M lift to us-paths in \(\Sigma \). Despite this, we can establish accessibility of \(F_{\text {sym}}\) in certain settings. We will also discuss the difficulties involved in the general case later in this “Appendix.”
We will adopt the notation used in Bowen’s book on the subject [10]. In particular recall that if p and q are in the same rectangle, then [p, q] is the intersection of the local stable manifold of p with the local unstable manifold of q. If \(x = \{x_n\}\) and \(y = \{y_n \}\) are elements of the symbolic dynamics with the same “zeroth” symbol \(x_0 = y_0,\) then \([x, y] = z = \{ z_n \}\) is defined by \(z_n = x_n\) for \(n \ge 0\) and \(z_n = y_n\) for \(n \le 0\) and one can show that \(\pi ([x, y]) = [\pi (x), \pi (y)]\).
For p and q in \(\Omega \), if \(q \in W^s_A(p)\), define \(\Delta ^s(p,q) = \sum _{n=0}^\infty f(A^n q) - f(A^n p)\) and note that \((p,s) \in W^s_F(q,t)\) if and only if \(t - s = \Delta ^s(p,q)\). If \(q \in W^u_A(p),\) define \(\Delta ^u(p,q)\) analogously. For p and q in the same rectangle, define
That is, h(p, q) measures the height of the “Brin quadrilateral” that has p and q as two of its four vertices. Note that h is continuous and if q is on the local stable or manifold manifold of p, then \(h(p,q) = 0\).
Define \(\Delta ^s_{\text {sym}}, \Delta ^u_{\text {sym}},\) and \(h_{\text {sym}}\) using the same formulas, but with \(f_{\text {sym}}\) in the place of f. Then on the cylinder \(C_i \subset \Sigma \) consisting of the sequences whose zeroth symbol corresponds to \(R_i\), the function \(h_{\text {sym}}:C_i \times C_i \rightarrow {\mathbb {R}}\) is continuous and \(h_{\text {sym}}(x,y) = h(\pi (x), \pi (y))\).
Proposition A.3
If \(p \in R_i\) and \(\gamma :[0,1] \rightarrow R_i\) is a continuous curve such that \(h(p, \gamma (0))\) is zero and \(h(p, \gamma (1))\) is non-zero, then \(F_{\text {sym}}\) is accessible.
Proof
Since h and \(\gamma \) are continuous, \(I = h(p, \gamma ([0,1]))\) is a positive length interval containing zero. For any \(t \in [0,1],\) using the properties of symbolic dynamics we may find elements \(x, y \in C_i\) such that \(\pi (x) = p\) and \(\pi (y) = \gamma (t)\). This implies that \(h_{\text {sym}}(C_i \times C_i)\) contains I. From this, one can show that \(F_{\text {sym}}\) has an open accessibility class and then use this to conclude that \(F_{\text {sym}}\) is accessible. \(\square \)
Corollary A.4
If \(A :M \rightarrow M\) is an Anosov diffeomorphism and \(\Omega = M\), then for a \(\mathscr {C}^1\)-open and \(\mathscr {C}^r\)-dense \((1 \le r < \infty )\) set of choices of \(f :M \rightarrow {\mathbb {R}}\), the corresponding \(F_{\text {sym}}\) is accessible.
Proof
Choose a rectangle \(R_i\) in a partition. Since A is Anosov and \(R_i\) has interior, there is a periodic point p in the interior of \(R_i\). Adapting the proof of the “Unweaving Lemma,” that is [47, Lemma A.4.3], we can make a small perturbation to any starting f in a small neighborhood in order to find a point q with \(h(p,q) \ne 0\). We can then define a path \(\gamma :[0,1] \rightarrow R_i\) from p to q and apply Proposition A.3. \(\square \)
Corollary A.5
If \(A :{\mathbb {T}}^2 \rightarrow {\mathbb {T}}^2\) is an Anosov diffeomorphism, then F is accessible if and only if \(F_{\text {sym}}\) is accessible.
Proof
Here, A is topologically conjugate to a linear map [23], and so we assume A itself is linear. In this setting, we can find a Markov partition where the interior of each rectangle is homeomorphic to a disc and its boundary consists of two stable curves and two unstable curves. For such a construction, see for instance [44, Section 8]. If \(F_{\text {sym}}\) is not accessible, then for each rectangle \(R_i\) the stable and unstable directions of F are jointly integrable inside the region \(R_i \times {\mathbb {R}}\subset M \times {\mathbb {R}}\). This region is therefore foliated by \(\mathscr {C}^1\) surfaces (with boundary) tangent to \(E^s\oplus E^u\). As the rectangles meet along stable and unstable curves, we can “glue together” the leaves of neighboring rectangles to produce a foliation on all of \(M \times {\mathbb {R}}\) tangent to \(E^s\oplus E^u\). \(\square \)
Remark A.6
The proof above is specific to the 2-torus. Bowen showed in higher dimensions that the boundaries of the rectangles are not smooth [9]. There are several slightly different constructions of Markov partitions [7, 8, 10, 43]. In none of these constructions is connectedness of the rectangles discussed and it is not clear from the constructions if a rectangle is connected when the diffeomorphism is Anosov. Even if the construction begins with connected rectangles, complicated topological manipulations are necessary to turn this into a family rectangles satisfying all of the axioms of a Markov partition. It is not clear how these steps affect connectedness.
It may conceivably be the case that the interior of a rectangle \(R_i\) consists of infinitely many connected components. The union \(U = \bigcup _i {\text {int}}R_i\) of the interiors of all of the rectangles would then be an open and dense set with infinitely many connected components. Our reasoning above shows that, if \(F_{\text {sym}}\) is not accessible, then the height function h(x, y) is constant when x and y are in the same connected component of U. Suppose a small neighborhood V of a point \(p \in M\) intersects infinitely many connected components of U. Then the function \(V \rightarrow R, \ q \mapsto h(p,q)\) may be a Cantor function or a “Devil’s staircase” which is constant on an open and dense subset of V despite not being constant on all of V.
Here, the regularity of the stable and unstable foliations is important, as Cantor functions are possible with Hölder regularity, but not with Lipschitz regularity. Keeping p as above fixed, consider a point q near p. We analyze what happens as we move q along its unstable manifold until it intersects the stable manifold of p. In other words, we take q tending toward [p, q] while holding both p and [p, q] constant. In the formula defining h(p, q), the term \(\Delta ^u(p,[p,q])\) will then be constant. The term \(\Delta ^u([p,q],q)\) will depend in a \(\mathscr {C}^1\) manner on the point q since we are a moving along a single unstable manifold and this single manifold is \(\mathscr {C}^1\) regular. The term \(\Delta ^u([q,p],p)\) also depends in a \(\mathscr {C}^1\) manner on the point q for the same reason. The problematic term is \(\Delta ^s(q,[q,p]).\) As q moves along its path toward [p, q], it passes through different stable leaves. As a consequence, \(\Delta ^s(q,[q,p]),\) which measures the change in “height” of stable segments on \(M \times {\mathbb {R}},\) is only as regular as the stable foliation on \(M \times {\mathbb {R}}.\)
In general, the stable foliation is only Hölder continuous [30] and so we cannot make further conclusions. In certain settings, however, we know that the stable foliation is \(\mathscr {C}^1\) and we can conclude that \(\Delta ^s(q,[q,p])\) is \(\mathscr {C}^1\) and that its derivative is uniformly bounded. From this one can show that the function h is Lipschitz in the sense that h(x, y) is bounded by a constant times d(x, y). The pathology of Cantor functions may therefore be ruled out. Further, the \(\mathscr {C}^1\) nature of the stable foliations allows one to establish a property known as “uniform non-integrability” or UNI which is a quantitative form of accessibility and can be used to establish exponential mixing. This is, more or less, the argument applied in [2, 3, 15] where they establish exponential mixing in a setting where stable foliation is \(\mathscr {C}^1.\)
We now consider accessibility in the setting of hyperbolic attractors.
Corollary A.7
If \(A :M \rightarrow M\) is an Axiom A diffeomorphism and \(\Omega \subset M\) is an attractor or repellor, then for a \(\mathscr {C}^1\)-open and \(\mathscr {C}^r\)-dense (\(1 \le r < \infty \)) set of choices of \(f :M \rightarrow {\mathbb {R}}\), the corresponding \(F_{\text {sym}}\) is accessible.
Proof
We will assume \(\Omega \) is an attractor so that for each \(p \in \Omega \), the unstable manifold \(W^u_A(p)\) is a connected immersed submanifold lying entirely within \(\Omega \).
As shown in [29], there is a neighborhood of \(\Omega \) on which one can define invariant stable and unstable foliations. We may find a periodic point p and a small neighborhood U of p, such that every unstable manifold intersects U in a path-connected set.
As in the proof of Corollary A.4, one can adapt the “Unweaving Lemma” to perturb f and find p and q with \(h(p,q) \ne 0\). We define \(\gamma :[0,1] \rightarrow R_i\) to be a path from [p, q] to q and then apply Proposition A.3. \(\square \)
Appendix B. Proofs of Lemmas 3.1, 3.2, 3.6, and 3.8
1.1 B.1. Proof of Lemma 3.1
Assume that f is cohomologous to zero, namely there exists a measurable function w such that \(f= w \circ \sigma - w\). By Livs̆ic Theorem, we can assume that w is continuous. We claim that for any \(x \in \Sigma \), all the points that can be reached by an su-path from \((x, w(x)) \in \Sigma \times {\mathbb {R}}\) are contained in the graph of w, i.e. in the set \(\mathcal {G}(w):= \{ (y,w(y)): y \in \Sigma \}\), which will give a contradiction with the accessibility assumption.
Let \((x,w(x)) \in \mathcal {G}(w)\); we now show that the whole stable set \(W^s(x,w(x))\) is fully contained in \(\mathcal {G}(w)\). Let \((y,s) \in W^s(x,w(x))\); then, \(y \in W^s(x)\) and
that is \(s=w(y)\). This implies \(W^s(x,w(x)) \subset \mathcal {G}(w)\). An analogous argument shows that \(W^u(x,w(x)) \subset \mathcal {G}(w)\).
1.2 B.2. Proof of Lemma 3.2
For any fixed \(R>0\), by the invariance of the Gibbs Measure with respect to the dynamics, we have
therefore
The last term above converges to zero for \(R \rightarrow \infty \), hence the limit
exists and equals \(\nu _{{{\,\mathrm{av}\,}}}(\Phi )\).
1.3 B.3. Proof of Lemma 3.6
By definition, we can write
We have to show that the limit (4) exists; in order to do this, we prove that for any \(x \in \Sigma \) we have
from which the claim follows. For any \(R >0\),
We have that \(|\sin (R\xi )/(R\xi )|\le 1\) (and \(1 \in L^1( |\eta _x| )\), since the total variation of \(\eta _x\) is finite) and \(\sin (R\xi )/(R\xi )\) converges to 0 for all \(\xi \ne 0\). Lebesgue theorem yields (13), which completes the proof.
1.4 B.4. Proof of Lemma 3.8
Let us show that for all \(\varepsilon >0\) we have
Then, simply by choosing \(\varepsilon = 2/K\), we conclude
Fix \(\varepsilon >0\); then
Since \( \sin (x)/x \le \max \{1,1/|x|\}\) for all \(x \in {\mathbb {R}}\), we get
Clearly, \( {\mathbb {R}}\setminus \left[ -\frac{2}{\varepsilon },\frac{2}{\varepsilon } \right] = \left\{ \xi \in {\mathbb {R}}: 1 - \frac{1}{|\varepsilon \xi |} > \frac{1}{2}\right\} \), thus, by Chebyshev inequality,
which proves the initial claim.
Appendix C. Proofs of Lemma 10.2 and Proposition 10.3
1.1 C.1. Proof of Lemma 10.2
In this proof, we will encounter another loss in regularity, namely we will need to replace \(\theta \) with a different one to make the observables Lipschitz: again, this is not an issue and we will freely do so.
Let \(\psi \in \mathscr {L}\) and define \(\psi _h(x,r) = \psi (x, r - h(x))\), where \(h \in \mathscr {F}_{\theta }\). By definition, it is clear that \(\psi _h\) is well-defined from \(\Sigma \) to \(\mathscr {S}\), and, by the Fubini–Tonelli Theorem, \(\nu (\psi ) = \nu (\psi _h)\). The only thing to check is that it is a Lipschitz map.
Let \(a,\ell \) be non negative integers, and fix \(x,y \in \Sigma \). We need to estimate
By definition, for any \(r \in {\mathbb {R}}\), we have
Let us consider the two summands separately. For the first one, we have
for some \(|u| \le h(x)-h(y) \le 2\Vert h\Vert _\infty \). If \(|r|\le 4\Vert h\Vert _\infty \), we have
otherwise, if \(|r| > 4\Vert h\Vert _\infty \), then
In both cases, the first term satisfies a Lipschitz bound, independent of r. For the second term, if \(|r|\le 4\Vert h\Vert _\infty \), then
and the Lipschitz bound follows from the assumption on \(\psi \); otherwise, if \(|r| > 4\Vert h\Vert _\infty \), then
and again the conclusion follows from the assumption on \(\psi \). This concludes the proof of the claims on \(\psi \).
Let now \(\Phi \!\in \! \mathscr {G}\). Then, for any \(x \!\in \! \Sigma \), the function \(\Phi _h(x)(r) \!=\! \Phi (x, r\!-\!h(x))\) is the Fourier–Stieltjes transform of the measure \(\mathop {}\!\mathrm {d}(\eta _h)_x = e^{i \xi h(x)}\mathop {}\!\mathrm {d}\eta _x\). In particular, the variation are the same \(|(\eta _h)_x| = |\eta _x|\) and, from Lemma 3.6, it also follows that \(\nu _{{{\,\mathrm{av}\,}}}(\Phi _h) = \nu _{{{\,\mathrm{av}\,}}}(\Phi )\). Again, the only claim left to be shown is the Lipschitz assumption. We will exploit here the tail condition (TC).
Fix \(x,y \in \Sigma \). Then,
The second summand in the right-hand side above satisfies a Lipschitz bound by assumption. We need to verify for the first term. If \(h(x) = h(y)\), the term is 0 and the proof is complete. Assume \(h(x) \ne h(y)\) and let \(I = [-|h(x) - h(y)|^{-1/2}, |h(x) - h(y)|^{-1/2}]\). Then we have
the first term can be bound by
and the second by
hence the proof follows from the fact that h is Lipschitz (again, up to possibly replacing \(\theta \) with \(\theta ^{a/2}\) if \(a<2\)).
1.2 C.2. Proof of Proposition 10.3
This section is devoted to the proof of Proposition 10.3. The strategy follows the same lines as in [18] and [41, Proposition 1.2], although there are additional difficulties in showing that the functions \(\Phi \) and \(\psi \) belong to \(\mathscr {G}^+\) and \(\mathscr {L}^+\) respectively.
Let \(\alpha :\Sigma \times {\mathbb {R}}\rightarrow {\mathbb {R}}\) be any Lipschitz function, with Lipschitz constant \(L(\alpha )\). Fix \(m \in {\mathbb {N}}\). The first step is to prove that there exists a function \(\beta :\Sigma \times {\mathbb {R}}\rightarrow {\mathbb {R}}\) such that the function
depends only on the future coordinates. In other words, we want to show that \(\alpha \circ F^m\) is cohomologous to a function defined on \(X \times {\mathbb {R}}\). We recall the construction of \(\beta \) for the reader’s convenience. For any cylinder \(\mathscr {C}_{n,j} := \mathscr {C}_{-n,o}(x^j)\), choose an element \(\omega ^{n,j} \in \mathscr {C}_{n,j}\). Define the element \(\omega ^n(x) \in \Sigma \) in the following way:
Notice that by definition \(d_{\theta }(\omega ^n(x), x)\le \theta ^n\) for all \(x \in \Sigma \). Define
Since \(\left\Vert \delta _n\right\Vert _{\infty } \le |f|_{\theta } (1-\theta )^{-1} \theta ^n\), we have
for a constant \(C_\theta = 1 + |f|_{\theta } (1-\theta )^{-1}\). Define
which, by (14), is well-defined as a function from \(\Sigma \times {\mathbb {R}}\rightarrow {\mathbb {R}}\). Then, we can define
and from its definition, it is clear that \(\alpha ^+\) depends only on the future coordinates; namely, it follows that \(\alpha \circ F^m\) is cohomologous to a function defined on \(X \times {\mathbb {R}}\).
We will show in Lemmas C.1 and C.2 that if \(\alpha \) belongs to \(\mathscr {L}\) or \(\mathscr {G}\), then \(\alpha ^+\) belongs to \(\mathscr {L}^+\) or \(\mathscr {G}^+\) respectively. Assuming these claims, let us first finish the proof, namely we show (i) and (iii). There exist \(\Phi ^+ \in \mathscr {G}^+\) and \(\psi ^+ \in \mathscr {L}^+\) which are cohomologous to \(\Phi \circ F^m\) and \(\psi \circ F^m\) respectively. In particular, we have
Moreover, from the definitions of \(\psi ^+\) and \(\Phi ^+\) and from (14), it follows that
and similarly for \(\psi \). Therefore, by invariance of the measure \(\nu _u\) under \(F^m\), we obtain
which concludes the proof.
Lemma C.1
If \(\alpha \in \mathscr {L}\), then \(\alpha ^+ \in \mathscr {L}^+\). For any \(\ell \in {\mathbb {N}}\), there exist constants \(M(A,\ell )\) and \(L(A,\ell )\) depending on \(\alpha \) and \(\ell \) only such that \({{\,\mathrm{Max}\,}}_\ell (\alpha ^+) \le M(\alpha ,\ell )\) and \({{\,\mathrm{Lip}\,}}_\ell (\alpha ^+) \le \theta ^{-m}L(\alpha ,\ell )\).
Proof
In order to prove the result, it is sufficient to show that for every \(x \in \Sigma \), the function \(\beta (x, \cdot )\) is Schwartz and the functions \(x \mapsto \partial ^\ell \beta (x,\cdot )\) are lipschitz with respect to the \(L^1\)-norm.
Since, for every \(x \in \Sigma \) and \(\ell \in {\mathbb {N}}\), the series
converges uniformly, the derivative \(\partial ^\ell \beta (x,\cdot )\) exists and equal the series above. We now show that \(\beta (x,\cdot )\) is a Schwartz function for every \(x \in \Sigma \).
Let \(a, \ell \in {\mathbb {N}}\). We need to show that \(r^a\partial ^\ell \beta (x,r)\) is uniformly bounded in r. To save notation, let us write \(x_n = \sigma ^nx\), \(y_n = \omega ^n(\sigma ^n x)\) and \(\delta _n = \delta _n(\sigma ^n x)\). We have
Each term in the sum above satisfies a Lipschitz bound exactly as in the proof of Lemma 10.2. Since the terms \(d(x_n,y_n)\) and \(\delta _n\) can be bounded by \(O(\theta ^n)\), the series above converge. Therefore, \(|r^a\partial ^\ell \beta (x,r)|\) is uniformly bounded for all \(a,\ell \in {\mathbb {N}}\), hence \(\beta (x,\cdot )\) is a Schwartz function.
The lipschitz bounds on the functions \(x \mapsto \partial ^\ell \beta (x,\cdot )\) with respect to the \(L^1\)-norm can be proved in a similar way and is left as an exercise to the reader: we remark that if \(\alpha \) is a lipschitz function with constant \(L(\alpha )\), then \(\alpha \circ F^m\) is lipschitz with constant \(L(\alpha \circ F^m) \le \theta ^{-m}L(\alpha )\). \(\square \)
We now show the analogous result if we assume that \(\alpha \) is a global observable.
Lemma C.2
If \(\alpha \in \mathscr {G}\), then \(\alpha ^+ \in \mathscr {G}^+\), and \(\Vert \alpha ^+\Vert _{\mathscr {G}^+} \le \Vert \alpha \Vert _\mathscr {G}\).
Proof
It suffices to show that for any \(x \in \Sigma \), \(\beta (x, \cdot )\) is the Fourier–Stieltjes transform of a complex measure \(\eta _x\) and moreover the total variation \(\Vert \eta _x\Vert _{{{\,\mathrm{TV}\,}}}\) is uniformly bounded.
By definition,
Fix \(x \in \Sigma \) and again let us denote \(x_n = \sigma ^nx\), \(y_n = \omega ^n(\sigma ^n x)\) and \(\delta _n = \delta _n(\sigma ^n x)\). For \(N\ge m\), let \(\zeta _N = \zeta _N^{(1)}+\zeta _N^{(2)} \) be the complex measure defined by
From the expression for \(\beta \) above, by definition, the Fourier–Stieltjes transforms \(\widehat{\zeta _N}\) of \(\zeta _N\) converge uniformly to \(\beta (x,\cdot )\). In order to conclude that \(\beta (x, \cdot )\) is the Fourier–Stieltjes transform of a complex measure, it is enough to show that the family of measures \(\zeta _N\) is contained in a weakly compact set. We will proceed as in the proof of Lemma 10.2 and use the tightness condition (TC).
We will show this separately for \(\zeta _N^{(1)}\) and \(\zeta _N^{(2)}\). For \(\zeta _N^{(1)}\), by the lipschitz assumption on \(\alpha \), we have
The total variation norm is stronger that the weak-convergence topology, hence the sequence of measures \(\zeta _N^{(1)}\) converges weakly (since the tails are exponentially small). For the second term, we apply Prokhorov theorem: it suffices to show that the sequence \(\zeta _N^{(2)}\) is uniformly bounded in total variation norm and is tight. Notice that the variation of \(\zeta _N^{(2)}\) can be bounded by
Using the tightness condition (TC),
This shows that \(\zeta _N^{(2)}\) is uniformly bounded in total variation. Similarly we prove tightness: let us fix \(\varepsilon >0\), and let \(K = [-\varepsilon ^{-1/a}, \varepsilon ^{-1/a}] \). Then,
This proves tightness and hence concludes the proof. \(\square \)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Giulietti, P., Hammerlindl, A. & Ravotti, D. Quantitative Global-Local Mixing for Accessible Skew Products. Ann. Henri Poincaré 23, 923–971 (2022). https://doi.org/10.1007/s00023-021-01102-8
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00023-021-01102-8