A couple of days ago (on November 12th, 2014 to be more precise), Giovanni Forni gave a talk at the “flat seminar / séminaire plat” on the ergodicity of billiards on non-rational polygons, and, by following the suggestion of two friends, I will transcript in this post my notes from Giovanni’s talk.
[Update (November 20, 2014): Some phrases near the statement of Theorem 3 below were edited to correct an inaccuracy pointed out to me by Giovanni.]
Let be a polygon with sides and denote by its interior angles.
The billiard flow associated to is the following dynamical system. A point-particle in follows a linear trajectory with unit speed until it hits the boundary of . At such an instant, the point-particle is reflected by the boundary of (according to the usual laws of a specular reflection) and then it follows a new linear trajectory with unit speed. (Of course, this definition makes no sense at the corners of , and, for this reason, we leave the billiard flow undefined at any orbit going straight into a corner)
The phase space of the billiard flow is naturally identified with the three-dimensional manifold : indeed, we need an element of to describe the position of the particle and an element of the unit circle to describe the velocity vector of the particle.
Alternatively, the billiard flow associated to can be interpreted as the geodesic flow on a sphere with a flat metric and conical singularities (whose cone angles are ) with non-trivial holonomy (see Section 2 of Zorich’s survey): roughly speaking, one obtains this flat sphere with conical singularities by taking two copies of (one on the top of the other), gluing them along the boundaries, and by thinking of a billiard flow trajectory on as a straight line path going from one copy of to the other at each reflection.
This interpretation shows us that billiard flows on polygons are a particular case of geodesic flows on the unit tangent bundle of compact flat surfaces whose subsets of conical singularities were removed.
Remark 1 In the case of a rational polygon (i.e., are rational multiples of ), it is often a better idea (see this survey of Masur and Tabachnikov) to take several copies of obtained by applying the finite group generated by the reflections through the sides of and then glue by translation the pairs of parallel sides of the resulting figure. In this way, one obtains that the billiard flow associated to is equivalent to translation (straightline) flow on a translation surface (an object that has trivial holonomy and, hence, is more well-behaved that a flat metric on with conical singularities) and this partly explains why the Ergodic Theory of billiards on rational polygons is well-developed. However, let us not insist on this point here because in what follows we will be mostly interested in billiard flows on irrational polygons.
A basic problem concerning the dynamics of billiards flows on polygons, or, more generally, geodesic flows on flat surfaces with conical singularities is to determine whether such a dynamical system is ergodic.
In view of Remark 1, we can safely skip the case of rational polygons: indeed, this setting one can use the relationship to translation surfaces to give a satisfactory answer to this problem (see the survey of Masur and Tabachnikov for more explanations). So, from now on, we will focus on billiard flows associated to non-rational polygons.
Kerckhoff, Masur and Smillie proved in 1986 that the billiard flow is ergodic for a -dense subset of polygons. Their idea is to consider the -dense subset of “Liouville polygons” admitting fast approximations by rational polygons (i.e., the subset of polygons whose interior angles admit fast approximations by rational multiples of ). Because the ergodicity of the billiard flow on rational polygons is well-understood, one can hope to “transfer” this information from rational polygons to any “Liouville polygon”.
Remark 2 The -dense subset of polygons constructed by Kerckhoff, Masur and Smillie has zero measure: indeed, this happens because they require the angles to be “Liouville” (i.e., admit fast approximations by rational multiples of ), and, as it is well-known, the subset of Liouville numbers has zero Lebesgue measure.
A curious feature of the argument of Kerckhoff, Masur and Smillie is that it is hard to extract any sort of quantitative criterion. More precisely, it is difficult to quantify how fast the quantities must be approximated by rationals in order to ensure that the ergodicity of the billiard flow on the corresponding polygon. This happens because the genera of translation surfaces associated to the rational polygons approximating usually tend to infinity and it is a non-trivial problem to control the ergodic properties of translation flows on families of translation surfaces whose genera tend to infinity.
Nevertheless, Vorobets obtained in 1997 (by other methods) a quantitative version of Kerckhoff, Masur and Smillie by showing the ergodicity of the billiard flow on a polygon whose interior angles verify the following fast approximation property: there exist arbitrarily large natural numbers such that
for some rational numbers , , with denominators , .
In summary, the works of Kerckhoff-Masur-Smillie and Vorobets allows to solve the problem of ergodicity of the billiard flow on Liouville polygons.
Of course, this scenario motivates the question of ergodicity of billiard flows on Diophantine polygons (i.e., the “complement” of Liouville polygons consisting of those which are badly approximated by rational polygons).
In his talk, Giovanni announced a new criterion for the ergodicity of the billiard flow on polygons (and, more generally, the geodesic flow on a flat surface with conical singularities) with potential applications to a whole class (of full measure) of Diophantine polygons.
Before stating Giovanni’s results, let us introduce some notation. Consider a flat surface with a finite subset of conical singularities (e.g., obtained by reinterpretation of the billiard flow on a polygon). The infinitesimal structure of the unit tangent bundle is described by vector fields:
- is the generator of the geodesic flow;
- is the “perpendicular geodesic flow”;
- is the generator of the rotation on the circle fibers of .
These vector fields satisfy the following commutation relations:
- (because is a flat surface, and, hence, has zero curvature);
Note that the knowledge of allows us to recover the natural Riemannian metric on induced by the flat structure on : indeed, is completely determined by the fact that is an orthonormal frame.
By analogy with the case of rational polygons (see this survey of Masur), we would like to apply renormalization methods to get an ergodicity criterion for the geodesic flow on based on the properties of the renormalization dynamics.
Logically, a naive implementation of this idea does not work: the Teichmüller geodesic flow on the moduli space of flat surfaces with arbitrary conical singularities has poor dynamical behavior (in comparison with the case of rational polygons) because these moduli spaces are usually very big and, for example, this is a serious obstruction to any recurrence property of the corresponding Teichmüller flow (which is a key ingredient in the so-called Masur’s ergodicity criterion).
Nevertheless, Giovanni noticed that one can still implement this renormalization method by introducing the following deformations of (playing the role of “fake Teichmüller geodesic flow”):
for . By declaring that the vector fields form an orthonormal frame, we obtain a Riemannian metric on .
Remark 3 Note that , and satisfy the following commutation relations:
Furthermore, the volume of is . In particular, as , we see that and , i.e., is very close to a Heisenberg group as (i.e., its geometry becomes nilpotent in the limit). In particular, we see that the deformations of do not exhibit any sort of recurrence property (in whatever moduli space they live).
Remark 4 In the definition of , resp. , the scaling factors of , resp. , for , resp. are motivated by direct analogy with the Teichmüller geodesic flow. On the other hand, the scaling factor for is more subtle to explain: Giovanni said that he found this scaling (which is convenient for his ergodicity criterion of billiards on polygons) from an analytical argument (see Remark 9 below). Also, Giovanni observed that, a posteriori, this scaling is “justified” from the dynamical point of view because the orbits of the geodesic flow of stay fairly close (i.e., they do not “diverge”) after applying the deformation , and, in particular, one has nice “rectangles” of heights and width (and, as it turns out, the presence of such nice rectangles is an important ingredient in Masur’s ergodicity criterion for rational polygons). However, he insisted that this “dynamical justification” was not the initial motivation to define (but rather the arguments from Analysis sketched below).
In this setting, Giovanni’s ergodicity criterion for geodesic flows on flat surfaces (such as billiard flows on polygons) is:
Theorem 1 (Forni) Let be a flat surface with a finite subset of conical singularities. Suppose that there exist a subset with positive lower density (i.e., ) and a real number such that for each and one can find a connected subset with the following properties:
- (i) for all , where denotes the Cheeger constant of with respect to (see below for the definitions);
- (ii) uniformly on .
Then, the geodesic flow on is ergodic.
Remark 5 Recall that the Cheeger constant of a domain with respect to a Riemannian metric on is
where and are the connected components of .
Intuitively, Giovanni’s ergodicity criterion can be thought as saying that if we can find a suitable subset of good renormalization times in the sense that the complement of “adequate small neighborhoods” of the subset of conical singularities has bounded geometry (i.e., a controlled Cheeger constant, cf. the condition (i) above) and almost full volume (cf. the condition (ii) above), then we can exploit these renormalization times to conclude the ergodicity of the geodesic flow.
Remark 6 For the sake of comparison with the case of rational polygons/translation surfaces, let us observe that for a translation surface (with flat metric ) one has
where is a constant depending only on the genus of and denotes the systole of (that is, the length of the shortest saddle connection). In particular, since the systole of a translation surface on a compact region of the moduli space admits an uniform lower bound, the analog of the condition (i) in Giovanni’s ergodicity criterion in the setting of translation surfaces is satisfied by most translation surfaces thanks to the recurrence properties of the Teichmüller geodesic flow (that is, of the deformation , and ).
Remark 7 Still for the sake of comparison, it is worth to observe that after more recent works of Cheung-Eskin and Treviño we know that the ergodicity criterion can be substantially improved in the context of translation surfaces: indeed, one can ensure the ergodicity (and even unique ergodicity) of the flow generated by whenever the systole of the flat metric associated to the Teichmüller deformation , (and ) verifies the non-integrability condition
(Note that this non-integrability condition is automatic for recurrent Teichmüller deformations as for such deformations the quantity admit uniform lower bounds on a countable family of disjoint subintervals of definite sizes) Evidently, these results of Cheung-Eskin and Treviño motivate the following question: is it possible to weaken the condition (i) in Theorem 1 in order to allow Cheeger constants that could approach slowly (maybe in a similar spirit of the non-integrability condition above)? In fact I asked this question to Giovanni after his talk and he pointed out that it is not very clear that this possible with his current argument because of the subtle nature of the proof of the estimate (1) appearing below (especially the estimate of the term ).
Before discussing some elements of the proof of Theorem 1, let us quickly comment on the potential applications of Giovanni’s ergodicity criterion. At first sight, it is not obvious at all how to decide whether a given polygon with interior angles (or, more generally, a flat surface with conical singularities with cone angles ) verify the requirements of Theorem 1 (especially the condition (i)).
In this direction, even though Giovanni said that he has not fully checked his arguments yet, Giovanni is confident that the following Diophantine conditions on are sufficient to apply his ergodicity criterion.
Theorem 2 (Forni (in progress)) Let be a polygon with sides and interior angles .Denote by (see Remark 8 below for the reason why we exclude ). Suppose that satisfies the following Diophantine conditions:
- (1) there exists a constant such that for all
- (2) there exists a constant such that for all (non-trivial) integer vectors one has
Then, the conditions (i) and (ii) in Theorem 1 hold, and, a fortiori, the billiard flow on is ergodic.
Even though we are not going to sketch the proof of Theorem 2 today, let us now make two comments on the Diophantine conditions (1) and (2).
First, these conditions do not seem totally independent (even though it is not easy to figure out their relationship): for example, for , the condition (2) becomes , that is, for all , and this latter condition resembles the condition (1).
Secondly, the condition (1) is a full Lebesgue measure condition on only for . In other terms, one can use Theorem 2 to deduce the ergodicity of the billiard flow on almost every polygon with sides, but the analogous statement for the case of triangles remains still open.
Closing this post, let us give a brief sketch of the proof of Giovanni’s ergodicity criterion (Theorem 1).
The argument starts in the same way as in Giovanni’s proof of the spectral gap property (“”) for the Lyapunov exponents of the Kontsevich-Zorich cocycle via variational formulas for the Hodge norm (in Section 2 of this paper here). More concretely, we consider the foliated Cauchy-Riemann operators
associated to the deformation . (We said “foliated” because the distribution is integrable in and are the usual and along the leaves of this foliation)
Next, given a -function , we consider its decomposition
in terms of the image and the kernel of the Cauchy-Riemann operators . (Here, there is a subtle point: contrary to the case of translation surfaces, it is not known that the image of is closed; in particular, one should replace and by adequate elements in the closure of the images of , but we will skip this technical detail by pretending that the decomposition above can always be made)
Recall that, under the assumptions of Theorem 1, our task is to show that the geodesic flow is ergodic, that is, we want to show that any real -function with (i.e., is invariant) is actually constant.
For this sake, by mimicking the proof of Lemma 2.1′ of his paper, Giovanni shows the following variational formula:
From this formula, we can deduce that as , (where is the subset of positive lower density of “good renormalization times”, cf. the statement of Theorem 1). Indeed, since is obtained by orthogonal projection of with respect to the (closure of the) image of , we have that is uniformly bounded for all . By plugging this information into the variational formula above, we obtain that
for all and the claim that as , follows.
In other terms, we have just shown that converges (in ) to as , .
Next, we observe that the functions are harmonic (since are meromorphic, resp. anti-meromorphic), and, thus, we can apply Cauchy’s estimate to obtain that
where is the gradient in the metric associated to the deformation , and is a -neighborhood of in (that is, is essentially equal to the subset by condition (ii) of Theorem 1).
Using the facts that has “bounded geometry” (by condition (i) of Theorem 1), and as , , we (
get that is constant along the leaves of the foliation associated to ) see that one is getting closer to show that is constant.
Nevertheless, the information obtained in the previous paragraph is not quite sufficient to conclude that is constant because the leaves of the foliation associated to (sometimes called Loch Ness monsters in the flat surfaces literature, see, e.g, this paper here) might not have bounded geometry. For this reason, Giovanni needs also
At this point, it remains only to control the behavior of in the -direction. Here, after replacing by an adequate truncation of its Fourier series in the -direction still called by a slight abuse of notation, Giovanni told us (without giving the proof because he ran out of time) that a computation based on arguments from Harmonic Analysis reveals that
Because , the bounded geometry condition (i) in Theorem 1 allows us to conclude that (
is also constant along the -direction. Therefore, we deduce that) is constant on , and, hence the geodesic flow (generated by ) is ergodic (so that the sketch of proof of Theorem 1 is complete).
Remark 9 As we mentioned in Remark 4 above, Giovanni’s choice of deformation in the -direction was purely guided by the arguments from Harmonic Analysis in the proof of Theorem 3 which “impose” the factor of in his control of the growth of .