Posted by: matheuscmss | May 4, 2009

Brazilian president Luis Inacio Lula da Silva visits IMPA

In last April 15 (two weeks ago), the Brazilian president Luis Inacio “Lula” da Silva visited IMPA – Instituto de Matematica Pura e Aplicada to discuss some aspects of mathematical research and teaching in Brazil (this was the first time a Brazilian president visited IMPA). The reader can find several photos of this event here.

Being both a Brazilian and a former student at IMPA (where I got my PhD), I’m happy to see that the Brazilian government is really paying the deserved attention to the fundamental sciences, specially Mathematics (although there are still several weak points concerning the distribuition of permanent positions in public universities, etc.)

In any case, I enjoyed a lot Lula’s speech while I got a little upset with some demagogic phrases of Sergio Cabral, the governor of the state of  Rio de Janeiro, who said:

“… Impa already organized a Math. Olympiad focused to private schools and what the president did? He moves IMPA, a world-wide recognized mathematical institution, from its academic pedestal to show it the real life of the Brazilian population by realizing the OBMEP [Math. Olympiads for Public Schools]…”

Let me say that you can see a resume (in Portuguese) of Lula and Cabral speechs here. Also, let me make two comments about the three points I marked (in bold) in Cabral’s phrase

  • concerning the organization of Math. Olympiads for private schools: firstly, IMPA’s main goal is the mathematical research and the training of PhD students; of course, IMPA supported other mathematical activities such as Math. Olympiads by providing physical space, etc., but evidently this isn’t its main objective; secondly, the Math. Olympiad organized at IMPA (called OBM – Brazilian Math. Olympiad) is completely open: anyone is able to make the inscription to participate the competitions; however, the fact that most of the “successful” students (meaning the ones making their way to the IMO) comes from private schools has nothing to do with the Cabral’s claim that OBM is focused on private schools! It has to do with the simple fact that public schools are fairly weak in Brazil!
  • concerning “academic pedestal” and “real life”: this is a repetition of the classical comparision between Mathematics and an Ivory Tower -  the image of an austere science closed into itself, etc., so that I think this absurd point doesn’t need any further explanations.

Finally, let me mention that Lula’s visit to IMPA takes place in an important moment for the Brazilian mathematical community: for instance, Brazilian mathematicians are consolidating their presence in the international community as the invitation of my friends Fernando Coda Marques and Artur Avila (two young Brazilian mathematicians) to give talks in the ICM 2010 confirms (Fernando was invited for the Geometry Session and Artur was invited for a Plenary Talk).

Hi! Firstly, let me apologize for the 1 month and half hiatus: while I plan to close up my series on the talks and courses of the Dynamical Systems conference (held at ICTP, Italy), I wasn’t able to do so during April because I was learning some interesting facts about square-tiled surfaces and Teichmuller dynamics from my new friends Pascal Hubert and Anton Zorich. Of course, I plan to share these facts in some future posts, although I can’t promise to do so in the next few months. On the other hand, I can give a clue to the curious reader by providing a copy of the slides of my recent talk at Orsay entitled: Sur le cocycle de Kontsevich-Zorich au-dessus de deux origamis spéciaux.

Warning. While the title is written in French (and the actual talk was my third presentation speaking in French), the slides are written in English (because I don’t feel comfortable to write in French yet).

Before giving the link to the slides, let me make a brief description of the talk.

The basic objective was the discussion of the Teichmuller geodesic flow and the Kontsevich-Zorich cocycle. Nowadays, after the works of Forni (2002), Avila and Viana (2006), we know that the Kontsevich-Zorich conjecture is true: the Lyapunov spectrum of the Kontsevich-Zorich (KZ) cocycle with respect to the absolutely continuous SL(2,\mathbb{R})-invariant ergodic probability is non-uniformly hyperbolic (i.e., 0 doesn’t belong to the spectrum) and simple (i.e., all exponents have multiplicity one). In other words, the ergodic behavior of generic points with respect to the Lebesgue measure is more or less well-understood.

However, one can follow Veech and pose the question: what about the ergodic behavior of non-generic points? In this direction, Forni (2005) and Forni, Matheus (2008) showed that there are examples of orbits with totally degenerate Kontsevich-Zorich  spectrum (see this previous post). An interesting feature of the KZ cocycle over totally degenerate orbits is the fact that it is isometric (this follows from Forni’s 2002 work). In the talk, I discussed an improvement of this fact (obtained jointly with Jean-Christophe Yoccoz): by computing the action of the group of affine diffeomorphisms on the relative homology of these examples, we are able to show that the Kontsevich-Zorich cocycle is represented by a subgroup of the Weyl group of the root system D_4. A nice consequence is the fact that the KZ cocycle acts by a finite group of rigid matrices, so that this gives a new proof of Forni and Matheus results by more explicit methods.

Finally, the slides can be found here.

Posted by: matheuscmss | March 16, 2009

R. Kaufmann’s proof of J. M. Marstrand’s theorem

After my previous post on a Fourier-analysis approach to Furstenberg’s problem, I would like to present another application of the Fourier transform of measures: a simple proof of Marstrand theorem about projections of Cantor sets. Before entering the details, let me tell you what are my motivations on this topic: besides the beauty of the Fourier-analysis approach to a relevant geometrical problem, this particular proof of Marstrand theorem was generalized by C. Moreira and J. C. Yoccoz in their work on stable intersections of regular Cantor sets so that it could be applied to the study of homoclinic tangencies (Dynamical Systems) and the Markov/Lagrange spectrum (Number Theory) (as we briefly described in this post). In particular, since I’m planning to make a series of posts around these works of C. Moreira and J. C. Yoccoz, I think that a discussion of Kaufmann’s proof of Marstrand theorem is the best place to start this project.

-Statement of Marstrand’s theorem-

Define {v_\theta=(\cos\theta,\sin\theta)\in\mathbb{R}^2}, {\theta\in\mathbb{R}} and denote by {\pi_\theta} the orthogonal projection of the plane {\mathbb{R}^2} onto the line {L_\theta=\mathbb{R}\cdot v_\theta}. In other words, if we identify {L_\theta} with the real line {\mathbb{R}} (via {x\in\mathbb{R}\mapsto x\cdot v_\theta\in L_\theta}, then, given {u=(u_1,u_2)\in\mathbb{R}^2}, we have {\pi_\theta(u)=u\cdot v_\theta = u_1\cos\theta + u_2\sin\theta}.

Theorem 1 (J. Marstrand’s theorem) Let {K\subset\mathbb{R}^2} be a subset with Hausdorff dimension {HD(K)>1}. Then, its projection {\pi_\theta(K)} has positive Lebesgue measure for (Lebesgue) almost every {\theta}.

An interesting consequence of this theorem (in the context of stable intersections of Cantor sets of the real line) is the following fact:

Corollary 2 Let {K_1} and {K_2} be two Cantor sets of the real line {\mathbb{R}} such that

\displaystyle HD(K_1)+HD(K_2)>1.

Then, {K_1-\lambda K_2} has positive Lebesgue measure for almost every {\lambda\in\mathbb{R}}.

Proof: Firstly, we recall the following well-known property of the Hausdorff dimension: {HD(K_1\times K_2)\geq HD(K_1)+HD(K_2)>1} (see K. Falconer’s book for more details). Secondly, we notice that the projection {\pi_\theta(K_1\times K_2)} of {K_1\times K_2} is {\pi_\theta(K_1\times K_2)=\cos\theta\cdot (K_1-\lambda K_2)} where {\lambda=-\tan\theta}. Since {\cos\theta\neq 0} for {-\pi/2<\theta<\pi/2} and we can parametrize {\lambda\in\mathbb{R}} via {\lambda=-\tan\theta} with {-\pi/2<\theta<\pi/2}, we get the desired corollary from Marstrand’s theorem. \Box

Now we turn to the proof of Marstrand’s theorem following the exposition of the book of Palis and Takens (pages 64, 65, 66 and 67).

Read More…

Posted by: matheuscmss | February 19, 2009

Furstenberg’s 2x, 3x (mod 1) problem

During my conversations with my friend Artur Avila around the 2x, 3x mod 1 problem and its applications, Artur explained me a cute argument (using the Fourier transform of measures) to show a weak version of Rudolph’s theorem (which is a partial answer to this problem). Of course, I believe it is a good idea to share it here, but, in order to keep this post more self-contained, I’ll not proceed directly to the argument. In fact, I’ll divide the exposition into 3 sections: the first one is a general introduction to Furstenberg problem, Rudolph’s theorem and the weak version of Rudolph’s theorem, the second section contains the proof of this weak version of Rudolph result and the third section briefly discusses Einsiedler, Katok and Lindenstrauss’ application of a (highly non-trivial) analog of Rudolph’s theorem to the so-called Littlewood conjecture (about simultaneous Diophantine approximations).

Disclaimer. While Artur’s explanation was clear and correct, any possible mistakes and errors appearing below are my responsability (of course).

-Furstenberg’s problem: measure rigidity of expanding rank-two semi-group actions-

Consider the following (uniformly expanding) dynamical system on the circle \mathbb{T}=\mathbb{R}/\mathbb{Z}:

f_d(x)=dx (\textrm{mod} \, 1),

where d\geq 2 is an integer. Despite its simple definition, f_d is rich from the dynamical point of view: f_d possesses plenty of ergodic invariant measures, e.g., the Lebesgue measure and the Dirac measures supported on its periodic points (in fact, due to the expanding features of f_d, one can use the so-called thermodynamical formalism [explained in Bowen's book] to construct a whole family of ergodic invariant measure “interpolating” the Lebesgue measure and the Dirac measures); since, by Birkhoff’s theorem, the ergodic invariant measures captures the statistical features of most orbits in its support, we see that the existence of several distinct ergodic measures for f_d indicates the presence of several distinct statistical behaviours of the orbits of f_d (which justifies the adjective “rich”). Of course, the dynamics of an individual uniformly expanding (and, more generally, hyperbolic) dynamical systems is well-understood nowadays (after the works of Anosov, Smale, Sinai, Ruelle and Bowen among several other authors) and we do not plan to discuss further this issue today. Instead, let us investigate the dynamics of the joint action of two expanding maps, say f_2 and f_3. In other words, denoting by \Sigma_d:=\{d^n: n\in\mathbb{N}\} and \Sigma_{2,3}:=\{2^n3^m: n,m\in\mathbb{N}\}, we are replacing the action

F_d:\Sigma_d\times \mathbb{T}\to\mathbb{T}, \, F_d(k,x)=k x (\textrm{mod}\, 1)

of the rank-one semi-group \Sigma_d (i.e., the individual action of f_d) by the action

F_{2,3}:\Sigma_{2,3}\times \mathbb{T}\to \mathbb{T}, \, F_{2,3}(k,x) = k x (\textrm{mod}\, 1)

of the rank-two semi-group \Sigma_{2,3} (i.e., the joint action of f_2 and f_3).

Remark. In general, a similar action by a rank-two semi-group can be obtained replacing 2, 3 by any pair n,m of multiplicatively independent non-negative integers.

The basic topological result about the dynamics of F_{2,3} is:

Theorem (H. Furstenberg). The sole infinite closed F_{2,3}-invariant subset of the circle \mathbb{T} is the circle itself.

This theorem should be constrasted with the rank-one situation: while F_{2,3} has only one closed invariant set, it is not hard to see that F_d has a lot of invariant Cantor sets. In fact, since f_d is semi-conjugated to a full Bernoulli shift with d symbols (via a codification of the orbits using the Markov partition \left[ i/d,(i+1)/d\right], i=0,\dots, k-1), any subshift gives you an invariant Cantor set. More concretely, you can take a finite collection of disjoint closed intervals I_1,\dots, I_\ell whose union doesn’t coincide with the whole circle and define K the set of points of the circle whose f_d-orbit never enters \bigcup_{1\leq j\leq\ell} \textrm{int}(I_j). Of course, K is F_d-invariant and, from the expanding features of f_d, it follows that K is a Cantor set (exercise).

In resume, Furstenberg’s result says that F_{2,3} is more rigid than F_d (dynamically speaking), which is quite unexpected because F_{2,3} is the combination of the dynamics of two strongly chaotic endomorphisms, namely, f_2 and f_3. Taking this theorem as a motivation, Furstenberg posed the following problem of the measure (i.e., ergodic-theoretical) rigidity of F_{2,3}:

Furstenberg’s 2x,3x (mod 1) problem. Is it true that the Lebesgue measure is the sole non-atomic ergodic F_{2,3}-invariant measure? Equivalently, is it true that the Lebesgue measure is the unique non-atomic ergodic measure which is both f_2 and f_3-invariant?

In other words, this problem asks whether the intersection of two enormous sets of measures (the set f_2-invariant probabilities and the set of f_3-invariant probabilities) is relatively small (namely, the Lebesgue measure and the Dirac measures supported on periodic orbits).

While Furstenberg’s problem is still open (to the best of my knowledge), some important partial results are available nowadays. In particular, let me quote the following theorem of D. Rudolph:

Theorem (D. Rudolph). Let \mu be a f_2 and f_3 invariant ergodic measure such that

\max\{h_\mu(f_2),h_\mu(f_3)\}>0.

Then, \mu is the Lebesgue measure. Equivalently, if a F_{2,3} invariant measure \mu is not the Lebesgue measure, then the semigroup

\Sigma_{2,3}(x):=\{2^n3^mx (\textrm{mod}\, 1): n,m\in\mathbb{N}\}

is a group for \mu almost every x\in\mathbb{T}.

In this statement, h_\mu(f) denotes the metric (Kolmogorov-Sinai) entropy of the f-invariant measure \mu.

Remark. The motivation of the metric entropy condition in Rudolph’s theorem comes from the fact that the metric entropy of Dirac measures supported on periodic orbits is zero. Of course, Rudolph theorem is just a partial answer to Furstenberg question because non-atomic invariant measures can have zero entropy.

While we do not pretend to give a complete proof of Rudolph’s theorem (which is not very hard but involves some amount of abstract ergodic theory), we do plan to show in the next section the following fact:

Theorem 1 (weak version of Rudolph’s theorem). Let \mu be a f_2 and f_3 invariant measure. Assume that h_\mu(f_2)>0 and \mu is f_3 ergodic. Then, \mu is the Lebesgue measure.

-Weak Rudolph theorem: proof of theorem 1-

Take \mu a probability measure verifying the assumptions of theorem 1 and let T(x)=x+1/2 be the translation of 1/2 on the circle \mathbb{T}.

Note that f_3 and T are commuting maps. In particular, since \mu is f_3 invariant, it follows that T_*\mu is f_3 invariant. Moreover, the assumption of ergodicity of \mu with respect to f_3 implies that \mu and T_*\mu are f_3-ergodic.

On the other hand, the positive entropy assumption h_\mu(f_2)>0 implies that the restriction of f_2 to the support of \mu is not invertible. Equivalently, we can always find some pairs of \mu generic points with the same image under f_2. However, such a pair of points is always permuted by the translation T. In particular, we have that the probability measures \mu and T_*\mu aren’t mutually singular.

Putting all these facts together, we have two f_3 invariant ergodic measures (\mu and T_*\mu) such that \mu and T_*\mu aren’t mutually singular. It follows that T_*\mu=\mu, i.e., \mu is T-invariant.

At this stage, one has enough information to conclude that \mu is the Lebesgue measure. Indeed, we claim that all of the non-zero Fourier modes of \mu vanish, that is,

(1) \widehat{\mu}(k):=\int_\mathbb{T} e^{ikx}d\mu(x)=0 for any k\in\mathbb{Z}-\{0\}.

In fact, let us start with the odd Fourier modes (i.e., \widehat{\mu}(k) where k\notin 2\mathbb{Z}). Using the T-invariance of \mu, we obtain

\widehat{\mu}(k) = \widehat{T_*\mu}(k) = \int e^{ikx}dT_*\mu(x) = \int e^{ikT(x)} d\mu(x) .

Since \int e^{ikT(x)} d\mu(x) = \int e^{ik(x+1/2)} d\mu(x) = e^{ik/2}\widehat{\mu}(k), we get

\widehat{\mu}(k)=e^{ik/2}\widehat{\mu}(k).

In particular, it follows that \widehat{\mu}(k)=0 whenever k\in\mathbb{Z} is odd. Finally, we can use the f_2-invariance of \mu (with a similar argument) to obtain

\widehat{\mu}(k)=\widehat{\mu}(2k) for every k\in\mathbb{Z},

that is, the even Fourier modes can be deduced from the odd ones (more precisely, writing a non-zero even number k as k=2^n m where m\neq 0 is odd, we have \widehat{\mu}(k)=\widehat{\mu}(m)). Consequently, our claim (1) about the vanishing of the Fourier modes of \mu is proved. As we know (1) forces \mu to be the Lebesgue measure, so that the proof of theorem 1 is complete.

-Measure rigidity of higher-rank hyperbolic actions: Einsiedler-Katok-Lindenstrauss theorem-

In this final section, we briefly outline a recent application of the measure rigidity of higher-rank hyperbolic actions to a partial solution of a number-theoretical problem.

A well-known fruitful interaction in Mathematics occurs between Dynamical Systems and Number Theory. Among the several applications of dynamical ideas to number-theoretical problems, one finds Margulis’ solution to Oppenheim conjecture (see e.g. this blog post of Terence Tao). Here, the basic idea is to convert the study of the values of indefinite quadratic forms in k variables (k\geq 3) into the study of the dynamics of an action of a higher-rank group H in the space SL(k,\mathbb{R})/SL(k,\mathbb{Z}) of unimodular lattices in \mathbb{R}^k. Another (more recent) application of these ideas was performed by M. Einsiedler, A. Katok and E. Lindenstrauss in the study of the so-called Littlewood conjecture:

Littlewood conjecture. For every \alpha,\beta\in\mathbb{R}, it holds

\liminf\limits_{n\to\infty} n\langle n\alpha\rangle \langle n\beta\rangle=0.

Here \langle x\rangle denotes the fractional part of x.

Again, the basic idea is to convert this problem into the study of the dynamics of the action of the group A of positive diagonal SL(k,\mathbb{R}) matrices in SL(k,\mathbb{R})/SL(k,\mathbb{R}).

However, although the basic strategy of Margulis and Einsiedler-Katok-Lindenstrauss is the same, there is a subtle and important difference: while in Margulis’ context the acting group H contains only unipotent (parabolic) elements, in Einsiedler-Katok-Lindenstrauss’s setting the acting group A contains only hyperbolic elements.

In particular, at the present moment, we have the following picture: one can completely classify the invariant measures of the H-action (by Ratner’s theorems), but one can say something interesting about the invariant measures of the A-action only when they have positive entropy. In other terms, it is “morally” more easy to show measure rigidity statements (in the spirit of Furstenberg’s problem) for the H-action than A-action because the individual elements of the group H are already parabolic, i.e., rigid. Here, the positive entropy condition for the measure rigidity statement (for the A action) is certainly motivated by Rudolph’s result.

In any case, the moral philosophy here is the following: the number-theoretical problems quoted above (Oppenheim and Littlewood conjectures) can be reduced to the complete classification of the invariant measures of adequate higher-rank group actions. Since such a complete classification is possible for the H-action, Margulis completely solved Oppenheim conjecture; but because our current technology only allows us to classify A invariant measures of positive entropy, Einsiedler-Katok-Lindenstrauss partially (almost) solved Littlewood conjecture. More precisely, they proved the following result:

Theorem (Einsiedler, A. Katok and E. Lindenstrauss). The set of (possible) exceptions (\alpha,\beta) to Littlewood conjecture has Hausdorff dimension 0. 

Closing this post, let me say that this last result deserves further explanation and I plan to discuss it a little bit more in a future post (probably after the series of posts around Trieste’s conference). By the way, the curious reader may consult (besides the original article) these expository notes of Akshay Venkatesh for a nice introduction.  Ciao!

Today we’ll close our current discussion of the standard map with the proof of Duarte’s theorem. As we mentioned earlier, the basic strategy consists into three steps: the construction of a dynamically increasing family of hyperbolic basic sets of saddle-type (“horseshoes”), the existence of a dense set of parameters where  a quadratic tangency is generically unfolded and the construction of elliptic islands from the bifurcation of quadratic tangencies via a conservative version of Newhouse phenomena. More precisely, we are going to show the statements of theorems 1, 2 and 3 of the previous post.

-An “increasing” family of hyperbolic basic sets-

In order to find hyperbolic sets, we’ll use the well-known invariant cone-field criterion:

Theorem (invariant cone-field criterion). Let f:M\to M be a C^1-diffeomorphism. Consider \Lambda a compact f-invariant set. Assume that there are \mu>1>\lambda, a decomposition T_{\Lambda}M=E\oplus F (not necessarily Df-invariant) and two families C^u\supset F, C^s\supset E of (closed) cones such that

  • Df(x)(C^u(x))\subset int(C^u(x)) and \|Df(x)v^u\|\geq\mu\|v^u\| for all x\in\Lambda and v^u\in C^u(x);
  • Df^{-1}(f(x))(C^s(f(x)))\subset int(C^s(x)) and \|Df^{-1}(f(x))v^s\|\geq\lambda^{-1}\|v^s\| for all x\in\Lambda and v^s\in C^s(f(x)).

Then, \Lambda is a hyperbolic set.

Proof. See corollary 6.4.8 of Katok-Hasselblat book. A sketch of proof goes as follows. Fix a point x\in\Lambda and consider the following two sequences of cones on the tangent space T_xM: Df^n(x)\cdot C^u(f^{-n}(x)) and Df^{-n}(x)\cdot C^s(f^n(x)). Our assumptions implies that these two sequences are nested sequences of closed cones, so that the intersections

E^u=\bigcap\limits_{n\geq 0}Df^n(x)\cdot C^u(f^{-n}(x))

and

E^s=\bigcap\limits_{n\geq 0}Df^{-n}(x)\cdot C^s(f^n(x))

are closed Df-invariant closed cones. In fact, one can work a bit more (with the facts Df(x)(C^u(x))\subset int(C^u(x)) and Df^{-1}(f(x))(C^s(f(x)))\subset int(C^s(x))) to see that E^u and E^s are vector subspaces with dimensions \dim F and \dim E resp.. Finally, once we know that E^u and E^s are Df-invariant subspaces, our assumptions of expansion (resp. contraction) of vectors belonging to the cone C^u\supset E^u (resp. C^s\supset E^s), we see that E^u is the unstable subspace and E^s is the stable subspace. This completes the sketch. \square

Proposition 1. Consider the invertible area-preserving map

f(x,y)=(-y+\phi(x),x)

of the 2-torus T^2 and let \Lambda be a f-invariant compact set. Assume that there exists \lambda>2 such that |\phi'(x)|\geq\lambda for all (x,y)\in\Lambda. Then, \Lambda is a hyperbolic set (of saddle type).

Proof. Note that

Df = \left(\begin{array}{cc}\phi'(x) & -1 \\ 1 & 0\end{array}\right)\in SL(2,\mathbb{R}).

In particular, the trace of Df verifies |tr Df|\geq \lambda>2 so that the matrices Df are uniformly hyperbolic. In fact, this follows from the fact that the constant cone-field

C^u(p)\equiv C_{a}^u:=\{(u,v)\in\mathbb{R}^2: |v|\leq a |u|\}

is an unstable cone-field whenever 1/(\lambda-1)<a<1 (note that such a choice is possible since \lambda>2). Indeed, if we write Df(u,v):=(u',v'), we see that

|v'|= |u|\leq (\lambda-a)^{-1}|\phi'(x)u-v|=(\lambda-a)^{-1}|u'|

so that Df(C_a^u)\subset C_{(\lambda-a)^{-1}}^u = C_{\theta a}^u where \theta = \left(a\cdot(\lambda-a)\right)^{-1}<1 by the choice of the parameter a, i.e., C_a^u is Df-invariant. Furthermore, denoting by \|(u,v)\|=\max\{|u|,|v|\}, we get, for any (u,v)\in C^u_a,

\|Df(u,v)\| = |u'|\geq (\lambda-a)|u| = (\lambda-a)\|(u,v)\|

with (\lambda-a)>1, i.e., Df (uniformly) expands any vector inside C^u_a. On the other hand, it is not hard to see that Df\in SL(2,\mathbb{R}) implies that the same argument can be applied to Df^{-1} in order to get a stable cone-field. Using the invariant cone-field criterion, the proof is complete. \square

An immediate consequence of this proposition is the following result:

Corollary 1. For the standard family

f_k(x,y)=(-y+2x+k\sin(2\pi x),x):=(-y+\phi_k(x),x),

given any \lambda>2, the maximal invariant set

\Lambda_k=\bigcap\limits_{n\in\mathbb{Z}}f_k^{-n}(\{(x,y)\in\mathbb{T}^2: |\phi_k'(x)|\geq\lambda \})

is hyperbolic.

Remark. It is worth to note that this result gives a clue about the location of the critical region of non-hyperbolicity: for a given \lambda>2, the set of points \{(x,y): |\phi_k'(x)|\leq\lambda\}  converges to the union of the two circles \{x=\pm 1/4\} when k\to\infty.

Of course, this corollary says that the hyperbolic sets \Lambda_k are a family of dynamically increasing basic sets. In fact, it turns out that this can be checked by hand (see section 4.2 of Duarte’s paper), but we’ll skip this fact for sake of brevity of the exposition.

-Global dynamical foliations and their tangency lines-

After the description of the (“big”) hyperbolic sets \Lambda_k of the standard map f_k, we proceed to the study of the tangencies between their invariant foliations. In order to do so, we need to extend these foliations to some uniform neighborhood of \Lambda_k (since we want to perform an analysis for several large parameters) while keeping good estimates of distortion of the holonomy maps. At this point, our first technical problem arises: from the general theory of uniformly hyperbolic sets (see the book of Palis-Takens), we know that \Lambda_k admits some neighborhood U_k so that the stable and unstable foliations of \Lambda_k can be extended to U_k (while keeping good estimates), but a priori the region U_k where the good estimates are ensured can deteriorate when k\to\infty. To overcome this problem, Duarte takes the following point of view. Near the critical region \{x=\pm 1/4\}, he replaces \phi_k(x)=2x+k\sin(2\pi x) by a function \psi_k(x) having two poles at x=\pm 1/4 and he tries to compare the dynamics of f_k(x,y)=(-y+\phi_k(x),x) with the dynamics of the singular diffeomorphism g_k(x,y) = (-y+\psi_k(x),x).

More precisely, \psi_k(x)=\phi_k(x)+\rho_k(x) where \rho_k(x) = 0 outside a 2/k^{1/3}-neighborhood of x=\pm 1/4 and \rho_k(\pm 1/4)=\infty. Then, after the somewhat tedious work of redoing the theory of invariant manifolds (following the exposition of Hirsch-Pugh-Shub), he checks that \psi_k has global stable and unstable foliations \mathcal{F}^s, \mathcal{F}^u on \mathbb{T}^2 verifying uniform distortion estimates (i.e., their holonomy maps have uniformly bounded C^2-norm). Here, the uniform control of distortion comes from the choice of \rho_k: indeed, assuming that \psi_k coincides with \phi_k outside a 1/k^{\epsilon}-neighborhood of x=\pm 1/4, it is not hard to see that the Schwartzian derivate of \psi_k is bounded from below (in the critical region |x\mp 1/4|\leq k^{-\epsilon}) by

2\left|\frac{\psi_k''(x)^2}{\psi_k'(x)^3}\right|+\left|\frac{\psi_k'''(x)}{\psi'(x)^2}\right|\geq \frac{\pi}{k|\cos(2\pi x)|^3} - \frac{\pi}{k\cos(2\pi x)}\geq k^{3\epsilon-1}.

In particular, since we want to take \epsilon>0 the largest possible so that \psi_k coincides with \phi_k in |x\mp 1/4|\geq k^{-\epsilon} and \psi_k with bounded Schwartzian derivative (because it is well-known that bounded Schwartzian derivative implies bounded distortion), it is natural to take \psi_k=\phi_k outside |x\mp 1/4|\geq 2k^{-1/3} (i.e., \epsilon=1/3).

Of course, once we performed this work (which takes 21 pages of Duarte’s paper), we have to compare the dynamics of f_k and g_k. However, this is not hard: the maximal invariant set \Lambda_k is the same for both f_k and g_k, and, using the strong hyperbolic features of the singular diffeomorphism g_k, it is possible to show that the dynamics of f_k and g_k on \Lambda_k are conjugated to a 2n_k full (Bernoulli) shift (where 2k(1-32\pi^2/k^{2/3})\leq n_k\leq 2k); furthermore, B_{4/k^{1/3}}(\Lambda_k)=\mathbb{T}^2,  its stable and unstable thickness \tau^s(\Lambda_k),\tau^u(\Lambda_k)\geq k^{1/3}/9 (where \tau^s(\Lambda_k), resp. \tau^u(\Lambda_k), is the thickness of the Cantor set obtained by projection of \Lambda_k along the stable, resp. unstable, foliation on an arbitrarily fixed transversal section) and, as a consequence, its Hausdorff dimension HD(\Lambda_k) satisfies HD(\Lambda_k)\geq 2 \log 2/\log(2+9/k^{1/3}).

Next, we analyse the relative positions of the (g_k-invariant) foliations \mathcal{F}^s and \mathcal{F}^u. Applying f_k to \mathcal{F}^u, we obtain a new foliation \mathcal{G}^u := (f_k)_*(\mathcal{F}^u) (recall that \mathcal{F}^u is g_k-invariant but it is not f_k-invariant). It is not hard to see that the set of tangencies between \mathcal{G}^u and \mathcal{F}^s are two circles close to \{x=\pm 1/4\}. Moreover, the projection of \Lambda_k along \mathcal{G}^u and \mathbb{F}^s into these two circles gives rise to two Cantor sets K^u and K^s satisfying \tau(K^u)\geq k^{1/3}/10 and \tau(K^s)\geq k^{1/3}/9 (here we are using the previous thickness estimate and the fact that the application of f_k to the foliation \mathcal{F}^u doesn’t change very much the thickness). The picture below (borrowed from Duarte’s paper) summarizes our discussion about the relative position of \mathcal{G}^u and \mathcal{F}^s:

duarte-fig1Here \mathcal{F}^s is the almost vertical foliation and \mathcal{G}^u is the foliation folding along the two dotted circles. Now, using the fact that \tau(K^u)\tau(K^s)\geq k^{2/3}/90\gg 1 (for any large k), we can apply Newhouse’s gap lemma to obtain that K^u\cap K^s\neq\emptyset. In other words, we get that \mathcal{G}^u and \mathcal{F}^s exhibits persistent tangencies.

Remark. In proposition 16 of Duarte’s paper, a version of Newhouse’s gap lemma in the circle is wrongly stated: indeed, Duarte claims that the fact that the two Cantor sets are contained in the circle automatically implies that the two Cantor sets are linked. However, this is not correct (as the example of two thick Cantor sets supported by two disjoint compact intervals shows), although this is not a serious problem for this argument: from a careful checking of the geometry of \Lambda_k (via the features of the singular diffeomorphism g_k), it is not hard to see that K^u and K^s are linked (this follows from Duarte’s argument in section 4.2 of his paper).

Finally, closing this section, we claim that these persistent tangencies are quadratic and unfold generically with the parameter k (as the picture above indicates). While a complete proof of this result takes 7 pages of technical calculations, we’ll provide a convincingly enough (I hope! :) ) heuristic argument. We know that the \psi_k-invariant foliations \mathcal{F}^s and \mathcal{F}^u are almost vertical and horizontal (resp.). In particular, it is reasonable to expect that the circles of tangencies between \mathcal{F}^s and \mathcal{G}^u = (f_k)_*(\mathcal{F}^u) are close to the circles of tangencies between the horizontal foliation and the image of the vertical foliation under f_k. On the other hand, since f_k(x,y) = T_k\circ R where R(x,y)=(-y,x) is the \pi/2 counterclockwise rotation and T_k(x,y)=(x+\phi_k(y),y) is a shear (of variable intensity) along the horizontal foliation, we can compute the image of the vertical foliation under f_k as follows: the image of the horizontal foliation by R is the vertical foliation and the image of the vertical foliation by T_k is a foliation by the family of curves which are parallel to the graph \{(\phi_k(y),y):y\in\mathbb{T}\}. In particular, the circle of tangencies between these two foliations are exactly the critical circles \{(x,\nu_{\pm}): x\in\mathbb{T}, \phi'_k(\nu_{\pm})=0\}. At such points, the difference between the curvatures is measured by \phi_k''(\nu_{\pm})=4\pi^2 k, so that the tangencies between the horizontal foliation and the f_k-image of the vertical foliation are quadratic (i.e., locally you are seeing the intersections between straight lines and parabolas) and, a fortiori, the same holds for the tangencies between \mathcal{G}^u and \mathcal{F}^s. Also, when the parameter k increases, the g_k-invariant foliations \mathcal{F}^u and \mathcal{F}^s doesn’t change very much (they are almost constant), while the x-coordinates of the tangency points between \mathcal{G}^u=(f_k)_*(\mathcal{F}^u) and \mathcal{F}^s (which are close to (x+\phi_k(\nu_{\pm}),\nu_{\pm})) move with velocity (close to) 1 (indeed, \nu_{\pm}=\pm 1/4 implies \frac{d}{dk}(x+\phi_k(\nu_{\pm}))= \sin(2\pi \nu_{\pm})=1). Hence, these tangencies are unfolded generically.

At this point, the reader noticed that this discussion gives the theorems 1 and 2 of the previous post. Now, we proceed to discuss the conservative version of the Newhouse phenomena.

-Conservative version of Newhouse phenomena: proof of theorem 3-

Before entering into the proof of the abundance of elliptic islands close to a generically unfolded quadratic tangency, let me review a little bit some facts around the proof of the “classical” Newhouse phenomena.

Given f_\mu a 1-parameter family of surface diffeomorphisms generically unfolding a quadratic homoclinic tangency q associated to a hyperbolic periodic point p of saddle-type (at the parameter \mu=0 say), Newhouse manage to define a renormalization scheme near q as follows: for every large n\in\mathbb{N}, one can select small boxes near q which are mapped by f_\mu^n near itself  with the shape of a parabola so that their relative positions resembles a horseshoe; next, we compose this dynamics with appropriate rescalings h_n of these boxes in order to put these very small boxes into a fixed scale (e.g., a unit square) so that we obtain the families of dynamical systems given by h_n\circ f_\mu^n\circ h_n^{-1} (these are called the successive renormalizations of the dynamics near the homoclinic tangency). The usefulness of idea is more or less clear: assuming that there exists some limiting object h_n\circ f_\mu\circ h_n^{-1}\to t_{\mu}, any stable dynamical property of t_\mu will be shared by h_n\circ f_\mu\circ h_n^{-1} and a fortiori f_\mu.  It turns out that Newhouse showed that this renormalization scheme converges (i.e., the limiting object exists) when the periodic point p is dissipative (i.e., \det|Df_\mu(p)|<1).  Moreover, the limit t_\mu in this case is the quadratic family

t_\mu(x,y)=(\mu-x^2,x).

Using this information, the existence of sinks near the homoclinic tangency follows directly. A detailed exposition of Newhouse’s argument can be found in the excelent book of Palis and Takens.

After this quick review of Newhouse arguments, let us consider again the situation of the standard map: in the conservative setting, given a family of area-preserving diffeomorphisms generically unfolding a quadratic homoclinic tangency associated to a hyperbolic periodic point, one can repeat the renormalization scheme of Newhouse to get as a limit object the conservative Hénon family

t_\mu(x,y)=(-y+\mu-x^2,x).

Next, it is possible (exercise) to show the presence of elliptic fixed points for this family when -1<\mu<3 (with the eigenvalue of this point running from 1 to -1). Because generic (i.e., non-resonant) elliptic periodic point is stable by conservative perturbations (this follows from the so-called KAM theory; see e.g. this monograph of J. Moser), we conclude the existence of (generic) elliptic periodic points nearby the homoclinic tangency. Combining this result with the theorems 1 and 2 proved in the previous sections, we see that, similarly to the proof of Newhouse theorem, the proof of theorem 3 is complete!

My next topic on my series of posts about some lectures of the School and Workshop of Dynamical held at the ICTP, Trieste (Italy) is a beautiful theorem of Pedro Duarte quoted by Anton Gorodestki during his talk ”On the size of the stochastic layer of the standard map“. I should say that I am not planning to discuss Gorodetski’s theorem here but I will focus only on Duarte’s result. My main reason to do so is the fact that Gorodetski result is strongly based on Duarte’s techniques, so that it would be very hard for me to explain the former theorem without the latter one.

The plan of this post is the following:

  • in next section, we introduce the standard map and briefly discuss the some of its well-known dynamics (due to the KAM theory);
  • after that, we recall the positive entropy conjecture for the standard map;
  • finally, we state Duarte’s theorem showing that the positive entropy conjecture for the standard map is a subtle problem due to the presence of ‘elliptic islands‘.

Closing this introduction, let me point out that I will postpone the proof of Duarte’s theorem to a subsequent discussion.

-Standard map: a discrete-time model of the pendulum dynamics-

During our undergratude studies in Mathematics (or Physics perhaps), we certainly faced at some point the so-called pendulum equation:

x''(t)=K\sin(2\pi x(t)).

This ODE models an idealized pendulum: the chord is massless and inextensible, the system moves on a 2-dimensional plane due only to the gravitational force and the total energy of the system is conserved (i.e., there is no friction). For more details, see the link above (for the Wikipedia article on the mathematical pendulum).

It is known that the dynamics of the pendulum is very rich. In order to approach it, one can look at the discrete-in-time model provided by the solutions of the following difference equation:

(1) \Delta^2 x_n = (x_{n+1}-x_n)-(x_n-x_{n-1}) = k\sin(2\pi x_n)

obtained by the the substitution of the second order differential operator x''(t) by the second order difference operator \Delta^2 x_n. A simple way to encode the pairs (x_n, x_{n-1}) satisfying the difference equation (1) is provided by the orbits of the so-called standard family (of area-preserving maps acting on the 2-torus \mathbb{T}^2=\mathbb{R}^2/\mathbb{Z}^2):

f_k(x,y)=(-y+2x+k\sin(2\pi x),x).

In fact, it is easy to check that f_k(x_n,x_{n-1})=(x_{n+1},x_n) (exercise).

Remark. Observe that the maps f_{-k} and f_k are conjugated via the translation (x,y)\mapsto (x+\tfrac{1}{2}, y+\tfrac{1}{2}). Hence, we can restrict ourselves to the standard family f_k with parameters k\geq 0.

On the other hand, we should point out that the dynamics of the standard map f_k is a good aproximation of the pendulum dynamics x''(t)=k\sin(2\pi x(t)) only for small values of the parameter k. Indeed, the pendulum’s flow is always integrable (since the total energy, i.e., the kinetic energy plus the potential energy is a conserved quantity for this flow), the same is not exactly true for the standard family: while the standard map f_0 is integrable, that is, \mathbb{T}^2 is foliated by completely invariant (KAM) circles (whose dynamics is conjugated to a rotation), these circles starts to break up as the parameter k increases and the orbit behavior becomes “chaotic”. In fact, this informal description of the dynamics of the standard family for large k is strongly supported by the computer experiments. For instance, one can see below a computer simulation ( borrowed from this Wikipedia article about the standard map) of the 10 orbits of the standard map f_2:

The orbit space of the standard map with parameter k=2. The green region is the ``chaotic'' part of the dynamics (while in the complement of this region the reader can find some invariant KAM circles).

The orbit space of the standard map with parameter k=2. The green region is the ``chaotic

The reader can found further beautiful pictures (for other choices of parameters) in this Scholarpedia article. Of course, one can ask (motivated by this conjectural description) the following question:

Question (Positive entropy conjecture). Is the Kolmogorov-Sinai entropy h_\mu(f_k) of the Lebesgue measure \mu positive for large values of k?

While the computer experiments seem to show a lower bound h_{\mu}(f_k)\geq \log(|k|/2), the positive entropy conjecture is not known to be true even for a single value of k (despite the efforts of many mathematicians, e.g., Oliver Knill who tried an operator theory strategy based on the so-called Herman’s subharmonicity method). Anatole Katok describes the positive entropy conjecture as one of the five most resistant problems in dynamics!

The positivity of the (metric) entropy of the standard map f_k for typical large values of k has several interesting consequences:

  • from the Pesin’s entropy formula, we know that the positivity of the entropy of the Lebesgue measure (area) implies the existence of an invariant set A_k of positive area such that every x\in A_k has a positive (maximal) Lyapounov exponent (and, due to the conservation of mass,  a negative minimal Lyapounov exponent also).
  • in particular, from Pesin’s theory, the non-vanishing of the (two) Lyapounov exponents of f_k on A_k (i.e., its non-uniform hyperbolicity) implies that the dynamics of f_k restricted to the set A_k is chaotic (for instance, the system exhibits sensitivity to initial conditions and Katok’s theorem allows us to infer the existence of ‘large’ horseshoes contained inside A_k).

At this point, we are ready to state the main theorem of this post:

Theorem (Duarte). There exists a large parameter k_0>0 and a residual subset R\subset [ k_0,\infty ) such that set E of elliptic periodic points of the standard map f_k is 4/k^{1/3}-dense set (i.e., d(x,E)<4/k^{1/3} for any x\in\mathbb{T}^2) for all k\in R.

It is worthwhile to point out that Duarte’s theorem goes into the opposite direction of the positive entropy conjecture. In fact, the presence of elliptic periodic points is potentially dangerous for the positivity of the entropy because the KAM theory says that a typical elliptic periodic point (i.e., an elliptic point satisfying a certain non-resonance condition) can be locally linearized (that is, the initial diffeomorphism can be locally conjugated to the its derivative at the periodic point). Since the derivative at an elliptic point is essentially a rotation, the entropy of the initial diffeomorphism (restricted to the basin of the elliptic point) is zero. In other words, the presence of these ‘elliptic islands’ (i.e., open invariant regions where the diffeomorphism behaves like a rotation) are a serious obstruction to the positivity of the entropy.

Remark. Observe that Duarte’s theorem doesn’t contradicts the positive entropy conjecture. In fact, this follows from the ‘usual reason’: residual sets (in Baire sense) may have zero Lebesgue measure. However, this result certainly shows how subtle the positive entropy conjecture is.

Ending this post, let me outline the strategy of the proof of Duarte’s theorem. Roughly speaking, the idea is to get the result from a conservative version of Newhouse phenomena. More precisely, one begins with the following result:

Theorem 1. For each parameter k, there exists a basic set \Lambda_k of f_k (i.e., a compact f_k-invariant transitive hyperbolic set) with the following properties:

  • the family \{\Lambda_k\}_{0\leq k<\infty} is dynamically increasing: given k_0, for any sufficiently small \epsilon>0, the set \Lambda_{k_0+\epsilon} contains the dynamical continuation of \Lambda_{k_0} (that is, the unique compact f_{k_0+\epsilon}-invariant hyperbolic set close to \Lambda_{k_0} in the Hausdorff topology);
  • the thickness of \Lambda_k grows to infinity as k\to\infty: for all sufficiently large parameter k, the local stable and unstable thickness are

\tau^s_{loc}(\Lambda_k), \tau^u_{loc}(\Lambda_k)\geq k^{1/3}/9;

HD(\Lambda_k)\geq 2\log2/\log(2+9k^{-1/3});

  • \Lambda_k is topologically conjugated to a (full) Bernoulli shift in 2n_k symbols where

2n_k/4k\to 1 as k\to\infty;

  • for all large k, the set \Lambda_k is 4/k^{1/3}-dense in the torus \mathbb{T}^2.

Starting with this theorem, Duarte shows that the basic set \Lambda_k generically unfolds a quadratic homoclinic tangency for a dense set of (large) parameters:

Theorem 2. There exists k_0>0 such that, for any k\geq k_0 and any periodic point p\in\Lambda_k, the set \mathcal{T} of parameters \ell\in\left[k,\infty\right) such that the invariant manifolds W^s(p(\ell)) and W^u(p(\ell)) of the continuation p(\ell) of p generically unfold a quadratic homoclinic tangency is dense in \left[ k,\infty \right).

In resume, the theorems 1 and 2 put us in the framework of the (conservative version of) Newhouse phenomena: for a large parameter in the dense set \mathcal{T}, we have a thick basic set unfolding (generically) a quadratic tangency; this permits to use some of the ideas employed by Newhouse (as discussed in a previous post): of course, it is impossible to obtain the same conclusion of Newhouse (namely, the coexistence of infinetely many sinks/sources for a generic nearby system) because we are dealing exclusively with area-preserving diffeomorphisms; but, the techniques of Newhouse gives us the coexistence of many elliptic periodic points. Indeed, this is the content of the following result of Duarte:

Theorem 3. There are k_0>0 large and a residual subset R\subset \left[k_0,\infty\right) such that, for every k\in R, the closure of the elliptic periodic points of f_k contains the 4/k^{1/3}-dense basic set \Lambda_k.

Of course, Duarte’s theorem is an immeadiate consequence of theorem 3. In the next post, we will concentrate on the proofs of theorems 1, 2 and 3. Ja ne!

Posted by: matheuscmss | October 22, 2008

Asaoka’s lecture on Verjovsky conjecture – part II

Firstly, I would like to apologize for the long hiatus between the first and the second post on Asaoka’s theorem. Basically, my blog schedule was somewhat affected by the World Chess Championship! By the way, the third and fifth matches between V. Anand and V. Kramnik were really interesting: Anand won the third match with a variant of the semi-slav defense and he managed to use the SAME move (ok not exactly the same because he interposed the rook before the bishop) three days after (in the fifth match) but Kramnik (and his mates) were still unable to find an adequate defense! :)

Anyway, since I’m not an expert in chess, let me return to the main purpose of this post. Following the plan of the previous post, today we’ll discuss the proof of Asaoka’s theorem:

Theorem (Asaoka). Any transitive codimension-one Anosov flow is topologically conjugated to a smooth volume-preserving Anosov flow.

Remark. It is not hard to show that volume-preserving Anosov systems are transitive (exercise). Thus, Asaoka’s theorem is a converse of this fact in the case of codimension-one Anosov flows. In fact, it is a good open problem to know whether a similar statement is true in general.

Roughly speaking, the proof of this theorem has two steps:

  • 1st step: a generalization of Elise Cawley’s work in order to show that one can deform any codimension-one Anosov flow inside its topological conjugacy class so that the derivative along the stable and unstable fits any ‘coherently’ prescribed dynamical behaviour; in particular, one can deform the flow in order to get a topologically equivalent flow preserving a Hölder continuous volume form;
  • 2nd step: by a C^1-small perturbation of a flow preserving a Hölder continuous volume form, one can get a flow preserving a smooth volume form (via the so-called ‘pasting lemma‘ technique developed by Alexander Arbieto and myself);

Since it is well-known that Anosov flows are structurally stable (i.e., they are topologically conjugated to any C^1-nearby flow), these two steps complete the proof of Asaoka’s theorem.

Remark 1. Concerning the 1st step, it turns out that the resulting flow can’t be expected to preserve a smooth volume form in general. Indeed, this happens due to the lack of regularity of invariant foliations of Anosov flows: typically, they are only C^{1+} even when the codimension-one Anosov flow is C^{\infty}.

Now let us turn to the details.

-Generalization of Cawley’s deformation arguments-

Consider a codimension-one Anosov flow \phi^{t} with a hyperbolic splitting TM = E^s\oplus T\phi\oplus E^u. For sake of concreteness, we assume that the unstable direction E^u is one-dimensional (of course this can always be achieved by a time change t\mapsto -t). Denote by \mathcal{F}^{uu} the associated strong-unstable foliation and let h_{pq}:\mathcal{F}^{uu}(p)\to\mathcal{F}^{uu}(q) the holonomy map of the weak stable foliation \mathcal{F}^s (i.e., the foliation tangent to the subbundle E^s\oplus T\phi).

Definition 1. We denote by \mathcal{M}(\phi) the set of families of measures \nu= \{\nu\}_{p\in M} such that

  • \nu_p is a Borel, non-atomic, positive on open sets, locally finite measure on \mathcal{F}^{uu}(p) for all p\in M;
  • \mathcal{F}^{uu}(p)=\mathcal{F}^{uu}(q) implies \nu_p=\nu_q;
  • the Radon-Nikodym derivative \frac{d(\nu_p\circ h_{pq})}{d\nu_q}(q) at q is well-defined for any p\in M and q close to p; morever \frac{d(\nu_p\circ h_{pq})}{d\nu_q}(q) is Hölder continuous on both variables p and q.

The next result is a precise statement of the generalization of Cawley’s work (quoted in the 1st step of the outline) about the realization of any dynamical behaviour along the unstable direction:

Theorem 1 (Radon-Nikodym realization theorem). Given any positive Hölder continuous function f:M\to\mathbb{R}, there are \nu\in\mathcal{M}(\phi) and \rho>0 such that

\log\left(\frac{d(\nu_{\phi^t(p)}\circ \phi^t)}{d\nu_p}\right)(p) = \rho\cdot\int_0^t f\circ\phi^{\tau}(p)d\tau.

Remark 2. In fact, the reader should notice that we are not deforming the flow in order to obtaining the desired Radon-Nikodym derivative (although we promised to do so in the 1st step); of course, we could do it, but this would lead to technical problems which we are going to avoid in the following way: instead of deforming directly the flow (in order to get a desired dynamical behaviour), we keep the same flow at the cost of using the family of measures \nu to deform the differentiable structure of the manifold. In other words, we make a little change of point of view: a deformed flow on a given manifold corresponds to a fixed flow on a deformed manifold (with the advantage that the deformation of the differentiable structure is easier to control than the deformation of the flow).

Proof of theorem 1. The basic idea here is the following: Anosov flows can be accurately modeled by the suspension flow over a subshift of finite type: more precisely, one can consider a Markov partition \{R_1,\dots, R_n\} so that the dynamics of any point (outside the boundaries of R_j) can be ‘codified’ by its itinerary with respect to the elements of this partition (i.e., at each time we look at the number j indexing the rectangle R_j containing our point). For more details on the suspension flow (over an arbitrary discrete time dynamical system) the reader can consult my previous post and for more explanations about Markov partitions for hyperbolic systems the reader can see the excelent classical book of R. Bowen. It follows that the original Anosov flow and the suspension of the subshift are Hölder conjugated, so that the replacement of the Anosov system by this ‘toy model’ is harmless for our purposes.

Next, one look at the toy model provided by the suspension of the subshift \sigma_A with a mixing transition matrix A and it turns out that the similar statement is true by the theory of equilibrium states (essentially the measure we are searching is the Gibbs measure obtained by maximizing the pressure of a certain potential). I.e., given a Hölder continuous function \ell, take the reference measure \mu_{\ell} for the Gibbs (\sigma_A-invariant) measure \nu_{\ell}\ll\mu_{\ell} attaining the topological pressure P_{\sigma_A}(\ell)=\sup\{h_{\nu}(\sigma_A)+\int k_{\ell}d\nu\} of \ell among all \sigma_A-invariant measures (that is, P_{\sigma_A}(\ell) = h_{\nu_{\ell}}+\int k_{\ell}d\nu_{\ell}). It holds \frac{d(\mu_{\ell}\circ \sigma_A)}{d\mu_{\ell}}(x)=-\ell(x)+P_{\sigma_A}(\ell). Moreover, if \ell verifies \inf\limits_{x\in\Sigma_A}\sum\limits_{j=0}^{m-1}\ell\circ\sigma^j(x)>0, we can find a constat \rho_{\ell} such that the Bowen’s equation P_{\sigma_A}(-\rho_{\ell}\cdot \ell)=0 holds. On the other hand, after doing the translation of the Radon-Nikodym problem for \phi to the Radon-Nikodym problem for the subshift \sigma_A with transition matrix A that it suffices to find a measure \mu such that

\log\frac{d(\mu\circ\sigma_A)}{d\mu} = \rho\cdot f_A

where f_A is a Hölder continuous function with \inf\limits_{x\in\Sigma_A}\sum\limits_{j=0}^{m-1}f_A\circ\sigma^j(x)>0. As we saw above, it suffices to use the theory of equilibrium states with the potential f_A to obtain the desired measure.

Unfortunately, since the theory of equilibrium states for subshifts of finite type takes an entire post by itself, I’m unable to give further details of this argument here. I refer the reader to the sections 2 and 3 of Asaoka’s paper for further details. \square

Remark 3. Although it was not explicitly stated in the ‘proof’ of theorem 1, we are strongly using the codimension-one hypothesis here (as Cawley did when dealing with Anosov diffeomorphisms of T^2).

Now we proceed to investigate the deformation of the Anosov flow versus the deformation of the differentiable structure of the manifold. Recall that the weak stable foliation \mathcal{F} is C^{1+}. So, if we fix a (Hölder) Riemannian metric g, it makes sense to consider an orthogonal splitting TM=T\phi\oplus E_{\phi}\oplus E_{\phi}^{\perp} and the Hölder continuous flows N_{\mathcal{F}}\phi^t=\pi_{\phi}\circ D_{\mathcal{F}}\phi^t|_{E_{\phi}} and N_{\mathcal{F}}^{\perp}\phi^t=\pi_{\phi}^{\perp}\circ D_{\mathcal{F}}\phi^t|_{E_{\phi}} where \pi_{\phi} (resp. \pi_{\phi}^{\perp}) denotes the orthogonal projection onto E_{\phi} (resp. E_{\phi}^{\perp}). Also, we introduce the cocycles \alpha_{\phi}(p,t,\mathcal{F},g) = \log\det_g(N_{\mathcal{F}\phi^t})_p and \alpha_{\phi}^{\perp}(p,t,\mathcal{F},g) = \log\det_g(N_{\mathcal{F}^{\perp}\phi^t})_p over \phi^t: the name cocycle comes from the property

\alpha_{\phi}(p,t+s,\mathcal{F},g)=\alpha_{\phi}(p,t,\mathcal{F},g)+ \alpha_{\phi}(\phi^t(p),s,\mathcal{F},g)

and

\alpha_{\phi}^{\perp}(p,t+s,\mathcal{F},g)=\alpha_{\phi}^{\perp}(p,t,\mathcal{F},g)+ \alpha_{\phi}^{\perp}(\phi^t(p),s,\mathcal{F},g)

We apply the Radon-Nikodym realization theorem with f(p)=-\frac{\partial\alpha_{\phi}}{\partial t}(p,0,\mathcal{F},g), so that we get a constant \rho_{\phi}>0 and a family of measures \nu\in \mathcal{M}(\phi) verifying

\log\left(\frac{d(\nu_{\phi^t(p)}\circ \phi^t)}{d\nu_p}\right)(p) = \rho_{\phi}\cdot\int_0^t f\circ\phi^{\tau}(p)d\tau=-\rho_{\phi}\cdot\alpha_{\phi}(p,t,\mathcal{F},g).

Lemma 1. \rho_{\phi}=1.

Proof. Take \mathcal{L} a C^{2+} one-dimensional foliation transverse to \mathcal{F}. Note that this is possible since \phi is a codimension-one Anosov flow. Fix an atlas a_p:U_p\to\mathbb{R}^n on M and a C^{1+} parametrization L_p:(-1,1)\to \mathcal{F}^{uu}(p) such that

  • a_p(p)=0 and a_p(U_p)=(-1,1)^{n};
  • a_p^{-1}((-1,1)^{n-1}\times \{y\})\subset\mathcal{F}(q) and a_p^{-1}(\{x\}\times (-1,1))\subset\mathcal{L}(q) (where a_p(q)=(x,y));
  • L_p(0)=p and L_p(y)\in\mathcal{F}^{uu}(p)\cap a_p^{-1}((-1,1)^{n-1}\times \{y\}).

Using this atlas a_p, the parametrizations L_p and the family of measures \nu provided by theorem 1, we can change the differentiable structure of M in the following way: we define \eta_p(y)=\int_0^y d(\nu_p\circ L_p) and we declare that the atlas \widehat{a}_p given by

\widehat{a}_p\circ a_p^{-1}(x,y):=(x,\eta_p(y))

is our new differentiable structure. In order to avoid some confusion, we denote by \widehat{M} the manifold M equipped with this new atlas. It is not hard to check that \widehat{M} is a C^{1+} manifold (although the ‘identity’ map i:M\to\widehat{M} is only a bi-Hölder homeomorphism). See lemmas 4.2 and 4.3 of Asaoka’s paper. In any case, it turns out that the flow \widehat{\phi}^t=i\circ \phi^t\circ i^{-1} is C^{1+} (despite the fact that it is generate by a Hölder continuous vector field). Furthermore, one can directly check that

\alpha_{\widehat{\phi}}(i(p),t,\widehat{\mathcal{F}},\widehat{g})=\alpha_{\phi}(p,t,\mathcal{F},g) and \alpha_{\widehat{\phi}}^{\perp}(i(p),t,\widehat{\mathcal{F}},\widehat{g})=-\rho_{\phi}\cdot\alpha_{\phi}(p,t,\mathcal{F},g),

where \widehat{\mathcal{F}}=i(\mathcal{F}) and \widehat{g}=i^*g (observe that \widehat{g} is a Hölder continuous metric).

Assume momentarily that \widehat{\phi} is generated by a C^{1+} (Anosov) vector field (recall that it is only generated by a Hölder vector field at this point). Then, we have \widehat{\phi}^t(\widehat{M})=\widehat{M}, so that

vol_{\widehat{g}}(\widehat{\phi}^t(M))=\int_{\widehat{M}} |\det_{\widehat{g}}D\widehat{\phi}^t|=\int_{\widehat{M}} 1 = vol_{\widehat{g}}(\widehat{M}).

In particular, there exists p_0\in\widehat{M} such that |\det_{\widehat{g}}D\widehat{\phi}^t(p_0)|=1, i.e., \log|\det_{\widehat{g}}D\widehat{\phi}^t(p_0)|=0. On the other hand, we know that \log|\det_{\widehat{g}}D\widehat{\phi}^t|=\alpha_{\widehat{\phi}} + \alpha_{\widehat{\phi}}^{\perp}=(1-\rho_{\phi})\alpha_{\widehat{\phi}}. Hence, by computing this last expression at p_0, we get

0=(1-\rho_{\phi})\alpha_{\widehat{\phi}}.

Since \widehat{\phi} is a C^{1+} Anosov flow, it follows that \sup\limits_{p\in M} \alpha_{\widehat{M}}(p)<0 (because \widehat{\phi} contracts any direction tangent to \widehat{\mathcal{F}} which is transverse to the flow direction). Hence, \rho_{\phi}=1.

Finally, it remains to get rid of the assumption that the vector field generating \widehat{\phi} is C^{1+}. This can be accomplish by perturbation of the vector field (in the C^{1+} local coordinates provided by a_p). It follows that this nearby (deformed) vector field is Anosov (and topologically conjugated to the initial system) so that the previous argument applies: indeed, although the initial vector field associated to \widehat{\phi} is only Hölder (and it does not make sense to say that it is Anosov), it ‘remembers’ that it was constructed from the Anosov smooth system \phi. In particular, after a little bit of technical work (see lemma 4.4 of Asaoka’s paper), we can show that the initial Hölder vector field behaves as an Anosov system (in the sense that nearby smooth systems are Anosov and topologically conjugated to \widehat{\phi}). This completes the proof. \square

Theorem 2. \phi is topologically conjugated to an Anosov flow \psi preserving a Hölder continuous volume form.

Proof. We have that \phi is topologically conjugated to an Anosov system \psi such that \alpha_{\widehat{\psi}} + \alpha_{\widehat{\psi}}^{\perp} = 0. In particular, it follows that \widehat{\psi} preserves the volume form associated to the Hölder metric \widehat{g}. Sending back this structure (from \widehat{M} to M) via the Hölder homeomorphism i:M\to \widehat{M} gives the desired conclusion. \square

At this point, the proof of Asaoka’s theorem is essentially complete: up to the somewhat boring fact that the volume form is Hölder continuous (although one would like to get smooth volume form for the application to Verjovsky conjecture because S. Simic’s method needs some regularity). We overcome this technical problem in the next section.

-‘Pasting lemma’ technique and the regularization of the volume form-

A basic problem in order to get a smooth volume form is the following: since the differentiable structure on \widehat{M} is only C^{1+}, our volume forms can’t be more regular than C^{1+}. However, one can bypass this problem with the following result of Hart:

Theorem 3 (Hart). For any C^{r+\alpha} foliation \mathcal{G} on a C^{\infty} manifold M, there exists a C^{r+\alpha} diffeomorphism H on M such that T H(\mathcal{G})=DH(T\mathcal{G}) is a C^{r+\alpha} subbundle of TM. Moreover, H can be taken arbitrarily C^r close to the identity.

The proof of this result is not very difficult (for a short proof [of 1 page] of it see the appendix A of Asaoka’s paper), but we will skip it here. An interesting direct corollary of this theorem is the following fact:

Corollary 1. Given a C^{1+} manifold \widehat{M} and an oriented C^{1+} foliation \widehat{O}, we can find a C^{\infty} differentiable structure on \widehat{M} which is compatible with the initial C^{1+} structure such that \widehat{O} is generated by a C^{1+} vector field.

We apply this corollary in the context of the theorem 2. We consider the orbit foliation \widehat{O} of \widehat{\psi} on the manifold \widehat{M}. Since \widehat{O} is C^{1+}, the corollary 1 gives a smooth structure on \widehat{M} compatible with the initial C^{1+} structure and a C^{1+} nearby Anosov vector field \widehat{X_0} generating \widehat{O} such that \widehat{X_0} preserves a Hölder volume form.

In order to conclude the proof of Asaoka’s theorem, it suffices to construct a vector field X preserving a smooth volume form such that X is nearby to X_0 (in the C^1 topology). To do so, we fix a C^{\infty} Riemannian metric g_0 on M and we consider h_0 a Hölder function such that \det_{e^{h_0}g_0}D\phi_0^t(p)=1 for all p\in M (where \phi_0 is the flow generated by X_0). Note that this implies

0=\log\det_{e^{h_0}g_0}D\phi_0^t(p) = \log\det_{g_0}D\phi_0^t(p)+dim(M)\cdot(h_0(\phi_0^t(p)) - h_0(p)).

Thus, although h_0 is only Hölder, we have that h_0 is differentiable along X_0 and

dim(M)\cdot X_0 h_0 = -div_{g_0}X_0.

In particular, X_0 h_0 is a Hölder function. Now, we consider a C^{\infty} function $h$ such that $h$ is C^{0+} close to h_0 and X_0h is C^{0+} close to X_0h_0, and a C^{\infty} vector field $Y$ nearby X_0 in the C^{1+\alpha} topology.  Of course, this can be achieved by a standard use of mollifiers. It follows that div_{e^h g_0}Y is C^{0+} close to div_{e^h g_0}X_0.

Next, we recall the formula

div_{e^h g} X(p) = dim(M)\cdot Xh + div_g X

for any C^1 vector field X, any (continuous) Riemannian metric g and any function h differentiable along the direction X. Using this formula and the fact dim(M)\cdot X_0 h_0 = -div_{g_0}X_0, we obtain

div_{e^h g_0} Y = (div_{e^h g_0} Y-div_{e^h g_0} X_0)+dim(M)\cdot(X_0 h -X_0 h_0).

Therefore, div_{e^h g_0} Y is C^{0+} close to 0. In other words, we are almost done since the condition of the type div_{g}X=0 means that the vector field X preserves the volume form associated to g.

At this point, we ‘adjust’ the vector field by a technique called ‘Pasting Lemma‘ (due to my coauthor A. Arbieto and myself in this paper). Initially, we developed the Pasting Lemma to deal with questions related to volume preserving diffeomorphisms f (where the corresponding equation is \det_{g} Df=1): the basic idea going back to the seminal paper of Dacorogna and Moser is the simple and powerful remark that one can adjust the diffeomorphism (or vector field) via the resolution of a very well-know PDE, namely, \Delta a = b (perhaps with Dirichlet or Neumann boundary condition), but it turns out that this pasting lemma has some applications by other authors, e.g., Bochi, Fayad and Pujals, Araujo and Bessa (besides Asaoka himself as we are going to see).

In the present case, the application is quite direct because the PDE div_g X=0 is linear on X (while the case of diffeomorphisms is technically more subtle since \det_g Df is highly non-linear). In any case, we fix a reference point p_0\in M and we take the unique solution f of the PDE \Delta f = -div_{e^h g_0}Y with f(p_0)=0. Note that f exists because div_{e^h g_0} Y is C^{\infty} (and, a fortiori, C^{0+}) and \int_M div_{e^h g_0} Y=0 (since M is compact and boundaryless). Moreover, the Schauder estimates say that f is C^{2+} close to 0 because div_{e^h g_0}Y is C^{0+} close to 0. In particular, the vector field X=Y+grad_{e^h g_0} f is a vector field $C^{1+}$ close to Y such that

div_{e^h g_0}X = div_{e^h g_0}Y + div_{e^h g_0}grad_{e^h g_0}f=div_{e^h g_0}Y+\Delta_{e^hg_0} f=0,

i.e., X preserves the (smooth) volume form of e^{h}g_0. This ends the proof of Asaoka’s theorem.

My main motivation for my posts in this blog is certainly the exposition of mathematical topics of my own interest. In particular, I try to avoid the discussion of political issues here. However, two friends and Iranian mathematicians working in Brazil (Hossein Movasati and Ali Tahzibi) were unable to get short visa in order to take part into scientific activities (a conference in Germany and an one-year post-doc in France, resp.) due to the fragile political relationship between Iran and other countries. Of course, each country is free to control its borders, but in the specific case of these two mathematicians, we can clearly say that these decisions were completely arbitrary at least for two reasons:

  • both of them have permanent visa and job in Brazil (so that, as Hossein wrote in this post, they could prove that they are ‘good men’);
  • both of them had the support of the local mathematicians (German and French, resp.) for these scientific visits;

As Amie pointed out in this comment here, there are certainly stories of this flavor involving the US government, but I was shocked to heard Ali’s history because I was unaware of such positions from the French government (specially because I’m living in France now).

In any case, if you want to express your opinion on these shameful political decisions and/or share similar histories, I invite you to joint us in this blog

http://iranianmath.blogspot.com/

begun by Hossein and Ali. I hope we can succeed against this kind of arbitrary politicial decision which is so harmful for the scientific (and in particular mathematical) community. In this direction, Ali proposed here the interesting idea of asking some help of important mathematical organizations such as IMU (and TWAS perhaps).

Thanks in advance for any support you can give in this issue! I promise my next topic will be on Asaoka’s proof of Verjovsky conjecture! :)

Yesterday, I gave a talk on the “Seminaire de Theorie Ergodique” at the Universite de Paris 13 (Villetaneuse) about a short note entitled “An example of a Teichmuller disk in genus 4 with degenerate Kontsevich-Zorich spectrum” that G. Forni and me just posted on the arXiv.

The Teichmuller geodesic flow acts (as any respectable geodesic flow) on the unit cotangent bundle of the Teichmuller space by parallel transport of points and vectors along geodesics of the Teichmuller metric (here one should be careful because the Teichmuller metric is not a Riemannian metric but only a Finsler metric).

Using the results of Alhfors and Bers in Riemann surface theory, it is possible to identify the unit cotangent bundle of the Teichmuller space with the space Q_g^{(1)} of holomorphic quadratic differentials on a Riemann surface M of genus g with area 1 modulo the group Diff^+_0(M) of orientation-preserving diffeomorphisms of M which are isotopic to the identity. Note that SL(2,\mathbb{R}) acts naturally on Q_g^{(1)} by linear transformations on the pairs (\Re(q^{1/2}), \Im(q^{1/2})) of Abelian differentials. The interesting consequence of this point of view is that it turns out that the Teichmuller geodesic flow G_t on Q_g^{(1)} is simply the action of the diagonal subgroup \textrm{diag}(e^t,e^{-t}) on Q_g^{(1)}.

Now let me make a few general remarks about the fine structure of Q_g^{(1)}:

  • Q_g is stratified into analytic spaces Q_{\kappa}^{(1)} obtained by fixing the multiplicities \kappa=(k_1,\dots,k_{\sigma}) of the zeroes \Sigma_\kappa=\{p_1,\dots,p_{\sigma}\} of the quadratic differentials (here \sum k_i=4g-4 in view of the Riemann-Hurwitz theorem);
  • Q_g^{(1)} can be endowed with a natural notion of ‘Lebesgue measure’: for each stratum Q_\kappa^{(1)} there is a SL(2,\mathbb{R})-invariant probability measure \mu_\kappa^{(1)} in the same class of the Lebesgue measure on the local charts Q_\kappa^{(1)}\to H^1(M,\Sigma_\kappa,\mathbb{C}) given by the period map;

Once we know that Q_g^{(1)} has a good structure, we can start doing some Ergodic Theory. We consider the Teichmuller flow G_t on a stratum Q_\kappa^{(1)}. Although Veech showed that the strata are not always connected (see the works of Kontsevich, Zorich and Lanneau for the complete classification of the connected components), Masur and Veech (independently) managed to show that G_t is ergodic on each connected component of Q_\kappa and, more recently, Avila, Gouezel and Yoccoz proved that G_t is exponentially mixing.

Next, we ask about the Lyapounov spectrum (i.e., the collection of Lyapounov exponents) of G_t. In order to adress properly this question, Kontsevich and Zorich introduced the so-called Kontsevich-Zorich cocycle G_t^{KZ}:Q_g^{(1)}\times H^1(M,\mathbb{C})\to Q_g^{(1)}\times H^1(M,\mathbb{C}) given by the quotient of the trivial cocycle G_t\times id by the mapping class group. It is know that G_t^{KZ} is symplectic so that its Lyapounov spectrum with respect to any ergodic measure \mu is symmetric:

1=\lambda_1^{\mu}\geq \dots\geq \lambda_g^{\mu}\geq 0\geq -\lambda_g^{\mu}\geq \dots \geq -\lambda_1^{\mu}=-1.

Moreover, one can show that the Lyapounov exponents of G_t are determined by the first g (non-negative) Lyapounov exponents 1=\lambda_1^{\mu}\geq \dots\geq \lambda_g^{\mu} of G_t^{KZ}.

After several numerical experiments, Kontsevich and Zorich conjectured that the Lyapounov exponents of G_t^{KZ} with respect to the canonical ‘Lebesgue’ measure \mu_\kappa^{(1)} are all non-zero (i.e., G_t is non-uniformly hyperbolic) and simple (i.e., multiplicity 1). Nowadays, we know that this conjecture is true due to the results of G. Forni (who proved the non-uniform hyperbolicity of G_t^{KZ}) and A. Avila, M. Viana (who showed that the simplicity of the Lyapounov spectrum). In particular, the Kontsevich-Zorich spectrum (i.e., the Lyapounov spectrum of G_t^{KZ}) of a \mu_\kappa^{(1)} generic point is well-understood.

A natural question (posed by Veech) related to this result concerns the Kontsevich-Zorich spectrum of non-generic points: how bad (or ‘degenerate’) can it be? This question was firstly answered by G. Forni who showed the existence of a Veech surface of genus 3 so that any SL(2,\mathbb{R})-invariant measure \mu supported on the SL(2,\mathbb{R})-orbit of this surface has \lambda_2^{\mu}=\lambda_3^{\mu}=0 (moreover, it seems that there are no such examples in genus 2).

At this point, we are ready to state the following result:

Theorem(G. Forni, –). Any SL(2,\mathbb{R})-invariant measure \mu supported on the SL(2,\mathbb{R})-orbit of the genus 4 Riemann surface associated to the algebraic equation

w^6=(z-x_1)(z-x_2)(z-x_3)(z-x_4)^3

has ‘degenerate’ Kontsevich-Zorich spectrum: \lambda_2^{\mu}=\lambda_3^{\mu}=\lambda_4^{\mu}=0.

Roughly speaking, the basic idea here is: this Riemann surface is ’sufficiently symmetric’ (i.e., it has a ‘good’ automorphism group); on the other hand, by Forni’s method, the presence of symmetries to show some cancellations of the Lyapounov exponents (and the more symmetries you have, the more cancellations you get); finally, in the case of this surface, it turns out that the cyclic group of automorphisms generated by the symmetry T(z,w)=(z,\varepsilon_6\cdot w) (where \varepsilon_6 is a 6-th root of unity) suffices to completely annilihate the Lyapounov spectrum (except for the ‘trivial’ exponent \lambda_1^{\mu}=1).

Remark 1. It turns out that Forni’s method automatically implies that the cocycle G_t^{KZ} along the SL(2,\mathbb{R})-orbit of this surface is isometric (and moreover the cocycle is trivial in the sense that it is conjugated to constant). This is an interesting phenomenon if you compare with the ‘chaotic‘ behavior exhibited by generic points.

Remark 2. It is not hard to see (via simple arithmetic arguments) that the method of our paper does not produce any new examples of the type

w^N=\prod\limits_{n=1}^{4}(z-x_n)^{a_n}

where 0<a_n<N, \sum\limits_{n=1}^4 a_n\equiv 0 (mod N) and gcd(a_1,\dots,a_4,N)=1 (the first two conditions are imposed to guarantee good symmetries while the third condition is necessary to get a connected Riemann surface). On the other hand, M. Möller told us that there are no further such examples among Veech surfaces (in any genus).

Of course, this theorem is just the tip of the iceberg: for instance, it would be interesting to know whether one can find examples of surfaces with a prescribed number of non-zero Lyapounov exponents (that is, given any 0<r<g, there is a surface with \lambda_r^{\mu}>0=\lambda_{r+1}^{\mu}=\dots=\lambda_g^{\mu}?). It is worth to observe that Forni’s method has a weak point: while we can detect surfaces with all (but one) Lyapounov exponents equal to zero, we can’t prove that a prescribed part of the spectrum vanishes for a given surface and the basic reason is the lack of an explicit formula for the sum of the first r<g exponents although Forni has an explicit formula for the sum of all exponents. Currently, I’m working with my post-doc advisor J.-C. Yoccoz in order to come around this problem by the usage of other methods, but this is still a work in progress… So, I think here is a good point to end this post!

I hope to see you soon in the proof of Asaoka’s theorem! Bye!

Posted by: matheuscmss | September 25, 2008

Asaoka’s lecture on Verjovsky conjecture – part I

My third post on the series of discussions of lectures held at ICTP (Trieste, Italy) concerns Asaoka’s recent solution of a conjecture of Verjovsky.

Let me start by recalling some basic definitions and facts.

Definition 1. A flow \phi^t on a compact manifold M is called an Anosov flow whenever there are a D\phi^t-invariant splitting TM= E^s\oplus T\phi\oplus E^u of the tangent bundle TM of M and a constant \lambda>0 such that

  • T\phi is the one-dimensional subbundle of TM of tangents to the orbits of \phi^t,
  • E^s is uniformly contracted: \|D\phi^t|_{E^s(p)}\|\leq e^{-\lambda\cdot t} for any p\in M and t>0,
  • E^u is uniformly expanded: \|D\phi^{-t}|_{E^u(p)}\|\leq e^{-\lambda\cdot t} for any p\in M and t>0.

We say that \phi^t is a codimension-one Anosov flow if \phi^t is an Anosov flow with \textrm{dim}(E^u)=1.

For sake of comparision, let me remind you the definition of Anosov diffeomorphisms:

Definition 2. A C^r-diffeomorphism f:M\to M is Anosov if there are a $Df$-invarinat splitting TM=E^s\oplus E^u and some constants C>0, \lambda<1 such that

  • E^s is uniformly contracted: \|Df^n|_{E^s(p)}\|\leq e^{-\lambda\cdot n} for any p\in M and n\in\mathbb{N},
  • E^u is uniformly expanded: \|Df^{-n}|_{E^s(p)}\|\leq e^{-\lambda\cdot n} for any p\in M and n\in\mathbb{N}.

Remark 1. As one can expect, there are some “dynamical” differences between these two definitions: Anosov diffeomorphism are discrete-time systems while Anosov flows are continuous-time systems. At a first glance this seems to be of little relevance, but there are fundamental distinct dynamical behaviors hidden here. In fact, given an Anosov flow \phi^t, one can infer some of its properties by the so-called time-one diffeomorphism \phi^1 associated to this flow. It turns out that \phi^1 is not an Anosov diffeomorphism: indeed, although we can decompose the tangent bundle TM=E^s\oplus T\phi\oplus E^u into three D\phi^1-invariant subbundles, since \phi^1 is an isometry along the direction T\phi (exercise), \phi^1 can’t be Anosov due to the presence of this ”neutral” (or central) direction. We will come back to this point later in the discussion.

Now, let us see some basic examples of Anosov systems:

Example 1 (Hyperbolic linear tori automorphisms). Take A a n\times n matrix with integer coefficients and determinant 1, i.e., A\in SL(n,\mathbb{Z}). It is not hard to see that A induces a diffeomorphism f_A on the torus \mathbb{T}^n:=\mathbb{R}^n/\mathbb{Z}^n. Moreover, one can show that f_A is an Anosov diffeomorphism whenever the spectrum \sigma(A) of A doesn’t intersect the unit circle S^1 (i.e., there are no eigenvalues with modulus 1): indeed, E^s is the sum of the eigenspaces associated to eigenvalues of modulus <1 and E^u is the sum of the eigenspaces associated to eigevalues of modulus >1. In this situation, f_A is called a hyperbolic toral automorphism.

Example 2 (Suspension of an Anosov diffeomorphism). Given a map T:M\to M and a “roof” function r:M\to\mathbb{R}^+, one can define a suspension flow associated to (T,r) in the following way: consider the space

\widetilde{M}=\{(x,s)\in M\times\mathbb{R}: 0\leq s<r(x)\}

and the flow

T^t(x,s)=(T^n(x), s+t-r^{(n)}(x))

where n\in\mathbb{N} is the unique integer such that

r^{(n)}(x):=\sum\limits_{j=1}^n r(T^j(x))\leq s+t<\sum\limits_{j=1}^{n+1} r(T^j(x)):=r^{(n+1)}(x).

An interesting feature of this construction is the following fact: the suspension flow of an Anosov diffeomorphism is an Anosov flow (exercise). In particular, we see that the suspension flow of an Anosov diffeomorphism shares the good dynamical properties of original diffeomorphism.

From the example 2, we see that the study of Anosov flows would be significantly more easy provided that a general Anosov flow possesses the dynamical structure of a suspension of an Anosov diffeomorphism. More precisely, our life would be easier if every Anosov flow is topologically conjugated to a suspension of an Anosov diffeomorphism since the dynamics of Anosov diffeomorphisms are well-studied, e.g.:

Theorem (Franks and Newhouse). A codimension-one Anosov diffeomorphism is always topologically conjugated to a hyperbolic toral automorphism.

However, our life is not always easy: Franks and Williams constructed an Anosov flow in a 3-manifold whose non-wandering set is not the whole manifold. In particular, this flow is not conjugated to a suspension flow. Indeed, since our manifold is 3-dimensional, the Anosov flow associated to the suspension is forced to be of codimension one. By the result of Newhouse, the diffeomorphism and, a fortiori, the suspension flow is transitive and, therefore, it can’t be topologically conjugated to a non-transitive system.

Roughly speaking, the idea of Franks and Williams is the following: Anosov flows possesses a central direction where the dynamics is isometric; taking advantage of this neutral direction, they construct a Reeb component preserved by the flow; since the 2-torus of the Reeb foliation separates the 3-torus into two distinct connected components, the flow isn’t transitive.

On the other hand, Verjovsky showed that this phenomenon is purely 3-dimensional: any codimension-one Anosov flow on a n-manifold with n\geq 4 is transitive. Motivated by this result, he made the following conjecture:

Verjovsky conjecture. Any codimension-one Anosov flow on a compact manifold M^n of dimension n\geq 4 is topologically equivalent to a suspension flow over an Anosov diffeomorphism of the torus.

As we already mentioned, Asaoka solved this conjecture based on a preprint of Simic (which is still under revision). More precisely, Asaoka’s strategy is the following:

  • by the theorem of Verjovsky, we know that any codimension-one Anosov flow is transitive (when n\geq 4);
  • by the results of Simic, a codimension-one volume-preserving Anosov is topologically equivalent to the suspension of a hyperbolic toral automorphism;

Thus, the proof of the Verjovsky conjecture is complete once we show the following theorem:

Theorem (Asaoka). Any transitive codimension-one Anosov flow is topologically conjugated to a smooth codimension-one volume-preserving Anosov flow.

Of course, the details of the proof of this result deserves another entire post, so that this is probably the best place to close the current discussion! Ja ne!

Older Posts »

Categories