254A, Lecture 1: Overview

8 January, 2008 in 254A - ergodic theory, math.DS | Tags: course overview, ergodic theory, Ratner's theorem, recurrence theorem | by Terence Tao

In this lecture, I define the basic notion of a dynamical system (as well as the more structured notions of a topological dynamical system and a measure-preserving system), and describe the main topics we will cover in this course.

We’ll begin abstractly. Suppose that X is a non-empty set (whose elements will be referred to as points), and $T: X \to X$ is a transformation. Later on we shall put some structures on X (such as a topology, a $\sigma$ -algebra, or a probability measure), and some assumptions on T, but let us work in total generality for now. (Indeed, a guiding philosophy in the first half of the course will be to try to study dynamical systems in as maximal generality as possible; later on, though, when we turn to more algebraic dynamical systems such as nilsystems, we shall exploit the specific structure of such systems more thoroughly.)

One can think of X as a state space for some system, and T as the evolution of some discrete deterministic (autonomous) dynamics on X: if x is a point in X, denoting the current state of a system, then Tx can be interpreted as the state of the same system after one unit of time has elapsed. (In particular, evolution equations which are well-posed can be viewed as a continuous dynamical system.) More geometrically, one can think of T as some sort of shift operation (e.g. a rotation) on the space X.

Given X and T, we can define the iterates $T^n: X \to X$ for every non-negative integer n; if T is also invertible, then we can also define $T^n$ for negative integer n as well. In the language of representation theory, T induces a representation of either the additive semigroup ${\Bbb Z}^+$ or the additive group ${\Bbb Z}$ . (From the dynamical perspective, this representation is the mathematical manifestation of time.) More generally, one can consider representations of other groups, such as the real line ${\Bbb R}$ (corresponding the dynamics $t \mapsto T^t$ of a continuous time evolution) or a lattice ${\Bbb Z}^d$ (which corresponds to the dynamics of d commuting shift operators $T_1,\ldots,T_d: X \to X$ ), or of many other semigroups or groups (not necessarily commutative). However, for simplicity we shall mostly restrict our attention to ${\Bbb Z}$ -actions in this course, though many of the results here can be generalised to other actions (under suitable hypotheses on the underlying semigroup or group, of course).

Henceforth we assume T to be invertible, in which case we refer to the pair $(X,T)$ as a cyclic dynamical system, or dynamical system for short. Here are some simple examples of such systems:

Finite systems. Here, X is a finite set, and $T: X \to X$ is a permutation on X.
Group actions. Let G be a group, and let X be a homogeneous space for G, i.e. a non-empty space with a transitive G-action; thus X is isomorphic to $G/\Gamma$ , where $\Gamma := \hbox{Stab}(x)$ is the stabiliser of one of the points x in X. Then every group element $g \in G$ defines a dynamical system $(X, T_g)$ defined by $T_g x := gx$ .
Circle rotations. As a special case of Example 2, every real number $\alpha \in {\Bbb R}$ induces a dynamical system $({\Bbb R}/{\Bbb Z}, T_\alpha)$ given by the rotation $T_\alpha x := x + \alpha$ . This is the prototypical example of a very structured system, with plenty of algebraic structure (e.g. the shift map $T_\alpha$ is an isometry on the circle, thus two points always stay the same distance apart under shifts).
Cyclic groups. Another special case of Example 2 is the cyclic group ${\Bbb Z}/N{\Bbb Z}$ with shift $x \mapsto x+1$ ; this is the prototypical example of a finite dynamical system.
Bernoulli systems. Every non-empty set $\Omega$ induces a dynamical system $(\Omega^{\Bbb Z}, T)$ , where T is the left shift $T (x_n)_{n \in {\Bbb Z}} := (x_{n+1})_{n \in {\Bbb Z}}$ . This is the prototypical example of a very pseudorandom system, with plenty of mixing (e.g. the shift map tends to move a pair of two points randomly around the space).
Boolean Bernoulli system. This is isomorphic to a special case of Example 5, in which $X = 2^{\Bbb Z} := \{ A: A \subset {\Bbb Z} \} \equiv \{0,1\}^{\Bbb Z}$ is the power set of the integers, and $TA := A-1 := \{ a-1: a \in A \}$ is the left shift. (Here we endow $\{0,1\}$ with the discrete topology.)
Baker’s map. Here, $X := [0,1)^2$ , and $T(x,y) := (\{2x\}, \frac{y + \lfloor 2x \rfloor}{2})$ , where $\lfloor x \rfloor$ is the greatest integer function, and $\{x\} := x - \lfloor x \rfloor$ is the fractional part. This is isomorphic to Example 6, as can be seen by inspecting the effect of T on the binary expansions of x and y.

The map $T^n$ can be interpreted as an isomorphism in several different categories:

as a set isomorphism (i.e. a bijection) $T^n: X \to X$ from points $x \in X$ to points $T^n x \in X$ ;
as a Boolean algebra isomorphism $T^n: 2^X \to 2^X$ from sets $E \subset X$ to sets $T^n E := \{ T^n x: x \in E \}$ ; or
as an algebra isomorphism $T^n: {\Bbb R}^X \to {\Bbb R}^X$ ) from real-valued functions $f: X \to {\Bbb R}$ to real-valued functions $T^n f: X \to {\Bbb R}$ , defined by
$T^n f(x) := f( T^{-n} x );$ (1)
as an algebra isomorphism $T^n: {\Bbb C}^X \to {\Bbb C}^X$ of complex valued functions, defined exactly as in 3.

We will abuse notation and use the same symbol $T^n$ to refer to all of the above isomorphisms; the specific meaning of $T^n$ should be clear from context in all cases. Our sign conventions here are chosen so that we have the pleasant identities

$T^n \{x\} = \{ T^n x \}; \quad T^n 1_E = 1_{T^n E}$ (2)

for all points x and sets E, where of course $1_E$ is the indicator function of E.

One of the main topics of study in dynamical systems is the asymptotic behaviour of $T^n$ as $n \to \infty$ . We can pose this question in any of the above categories, thus

For a given point $x \in X$ , what is the behaviour of $T^n x$ as $n \to \infty$ ?
For a given set $E \subset X$ , what is the behaviour of $T^n E$ as $n \to \infty$ ?
For a given real or complex-valued function $f: X \to {\Bbb R}, {\Bbb C}$ , what is the behaviour of $T^n f$ as $n \to \infty$ ?

These are of course very general and vague questions, but we will formalise them in many different ways later in the course. (For instance, one can distinguish between worst-case, average-case, and best-case behaviour in x, E, f, or n.) The answer to these questions also depends very much on the dynamical system; thus a major focus of study in this subject is to seek classifications of dynamical systems which allow one to answer the above questions satisfactorily. (In particular, ergodic theory is a framework in which our understanding of the dichotomy between structure and randomness is at is most developed.)

One can also ask for more quantitative versions of the above asymptotic questions, in which n ranges in a finite interval (e.g. ${}[N] := \{1,\ldots,N\}$ for some large integer N), as opposed to going off to infinity, and one wishes to estimate various numerical measurements of $T^n x$ , $T^n E$ , or $T^n f$ in this range.

In this very general setting, in which X is an unstructured set, and T is an arbitrary bijection, there is not much of interest one can say with regards to these questions. However, one obtains a surprisingly rich and powerful theory when one adds a little bit more structure to X and T (thus changing categories once more). In particular, we will study the following two structured versions of a dynamical system:

Topological dynamical systems $(X,T) = (X,{\mathcal F},T)$ , in which $X = (X,{\mathcal F})$ is a compact metrisable (and thus Hausdorff) topological space, and T is a topological isomorphism (i.e. a homeomorphism); and
Measure-preserving systems $(X,T) = (X,{\mathcal X},\mu,T)$ , in which $X = (X,{\mathcal X},\mu)$ is a probability space, and $T$ is a probability space isomorphism, i.e. T and $T^{-1}$ are both measurable, and $\mu(TE) = \mu(E)$ for all measurable $E \in {\mathcal X}$ . [In this course we shall tilt towards a measure-theoretic perspective rather than a probabilistic one, thus it might be better to think of $\mu$ of as a normalised finite measure rather than as a probability measure. On the other hand, we will rely crucially on the probabilistic notions of conditional expectation and conditional independence later in this course.] For technical reasons we also require the measurable space $(X, {\mathcal X})$ to be separable (i.e. ${\mathcal X}$ is countably generated).

Remark 1. By Urysohn’s metrisation theorem, a compact space is metrisable if and only if it is Hausdorff and second countable, thus providing a purely topological characterisation of a topological dynamical system. $\diamond$

[It is common to add a bit more structure to each of these systems, for instance endowing a topological dynamical system with a metric, or endowing a measure preserving system with the structure of a standard Borel space; we will see examples of this in later lectures.] The study of topological dynamical systems and measure-preserving systems is known as topological dynamics and ergodic theory respectively. The two subjects are closely analogous at a heuristic level, and also have some more rigorous connections between them, so we shall pursue them in a somewhat parallel fashion in this course.

Observe that we assume compactness in 1. and finite measure in 2.; these “boundedness” assumptions ensure that the dynamics somewhat resembles the (overly simple) case of a finite dynamical system. Dynamics on non-compact topological spaces or infinite measure spaces can be quite nasty, and there does not appear to be a useful general theory in these cases is a more complicated topic; see for instance this book of Aaronson. [Updated, Jan 9; thanks to Tamar Ziegler for the link.]

Note that the action of the isomorphism $T^n$ on sets E and functions f will be compatible with the topological or measure-theoretic structure:

If $(X,T) = (X,{\mathcal F},T)$ is a topological dynamical system, then $T^n: {\mathcal F}\to {\mathcal F}$ is a topological isomorphism on open sets, and $T^n: C(X) \to C(X)$ is also a $C^*$ -algebra isomorphism on the space $C(X)$ of real -valued (or complex-valued) continuous functions on X.
If $(X,T) = (X, {\mathcal X}, \mu, T)$ is a measure-preserving system, then $T^n: {\mathcal X} \to {\mathcal X}$ is a $\sigma$ -algebra isomorphism on measurable sets, and $T^n: L^p({\mathcal X},\mu) \to L^p({\mathcal X},\mu)$ is a Banach space isomorphism on $p^{th}$ -power integrable functions for $1 \leq p\leq \infty$ . (For $p=\infty$ , $T^n$ is a von Neumann algebra isomorphism, whilst for $p=2$ , $T^n$ is a Hilbert space isomorphism (i.e. a unitary transformation).)

We can thus see that tools from the analysis of Banach spaces, von Neumann algebras, and Hilbert spaces may have some relevance to ergodic theory; for instance, the spectral theorem for unitary operators is quite useful.

In the first half of this course, we will study topological dynamical systems and measure-preserving systems in great generality (with few assumptions on the structure of such systems), and then specialise to specific systems as appropriate. This somewhat abstract approach is broadly analogous to the combinatorial (as opposed to algebraic or arithmetic) approach to additive number theory. For instance, we will shortly be able to establish the following general result in topological dynamics:

Birkhoff recurrence theorem. Let (X,T) be a topological dynamical system. Then there exists a point $x \in X$ which is recurrent in the sense that there exists a sequence $n_j \to\infty$ such that $T^{n_j} x \to x$ as $j \to\infty$ .

As a corollary, we will be able to obtain the more concrete result:

Weyl recurrence theorem. Let $P: {\Bbb Z} \to {\Bbb R}/{\Bbb Z}$ be a polynomial (modulo 1). Then there exists a sequence $n_j \to \infty$ such that $P(n_j) \to P(0)$ .

This is already a somewhat non-trivial theorem; consider for instance the case $P(n) := \sqrt{2} n^2 \hbox{ mod } 1$ .

In a similar spirit, we will be able to prove the general topological dynamical result

Topological van der Waerden theorem. Let $(U_\alpha)_{\alpha \in A}$ be an open cover of a topological dynamical system (X,T), and let $k\geq 1$ be an integer. Then there exists an open set U in this cover and a shift $n \geq 1$ such that $U \cap T^n U \cap \ldots \cap T^{(k-1)n} U \neq \emptyset$ . (Equivalently, there exists U, n, and a point x such that $x, T^n x, \ldots, T^{(k-1)n} x \in U$ .)

and conclude an (equivalent) combinatorial result

van der Waerden theorem. Let ${\Bbb N} = U_1 \cup \ldots \cup U_m$ be a finite colouring of the integers. Then one of the colour classes $U_j$ contains arbitrarily long arithmetic progressions.

More generally, topological dynamics is an excellent tool for establishing colouring theorems of Ramsey type.

Analogously, we will be able to show the following general ergodic theory result

Furstenberg multiple recurrence theorem. Let $(X,T)$ be a measure-preserving system, let $E \in {\mathcal X}$ be a set of positive measure, and let $k \geq 1$ . Then there exists $n\geq 1$ such that $E \cap T^n E \cap \ldots \cap T^{(k-1)n} E \neq \emptyset$ (or equivalently, there exists $x \in X$ and $n \geq 1$ such that $x, T^n x, \ldots, T^{(k-1)n} \in E$ ).

Similarly, if $f: X \to {\Bbb R}^+$ is a bounded measurable non-negative function which is not almost everywhere zero, and $k \geq 1$ , then

$\liminf_{N \to \infty} \frac{1}{N} \sum_{n=1}^N \int_X f T^n f \ldots T^{(k-1) n} f > 0$ . (3)

and deduce an equivalent (and highly non-trivial) combinatorial analogue:

Szemerédi’s theorem. Let $E \subset {\Bbb Z}$ be a set of positive upper density, thus $\limsup_{N \to \infty} \frac{|E \cap [-N,N]|}{2N+1} > 0$ . Then E contains arbitrarily long arithmetic progressions.

More generally, ergodic theory methods are extremely powerful in deriving “density Ramsey theorems”. Indeed, there are several theorems of this type which currently have no known non-ergodic theory proof. [From general techniques in proof theory, one could, in principle, take an ergodic theory proof and mechanically convert it into what would technically be a non-ergodic proof, for instance avoiding the use of infinitary objects, but this is not really in the spirit of what most mathematicians would call a genuinely new proof.]

The first half of this course will be devoted to results of the above type, which apply to general topological dynamical systems or general measure-preserving systems. One important insight that will emerge from analysis of the latter is that in many cases, a large portion of the measure-preserving system is irrelevant for the purposes of understanding long-time average behaviour; instead, there will be a smaller system, known as a characteristic factor for the system, which completely controls these asymptotic averages. A deep and powerful fact is that in many situations, this characteristic factor is extremely structured algebraically, even if the original system has no obvious algebraic structure whatsoever. Because of this, it becomes important to study algebraic dynamical systems, such as the group actions on homogeneous spaces described earlier, as it allows one to obtain more precise results. (For instance, this algebraic structure was used to show that the limit in (3) actually converges, a result which does not seem accessible purely through the techniques used to prove the Furstenberg recurrence theorem.) This study will be the focus of the second half of the course, particularly in the important case of nilsystems – group actions arising from a nilpotent Lie group with discrete stabiliser. One of the key results here is Ratner’s theorem, which describes the distribution of orbits $\{ T^n x: n \in {\Bbb Z} \}$ in nilsystems, and also in a more general class of group actions on homogeneous spaces. It is unlikely that we will end up proving Ratner’s theorem in full generality, but we will try to cover a few special cases of this theorem.

In closing, I should mention that the topics I intend to cover in this course are only a small fraction of the vast area of ergodic theory and dynamical systems; for instance, there are parts of this field connected with complex analysis and fractals, ODE, probability and information theory, harmonic analysis, group theory, operator algebras, or mathematical physics which I will say absolutely nothing about here.

[Update, Jan 28: Definition of measure-preserving system modified to add the separability condition.]

30 comments

Comments feed for this article

9 January, 2008 at 12:11 am

Tamar Ziegler

Hi Terry,

Just a comment regarding infinite measure preserving systems. There
is a rather developed general theory in this case as well. There is an overview
paper on Jon Aaronson’s web page (he has also written a book on the subject).

Best,
Tammy

9 January, 2008 at 12:38 am

Anonymous

Dear Terry,

What text book would you recommend for this topic? Thanks.

–Anon

9 January, 2008 at 8:18 am

Anonymous 2

Dear Prof. Tao, are you going to have the notes available in a more “printable” format such as pdf? I hope so.

Thank you.

9 January, 2008 at 10:20 am

Terence Tao

Dear Tammy: Thanks for the heads-up! I’ve updated the relevant paragraph.

Dear Anonymous: Right now I am using Furstenberg’s “Recurrence in Ergodic Theory and Combinatorial Number Theory” and Glasner’s “Ergodic Theory via Joinings”, together with a large number of miscellaneous on-line resources such as papers, course notes, and even Wikipedia. Some of these are linked from my web page for this course.

Dear Anonymous 2: I am planning to compile several of the blog posts here into a book format; more information to follow soon.

9 January, 2008 at 6:25 pm

Anonymous

Dear Prof.Tao:
Anonymous 2’s suggestion may be worth consideration. Your blog is excellent and I benefit a lot from it. But sometimes when I want to read the post carefully, I have to view online. Could you also provide pdf or dvi version of “short story article”? May it add to some technical difficulties? Many thanks.

9 January, 2008 at 6:46 pm

darren

Since Furstenberg’s book is out of print and quite hard to find, you might also want to point people to “Ergodic Theory” by Petersen. It covers the ergodic theoretic proof of Szemeredi’s Theorem and everything that leads up to it but unfortunately does not cover Ratner’s theorem.

9 January, 2008 at 7:05 pm

Richard

You can find used copies of Furstenberg’s book at ABE book search:

http://www.abebooks.com/

This web site is a great source for locating out of print math books.

They also have multiple copies of Ellis’ Lectures on Topological Dynamics.

10 January, 2008 at 5:46 am

Anonymous

‘Dynamical Systems and Ergodic Theory’ by Pollicott and Yuri may be useful. It covers Szemerédi’s theorem and is available electronically:

http://www.warwick.ac.uk/~masdbl/book.html

10 January, 2008 at 8:26 pm

254A, Lecture 2: Three categories of dynamical systems « What’s new

[…] dynamical systems, topological dynamical systems, and measure-preserving systems (as defined in the previous lecture), it is convenient to give these three classes the structure of a category. One of the basic […]

10 January, 2008 at 11:46 pm

anon

Pollicott and Yuri’s book has good chapters on recurrence theorems, but if I recall correctly, you should watch out for typos!

13 January, 2008 at 5:35 am

Nilay

Sir

Can you explain the meaning of the notation \Omega^\mathbb{Z} that has been used in the Bernoulli System example for representing the set of the states of such a system?

Nilay

13 January, 2008 at 4:44 pm

Terence Tao

Dear Nilay,

When A and B are sets, then ${}A^B$ refers to the space of all functions from B to A, or equivalently the space of all sequences $(a_b)_{b \in B}$ in A indexed by B. See

http://en.wikipedia.org/wiki/Exponentiation#Exponentiation_over_sets

15 January, 2008 at 9:44 am

Anon 9

Regarding anonymous 2’s suggestion, I wish someone should make a tool that does automatically what anonymous 2 suggested. That tool would take any wordpress math blog article and produce a tex file or a pdf file.

23 August, 2017 at 10:16 pm

Krishna Bhogaonker

@Anon 9 Actually, I believe that Pandoc can do this. Pandoc can convert between latex, html, markdown, and any variety of syntaxes.

28 January, 2008 at 4:11 am

Matas

Furstenberg multiple recurrence theorem: why should not “E \cap T^n E \cap \ldots \cap T^{(k-1)n} E \neq \emptyset” be changed to “E \cap T^{-n} E \cap \ldots \cap T^{-(k-1)n} E \neq \emptyset” ?

28 January, 2008 at 10:40 am

Terence Tao

Dear Matas,

The two statements are equivalent (if you apply $T^{-(k-1)n}$ to the former and rearrange, you obtain the latter, and similarly one can obtain the latter from the former).

20 February, 2008 at 1:56 pm

Jose Brox

Dear prof. Tao:

Thank you for making these notes available for people who can not possibly attend your classes (I’m from Spain!).

I have one doubt: Does “mod 1” refer to taking the fractional part? Then, which are the purpose and the advantage of writting it as “modulo arithmetic” instead of using a fractional operator? I looked for it on the net, but I couldn’t find any source! (Any links would make for it)

Thanks!

20 February, 2008 at 2:42 pm

Terence Tao

Dear Jose,

I use $x \mapsto x \mod 1$ to denote the quotient map from the real line ${\Bbb R}$ to the unit circle ${\Bbb R}/{\Bbb Z}$ (thus, if one views ${\Bbb R}/{\Bbb Z}$ as the space of cosets $\{ x + {\Bbb Z}: x \in {\Bbb R} \}$ , then $x \mod 1 = x + {\Bbb Z}$ ). The fractional part map $x \mapsto \{x\}$ is essentially a branch of the quotient map, formed by identifying the unit circle ${\Bbb R}/{\Bbb Z}$ with a unit interval such as ${}[0,1)$ or ${}(-1/2,1/2]$ (depending on one’s conventions), and so $\{x\}$ is a real number rather than an element of the circle ${\Bbb R}/{\Bbb Z}$ . Thus, for instance, the fractional part of 3.4 would be 0.4, but $3.4 \hbox{ mod } 1$ would be the coset $3.4 + {\Bbb Z} = 0.4 + {\Bbb Z}$ . The two concepts are of course very closely related, but not quite the same: for instance, the map $x \mapsto x \hbox{ mod } 1$ is continuous and a homomorphism (thus $x \hbox{ mod } 1 + y \hbox{ mod } 1 = (x+y) \hbox{ mod } 1$ ), while the fractional part map $x \mapsto \{x\}$ is neither.

20 February, 2008 at 3:32 pm

Jose Brox

Thanks, it’s crystal clear now :-)

6 April, 2008 at 12:44 am

Luca Goldoni

Dear prof Tao,

thank you so many for your very deep and very clear lectures.

27 September, 2008 at 11:50 am

What is a gauge? « What’s new

[…] 2: circle extensions of a dynamical system. Recall (see e.g. my lecture notes) that a dynamical system is a pair X = (X,T), where X is a space and is an invertible map. (One […]

28 October, 2008 at 3:03 am

Nara

Dear Prof. Tao,

Thanks so much to make your lecture note to be publicly available.
I totally appreciate it.

I have a silly question since I’m kind of new with the ergodic theory . I wonder if there is a dynamical system that does not leave any measure preserved.
And it would be very generous of you if you may give several examples for different cases, that is, for X is finite, sigma-finite, or infinite.

Thanks

28 October, 2008 at 2:16 pm

Terence Tao

Dear Nara,

Actually, every topological dynamical system has at least one invariant measure; this is known as the Krylov-Bogolyubov theorem and appears as Corollary 1 in Lecture 7:

254A, Lecture 7: Structural theory of topological dynamical systems

	Jianfeng Li on An introduction to measure…
	Jianfeng Li on An introduction to measure…
	Jianfeng Li on An introduction to measure…
	Jianfeng Li on An introduction to measure…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on Dense sets of natural numbers…
	Anonymous on Dense sets of natural numbers…
	Anonymous on 275A, Notes 3: The weak and st…
	Anonymous on Dense sets of natural numbers…
	Fabrice Imparato on Dense sets of natural numbers…
	Anonymous on A computation-outsourced discu…
	Anonymous on Dense sets of natural numbers…
	Anonymous on 245A, Notes 3: Integration on…
	Samuel German on Analysis I

254A, Lecture 1: Overview

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

30 comments

Leave a comment Cancel reply

For commenters

254A, Lecture 1: Overview

Share this:

Recent Comments

Articles by others

Diversions

Mathematics

Selected articles

Software

The sciences

Top Posts

Archives

Categories

The Polymath Blog

30 comments

Leave a comment Cancel reply

For commenters