Lex Fridman PodcastTerence Tao: Hardest Problems in Mathematics, Physics & the Future of AI | Lex Fridman Podcast #472
EVERY SPOKEN WORD
150 min read · 30,046 words- 0:00 – 0:49
Introduction
- LFLex Fridman
The following is a conversation with Terence Tao, widely considered to be one of the greatest mathematicians in history, often referred to as the Mozart of Math. He won the Fields Medal and the Breakthrough Prize in Mathematics, and has contributed groundbreaking work to a truly astonishing range of fields in mathematics and physics. This was a huge honor for me, for many reasons, including the humility and kindness that Terry showed to me throughout all our interactions. It means the world. This is the Lex Fridman Podcast. To support it, please check out our sponsors in the description or at lexfridman.com/sponsors. And now, dear friends, here's Terence Tao.
- 0:49 – 6:16
First hard problem
- LFLex Fridman
What was the first really difficult research-level math problem that you encountered? One that gave you pause maybe?
- TTTerence Tao
Well, I mean, in your undergraduate, um, education, you learn about the really hard impossible problems, like the Riemann hypothesis, the twin primes conjecture. You can make problems arbitrarily difficult, uh, that's not really a problem. In fact, there's even problems that we know to be unsolvable. What's really interesting, uh, are the problems just at the... on the boundary between what we can do perfectly easily and what are hopeless. Um, but what are problems where, like, existing techniques can do like 90% of the job and then y- you just need that remaining 10%. Um, I think as a PhD student, the Kakeya problem certainly caught my eye, and it just got solved actually. It's a problem I've worked on a lot in my early research. Historically, it came from a little puzzle by the Japanese, uh, mathematician Soichi Kakeya, uh, in like 1918 or so. Um, so, the puzzle is that you, you have, um, a needle, um, in s- on the plane, um, or- or think like- like- uh, like driving on- on- on a road or something, and you want to- to execute a U-turn, you want to turn the needle around, um, but you want to do it in as little space as possible, uh, so you want to use this little area, uh, in order to turn it around. So, um, but the needle is infinitely maneuverable. So you can imagine just spinning it around its, um... uh, it's a unit needle. You can spin it around its center, um, and I think, um, that gives you a disc of- of area, I think, pi over 4. Um, or you can do a three-point U-turn, which is what they- we teach people in- in the driving schools to do, uh, and that actually takes area pi over 8. Um, so it's- it's a little bit more efficient than, um, uh, rotation. And so for a while people thought that was the most efficient, uh, way to turn things around. But Bazhukovich, uh, showed that in fact, uh, you could actually, uh, turn the needle around using as little area as you wanted.
- LFLex Fridman
Mm.
- TTTerence Tao
So .001, there was some really fancy multi, um, uh, um, back and forth U-turn thing that you could- you could do, that- that you could turn the needle around. And in so doing, it would pass through every intermediate direction.
- LFLex Fridman
Is this in the two-dimensional plane?
- TTTerence Tao
This is in the two-dimensional plane. Yeah, so we understand everything in two dimensions. So the next question is, what happens in three dimensions? So suppose like the Hubble Space Telescope is a tube in space, and you want to observe every single star in the universe, so you want to rotate the telescope to reach every single direction. And here's the unrealistic part. Suppose that space is at a premium, which it totally is not.
- LFLex Fridman
(laughs) Yeah.
- TTTerence Tao
Uh, you want to occupy as little volume as possible in order to rotate your- your needle around, in order to see every single star in the sky. Um, how small a volume do you need to do that? And so you can modify Bazhukovich's construction. And so if your telescope has zero thickness, then y- you can use as little volume as you need. That's a- a simple modification of the two-dimensional construction. But the question is that if your telescope is not zero thickness but- but just very, very thin, some thickness delta, what is the minimum volume needed to be able to see every single direction as a function of delta? So as delta gets smaller, as the needle gets- gets thinner, the volume should go down, but- but how fast does it go down? Um, and the conjecture was that it goes down very, very slowly, um, like logarithmically, um, uh, roughly speaking. And that was proved after a lot of work. So this seems like a puzzle. Why is it interesting? So it turns out to be surprisingly connected to a lot of problems in partial differential equations, in number theory, in geometry, combinatorics. For example, in- in wave propagation, if you splash some- some water around, um, you create water waves and they- they travel in various directions. Um, but waves exhibit both w- both particle and wave type behavior. So you can have what's called a wave packet, which is like a- a very localized wave that is localized in space and moving a certain direction in time. And so if you plot it in both space and time, it occupies a region which looks like a tube. And so what can happen is that you can have a wave which initially is- is very dispersed, but it all comes... it all focuses at a single point later in time. Like, you can imagine dropping a- a pebble into a pond and ripples spread out. But then if you time reverse that- that, um, that scenario, and the equations of wave motion are time reversible, you can imagine ripples that are converging, um, to a single point, and then a- a big splash occurs, um, maybe even a singularity. Um, and so it's possible to do that, uh, and geometrically what's going on is that there's also light rays. Um, so like if- if- if this wave represents light, for example, um, you can imagine this wave as a superposition of photons, um, all traveling at the speed of light, they- they all travel on these light rays and they're all focusing at this one point. So you can have a very dispersed wave focus into a very concentrated wave at one point in- in space and time, but then it defocuses again and it's- it separates. But potentially, if the conjecture had a negative solution, so what I mean is that there's- there's a very efficient way to pack, um, tubes pointing in different directions into a very, very narrow region of- of- of... uh, very narrow volume. Then you would also be able to create waves that start out... some... there'll be some arrangement of waves that start out very, very dispersed, but they would concentrate not just at a single point but, um, um, there'll be a- a large, um... uh, there'll be a lot of concentrations in space and time. And, uh, um, and you could create what's called a blowup, where these waves, their amplitude becomes so great that the laws of physics that they're governed by are no longer wave equations, but something more complicated and nonlinear. Um-And so in mathematical physics, we care a lot about whether certain equations in- in- in wave equations are stable or not, uh, whether they can create, um, these singularities.
- 6:16 – 26:26
Navier–Stokes singularity
- TTTerence Tao
There's a famous unsolved problem called the Navier-Stokes regularity problem. So the Navier-Stokes equations, the equations that govern the fluid flow for incompressible fluids, like water, the question asked, if you start with a smooth velocity field of water, can it ever concentrate so much that, like, the velocity becomes infinite at some point? Uh, that's called a singularity. We don't see that, um, in real life. You know, if you splash around water on- on the bathtub, it won't explode on you, um, or- or have p- have water leaving at the speed of light or anything, but potentially it is possible. Um, and in fact, in recent years, the- the consensus has- has drifted towards the, uh, the- the belief that, uh, that in fact for certain very special con- initial con- uh, configurations of- of, say, water, that singularities can form. But people have not yet been able to, uh, to actually establish this. The Clay Foundation has these seven Millennium Prize problems, has a million dollar prize for solving one of these problems, uh, this is one of them, of these seven, only one of them has been solved, uh, uh, the Poincaré conjecture by Perlmutter. So the Kakeya conjecture is not directly, directly related to the Navier-Stokes problem, but understanding it would help us understand some aspects of things like wave concentration, which would indirectly probably help us, uh, understand the Navier-Stokes problem better.
- LFLex Fridman
Can you speak to the Navier-Stokes, so the existence and smoothness, like you said, Millennial Prize problem-
- TTTerence Tao
Right.
- LFLex Fridman
...you've made a lot of progress on this one. In 2016, you published a paper, Finite Time Blow-Up For An Averaged Three-Dimensional Navier-Stokes Equation.
- TTTerence Tao
Right. (laughs)
- LFLex Fridman
(laughs) So- so we're trying to figure out if this thing usually doesn't blow up-
- TTTerence Tao
Right.
- LFLex Fridman
...but can we say for sure it never blows up?
- TTTerence Tao
Right, yeah. So yeah, that is literally the- the- the million dollar question, yeah. So this is what distinguishes mathematicians from pretty much everybody else, like if- if- if something ho- holds 99.99% of the time, um, that's good enough for most, you know, uh, for- for- for- for most things. But mathematicians are one of the few people who really care about whether every, like 100%, really 100% of all, um, situations are covered by- by, um... Yeah, so most fluids, most of the time, um, s- water d- does not blow up, but could you design a s- very special initial state that does this?
- LFLex Fridman
And maybe we should say that this is a, this is a set of equations that govern in the field of fluid dynamics-
- TTTerence Tao
Yes.
- LFLex Fridman
...yeah, trying to understand how a fluid behaves and it's actually turns out to be a really compli-... You know, fluid is-
- TTTerence Tao
Yeah.
- LFLex Fridman
...extremely complicated thing to try to model.
- TTTerence Tao
Yeah, so it has practical importance. So this Clay Prize problem concerns what's called the incompressible Navier-Stokes, which governs things like water. There's something called the compressible Navier-Stokes, which governs things like air, and that's particularly important for weather prediction. Weather prediction, it's just a lot of computational fluid dynamics, a lot of it is actually just trying to solve the Navier-Stokes equations as- as best they can. Um, also gathering a lot of data so that they can get, they can ini- initialize the equation. There's a lot of moving parts. So it's very important problem practically.
- LFLex Fridman
Why is it difficult to prove general things about the set of equations like it not not blowing up?
- TTTerence Tao
The short answer is Maxwell's demon.
- LFLex Fridman
(laughs) .
- TTTerence Tao
Uh, so Maxwell's demon is a concept in thermodynamics, like if you have a box with two gases in, o- uh, oxygen and nitrogen, uh, and maybe you start with all the oxygen on one side and nitrogen on the other side, but there's no barrier between them, right? Then they will mix, um, and they should stay mixed, right? There's- there's- there's no reason why they should unmix. But in principle, because of all the collisions between them, there could be some sort of weird conspiracy that- that, um, like maybe there's a microscopic demon called Maxwell's demon that will, um, every time a oxygen and nitrogen atom collide, they will bounce off in such a way that the oxygen sort of drifts onto one side and then nitrogen goes to the other, and, uh, you could have an extremely improbable configuration emerge, uh, which we never see, um, and- and wh- which statistically it's extremely unlikely. But mathematically, it's possible that this can happen and we can't rule it out. Um, and this is a s- situation that shows up a lot in mathematics, um, uh, basically this example is the digits of pi, 3.14159 and so forth, the digits look like they have no pattern, and we believe they have no pattern. On the long term, you should see as many ones and twos and threes as fours and fives and sixes, uh, that there should be no preference in the digits of pi to favor, let's say, seven over eight. Um, but maybe there's some demon in the digits of pi that- that, like, every time you compute more and more digits, it sort of biases one digit to another, um, and this is a conspiracy that should not happen. There's no reason it- it should happen, but, um, there's- there's- there's no way to prove it, uh, with our current technology. Okay, so getting back to Navier-Stokes, a fluid has a certain amount of energy, and because the fluid is in motion, the energy gets transported around, and, uh, water is also viscous. So if the energy is spread out over many different locations, the natural viscosity of the fluid will just damp out the energy and it will- it will- it will go to zero. Um, and this is what happens, um, in, um, uh, when we actually experiment with water, like it, you splash around, there's- there's some turbulence and- and waves and so forth, but eventually it- it settles down and- and- and the- the lower the amplitude, the- the smaller the velocity, the- the more calm it gets. Um, but potentially, there is some sort of demon that keeps pushing the, uh, the energy of the fluid into a smaller and smaller scale and it will move faster and faster. And at faster speeds, the effect of viscosity is relatively less, and so it could happen that- that it- it creates a s- some sort of, um, um, what's called a self-similar blob scenario where, you know, um, the energy of the fluid sta- starts off at some, um, large scale and then it all sort of, um, transfers its energy into a smaller, um, region of- of- of the fluid, which then at a much faster rate, um, moves into, um, an even s- smaller, uh, region and so forth. Um, and- and each time it does this, uh, it takes maybe half as- as- as long as- as- as the previous one, and then s- uh, you could- you could- you could actually, uh, converge to- to all the energy, um, concentrating in one point in a- a finite amount of time. Um, and that- that- that's, uh, that scenario is called finite time blow-up. Um, so in practice, this doesn't happen.Um, so water is what's called turbulent. Um, so it is true that, um, if you have a big eddy of water, it will tend to break up into smaller eddies, but it won't transfer all the s- the energy from one big eddy into one small eddy. It will transfer it into maybe three or four. And then those ones split up into maybe three or four small eddies of their own. And so the energy gets dispersed to the point where the viscosity can, can then k- keep everything under control. Um, but if it can somehow, um, concentrate, um, all the energy, keep it all together, um, and do it fast enough that, that the viscous effects, uh, don't have enough time to calm everything down, then this blowup can occur. So there are papers who had claimed that, "Oh, you just need to take into account conservation of energy and just carefully use the viscosity and you can keep everything under control for not just in Navier-Stokes, but for many, many types of equations like this." And so like in, in the past, there had been many attempts to try to obtain what's called global regularity for Navier-Stokes, which is the opposite of final time blowup, that velocity stays smooth, and it all failed. There was always some sign error or some subtle mistake and, and it c- it couldn't be salvaged. Um, so what I was interested in doing was trying to explain why we were not able to disprove, um, final time blowup. I couldn't do it for the actual equations of fluids, which were too complicated, but if I could average the equations of motion of Navier-Stokes, so physically if, if, um, if I could turn off certain types of, of ways in which water interacts and only keep the ones that I want. Um, so in particular, um, if, um, if it's a fluid and it could transfer its energy from a large eddy into this small eddy or this other small eddy, I would turn off the energy channel that would transfer energy to this, this one and, and direct it only into, um, this smaller eddy while still preserving the law of conservation of energy.
- LFLex Fridman
So you're trying to make it blow up?
- TTTerence Tao
Yeah. Yeah, so I, I, I basically engineer, um, a blowup by changing the laws of physics, which is one thing that mathematicians are allowed to do. We can change the equation.
- LFLex Fridman
How does that help you get closer to the proof of something?
- TTTerence Tao
Right. So it provides what's called an ob- obstruction in mathematics. Um, so, so what I did was that, uh, basically if I turned off the, um, certain parts of the equation so that... which usually when you turn off certain interactions make it less nonlinear, it makes it more regular and less likely to blow up. But I found that by turning off a very well designed set of, of, of, of, of, of interactions, I could force all the energy to blow up, uh, in finite time. So what that means is that if you wanted to prove, um, global regularity for Navier-Stokes, um, for the actual equation, you had... you must use some feature of the true equation which, which my artificial equation, um, does not satisfy. So it, it rules out certain, um, certain approaches. So, um, the thing about math is, is it's not just about finding or taking a technique that is gonna work and applying it, but you, you need to not take the techniques that don't work. Um, and for the problems that are really hard, often there are dozens of ways that y- you might think might apply to solve the problem, but, uh, it's only after a lot of experience that you realize there's no way that these methods are going to work. So having these counterexamples for nearby problems, um, kind of rules out, um, uh... it, it saves you a lot of time because you, you, you're not wasting, um, energy on, on things that you now know cannot possibly ever work.
- LFLex Fridman
How deeply connected is it to that specific problem of fluid dynamics, or is it some more general intuition you build up about mathematics?
- TTTerence Tao
Right. Yeah. So the key phenomenon that, uh, was, uh, my technique exploits is what's called supercriticality. So in partial differential equations, often these equations are like a tug of war between different forces. So in Navier-Stokes there's the dissipation, um, force coming from viscosity and, and it's very well understood. It's linear, it calms things down. If, if viscosity was all there was, then, then nothing bad would ever happen. Um, but there's also transport, um, that, that energy from in one location of space can get transported because the fluid is in motion to, to other locations. Um, and that's a nonlinear effect and that causes all the, all the problems. Um, so there are these two competing terms in the Navier-Stokes equation, uh, the dissipation term and the transport term. If the dissipation term dominates, if it's, if it's large then basically you get regularity and if, um, if the transport term dominates then, uh, then we don't know what's going on. It's a very nonlinear situation. It's unpredictable. It's turbulent. So sometimes these forces are in balance at small scales but not in balance at large scales or, or vice versa. Um, so Navier-Stokes is what's called supercritical. So at, at smaller and smaller scales the transport terms are much stronger than the viscosity terms. So the viscosity terms are the things that calm things down, um, and so this is, um, um, this is why the problem is hard. In two dimensions, so the Soviet mathematician, uh, Ledochinskaya, she, in the '60s showed in two dimensions there was no blowup. And in two dimensions, the Navier-Stokes equations is what's called critical. The effect of transport and the effect of viscosity are about the same strength even at very, very small scales. And we have a lot of technology to handle critical and also subcritical equations and prove, um, regularity. But for supercritical equations it was not clear what was going on, and I did a lot of work and then there's been a lot of follow-up showing that for many other types of supercritical equations you can create all kinds of blowup examples. Once the nonlinear effects dominate the linear effects at small scales you can have all kinds of bad things happen. So this is sort of one of the main insights of this, this line of work is that supercriticality versus criticality and subcriticality, this, this makes a big difference. I mean, the, that's a key qualitative feature that distinguishes some equations from being sort of nice and predictable and, and, you know, like, like planetary motion and... I mean, there, there's certain equations that, that you can predict for millions of years and, uh, or, or thousands at least. Again, it's not really a problem. But, but there's a reason why we can't predict the weather past two weeks into the future because it's a supercritical equation. Lots of really strange things are going on at, at very fine scales.
- LFLex Fridman
So whenever there is some huge source of nonlinearity-
- TTTerence Tao
Yeah.
- LFLex Fridman
... that can create a huge problem for predicting what's gonna happen?
- TTTerence Tao
Yeah. And if the nonlinearity is somehow more and more featured and interesting at, at small scales.Um, I mean there's- there's many equations that are nonlinear, but, um, in many- in- in many equations you can approximate things by the bulk. Um, so for example, planetary motion. You know, if you wanted to, uh, understand the orbit of th- the moon or Mars or something, you don't really need the microstructure of, uh, like the seismology of the moon or- or like exactly how the mass is distributed, um, you just... basically, you- you can almost approximate these planets by point masses. Uh, and you... it's just the aggregate behavior is important. Um, but if you want to model a fluid, um, like- like the weather, you can't just say, "In Los Angeles, the temperature is this, the wind speed is this." For supercritical equations, the finest-grain information is- is really important.
- LFLex Fridman
If we can just linger on the Navier–Stokes, uh, equations a little bit. So you've suggested... maybe you can describe it, that one of the ways to, uh, solve it, or to negatively resolve it, would be to... sort of to construct a liquid, a kind of liquid computer-
- 26:26 – 33:01
Game of life
- TTTerence Tao
there's a lot of work previously on what are called c- cellular automata.
- LFLex Fridman
Mm-hmm.
- TTTerence Tao
Um, the most famous of which is Conway's Game of Life. There's this infinite discrete grid, and any given time, the grid is either occupied by a cell or it's empty. And there's a very simple rule that, uh, that tells you how these cells evolve. So s- sometimes cells live, and sometimes they, they die. Um, and there's a, um, you know, um, when I was a, a, a student, uh, it was a very popular screensaver to actually just have these, these animations l- going. And, and they look very chaotic. In, in fact they look a little bit like turbulent flow sometimes. But at some point, people discovered more and more interesting structures within this Game of Life. Um, so for example they discovered this thing called a glider. So a glider is a v- very tiny configuration of like four or five cells, which evolves, and it just moves at a certain direction. And that's like this, this vortex rings, this analog. Um, yeah, so this is an a- analogy. The Game of Life is kinda like a discrete equation and, and, um, the fluid Navier-Stokes is, is a continuous equation. But mathematically, they have some similar features. Um, and, um, so over time people discovered more and more interesting things that you could build within the Game of Life. The Game of Life is a very simple system. It only has like three or four rules, um, to, to do it. But, but you can design all kinds of interesting configurations inside it. Um, there's something ca- called a glider gun that does nothing but spit out gliders one at a t- one, one at a time. Um, and then after a lot of effort people managed to, to create, um, AND gates and OR gates for gliders. Like there's this massive ridiculous structure which if you, if a, if a, if you have a stream of gliders, um, coming in here and a stream of gliders coming in here, then you may produce extreme gliders coming out if, if maybe, maybe if both of, of the, um, streams, um, have gliders then there'll be an- an ou- output stream. But if only one of them does, then nothing comes out.
- LFLex Fridman
Mm-hmm.
- TTTerence Tao
So they could build something like that. And once you could build an, um, these basic gates then just from software engineering you can build almost anything. Um, uh, you can build a Turing machine. I mean, it, it's, it's like an enormous steampunk type things. They look ridiculous, but then people also generated self-replicating objects in the Game of Life, a massive machine, a Bonomo machine, which over a lo- huge period of time and it always looked like glider guns inside doing these very steampunk calculations. It would create another version of itself which could replicate-
- LFLex Fridman
That's so incredible.
- TTTerence Tao
A lot of this was like community crowdsourced by, uh, like amateur mathematicians actually. Um, so I knew about that, that, that work, and so th- that is part of what inspired me to propose the same thing for Navier-Stokes. Um, that really just... analog is much... I should say analog is much worse than digital. Like, uh, it's gonna be, um... you can't just directly take the constructions in the Game of Life and plunk them in, but again, it just, it shows it's possible.
- LFLex Fridman
You know, there's a kinda emergence that happens with these cellular automata local rules.
- TTTerence Tao
Mm-hmm.
- LFLex Fridman
Maybe it's similar to fluids, I don't know, but local rules operating at scale can create these incredibly complex dynamic structures. Do you think any of that is amenable to mathematical analysis? Do we have the tools to say something profound about that?
- TTTerence Tao
The thing is you can get this emergent very complicated structures but only with very carefully prepared initial conditions, yeah? So, so these, these, these glider guns and, and gates and, and software machines, if you just plunk down randomly some cells and you run the Game of Life you will not see any of these. Um, and th- that's the analogous situation with Navier-Stokes again, you know, that, that with, with typical initial conditions you will n- you will not have any of this weird computation going on. Um, but basically through engineering, you know, by, by, by s- specially designing things in a very special way you can make clever constructions.
- LFLex Fridman
I wonder if it's possible to prove the sort of the negative of like basically prove that only through engineering can you ever create-
- TTTerence Tao
Yeah. Yeah, yeah, yeah.
- LFLex Fridman
... something interesting.
- TTTerence Tao
This, this is a recurring challenge in mathematics that, um-... I call it the dichotomy between structure and randomness, that most objects that you can generate in mathematics are random. Uh, they look like ran- like, the digits of pi, well, we believe is a good example. Um, but there's a very small number of things that have patterns. Um, but, um, now, you can prove something has a pattern by just constructing, you know, like if, if something has a simple pattern and you have a proof that it, it does something like repeat itself every so often, you can do that. But, um, and you c- you can prove that, that, for example, you can, you can prove that most sequences of, of digits have no pattern. Um, so like if, if you just pick digits randomly, there's something called the law of large numbers that tells you that you're going to get as many oneses as, as twos in the long run. Um, but, um, we have a lot fewer tools to, to, to... If I give you a specific pattern like the digits of pi, how can I show that this doesn't have some weird pattern to it? Some other work that I spend a lot of time on is to prove what are called structure theorems or inverse theorems that give tests for when something is, is very structured. So some functions are what's called additive, like if you have a function that maps the natural numbers to the natural numbers, so maybe, um, you know, two maps to four, three maps to six and so forth. Um, some functions are what's, what's called additive, which means that if you add, if you add two inputs together, the output gets, gets added as well. Uh, for example, uh, multiplying by a constant, if you multiply a number by 10, um, if you, i- i- if you multiply A plus B by 10, that's the same as multiplying A by 10 and B by 10 and then adding them together, so some, um, functions are additive. Some functions are kind of additive but not completely additive. Um, so for example, if I take a number N, I multiply by the square root of two and I take the integer part of that, so 10 by the square root of two is like 14 point something, so 10 will map to 14. Um, 20 will map to 28. Um, so in that case, a- additivity is true then, so 10 plus 10 is 20 and 14 plus 14 is 28, but because of this rounding, uh, sometimes there's round off errors and, and sometimes when you, um, add A plus B, this function doesn't quite give you the sum of, of the two individual outputs, but the sum plus or minus one. Um, so it's almost additive, but not quite additive. Um, so there's a lot of useful results in mathematics and I've worked a lot on developing things like this, to the effect that if, if a function exhibits some structure like this, then, um, it's basically, th- there's a reason for why it's true and the reason is because there's, there's some other nearby function which is actually, um, com- completely structured which is explaining this sort of partial pattern that you have. Um, and so if you have these sort of inverse theorems, it, um, it creates this sort of dichotomy that, that either the objects that you study are either have no structure at all or they are somehow related to something that is structured. Um, and in either way, in either, um, uh, in either case you can make progress. Um,
- 33:01 – 38:07
Infinity
- TTTerence Tao
a good example of this is that there's this old theorem in mathematics called Szemerédi's theorem, uh, proven in the 1970s. It concerns trying to find a certain type of pattern in a set of numbers and the, the pattern is arithmetic progression, things like 3, 5, and 7, or, or, or 10, 15, and 20. And Szemerédi, Endre Szemerédi proved that, um, any set of, of numbers that are sufficiently big, um, what's called, what's called positive density, has, um, arithmetic progressions in it of, of any length you wish. Um, so for example, um, the odd numbers have a set of density one half, um, and they contain arithmetic progressions of any length. Um, so in that case it's obvious because the, the od- the odd numbers are really, really structured. I can just take, uh, uh, 11, 13, 15, 17, I can just, I can, I can easily find arithmetic progressions in, in, in that set. Um, but, um, Szemerédi's theorem also applies to random sets. If, if I take the set of all numbers and I flip a coin, um, and I, uh, for each number and I only keep th- the numbers which, for which I got a heads, okay, so I just flip coins, I just randomly take out half the numbers, I keep one half, so that's a set that has no, no patterns at all, but just from random fluctuations you will still get a lot of, um, um, of arithmetic pro- progressions in that set.
- LFLex Fridman
Can you prove that there's arithmetic progressions of arbitrary length within a random-
- TTTerence Tao
Yes. Um, have you heard of the infinite monkey theorem? Usually mathematicians give boring names to theorems, but occasionally-
- LFLex Fridman
Yeah.
- TTTerence Tao
... they, they give colorful names.
- LFLex Fridman
Yes.
- TTTerence Tao
The popular version of the infinite monkey theorem is that if you have an infinite number of monkeys in a room with each a typewriter, they type out, uh, text randomly. Almost surely one of them is going to generate the entire score of Hamlet or any other finite string of text. Uh, it will just take some time, uh, quite a lot of time actually. But if you have an infinite number, then it happens. Um, so, um, basically the theorem says that if you take an infinite string of, of digits or whatever, um, eventually any finite pattern you wish will emerge. Uh, it may take a long time, but it will eventually happen. Um, in particular, arithmetic progressions of any length will eventually happen. Okay, but you need a, b- but you need an extremely long random sequence for this to happen.
- LFLex Fridman
I suppose that's intuitive, it's just infinity.
- TTTerence Tao
Yeah, infinity absorbs a lot of sins.
- LFLex Fridman
Yeah. How are we humans supposed to deal with infinity?
- TTTerence Tao
Well, you can think of infinity as, as, as an abstraction of, um, a finite number for which you, you do not have a bound full, um, that, uh, you know, I mean, so nothing in real life is truly infinite, um, but, you know, you can, um, you know, you can ask these sort of questions like, "What if I had as much money as I wanted?" You know, or, "What if I could go as fast as I wanted?" And a way in which mathematicians formalize that is, mathematics has found a formalism to idealize instead of something being extremely large or extremely small, to actually be exactly infinite or zero, um, and often the, the mathematics becomes a, a lot cleaner when you do that. I mean, in, in physics we, we joke about, uh, assuming spherical cows. Um, you know, like, real world problems have got all kinds of real world effects but you can idealize s- send some things to infinity, send so- some things to zero, um, and, uh, and the mathematics becomes a lot simpler to work with then.
- LFLex Fridman
I wonder how often using infinity, uh, forces us to deviate from, um, the physics of reality.
- TTTerence Tao
Yeah, so there's, there's a lot of pitfalls, um, so, you know, we, we spend a lot of time in, uh, undergraduate math classes teaching analysis, um, and analysis is often about how to take limits and, and, and whether you... You know, so for example, A plus B is always B plus A.Um, so when, when you have a finite number of terms and you add them, you can swap them and there's n- there's no problem. But when you have an infinite number of terms, there are these sort of show games you can play where you can have a series which converges to one value, but you rearrange it and suddenly it converges to another value, and so you can make mistakes. You have to know what you're doing when you allow infinity. Um, you have to introduce these epsilons and deltas and, and, and there's, there's a certain type of way of reasoning that helps you avoid mistakes. Um, in more recent years, um, uh, people have started taking results that are true in, in infinite limits and try to
- LFLex Fridman
(laughs)
- TTTerence Tao
... what's ca- what's called finitizing them. Um, so you know that something's true eventually, but, um, you don't know when. Now, give me a rate. Okay, so such that if I have, don't have an infinite number of monkeys but, but a large finite number of monkeys, how long do you have to wait for Hamlet to come out? Um, and, uh, that's a more qua- quantitative question. Um, and this is something that you can, you can, um, attack by purely finite methods and you can use your finite intuition. Um, and i- in this case it turns out to be exponential in the length of the text that you're, you're trying to generate. Um, so if... um, and so this is why you never see the monkeys create Hamlet. You can maybe see them create a four-letter word, but nothing that big. And so I personally find once you finitize an infinite statement, it's, it does become much more intuitive a- and it's no longer so, so weird. Um...
- LFLex Fridman
So even if you're working with infinity, it's good to finitize so that you can have some intuition.
- TTTerence Tao
Yeah. The downside is that the finitized proofs are just much, much messier.
- LFLex Fridman
Yeah.
- TTTerence Tao
And, and, uh, yeah, so the, so the infinite ones are found first usually, like decades earlier, uh, and then later on people finitized
- 38:07 – 44:26
Math vs Physics
- TTTerence Tao
them.
- LFLex Fridman
So since we mentioned a lot of the math and a lot of physics-
- TTTerence Tao
Mm-hmm.
- LFLex Fridman
... uh, what to you is the difference between mathematics and physics as disciplines, as ways of understanding, of seeing the world? Maybe we can throw in engineering in there. You mentioned your wife is an engineer, give a new perspective on circuits.
- TTTerence Tao
Right.
- LFLex Fridman
So there's a different way of looking at the world given that you've done mathematical physics, so you, you've, you've worn all the hats.
- TTTerence Tao
Right. So I think science in general is an interaction between three things. Um, there's the real world, um, there's what we observe of the real world, our observations, and then our mental models as to h- how we think the world w- works. Um, so, um, we can't directly access reality, okay? Uh, all we have are the, uh, observations, which are incomplete and they, they have errors, um, and, um, there are many, many cases where we would, um, uh, we want to know, for example, what is the weather like tomorrow and we don't yet ha- have the observation and we'd like to make a prediction. Um, and then we have these simplified models, sometimes making unrealistic assumptions, you know, spherical cow type things. Those are the mathematical models.
- LFLex Fridman
Mm-hmm.
- TTTerence Tao
Mathematics is concerned with the models. Science collects the observations and it proposes the models that might explain these observations. What mathematics does i- uh, we, we stay within the model and we ask, "What are the consequences of that model? What observations, what, what predictions would the model make of the, of future observations? Uh, or past observations, does it fit observed data?" Um, so there, there's definitely a symbiosis. Um, it's... matha- I guess mathematics is, is unusual among other disciplines is that we start from hypotheses, like the axioms of a model, and ask what conclusions come up from that m- model. Um, in almost any other discipline, uh, you start with the conclusions. You know, "I want to do this. I want to build a bridge," you know, "I want, I want to, to make money. I want to do this." Okay, and then you, you, you find the paths to get there. Um, a lot... there's, uh, there's a lot less sort of speculation about, "Suppose I did this, what would happen?" Um, you know, planning and, and, and modeling. Um, uh, speculative fiction maybe is, is one other place, uh, but, uh, that's about it actually. Most of the things we do in life is conclusions driven, including physics and sci... you know, I mean, they want to know, you know, where is this asteroid gonna go, you know, or what, what, "What is the weather gonna be tomorrow?" Um, but, um, mathematics also has this other direction of, of going from the, uh, the axioms.
- LFLex Fridman
W- what do you think... there is this tension in physics between theory and experiment.
- TTTerence Tao
Mm-hmm.
- LFLex Fridman
What do you think is a more powerful way of discovering truly novel ideas about reality?
- TTTerence Tao
Well, you need both, top-down and bottom-up. Um, yeah, it's just a, it's, it's really an interaction between all these things. So over time, the observations and the theory and the modeling should g- both g- get closer to reality. But initially, and it is, I mean, uh, this is, um, this is always the case, you know, they're, they're always far apart to begin with. Um, but you need one to figure out, uh, where, where to push the other, you know? So, um, if your model is predicting anomalies, um, that are not picked up by experiment, that tells experimenters where to look, you know, um, to, to, to, to, to find more data to refine the models. Um, you know, so it, it, it goes, it goes back and forth. Um, within mathematics itself, there's, there's also a theory and experimental component. It's just that until very recently, theory has dominated almost completely. Like 99% of mathematics is theoretical mathematics, and there's a very tiny amount of experimental mathematics. Um, I mean, people do do it, you know. Like if they want to study prime numbers or whatever, they can just generate large datasets and with a com-... so once we had computers, um, we began to do it a little bit. Um, although even before... well, like Gauss for example, he discovered or he conjectured the most basic theorem in, in number theory, it's called the prime number theorem, which predicts how many primes that... up to a million, up to a trillion. It's not an obvious question. And basically what he did was that he computed, uh, I mean, mostly u- um, by himself, but also hired human computers, um, people who, whose professional job it was to do arithmetic, um, to compute the first 100,000 primes or something and made tables and made a prediction. Um, that was an early example of experimental mathematics. Um, but until very recently, it was not, um... yeah, I mean, theoretical mathematics was just much more successful. I mean, of course, doing complicated mathematical computations is, uh, was just not, not feasible, uh, until very recently.... and even nowadays, you know, even though- though we have powerful computers, only some mathematical things can be, um, explored numerically. There's something called the combinatorial explosion. If you want us to study, for example, Zermelo's theorem, we want to study all possible subsets of the numbers one to 1,000, there's only 1,000 numbers. How bad could it be? It turns out the number of different subsets of- of one to 1,000 is two to the power of 1,000, which is way bigger than- than- than any computer can currently can- can enu- in fact, any computer ever will ever, um, enumerate. Um, so if you have to be, um... There are certain math problems that very quickly become just in- intractable to attack by direct brute force computation. Uh, chess is another, um, uh, famous example. Uh, it's the number of chess positions, uh, we can't get a c- computer to fully explore. But now we have AI, um, um, we have tools to explore the space not with 100% guarantees of s- success, but with experiment, you know. So like, um, we can empirically solve chess now, right? For example, uh, we have, we have, uh, very, very good AIs that they can... You know, they don't explore every single position in- in the game tree, but they have found some very good approximation. Um, and people are using, actually, these chess engines, uh, to make... uh, to do experimental chess. Um, that, uh, they're- they're revisiting old chess theories about, oh, you know, when you... this type of opening, you know, that- that this is good... this is a good type of move, this is not, and then using chess engines to actually, uh, refine, and- and in some cases overturn, um, um, conventional wisdom about chess. And I- I do hope that, uh, that mathematics will- will have a larger experimental component in the future, perhaps powered by AI.
- LFLex Fridman
We'll, of course, ta- talk about that, but in the case of chess, and there's a similar thing in mathematics, the... I don't believe it's providing a kind of formal explanation of the different positions.
- TTTerence Tao
No.
- LFLex Fridman
It's just saying which position is better or not, and then you can intuit as a human being. And then from that, we humans can construct-
- TTTerence Tao
Yes.
- LFLex Fridman
... a theory of the matter.
- 44:26 – 1:07:09
Nature of reality
- LFLex Fridman
- TTTerence Tao
Right.
- LFLex Fridman
You've mentioned the Plato's cave allegory.
- TTTerence Tao
Mm-hmm.
- LFLex Fridman
So, in case people don't know, it's where people are observing shadows of reality, not reality itself, and they believe what they're observing to be reality. Is that, in some sense, what mathematicians and maybe all humans are doing, is, um, looking at shadows of reality? Do... Is it possible for us to truly access reality?
- TTTerence Tao
Well, there are these three ontological things. There's actual reality, there's our observations, and our- our models. Um, and- and technically, they are distinct, and I think they will always be distinct, um...
- LFLex Fridman
Right.
- TTTerence Tao
... but they can get closer, um, over time. Um, you know, so, um... And the process of getting closer often means that you- you have to discard your initial intuitions, um, so, um, a- astronomy pro- provides great examples, you know, like, you know, like, you'll... An initial model of the world is that it's flat because it's- it looks flat, you know, uh, and, um, and that it's... and it's big, you know, and the rest of the universe, the skies is not... You know, like, the sun, for example, looks really tiny-
- LFLex Fridman
Mm-hmm.
- TTTerence Tao
... um, and so you- you start off with a model which is actually really far from reality, um, but it fits kind of the observations that you have, um, you know. So, you know, so things look good for... you know. But- but over time, as you make more and more observations, bringing it closer to- to- to reality, um, the model gets dragged along with it, you know. And so, over time, we had to realize that the Earth was round, that it spins, it goes around the solar system, the solar system goes around the galaxy, and so on and so forth, and the- the guy's part of the universe is expanding. Um, the expansions are self-expanding, accelerating, and in fact, very recently, in- in this year or so, there's a... even the explosion of the universe itself is, uh, there's evidence now that it's- it's non-constant.
- LFLex Fridman
And, uh, the explanation behind why that is, is-
- TTTerence Tao
It's catching up. Um... (laughs)
- LFLex Fridman
(laughs) it's catching up. I mean, it's still, you know, the dark matter, dark energy-
- TTTerence Tao
Yeah.
- LFLex Fridman
... this- this kind of thing.
- TTTerence Tao
Mm-hmm. Yes. We have a- we have a model that sort of explains... that fits the data really well. It just has a few parameters that, um, uh, you have to specify. Um, but so, you know, people say, "Oh, that's fudge factors," you know, with- with enough fudge factors, you can- you can explain anything. Um, yeah, but, uh, the mathematical point of the model is that, um, you want to have fewer parameters in your model than data points in your observational set. So, if you have a model with 10 parameters that explains 10 out- 10 observations, that is a completely useless model. It's what's called over-fitted. But, like, if you have a model with- with, you know, two parameters and it explains a- a trillion observations, which is basically... uh... so, uh, yeah, the- the- the dark matter model, I think, has like 14 parameters and it explains petabytes of data, um, that- that- that- that the astronomers have. Um, you can think of a- of a theory, uh, like one way to think about, um, uh, a physical mathematical theory... a- a theory is- it's- it's a compression of- of the universe, um, and, uh, uh, data compression. So, you know, you have these petabytes of observations, you like to compress it to a model which you can describe in five pages and specify a certain number of parameters, and if it can fit to reasonable accuracy, you know, a- almost all of- of the observations, that... I mean, the more compression that you make, the better your theory.
- LFLex Fridman
In fact, one of the great surprises of our universe and of everything in it is that it's compressible at all.
- TTTerence Tao
Yeah.
- LFLex Fridman
It's the unreasonable effectiveness of mathematics.
- TTTerence Tao
Yeah. Einstein had a quote like that. The- the most incomprehensible thing about the universe is that it is comprehensible.
- LFLex Fridman
Right. And not just comprehensively. You can do an equation like E equals MC squared.
- TTTerence Tao
There is actually a- some mathematical possible explanation for that. Um, so there's this phenomenon in mathematics called universality. So, many complex systems at the macroscale are coming out of lots of tiny interactions at- at the macroscale, and normally, because of the combinatorial explosion, you would think that, uh, the macroscale equations must be like infinitely, exponentially more complicated than- than the, uh, the microscale ones. And they are, if you want to solve them completely exactly. Like if you want to model, um, all the atoms in a box of- of air, uh, that's like Avogadro's number is humongous, right? There's a huge number of particles. If you actually had to track each one, it would be ridiculous. But certain laws emerge at the microscopic scale that almost don't depend on what's going on at the macroscale or only depend on a very small number of parameters. So, if you want to model a gas, um, of, you know-... quintillion particles in, in a box. You just need to know its temperature and pressure and volume and a few parameters, like five or six, and it models almost everything you, you need to know about these 10 to the 23 or whatever particles. Um, so we, we have, um, we, we don't understand universality anywhere near as we would like mathematically, but there are much simpler toy models where we do, um, have a good understanding of why univers- universality occurs. Um, uh, most basic one is, is the central limit theorem. That explains why the bell curve shows up everywhere in nature, that so many things are distributed by, uh, what's called a Gaussian distribution, uh, the famous bell curve. Uh, there's now even a meme with this curve.
- LFLex Fridman
And even the meme applies broadly.
- TTTerence Tao
Yeah.
- LFLex Fridman
There's a universality to the meme.
- TTTerence Tao
Yeah, yes, you can call meta-
- LFLex Fridman
(laughs)
- TTTerence Tao
... uh, if you like. But there are many, many processes. For example, you c- you can take lots and lots of independent, um, random variables and average them together, um, uh, in, in various ways. You basically... you could take a simple average or more complicated average, and we can prove in various cases that, that these, these bell curves, these Gaussians emerge, and it is a satisfying, satisfying explanation. Um, sometimes they don't. Um, so, so if you have many different inputs and they're all correlated in some s- systemic way, then y- you can get something very far from a bell curve to show up. Uh, and this is also important to know when this fails. So universality is not a 100% reliable thing to rely on. In fact, um, um, the, the global financial crisis was a, a famous example of this. Uh, people thought that, uh, um, mortgage de- defaults, um, um, had this sort of, um, Gaussian-type behavior that, that if you, if you ask... if a population of, of, of, uh, you know, 100,000 Americans with mortgages, ask what, what proportion of them would default on their mortgages, um, if everything was de-correlated, it would be a nice bell curve and, and like you can, you can, you can manage risk with options and derivatives and so forth and, um, and it is a very beautiful theory. Um, but if there are systemic shocks in the economy (laughs) , uh, that can push everybody to default at the same time, uh, that's very non-Gaussian behavior. Um, and, uh, this wasn't fully accounted for (laughs) in, uh, 2008. Uh, now I think there's some more awareness that this is a... systemic risk is actually a, a much bigger issue. And, uh, just because the model is pretty, uh, and nice, uh, it may not match reality. And so, so the mathematics of working out what models do is really important, um, but, um, also the, the, the science of validating when the models, uh, fit reality and when they don't, um, I mean, that... you need both. Um, and... but mathematics can help because it c- it can, uh... for example, these central limit theorems, it, it tells you that, that if you have certain axioms like, like, like, uh, non-correlation, that if all the inputs were not correlated to each other, um, then you have this Gaussian behavior so things are fine. It, it tells you where to look for weaknesses in the model. So if you have a mathematical understanding of central limit theorem and someone proposes to use these Gaussian copulas or whatever to mo- to model, um, default risk, um, if you're mathematically, um, trained, you would say, "Okay, but what are the systemic correlation between all your inputs?" And so then the... then you can ask the economists, you know, "How, how, how much of a risk is that?" Um, and then you can, you can, you can go look for that. So there's always this, this, this synergy between science and, and mathematics.
- LFLex Fridman
A little bit on the topic of universality...
- TTTerence Tao
Mm-hmm.
- 1:07:09 – 1:13:10
Theory of everything
- TTTerence Tao
- LFLex Fridman
Does your gut say that there is a theory of everything, so this is po- even possible to unify, to find this language that unifies general relativity and quantum mechanics?
- TTTerence Tao
I believe so. I mean, the history of physics has been that unification, much like mathematics, um, over the years. You know, electricity and magnetism were, were separate theories, and then Maxwell unified them. You know, Newton unified the, the motions of the heavens with the motions on, of, of objects on the earth and so forth. So it should happen. Uh, it's just that the, um, uh... again, to go back to this model of the, of the observations and, and, and theory, part of our problem is that physics is a victim of its own success. That, uh, two big theories of, of, of physics, general relativity and quantum mechanics, are so, are so good now that together they cover 99.9% of sort of all the observations we can make. Um, and you have to, like, either go to extremely insane particle accelerations or, or the early universe or, or, or things that are really hard to measure, um, in order to get any deviation from either of these two theories to the point where you can actually figure out how to, how to c- c- combine them together. Um, but I have faith that we... you know, we've, we've, we've been doing this for centuries, and we've made progress before. There, there's no reason why we should stop.
- LFLex Fridman
Do you think it will be a mathematician that develops a theory of everything?
- TTTerence Tao
What often happens is that when the physicists need, uh, um, some theory of math- mathematics, there's often some precursor that the mathematicians, um, worked out earlier. So when Einstein started realizing that space was curved, he went to some mathematician and asked, you know, "Is there, is there some theory of curved space that mathematicians already came up with that could be useful?" And he said, "Oh, yeah. There's a... I think, uh, Riemann came up with something." Um, and so, yeah, Riemann had developed, you know, Riemannian geometry, um, which is precisely, um, you know, a, a, a, a theory of spaces that are curved in, in various general ways, which turned out to be almost exactly what was needed, um, for Einstein's theory. This is going back to, to Wigner's unreasonable effectiveness of mathematics. I think the theories that work well to explain the universe tend to also involve the same mathematical objects that work well to solve mathematical problems.
- LFLex Fridman
Mm-hmm.
- TTTerence Tao
Ultimately, they just have both ways of organizing data, um, in, in, in, in useful ways.
- LFLex Fridman
It just feels like you might need to go to some weird land that's very hard to, to intuit, like-
- TTTerence Tao
Yeah, yeah.
- LFLex Fridman
... you know, you have, like, string theory.
- TTTerence Tao
Yeah. That, that's, that was, that was a leading candidate for many decades. It's, I think it's slowly falling out of fashion because it's, it's not matching experiment, uh...
- LFLex Fridman
So one of the big challenges, of course, like you said, is experiment is very tough-
- TTTerence Tao
Yes.
- LFLex Fridman
... because of the how effective-
- TTTerence Tao
Yeah.
- LFLex Fridman
... both theories are. But the other is, like, just, you know, you're talking about you're not just deviating from spacetime. You're going into, like, some crazy number of dimensions.
- TTTerence Tao
Yeah. Yeah.
- LFLex Fridman
You're doing all kinds of weird stuff that to us... We've gone so far from this flat Earth that we started at-
- TTTerence Tao
Yes.
- LFLex Fridman
... like you mentioned. (laughs)
- TTTerence Tao
Yeah, yeah, yeah, yeah.
- LFLex Fridman
And now we're just... it's, it's very hard to use our limited ape descendants of, uh, uh, cognition to intuit what that reality really is like.
- TTTerence Tao
This is why analogies are so important, you know. I mean, so yeah, when... the round Earth is not intuitive because we're, we're stuck on it, um, but, you know, but y- y- y- you know, but round objects in general, we have pretty good intuition, a little bit. Uh, and we have intuition about light works and so forth. And, like, it's, it's actually a good exercise to actually work out how eclipses and phases of, of the sun and the moon and so forth can be really easily explained by, by, by, by round Earth and round moon, you know, um, and models. Um, and, and i- and you can just take, you know, a basketball and a golf ball and, and a, and, and a light source and actually do these things yourself. Um, so the intuition is there, um, but yeah, you have to transfer it.
- LFLex Fridman
That is a big leap intellectually for us to go from flat to round Earth-
- TTTerence Tao
(laughs)
- LFLex Fridman
... because, you know, our life is mostly lived in flat land.
- TTTerence Tao
Yeah.
- LFLex Fridman
To load that information, and we all, like, take it for granted. We take so many things for granted because science has established a lot of evidence-
- TTTerence Tao
Mm-hmm.
- LFLex Fridman
... for this kind of thing, but, you know, we're on a r- r- ro- round rock (laughs) -
Episode duration: 3:14:33
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode HUkBz-cdB-k
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome