How a reasoning model cracked an 80-year-old math problem — the OpenAI Podcast Ep. 20

Last month AI found something mathematicians had missed for decades. Reasoning researchers Alexander Wei, Hongxun Wu, and Lijie Chen join the podcast to discuss how a general-purpose model helped disprove an 80-year-old conjecture from famed mathematician Paul Erdős. They walk through the moment the result started looking real, what it took to verify the proof, and what’s happened since sharing the discovery with the world. They also explore what this means for the future of math and for researchers learning to work with AI. Chapters 0:44 AI and the International Math Olympiad and International Olympiad of Informatics 6:35 An OpenAI model disproves the Erdős unit distance conjecture 8:33 Running the model and checking the proof 11:04 Why general models matter for discovery 15:55 Creativity, tools, and how the proof worked 18:25 Why AI should feel empowering for mathematicians 22:31 Advice for researchers using AI 27:24 What comes next for math and AI research 37:30 Cryptography, quantum computing, and the future

Andrew MaynehostLijie ChenguestHongxun WuguestAlexander Weiguest

Jun 4, 202641mWatch on YouTube ↗

EVERY SPOKEN WORD

40 min read · 7,922 words

0:00 – 0:44
Intro
1. AMAndrew Mayne
  Hello, I'm Andrew Mayne, and welcome to the OpenAI Podcast. On today's episode, we're speaking with Alexander Wei, Hongxing Wu, and Lijie Chen from the reasoning research team behind a recent math breakthrough from an OpenAI model. They'll tell us the story behind the discovery and what stood out to them about the reaction.
2. LCLijie Chen
  Everyone had a hard time sleeping because it's so [laughs] so exciting, yeah.
3. HWHongxun Wu
  Okay, this model is something that's really amazing.
4. LCLijie Chen
  I mean, this is something that can be published in the best journal of maths.
5. AWAlexander Wei
  Maybe this is the one in 100 times where it's too good to be true, but it's, it's actually true.
6. AMAndrew Mayne
  Lijie, tell me what you work on.
7. LCLijie Chen
  Oh, w- I work on reasoning with Alex.
8. AMAndrew Mayne
  Okay. How did you find your way into reasoning?
0:44 – 6:35
AI and the International Math Olympiad and International Olympiad of Informatics
1. LCLijie Chen
  Last summer, Alex had this breakthrough in, like, IOI and IMO.
2. AMAndrew Mayne
  Mm-hmm.
3. LCLijie Chen
  You know, I used to be a parti- participant in IOI.
4. AMAndrew Mayne
  Okay.
5. LCLijie Chen
  And, uh, that was like, oh, that's crazy, you know, a model can already win medals, uh, um, gold medals. At that time, I was a profes- assistant professor at UC Berkeley, but then I'm thinking, like, maybe I should try to rethink my, [laughs] my career.
6. AMAndrew Mayne
  [laughs]
7. LCLijie Chen
  It seems like making the model smarter will have, maybe have some bigger impact on the, on the world. And that, and then I just kind of had a conversation with Alex, like, back in n- last October, and then I got super excited about this thing, and eventually I just joined OpenAI.
8. AMAndrew Mayne
  We hear IOI and IMO come up a lot. Alex, you wanna unpack those for everybody?
9. AWAlexander Wei
  So IMO and IOI are, are these two competitions for high schoolers. Uh, they stand for the International Math Olympiad and In- International Olympiad of Informatics respectively, and these are just devilishly hard math problems. Uh, you get two sessions for each of these exams that are, like, four and a half to five hours, and you just have to do three problems. And so for a long time, uh, these were sort of an implicit, like, grand challenge in AI. Like, when would we be able to get models that could perform as best, as well as the best humans on these exams?
10. AMAndrew Mayne
  That was a pretty interesting starting point, I think, for measuring the success of the model, and we're here to talk about how far things have gone since then, which is pretty incredible. But how did you find your way into reasoning?
11. AWAlexander Wei
  So I did my PhD in ML.
12. AMAndrew Mayne
  Mm.
13. AWAlexander Wei
  Um, and towards the end of my PhD, I got excited about this idea of spending more compute at inference time-
14. AMAndrew Mayne
  Mm
15. AWAlexander Wei
  ... to solve, uh, you know, harder and harder reasoning problems. At the time, I was playing with, uh, GPT 3.5 Turbo in the API.
16. AMAndrew Mayne
  Mm-hmm.
17. AWAlexander Wei
  And, uh, I, I didn't really get any interesting results, but there was this team at OpenAI that was, seemed to be doing something pretty similar, and so I got super excited about it, and, you know, was lucky enou- enough to be able to join.
18. AMAndrew Mayne
  So probably the simplest way to describe that is, like, letting the inference time is basically letting the model think longer about it.
19. AWAlexander Wei
  Yes, that's right. So basically, um, before this era of test-time compute, models sort of answered immediately without, like, right off the cuff-
20. AMAndrew Mayne
  Mm
21. AWAlexander Wei
  ... without thinking. And what inference-time compute, test-time compute does is you now g- let the model, give the model a chance to think and improve its answer and, like, try different things before having to finally output something.
22. AMAndrew Mayne
  Mm.
23. AWAlexander Wei
  That obviously just h- helps make the model smarter, let- lets them do things that they wouldn't otherwise be able to do instantly.
24. AMAndrew Mayne
  When you started to work on reasoning, did you have an idea of where you wanted to see this go? Like, what your expectations were? Were you looking at it purely from, "Hey, this is very cool from an academic point of view," or did you have some sort of other vision?
25. AWAlexander Wei
  Uh, I think for me, the draw of reasoning when I first got excited about it was that this was something that, you know, models just obviously can't do right now.
26. AMAndrew Mayne
  Mm-hmm.
27. AWAlexander Wei
  Um, so this was like end of 2023, start of 2024. Models were, you know, struggling with grade school math problems.
28. AMAndrew Mayne
  Mm-hmm.
29. AWAlexander Wei
  And so at that time it was just like, can we just get the, these models to do something reasonable on m- math at all, let alone, like, have them be like, you know, like, you know, much, much better than I am at it? Uh, I remember when I s- the first, my first day at OpenAI, uh, Nolan Brown asked me, uh, you know, when I thought models would get IMO gold. Uh, that, that was, uh, just a benchmark we talked about. I think at the time, a lot of people, even within research, thought that, like, you know, uh, IMO gold was out of reach this year, but maybe, like, 2026.
30. AMAndrew Mayne
  Mm.
6:35 – 8:33
An OpenAI model disproves the Erdős unit distance conjecture
1. AMAndrew Mayne
  You had a model that was able to basically disprove one of the Erdős conjectures. Would you, could you explain that just a little bit?
2. AWAlexander Wei
  Yeah, so our models last week, uh, they were able to produce a proof of the u- or a disprove rather, of the unit distance conjecture due to Erdős, and this was an 80-year-old open problem in the field of, field of, uh, combinatorial geometry, where basically the question concerns, uh, if you have endpoints, let's say on a piece of paper-
3. AMAndrew Mayne
  Mm-hmm
4. AWAlexander Wei
  ... how many of them can be one inch apart, uh, exactly? And h- how, how many pairs can be one inch apart exactly? And how does this number grow asymptotically with the number of points on the piece of paper?
5. AMAndrew Mayne
  This wasn't a, a trivial problem. When Erdős put this together, the idea was to say that it could, you know, like, I think ideally was had to be only done on a plane or something like this, but there was, you know, the idea that maybe there was no better way. And this has been out there because it's a very interesting problem, and the fact that a model solved this is pretty profound, and also this model was a general-purpose model, correct?
6. AWAlexander Wei
  Yes, that's right. So Erdős' original conjecture was essentially that, uh, the optimal cons- the optimal solution to, uh, having as many, uh, distance one points on the plane was to arrange them in a square grid.
7. AMAndrew Mayne
  Mm-hmm.
8. AWAlexander Wei
  And what the model proved was that the square grid was not actually, uh, close to optimal at all, and that you can do much better, uh, with a different construction using a lot of, uh, high-powered number theory.
9. AMAndrew Mayne
  Hongxun, how did you, how did you choose these problems?
10. HWHongxun Wu
  I, I guess we didn't really choose the problem. We, what happened was, uh, we want to test the upper bound of our model's capability.
11. AMAndrew Mayne
  Mm-hmm.
12. HWHongxun Wu
  So we just used a selected subset of Erdős problems and to test the capability of the model.
8:33 – 11:04
Running the model and checking the proof
1. AMAndrew Mayne
  I would love to know, one, who is the one that hit Enter and asked the model the question?
2. LCLijie Chen
  I guess both of us, like I and Hongxun.
3. AMAndrew Mayne
  You guys at the same time, like pressed? [laughs]
4. LCLijie Chen
  [laughs] Yeah, maybe.
5. HWHongxun Wu
  I, I think what happened was actually we were testing, like, two side different internal models.
6. AMAndrew Mayne
  Mm-hmm.
7. HWHongxun Wu
  And we both saw, like, some correct solutions.
8. AMAndrew Mayne
  Mm-hmm.
9. HWHongxun Wu
  Uh, it was really, really exciting for us.
10. AMAndrew Mayne
  How did you know that it worked?
11. LCLijie Chen
  Of course, you first ask the model to check it.
12. AMAndrew Mayne
  Okay. [laughs]
13. LCLijie Chen
  Uh, but of course, you know, models sometimes they are not reliable, so like-
14. AMAndrew Mayne
  [laughs] I got it. It's good. Don't worry about it.
15. LCLijie Chen
  [laughs] Yeah, so then we just, uh, after we check it with the model, it seems plausible, then we just ask a bunch of, you know, other, our mathematics friends in the company, you know, um, Mattab and, uh, Makselki.
16. AMAndrew Mayne
  Mm-hmm.
17. LCLijie Chen
  And at first they were like, "Oh, there's no way this can be true."
18. AMAndrew Mayne
  [laughs]
19. LCLijie Chen
  "This is a major open problem." And, uh, but after, you know, just they think about it for five days, they couldn't figure out any mistake, then they become more convinced, then eventually they are like, "A- actually this may be correct." Yeah, then, like, everyone had a hard time sleeping because it's so, [laughs] so exciting, yeah.
20. AMAndrew Mayne
  What was the conversation like when you started getting, you know, people saying that this was accurate?
21. HWHongxun Wu
  For me, I was not that surprised because I guess when Mattab first say... Okay, first, what happened was first Mattab say, "This is definitely wrong."
22. AMAndrew Mayne
  Okay. [laughs]
23. HWHongxun Wu
  But I actually knew that he probably just spent like five minute, 10 minute-
24. AMAndrew Mayne
  Right
25. HWHongxun Wu
  ... looking at it. [laughs] So, like, in my heart, I don't really believe that. Um, but later he told me it's 50%. I was thinking, "Okay, if we extrapolate the trend, then maybe next night it will be 8- 100%." [laughs] Um, so yeah, I guess it, it's a little bit dreamlike, um, but also was, like, it feels a bit, a little bit natural that, um, that this model would do something amazing. Later it, it just become more and more, um, more and more real that, um, this might actually be correct. This might actually be a big deal, the first time the model can publish something that get into like, uh, top math journals.
26. AMAndrew Mayne
  Mm-hmm.
27. HWHongxun Wu
  It's, like, we knew this day is gonna come but never knew that it's gonna become, it's gonna become reality so fast. It's like living a dream.
28. LCLijie Chen
  I mean, this is something, like, can be published in, like, the best journal of maths. It is way beyond like a, you know, IMO level. [laughs]
29. AMAndrew Mayne
  Mm-hmm.
30. LCLijie Chen
  So I, I only expect something to happen at some time, but at some, at some point, but maybe not this, just not this May, yeah.
11:04 – 15:55
Why general models matter for discovery
1. AMAndrew Mayne
  One of the things I think that we, we've seen emphasized at OpenAI is that OpenAI doesn't try to train to specific benchmarks and stuff, that OpenAI tries to build really good general overall models, and I think sometimes people say like, "Well, we just try to build a generally smart model, and we find these things a lot of way, and when it comes to reasoning, it's the same thing. Something that's really good reasoning overall, you find these capabilities." Does that ring true for you or?
2. AWAlexander Wei
  Yeah, I think, yeah. I think for, for this model in particular, I think it's one that, um, I think all of us have also just used, um, like in lieu of, uh, the current model, uh, current model in, uh, Codex, and it, it, it works quite well as just a general purpose model. Having the capabilities to do this, uh, Erdős, uh, unit distance result, I think people will be able to do this at home-
3. AMAndrew Mayne
  [laughs]
4. AWAlexander Wei
  ... in the, uh, in the near future.
5. AMAndrew Mayne
  It's been exciting to see people react to this and pay attention to this. Uh-We went from just a very short period of time ago where people said models weren't good at math, and now models are doing this. What have been some of the more fun things you've seen online or reactions from people?
6. HWHongxun Wu
  Ever since we announced the results, my friend, uh, Ying, he says, start to, uh, asking me to try to- try out their pro- open problems.
7. AMAndrew Mayne
  Mm.
8. HWHongxun Wu
  Uh, including my advisor gave me like two, three open problems to try out. Um, I think the reaction was, um, very positive.
9. AWAlexander Wei
  I think people really get a sense that the frontier of AI today can really, uh, come up with, you know, research output that I think many human mathematicians would be proud to achieve.
10. AMAndrew Mayne
  Mm.
11. AWAlexander Wei
  Um, and I think it's really great that we're able to communicate this like, you know, that this is the frontier of progress to the rest of the world. I've seen people like, you know, m- make these, uh, make these like designs of trying to like sketch out, um, like the model's construction and if you plot it on a grid, it's actually like this very like pretty like symmetric geometric design.
12. LCLijie Chen
  Yeah, I guess we are thinking maybe try to make one of the design, you know, put them in a frame and, uh, put them on a desk or something. [laughs]
13. AWAlexander Wei
  [laughs]
14. LCLijie Chen
  Kind of to celebrate this like, you know, moment.
15. AMAndrew Mayne
  Yeah, I think it's gonna be fun when we start seeing it, things like tiling problems and other stuff where we can actually just look at the, the artifacts we need. So we've been hearing more about Erdős problems lately, and some seemed like they weren't as challenging to solve as perhaps as people thought. They just needed some attention, yet this one seems to be a little bit more complicated. Where would you rank this?
16. LCLijie Chen
  Oh, yes. I think he proposed like a thousand question or more, right? So like he, you know, Er- Erdős problem is just collection of all the problems he has asked.
17. AMAndrew Mayne
  Mm.
18. LCLijie Chen
  You know, some problem he has offered some money for, for solution.
19. AMAndrew Mayne
  Mm.
20. LCLijie Chen
  Some problem he could just, you know, note it. And this, this problem he, you know, he offered I think $500-
21. AMAndrew Mayne
  Mm
22. LCLijie Chen
  ... uh, which is for last century, so you know, it's, it was a little bit... Yeah. And also like this is one of the central question in this field of, uh, discrete geometry, and, you know, and this is, has been, you know, heavily discussed by mathematician in like many discrete geometry papers. And so it's kind of one of the question people have thought about a lot and really want to see the answer. So I would say this is more like a major open problem in a concrete field of mathematics instead of some just like, you know, um, many other Erdős question which may be just some, you know, um, something like Erdős ask after lunch or something. [laughs]
23. AMAndrew Mayne
  So how do you collect that $500? Did it, did it, did it, did it disappear when he passed away? [laughs]
24. LCLijie Chen
  Uh, I think there's a special agency for that-
25. AMAndrew Mayne
  Oh
26. LCLijie Chen
  ... but, uh, you usually people just frame the, the check.
27. AMAndrew Mayne
  Yeah.
28. LCLijie Chen
  Yeah, so maybe we'll just frame the check in Sam's office. I don't know. [laughs]
29. AMAndrew Mayne
  Yeah. [laughs] How do you feel this proves that reasoning is effective?
30. LCLijie Chen
  Well, I think the biggest, uh, proof is that if you look at the plot, the, in the official blog-
15:55 – 18:25
Creativity, tools, and how the proof worked
1. AWAlexander Wei
  what can it do?
2. AMAndrew Mayne
  When you go through the proof and you look at what it came up with, were there things that surprised you, things that you would describe as creative?
3. AWAlexander Wei
  So for some context, like the proof is like well above my own mathematical pay grade.
4. AMAndrew Mayne
  Mm-hmm.
5. AWAlexander Wei
  Um, but like just at, at a high level, um, my understanding was that, you know, this, this idea of, uh, taking class field theory and applying it to, uh, problems in, in combinatorial geometry hadn't really been done before, though this was, though there were like, you know, though some people like knew that there was, there could be this bridge between these two fields. Being able to do that and execute it requires like to, first of all, to make the connection requires, uh, quite a bit of like insight and creative, creativity, and then to execute the proof is also like, you know, a very like delicate, careful affair that very few people would be able to do.
6. HWHongxun Wu
  I think the most surprising thing for me is you tell model do something-
7. AMAndrew Mayne
  Mm
8. HWHongxun Wu
  ... and you, you went to have a lunch, and when you come back, you see that it actually does much better than you thought.
9. AMAndrew Mayne
  Hmm.
10. HWHongxun Wu
  And at that moment you feel like, okay, this model is something that's really amazing.
11. AMAndrew Mayne
  So going back to GPT-3.5 Turbo and working with that, and looking at a model that was doing automatically instant sort of, uh, inference and fre- figuring these things out, to now a model that's able to do incredible mathematical proofs, is it using tools? Is it using Lean? Is it using some other things like that, or is this doing purely inside the model?
12. LCLijie Chen
  So for this particular case, the model basically is like Codex. It can code-
13. AMAndrew Mayne
  Mm
14. LCLijie Chen
  ... it can, um, look at the website and, and find information.
15. AMAndrew Mayne
  Mm-hmm.
16. LCLijie Chen
  Yeah, so it's, it's basically a general ChatGPT setup.
17. AMAndrew Mayne
  Okay.
18. LCLijie Chen
  ChatGPT can also write Python and execute them, but I don't think the mo- the, the model write any Lean. Yeah.
19. AMAndrew Mayne
  Mm.
20. HWHongxun Wu
  I think Lijie has a story about the Cambridge, uh, dictionary. [laughs]
21. LCLijie Chen
  Oh, okay.
22. HWHongxun Wu
  Yeah.
23. LCLijie Chen
  So okay, the first thing the model do when it gets the website is to check what unit means-
24. AMAndrew Mayne
  [laughs]
25. LCLijie Chen
  ... in the Cambridge dictionary. It's a little bit [laughs] ridiculous. Yeah.
26. AMAndrew Mayne
  So it like looked up the word unit?
27. LCLijie Chen
  Yeah. It wants to make sure it has the absolute correct u- understanding of what is unit.
28. AMAndrew Mayne
  Have you seen it do other things like that where you're seeing like, oh, it's trying to ground itself to make sure it understands the question?
29. HWHongxun Wu
  And definitely. A lot of time in the model answer it will actually, uh, explain the definition again-
30. AMAndrew Mayne
  Yeah
18:25 – 22:31
Why AI should feel empowering for mathematicians
1. AMAndrew Mayne
  people who are very knowledgeable about computer science, people who, uh, know a lot about mathematics, is it intimidating to all of a sudden see this happen?
2. HWHongxun Wu
  I think it should not be intimidating.
3. AMAndrew Mayne
  Mm.
4. HWHongxun Wu
  I just think it should be empowering instead.
5. AMAndrew Mayne
  Okay.
6. HWHongxun Wu
  After the proof actually come out, uh, like mathematician has improved, first improved the bound they proved.
7. AMAndrew Mayne
  Mm-hmm.
8. HWHongxun Wu
  Uh, and second, they use the intuition, the motivation of the construction to, um, knock down other open problems as well.
9. AMAndrew Mayne
  Mm-hmm.
10. HWHongxun Wu
  So I think the trend is gonna continue. Um, like model can make good breakthrough on some very hard questions we don't know how to solve. But then how to digest that idea, how to, uh, use that method for other good things, uh, I think human still, uh, has a role in this.
11. AMAndrew Mayne
  So what do you think the role of somebody working in mathematics is going to be like five years from now?
12. LCLijie Chen
  I think there'll be a lot of AI and the human collaboration.
13. AMAndrew Mayne
  Mm-hmm.
14. LCLijie Chen
  Yeah, because AI... And now, you know, AI they know a lot, right? They can connect distant ideas, but human can also think for longer. Like currently it seems AI cannot build a new theory for maths-
15. AMAndrew Mayne
  Mm-hmm
16. LCLijie Chen
  ... for example. But, uh, I, I guess human, once they have the help of AI, they can just grab all the ideas from distant field of maths. I think they can empower human way more. Yeah.
17. AMAndrew Mayne
  Do you see this working into other fields? Are we gonna see discoveries in physics?
18. AWAlexander Wei
  So I, I can't speak for, uh, physics, but um, I mean, I guess, like we're, we're all researchers in AI, and I think definitely for me, like my day-to-day looks completely different than when I s- first started, uh, doing res- research in this field. Um, I think like so much of my work is now done by coding agents. Um, like I can, I can just like do so much more, and I think that, that's been a sort of like magical feeling that like with AI you, you can... You, you're really starting now to f- feel like you can use AI to build AI faster.
19. AMAndrew Mayne
  How much has AI changed the way you do these sorts of things?
20. HWHongxun Wu
  I think changed completely. Like even when I just joined, uh, half a year ago, uh, I was hand-coding the codes.
21. AMAndrew Mayne
  Mm-hmm.
22. HWHongxun Wu
  Uh, looking up the Slack channels for like directions, uh, but now the default is just ask Codex.
23. AMAndrew Mayne
  Mm.
24. HWHongxun Wu
  And I ask Codex do a lot of things and, um, then I just go to lunch, I just go to, uh, you know, talk to people.
25. AMAndrew Mayne
  Mm-hmm.
26. HWHongxun Wu
  Um, the, the work completely changes.
27. AMAndrew Mayne
  And now you use Codex on your phone, and you can check on it.
28. HWHongxun Wu
  Yeah.
29. AMAndrew Mayne
  It's, uh, it's interesting how much more I want to do things now that you have this sort of tool that can work all the time and do stuff. Lijie, how do you explain this to your friends who are sort of trying to understand what this means and how it's gonna impact other fields?
30. LCLijie Chen
  So I mean, I have some math- mathematician friends, and I have some, you know, friends in, in other fields. Yeah. So I think the way I wo- I would tell them is that I feel like, you know, some, some, some maybe are afraid that, you know, AI will replace them. You know, AI will just replace mathematician. Yeah, but I think it's really about, you know, empowering like every theoretical researcher. Yeah, because you know, AI really have this advantage of knowing so many stuff and connect things. Currently it seems like the problem hard for human may not be hard for AI.
22:31 – 27:24
Advice for researchers using AI
1. AMAndrew Mayne
  I was a researcher, how would I get started? What advice would you have to say, "Okay, try this first"? We'll start with you, Hongxun.
2. HWHongxun Wu
  Get GPT Pro subscription.
3. AMAndrew Mayne
  [laughs]
4. LCLijie Chen
  Yeah, of course. Of course.
5. HWHongxun Wu
  It's really, really much better than, uh, thinking without Pro, uh, and because it think longer.
6. AMAndrew Mayne
  Yeah.
7. HWHongxun Wu
  Uh, and try to, um, ask the boldest question you can ask.
8. AMAndrew Mayne
  Hmm.
9. HWHongxun Wu
  I had experience that sometimes I try to decompose a problem into smaller problem and ask the model, and turns out that it was not as good as just directly ask the question-
10. AMAndrew Mayne
  Hmm
11. HWHongxun Wu
  ... because my decomposition was not the best way. [laughs]
12. AMAndrew Mayne
  Why do you think that was?
13. HWHongxun Wu
  I think because as human we have all kinds of priors on-
14. AMAndrew Mayne
  Mm-hmm
15. HWHongxun Wu
  ... how problem should be solved, and they are very helpful in reducing the thinking time. Um, but, uh, very often the prior are wrong and there are blind spots.
16. AMAndrew Mayne
  Mm-hmm.
17. HWHongxun Wu
  And AI models, they sometimes just can, you know, surprise us with-
18. AMAndrew Mayne
  Mm
19. HWHongxun Wu
  ... uh, discovering these hidden things.
20. AMAndrew Mayne
  When I spoke to Alex Olchawska, he talked about how kind of treating it like a graduate student.
21. HWHongxun Wu
  Mm-hmm.
22. AMAndrew Mayne
  You know, not, not talking down too low, but not too high, but at the right level so you could just understand that it knew the terms and it worked for you. Alex, how about for you? What advice would you give somebody who's a researcher who's, wants to try to figure out how to be more effective with this?
23. AWAlexander Wei
  Yeah, I think a lot of it is actually like, I think these days learning to trust the model-
24. AMAndrew Mayne
  Mm-hmm
25. AWAlexander Wei
  ... and like figuring out like, you know, how far you can go in trusting the model, and also learning like, you know, what's beyond what the model can do. Because if you don't have a sense of that, you don't like, you know, maximally-
26. AMAndrew Mayne
  Mm
27. AWAlexander Wei
  ... uh, use, uh, the full capabilities of the model. I think Lijie has taught me a lot about how to use these tools better. I, I, I feel like I'm, I'm like sort of a dinosaur in some respects-In terms of a- adoption. 'Cause I think I, I started, like, working at OpenAI well before these tools existed, and so I think I have a lot of old bad habits-
28. AMAndrew Mayne
  Hmm
29. AWAlexander Wei
  ... where I, I don't trust the models enough. I still think it's, like, the models of six months ago or something.
30. AMAndrew Mayne
  That's an interesting paradigm. Okay. So Lijie, what advice would you give?
27:24 – 37:30
What comes next for math and AI research
1. AMAndrew Mayne
  how long before there are no more unsolved Erdős problems?
2. LCLijie Chen
  Some of them are very, very hard.
3. AMAndrew Mayne
  Yeah.
4. LCLijie Chen
  Yeah, so yeah, I don't know.
5. AMAndrew Mayne
  Do you foresee us, maybe Alex, needing to come up with a new category of problems? [laughs]
6. AWAlexander Wei
  I think probably, like, the hardest problems on that list, I think that list includes, like, the Collatz conjecture.
7. AMAndrew Mayne
  Mm-hmm.
8. AWAlexander Wei
  Like, these are problems that feel like very, very far out of reach of, like, the mathematical technology of today-
9. AMAndrew Mayne
  Mm-hmm
10. AWAlexander Wei
  ... even though many of them are, like, quite simple to state.
11. AMAndrew Mayne
  So we'll still have some more things to work on and continuously move things through. That's good to know. It's exciting though too to start to think about what happens when you do start applying this to other areas, in physics, in astronomy, and start looking at data sets and stuff, and what kind of discoveries are gonna be in store. Do you have any particular area that you're hoping to see?
12. HWHongxun Wu
  Oh, I hope they just solve P versus NP. [laughs]
13. AMAndrew Mayne
  Okay. How about you, Alex?
14. AWAlexander Wei
  I think the next milestone in my head is really, like, AI that can, like, do AI research.
15. AMAndrew Mayne
  Mm-hmm.
16. AWAlexander Wei
  Um, I think there are so many, like, u- unsolved problems here. We're, in a, in a sense, like, in many ways, like, limited by, like, the, you know, all the limitations of just, you know, our own intelligences. I, I'm optimistic about, like, you know, just having AI broadly available as a technology because there's just, like, so much more demand for intelligence in the world that, you know, like, humans can supply.
17. AMAndrew Mayne
  Lijie?
18. LCLijie Chen
  Oh, I wanted to say P versus NP too, but Hongxun said it.
19. AWAlexander Wei
  Yeah. [laughs]
20. LCLijie Chen
  Uh, so I guess beyond that, like, one concrete thing I'm very interested in is, like, you know, like, currently it seems AI is trying to combine ideas from different fields and of, of course, in a very novel and, uh, you know, sophisticated way. But, like, can, can AI actually generate completely new ideas from scratch? I mean, that's something, like, we haven't really seen concretely in AI-
21. AMAndrew Mayne
  Mm-hmm
22. LCLijie Chen
  ... and, uh, that's something I maybe want to see next happening, and that can be very cool.
23. AMAndrew Mayne
  Have you seen traces of that yet?
24. LCLijie Chen
  I think so. Like, um, you know, even in this, like, Erdős problem, I mean, I think some... if you look at, like, the chain of thought, which is like 125 pages, I think some, some of the thoughts are pretty creative-
25. AMAndrew Mayne
  Mm
26. LCLijie Chen
  ... although they didn't work out. Yeah. [laughs] I mean, the, the final idea is more, like, combining all the stuff, but some of... it has some creative thoughts.
27. AMAndrew Mayne
  Well, it is interesting. You know, early on, arguments were like these models weren't creative, but you could give it two ideas that had never been connected before and say, "What's the relationship?" And that would be something very, very new and felt like something different, and I feel like we'll probably be seeing more of that. Do you see us coming up with new forms of mathematics?
28. HWHongxun Wu
  Um, I think that actually be, will be a further way down the line-
29. AMAndrew Mayne
  Mm-hmm
30. HWHongxun Wu
  ... um, because-
37:30 – 41:16
Cryptography, quantum computing, and the future
1. AMAndrew Mayne
  Do you foresee things applying to like cryptography and there's also some debate too about do these models get so good that we kind of surpass even where quantum computing goes, which sounds kind of crazy.
2. LCLijie Chen
  Yeah. I think cryptography is really a im- im- important topic these days because, you know, the foundation of cryptography is, is really about some problems like factoring.
3. AMAndrew Mayne
  Mm.
4. LCLijie Chen
  It's hard to solve by, by computers, right? But basically we only have conjecture. There's no mathematical proof of this fact.
5. AMAndrew Mayne
  Mm.
6. LCLijie Chen
  And suppose the model gets really good at the, at, at, you know, algorithms, maybe they will prove, you know, some of the cryptography conjecture and saying, "Okay, those, those, those protocol, they're actually secure. We don't have to conjecture them to be secure." Or maybe they'll have s- they'll find some loophole, and that's also very important. Like, you know, w- I think we need to make sure, you know, the, the foundation of all, of all security is good, so the model can stress test the, the foundation of the cryptography to make sure like we have better security.
7. AMAndrew Mayne
  What about quantum computing?
8. LCLijie Chen
  Uh, I think that's a very different territory, right?
9. AMAndrew Mayne
  Mm.
10. LCLijie Chen
  Like quantum computing like... Okay, actually, I used to study quantum computing. Like the, my first paper is, uh, is on quantum advantage [laughs]
11. AMAndrew Mayne
  Mm.
12. LCLijie Chen
  Which shows like for some tasks quantum computer can do better than classical, um, com- computers. But so far I think the models, I mean, they are just classical computers. I mean, they are, they do what human can do.
13. AMAndrew Mayne
  Mm.
14. LCLijie Chen
  I mean, maybe a bit better. The quantum computer they, they can sometimes do like more fancy stuff-
15. AMAndrew Mayne
  Mm
16. LCLijie Chen
  ... like simulating some quantum, um, effect in chemistry, which we probably not su- okay, I'm not a, an expert on, on that-
17. AMAndrew Mayne
  Mm
18. LCLijie Chen
  ... but, uh, that might not... It's unclear. Like it is just two different paradigms.
19. AMAndrew Mayne
  Mm.
20. LCLijie Chen
  So yes, I'm not super sure how they like, uh, compare to each other.
21. HWHongxun Wu
  But I think AI's gonna greatly accelerate the s- the pace that we develop quantum computers.
22. AMAndrew Mayne
  Mm.
23. HWHongxun Wu
  Uh, like in recent, just in these years, uh, there's improvement in like error correcting.
24. AMAndrew Mayne
  Mm.
25. HWHongxun Wu
  Uh, like you have error correcting code- quantum error correcting codes that, uh, only uses like simpler type of operations.
26. AMAndrew Mayne
  Mm.
27. HWHongxun Wu
  And that really speed up the like physical implementation. So, uh, I expect more of these to come from like, uh, collaboration with AI, uh, that, uh, AI can propose new like quantum error correction algorithms-
28. AMAndrew Mayne
  Mm
29. HWHongxun Wu
  ... and then we can develop the quantum computers much faster.
30. LCLijie Chen
  Once you ask the model to solve a question, you can of, of course follow up with, you know, "How did you solve it? Uh, can you explain this part of the proof to me?" And, uh, then the model will patiently try to t- teach you how, how everything goes ab- line by line. Yeah. So it's like, it's actually not just, you know, one, one-shot problem solving. You can ask it a lot of question to, you know, to learn the how the proof works, and I, I really like that.