Dwarkesh Podcast@Asianometry & Dylan Patel — How the semiconductor industry actually works
EVERY SPOKEN WORD
150 min read · 30,292 words- 0:00 – 5:05
Xi’s path to AGI
- J(Jon Y (Asianometry)
Leong Mong-sung is a nut. He's like, "We will make Samsung into this monster." He does not care about people, he does not care about business. He wants to take it to the limit, the only thing.
- DPDylan Patel
There's no fucking way you can pay for the scale of clusters that are being planned to be built next year for OpenAI unless they raise, like, $50 to $100 billion.
- J(Jon Y (Asianometry)
(laughs) Hold on, hold on. We've already lost John.
- DPDylan Patel
(laughs)
- J(Jon Y (Asianometry)
We've already accepted GPT-5 will be good.
- DPDwarkesh Patel
But yeah, y- you got it, you know? (laughs)
- DPDylan Patel
You got it. You got it. It's like, bro, like, life is so much more fun when you just, like, are delusionally, like, you know?
- J(Jon Y (Asianometry)
We're just ripping bong hits, are we?
- DPDylan Patel
We're not even close to the dot-com bubble. Why wouldn't this one be bigger? We're gonna rip, baby.
- DPDwarkesh Patel
(laughs)
- J(Jon Y (Asianometry)
You could-
- DPDylan Patel
Rip that bong, baby. (laughs)
- J(Jon Y (Asianometry)
You could raise AI for another two decades.
- DPDylan Patel
If you are Xi Jinping and scale pilled, you must now centralize the compute resources, right? They could have a bigger model than any of the labs next year.
- DPDwarkesh Patel
Today, I'm chatting with Dylan Patel, who runs SemiAnalysis, and John, who runs the Asianometry YouTube channel.
- DPDylan Patel
Does he have a last name?
- J(Jon Y (Asianometry)
No, I do not.
- DPDwarkesh Patel
(laughs)
- J(Jon Y (Asianometry)
No, I'm just kidding. John Y.
- DPDwarkesh Patel
That's right, is it?
- J(Jon Y (Asianometry)
John Y.
- DPDylan Patel
Wait, wh- wh- why is it only one letter?
- J(Jon Y (Asianometry)
Because Y is the best letter.
- DPDwarkesh Patel
(laughs)
- DPDylan Patel
(laughs) Why is your face covered?
- DPDwarkesh Patel
(laughs)
- J(Jon Y (Asianometry)
Why not?
- DPDwarkesh Patel
(laughs) No, seriously, why is it covered? (laughs)
- J(Jon Y (Asianometry)
Because I'm afraid of looking at myself get older and fatter over the years.
- DPDylan Patel
(laughs)
- 5:05 – 9:10
Liang Mong Song
- J(Jon Y (Asianometry)
And I think, like, back in the 2000s prior to TSMC, before SMIC got big, it was actually much more kinda open, more flat. I think after that, there was like, after Leong Mong-sung and after all the Samsung issues and after all the, the SMIC's rise, when they're u- literally saw-
- DPDylan Patel
I think, I think you should tell that story, actually. The, the, the TSMC guy that went to Samsung and SMIC and all that. I think you should tell that story.
- J(Jon Y (Asianometry)
There are two stories. There's a guy, he ran a semiconductor company in Taiwan called Worldwide Semiconductor. And this guy, Richard Chang, was very religious. I mean, all the TSMC people are pretty religious. But, like, he in particular was very fervent and he wanted to bring religion to China. So after he sold his company to TSMC, huge coup for TSMC, he worked there for about eight or nine months and he was like, "All right, I'll go to China." Because back then there was... The relations between China and m- uh, Taiwan were much more different. And so he goes over there. Shanghai says, "We'll give you a bunch of money." And then-Richard Chang basically recruits half of, like, a whole bunch, it's like a conga line of, like, Taiwanese that go live-
- DPDylan Patel
(laughs)
- J(Jon Y (Asianometry)
Just, like, they- th- they get on the plane and they fly over. And generally, that's actually a lot of, like, a lot of, like, acceleration points within China's semiconductor industry, it's from talent flowing from Taiwan. And then the second thing was, like, My- Liang Mong-Song. Liang Mong-Song was a, is a nut. And I've met him-
- DPDylan Patel
(laughs)
- J(Jon Y (Asianometry)
I've not met him.
- DPDylan Patel
Oh. (laughs)
- J(Jon Y (Asianometry)
I've met people who work with him, and they say he is a nut. He is a, he is probably on the spectrum, and he's, he does not care about people, he does not care about business, he does not care about anything. He wants to take it to the limit, the only thing. That's the only thing he cares about. He worked from TSMC, literal genius, 300 patents or whatever, 285. Goes, works all the way to, like, the top, top tier, and then one day, uh, he decides, he loses out on some sort of power game within TSMC and gets demoted.
- DPDylan Patel
And he was, like, head of R&D, right, or something? Before the-
- J(Jon Y (Asianometry)
He was, like, one of the top R&D, he was, like, second or third place and he-
- DPDylan Patel
And it was for the head of R&D position, basically.
- J(Jon Y (Asianometry)
Correct. More of the head of R&D position. He's like, "I can't deal with this." And he goes to Samsung, and he steals a whole bunch of talent from TSMC, literally, again, conga line, goes and just emails people and say, "We will pay..." At some point, some of these people were getting paid more than the Samsung chairman, which, uh, not really comparable, but, like, you know what I mean. So they're going-
- DPDylan Patel
Isn't the Samsung chairman usually, like, like, part of the family that owns Samsung?
- J(Jon Y (Asianometry)
Correctamundo.
- DPDylan Patel
Okay, yeah, so it's, like, kinda irrelevant. Yeah, fair enough.
- J(Jon Y (Asianometry)
So it's a bit, but he goes over there, and he's like, "Well, I'm, like, we will make Samsung into this monster. We forget everything. Forget all of the stuff you've been trying to do at, like, incremental."
- DPDylan Patel
Uh-huh.
- J(Jon Y (Asianometry)
"Toss that out. We are going to the leading edge, and that is it." They go to the leading edge. The guys, like-
- DPDylan Patel
They win Apple's business.
- J(Jon Y (Asianometry)
They win Apple's business. They win it back from TSMC. Or did they win it back from TSMC? They win-
- DPDylan Patel
They had a portion of the-
- J(Jon Y (Asianometry)
They had a big portion of it. And then TSMC, Morris Chang is, like, at this time was running the company, and he's like, "I'm not letting this happen." 'Cause that guy, toxic to work for as well, but also goddamn brilliant, and also, like, very good at motivating people. He's like, "We will work literally day or night." Sets up what is called the nightingale army, where you have, they split a bunch of people and they say, "You are working R&D night shift. There is no rest at the TSMC fab. You will go in. There is, as you go in, there'll be a day shift going out." They called it the, it, uh, it's like you're burning your liver. 'Cause in Taiwan, they said, like, if you get old, like, as you work, you, you're roo- sacrificing your liver. They called it the liver buster. So they basically did this nightingale armory for, like, a year, two years. They finished FinFET. They, they basically just blow away Samsung. And at the same time, they sue Liang Mong-Song directly for stealing seek- trade secrets. Samsung s- basically separates from Liang Mong-Song, and Liang Mong-Song goes to SMIC.
- DPDylan Patel
And, and, and so Samsung, like, at one point was better than TSMC.
- J(Jon Y (Asianometry)
Mm-hmm.
- DPDylan Patel
And then, yeah, he goes to SMIC, and SMIC is now better than ... Well, or not better, but they caught up rapidly as well after.
- J(Jon Y (Asianometry)
Very rapid. That guy's a genius. That's a guy who's a genius. I mean, I, I, I don't even, I don't even know what to say about him. He's, like, 78 and he's, like, beyond brilliant, does not care about
- 9:10 – 12:01
How semiconductors get better
- J(Jon Y (Asianometry)
people.
- DPDylan Patel
So-
- J(Jon Y (Asianometry)
Like, what, yeah, what- what is research to make the next process node look like? C- is it just a matter of, like, a hundred researchers go in, they do, like, the next N plus one, then the next morning, the next, uh, a hundred researchers go in? It's experiments.
- DPDylan Patel
Okay, interesting.
- J(Jon Y (Asianometry)
They have a recipe, and they're, what they'll do, every recipe, a TSMC recipe, uh, is the culmination of a long, long years of, like, research, right? It's highly secret. And the idea is that you're, what you're gonna do is that you go, you look at one particular part of it and you say, "Experiment, run an experiment. Is it better? Is it not? Is it better or not?" Kind of a thing like that.
- DPDylan Patel
Yeah, you're basically, it's, it's, it's, it's multivariable problem that each, every single tool, sequentially you're processing the whole thing. You, you turn up knobs up and down on every single tool. You can increase the pressure on this one specific deposition tool, or-
- J(Jon Y (Asianometry)
And, and what are you trying to measure? Is it, like, does it increase the yield? Or, like, what do- what is it that-
- DPDylan Patel
It's not, it's yield, it's performance, it's power. It's not just a one, you know, it's not just better or worse, right? It's a multivariable search space.
- J(Jon Y (Asianometry)
And what do these people know such that they can do this? Is it they understand the chemistry and physics?
- DPDylan Patel
So it's a lot of intuition, but yeah, it's, it's PhDs in chemistry, PhDs in physics, PhDs in, uh, EE.
- J(Jon Y (Asianometry)
Brilliant geniuses people, and they all just-
- DPDylan Patel
And they don't even know about, like, the end chip a lot of times.
- J(Jon Y (Asianometry)
Mm-hmm.
- DPDylan Patel
It's like, "Oh, I am an etch engineer, and all I focus on is how hydrogen fluoride f- etches this," right? "And that's all I know. And, like, if I do it at different pressures, if I do it at different temperatures, if I do it with a slightly different recipe of chemicals, it changes everything."
- J(Jon Y (Asianometry)
I remember, like, uh, someone told me this when I was speaking, like, "How did America lose the ability to do this sort of thing? Like etch in hydrofluoric and acid, all of that." I told them, like, the, he told me basically it was like, it's, it's very apprentice, master-apprentice.
- DPDylan Patel
Mm-hmm.
- J(Jon Y (Asianometry)
Like, you know, in Star Wars Sith, there's only one, right? Master-apprentice, master-apprentice. In, it used to be that there is a master, there's apprentice, and they pass on this secret knowledge. This guy knows nothing but etch, nothing but etch. Over time, the apprentices stop coming. And then in the end, the, the apprentices moved to Taiwan. And that's the same way it's still run. Like, uh, you have the NTU and NTHU, uh, Tsing Hua University, National Tsing Hua University. There's a bunch of masters, they teach apprentices, and they just pass this secret, sacred knowledge down. Mm-hmm. Who are the most AGI-pilled people in the supply chain? Is there anybody that's in, like, the hardware supply chain?
- DPDylan Patel
I g- I gotta, I gotta have my phone call with Colette right now.
- J(Jon Y (Asianometry)
Okay, go for it. (laughs)
- DPDylan Patel
(laughs) Sorry, sorry.
- J(Jon Y (Asianometry)
Can, can, could we mention on the podcast that NVIDIA has got a guy who's calling Dylan for the (laughs) for, to update him on the earnings call?
- DPDylan Patel
Well, it's not this, not exactly that, but ...
- J(Jon Y (Asianometry)
Go for it. Go for it.
- DPDylan Patel
Yeah.
- J(Jon Y (Asianometry)
So, Dylan is back from his call with Jensen Huang. (laughs)
- DPDylan Patel
It was not with Jensen, Jesus.
- J(Jon Y (Asianometry)
What did they tell you, huh? What did they tell you about next year's earnings? (laughs)
- DPDylan Patel
No, it was just color around, like, uh, Hopper, Blackwell-
- J(Jon Y (Asianometry)
Uh-huh.
- DPDylan Patel
... and, like, margins. It's, like, quite boring stuff-
- 12:01 – 19:35
China can centralize compute
- DPDylan Patel
there.
- DPDwarkesh Patel
All right, all right. We covered the chips themselves. How, how do they get, like, the, the 10 gigawatt, uh, data center up? What else do they need?
- DPDylan Patel
So I, I think there is a true, like, question of how decentralized do you go versus centralized, right? And if you look in the US, right, uh, as far as, like, labs and such, uh, the, you know, OpenAI, xAI, you know, uh, Anthropic, and then Microsoft having their own effort, Anthropic having their own efforts despite having their partner, and then Meta. Um, and, you know, you go down the list, it's like there's a... quite a decentralization of... And then all the startups, like, interesting startups that are out there doing stuff.
- DPDwarkesh Patel
Yep.
- DPDylan Patel
There's quite a decentralization of efforts. Uh, today in China, it is still quite decentralized, right? It's not like Alibaba, Baidu, you are the champions, right? You have, like, DeepSeek. Like, who the hell are you, does government even support you, like, doing amazing stuff, right? If you are S- Xi Jinping and scale pilled-
- DPDwarkesh Patel
Interesting.
- DPDylan Patel
... you must now centralize the compute resources, right?
- DPDwarkesh Patel
Yeah, yeah.
- DPDylan Patel
Because you have, you have sanctions on how many Nvidia GPUs you can get in now. They're still north of a million a year, right, even post, uh, October last year's sanctions. We still have more than a million H20s, um, and, and other Hopper GPUs getting in through, you know, other means, but legally, like, the H20s. And then on top of that, you have, um, you have your domestic chips, right? But those, that's less than a million chips. So then when you look at it, it's like, Oh, well, we're still talking about a million chips. The scale of data centers people are training on today/over the next six months is 100,000 GPUs, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
Uh, OpenAI, xAI, right? These are, like, quite well-documented, um, and others. But in China, they have no individual system of that scale yet, right? So then the question is like, how do we get there? Um, you know, no, no company has had the centralization push to have a cluster that large and train on it yet, at least publicly, like, well known. And the best models seem to be from a company that has got like 10,000 GPUs, right, or 16,000 GPUs, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
So it's not, it's not quite a, quite as centralized as the US companies are. And the US companies are quite decentralized. If you're Xi Jinping and you're scale pilled, do you just say, "XYZ Company is now in charge-"
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
... and every GPU goes to one place? And then, then you don't have the same issues as the US, right? In the US we have a big problem with, like, being able to build big enough data centers, being able to build substations and transformers and all this-
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
... that are large enough in a dense area. China has no issue with that at all because their supply chain adds, like, as much power as, like, half of Europe every year, right? Like, or some th- some absurd statistics, right? Um, so they're building transformer substations, they're building new power plants constantly. Um, so they have no problem with, like, getting power density. And you go look at, like, Bitcoin mining, right? Um, around the Three Gorges Dam, at one point at least, there was like 10 gigawatts of, like, Bitcoin mining estimated, right? Um, which, you know, we're talking about, you know, gigawatt data centers are coming over, you know, '26, '27 in the U- or '26ish in the US, or '27, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
You know, sort of this is an absurd scale relatively, right? We don't have gigawatt data centers, you know, ready, but like, China could just build it in six months, I think, around the Three Gorges Dam or many other places, right? Because they have, they have the ability to do the substations, they have the, they have the power generation capabilities. Everything can be, like, done like a flip of a switch, but they haven't done it yet. And then they can centralize the chips like crazy, right? Now, oh, all million chips that Nvidia is shipping in Q3 and Q4, the H20, um, let's just put them all in this one data center. They just haven't had that centralization effort, right?
- J(Jon Y (Asianometry)
Well, you can argue that, like, the more you centralize it, the more you start building this monstrous thing within the industry, you start getting attention to it, and then suddenly, you know, lo and behold, you have a little bit of a, you have a little worm in there suddenly while you're doing your big training run.
- DPDwarkesh Patel
Ah.
- J(Jon Y (Asianometry)
Oh, this GPU off. Oh, this GPU. Oh, no. Oh, no. Oh, no.
- DPDwarkesh Patel
(laughs)
- DPDylan Patel
(laughs) I don't know if it's like that easy to hack.
- DPDwarkesh Patel
Wait, was that a Chinese accent, by the way?
- J(Jon Y (Asianometry)
(laughs)
- DPDylan Patel
Just to be clear, John is, is East Asian. He's Chinese.
- J(Jon Y (Asianometry)
I am of East Asian descent.
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
Half Taiw- Taiwanese, half Chinese.
- 19:35 – 33:36
Export controls & sanctions
- DPDylan Patel
- DPDwarkesh Patel
To the extent that you could just take out a five-gigawatt aluminum refining center and, like, build a giant data center there, then I guess the way to control Chinese AI has to be the chips, because like everything else they, they... So like, uh, how do you like... Just, like, walk me through h- h- how many chips do they have now? How many will they have in the future? What will the r- like how many is that in comparison to US and the rest of the world?
- DPDylan Patel
Yeah, so, so in the world, I mean, the world we live in is they are not restricted at all in like the physical infrastructure side of things in terms of power, data centers, et cetera, because their supply chain is built for that, right?
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
And, and it's pretty easy to pivot that. Whereas the US adds so little power each year, and Europe loses power every year. The, the Western sort of industry for, uh, power is non-existent in comparison, right? But on the flip side is, uh, quote unquote "Western," including Taiwan, manufacture- chip manufacturing is way, way, way, way larger than China's, especially on leading edge where China theoretically has, you know, depending on the way you look at it, either zero or a very small percentage share, right? And so there, um, you have, you have wafer... You have, you have equipment, wafer manufacturing, and then you have, uh, advanced packaging capacity, right? And where the US can control China, right? So advanced packaging capacity is kind of a shot, because the vast majority, the, the largest advanced packaging company in the world was Hong Kong headquartered. They just moved to Singapore, but like, that's effectively like, you know, in a realm where the US can't sanction it, right? Um, a majority of these other companies are in similar places, right? So advanced packaging capacity is very hard, right? Advanced packaging is useful for stacking memory, stacking chips on, uh, COAS, right? Things like that. Then, then the step down is wafer fabrication. Uh, there's immense capability to restrict China there. Uh, and despite the US making some sanctions, China in the most recent quarters was like 48% of ASML's revenue, right? So, you know, and, and, and like 45% of like Applied Materials, and you just go down the list. So it's like, obviously it's not being controlled that effectively. Um, but it could be on the equipment side of things. The chip side of things is actually being controlled quite effectively, I think, right? Like yes, there is like shipping GPUs through Singapore and Malaysia and, and other countries in Asia to China, but you know, the amount you can smuggle is quite small, and then the sanctions have limited the chip performance to a point where it's like, you know, this is actually kind of fair. Uh, but there is a problem with how everything is restricted, right? Um, 'cause you want to be able to restrict China from building their own domestic chip manufacturing industry that is better than what we ship them. Um, you want to prevent them from having chips that are better than what we have.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
And then... Or, and then you want to prevent them from having AIs better. The ultimate goal being-
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
... you know, and if you read the restrictions, like very clear, it's about AI.
- DPDwarkesh Patel
Yeah, yeah.
- DPDylan Patel
Um, even in 2022, which is amazing, like at least the Commerce Department was kind of AI-pilled. It was like, is, is you want to restrict them from having AIs worse than us, right? So starting on the right end, it's like, okay, well, if you want to restrict them from having better AIs than us, you have to restrict chips. Okay. If you want to restrict them from having chips, uh, you have to let them have at least some level of chip that the West also that is good, better than what they can build internally. Um, but currently, the, the restrictions are flipped the other way, right? They can build better chips in China, then, uh, we restrict them in terms of chips that NVIDIA or AMD or an Intel can sell to China. Um, and so there's sort of a problem there in terms of the equipment that is shipped can be used to build chips that are better than what the Western companies can actually ship them.
- DPDwarkesh Patel
Uh, John, Dylan seems to think the export controls are k- kind of a failure. Do you, do you, do you agree with him or...?
- J(Jon Y (Asianometry)
That is a very interesting question, because I think it's like-
- DPDwarkesh Patel
Why, thank you (laughs) .
- J(Jon Y (Asianometry)
Like, what do you...
- DPDylan Patel
Darkish, you're so good.
- J(Jon Y (Asianometry)
Yeah, Darkish, you're the best. I think it's, I think failure is a tough word to say, because I think it's like, what are we trying to achieve, right? Like, and say they're talking about AI, right?
- DPDwarkesh Patel
Yeah.
- J(Jon Y (Asianometry)
When you do sanctions like that, it's you need, like, such deep knowledge of the technologies.
- DPDylan Patel
You know, just taking lithography, right? If your goal is to restrict China from building chips and you just like boil it down to like, "Hey, lithography is 30% of making a chip, so are 25%. Cool, let's, let's sanction lithography." Okay, where do we draw the line? Okay, let me ask, let me ask? Let me figure out what, where the line is, and if I'm a bureaucrat or if I'm a lawyer at the Commerce Department or what have you, well, obviously I'm gonna go talk to ASML.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
And ASML is gonna tell me, "This is the line." Because they know like, hey, well, th- you know, this, this, this is-
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
You know, there's like some blending over.
- J(Jon Y (Asianometry)
There's like... They're, they're like looking at like, "What's going to cost us the most money," right?
- DPDylan Patel
And then they all constantly say like, "If you restrict us, then China will have their own industry," right? And, and the way I like to look at it is like...... chip manufacturing is like, like 3D chess or like, you know, a, a massive jigsaw puzzle in that if you take away one piece, China can be like, "Oh, yeah, that's the piece. Let's put it in," right? And currently this, i- export, say, uh, restrictions, year by year by year they keep updating them, ever since like 2018 or so, '19, right, when Trump started, and now Biden's, you know, accelerated them. They've been like... They haven't just like take a bat to the table and like brick it, right? Like, it's like, let's take one jigsaw puzzle out, walk away. Oh, shit. Let's take two more out. Oh, shit. Right? Like, you know, it's like instead if they like... They, you either have to go kind of like full bat to the fricking like table/wall or, or chill out, right? Like, and like, you know, let them, let them do whatever they want.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
'Cause the alternative is everything is focused on this thing and they make that. And then now when you take out another two pieces, it's like, "Well, I have my domestic industry for this. I can also now make a domestic industry for these." Like, you go deeper into the tech tree or what have you.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
It's a very... It's art, right? In the sense that there are technologies out there that can compensate. Like, if you believe... The belief that lithography is a linchpin within the system is, it's not exactly true, right? At some point, if you keep pulling, keep pulling a thread, other things will start developing to kind of close that loop. And like, I think it's a, it's, uh, it is... That's why I say it's an art, right? I don't think you can stop Chinese semiconductor industry- the semiconductor industry from progressing. I think that's basically impossible. So the question is, the Chinese government believes in the primacy of semiconductor manufacturing. They u- they've believed it for a long time, but now they really believe it, right? To, to some extent, these sanctions have made China believe in the importance of the semiconductor industry-
- 33:36 – 39:36
Huawei’s intense culture
- DPDylan Patel
five nanometer here," right?
- DPDwarkesh Patel
And are they doing it because they believe in AI or because they wanna make Huawei phones?
- DPDylan Patel
Uh, you know, Huawei was the largest TSMC customer for, like, a few quarters actually-
- DPDwarkesh Patel
Mm-hmm. Yeah.
- DPDylan Patel
... before they got sanctioned. Uh, Huawei makes most of the telecom equipment in the world, right? Uh, you know, phones, of course, modems, but of course accelerators, networking equipment. You know, you go down the whole, like, video surveillance chips, right? Like, you kinda, like, go through the whole gambit.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
A lot of that could use seven and five nanometer.
- DPDwarkesh Patel
So-
- J(Jon Y (Asianometry)
Do you think the dominance of Huawei is actually a bad thing for the rest of the Chinese tech industry?
- DPDylan Patel
I think Huawei is so fucking cracked that, like, it's, it's hard to say that, right? Like, Huawei outcompetes Western firms regularly with two hands tied behind their back. Like, you know, like, what, what the hell is Nokia and, like, uh, Sony Ericsson? Like, trash, right? Like, compared to Huawei. And Huawei is not allowed to ship, sell to, like, European companies or American companies, and they don't have TSMC, and yet they still destroy them, right?
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
Um, and, and same applies to, like, the new phone, right? It's like, oh, it's, like, as good as, like, a year old Qualcomm phone on a process node that's equivalent to, like, four years old, right, or three years old. So it's like, wait, so they actually out-engineered us with a worse process node, you know? So it's like, oh, wow. Okay. Like, you know, Huawei is, Huawei is, like, crazy cracked.
- J(Jon Y (Asianometry)
Where do you think that culture comes from?
- DPDylan Patel
The military 'cause it's the PLA.
- J(Jon Y (Asianometry)
It is the, we, we... It is generally seen as an arm of the PLA. But, like, how do you square that with the fact that sometimes the PLA seems to mess stuff up?
- DPDylan Patel
Oh, like filling water in rockets?
- DPDwarkesh Patel
Yeah. (laughs)
- J(Jon Y (Asianometry)
I don't know if that was true. I'm denying, I'm not denying-
- DPDylan Patel
So there, so there is, there is, like, that, like, like, like, crazy conspir- not conspiracy. It was like, you, you don't know what the hell to believe in China, especially as a not-Chinese person, but, like-
- J(Jon Y (Asianometry)
Like, nobody knows... Even Chinese people don't know what's going on in China.
- DPDylan Patel
There's, like, you know, like, all sorts of stuff. Like, "Oh, they're filling water in their rockets. Clearly they're, like, incompetent." It's like, look, if I'm the Chinese military, I want the Western world to, like, believe I'm completely incompetent because one day I can just, like, destroy the fuck out of everything, right? With all these hypersonic missiles and all this shit, right? Like drones and... Like, no, no, no, no. Uh, we're filling water in our missiles. These are all fake. We don't actually have 100,000 missiles that we manufacture in a facility that's, like, super hyper advanced and Raytheon is stupid as shit because they can't make, you know, missiles nearly as fast, right? Like, I think, like, that's also, like, a flip side is, like, how much false propaganda is there, right? 'Cause there's a lot of, like, "No, SMIC could never. SMIC could never. They had... They don't have the best tools," blah, blah, blah. And then it's like, motherfucker, they just shipped 60 million phones last year with this chip that performs only one year worse than, like, what Qualcomm has. It's like proof is in the pudding, right? Like, you know, there's, there's a lot of, like, cope, if you will.
- J(Jon Y (Asianometry)
I just wonder where it comes from. I do really do just wonder where that culture comes from. Like, there's something crazy about them, where they're kinda like everything they touch they seem to succeed in. And, like, I, I kinda wonder why.
- DPDylan Patel
They're making cars. I wonder if it's going on there. (laughs)
- J(Jon Y (Asianometry)
I think, like, if, like, supposedly, like, if we kind of imagine, like, historically, like, do you think they're getting something from somewhere?
- DPDwarkesh Patel
What do you mean?
- DPDylan Patel
Espionage, you mean?
- J(Jon Y (Asianometry)
Yeah, like-
- DPDylan Patel
Well, obviously.
- J(Jon Y (Asianometry)
Like, like, East Germany and the Soviet industry was basically, was just a, it was like a conveyor belt of, like, secrets coming in. And they're just, used that to run everything. But the Soviets were never good at it.
- DPDwarkesh Patel
Yeah.
- 39:36 – 41:43
Why the semiconductor industry is so stratified
- DPDylan Patel
uh-
- DPDwarkesh Patel
Why isn't there more vertical integration in the semiconductor industry? Well, like, why are there, like, this subcomponent requires this other subcomponent from this other company, which requires a subcomponent from another company? Like, why, why is more of it not done in-house?
- DPDylan Patel
The way to look at it today is it's super, super stratified. And every industry has anywhere from one to three competitors. And pretty much the most competitive it gets is like 70% share, 25% share, 5% share.
- DPDwarkesh Patel
Mm.
- DPDylan Patel
Um, in any layer of, like, manufacturing chips, anything, anything, chemicals, different types of chips. But it used to, used to be vertically integrated, like, and-
- J(Jon Y (Asianometry)
Well, in the very beginning it was integrated, right? And-
- DPDwarkesh Patel
Wh- why did that stop?
- J(Jon Y (Asianometry)
What happened was, you know, the funniest thing was that like, you know, you had companies that used to do it all in the one, and then suddenly sometimes a guy would be like, "I hate this. I think I know, I know th- how to do better." Spins off, does his own thing, starts his company. Goes back to his old company, says, "I can sell you a product-"
- DPDwarkesh Patel
Mm.
- J(Jon Y (Asianometry)
"... that's better." Right? And that's the beginning of what we call the semiconductor manufacturing in- equipment industry. Like basically-
- DPDylan Patel
Like in the '70s, right?
- J(Jon Y (Asianometry)
... it was spinoff.
- DPDylan Patel
Like everyone made their own equipment in the '60s.
- J(Jon Y (Asianometry)
'60s and '70s. Like they spin off all these people. And then what happened was that the companies that accepted, you know, these outside products and equipment got better stuff. They did better. Like, you can talk about a whole bunch ... Like, there are companies that were totally vertically integrated in semiconductor manufacturing for decades. And they are n- they're still good, but they're nowhere near competitive.
- DPDwarkesh Patel
One thing I'm confused about is, like, the actual foundries themselves. There's, like, fewer and fewer of them-
- J(Jon Y (Asianometry)
Yeah.
- DPDwarkesh Patel
... every year, right? So there's, like, more, maybe more companies overall, but like the, like the final people, like, who, uh, make the m- m- make the wafers, there's less and less. Uh, and then i- I- it's interesting in a way similar to like the AI foundation models where, um, you need to use like the revenues from like a previous model in order ... Or like the, your, um, market share to, like, fund the next round of ever more expensive development.
- J(Jon Y (Asianometry)
When TSMC launched the foundry industry, right, and when they started, there was a whole wave of, like, Asian companies that funded semiconductor foundries-
- DPDwarkesh Patel
Mm-hmm.
- J(Jon Y (Asianometry)
... of their own. You had Malaysia with Siltera. You have Singapore with Chartered. You had, uh, there was one, there's Worldwide, there's Worldwide Semiconductor where I talked about earlier. There's one from Hong Kong.
- DPDylan Patel
Bunch in Japan.
- J(Jon Y (Asianometry)
Bunch in Japan. Like, they
- 41:43 – 46:38
N2 should not exist
- J(Jon Y (Asianometry)
all sort of did this thing, right? And I think the thing was that when you go into leading edge when the thing is that, like, it got harder and harder, which means that you had to aggregate more demand from all the customers to fund the next node, right? So technically, in the sense that what this kind of do is aggregating all this money-
- DPDwarkesh Patel
Yeah.
- J(Jon Y (Asianometry)
... all this profit, to kind of fund this next node to the point where now, like, there's no room in the market for an N2 and, or N3. Like, there, technically you could argue that ... economically, you can make an argument that, like, N2 is a monstrosity that doesn't make sense economically and would, should not exist in some ways without the immense single concentrated spend of, like, five players in the market.
- DPDylan Patel
I'm sorry to, like, completely derail you, but, like, there's this video where it's like, uh, there's this unholy concoction of meat slurry.
- J(Jon Y (Asianometry)
Yes.
- DPDylan Patel
(laughs)
- DPDwarkesh Patel
What? (laughs)
- J(Jon Y (Asianometry)
(laughs) Sorry, there's like a video that's like, "Ham is disgusting. It's an unholy concoction of, like-
- DPDwarkesh Patel
(laughs)
- J(Jon Y (Asianometry)
... meat with no bones or collagen."
- DPDwarkesh Patel
Uh-huh.
- J(Jon Y (Asianometry)
And like, I don't know, like two... He was like, the way he was describing two-nanometer is kind of like that, right?
- DPDwarkesh Patel
It's like the guy who pumps his right arm so much, and he's like mo- super muscular. The human body was not meant to be so muscular.
- J(Jon Y (Asianometry)
(laughs)
- DPDwarkesh Patel
Like- Wh- wh- what, what's the point? Like why, why is two nanometer not justified?
- J(Jon Y (Asianometry)
I'm not saying N2 as like, N2 specifically, but say N2 as a concept. The next node should technically... Like right now, there is a p- there will come a point where economically, the next node will not be possible, like at all, right?
- DPDylan Patel
Unless, unless more, you know, technologies spawn. Like AI now makes, you know-
- J(Jon Y (Asianometry)
Yeah. AI, yeah.
- DPDylan Patel
... one nanometer or whatever, you know-
- J(Jon Y (Asianometry)
Yeah. There was a long period of time-
- DPDylan Patel
... where 16A-
- J(Jon Y (Asianometry)
Yeah, yeah.
- DPDylan Patel
... viable, right? So, so like right before AI-
- J(Jon Y (Asianometry)
Ra- and, and makes it viable in what's, uh, as in like, uh, it makes it-
- DPDylan Patel
Like money. Money.
- J(Jon Y (Asianometry)
... worth it or like make-
- DPDylan Patel
So every, every two years you get a shrink, right?
- J(Jon Y (Asianometry)
Yeah.
- DPDylan Patel
Uh, like clockwork, Moore's Law. Um, and then five nanometer happened, it took three years. Holy shit. And then three nanometer happened, it took three... Or no, sorry, is it three nanometer or five? Uh, took three years. Holy shit. Like is Moore's Law dead, right? Like 'cause TSMC didn't... And then what did Apple do? Even on the third year of three... of, of, uh, uh... or sorry, when three nanometer finally launched, they still only... Apple only moved half of the iPhone volume to three nanometer. So this is like... Now that they had a fourth year of I- of five nanometer for a big chunk of iPhones, right? And it's like, oh, is the mobile industry petering out? Then you look at two nanometer, and it's like gonna be a similar, like very difficult thing for the, for the industry to pay for this, right? Apple, of course, they have... you know, because they get to make the phone, they have so much profit, they can funnel into the more and more expensive chips. But finally, like that was, that was really running out, right? It was two... H- how economically viable is two nanometer just for one player, TSMC? You know, ignore Intel, ignore Samsung. Just in, you know, because in... Samsung is paying for it with memory, not with their actual profit. And then Intel is paying it from it from their former CPU monopoly, um-
- J(Jon Y (Asianometry)
Private equity money-
- 46:38 – 50:06
Taiwan invasion hypothetical
- DPDylan Patel
- J(Jon Y (Asianometry)
(laughs)
- DPDwarkesh Patel
Okay. The, uh, uh, suppose Taiwan is invaded or Taiwan has an earthquake, nothing is shipped out of Taiwan in the... from now on. What happens next? The rest of the world, how would it feel its impact a day in, a week in, a month in, a year in?
- J(Jon Y (Asianometry)
I mean, it's, it's a terrible thing. It's a terrible thing to talk about. I think it's like... Can you just say it's all terrible? Everything's terrible. Because it's not just like leading edge. Leading edge, people were focused on leading edge, but there's a lot of trailing edge stuff that, like, people depend on every day. I mean, we all worry about AI. The reality is you're not gonna get your fridge, you're not gonna get your cars, you're not gonna get everything. It's terrible. And then there's the human part of it, right?
- DPDwarkesh Patel
Yeah.
- J(Jon Y (Asianometry)
It's all terrible. Can we... Like it's, it's depressing.
- DPDylan Patel
I think that-
- J(Jon Y (Asianometry)
And I live there. Yeah.
- DPDylan Patel
I think day one, market crashes a lot, right? You gotta think about like... I think, I think the big, like big six, six mo-... Six biggest companies, magnificent seven, whatever the heck it's called are like 60, 75% of the S&P 500, and their entire business relies on chips, right? Google, Microsoft, Apple, Nvidia. Uh, you know, you go down the list, right? They're, they're all... Meta, right? They all entirely rely on AI. Um, you, and you would have a tech reset, like extremely insane tech reset, by the way, right? Like, so market would crash a week, a day in, a couple weeks in, right? Like, people are preparing now. People are like, "Oh, shit." Like, "Let's start building fabs. Fuck all the environmental stuff." Like war is probably happening.
- J(Jon Y (Asianometry)
Yeah, yeah.
- DPDylan Patel
Um, but, uh, but like the supply chain is trying to like figure out what the hell to do to refix it. But six months in, the supply of chips for making new cars, gone or sequestered to make military shit, right? Um, you can no longer make cars. Um, and we don't even know how to make non-semiconductor in- induced cars, right? Like this unholy concoction with all these like chips, right? Uh, you- you, uh-
- J(Jon Y (Asianometry)
Car's like 40% chips now.
- DPDylan Patel
Yeah. (laughs)
- J(Jon Y (Asianometry)
It's like it's just chips on, in the tires.
- DPDwarkesh Patel
(laughs)
- DPDylan Patel
There's like, there's like 2000 plus chips. Really, like every Tesla door handle has like four chips in it. It's like, what the fuck?
- DPDwarkesh Patel
(laughs)
- DPDylan Patel
Like why? Um, like, like but like it's like, it's like shitty like microcontrollers and stuff-
- J(Jon Y (Asianometry)
Yeah.
- DPDylan Patel
... but like there's like 2000 plus chips even in an, in, in an ICE vehicle, like internal combustion engine vehicle, right? And every engine has dozens of dozens of chips, right? Um, anyways-This all shuts down because, uh, not all of the production. There's some in Europe, there's some in the US, there's some in Japan, there's some in Singapore-
- J(Jon Y (Asianometry)
Yeah, they're gonna, they're gonna bring in a guy to work in, on Saturday until 4:00 (laughs) .
- DPDylan Patel
Well, yeah, yeah. I mean, yeah (laughs) . So, so you have like, TSMC always builds new fabs. That old fab, they, like, tweak production up a little bit more and more, and th- new designs move to the next, next, next node. And, and old stuff fills in the old nodes, right? So, you know, ever since TSMC has been the most important player, and not just TSMC, there's UMC there, there's PSMC there, there's a number of other companies there. Taiwan's share of, like, total manufacturing has grown every single process node. So in, like, 130 nanometer, there's a lot, and including, like, many chips from, like, Texas Instruments or Analog Devices or, like, NXP, like, all these companies. 100% of it is manufactured in Taiwan, right? By, you know, either PS- TSMC or UMC or whatever. But then you, like, step forward and forward and forward, right, like 28 nanometer. Like, 80% of the world's production of 28 nanometers is in Taiwan. Oh, fuck, right? Like, you know, and everything in 28 nanometers, like, what's made on 28 nanometer today? Tons of microcontrollers and stuff, but also, like, every display driver IC. Like, cool, like even if I can make my Mac chip, I can't make the chip that drives the display. Like, you know, you just go down the list, like, everything, no fridges, no, no automobiles, no, no weed whackers because that shit has... My toothbrush has fucking Bluetooth in it, right? Like, why? I don't know. But like, you know, there's, like, so many things that, like, just like, poof, we're tech
- 50:06 – 59:58
Mind-boggling complexity of semiconductors
- DPDylan Patel
reset.
- DPDwarkesh Patel
We were supposed to do this i- interview, like, many months ago, and then I, like, kept, like, delaying 'cause I'm like, "Ah, I don't understand any of this shit." (laughs) But like, it is like a very difficult thing to understand where I feel like with AI, it's like, I, it's not that, like-
- DPDylan Patel
No, you've just spent time. You've spent-
- DPDwarkesh Patel
Sure, sure.
- DPDylan Patel
... the time to-
- DPDwarkesh Patel
But like, I also feel like it's, like, less compli- Uh, it, it, it feels like it's the kind of thing where, like, in an amateur kind of way, you can, like, you know, pick up what's going on in the field.
- DPDylan Patel
Yes, es-
- DPDwarkesh Patel
In this field, like, I, I, the thing that cares about is, like how, how does one learn the layers of the stack? Because the layers of the stack are like, there's not just the papers online. You can't just, like, look up the, the tutorial on how the transformer works or whatever. It's like-
- DPDylan Patel
Yes. I mean, like-
- DPDwarkesh Patel
It's like many layers of really difficult shit.
- DPDylan Patel
There are, like, 18 year olds who are just cracked at AI, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
Already, right? And like, there's high school dropouts that get, like, jobs at OpenAI. This existed in the past, right? Pat Gelsinger, current CEO of Intel, went straight to work. He was, he, like, grew up in the Amish area of Pennsylvania, and he went straight to work at Intel, right? Because he's just cracked, right? That is not possible in semiconductors today. You can't even get, like, a job at, like, a tool company without, like, a, at least, like, a freaking master's in chemistry, right? And probably a PhD, right? Like, like, of the, like, 75,000 TSMC workers, it's like 50,000 have a PhD or something insane, right? It's like, okay, this is like, there's like some, there's like a next level amount of like, how specialized everything's gotten. Whereas today, like, you can take like, you know, Sholto, you know, he, he, when did he start working on AI? Not that long ago. Right?
- J(Jon Y (Asianometry)
Not to say anything bad about Sholto.
- DPDwarkesh Patel
(laughs)
- DPDylan Patel
No, no, no, he's c- but he's cracked, he's like omega cracked-
- DPDwarkesh Patel
Yeah, yeah.
- DPDylan Patel
... at like what he does. What he does, you could pick him up and drop him into another part of the AI lay- stack. First of all, he understands it already. And then second of all, he could probably become cracked at that too, right? Um, whereas that is not the case in semiconductors, right? You ca- you, one, you, like, specialize like crazy. Two, you can't just pick it up. Um, you know, like Sholto, I think, what did he say? He, like, just started, like-
- DPDwarkesh Patel
Uh, he was a consultant in McKinsey, and at, like, night, he would, like, read papers about robotics-
- DPDylan Patel
Right.
- DPDwarkesh Patel
... and like, run experiments and whatever.
- DPDylan Patel
Yeah, and then, and then, like, he, like, was like, like people noticed, it was like, "Who the hell is this guy?"
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
"And why is he posting this?" Like-
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
... I thought everyone who knew about this was at Google already, right?
- DPDwarkesh Patel
Yeah, yeah, yeah.
- DPDylan Patel
And it's like, "Come to Google," (laughs) right? That can't happen in semiconductors, right? Like, it's just not, like, conducively, like, it's not possible, right? One, arXiv is, like, a free thing. Um, the paper publishing industry is, like, ou- abhorrent everywhere else. And you just, like, cannot download IEEE papers or, like, SPIE papers or, like, other organizations. And then two, at least up until, like, late 2022 or really early 2023 in the case of Google, right? I think, what, the PaLM inference paper, up until the PaLM inference paper, before that, all the good, best stuff was just posted on the internet. After that, you know, it's kind of a little bit clamping down by the labs, but there's also still all these other companies making innovations in the public. That, and, and like, what is state of the art is public. That is not the case in semiconductors.
- J(Jon Y (Asianometry)
Semiconductors have been shut down since 1960s, 1970s, basically. I mean, like, it's kinda crazy how little information has been formally transmitted from one, one country to another. Like, the last time you could really think of this was like 19, maybe the Samsung era, right?
- DPDwarkesh Patel
So then how do you guys keep up with it?
- 59:58 – 1:05:21
Chip architecture design
- DPDylan Patel
I think, um, you know, it's first, it's important to state that semiconductor manufacturing and design is the largest search base of any problem that h- humans do because it is the most complicated, uh-
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
... industry that anything that humans do. And so, you know, when you think about it, right, there's, there's, uh, 1E10, 1E11, right? Uh, 100 billion transistors-
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
... uh, on, on leading edge chips, right? Blackwell has, uh, two 20 billion transistors or something like that. So what is... And those are just on-off switches. And then think about every permutation of putting those together, contact ground, et cetera, drain source, blah, blah, blah, with wires, right? There's 15 metal layers, right? Connecting every single transistor in every possible arrangement. This is a search base that is literally almost infinite, right? You could like... The search base is much larger than any other search base that humans know of.
- DPDwarkesh Patel
And the nature of the search, like what are you trying to optimize over?
- DPDylan Patel
Well, useful compute, right? What is, you know, if your- if the, if the goal is optimize intelligence per picajoule, right? Um, and- and intelligence is some nebulous nature of like the, what the model architecture is.
- DPDwarkesh Patel
Yeah, yeah.
- DPDylan Patel
Uh, but, and then, and then picajoule is like a unit of energy, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
How do you optimize that? So there's humongous innovations possible in architecture, right? Because vast majority of the power on a H100 does not go to compute. And there are more efficient, like-... uh, compute our, you know, ALUs, arithmetical unit like designs, right? But even then, the vast majority of the power doesn't go there, right? The vast majority of the power goes to moving data around, right? And then when you look at what is the movement of data? It's either networking or memory. You know, you have, you have a humongous amount of movement relative to compute, and a hu- humongous amount of power consumption relative to compute. And so the, so how can you minimize that data movement, m- and then maximize the compute? There are 100X gains from architecture. Even if we, like, literally stopped shrinking, I think we could have 100X gains from architectural advancements.
- DPDwarkesh Patel
Over what time period?
- DPDylan Patel
Um, the, the, the question is how much can we advance the architecture, right?
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
The, the challenge, the other challenge is like, the number of people designing chips has not necessarily grown in a long time, right? Um, yeah, like, company to company it shifts, but like, within like the semiconductor industry in the US, and the US makes, you know, designs the vast majority of leading edge chips, the number of people designing chips has not grown much.
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
Um, what has happened is the output per individual has soared because of EDA, electronic design assistance tooling, right? Now, this is all still, like, classical tooling. There's just a little bit of inkling of AI in there yet, right? What, what happens when we bring this in, is the question. And, and how you can solve this search base somehow, uh, with humans and AI working together to optimize this so it's not most of the power is mo- data movement, and then the er- logic, the, the compute is actually very small, to flip side, um, the compute is... First of all, compute can get like 100X more efficient just with, like, design changes, and then you could minimize that data movement massively, right? So, you can get a humongous gain in efficiency just from architecture itself. And then process node helps you innovate that there, right? And, uh, power delivery helps you innovate that. Uh, system design, chip-to-chip networking helps you innovate that, right? Like, memory technologies, there's so much innovation there, and there's so many different vectors of innovation that people are pursuing-
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
... simultaneously, uh, to where, like, Nvidia gen to gen to gen will do more than 2X performance per dollar. Uh, I, I think that's very clear. And then, like, hyperscalers are probably gonna try and shoot above that, but we'll see if they can execute.
- DPDwarkesh Patel
The, there's, like, uh, two narratives you can tell here of how this happens. One is that these AI c- companies who are training the foundation models, who understand the trade-offs of, like, how much is the marginal increase in compute versus memory worth to them, and what trade-offs do they want between different kinds of memory, they understand this. And so therefore, the accelerators they build, uh, they can make these sort of trade-offs in a way that's like most optimal or, and also design, like, the architecture of the, the model itself in a way that, uh, uh, uh, reflects, like, w- what are the hardware trade-offs. Another is NVIDIA, because it has, like... M- I, I, I don't know how this works, but presumably they have some sort of, like, know-how, like, they're accumulating all this, like, uh, knowledge about how to better design this architecture and like also better search tools for so on. Um, who has basically, like, better moat here in terms of... W- will NVIDIA keep getting better at design, getting this 100X improvement? Or will it be like OpenAI and Microsoft and, uh, Amazon and Anthropic who are designing their accelerators, who will keep getting better at, like, designing the accelerator?
- DPDylan Patel
I, I think that there's a few vectors to go here, right? One is, you mentioned, and I think it's important to note, is that hardware has a huge influence on the model architecture that's optimal. And so it's not a one-way street that better chip equals, you know... The m- the optimal model for Google to run on TPUs, given a given amount of-
- DPDwarkesh Patel
Mm-hmm.
- DPDylan Patel
... dollars, a given amount of, uh, compute, is different architecturally than what it is for OpenAI with NVIDIA g- stuff, right? It is, like, absolutely different. And then, like, even down to, like, networking decisions that different companies do and data center design decisions that people do, the optimal, like, if you were to say, you know, X amount of compute of TPU versus GPU, compute optimally what is the best thing, you will d- diverge in what the architecture is. And I think that's important to know, right?
- DPDwarkesh Patel
Wait,
- 1:05:21 – 1:10:57
Architectures lead to different AI models? China vs. US
- DPDwarkesh Patel
can, can I ask about that real quick? Uh, the, um... So earlier we were talking about how China has the, uh, uh, H20s or, uh, B20s, um-
- DPDylan Patel
Yeah.
- DPDwarkesh Patel
Uh, and there, there's like much less compute per memory bandwidth and, like, the amount of memory, right? Does that mean that Chinese models will actually have like very different architecture and characteristics than American models in the future?
- DPDylan Patel
So, so you can take this to like a very, like, large conclu- like, leap, and it's like all, you know, neuromorphic computing or whatever is, like, the optimal path, and that looks very different than, like, what a transformer does, right? Um, or you could take it to, like, a simple thing, which is like the level of sparsity, eh, th- like f- well, coarse-grained sparsity, i.e. like, experts and all this sort of stuff.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
Um, the arrangement of, w- like, what exactly the attention mechanism is, because there are a lot of tweaks.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
Uh, it's not just like pure transformer attention, right? Or like, hey, D mod, like how wide versus tall the model is, right? That's like very important. Like, D mod versus, you know, number of layers, right? Um, these are all, like, things that, like, would be different, like... And I, and like, I know they're different between, like, say-
- DPDwarkesh Patel
Yeah, yeah.
- DPDylan Patel
... a Google and an OpenAI in what is optimal.
- DPDwarkesh Patel
Yeah.
- DPDylan Patel
But what really s- it really starts to get like, hey, if you are limited on a number of different things, like, uh, like China invests humongously in compute and memory. Um, you know, which is, like, basically the memory cell is directly coupled... Or is the, uh, the compute cell, right? So these are, like, things that, like, China's investing hugely, and you go to conferences like, oh, there's 20 papers from Chinese companies/universities about compute and memory. Or like, you know, hey, like, because the flop limitation is here, maybe NVIDIA pumps up the on-chip memory and, like, changes the architecture, because they s- they still stand to benefit tens of billions of dollars by selling chips to China, right? Today, it's just like neutered American chips, right? Uh, uh, neutered chips that go to the US, but like, it'll start to diverge more and more architecturally, because they'd be stupid not to make chips for China, right? Um-And Huawei, obviously again, like has like their constraints, right? Like where are they limited on memory? Oh, they have a sh- lot of networking capabilities, and they could move to like certain optical, like, networking technologies directly onto the chip much sooner than we could, right? Because that is what's optimal for them within their search base of solutions, right? Because this whole area is like blocked off. Right?
- NANarrator
It's kind of really interesting to see, to think about like the development of how Chinese AI models will differ b- from American AI models because of the, 'cause of these changes or these constraints.
- DPDylan Patel
And it applies to use cases, it applies to data, right? Like American models are very important about like, "Let me learn from you," right? "Let me be able to use you directly as a random consumer," right? That is not the case for a Chinese model, I assume, right? Uh, because there's probably very different use cases for them. Uh, China's crushes the West at video and image recognition, right? Uh, at ICML, like Albert Gu at, you know, of Cartesia, like state-space models, like every single Chinese person was like, "Can I take a selfie with you?" Man was harassed. In the US, like you see Albert, and he's like, it's awesome, he invented state-space models, but it's not like state-space models are like, like here. But that's because state-space models potentially have like a huge advantage in like video and image and audio, which is like stuff that China does more of and that is further along and has better capabilities in, right?
Episode duration: 2:10:53
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode pE3KKUKXcTM
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome