Skip to content
No PriorsNo Priors

No Priors Ep. 13 | With Jensen Huang, Founder & CEO of NVIDIA

So much of the AI conversation today revolves around models and new applications. But this AI revolution would not be possible without one thing – GPUs, Nvidia GPUs. The Nvidia A100 is the workhorse of today’s AI ecosystem. This week on No Priors, Sarah Guo and Elad Gil sit down with Jensen Huang, the founder and CEO of NVIDIA, at their Santa Clara headquarters. Jensen co-founded the company in 1993 with a goal to create chips that accelerated graphics. Over the past thirty years, NVIDIA has gone far behind gaming and become a $674B behemoth. Jensen talks about the meaning of this broader platform shift for developers, making very long term bets in areas such as climate and biopharma, their next-gen Hopper chip, why and how NVIDIA chooses problems that are unsolvable today, and the source of his iconic leather jackets. 00:00 - Introduction 01:26 - The early days when Jensen Co-founded NVIDIA 04:58 - Why NVIDIA started to expand its aperture to artificial intelligence use cases 10:42 - The moment in 2012 Jensen realized AI was going to be huge 13:52 - How we’re in a broader platform shift in computer science 17:48 - His vision for NVIDIA’s future lines of business 18:09 - How NVIDIA has two motions: Shipping reliable chips and solving new use cases 25:41 - Why no one should assume they’re right for the job of CEO and why not every company needs to be architected as the US military 31:39 - What’s next for NVIDIA’s Hopper 32:57 - Durability of Transformers 35:08 - What Jensen is excited about in the future of AI & his advice for founders

Sarah GuohostJensen HuangguestElad Gilhost
Apr 25, 202352mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:001:26

    Introduction

    1. SG

      Thank you for doing this with us, Jensen.

    2. JH

      Delighted to do it, Sarah. Delighted.

    3. SG

      Why don't we start at the beginning? Uh, you worked at LSI and AMD before starting a company. How did that happen?

    4. JH

      They gave me a job. (laughs)

    5. EG

      (laughs)

    6. NA

      (laughs)

    7. JH

      Let's see, uh, let... I was at Oregon State University, and, uh, uh, it was, um, a campus company day, and, uh, I interviewed at a lot of companies. And, and, um, uh, uh, two companies really, really, um, uh, connected with me. I, I love designing chips and designing computers, and, uh, at the time, in our lab, in the computer science lab, there was a, a poster of a 29,000, um, 32-bit, uh, CPU from AMD. And, uh, uh, it... you know, I always thought it'd be kinda cool to build that. Uh, on the other hand, uh, there was a- another company that was a startup at the time, uh, built by, uh, one of the legends of Silicon Valley, Wulf Corrigan. And, uh, they started a company to design chips using software, uh, to design chips not by hand, but by, by, uh, uh, using programmable, uh, logic. And, and you would describe it in language, and it would, it would synthesize it to chips. And of course,

  2. 1:264:58

    The early days when Jensen Co-founded NVIDIA

    1. JH

      uh, I chose to go to AMD. Uh, it turned out, it turned out that I went there to design microprocessors and, and, uh, my lab partner... not my lab, but my office mate, uh, ended up going to LSI. And, and she, uh, uh, insisted that I, I go there, uh, after I was there and after, after she went there. And, and, um, uh, the LSI team, uh, said, "Hey, we were recruiting this kid from, from Oregon State, and we really wanted him to come work at LSI Logic." And, and it turned out to have been her office mate. And so they all reached out to me, and, and I decided to go there because, uh, it was at the beginning of the EDA industry. It was at the beginning of, uh, designing, uh, chips using computers. And, uh, it was probably one of the... one of the best things that's ever, ever happened to me. And, um, uh, it was in the beginning of the, the, uh, ability for every company to build their own chips, and it's the reason why, uh, I met some really great computer architects. Uh, uh, Andy Bechtolsheim was the founder of Sun. I got to work with, uh, uh, a bunch of great architects at Silicon Graphics and, uh, Jon Rubinstein, who, who, um, uh, was at a company called Dana Computer, who became the vice president of engineering for Apple. And, and so I... uh, in, in a very early age, I got the opportunity to work... And then, of course, uh, the two founders of NVIDIA, uh, Chris Malkowski and, uh, Curtis Payne, myself. And, and so I got a chance to work with some really amazing computer architects, and I learned a lot about, uh, about building computers with chips. And so that's, you know, my early days.

    2. SG

      And you were a star at LSI with your co-founders. At what point did you know, "I have to start a company"?

    3. JH

      Uh, it wasn't my idea. It was theirs. Uh, Chris and Curtis wanted to, wanted to leave Sun. Uh, they had their own reasons. And, uh, I was doing really well at, at LSI Logic, and, and I enjoyed my job, and we had two kids, Laurie and Ian. And, um, I... and, you know, just like, just like you, they, they wouldn't stop hounding me.

    4. EG

      (laughs)

    5. JH

      And, and they said, "Hey, you know, we, we wanna start this company, and we really need you to come along." And, and, uh, and I told them that I really needed to have a job, and, and, um... and so anyways, uh, they needed to figure out what to do. Uh, at the time, the, the S- the valley was, was, uh, um... the way of designing computers was rather s- rather split between, uh, general purpose computing, uh, versus using accelerators. And, um, about 99% of, of, uh, of, uh, the valley was, uh, believed in general purpose computing, and about 1% believed in acceleration. And, um, and, and for 25 years, 99% was right. So we s- we, uh, we decided to start a company on, on accelerated computing. Um, and, you know, at the time, the only thing you could really do with accelerated computing is, is, uh, find applications or find problems that were barely solvable or unsolvable by general purpose computing. So then that's k- kinda what we dedicated our company to do, to solve problems that normal computers can't. And if you, if you, uh, follow that mission to its limit, um, it led us to, uh, self-driving cars. It led us to robotics. It led us to climate science problems, uh, digital biology, and of course, you know, one of the most famous ones is artificial intelligence.

    6. SG

      Uh, so you were working on this huge set of applications before the, you know, current ra- wave of artificial intelligence. What was the original, um, technical advantage of NVIDIA in, in artificial intelligence, and when did you begin to realize that,

  3. 4:5810:42

    Why NVIDIA started to expand its aperture to artificial intelligence use cases

    1. SG

      um, this was gonna be a por- important use case for you guys?

    2. JH

      Uh, so we, we had... we had, um, uh, expanded the, the flexibility of our, of our, um, of our accelerators to, to be more general purpose, and we invented a, a new computing model, uh, called CUDA. And, um... we're doing this podcast, like, at 4:00 or something like that in the afternoon. It was like, at the lowest point of energy. Isn't that right?

    3. EG

      Yeah.

    4. JH

      So this, this is... we need some...

    5. EG

      That's why we need-

    6. JH

      We need some nerds.

    7. EG

      ... nerds.

    8. SG

      We need nerds.

    9. JH

      That's right, we need some nerds, guys.

    10. NA

      (laughs)

    11. SG

      Thank you, nerds.

    12. EG

      Now with gummy clusters as well, so it's very exciting new technology.

    13. JH

      We need, we need, we need some energy.

    14. EG

      (laughs)

    15. JH

      We need some accelerated... we need some accelerated computing in our life.

    16. SG

      (laughs)

    17. NA

      (laughs)

    18. JH

      And my, my voice is hoarse. (clears throat) So we wanted to... we wanted to make our, our, um, our graphics processors more and more general. And the reason for that in the beginning was because some of the effects that we had to do related to general purpose image processing, post-effects. You render an image, and you do post, post, um, image effects.... and other applications, of course, we wanted to bring the scene to life, and so we had to do physics processing. Um, in order to do physics, you have to do particle physics, fluid dynamics, so on and so forth. And so we, we expanded the aperture of our, of our accelerated computing platform to be more, and more, and more general purpose. The problem with general purposeness is that the more general purpose you are, the less, um, acceleration you get in any particular domain. And so you've got, you've got to find that, that, that line really, really carefully. And that's one of the gifts of our company, to find that line between, on the one hand, every single generation bringing enormous amounts of acceleration well beyond what CPU could do, um, to the application. And so, if you become too general purpose, you're like, just like a CPU. How can you accelerate a CPU with a CPU? And so you... So you have to find a way to f- walk that line. On the other hand, if you don't expand the aperture of the applications that you serve, the R&D dollars that you're able to generate wouldn't be enough to stay ahead of the CPU, which had the largest R&D budget of any chip on the planet. So if you think about this problem, it's actually really nearly impossible, because you have a small application, let's call it, you know, a billion-dollar market at the time, and, and out of that billion-dollar market you're investing 115- $150 million a year. Out of that $150 million a year, how do you keep up with a few hundred billion dollar industry? It's not even sensible. And so you have to find that niche very, very carefully where $150 million would accelerate this particular application abnormally and insanely. And then over time, you could expand your application space so that it's, goes from a billion dollars, to five billion dollars, to ten billion dollars, so on and so forth, without falling off that cliff. That is the fine line that we w- we walk, and... And so we kept expanding the, the general purposeness, and it led us to, uh, molecular dynamics simulation, which is what this image seems to look like. And, and, um, uh, seismic processing was another, uh, industry. And, uh, just slowly but surely, uh, we, we expanded our aperture. But one of the things that we did well was to make sure that, that irrespective of whether somebody used our platform for general purpose computing, accelerated computing, we always maintained the architecture compatibility. And the reason for that is because we wanted, uh, a platform that would attract developers. If every single NVIDIA chip in the world was incompatible, then how would a developer be able to pick one up, even if they learned that, that CUDA was going to be incredible for them? How would they pick that up and say, "I'm going to develop an application that's going to run on that"? Wh- h- which chip would they, would they have to go figure out? And, and nobody could figure that out. And so we said, "If we're gonna... If we believe in an architecture, and we want this to be a new computing platform, then let's make sure that every one of our chips, uh, perform exactly the same way, just like an x86, just like ARM, just like any computing platform." And so for the first five, ten years, you know, we had very few customers for CUDA, but we made every chip CUDA-compatible. And you can go back in history and looked at, looked at our gross margins. Um, it started out, it started out poor, and it got worse, you know? (laughs) So... Uh, because we were, we were in a really competitive industry and, and we were still trying to figure out how to do our job and build cost-effective things, so... So you know, i- i- it was already challenging as it is, and then we layered on top of this this architecture that was called CUDA that had no applications that nobody paid for.

    19. EG

      Yeah.

    20. JH

      And so-

    21. EG

      It's kind of amazing, because now when I talk to people in the AI world in terms of one of the reasons that they really love using NVIDIA GPUs is because of CUDA.

    22. JH

      Mm-hmm.

    23. EG

      And then because of the ability to scale interconnect.

    24. JH

      Yeah.

    25. EG

      And so you can really, like, highly parallelize these things as well, which you can't necessarily do with other approaches or architectures that are in the market today.

    26. JH

      Yeah, and s- so this, this computing platform, Elad, it, it's, uh, i- i- it's strange in a sense that it does, it performs these miraculous things, um, and, and, uh, we carried it to, to the world on the backs of GeForce, which is a gaming card.

    27. EG

      Mm-hmm. Mm-hmm.

    28. JH

      Uh, the, the first GPU that Geoff Hinton got for his lab-

    29. EG

      Mm-hmm.

    30. JH

      ... you know, uh, E- Elad would tell you that, that, uh, Geoff came in and said, "Here's a couple of, uh, uh, GPUs."

  4. 10:4213:52

    The moment in 2012 Jensen realized AI was going to be huge

    1. EG

      and sort of mining, and then in the context of AI. And it seemed like those were the two markets where a bunch of people were organically just adopting you. Was it... Were you marketing to those communities? Was it just people started realizing that-

    2. JH

      That's the beauty-

    3. EG

      ... they needed linear algebra and... (laughs)

    4. JH

      That's the beauty of a computing platform, right?

    5. EG

      Yeah.

    6. JH

      Uh, in the beginning, you have to target the applications, and in the beginning we did. Uh, one of the first applications was NAMD.

    7. EG

      Mm-hmm.

    8. JH

      Uh, seismic processing. Um, uh, it was, uh, both of them, both of them are, are, are... Uh, one of them is kind of particle physics, the other one is, uh, image processing, if you will, so... And so inverse physics, if you will.

    9. EG

      Mm-hmm.

    10. JH

      And, and so, uh, one particular domain, uh, you know, we just, we just went out to hire, uh, to, to, um, uh, research, we went to scientific computing, um, centers and, and we said, "What kind of pro- problems are just beyond your reach?"

    11. EG

      Mm-hmm.

    12. JH

      And, uh, um, the list of applications are, include-

    13. EG

      Yeah.

    14. JH

      ... quantum chemistry and quantum physics and, you know, so on and so forth.

    15. EG

      What was the moment when you said, "Wow, this AI thing is really important for us?"

    16. JH

      Uh, it happened, it happened, uh, uh, around 2012, I guess, and it was because, uh, simultaneously, Andrew Ng, uh, reached out to Bill Dally, our, our chief scientist, um, to, uh, work on a way to get the neural network model that they were working on-

    17. EG

      Mm-hmm, mm-hmm.

    18. JH

      ... uh, onto GPU so that they could, instead of using, uh, thousands of, uh, CPU servers, they could use a few GPUs, uh, to, uh, to do training. So, that was one.... simultaneously, almost— uh, l- simultaneously, uh, Geoff Hinton reached out to us, and we started-

    19. SG

      Hm.

    20. JH

      ... hearing about that, and, uh, same thing was happening with Yann LeCun in his lab, and-

    21. SG

      Mm-hmm.

    22. JH

      ... and so simultaneously in several different labs, we're starting to feel that there's this, this, uh, this neural network-

    23. SG

      Mm-hmm.

    24. JH

      ... uh, emergence that, that is ... and, and that attracted our attention.

    25. EG

      Yeah, I guess 2012 was also the year when AlexNet came out.

    26. JH

      Yeah. Right.

    27. EG

      So it felt like that was a year of transition for deep learning in general, in terms of really ... that was the moment in time, at least, that I remember thinking, "Wow, this, this really exciting wave of AI coming."

    28. JH

      Yeah.

    29. EG

      And then I feel like for 10 years nothing really happened for startups, but a lot of incumbents started adopting this technology at scale. Yeah.

    30. JH

      Yeah, we started feeling it. We started hearing about it before that, and then ImageNet kind of-

  5. 13:5217:48

    How we’re in a broader platform shift in computer science

    1. JH

      I mean, just, just if it was just computer vision, we could've used it for all kinds of int- interesting applications, like self-driving cars and robotics, and, and we did. Uh, but w- we observed that this might be a new way of writing software altogether and asking ourselves, "What's the implication to chip design, system design, interconnect the algorithm, the system software?" Uh, to, to really reason about not just, "Why is this exciting, why was it so effective," which was impl- i- it was ... j- that alone was plenty, miraculous.

    2. SG

      Mm-hmm.

    3. JH

      That ImageNet, uh, without, without, uh, a specifically any human engineered algorithm would, would reach the level of effectiveness, you know, compared to 30 years of computer vision al- algorithms, uh, overnight, you know, and it wasn't, it wasn't by a small amount. And so the first question, of course, is, why is it so effective, and was this going to be scalable? And if it was gonna be scalable, what's the implication to the rest of computer science? What, what problems can't this universal function approximator, if you will, that can solve problems of dimensionality extraordinarily high, and yet you could, um, uh, learn the function using enough data, which, at the time, we were starting to believe we can get plenty of, and to, to, uh, systematically train this model into existence? Because, you know, it's, it's, uh, you train 'em one layer at a time.

    4. EG

      Can you talk a little bit more about ... I've, I've heard you be very articulate in terms of how you view this as a broader platform shift, just even in terms of, like, how pages are served versus, you know, generated or other aspects of that. Could you talk a little bit more about, you know, what's really happening right now more broadly in computer science with this shift to AI?

    5. JH

      Yeah, so you fast-forward now a decade. The first, the first five years was, was by reasoning the impact of computer science altogether. Um, th- at the same time, we're developing, uh, new models of all kinds, right? And so CNNs to ResNets to RNNs to LSTMs to, you know, all kinds of new models, and, and scaling 'em larger and larger, making great strides in perception models particularly. And of course, the, uh, the transformer was a big deal, BERT was a big deal, um, a- all, all of you know that story r- well, and, um-

    6. SG

      Did you guys see, like a, like a step change in, in volume growth, right, uh, with transformers and, and BERT and such? Um, because it feels like having a architecture and me- like, an attention mechanism that allowed for scaling of these models really was also a kickstart in the industry.

    7. JH

      Well, the, the ability for you to learn, um, patterns and relationships from spatial as well as sequential data, uh, must be an architecture that's very effective, right? And so, so I think on its, on its first principles, you, you can kinda think transformer's gonna make ... it's gonna be a big, big deal.

    8. SG

      Mm-hmm.

    9. JH

      Not only that, you could train it in parallel, and you can really scale this model up. And so that's very, very exciting. Um, I ... and so I, I think that, that when transformers came, first came out, uh, we realized that, uh, that there's a model now that overcame the limitations of RNNs and LSTMs, and, and, um, uh, we can now learn sequential data in, in a very large way. So, that was very exciting. Uh, BERT was very exciting. Um, uh, we trained some of the early language models ourselves, and th- and we saw very good results. Uh, but it, it wasn't until, it wasn't until, um, uh, uh, the combination of, uh, reinforcement learning/human feedback, uh, wasn't ... and, and of course some of the breakthrough work that was done with retrieval models, um, uh, dialogue managers that, that does the g- guardrailing. Um, s- it wasn't until some of, all of those kind of pieces started to come together that, of course, that we all enjoy ChatGPT. And, and, Eli, the, the point that you're trying to make

  6. 17:4818:09

    His vision for NVIDIA’s future lines of business

    1. JH

      is, is, um, uh, the, the observation that computer programming has now, uh, been completely disrupted, that for the very first time in the history of computing, the language of programming a computer is human.

    2. EG

      Mm-hmm.

    3. JH

      You know? Th- any human language. And, and, uh, it doesn't even have to be grammatically correct.

    4. SG

      (laughs)

    5. JH

      And, and it's, it's,

  7. 18:0925:41

    How NVIDIA has two motions: Shipping reliable chips and solving new use cases

    1. JH

      uh, fairly in- incredible that, that, um, anyone can program a computer now.

    2. EG

      Mm-hmm.

    3. JH

      And so that's a, that's a, that's a big deal. Uh, the fact that you program it differently, it writes different applications.... um, what is the reach of this new computing, uh, computing model, uh, apparently quite, quite large, and it's, it's the reason why ChatGPT is the fastest-growing application in history.

    4. EG

      Mm-hmm.

    5. SG

      We had, um, Alex Graveley, who is the chief architect for Copilot-

    6. JH

      Uh-huh.

    7. SG

      ... on the show as well, and, uh, his, his favorite... Like, obviously it's, you know, very powerful to have sort of, um, uh, sequential code prediction, but his favorite use cases of Copilot have been, like, people telling him that they don't code, but now they do.

    8. JH

      I... Yeah, right.

    9. SG

      Which is, which I think is very democratizing, as you said.

    10. JH

      It's quite amazing that you could-

    11. SG

      Yeah.

    12. JH

      ... give, uh, um, ChatGPT a, a problem to solve, and it reasons through it step by step, um, but yet it, it arrives at the wrong answer on the one hand. On the other hand, you could tell it to write a program to solve the same problem, and it writes a program that solves the problem perfectly. And so, the, the fact that, that there's an application that, on the one hand, reasons and tries to solve a problem and does a fairly good job at it, it's almost there. Um, uh, on the other hand, it can write a program altogether to solve that same problem. You know, it's, it's, uh, you gotta really w- wrap your head around the implication of this, um, uh, yeah.

    13. EG

      So do you view it as, like, the future world is some form of machine sentience?

    14. JH

      Well, uh, first of all, I don't even know what that word means to... Uh, uh, in a-

    15. EG

      Yeah.

    16. JH

      ... in a technical way.

    17. EG

      Yeah.

    18. SG

      Yeah.

    19. JH

      Um, I'm, uh, you know, I'm fairly sure that I'm sentient, less so today.

    20. EG

      (laughs)

    21. JH

      I'm, I'm kind of under the weather, but-

    22. EG

      That's what we have nerves for.

    23. JH

      Yeah. Yeah, that's why I need-

    24. SG

      Keep eating. (laughs)

    25. JH

      That's why I need nerds to, to, to, to crank me up here.

    26. EG

      (laughs)

    27. SG

      I'll try to.

    28. JH

      Yeah, I know. I know, today was a tough day. No. Um, but, but, uh, I, I, I don't, I don't know. But, uh, do I, do I believe that, uh, we now have a, uh, a software that can reason through a problem, uh, for many, many types of problems, reason through a problem and solve... and provide a solution, uh, or a program to systematically provide a solution on an ongoing basis? The answer is yeah.

    29. EG

      Mm-hmm. Yeah. And then as you look forward to that world, how do you think about, um, where you wanna take NVIDIA's lines of business, but also, you mentioned in the past that NVIDIA has done things like train models, and you've done some really interesting things there. Is that gonna be an increasing part of what you do in the future, or are you mainly focused on the chip side? Or how do you think about that mix of helping to push forward some research as well as, you know, being the underlying platform for the industry?

    30. JH

      Well, we're a computing platform company, and we have to go u- go up the stack as far as we need to so that developers can use it.

  8. 25:4131:39

    Why no one should assume they’re right for the job of CEO and why not every company needs to be architected as the US military

    1. JH

      Investing in the future and being, um, sustainable now are, are not in conflict with each other. And, and so the, the, the challenge for, for all startup CEOs and for all CEOs is to find a way to, to be able to do what you believe in, the, the fundamental core belief of the institution, and to be able to afford doing it. That is the purpose of the company.

    2. SG

      Mm-hmm.

    3. JH

      And, and so you... The... It's part, it's part, um, conviction, it's part skill. You know, make me... Making money is not a, a matter of conviction. Making money is a matter of skill, and it's a learnable skill. And it took me a long time to learn it, you know, I'll, I'll admit that. Um, and, you know, I've been, I've been at this for 30 years, and, and, um, uh, for the first... For... Well, apparently, for the first 20 years, since you went back 10 years, uh, for the first 20 years, I was still trying to figure it out. And so... But it's a skill. Learning, learning how to make money, learning how to, how to, um, uh, uh, run a company, uh, efficiently, those are all skills, and the company has to develop the skills. And, and so, so I, I think the, the, um, the, the way that we, we ultimately do it, uh, is y-... We ask ourselves, "Do we... Do we really believe it or not?" And if we really d-... Believe in doing something, then it, it is the purpose of the enterprise, it's the singular purpose of the institution to go pursue its beliefs. And the rest of it is up to, um, all of the cleverness of the company and, you know, try to do our jobs well and build things that people want to buy and, um, try to make it as cost-effective as possible, make the company as efficient. Those are all skills. You know, the hard part, the hard part, uh, as, as it turns out, is not the skill part. Uh, you know, it took me a long time, but, but, um, uh, you know, a lot of companies know how to make money, obviously. And so, so the fact that there are more than one company that makes money suggests it's not that hard. You know, if somebody else can do it, Harcard can be.

    4. SG

      (laughs)

    5. JH

      (laughs) And, and so the... But to, to be singularly advancing, um, a new computing model we call accelerated computing, and we believe that someday, that on the one hand, accelerated computing can help us solve problems and tackle problems that normal computers can't, and it exposed us to all of these amazing applications, like digital biology, that I'm excited about today, like climate change, that we're excited about, like robotics and self-driving cars. If not for the fact that we're pursuing applications that were impossible with normal computers, why would we have discovered all of those things? You know, why would we have discovered artificial intelligence? Why, why would we be, um, the workhorse of large language models? Because large language models are barely possible. And if you are doing something that's barely possible, you call us, you know. We're the... You know, we're, we're the horse you call- (laughs)

    6. SG

      (laughs)

    7. JH

      ... to, to, to, uh, solve those problems. And, and so I, I love that aspect. I love the fact that we get to discover those future. Um, on the other hand, uh, w- we deeply believe that, that someday everything will be accelerated. And the reason for that is, is, um, very clearly that, that the CPU will run its course, and there's a limit to how far you could scale general purpose computing. And, and, um, uh, you'll always need it. You'll always need CPUs. But, uh, the, the type of applications that you... that we're, we're all gonna run, um, acceleration is really the best way forward. And so, so I, I... We just... A- at our core, we believe that from day one. 30 years ago, that's the reason why we started the company. And so it's the true conviction.

    8. SG

      You have been, uh, enormously vindicated on this, you know, 30-year belief.

    9. JH

      Mm-hmm.

    10. SG

      Um, uh, you must have felt like, you know, that conviction challenged at some point in, you know, 30 years of running the company and learning the skills to run the company.

    11. JH

      Mm-hmm.

    12. SG

      What was the... I, I guess, what was the nearest death experience or the most concerned where you're like, "Maybe I'm not right," or has that ever happened?

    13. JH

      That I'm not right for the job?

    14. SG

      No, you're not right about accelerated computing and how important it will be.

    15. JH

      Th- the second one's yes. (laughs)

    16. SG

      (laughs)

    17. JH

      (laughs) Uh, I, first of all, I, I don't think anybody, anybody should assu- assume that they're, they're right for the job. And so, so, uh, y- you should be gut-checking on that almost every day. And so-

    18. SG

      To be clear, that wasn't the question, but-

    19. JH

      But I'll, I'm more than happy to answer that.

    20. SG

      Very humble of you. Yeah.

    21. JH

      (laughs) ... answer that question. Um, uh, d- did I ever believe that, that was... it was wrong? No.

    22. SG

      Mm-hmm.

    23. JH

      I, I believe that accelerated computing is the... uh, absolutely the only way to solve problems that are impossible by definition if it's not-

    24. SG

      Mm-hmm. Okay. Axiomatically.

    25. JH

      Right? Mm-hmm.

    26. SG

      Yeah.

    27. JH

      And on the other hand, if you can solve problems that are impossible today, and someday you need that promp-... You need that application to be broad-based-... would accelerated computing be the best approach? The answer is yes. Yeah.

    28. EG

      When do you think the CPU hits its limits? You mentioned that, you know, eventually you think everything will move over, or at least, you know, big chunks of the future will move over. Is that five years away? Ten years away?

    29. JH

      For certain applications, it happened-

    30. EG

      It's already there.

  9. 31:3932:57

    What’s next for NVIDIA’s Hopper

    1. JH

      be generic. You know, every company in the world should not be built like the US military.

    2. EG

      Mm-hmm.

    3. JH

      And in fact, if you look at every company's org chart in the world, they kinda look like the US military.

    4. EG

      Mm-hmm.

    5. JH

      They, there, there's somebody on top and then it comes, you know, comes down and, um, uh, and yet, yet, uh, the number of direct reports of CEOs are very few and, and the direct reports of the people who are just learning how to manage-

    6. EG

      Mm-hmm.

    7. JH

      ... first-level managers, are very large. It's exactly the opposite of how it should probably be architected.

    8. EG

      Mm-hmm.

    9. JH

      You would think that the people that report to the CEO repor- requires no management at all and in fact, it's generally true. My direct reports are sophi- sophisticated, they're really talented, they're incredibly good at their job, they're excellent lead- leaders.

    10. EG

      Mm-hmm.

    11. JH

      They have great business acumen, they have excellent vision. They're incredible.

    12. EG

      Mm-hmm.

    13. JH

      Every single one of them. And, and, uh, why-

    14. SG

      I guess that, that means, uh, you have more than like, you know, the sort of management book accepted, like, six or seven or whatever.

    15. JH

      Yeah, I have 40-somewhat direct reports. And no one-on-ones, um, uh, no career coaching.

    16. EG

      Mm-hmm.

    17. JH

      You know?

    18. SG

      (laughs)

    19. JH

      (laughs)

    20. EG

      (laughs)

    21. JH

      "So, what would you like to do with your life?"

  10. 32:5735:08

    Durability of Transformers

    1. JH

    2. EG

      Yeah.

    3. JH

      Tho- those are conversations you have with, with new college grads and, and early career, and we love those conversations, of course, in, in helping them shape their career and mentor them, give them access to, to, uh, new experiences. Um, but, eh, but at the executive staff level, um, we're organized so that we can, we can pursue a whole lot of different things at the same time. However, one of the most important things about, about a software company is you have to understand, um, computer architecture and one of the most important thing about computer architecture is you can only afford one.

    4. EG

      Mm-hmm.

    5. JH

      Just as, just as, um, uh, some of the largest companies in the world only have two operating systems, you know, the single largest company on the planet only has two. How is it possible that, that, that, um, uh, you know, so many companies have so many different computer architectures and they have, uh, seven or eight or nine instruction sets that they're keeping around? We have one instruction set. We have one computer architecture and we're super disciplined about that. And so where we need to be focused, we are. Where we, uh, allow for, for, um, uh, innovation and discovery, uh, at the senior level, uh, we allow that. And so, so I think the company's tapered, uh, and organized in a way that is consistent with the nature of our work, you know? So, that's the most important thing and, and that's probably the, the takeaway for, for what I've learned, um, uh, building our company is, there is no one generic architecture for every company. It should, it should fit, uh, the function of the company, its purpose, and then, of course, the leadership style of the, of the leaders.

    6. EG

      Mm-hmm. Yeah, I think that's a really important, um, note that most people don't really realize is that a company should almost be a bespoke structure supporting the CEO and their staff, and the way the company... What the company's delivering to customers versus it's always the same thing, and I think-

    7. JH

      Exactly.

    8. EG

      ... that gets lost a lot.

    9. JH

      Exactly.

    10. EG

      Yeah.

    11. JH

      Yeah. There's some particular, um, chief that, that you need and a chief that, that you need-

    12. EG

      Mm-hmm.

    13. JH

      ... and a chief this, that, you know, there's some chiefs that you do need.

    14. EG

      Yeah.

    15. JH

      And, um, but aside from that, uh, you should start from first principles and architect something that, that makes sense for the, for the, the leader and, as well as the, the, uh,

  11. 35:0852:11

    What Jensen is excited about in the future of AI & his advice for founders

    1. JH

      the function.

    2. EG

      Yeah.

    3. JH

      Yeah.

    4. EG

      Yeah. When I, when I was at Google, they had the famous 80/20/10.

    5. JH

      Mm-hmm.

    6. EG

      Where it was like 80% is core, 20% is, like, core adjacent/new stuff, and then 10% was hyper-experimental.

    7. JH

      Mm-hmm.

    8. EG

      Do you have any frameworks or ways to think about that stuff or it's just kind of like let's see what organically is used in terms of this generic platform that we've built with CUDA and other things that are, you know, built in to help support a lot of use cases, and as they emerge, we, we say, "Okay, let's go support that new thing?"

    9. JH

      Our str- yeah, I don't, I don't, h- I, I don't have any wise framework like that. Um, uh, there, there, there are a couple of things that, that our company has shaped, um, and structured to do. There's one part, uh, a very large part of our company is designed to, uh, build very, very complicated computers perfectly.

    10. EG

      Mm-hmm.

    11. JH

      And so that's, that is, um, uh, one of its missions. Okay? And, and, uh, that kind of architecture, that kinda organization, uh, i- is a, is a, um, uh, invention and refinement organization. And then we have, we have, um, uh, a, a whole bunch of, of, um, uh, skunk works, if you will. And the reason for that is because we're trying to invent things 10 years out that we're not exactly sure whether it's gonna work or not and, and there's a lot of adaptation, a lot of pivoting, and, um, and so, so y- you know, our company actually has, has two different ways of working. One of them is rather organic, shape-shifting all the time. If a particular investment's not working out, we give up on it, move the resources somewhere else-And so that's the agile part of the company, and then there's a part of the company that's not rigid, but it's really refined.

    12. EG

      Mm-hmm.

    13. JH

      And so these two, these two systems have to work side-by-side.

    14. SG

      Can you talk a little bit about the H100?

    15. JH

      Mm.

    16. SG

      Next workhorse? And, like, what, um, what the most important innovations are and, like, what the, like, design and ship process for that looks like?

    17. JH

      I would say the, the big breakthrough for Hopper is, uh, recognizing that quantization, the numerical quantization, the numerical formats, um, has, has a fair amount of, uh, innovation and, um, uh, uh, ability to, to reduce, uh, because it's, it's statistical in the first place. And now the question is, what kind of models could be created and trained and, and, um, uh, we believe that 8-bit floating point, uh, i- i- rather than, if you look at scientific computing today, 64-bit floating point. And so j- just by, uh, breaking up 64 into eight, you could increase the performance of an AI supercomputer just by a factor of eight, by not doing 64-bit. And so, so that's, that's almost a factor of, uh, if you will, a factor of 10 almost, in, in just a couple of generations, just by recognizing that 64-bit floating point wasn't necessary.

    18. SG

      Mm-hmm.

    19. JH

      And so, the, one of the big things is that. The second thing is, is Transformer. The Transformer engine is so universal and so useful that it's possible for us to design a, um, pipeline that is shaped for, uh, learning and, and, uh, inferencing Transformers. And so those are probably the two biggest things. Otherwise, it's the largest chip the world's ever made, it's, you know, the fastest chip the world's ever made, and, you know, super energy efficient and uses the fast memories that the world's ever made, and then we connect a whole bunch of these things together so that it's, it's fast and energy efficient. But those are all, you know, kind of brute force-y things, but the archi- the big architecture idea is, uh, FP8 and Floe- and Transformer engine.

    20. SG

      And when you think about then, so that's the, you know, big project refinement part of the company.

    21. JH

      Mm-hmm.

    22. SG

      We think about the more agile piece, like, what's the impossible application you're working on today that's 10 years out you think is likely to be important?

    23. JH

      M- tu-

    24. SG

      I'm sure there are a ton of them, but...

    25. JH

      The- th- there's a whole bunch we're working on that, that don't work at the moment-

    26. SG

      (laughs) .

    27. JH

      ... um, but, but I've got a lot of confidence it will work, okay? So for example, uh, uh, you know, autonomous driving is still, uh, making progress, uh, but I have every confidence that it will work. Um, uh, I have every confidence that a robotic foundation model will be discovered, and that, that, um, uh, through, uh, through, uh, expressing yourself, uh, using human language, um, uh, you could, you could, uh, cause a, um, uh, mechatronic system of almost different types of limbs and, you know, agility to be able to, uh, figure out how to bend itself, articulate itself, uh, to do a particular task. And, and, um-

    28. SG

      What do you think the blockers are to that today?

    29. JH

      Uh, I have no idea. Uh, I have no idea. Um, but I, I can't tell you. I'm just (laughs) -

    30. EG

      (laughs)

Episode duration: 52:11

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode ZFtW3g1dbUU

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome