This video isn’t embeddableWatch on YouTube →

Stanford Online

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence

For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai Follow along with the course schedule and syllabus, visit: https://cs153.stanford.edu/ In a CS153 Frontier Systems lecture, the class hosts Jensen Huang, CEO of NVIDIA, who argues computing is being reinvented for the first time in 64 years as software shifts from prerecorded execution to real-time generation, with NVIDIA's extreme co-design across chips, compilers, networks, and systems delivering a million-fold speedup over the past decade versus Moore's Law's 100x. He walks through the architectural logic of Hopper (pre-training), Grace Blackwell NVLink72 (inference and decode), Vera Rubin (agents), and the upcoming Feynman generation built for swarms of agents and sub-agents, while pushing back on MFU as a misleading metric in favor of tokens-per-watt and real evals. Huang also defends open models like Nemotron, BioNemo, and Alpamayo as essential for safety, transparency, and democratizing AI across underserved languages and scientific domains, and forecasts compute energy demand growing roughly a thousandfold, making this the strongest market moment in history to invest in sustainable energy and grid upgrades. Guest Speaker: Jensen Huang founded NVIDIA in 1993 and has served since its inception as president, chief executive officer, and a member of the board of directors. Since its founding, NVIDIA has pioneered accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI. NVIDIA is now driving the platform shift of accelerated computing and generative AI, transforming the world's largest industries and profoundly impacting society. Huang has been elected to the National Academy of Engineering and in 2026 was appointed to the President’s Council of Advisors on Science and Technology. He is a recipient of the Semiconductor Industry Association’s highest honor, the Robert N. Noyce Award; the IEEE Founder’s Medal; the Dr. Morris Chang Exemplary Leadership Award; and honorary doctorate degrees from Taiwan’s National Chiao Tung University, National Taiwan University, Oregon State University, Huazhong University of Science and Technology, and Linköping University. He has been named the world’s best CEO by Fortune, the Economist, and Brand Finance, as well as one of TIME magazine’s 100 most influential people. Prior to founding NVIDIA, Huang worked at LSI Logic and Advanced Micro Devices. He holds a BSEE degree from Oregon State University and an MSEE degree from Stanford University. Follow the playlist: https://youtube.com/playlist?list=PLoROMvodv4rN447WKQ5oz_YdYbS74M5IA&si=DOJ5amlyRdyMJBhG

Jensen Huangguest

May 13, 20261h 8mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

Jensen Huang explains AI-era co-design, agents, and compute bottlenecks ahead

Huang argues computing is being reinvented for the first time since the IBM System/360 era as systems shift from pre-recorded software to continuously running, generative, context-aware AI and agents.
He frames “co-design” as optimizing algorithms, compilers/frameworks, and hardware (CPU/GPU/network/storage) together, claiming this approach produced performance leaps far beyond Moore’s Law for AI workloads.
He advocates integrating AI into education both as subject matter and as a learning tool, while still grounding students in enduring first principles that don’t change as quickly as the frontier.
On open source, he supports using best-in-class closed models for productivity but argues open/transparent models are essential to democratize domain foundation models and to make AI safety and security defensible.
He challenges narratives equating GPUs with weapons or AI with instant singularity, and contends compute scarcity at universities is primarily an organizational/budgeting problem requiring centralized, large-scale shared infrastructure investment.

IDEAS WORTH REMEMBERING

5 ideas

AI changes the computer at every layer, not just the app layer.

Huang describes a shift from pre-recorded, on-demand computing to generated, context-aware, continuously running agentic systems, forcing rethinks in software development, systems architecture, cloud services, and organizational workflows.

Co-design beats isolated optimization by aligning the whole stack to the workload.

Using RISC as an analogy (compiler + ISA harmony), he argues AI-era performance comes from jointly optimizing algorithms, frameworks/compilers, hardware architecture, networking, and storage rather than treating them as separate disciplines.

Workload-relevant metrics matter more than headline FLOPS or MFU.

He calls MFU (model FLOPs utilization) potentially misleading because real bottlenecks shift among compute, memory bandwidth/capacity, and network; he prefers outcome metrics like tokens-per-watt tied to user-perceived performance.

Inference, especially decoding, is a bandwidth problem that reshapes system design.

He explains why NVIDIA built rack-scale NVLink72 systems: decoding token generation demands aggregate memory bandwidth beyond a single chip, enabling large gains even when FLOPS utilization looks “low.”

Each GPU generation is aimed at the next dominant compute pattern (training → inference → agents).

He positions Hopper around pre-training, Grace Blackwell around inference/decoding at rack scale, Vera Rubin around agent workflows (tool use, low-latency CPU needs, storage-to-fabric integration), and hints Feynman will target multi-agent swarms.

WORDS WORTH SAVING

5 quotes

This is a great time to be in computer science, and obviously the reason is because computing is being reinvented for the first time as dramatically as, as it is for the first time really in about 60-plus years.

— Jensen Huang

In the case of NVIDIA and co-design, we got 1 million X over 10 years—1 million X.

— Jensen Huang

I can't learn anymore without AI.

— Jensen Huang

If you want, if you care to have AI be safe and secure, it has to be open. And the reason for that is you can't defend against a black box, and you can't secure a black box.

— Jensen Huang

You're gonna have abundance of problems. They're gonna come in different types. And you just have to learn how to condition yourself to want to get to a better state, no matter how hard. To get better, no matter how hard. And that's suffering.

— Jensen Huang

AI as a new computing paradigm (generative + continuous)Co-design across the full stackMoore’s Law, Dennard scaling, and performance discontinuitiesAgentic systems and “tokens per watt” vs MFURack-scale systems (NVLink72) and memory bandwidth for decodingOpen models: democratization, safety transparency, domain foundationsCompute access, policy, export controls, and university budgeting

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.