Skip to content
Stanford OnlineStanford Online

Stanford CS153 Frontier Systems | Jensen Huang from NVIDIA on the Compute Behind Intelligence

For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai Follow along with the course schedule and syllabus, visit: https://cs153.stanford.edu/ In a CS153 Frontier Systems lecture, the class hosts Jensen Huang, CEO of NVIDIA, who argues computing is being reinvented for the first time in 64 years as software shifts from prerecorded execution to real-time generation, with NVIDIA's extreme co-design across chips, compilers, networks, and systems delivering a million-fold speedup over the past decade versus Moore's Law's 100x. He walks through the architectural logic of Hopper (pre-training), Grace Blackwell NVLink72 (inference and decode), Vera Rubin (agents), and the upcoming Feynman generation built for swarms of agents and sub-agents, while pushing back on MFU as a misleading metric in favor of tokens-per-watt and real evals. Huang also defends open models like Nemotron, BioNemo, and Alpamayo as essential for safety, transparency, and democratizing AI across underserved languages and scientific domains, and forecasts compute energy demand growing roughly a thousandfold, making this the strongest market moment in history to invest in sustainable energy and grid upgrades. Guest Speaker: Jensen Huang founded NVIDIA in 1993 and has served since its inception as president, chief executive officer, and a member of the board of directors. Since its founding, NVIDIA has pioneered accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, and ignited the era of modern AI. NVIDIA is now driving the platform shift of accelerated computing and generative AI, transforming the world's largest industries and profoundly impacting society. Huang has been elected to the National Academy of Engineering and in 2026 was appointed to the President’s Council of Advisors on Science and Technology. He is a recipient of the Semiconductor Industry Association’s highest honor, the Robert N. Noyce Award; the IEEE Founder’s Medal; the Dr. Morris Chang Exemplary Leadership Award; and honorary doctorate degrees from Taiwan’s National Chiao Tung University, National Taiwan University, Oregon State University, Huazhong University of Science and Technology, and Linköping University. He has been named the world’s best CEO by Fortune, the Economist, and Brand Finance, as well as one of TIME magazine’s 100 most influential people. Prior to founding NVIDIA, Huang worked at LSI Logic and Advanced Micro Devices. He holds a BSEE degree from Oregon State University and an MSEE degree from Stanford University. Follow the playlist: https://youtube.com/playlist?list=PLoROMvodv4rN447WKQ5oz_YdYbS74M5IA&si=DOJ5amlyRdyMJBhG

Jensen Huangguest
May 13, 20261h 8mWatch on YouTube ↗

CHAPTERS

  1. Why computing is being reinvented: from pre-recorded to generative and continuous

    Huang frames the moment as the biggest computing-model shift since IBM System/360, arguing that AI changes not just applications but the entire software and systems stack. He contrasts “pre-recorded” computing with real-time generated outputs that are context-aware and intention-driven.

  2. From GPT to agentic systems: what comes after generative AI

    He explains why GPT-style models made “thinking” and tool use an obvious next step, and why agentic systems change the economics and design assumptions of computing. The key transition is from on-demand usage to continuously running systems.

  3. What “co-design” really means (and why it outperforms siloed optimization)

    Huang defines co-design as simultaneous optimization across algorithms, compilers/frameworks, chip architecture, and full systems. He uses RISC as a Stanford-rooted example of compiler–architecture harmony, then generalizes it to modern accelerated computing.

  4. Co-design at NVIDIA: beyond Moore’s Law to 100,000x–1,000,000x gains

    He claims extreme co-design enabled NVIDIA to far outpace classic scaling expectations, reframing what becomes feasible for AI research and product development. The punchline: massive performance leaps create a sense of “compute abundance” that changes what problems people attempt.

  5. How education should evolve: AI as both subject and learning tool

    Huang argues curricula must integrate AI not only as content but as an everyday research and learning assistant. He highlights the mismatch between slow textbook cycles and fast-moving AI knowledge, while defending timeless first principles.

  6. Open source at the frontier: why NVIDIA builds open models despite using closed tools

    He distinguishes using best-in-class proprietary models for productivity from building open models to seed ecosystems and new scientific domains. Open models, in his view, help democratize access and enable domain-specific foundation models where market incentives are weak.

  7. Safety, security, and transparency: why “you can’t defend a black box”

    Huang argues open/transparent systems are essential for AI security and robust defense. He suggests defending against powerful attackers requires swarms of cheap specialized AIs rather than an arms race of ever-larger proprietary models.

  8. Coalition scaling, compute bottlenecks, and why MFU can mislead

    The discussion turns to utilization and scarcity: Huang critiques MFU (model FLOPs utilization) as an incomplete metric because bottlenecks shift across memory, bandwidth, and networking. He pushes for performance tied to real evals—like tokens-per-watt—rather than chasing a single utilization number.

  9. Designing platforms for many evals: balancing specialization vs generality

    Huang explains the core product challenge: customers optimize for different tasks and metrics, so the platform must serve many domains without becoming bland general-purpose hardware. He calls the balance “artistry” involving vision, iteration, and strategic trade-offs.

  10. Architecture roadmap: Hopper → Grace Blackwell NVLink72 → Vera Rubin → (future) Feynman

    He walks through NVIDIA’s generational design logic as compute patterns evolve: pre-training (Hopper), inference/decode bandwidth at rack scale (Grace Blackwell NVLink72), and agentic workloads (Vera Rubin) with tool use, low-latency CPUs, and fabric-attached storage. Feynman is positioned as a likely next step for swarms of agents and sub-agents.

  11. Energy and infrastructure: tokens-per-watt, grid upgrades, and sustainable power

    Huang frames energy as the next constraint and emphasizes efficiency as the controllable lever, alongside ecosystem mobilization and investment in generation and grids. He predicts compute energy needs could rise by ~1000x (or more), driven by continuous generative computing.

  12. Career advice: resilience, suffering, and doing hard things well

    He challenges the “only do what you love” career heuristic, arguing many people don’t yet know their passions and that competence is built through struggle. He reframes suffering as disciplined effort through unpleasant tasks, producing resilience needed for leadership and adversity.

  13. Geopolitics and chip access: rejecting the ‘atomic bomb’ analogy and defending competition

    Huang argues GPUs are general-purpose and widely used, so comparisons to weapons are flawed. He warns that policies conceding large global markets could hollow out U.S. tech leadership, and criticizes “sudden singularity” rhetoric as irresponsible fear-mongering.

  14. Compute scarcity at universities: ‘place orders’ vs structural budgeting, and the case for campus supercomputers

    Pressed on why startups/universities lack compute, Huang insists supply isn’t withheld—institutions must plan and fund at the necessary scale. He argues universities’ decentralized grant structures prevent pooling resources for billion-dollar-class shared AI infrastructure.

  15. Being CEO: best/worst parts, early NVIDIA mistakes, and strategic learning loops

    Huang contrasts the creative joy of vision/strategy/execution with the vulnerability and fear of responsibility during near-failure periods. He recounts early technical wrong turns that forced strategic adaptation, plus a later strategic misstep in mobile—then repurposing that expertise into robotics.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.