Skip to content
AcquiredAcquired

Nvidia Part III: The Dawn of the AI Era (2022-2023) (Audio)

It’s a(nother) new era for Nvidia. We thought we’d closed the Acquired book on Nvidia back in April 2022. The story was all wrapped up: Jensen & crew had set out on an amazing journey to accelerate the world’s computing workloads. Along the way they’d discovered a wondrous opportunity (machine learning powered social media feed recommendations). They forged incredible Power in the CUDA platform, and used it to triumph over seemingly insurmountable adversity — the stock market penalty-box. But, it turned out that was only the precursor to an even wilder journey. Over the past 18 months Nvidia has weathered one of the steepest stock crashes in history ($500B+ market cap wiped away peak-to-trough!). And, it has of course also experienced an even more fantastical rise — becoming the platform that’s powering the emergence of perhaps a new form of intelligence itself… and in the process becoming a trillion-dollar company. Today we tell another chapter in the amazing Nvidia saga: the dawn of the AI era. Tune in! Sponsors: Thanks to our fantastic partners, any member of the Acquired community can now get: Scalable, clean and low-cost cloud AI compute from Crusoe, and listen to our recent ACQ2 interview with CEO Chase Lochmiller https://bit.ly/acquiredcrusoe https://bit.ly/CrusoeACQ2 Your product growth powered by Statsig https://bit.ly/statsigacquired Free access to our episode research on Blinkist, plus our favorite books on Ben & David’s Bookshelf https://bit.ly/BlinkistNvidia https://bit.ly/BlinkistBookshelf More Acquired!: Get email updates with hints on next episode and follow-ups from recent episodes https://www.acquired.fm/email Join the Slack http://acquired.fm/slack Subscribe to ACQ2 https://pod.link/acquiredlp Become an LP and support the show. Help us pick episodes, Zoom calls and more https://acquired.fm/lp ACQ hats are back in stock in the ACQ Merch Store! https://www.acquired.fm/store Links: Asianometry on AI Hardware https://youtu.be/5tmGKTNW8DQ?si=m4PJpgnrERddk99E Episode sources https://docs.google.com/document/d/1d5vDY0pFcLGeZEg9t0vNMVnMvFTwMwcv9QhQHca4Aus/edit?usp=sharing Carve Outs: Alias https://www.imdb.com/title/tt0285333/ Moana https://movies.disney.com/moana Note: Acquired hosts and guests may hold assets discussed in this episode. This podcast is not investment advice, and is intended for informational and entertainment purposes only. You should do your own research and make your own independent decisions when considering any financial transactions. © Copyright ACQ, LLC

Ben GilberthostDavid Rosenthalhost
Sep 6, 20232h 54mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 1:30

    Nvidia Part III setup: why an extra episode was necessary

    Ben and David set the context: the last 18 months reshaped Nvidia’s story and made “AI era” the central narrative. They contrast the 2022 tech downturn with the late-2022 inflection when LLMs became mainstream with ChatGPT.

    • April 2022 episodes never said “generative”—how fast the landscape changed
    • 2022 macro + crypto crash hit tech and Nvidia (inventory write-off)
    • Fall 2022: LLMs break into public consciousness
    • This episode will go deep from silicon-level constraints to Nvidia’s platform advantage
  2. 1:30 – 4:36

    From bleak 2022 to AI’s ‘Netscape/iPhone moment’

    They describe the precise timing of the AI breakthrough: just as markets were at their bleakest, LLMs became useful and viral. The chapter frames ChatGPT’s launch as a watershed consumer moment and a catalyst for massive compute demand.

    • ChatGPT’s growth to 100M users signals a new platform shift
    • Microsoft, Google, and others rapidly follow with competing launches
    • Jensen’s framing: AI’s Netscape moment, potentially even iPhone-like
    • Sets up the question: why did this revolution run on Nvidia?
  3. 4:36 – 7:59

    Revisiting Nvidia’s old $1T TAM—and the accidental prophecy

    David revisits Nvidia’s 2021 slide claiming a $1T TAM via capturing 1% of $100T of industry, which they previously critiqued as overly top-down. Ironically, the trillion-dollar outcome may be arriving through a different path: the digital world and a new foundational compute layer.

    • Prior skepticism: robotics/autonomy/Omniverse felt speculative and slow-moving
    • Key question floated in Part II: could a new digital foundational layer drive Nvidia?
    • Most physical-world TAM items haven’t arrived—yet revenue growth did
    • Sets up the historical rewind to how AI compute demand was born
  4. 7:59 – 10:30

    2012’s AlexNet: the Big Bang for GPU-accelerated AI

    They rewind to the ImageNet competition and explain why AlexNet was a step-change breakthrough. The pivotal detail: convolutional neural nets became practical when trained on consumer Nvidia GPUs using CUDA.

    • ImageNet’s 14M labeled images and Mechanical Turk labeling scale
    • AlexNet slashed error rate dramatically versus incremental prior progress
    • CNNs existed since the 1960s but were too compute-intensive on CPUs
    • Two GTX 580s + CUDA unlocked feasible training on cheap hardware
  5. 10:30 – 12:39

    Why GPUs win: parallelism as a lever on Moore’s Law

    The hosts explain the architectural reason GPUs transformed ML: massive parallel execution compared to CPU sequential processing. They tie Nvidia’s original graphics parallelism to broader accelerated-computing workloads like linear algebra and AI.

    • CPUs execute few instructions at a time; GPUs execute thousands in parallel
    • Accelerated computing ‘leverages’ transistor gains by huge multipliers
    • Graphics proved parallel compute’s value before AI discovered it
    • AI/crypto/linear algebra became the second frontier for GPU acceleration
  6. 12:39 – 26:02

    The people behind the breakthrough—and the path to OpenAI

    They follow the AlexNet team’s careers: Hinton, Krizhevsky, and Ilya Sutskever. After being acquired by Google, Ilya later becomes the key researcher to leave for OpenAI, seeded by a 2015 Musk/Altman dinner to break the Google/Facebook AI research lockup.

    • Jeff Hinton’s lineage trivia: descendant of George & Mary Boole
    • Google acquires the AlexNet startup and consolidates top researchers
    • Google Brain + DeepMind and Facebook’s Yann LeCun create an AI duopoly
    • 2015 dinner: most won’t leave—except Ilya, who co-founds OpenAI
  7. 26:02 – 30:42

    Pre-transformer ambitions: early language-model intuition and next-word prediction

    They highlight that the idea of training chatbots via next-word prediction predated transformers. Karpathy’s 2015-2016 commentary anticipates today’s approach: language structure emerges from large unlabeled text, not manual labeling.

    • RNN-era state-of-the-art still limited by short context and sequential training
    • Karpathy’s ‘Unreasonable Effectiveness’ and early chatbot vision
    • Shift from labeled datasets to learning patterns from raw text
    • Ilya’s detective-novel analogy: next-word prediction implies understanding
  8. 30:42 – 44:10

    2017 Transformer paper: attention, context windows, and GPU-friendly parallelism

    They explain the core of “Attention Is All You Need” and why it changed everything. Attention is computationally expensive (O(n²)), but critically parallelizable—making it a perfect fit for GPUs and enabling much larger sequence models.

    • Attention lets the model weigh the whole input while generating output
    • Positional encoding preserves word order while attending broadly
    • Quadratic compute cost with context length, but comparisons parallelize well
    • Transformers replace sequential RNN/LSTM training with scalable parallel training
  9. 44:10 – 51:24

    OpenAI’s pivot and capitalization: from nonprofit lab to Microsoft-backed compute engine

    OpenAI initially pursued scattered narrow research; transformers and scale changed the cost curve. Elon exits in 2018; OpenAI restructures in 2019 to raise capital, leading to major Microsoft investments and cloud exclusivity that later fuel ChatGPT’s rise.

    • Transformers open ‘endless improvement’ via more data/compute—expensive for a nonprofit
    • 2019: creates capped-profit for-profit entity to fund compute needs
    • Microsoft invests $1B, later $2B and $10B; exclusive cloud relationship forms
    • GPT-3 → Copilot → ChatGPT → GPT-4 sequence drives demand for GPU clusters
  10. 51:24 – 53:28

    Why Nvidia was uniquely prepared: re-architecting the data center into ‘the computer’

    They connect the generative AI opportunity to Nvidia’s years-long preparation to replace CPU-centric x86 data centers with GPU-accelerated platforms. The central thesis emerges: AI is the workload that finally forces broad GPU adoption in data centers.

    • Three ingredients: generative AI traction + massive GPU training needs + cloud distribution
    • Nvidia’s long bet: GPU platform for the data center, not just niche accelerators
    • AI becomes the clear driver of data center GPU adoption
    • Sets up the deep technical detour into architecture and memory constraints
  11. 53:28 – 1:00:49

    Computer architecture detour: von Neumann bottleneck and why memory/networking now dominates

    Ben explains the classic von Neumann model and its bottleneck: compute often waits on memory loads/stores. For modern LLMs, the limiting factor shifts to on-chip memory and interconnects, forcing multi-GPU, multi-rack systems to behave like one machine.

    • Load-load-add-store illustrates memory traffic dominating CPU cycles
    • von Neumann bottleneck worsens as CPUs outpace memory bandwidth
    • LLMs require massive memory footprints (hundreds of GB)
    • Reticle limits and packaging constraints push scaling toward networking GPUs together
  12. 1:00:49 – 1:11:52

    Nvidia’s data center ‘three-legged stool’: Mellanox/InfiniBand, Grace CPU, Hopper + CoWoS

    David lays out the three pillars that make Nvidia’s full-stack data center solution: Mellanox networking, a purpose-built CPU to orchestrate GPU clusters, and a separate GPU architecture optimized for AI with advanced packaging and HBM proximity.

    • 2020 Mellanox acquisition brings InfiniBand high-bandwidth, low-latency fabric
    • Grace CPU (2022) positions Nvidia to deliver a fully integrated system
    • Hopper architecture splits from consumer Lovelace; built for AI workloads
    • CoWoS/2.5D packaging + HBM proximity becomes a key capacity bottleneck at TSMC
  13. 1:11:52 – 1:27:40

    Productization and monetization: H100 economics, DGX systems, SuperPODs, and DGX Cloud

    They explain how Nvidia sells chips and integrated systems across hyperscalers, GPU clouds, and enterprises—and why integration expands margins. DGX Cloud extends this by renting Nvidia-managed DGX experiences hosted inside partner clouds, giving Nvidia more direct customer relationships.

    • H100 ~$40K each; DGX H100 boxes start around ~$500K
    • Hyperscalers buy chips; enterprises often buy turnkey DGX systems
    • SuperPOD/GH200 ‘AI wall’ positions Nvidia as mainframe-like systems vendor
    • DGX Cloud ($37K/month entry) creates high utilization economics and direct enterprise touchpoints
  14. 1:27:40 – 1:37:01

    2023 financial shockwave and the new $1T narrative: data center CapEx as the real TAM

    They cover Nvidia’s historic 2023 earnings acceleration, driven overwhelmingly by the data center segment. Jensen reframes the $1T opportunity as the installed base and annual refresh spend of global data centers, hinging on whether AI delivers enduring user value.

    • Q1 guide shock and Q2 results: data center revenue surges to ~$10.3B in a quarter
    • Growth at massive scale: +141% QoQ in data center
    • Jensen’s updated TAM: $1T of data center assets, $250B annual spend
    • Core bet: GPT-like interfaces create durable value and justify sustained CapEx shifts
  15. 1:37:01 – 2:54:09

    Moats and competitive dynamics: CUDA as platform, TSMC capacity as cornered resource, and the bull/bear debate

    In analysis, they argue Nvidia is best understood as a platform (Microsoft/IBM-like), not a cyclical hardware vendor (Cisco/Intel-like). CUDA’s ecosystem, data center switching costs, and packaging/networking advantages create defensibility—while hyperscalers, open source, and shifting workloads form the core risks.

    • CUDA stack: compiler/runtime/tools/libraries + 4M developers; massive accumulated investment
    • Data center switching costs lock architectures for years; ‘nobody fired for buying Nvidia’
    • Cornered resource: leading-edge packaging (CoWoS) capacity and supply constraints
    • Bear: hyperscaler custom silicon + open ecosystems (PyTorch), AI hype cycle, inference commoditization, China controls; Bull: accelerated computing expansion, AI added to everything, Nvidia speed/culture and integrated systems

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.