Skip to content
a16za16z

Dylan Patel on the AI Chip Race - NVIDIA, Intel & the US Government vs. China

Nvidia’s $5 billion investment in Intel is one of the biggest surprises in semiconductors in years. Two longtime rivals are now teaming up, and the ripple effects could reshape AI, cloud, and the global chip race. To make sense of it all, Erik Torenberg is joined by Dylan Patel, chief analyst at SemiAnalysis, joins Sarah Wang, general partner at a16z, and Guido Appenzeller, a16z partner and former CTO of Intel’s Data Center and AI business unit. Together, they dig into what the deal means for Nvidia, Intel, AMD, ARM, and Huawei; the state of US-China tech bans; Nvidia’s moat and Jensen Huang’s leadership; and the future of GPUs, mega data centers, and AI infrastructure. Timecodes: 0:00 Introduction 0:29 Nvidia and Intel: Unlikely Allies 2:11 Investment and Capital in Semiconductors 4:27 The Impact on AMD and ARM 5:21 China’s AI Chip Race: Huawei’s Rise 14:01 The HBM Bottleneck and Manufacturing 19:00 Nvidia’s Global Competition: The Huawei Threat 22:32 Jensen’s Next Move: Nvidia’s Strategy 29:44 Nvidia’s Moat: How They Built It 36:15 How Jensen Has Changed Over the Years 39:40 Jensen Huang’s Leadership and Company Culture 46:37 The Future of Nvidia: Cash, Data Centers, and AI Infrastructure 56:11 The Hyperscalers: Amazon, Oracle, and the Cloud Wars 1:03:01 The Era of Mega Data Centers 1:07:40 Hardware Cycles: GB200, Blackwell, and the Next Generation 01:16:03 xAI’s Colossus 2 01:22:06 Recommendations to Start-Ups 1:34:49 The State of the GPU Market Today Resources: Find Dylan on X: https://x.com/dylan522p Find Sarah on X: https://x.com/sarahdingwang Find Guido on X: https://x.com/appenz Learn more about SemiAnalysis: https://semianalysis.com/dylan-patel/ Stay Updated: If you enjoyed this episode, be sure to like, subscribe, and share with your friends! Find a16z on X: https://x.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Listen to the a16z Podcast on Spotify: https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX Listen to the a16z Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711 Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see a16z.com/disclosures.

Dylan PatelguestGuido AppenzellerguestSarah WangguestErik Torenberghost
Sep 22, 20251h 38mWatch on YouTube ↗

CHAPTERS

  1. Nvidia invests in Intel: why the partnership makes sense (and who it hurts)

    The episode opens with the surprising news that Nvidia is investing $5B in Intel and collaborating on custom data center and PC products. The hosts and Dylan unpack why this “unlikely alliance” is strategically rational, potentially great for consumers, and uniquely problematic for competitors like AMD and ARM.

    • Nvidia’s $5B Intel investment framed as confidence-building ahead of Intel’s larger capital raise
    • Intel-Nvidia collaboration could enable compelling x86 + integrated Nvidia graphics/AI PC designs
    • Historical irony: Intel and Nvidia were once adversaries (chipsets/graphics disputes), now cooperating
    • Competitive fallout: AMD faces a tougher landscape; ARM loses a key “anti-Intel” partnership narrative
    • Intel may de-prioritize internal graphics/AI efforts (e.g., Gaudi) if Nvidia becomes the partner path
  2. Semiconductor capital intensity and the role of governments and mega-customers

    The discussion broadens to semiconductor funding mechanics: how Intel needs far more capital than headline investments suggest, and why customer participation (plus government incentives) can change market perception. The panel considers how political pressure and strategic signaling can pull more corporate capital into US chip manufacturing.

    • Intel’s funding gap is still massive (Dylan cites needs on the order of ~$50B)
    • Early investments (Nvidia, SoftBank, US government) are confidence signals more than full solutions
    • Customer buy-in helps Intel approach capital markets on better terms (debt/equity)
    • Speculation about political influence nudging large tech firms to invest
    • “Jensen/Buffett effect”: Jensen’s involvement can move markets and sentiment quickly
  3. China’s AI chip push: Huawei’s trajectory from 7nm leadership to export-control workarounds

    Dylan walks through Huawei’s technical capabilities and the timeline from 2020 onward, arguing Huawei has long been a top-tier systems company. He explains how sanctions forced Huawei to shift manufacturing, stockpile, and use intermediaries—while still accumulating meaningful chip volume.

    • Huawei’s 2020 Ascend entry: early 7nm AI chips and public benchmarks with a narrow gap vs Nvidia
    • Sanctions cut off Huawei from TSMC, driving a pivot to SMIC and supply-chain workarounds
    • Claims Huawei used intermediaries to source millions of chips fabricated at TSMC before the clampdown
    • China’s internal demand vs best-in-class performance: firms like ByteDance still prefer Nvidia if possible
    • Export controls change the market structure more than pure competitiveness does
  4. The H20 ban, China’s domestic alternatives, and the “stockpile-to-ramp” transition risk

    The conversation covers Nvidia’s China revenue exposure and the dynamics created by banning China-specific Nvidia SKUs like H20. Dylan argues China can temporarily rely on prior stockpiles, but the critical question is whether domestic production can ramp fast enough to avoid a gap.

    • H20 restrictions forced Nvidia write-downs; Dylan cites very large implied China revenue impact
    • China promotes Huawei/Cambricon alternatives, but many inputs remain foreign (logic wafers, memory)
    • Key inflection: running down existing inventories vs scaling new domestic supply
    • Potential backtracking: commercial players may pressure regulators if domestic chips can’t meet needs
    • Smuggling/re-export persists but at limited volumes relative to demand
  5. HBM as the chokepoint: equipment imports, yields, and why memory is harder than logic

    Dylan explains why high-bandwidth memory (HBM) remains the hardest bottleneck for Huawei/China despite bold roadmaps. He discusses the specialized equipment needs (notably etch for TSVs), the yield learning curve, and why scaling HBM production takes years of sustained capital and process maturity.

    • HBM requires specialized steps like TSV etch and high-stacking (12H/16H), stressing tooling needs
    • China import data suggests shifts toward etch equipment, consistent with HBM ramp attempts
    • China is behind on HBM3-class manufacturing; sampling is not the same as volume production
    • Even if designs are credible, manufacturing scale + yield ramp determine real competitiveness
    • Global HBM capacity reflects years of capex in Korea/elsewhere—hard to replicate quickly
  6. Huawei’s roadmap hype as strategy: negotiating leverage and “playing chess”

    Guido and Dylan explore whether Huawei’s aggressive announcements are partly aimed at influencing US export policy negotiations. Dylan argues hyping domestic capability can push US stakeholders to loosen restrictions to avoid losing a strategic market—turning public signaling into leverage.

    • Roadmap announcements can be timed to shape export-control boundaries and negotiations
    • “We don’t need Nvidia” messaging can pressure the US to allow more exports to retain market share
    • Domestic lobbying + geopolitical bargaining intersect with technical claims
    • Core question for US policy: sell at/above China’s capability tier, factoring volume and ramp speed
    • Strategic risk: bans may accelerate China’s self-sufficiency over time
  7. If you’re Jensen: framing Huawei as the real threat and the Galapagos China debate

    Asked what Jensen should do next, Dylan argues Nvidia’s best move is to treat Huawei’s competitiveness as real—especially outside the US—and emphasize that manufacturing catch-up is “when, not if.” The discussion introduces the “Galapagos China” concept: isolating China could trap it in a local optimum—or push it to a better global one.

    • Jensen publicly calling Huawei “formidable” is consistent with Huawei’s track record across industries
    • Nvidia’s narrative option: stress near-term constraints (capacity/yields) while acknowledging long-run risk
    • Huawei’s expansion threat could be global (Middle East, SE Asia, Europe, LATAM)
    • “Galapagos China” theory: isolation could limit export success—or drive divergent innovation to a superior path
    • Policy/strategy uncertainty: isolation can create unintended winners
  8. Nvidia’s moat: repeated ‘bet-the-company’ moves and supply-chain aggression

    Dylan details how Nvidia built its moat through risk-taking, rapid execution, and bold capacity commitments—often ordering ahead of confirmed demand. He contrasts Nvidia’s approach with more cautious competitors and highlights how Nvidia repeatedly captured upside in cyclical moments (e.g., crypto, data center ramps).

    • Nvidia repeatedly took existential bets early (capacity orders, major design pivots)
    • Aggressive supply-chain commitments (NCNR) helped capture demand spikes ahead of peers
    • Crypto-era dynamics: Nvidia pushed suppliers to ramp; competitors were more conservative
    • Jensen’s decision style: intuition-driven, willing to accept occasional write-downs for asymmetric upside
    • Strategy prioritizes winning each generation and staying in the game for the next cycle
  9. Execution advantage: first-pass silicon, fast stepping, and hardware-software coordination

    The panel dives into Nvidia’s operational excellence: getting chips right with fewer steppings, managing mask-set risk, and shipping faster than peers. They also highlight the difficulty of keeping software and drivers in lockstep with rapid hardware cadence—yet Nvidia largely succeeds.

    • Semiconductor reality: most designs need revisions; Nvidia often ships at A0/A1 while others slip
    • Mask sets are expensive; Nvidia historically optimized for “works first time” to avoid delays
    • Holding wafers before metal layers to preserve optionality when late tweaks are needed
    • Example: Volta/Tensor Cores allegedly added late—critical to seizing the AI moment
    • Fast silicon success forces strong software readiness (drivers, libraries, infra) to match hardware launch speed
  10. Jensen’s evolution and Nvidia culture: rock-star CEO, loyal lieutenants, and shipping discipline

    Dylan reflects on how Jensen’s public persona and influence have grown, while internal culture remains focused on shipping. He describes long-tenured leaders who enforce pragmatism—cutting features to meet schedules—and a company-wide bias toward execution over perfection.

    • Jensen’s charisma and “CEO as rock star” presence has intensified over the last decade
    • Founder-led memory: willingness to keep betting because survival once depended on it
    • Nvidia has deeply loyal senior operators who balance visionary ambition with ruthless shipping priorities
    • Cultural emphasis: cut features, ship, and iterate—especially crucial in silicon timelines
    • Internal tension between innovation and schedule is managed by strong execution-oriented leadership
  11. What does Nvidia do with all that cash? Infrastructure, power, and customer-neutral investing

    The discussion turns to Nvidia’s future strategy: how to deploy enormous free cash flow without triggering customer backlash or regulatory blocks. Dylan argues Nvidia must be careful “picking winners,” and suggests investing in data centers and power—bottlenecks that expand the market—without competing directly with customers in cloud services.

    • Nvidia’s balance sheet could reach massive scale; big M&A is constrained by regulation (e.g., ARM)
    • Directly picking AI model winners risks alienating Nvidia’s broad customer base
    • Nvidia already nudges the ecosystem (neo-cloud investments, allocations), but in comparatively small checks
    • Best leverage point may be market-enabling investments: power generation, substations, data center build-outs
    • Bottlenecks for growth shift from chips to physical infrastructure (sites, power, cooling, permitting)
  12. Cloud wars and hyperscaler dynamics: AWS re-accelerates, Trainium remains hard, Oracle’s bet

    Dylan explains why AWS stumbled early in the AI shift (scale-up vs scale-out infra) but is poised to re-accelerate due to sheer data center capacity and key customers like Anthropic. He then outlines why Oracle is “winning AI compute” by being hardware-agnostic, balance-sheet strong, and willing to underwrite OpenAI-scale demand.

    • AWS critique: strong for prior era; initially weaker for AI scale-up networking/architecture
    • Re-acceleration thesis: Amazon’s unmatched data center footprint can convert capacity into AI revenue
    • Trainium: powerful but difficult; workable when serving a small set of models with deep low-level optimization
    • Oracle advantage: non-dogmatic hardware/networking, strong cluster software, and willingness to finance growth
    • Oracle’s OpenAI deals: risk is OpenAI’s ability to pay long-term, but Oracle can time GPU purchases close to revenue start
  13. Mega data centers and ‘Colossus 2’: the gigawatt era and Elon’s speed advantage

    The episode highlights the escalating scale of AI infrastructure, shifting from “impressive at 100MW” to “only exciting at gigawatts.” Dylan describes xAI’s rapid Memphis build and the strategic move to leverage regulatory boundaries across states to secure power and keep pace.

    • Industry scale shift: 100K GPUs used to be headline-worthy; now multiple massive clusters exist
    • xAI’s Colossus build-out: rapid timelines, liquid cooling at scale, generators/turbines, creative power sourcing
    • Regulatory arbitrage: siting near state borders to exploit differing power and permitting regimes
    • Social/political friction (local protests) becomes part of infrastructure strategy
    • Operational competence in power + construction emerges as a competitive advantage, not just model quality
  14. Hardware cycle realities: GB200/Blackwell TCO, reliability, and the GPU market’s new ‘tightness’

    Closing out, Dylan explains Blackwell’s economics and operational tradeoffs: GB200 can be compelling on certain workloads, but the reliability and failure-domain “blast radius” of NVL72 changes how customers must run clusters. He ends with a market update: Hopper capacity tightened again as inference demand surged and Blackwell rollouts faced ramp friction.

    • GB200 vs H100/H200: TCO uplift (~1.6x) must be justified by workload-specific performance gains
    • Inference splits (prefill vs decode) and quantization can make Blackwell far more advantageous for some models
    • NVL72 reliability challenges: a single GPU failure impacts a much larger coherent domain than 8-GPU servers
    • Operational workaround: reserve GPUs for spares/low-priority workloads; cloud SLAs adjust accordingly
    • GPU market status: not as dire as 2023, but large-block capacity is hard; inference demand and Blackwell ramp issues tightened supply again

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.