Skip to content
The Twenty Minute VCThe Twenty Minute VC

Cerebras CEO on the Future of Data Centres, Token Costs & Memory | Should US Companies Sell to China

Andrew Feldman is the co-founder and CEO of Cerebras Systems. This month, Cerebras went public achieving a market cap of $70BN, the largest semiconductor IPO in history. Cerebras has a massive commercial backlog with a monumental, multi-year $20 billion compute agreement from OpenAI. ---------------------------------------------- In Today’s Episode We Discuss: 00:00 Intro 02:18 Is There an AI Infrastructure Bubble? 07:35 Memory Shortages Will Last Years 09:35 2025: The Year AI Became Actually Useful 11:33 Will Frontier Models Commoditize Like Cloud Did? 16:34 Can Google Win by Owning the Full Stack From TPUs to Tokens? 32:53 Data Centers & Local Communities 33:10 AI Layoffs 38:15 The Real Blocker to Enterprise AI Adoption 44:04 Should the US Be Selling Chips to China? 47:00 Why Europe Can't Build Great Tech Companies 53:48 Timing the Cerebras IPO: Luck or Strategy? 57:10 Is the Trump Administration Better for Business? 58:53 Quick-Fire Round ----------------------------------------------- Subscribe on Spotify: https://open.spotify.com/show/3j2KMcZTtgTNBKwtZBMHvl?si=85bc9196860e4466 Subscribe on Apple Podcasts: https://podcasts.apple.com/us/podcast/the-twenty-minute-vc-20vc-venture-capital-startup/id958230465 Follow Harry Stebbings on X: https://twitter.com/HarryStebbings Follow Andrew Feldman on X: https://twitter.com/andrewdfeldman Follow 20VC on Instagram: https://www.instagram.com/20vchq Follow 20VC on TikTok: https://www.tiktok.com/@20vc_tok Visit our Website: https://www.20vc.com Subscribe to our Newsletter: https://www.thetwentyminutevc.com/contact ----------------------------------------------- #20vc #harrystebbings #andrewfeldman #cerebras #ceo #founder #ai #nvidia #chips #china #ailayoffs #ipo

Andrew FeldmanguestHarry Stebbingshost
May 26, 20261h 7mWatch on YouTube ↗

CHAPTERS

  1. Cerebras’ IPO moment and the bigger agenda: chips, geopolitics, and energy

    Harry sets the stage with Cerebras’ blockbuster public debut and frames the conversation as a forward-looking exploration of AI infrastructure, compute economics, and US–China dynamics. Feldman positions the discussion around where constraints and leverage points really are in the AI stack.

  2. AI infrastructure bubble—or infrastructure chasing demand?

    Feldman rejects the “AI infra bubble” analogy by contrasting today with rail/fiber buildouts where supply led demand. He argues the defining feature now is the opposite: demand is already here and infrastructure is lagging, creating persistent backlogs across the ecosystem.

  3. Why data-center “metering” might actually stabilize the market

    They discuss permitting and build delays as a form of ‘metering’ that can smooth adoption and prevent overbuild whiplash. Feldman highlights OpenAI’s early recognition of exponential compute demand as a competitive advantage, while noting not all compute deals are equal.

  4. Memory (HBM) as the chokepoint—and why it won’t clear quickly

    Feldman explains that explosive demand is stressing every part of the supply chain, with HBM memory a major bottleneck due to limited suppliers. Because fab capacity is added in huge, slow ‘step functions,’ he expects shortages and elevated pricing to persist for years if demand remains strong.

  5. 2025 as the inflection: inference demand explodes when AI becomes truly useful

    Feldman argues that around 2025 models crossed a threshold from novelty to daily utility, shifting the center of gravity from training to inference usage at massive scale. He attributes sustained demand growth to AI adoption spreading across demographics and problem types.

  6. Will frontier models commoditize like cloud—or segment like every other market?

    The discussion shifts to whether AI model providers will become utilities. Feldman emphasizes market segmentation: hyperscalers win where security, software layers, and credibility matter, while ‘cheap compute’ buyers may prefer leaner providers without enterprise overhead.

  7. Token economics and why compute gets cheaper (but speed still dominates)

    Feldman forecasts ongoing reductions in cost per unit compute as architectures improve, even amid near-term supply constraints. He argues speed has compounding value—slow inference has ‘zero market’—and positions performance gains as decisive in competitive workflows like coding and agents.

  8. Full-stack control: can Google become the lowest-cost token producer?

    They examine the thesis that owning everything from silicon to power procurement makes Google the cheapest token supplier. Feldman notes the countervailing risk: if you only sell chips to yourself, you may lose volume benefits and constrain economies of scale—though Google is testing ways to broaden reach.

  9. Cerebras’ differentiation: supply-chain advantages and ‘proof by benchmark’ moments

    Feldman highlights how Cerebras avoids key GPU bottlenecks (HBM, CoWoS, oversubscribed nodes), turning industry constraints into opportunity. He also describes the strategic value of public benchmark-style proof points (e.g., Kimi K2 speed claims) to counter skepticism and win trust.

  10. Scaling to massive customers: delivery muscle, concentration concerns, and giga-scale thinking

    Feldman discusses what it takes to serve very large customers and why early wins build operational capability for subsequent deals. The conversation broadens into the changing mindset of infrastructure scale—from megawatts to multi-gigawatts—and whether electricity becomes the ultimate limiting factor.

  11. Data centers vs local communities: delays are normal, but neighbor relations matter

    Feldman downplays schedule slippage as inherent to large construction, but criticizes the industry for poor community engagement. He argues data centers can be clean, job-creating assets if builders are transparent, pay their own way on grid upgrades, and invest locally to earn legitimacy.

  12. AI layoffs, tool spend, and the jobs that will emerge next

    Feldman separates ‘AI-washed’ layoffs from true AI-driven displacement, arguing many cuts reflect earlier overhiring and ongoing automation. He expects software tool spend per engineer to rise dramatically (as in hardware EDA), while new governance roles will emerge as AI becomes core to enterprise operations.

  13. The real enterprise blocker: lawyers, security, and open-source risk—especially from China

    Feldman argues the main constraint on enterprise AI adoption is organizational risk management—legal and security teams incentivized to say ‘no’ when precedent is unclear. Open source intensifies concerns, particularly when leading models originate from Chinese firms, even as cost pressure pushes adoption forward.

  14. Should US firms sell advanced chips to China—and what ‘chokepoints’ matter

    Feldman takes a firm stance against selling leading-edge tech to China, arguing it will be used for military and industrial competition. He acknowledges counterarguments about keeping China in the ecosystem but believes the US and allies retain meaningful choke points via advanced manufacturing and tooling dependencies.

  15. Europe’s innovation gap, Cerebras IPO timing, and leadership lessons from going public

    Feldman argues Europe’s broader pattern—fear, regulation, and slower adoption—reduces breakout tech creation, though pockets thrive at the application layer. He describes Cerebras’ IPO as driven by persistence through regulatory obstacles (including CFIUS), and closes with reflections on leadership pressure, relationships, and empathy from boards and partners.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.