Skip to content
The Twenty Minute VCThe Twenty Minute VC

Cerebras CEO on the Future of Data Centres, Token Costs & Memory | Should US Companies Sell to China

Andrew Feldman is the co-founder and CEO of Cerebras Systems. This month, Cerebras went public achieving a market cap of $70BN, the largest semiconductor IPO in history. Cerebras has a massive commercial backlog with a monumental, multi-year $20 billion compute agreement from OpenAI. ---------------------------------------------- In Today’s Episode We Discuss: 00:00 Intro 02:18 Is There an AI Infrastructure Bubble? 07:35 Memory Shortages Will Last Years 09:35 2025: The Year AI Became Actually Useful 11:33 Will Frontier Models Commoditize Like Cloud Did? 16:34 Can Google Win by Owning the Full Stack From TPUs to Tokens? 32:53 Data Centers & Local Communities 33:10 AI Layoffs 38:15 The Real Blocker to Enterprise AI Adoption 44:04 Should the US Be Selling Chips to China? 47:00 Why Europe Can't Build Great Tech Companies 53:48 Timing the Cerebras IPO: Luck or Strategy? 57:10 Is the Trump Administration Better for Business? 58:53 Quick-Fire Round ----------------------------------------------- Subscribe on Spotify: https://open.spotify.com/show/3j2KMcZTtgTNBKwtZBMHvl?si=85bc9196860e4466 Subscribe on Apple Podcasts: https://podcasts.apple.com/us/podcast/the-twenty-minute-vc-20vc-venture-capital-startup/id958230465 Follow Harry Stebbings on X: https://twitter.com/HarryStebbings Follow Andrew Feldman on X: https://twitter.com/andrewdfeldman Follow 20VC on Instagram: https://www.instagram.com/20vchq Follow 20VC on TikTok: https://www.tiktok.com/@20vc_tok Visit our Website: https://www.20vc.com Subscribe to our Newsletter: https://www.thetwentyminutevc.com/contact ----------------------------------------------- #20vc #harrystebbings #andrewfeldman #cerebras #ceo #founder #ai #nvidia #chips #china #ailayoffs #ipo

Andrew FeldmanguestHarry Stebbingshost
May 26, 20261h 7mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

Cerebras CEO on AI compute bottlenecks, costs, geopolitics, and adoption

  1. Feldman argues today’s AI buildout is not a classic bubble because data-center and chip supply are trailing real, current demand, creating persistent backlogs across the industry.
  2. He predicts multi-year memory (HBM) shortages because capacity additions are lumpy, capital-intensive, and slow, while AI usefulness has reached a tipping point that sustains demand growth.
  3. He expects long-run token costs to fall via architectural and efficiency improvements across the whole chip industry, while emphasizing speed as a decisive competitive advantage for agentic workflows and search-like experiences.
  4. He claims the biggest near-term blocker to enterprise AI adoption is organizational risk control—security teams and lawyers—before data cleanliness becomes the next constraint.
  5. On geopolitics and industrial policy, he opposes selling leading-edge chips to China, stresses rebuilding US fab/packaging capability, and criticizes Europe’s regulate-first posture as a drag on innovation and adoption.

IDEAS WORTH REMEMBERING

5 ideas

AI infrastructure is constrained by supply, not speculative oversupply.

Feldman contrasts AI with past bubbles (rail/fiber) where capacity got built ahead of demand; he says today’s reality is backlogs everywhere because data centers and supporting infrastructure can’t be built fast enough.

Memory shortages are structurally hard to fix quickly.

HBM is supplied by only a few vendors and expanding fab capacity is a multi-year, tens-of-billions step function; if demand stays high, he expects shortages for “the next several years.”

A 2025 usefulness inflection is what turns training hype into inference-driven demand.

He claims models crossed a threshold where everyday users (across demographics) get real value, so inference usage explodes and keeps pulling forward demand for compute and power.

Speed is a moat because many AI products have near-zero tolerance for latency.

He argues there’s effectively “zero market for slow” AI experiences (search, coding agents, workflows), so large speedups (not just 10–20%) can change competitive outcomes by enabling many more tasks per day.

Token costs should decline over time even if near-term components spike.

Despite HBM-driven GPU cost inflation, Feldman expects the industry’s historical trend—more performance per dollar and per watt through better designs—to continue, pushing compute unit costs down over a 3–5 year horizon.

WORDS WORTH SAVING

5 quotes

The infrastructure build-out is behind demand. We can't build data centers fast enough to keep up with demand. We have a twenty-five billion dollar backlog.

Andrew Feldman

Somewhere in twenty twenty-five, the models got smart enough to be really useful. Before that, Harry, these were sort of, sort of a novelty.

Andrew Feldman

For hard problems, there is no upper bound to how much faster you wanna be, nor the value of speed.

Andrew Feldman

Really. H-It's zero. How big is the market for dial-up, for slow internet? ... That's how impossible it is to engage with an important technology slowly. Why do we believe the inference will be any different? There'll be zero market for slowing them.

Andrew Feldman

No. No, the biggest are, are lawyers.

Andrew Feldman

AI infrastructure “bubble” vs real demand backlogData-center build constraints and permitting delaysHBM memory supply chain and pricing powerToken economics, COGS, and performance-per-wattFull-stack ownership (Google TPUs) vs open-market volumeEnterprise AI adoption: legal/security friction and data readinessUS–China chip export policy and US onshoring fabs

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.