Skip to content
No PriorsNo Priors

Baseten CEO Tuhin Srivastava on Custom Models, and Building the Inference Cloud

Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Baseten’s 30x growth, and why inference is becoming the strategic “last market.” Tuhin Srivastava argues the application layer will persist because companies with unique user signals can encode value into workflows and post-train specialized models, citing examples like Abridge and support workflows. The conversation covers GPU capacity constraints, Baseten’s multi-cloud fabric across 18 clouds and 90 clusters, long-term contracting dynamics, the importance of the software layer for stickiness, evolving workloads, multichip possibilities, and operational lessons at scale. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Tuhinone Chapters: 00:31 Baseten growth 01:55 Why the app layer wins 05:57 Serving frontier customers 07:55 Open source model mix 09:21 Chinese models and geopolitics 13:07 Custom inference dominates 14:22 Post training acquisition 17:10 When to invest in custom models 18:35 Supply crunch and data centerse 22:25 Longer GPU Contracts 24:09 What Makes a Winner 26:07 Multi Chip Future 28:19 Runtime Roadmap 31:08 Scaling Edge Cases 33:48 Hiring and Leadership 36:44 Operations Pager Culture 38:19 Efficiency Drives Demand 40:41 Concierge Everything Future 42:34 Conclusion

Sarah GuohostTuhin SrivastavaguestElad Gilhost
May 1, 202642mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
May 1, 2026
Duration
42m
Channel
No Priors
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Baseten CEO and co-founder Tuhin Srivastava sits down with Sarah Guo and Elad Gil to discuss the rapid growth of AI inference demand, Baseten’s 30x growth, and why inference is becoming the strategic “last market.” Tuhin Srivastava argues the application layer will persist because companies with unique user signals can encode value into workflows and post-train specialized models, citing examples like Abridge and support workflows. The conversation covers GPU capacity constraints, Baseten’s multi-cloud fabric across 18 clouds and 90 clusters, long-term contracting dynamics, the importance of the software layer for stickiness, evolving workloads, multichip possibilities, and operational lessons at scale. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @Tuhinone Chapters: 00:31 Baseten growth 01:55 Why the app layer wins 05:57 Serving frontier customers 07:55 Open source model mix 09:21 Chinese models and geopolitics 13:07 Custom inference dominates 14:22 Post training acquisition 17:10 When to invest in custom models 18:35 Supply crunch and data centerse 22:25 Longer GPU Contracts 24:09 What Makes a Winner 26:07 Multi Chip Future 28:19 Runtime Roadmap 31:08 Scaling Edge Cases 33:48 Hiring and Leadership 36:44 Operations Pager Culture 38:19 Efficiency Drives Demand 40:41 Concierge Everything Future 42:34 Conclusion

SPEAKERS

  • Sarah Guo

    host

    Co-host of the No Priors podcast and investor focused on AI and technology startups.

  • Tuhin Srivastava

    guest

    Founder and CEO of Baseten, an AI inference cloud platform.

  • Elad Gil

    host

    Co-host of the No Priors podcast and technology investor/entrepreneur.

EPISODE SUMMARY

In this episode of No Priors, featuring Sarah Guo and Tuhin Srivastava, Baseten CEO Tuhin Srivastava on Custom Models, and Building the Inference Cloud explores baseten CEO on custom models, scaling inference, and compute constraints Baseten’s growth is driven by the rapid expansion of the application layer and the mainstreaming of post-training/RL techniques that let companies “own” and specialize inference.

RELATED EPISODES

Re-engineering the Semiconductor Supply Chain with Intel CEO Lip Bu Tan

Re-engineering the Semiconductor Supply Chain with Intel CEO Lip Bu Tan

We Need An Ecosystem in AI, And Every Company Can Win A Place In It

We Need An Ecosystem in AI, And Every Company Can Win A Place In It

“Curing All Disease by next century is too conservative" - Mark Zuckerberg

“Curing All Disease by next century is too conservative" - Mark Zuckerberg

SAP: Bringing the ‘Operating System’ of a Company into the AI Era with CTO Philipp Herzig

SAP: Bringing the ‘Operating System’ of a Company into the AI Era with CTO Philipp Herzig

Scaling Global Organizations in the Age of AI with ServiceNow Chairman and CEO Bill McDermott

Scaling Global Organizations in the Age of AI with ServiceNow Chairman and CEO Bill McDermott

How AI Agents Will Transform the Financial System with Circle Co-Founder and CEO Jeremy Allaire

How AI Agents Will Transform the Financial System with Circle Co-Founder and CEO Jeremy Allaire

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.