Skip to content
a16za16z

How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning

In this episode, a16z GP Martin Casado sits down with Sherwin Wu, Head of Engineering for the OpenAI Platform, to break down how OpenAI organizes its platform across models, pricing, and infrastructure, and how it is shifting from a single general-purpose model to a portfolio of specialized systems, custom fine-tuning options, and node-based agent workflows. They get into why developers tend to stick with a trusted model family, what builds that trust, and why the industry moved past the idea of one model that can do everything. Sherwin also explains the evolution from prompt engineering to context design and how companies use OpenAI’s fine-tuning and RFT APIs to shape model behavior with their own data. Highlights from the conversation include: • How OpenAI balances a horizontal API platform with vertical products like ChatGPT • The evolution from Codex to the Composer model • Why usage-based pricing works and where outcome-based pricing breaks • What the Harmonic Labs and Rockset acquisitions added to OpenAI’s agent work • Why the new agent builder is deterministic, node based, and not free roaming Timestamps: 00:00 Introduction 8:36 Horizontal vs vertical OpenAI 12:18 Why you can’t “disintermediate” the model 15:11 People build relationships with models 17:30 Not one AGI model, but many 20:10 Fine-tuning, RFT, and customer data choices 24:44 Prompt engineering isn’t the point anymore 28:06 What an “agent” really is 31:55 How OpenAI thinks about pricing 36:46 Why open-weights don’t kill the API 42:57 Different stacks for text, images, video 45:47 How the agent builder actually works Stay Updated: If you enjoyed this episode, be sure to like, subscribe, and share with your friends! Find a16z on X: [https://x.com/a16z](https://x.com/a16z) Find a16z on LinkedIn: [https://www.linkedin.com/company/a16z](https://www.linkedin.com/company/a16z) Listen to the a16z Podcast on Spotify: [https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX](https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX) Listen to the a16z Podcast on Apple Podcasts: [https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711](https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711) Follow our host: [https://x.com/eriktorenberg](https://x.com/eriktorenberg) Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see [a16z.com/disclosures](http://a16z.com/disclosures).

Sherwin WuguestMartin Casadohost
Nov 28, 202553mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
November 28, 2025
Duration
53m
Channel
a16z
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

In this episode, a16z GP Martin Casado sits down with Sherwin Wu, Head of Engineering for the OpenAI Platform, to break down how OpenAI organizes its platform across models, pricing, and infrastructure, and how it is shifting from a single general-purpose model to a portfolio of specialized systems, custom fine-tuning options, and node-based agent workflows. They get into why developers tend to stick with a trusted model family, what builds that trust, and why the industry moved past the idea of one model that can do everything. Sherwin also explains the evolution from prompt engineering to context design and how companies use OpenAI’s fine-tuning and RFT APIs to shape model behavior with their own data. Highlights from the conversation include:

  • How OpenAI balances a horizontal API platform with vertical products like ChatGPT
  • The evolution from Codex to the Composer model
  • Why usage-based pricing works and where outcome-based pricing breaks
  • What the Harmonic Labs and Rockset acquisitions added to OpenAI’s agent work
  • Why the new agent builder is deterministic, node based, and not free roaming

Timestamps: 00:00 Introduction 8:36 Horizontal vs vertical OpenAI 12:18 Why you can’t “disintermediate” the model 15:11 People build relationships with models 17:30 Not one AGI model, but many 20:10 Fine-tuning, RFT, and customer data choices 24:44 Prompt engineering isn’t the point anymore 28:06 What an “agent” really is 31:55 How OpenAI thinks about pricing 36:46 Why open-weights don’t kill the API 42:57 Different stacks for text, images, video 45:47 How the agent builder actually works Stay Updated: If you enjoyed this episode, be sure to like, subscribe, and share with your friends! Find a16z on X: [https://x.com/a16z](https://x.com/a16z) Find a16z on LinkedIn: [https://www.linkedin.com/company/a16z](https://www.linkedin.com/company/a16z) Listen to the a16z Podcast on Spotify: [https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX](https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX) Listen to the a16z Podcast on Apple Podcasts: [https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711](https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711) Follow our host: [https://x.com/eriktorenberg](https://x.com/eriktorenberg) Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see [a16z.com/disclosures](http://a16z.com/disclosures).

SPEAKERS

  • Sherwin Wu

    guest

    OpenAI product/API leader discussing model specialization, fine-tuning, and developer platform strategy.

  • Martin Casado

    host

    a16z general partner and tech investor hosting the interview and asking product/strategy questions.

EPISODE SUMMARY

In this episode of a16z, featuring Sherwin Wu and Martin Casado, How OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning explores openAI scales platform and apps through specialized models, tuning, agents OpenAI intentionally operates as both a vertical app company (ChatGPT) and a horizontal platform (API), accepting inherent ecosystem tension in service of broad distribution.

RELATED EPISODES

The Golden Age Thesis | Marc Andreessen on MTS

The Golden Age Thesis | Marc Andreessen on MTS

The Investor Behind Costco, Starbucks, and Blackstone | Tony James on The a16z Show

The Investor Behind Costco, Starbucks, and Blackstone | Tony James on The a16z Show

Digital Freedom, AI Regulation, and the Fight for the Western Internet | The a16z Show

Digital Freedom, AI Regulation, and the Fight for the Western Internet | The a16z Show

Crypto Experts Explain Stablecoins & the Future Financial System w/ Ali Yahya & Arianna Simpson

Crypto Experts Explain Stablecoins & the Future Financial System w/ Ali Yahya & Arianna Simpson

Why Scale Will Not Solve AGI | Vishal Misra - The a16z Show

Why Scale Will Not Solve AGI | Vishal Misra - The a16z Show

Emil Michael: The Department of War Is Moving Faster Than Silicon Valley on AI | The a16z Show

Emil Michael: The Department of War Is Moving Faster Than Silicon Valley on AI | The a16z Show

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome