Skip to content
Lenny's PodcastLenny's Podcast

Kevin Weil: Why evals are the new core skill in AI products

Through fine-tuning runs and writing evals against the fuzzy outputs; OpenAI builds at the edge of capabilities, betting on better models every two months.

Kevin WeilguestLenny Rachitskyhost
Apr 10, 20251h 31mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
April 10, 2025
Duration
1h 31m
Channel
Lenny's Podcast
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Kevin Weil is the chief product officer at OpenAI, where he oversees the development of ChatGPT, enterprise products, and the OpenAI API. Prior to OpenAI, Kevin was head of product at Twitter, Instagram, and Planet, and was instrumental in the development of the Libra (later Novi) cryptocurrency project at Facebook. In this episode, you’ll learn:

  1. How OpenAI structures its product teams and maintains agility while developing cutting-edge AI
  2. The power of model ensembles—using multiple specialized models together like a company of humans with different skills
  3. Why writing effective evals (AI evaluation tests) is becoming a critical skill for product managers
  4. The surprisingly enduring value of chat as an interface for AI, despite predictions of its obsolescence
  5. How “vibe coding” is changing how companies operate
  6. What OpenAI looks for when hiring product managers (hint: high agency and comfort with ambiguity)
  7. “Model maximalism” and why today’s AI is the worst you’ll ever use again
  8. Practical prompting techniques that improve AI interactions, including example-based prompting

Find the transcript at: https://www.lennysnewsletter.com/p/kevin-weil-open-ai Brought to you by:

Where to find Kevin Weil:

Where to find Lenny:

In this episode, we cover: (00:00) Kevin’s background (05:16) OpenAI’s new image model (08:13) The role of chief product officer at OpenAI (11:42) His recruitment story and joining OpenAI (15:59) Working at OpenAI (18:44) The importance of evals in AI (24:40) Opportunities in the space (26:34) Shipping quickly and consistently (29:47) Product reviews and iterative deployment (32:53) Winning consumer awareness (36:03) Designing thoughtful experiences (40:56) Chat as an interface for AI (45:21) Collaboration between researchers and product teams (48:05) Hiring product managers at OpenAI (53:06) How OpenAI uses AI: vibe coding, AI prototyping, and more (01:04:34) Raising kids in an increasingly intelligent AI world (01:08:07) Why Kevin feels optimistic about our AI future (01:14:20) The AI model you're using today is the worst AI model you'll ever use (01:17:58) Reflections on the Libra project (01:21:51) Lightning round and final thoughts Referenced:

...References continued at: https://www.lennysnewsletter.com/p/kevin-weil-open-ai Recommended books:

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com. Lenny may be an investor in the companies discussed.

SPEAKERS

  • Kevin Weil

    guest
  • Lenny Rachitsky

    host
  • Narrator

    other

EPISODE SUMMARY

In this episode of Lenny's Podcast, featuring Kevin Weil and Lenny Rachitsky, Kevin Weil: Why evals are the new core skill in AI products explores openAI’s CPO on building products atop rapidly evolving AI foundations Kevin Weil, Chief Product Officer at OpenAI, explains how building on AI is fundamentally different from past tech shifts because the underlying capabilities improve dramatically every few months. This forces product teams to plan loosely, ship quickly, and design around fuzzy, probabilistic model behavior instead of deterministic software. He highlights the rising importance of evals, fine-tuning, and ensembles of models, and argues that every serious product team will eventually embed ML researchers as core members. Weil also reflects on missed opportunities like Facebook’s Libra, the transformative potential of AI tutoring, and the skills he’s encouraging his kids (and future builders) to develop in an AI-first world.

RELATED EPISODES

Tony Fadell: How to build real taste (and why AI makes it matter more)

Tony Fadell: How to build real taste (and why AI makes it matter more)

The most rational take on AI you’ll hear this year

The most rational take on AI you’ll hear this year

AI predictions: Job markets, Codex beats Claude, and the death of org charts | Dan Shipper

AI predictions: Job markets, Codex beats Claude, and the death of org charts | Dan Shipper

Why the next AI boom is physical AI | Caitlin Kalinowski (ex-OpenAI, Meta, Apple)

Why the next AI boom is physical AI | Caitlin Kalinowski (ex-OpenAI, Meta, Apple)

How Anthropic, Costco, and Patagonia all build incorruptible companies  | Eric Ries

How Anthropic, Costco, and Patagonia all build incorruptible companies | Eric Ries

AI era skills: Why cultivating agency matters more than job titles | Max Schoening (Notion)

AI era skills: Why cultivating agency matters more than job titles | Max Schoening (Notion)

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.