Skip to content
Lenny's PodcastLenny's Podcast

Kevin Weil: Why evals are the new core skill in AI products

Through fine-tuning runs and writing evals against the fuzzy outputs; OpenAI builds at the edge of capabilities, betting on better models every two months.

Kevin WeilguestLenny Rachitskyhost
Apr 10, 20251h 31mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
April 10, 2025
Duration
1h 31m
Channel
Lenny's Podcast
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Kevin Weil is the chief product officer at OpenAI, where he oversees the development of ChatGPT, enterprise products, and the OpenAI API. Prior to OpenAI, Kevin was head of product at Twitter, Instagram, and Planet, and was instrumental in the development of the Libra (later Novi) cryptocurrency project at Facebook. In this episode, you’ll learn:

  1. How OpenAI structures its product teams and maintains agility while developing cutting-edge AI
  2. The power of model ensembles—using multiple specialized models together like a company of humans with different skills
  3. Why writing effective evals (AI evaluation tests) is becoming a critical skill for product managers
  4. The surprisingly enduring value of chat as an interface for AI, despite predictions of its obsolescence
  5. How “vibe coding” is changing how companies operate
  6. What OpenAI looks for when hiring product managers (hint: high agency and comfort with ambiguity)
  7. “Model maximalism” and why today’s AI is the worst you’ll ever use again
  8. Practical prompting techniques that improve AI interactions, including example-based prompting

Find the transcript at: https://www.lennysnewsletter.com/p/kevin-weil-open-ai Brought to you by:

Where to find Kevin Weil:

Where to find Lenny:

In this episode, we cover: (00:00) Kevin’s background (05:16) OpenAI’s new image model (08:13) The role of chief product officer at OpenAI (11:42) His recruitment story and joining OpenAI (15:59) Working at OpenAI (18:44) The importance of evals in AI (24:40) Opportunities in the space (26:34) Shipping quickly and consistently (29:47) Product reviews and iterative deployment (32:53) Winning consumer awareness (36:03) Designing thoughtful experiences (40:56) Chat as an interface for AI (45:21) Collaboration between researchers and product teams (48:05) Hiring product managers at OpenAI (53:06) How OpenAI uses AI: vibe coding, AI prototyping, and more (01:04:34) Raising kids in an increasingly intelligent AI world (01:08:07) Why Kevin feels optimistic about our AI future (01:14:20) The AI model you're using today is the worst AI model you'll ever use (01:17:58) Reflections on the Libra project (01:21:51) Lightning round and final thoughts Referenced:

...References continued at: https://www.lennysnewsletter.com/p/kevin-weil-open-ai Recommended books:

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com. Lenny may be an investor in the companies discussed.

SPEAKERS

  • Kevin Weil

    guest
  • Lenny Rachitsky

    host
  • Narrator

    other

EPISODE SUMMARY

In this episode of Lenny's Podcast, featuring Kevin Weil and Lenny Rachitsky, Kevin Weil: Why evals are the new core skill in AI products explores openAI’s CPO on building products atop rapidly evolving AI foundations Kevin Weil, Chief Product Officer at OpenAI, explains how building on AI is fundamentally different from past tech shifts because the underlying capabilities improve dramatically every few months. This forces product teams to plan loosely, ship quickly, and design around fuzzy, probabilistic model behavior instead of deterministic software. He highlights the rising importance of evals, fine-tuning, and ensembles of models, and argues that every serious product team will eventually embed ML researchers as core members. Weil also reflects on missed opportunities like Facebook’s Libra, the transformative potential of AI tutoring, and the skills he’s encouraging his kids (and future builders) to develop in an AI-first world.

RELATED EPISODES

How to build a company that withstands any era | Eric Ries, Lean Startup author

How to build a company that withstands any era | Eric Ries, Lean Startup author

Head of Claude Code: What happens after coding is solved | Boris Cherny

Head of Claude Code: What happens after coding is solved | Boris Cherny

Building product at Stripe: craft, metrics, and customer obsession | Jeff Weinstein (Product lead)

Building product at Stripe: craft, metrics, and customer obsession | Jeff Weinstein (Product lead)

Building a world-class data org | Jessica Lachs (VP of Analytics and Data Science at DoorDash)

Building a world-class data org | Jessica Lachs (VP of Analytics and Data Science at DoorDash)

What most people miss about marketing | Rory Sutherland (Vice Chairman of Ogilvy UK, author)

What most people miss about marketing | Rory Sutherland (Vice Chairman of Ogilvy UK, author)

5 essential questions to craft a winning strategy | Roger Martin (author, advisor, speaker)

5 essential questions to craft a winning strategy | Roger Martin (author, advisor, speaker)

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome