Skip to content
Lenny's PodcastLenny's Podcast

Edwin Chen: Why optimizing for benchmarks creates AI sloth

How Surge bootstrapped past $1B revenue with fewer than 100 people; Chen argues benchmark gaming pushes AI toward dopamine, emojis, and slop, not truth.

Lenny RachitskyhostEdwin Chenguest
Dec 7, 20251h 10mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 4:48

    Introduction to Edwin Chen

  2. 4:48 – 7:08

    AI’s role in business efficiency

  3. 7:08 – 8:55

    Building a contrarian company

  4. 8:55 – 9:36

    An explanation of what Surge AI does

  5. 9:36 – 13:31

    The importance of high-quality data

  6. 13:31 – 17:37

    How Claude Code has stayed ahead

  7. 17:37 – 21:54

    Edwin’s skepticism toward benchmarks

  8. 21:54 – 28:33

    AGI timelines and industry trends

  9. 28:33 – 33:07

    The Silicon Valley machine

  10. 33:07 – 39:37

    Reinforcement learning and future AI training

  11. 39:37 – 41:11

    Understanding model trajectories

  12. 41:11 – 42:55

    How models have advanced and will continue to advance

  13. 42:55 – 44:39

    Adapting to industry needs

  14. 44:39 – 48:07

    Surge’s research approach

  15. 48:07 – 50:43

    Predictions for the next few years in AI

  16. 50:43 – 52:55

    What’s underhyped and overhyped in AI

  17. 52:55 – 1:02:18

    The story of founding Surge AI

  18. 1:02:18 – 1:10:31

    Lightning round and final thoughts

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome