Lenny's Podcast

Edwin Chen: Why optimizing for benchmarks creates AI sloth

How Surge bootstrapped past $1B revenue with fewer than 100 people; Chen argues benchmark gaming pushes AI toward dopamine, emojis, and slop, not truth.

Lenny RachitskyhostEdwin Chenguest

Dec 7, 20251h 10mWatch on YouTube ↗

CHAPTERS

0:00 – 4:48
Introduction to Edwin Chen
4:48 – 7:08
AI’s role in business efficiency
7:08 – 8:55
Building a contrarian company
8:55 – 9:36
An explanation of what Surge AI does
9:36 – 13:31
The importance of high-quality data
13:31 – 17:37
How Claude Code has stayed ahead
17:37 – 21:54
Edwin’s skepticism toward benchmarks
21:54 – 28:33
AGI timelines and industry trends
28:33 – 33:07
The Silicon Valley machine
33:07 – 39:37
Reinforcement learning and future AI training
39:37 – 41:11
Understanding model trajectories
41:11 – 42:55
How models have advanced and will continue to advance
42:55 – 44:39
Adapting to industry needs
44:39 – 48:07
Surge’s research approach
48:07 – 50:43
Predictions for the next few years in AI
50:43 – 52:55
What’s underhyped and overhyped in AI
52:55 – 1:02:18
The story of founding Surge AI
1:02:18 – 1:10:31
Lightning round and final thoughts

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.