Skip to content
Y CombinatorY Combinator

Paul Buchheit: Why Evals, Not Code, Are the Real AI Moat

Predicting the next token dissolved the paperclip maximizer fear; now eval sets, not codebases, are the moat, as Jerry shows with 50% growth post-GPT-4.

Garry TanhostPaul BuchheitguestHarj TaggarhostJared FriedmanhostDiana Huhost
Jan 24, 202539mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
January 24, 2025
Duration
39m
Channel
Y Combinator
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

In this special episode of Lightcone, we’re joined by YC partner and creator of Gmail Paul Buchheit to dig into some of the latest trends in the world of AI startups. We recorded our conversation at a recent retreat where 300 of the top AI founders in the world gathered to share expertise and make predictions about how this technology will shape our future. In the discussion we cover a wide range of topics including: The future of work, the power of agency and taste in an AI world and why this is the absolute best time to be building a startup. Learn more about the YC startups mentioned in this episode: Tavus: https://www.tavus.io Jerry: https://getjerry.com Apply to Y Combinator: https://ycombinator.com/apply Chapters (Powered by https://bit.ly/chapterme-yc) - 00:00 - Intro 01:17 - Retreat. 05:04 - Demand for AI. 08:00 - Evals. 10:00 - Product iteration. 14:37 - Balance. 16:16 - Automation. 19:31 - Predictive models. 22:21 - OpenAI. 24:58 - AI tools. 26:23 - Cursor. 29:30 - Scaling. 33:16 - ROI. 36:37 - Startup success. 38:30 - Outro

SPEAKERS

  • Garry Tan

    host
  • Paul Buchheit

    guest
  • Harj Taggar

    host
  • Jared Friedman

    host
  • Diana Hu

    host

EPISODE SUMMARY

In this episode of Y Combinator, featuring Garry Tan and Paul Buchheit, Paul Buchheit: Why Evals, Not Code, Are the Real AI Moat explores aI Agents Explode Startup Growth, Redefining Work, Wealth, And Software The episode covers how AI, especially AI agents for businesses, is radically accelerating startup growth, with many YC companies hitting revenue milestones in months that once took years. The hosts describe unprecedented enterprise demand for AI, where buyers already want solutions and technical founders win by simply delivering products that actually work. They argue that the true moats are now eval datasets, prompting expertise, rapid iteration, and willingness to constantly rebuild on the latest models. The conversation broadens into how AI can increase human agency, reshape labor and wealth (machine money vs human money), and why we may be on a “good timeline” for AI development rather than a dystopian one.

RELATED EPISODES

The CEO Must Be the Chief AI Officer

The CEO Must Be the Chief AI Officer

Inside YC's AI Playbook

Inside YC's AI Playbook

Tokenmaxxing: How Top Builders Use AI To Do The Work Of 400 Engineers

Tokenmaxxing: How Top Builders Use AI To Do The Work Of 400 Engineers

The GPT Moment for Robotics Is Here

The GPT Moment for Robotics Is Here

AI Is Unlocking Millions Of New Builders

AI Is Unlocking Millions Of New Builders

The Powerful Alternative To Fine-Tuning

The Powerful Alternative To Fine-Tuning

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.