Skip to content
Y CombinatorY Combinator

Alexandr Wang: Why Data Quality Decides the AI Frontier

Through hard evals against real customer tasks rather than benchmarks; Scale AI proves labeled data quality determines the frontier model performance ceiling.

Garry TanhostAlexandr WangguestJared FriedmanhostHarj Taggarhost
Jun 18, 20251h 1mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
June 18, 2025
Duration
1h 1m
Channel
Y Combinator
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Alexandr Wang started Scale AI to help machine learning teams label data faster. It started as a simple API for human labor, but behind the scenes, he was tackling a much bigger problem: how to turn messy, real-world data into something AI could learn from. Today, that early idea powers a multi-hundred-million-dollar engine behind America's AI infrastructure—fueling everything from Fortune 500 workflows to real-time military planning. Just last week, Meta agreed to invest over $14 billion in Scale, valuing the company at $29 billion. Alexandr joined us on the Lightcone to share how Scale evolved from a scrappy YC startup into the backbone of some of the world's most advanced AI systems, how he thinks about competition with Chinese AI labs, and what it takes to build infrastructure that shapes the frontier. Apply to Y Combinator: https://ycombinator.com/apply Work at a startup: https://workatastartup.com Chapters (Powered by https://ChapterMe.co): 00:00 Intro 01:15 Alexandr’s early days at YC 07:25 Dialing in on what worked 10:24 Model improvements, evals 19:18 The techno optimist view of work 27:47 The turning points for Scale AI 37:37 Agentic workflows 41:55 “Humanity’s Last Exam” 47:48 U.S. vs China in AI and hard tech 56:57 How to be hardcore

SPEAKERS

  • Garry Tan

    host
  • Alexandr Wang

    guest
  • Jared Friedman

    host
  • Harj Taggar

    host

EPISODE SUMMARY

In this episode of Y Combinator, featuring Garry Tan and Alexandr Wang, Alexandr Wang: Why Data Quality Decides the AI Frontier explores alexandr Wang on Scale AI, Agentic Workflows, and U.S.–China AI Rivalry Alexandr Wang recounts Scale AI’s evolution from a YC-era “API for human labor” into a core infrastructure and applications provider for frontier AI labs, enterprises, and the U.S. Department of Defense. He explains how focusing early on self‑driving car data, then shifting to foundation model data and agentic applications, positioned Scale as the “NVIDIA of data.” Wang outlines a future of work where humans increasingly manage swarms of AI agents rather than being replaced by them, and describes how reinforcement learning and hard evaluations like Humanity’s Last Exam are driving model capabilities. He also warns about China’s rapid progress in AI—especially in data, manufacturing, and espionage—and argues that U.S. strategic advantage will hinge on compute, energy, and maintaining frontier models.

RELATED EPISODES

Tokenmaxxing: How Top Builders Use AI To Do The Work Of 400 Engineers

Tokenmaxxing: How Top Builders Use AI To Do The Work Of 400 Engineers

Make Something Agents Want

Make Something Agents Want

Boris Cherny: How We Built Claude Code

Boris Cherny: How We Built Claude Code

AI Revolution: What Nobody Else Is Seeing

AI Revolution: What Nobody Else Is Seeing

How To Get AI Startup Ideas

How To Get AI Startup Ideas

How AI Is Changing Enterprise

How AI Is Changing Enterprise

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome