Alexandr Wang: Why Data Quality Decides the AI Frontier

Through hard evals against real customer tasks rather than benchmarks; Scale AI proves labeled data quality determines the frontier model performance ceiling.

Garry TanhostAlexandr WangguestJared FriedmanhostHarj Taggarhost

Jun 18, 20251h 1mWatch on YouTube ↗

EPISODE INFO

Released: June 18, 2025
Duration: 1h 1m
Channel: Y Combinator
Watch on YouTube: ▶ Open ↗

EPISODE DESCRIPTION

Alexandr Wang started Scale AI to help machine learning teams label data faster. It started as a simple API for human labor, but behind the scenes, he was tackling a much bigger problem: how to turn messy, real-world data into something AI could learn from. Today, that early idea powers a multi-hundred-million-dollar engine behind America's AI infrastructure—fueling everything from Fortune 500 workflows to real-time military planning. Just last week, Meta agreed to invest over $14 billion in Scale, valuing the company at $29 billion. Alexandr joined us on the Lightcone to share how Scale evolved from a scrappy YC startup into the backbone of some of the world's most advanced AI systems, how he thinks about competition with Chinese AI labs, and what it takes to build infrastructure that shapes the frontier. Apply to Y Combinator: https://ycombinator.com/apply Work at a startup: https://workatastartup.com Chapters (Powered by https://ChapterMe.co): 00:00 Intro 01:15 Alexandr’s early days at YC 07:25 Dialing in on what worked 10:24 Model improvements, evals 19:18 The techno optimist view of work 27:47 The turning points for Scale AI 37:37 Agentic workflows 41:55 “Humanity’s Last Exam” 47:48 U.S. vs China in AI and hard tech 56:57 How to be hardcore

SPEAKERS

Garry Tan
host
Alexandr Wang
guest
Jared Friedman
host
Harj Taggar
host

EPISODE SUMMARY

In this episode of Y Combinator, featuring Garry Tan and Alexandr Wang, Alexandr Wang: Why Data Quality Decides the AI Frontier explores alexandr Wang on Scale AI, Agentic Workflows, and U.S.–China AI Rivalry Alexandr Wang recounts Scale AI’s evolution from a YC-era “API for human labor” into a core infrastructure and applications provider for frontier AI labs, enterprises, and the U.S. Department of Defense. He explains how focusing early on self‑driving car data, then shifting to foundation model data and agentic applications, positioned Scale as the “NVIDIA of data.” Wang outlines a future of work where humans increasingly manage swarms of AI agents rather than being replaced by them, and describes how reinforcement learning and hard evaluations like Humanity’s Last Exam are driving model capabilities. He also warns about China’s rapid progress in AI—especially in data, manufacturing, and espionage—and argues that U.S. strategic advantage will hinge on compute, energy, and maintaining frontier models.

RELATED EPISODES