The Twenty Minute VCTuring CEO Jonathan Siddharth: Who Wins in Data Labelling & Why 99% of Knowledge Work Will Disappear
At a glance
WHAT IT’S REALLY ABOUT
Turing CEO predicts AI agents will automate nearly all knowledge work
- Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. He describes how Turing powers the “data pillar” for 7 of 8 frontier labs, creating synthetic but realistic workflows across industries so models can learn to perform economically valuable, multi-step knowledge work. Siddharth predicts that virtually all digital knowledge work—about $30 trillion worth—will be automated over time, with slow but steady AI capability takeoff and a massive shift of budget from human labor to AI systems. He also foresees the decline of classic SaaS, the rise of custom fine-tuned small models inside enterprises and governments, and a future where individuals are 100x more productive through fleets of agentic AI “exoskeletons.”
IDEAS WORTH REMEMBERING
5 ideasData-labeling is giving way to research accelerators that build RL environments.
Turing no longer focuses on simple annotations but on constructing rich reinforcement-learning ‘mini-worlds’ that mimic real workflows across industries, enabling models to learn tool use, multi-step reasoning, and agentic behavior.
AI training data needs have shifted from simple tasks to complex, domain‑expert workflows.
As models become smarter, the marginal value comes from high-skill, vertically specific data (e.g., complex coding tasks, legal workflows, underwriting), which requires expert humans and realistic environments, not commodity labelers.
Custom small models fine-tuned on proprietary data will be a durable enterprise pattern.
Siddharth argues many enterprise use cases (e.g., insurance underwriting) are best served by smaller on-prem models fine-tuned on decades of internal decisions, offering better accuracy, speed, and data control than trillion-parameter general models.
Nearly all digital knowledge work will be automated, but via slow takeoff.
He believes any job done on a computer using tools, keyboard, and mouse will be automated over time, yet adoption will be gradual, especially in back-office functions, giving society and enterprises time to adapt and redesign workflows.
Moats will come from data-driven feedback loops and deployment, not raw tech.
Similar to Google’s search advantage, winners will continuously collect real usage data, see where models fail in production, and use that feedback to generate targeted new training data, creating compounding performance gaps.
WORDS WORTH SAVING
5 quotesI think the era of data labeling companies is over. It’s now the era of research accelerators.
— Jonathan Siddharth
All knowledge work is going to be automated. It’s only a matter of time.
— Jonathan Siddharth
SaaS, as we know it, I think is over.
— Jonathan Siddharth
There is a very significant model capability overhang… the models are capable of X, but what we are getting out of the models is X minus delta.
— Jonathan Siddharth
Whoever wins the superintelligence race will probably win search, consumer devices, operating systems, and cloud. You’re playing for everything.
— Jonathan Siddharth
High quality AI-generated summary created from speaker-labeled transcript.
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome