
Turing CEO Jonathan Siddharth: Who Wins in Data Labelling & Why 99% of Knowledge Work Will Disappear
Jonathan Siddharth (guest), Harry Stebbings (host), Harry Stebbings (host)
In this episode of The Twenty Minute VC, featuring Jonathan Siddharth and Harry Stebbings, Turing CEO Jonathan Siddharth: Who Wins in Data Labelling & Why 99% of Knowledge Work Will Disappear explores turing CEO predicts AI agents will automate nearly all knowledge work Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. He describes how Turing powers the “data pillar” for 7 of 8 frontier labs, creating synthetic but realistic workflows across industries so models can learn to perform economically valuable, multi-step knowledge work. Siddharth predicts that virtually all digital knowledge work—about $30 trillion worth—will be automated over time, with slow but steady AI capability takeoff and a massive shift of budget from human labor to AI systems. He also foresees the decline of classic SaaS, the rise of custom fine-tuned small models inside enterprises and governments, and a future where individuals are 100x more productive through fleets of agentic AI “exoskeletons.”
Turing CEO predicts AI agents will automate nearly all knowledge work
Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. He describes how Turing powers the “data pillar” for 7 of 8 frontier labs, creating synthetic but realistic workflows across industries so models can learn to perform economically valuable, multi-step knowledge work. Siddharth predicts that virtually all digital knowledge work—about $30 trillion worth—will be automated over time, with slow but steady AI capability takeoff and a massive shift of budget from human labor to AI systems. He also foresees the decline of classic SaaS, the rise of custom fine-tuned small models inside enterprises and governments, and a future where individuals are 100x more productive through fleets of agentic AI “exoskeletons.”
Key Takeaways
Data-labeling is giving way to research accelerators that build RL environments.
Turing no longer focuses on simple annotations but on constructing rich reinforcement-learning ‘mini-worlds’ that mimic real workflows across industries, enabling models to learn tool use, multi-step reasoning, and agentic behavior.
Get the full analysis with uListen AI
AI training data needs have shifted from simple tasks to complex, domain‑expert workflows.
As models become smarter, the marginal value comes from high-skill, vertically specific data (e. ...
Get the full analysis with uListen AI
Custom small models fine-tuned on proprietary data will be a durable enterprise pattern.
Siddharth argues many enterprise use cases (e. ...
Get the full analysis with uListen AI
Nearly all digital knowledge work will be automated, but via slow takeoff.
He believes any job done on a computer using tools, keyboard, and mouse will be automated over time, yet adoption will be gradual, especially in back-office functions, giving society and enterprises time to adapt and redesign workflows.
Get the full analysis with uListen AI
Moats will come from data-driven feedback loops and deployment, not raw tech.
Similar to Google’s search advantage, winners will continuously collect real usage data, see where models fail in production, and use that feedback to generate targeted new training data, creating compounding performance gaps.
Get the full analysis with uListen AI
Classic SaaS is structurally threatened by easy app-building and agentic models.
He predicts many companies will replace generic SaaS with custom AI-powered workflows, while foundation model providers may ‘sonic boom’ into application layers as agents directly operate existing GUIs and internal tools.
Get the full analysis with uListen AI
AI will massively expand who can build software and start companies.
By turning ‘intelligence into an API’ for $20/month, Siddharth expects non-technical experts (e. ...
Get the full analysis with uListen AI
Notable Quotes
“I think the era of data labeling companies is over. It’s now the era of research accelerators.”
— Jonathan Siddharth
“All knowledge work is going to be automated. It’s only a matter of time.”
— Jonathan Siddharth
“SaaS, as we know it, I think is over.”
— Jonathan Siddharth
“There is a very significant model capability overhang… the models are capable of X, but what we are getting out of the models is X minus delta.”
— Jonathan Siddharth
“Whoever wins the superintelligence race will probably win search, consumer devices, operating systems, and cloud. You’re playing for everything.”
— Jonathan Siddharth
Questions Answered in This Episode
If nearly all digital knowledge work becomes automated, how should individuals and companies strategically reskill and reposition over the next decade?
Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. ...
Get the full analysis with uListen AI
What concrete metrics or signals can distinguish a true ‘research accelerator’ from a traditional data-labeling vendor in this new ecosystem?
Get the full analysis with uListen AI
How will regulators and governments balance the need for sovereign models and data control with the global race toward superintelligence?
Get the full analysis with uListen AI
In a world where foundation models can directly operate software as agents, what types of SaaS products (if any) remain defensible?
Get the full analysis with uListen AI
How should enterprises practically approach the ‘first-mile’ and ‘last-mile’ schlep to turn powerful frontier models into reliable, production-grade agents?
Get the full analysis with uListen AI
Transcript Preview
I think the era of data labeling companies is over, and it's now the era of research accelerators.
Today, I do not pull any punches with Jonathan Siddharth, founder and CEO of Turing, a company that he has scaled to over 350 million in annual recurring revenue.
All knowledge work is going to be automated. It's only a matter of time. I don't see an AI bubble. These models are incredibly powerful today. SaaS, as we know it, I think is over. I think it's completely over.
Ready to go? Jonathan, I've been so looking forward to this. Thank you so much for joining me in person. It's such a treat to do it in person while you're in London.
Thank you for having me, Harry.
Now, I want to start with a little bit of definitions because everyone thinks they're talent marketplaces, and then everyone pushes back on talent marketplaces. How do you describe it, and why are we not dealing with talent marketplaces anymore?
So, I think of a talent marketplace as something that's basically matching talent to something, maybe it's an opportunity. So, so Turing is not a talent marketplace. The, um... what we do at Turing is we're training superintelligence. We work with seven out of the eight frontier labs. To get to superintelligence, you need research, compute, and data. Research, the labs do in-house with OpenAI, Anthropic, DeepMind, et cetera. For compute, we have, uh, Jensen to thank, and maybe NVIDIA as well. But, uh, on the data side, Turing power is the data pillar. On the data side, there's been a significant shift in the last couple of years. So a few years back, uh, the models weren't quite smart enough. And as the models have gotten increasingly smarter, the data needed to improve them has become harder to generate.
And this is because it's more sophisticated data that's required to improve the models. It's like vertically specific people in task and workflows that isn't so obvious, like cat pictures.
That's correct. That's correct. And it's, uh, there's a shift in the data going from simple to complex. I mean, let's take coding, for example. Uh, a few years ago, the kind of dataset a, a contractor who gen- could generate might look like, "Hey, write a Python program to sort some numbers." Today, the data that's generated might be, "Write a, uh, a B2B marketplace app that connects, uh, doctors with patients. And write it on, for Android, uh, with Kotlin/Java, write it for iOS with, uh, Swift, and write it on the web, like with Next.js or something." Right? That's the complexity. So there's a shift in going from simple to complex. So it's no longer the kind of data that low-skilled, medium-skilled contractors c- can generate. You need expert humans in every domain.
Yeah.
The second shift, uh, is, uh, we've gone from teaching AI to take tests and pass tests to teaching AI to do real work. It's less about having AI pass the bar. It's more about, can AI do the job of a lawyer? Can it do the job of a privacy lawyer, a compliance lawyer, a paralegal? So having AI be good at doing economically valuable work. So that's, that's a shift. Uh, the third, uh, shift is we've gone from chatbots to agents, right? Like, the, we started off with ChatGPT where you're asking questions, getting answers, which is great. But now it's about the models becoming agentic, where they can execute complex, multi-step workflows in a real-world business setting. And the type of data you need for that is totally different.
Install uListen to search the full transcript and get AI-powered insights
Get Full TranscriptGet more from every podcast
AI summaries, searchable transcripts, and fact-checking. Free forever.
Add to Chrome