Turing CEO Jonathan Siddharth: Who Wins in Data Labelling & Why 99% of Knowledge Work Will Disappear

The Twenty Minute VCDec 1, 20251h 17m

Jonathan Siddharth (guest), Harry Stebbings (host), Harry Stebbings (host)

Shift from data-labeling companies to research accelerators and RL environmentsTraining agentic AI systems to perform real-world, multi-step knowledge workCustom fine‑tuned small models for enterprises and future government/sovereign modelsAutomation of $30T of digital knowledge work and labor-to-AI budget transferFuture of SaaS, software engineering, and moats in an AGI-centric worldAI adoption dynamics in incumbents vs startups; front-office vs back-office useLong-term AI trajectory: slow takeoff, superintelligence, new interfaces and devices

In this episode of The Twenty Minute VC, featuring Jonathan Siddharth and Harry Stebbings, Turing CEO Jonathan Siddharth: Who Wins in Data Labelling & Why 99% of Knowledge Work Will Disappear explores turing CEO predicts AI agents will automate nearly all knowledge work Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. He describes how Turing powers the “data pillar” for 7 of 8 frontier labs, creating synthetic but realistic workflows across industries so models can learn to perform economically valuable, multi-step knowledge work. Siddharth predicts that virtually all digital knowledge work—about $30 trillion worth—will be automated over time, with slow but steady AI capability takeoff and a massive shift of budget from human labor to AI systems. He also foresees the decline of classic SaaS, the rise of custom fine-tuned small models inside enterprises and governments, and a future where individuals are 100x more productive through fleets of agentic AI “exoskeletons.”

Turing CEO predicts AI agents will automate nearly all knowledge work

Jonathan Siddharth, CEO of Turing, argues that traditional data-labeling firms are obsolete and being replaced by "research accelerators" that build complex reinforcement-learning (RL) environments to train agentic AI systems. He describes how Turing powers the “data pillar” for 7 of 8 frontier labs, creating synthetic but realistic workflows across industries so models can learn to perform economically valuable, multi-step knowledge work. Siddharth predicts that virtually all digital knowledge work—about $30 trillion worth—will be automated over time, with slow but steady AI capability takeoff and a massive shift of budget from human labor to AI systems. He also foresees the decline of classic SaaS, the rise of custom fine-tuned small models inside enterprises and governments, and a future where individuals are 100x more productive through fleets of agentic AI “exoskeletons.”

Key Takeaways

Data-labeling is giving way to research accelerators that build RL environments.

Turing no longer focuses on simple annotations but on constructing rich reinforcement-learning ‘mini-worlds’ that mimic real workflows across industries, enabling models to learn tool use, multi-step reasoning, and agentic behavior.

Get the full analysis with uListen AI

AI training data needs have shifted from simple tasks to complex, domain‑expert workflows.

As models become smarter, the marginal value comes from high-skill, vertically specific data (e. ...

Get the full analysis with uListen AI

Custom small models fine-tuned on proprietary data will be a durable enterprise pattern.

Siddharth argues many enterprise use cases (e. ...

Get the full analysis with uListen AI

Nearly all digital knowledge work will be automated, but via slow takeoff.

He believes any job done on a computer using tools, keyboard, and mouse will be automated over time, yet adoption will be gradual, especially in back-office functions, giving society and enterprises time to adapt and redesign workflows.

Get the full analysis with uListen AI

Moats will come from data-driven feedback loops and deployment, not raw tech.

Similar to Google’s search advantage, winners will continuously collect real usage data, see where models fail in production, and use that feedback to generate targeted new training data, creating compounding performance gaps.

Get the full analysis with uListen AI

Classic SaaS is structurally threatened by easy app-building and agentic models.

He predicts many companies will replace generic SaaS with custom AI-powered workflows, while foundation model providers may ‘sonic boom’ into application layers as agents directly operate existing GUIs and internal tools.

Get the full analysis with uListen AI

AI will massively expand who can build software and start companies.

By turning ‘intelligence into an API’ for $20/month, Siddharth expects non-technical experts (e. ...

Get the full analysis with uListen AI

Notable Quotes

“I think the era of data labeling companies is over. It’s now the era of research accelerators.”
— Jonathan Siddharth

“All knowledge work is going to be automated. It’s only a matter of time.”
— Jonathan Siddharth

“SaaS, as we know it, I think is over.”
— Jonathan Siddharth

“There is a very significant model capability overhang… the models are capable of X, but what we are getting out of the models is X minus delta.”
— Jonathan Siddharth

“Whoever wins the superintelligence race will probably win search, consumer devices, operating systems, and cloud. You’re playing for everything.”
— Jonathan Siddharth

Questions Answered in This Episode

If nearly all digital knowledge work becomes automated, how should individuals and companies strategically reskill and reposition over the next decade?

Get the full analysis with uListen AI

What concrete metrics or signals can distinguish a true ‘research accelerator’ from a traditional data-labeling vendor in this new ecosystem?

Get the full analysis with uListen AI

How will regulators and governments balance the need for sovereign models and data control with the global race toward superintelligence?

Get the full analysis with uListen AI

In a world where foundation models can directly operate software as agents, what types of SaaS products (if any) remain defensible?

Get the full analysis with uListen AI

How should enterprises practically approach the ‘first-mile’ and ‘last-mile’ schlep to turn powerful frontier models into reliable, production-grade agents?

Get the full analysis with uListen AI

Transcript Preview

Jonathan Siddharth

I think the era of data labeling companies is over, and it's now the era of research accelerators.

Harry Stebbings

Today, I do not pull any punches with Jonathan Siddharth, founder and CEO of Turing, a company that he has scaled to over 350 million in annual recurring revenue.

Jonathan Siddharth

All knowledge work is going to be automated. It's only a matter of time. I don't see an AI bubble. These models are incredibly powerful today. SaaS, as we know it, I think is over. I think it's completely over.

Harry Stebbings

Ready to go? Jonathan, I've been so looking forward to this. Thank you so much for joining me in person. It's such a treat to do it in person while you're in London.

Jonathan Siddharth

Thank you for having me, Harry.

Harry Stebbings

Now, I want to start with a little bit of definitions because everyone thinks they're talent marketplaces, and then everyone pushes back on talent marketplaces. How do you describe it, and why are we not dealing with talent marketplaces anymore?

Jonathan Siddharth

So, I think of a talent marketplace as something that's basically matching talent to something, maybe it's an opportunity. So, so Turing is not a talent marketplace. The, um... what we do at Turing is we're training superintelligence. We work with seven out of the eight frontier labs. To get to superintelligence, you need research, compute, and data. Research, the labs do in-house with OpenAI, Anthropic, DeepMind, et cetera. For compute, we have, uh, Jensen to thank, and maybe NVIDIA as well. But, uh, on the data side, Turing power is the data pillar. On the data side, there's been a significant shift in the last couple of years. So a few years back, uh, the models weren't quite smart enough. And as the models have gotten increasingly smarter, the data needed to improve them has become harder to generate.

Harry Stebbings

And this is because it's more sophisticated data that's required to improve the models. It's like vertically specific people in task and workflows that isn't so obvious, like cat pictures.

Jonathan Siddharth

That's correct. That's correct. And it's, uh, there's a shift in the data going from simple to complex. I mean, let's take coding, for example. Uh, a few years ago, the kind of dataset a, a contractor who gen- could generate might look like, "Hey, write a Python program to sort some numbers." Today, the data that's generated might be, "Write a, uh, a B2B marketplace app that connects, uh, doctors with patients. And write it on, for Android, uh, with Kotlin/Java, write it for iOS with, uh, Swift, and write it on the web, like with Next.js or something." Right? That's the complexity. So there's a shift in going from simple to complex. So it's no longer the kind of data that low-skilled, medium-skilled contractors c- can generate. You need expert humans in every domain.

Harry Stebbings

Yeah.

Jonathan Siddharth

The second shift, uh, is, uh, we've gone from teaching AI to take tests and pass tests to teaching AI to do real work. It's less about having AI pass the bar. It's more about, can AI do the job of a lawyer? Can it do the job of a privacy lawyer, a compliance lawyer, a paralegal? So having AI be good at doing economically valuable work. So that's, that's a shift. Uh, the third, uh, shift is we've gone from chatbots to agents, right? Like, the, we started off with ChatGPT where you're asking questions, getting answers, which is great. But now it's about the models becoming agentic, where they can execute complex, multi-step workflows in a real-world business setting. And the type of data you need for that is totally different.

Install uListen to search the full transcript and get AI-powered insights

Get Full Transcript

Get more from every podcast

AI summaries, searchable transcripts, and fact-checking. Free forever.

Add to Chrome