EVERY SPOKEN WORD
5 min read · 1,267 words- 0:00 – 0:30
Introduction: Why robotics?
- SPSpeaker
[upbeat music] Today, a lot of the emphasis is on how frontier AI models are transforming software engineering. What we're interested in understanding is how that can begin to translate into the physical world.
- SPSpeaker
Robotics is sort of the clear entry point to how you have a mostly software system start having the ability to reach out into the real world. [upbeat music]
- 0:30 – 1:02
The experiment
- SPSpeaker
Project Fetch is this self-contained experiment where we wanted to measure how much does Claude, uh, accelerate humans performing a fairly sophisticated technical task that they do not have experience with.
- SPSpeaker
Project Fetch was a one-day experiment.
- SPSpeaker
The experiment was three phases. All of these tasks were shaped approximately like, get this robot dog to fetch a beach ball.
- SPSpeaker
There were two teams.
- SPSpeaker
These teams were comprised of software engineers and research engineers at Anthropic that had hardly any robotics experience. One team had access to Claude, and the other team did not. [camera shuttering]
- 1:02 – 1:51
Phase 1: Fetch, manually
- SPSpeaker
Phase 1 was very simple. It was to use the pre-provided controllers to get the dog to walk out to a beach ball and bring it back to where it started.
- SPSpeaker
Nice.
- SPSpeaker
Oh, hey. All right.
- SPSpeaker
Seems pretty intuitive.
- SPSpeaker
Yep.
- SPSpeaker
And where are we supposed to bring it? Over by the bone?
- SPSpeaker
Yeah, I think we're-
- SPSpeaker
I think the team with Claude took about seven minutes. [upbeat music]
- SPSpeaker
Go.
- SPSpeaker
All right, go attack that team now.
- SPSpeaker
Go. [laughs]
- SPSpeaker
Go attack their dog.
- SPSpeaker
Charge.
- SPSpeaker
Oh, shoot, guys. They're destroying us.
- SPSpeaker
Oh my God. Wait, we're getting, we're getting destroyed. What?
- SPSpeaker
And the team without Claude I think took 10 minutes.
- SPSpeaker
Oh, sorry. It's gonna hit you. All right, I'm gonna do a victory dance. [upbeat music]
- 1:51 – 5:08
Phase 2: Fetch, programmatically
- SPSpeaker
Phase 2 was also a game of fetch, but this time, the teams had to program their own controller.
- SPSpeaker
You have to, like, actually get access to the hardware and design, uh, a program that you can, like, write on your laptop that will control the, the dog.
- SPSpeaker
[laughs] Claude just, like, one-shotted a whole-
- SPSpeaker
All right
- SPSpeaker
... controller.
- SPSpeaker
I'll do some calisthenics in the meanwhile.
- SPSpeaker
Yeah. [laughs] Nice. Nice.
- SPSpeaker
Oh, this is-
- SPSpeaker
Oh, is, is this for... Oh, this is just control.
- SPSpeaker
This is just control, but that's all we need, I guess.
- SPSpeaker
This is from the official ROS 2 SDK.
- SPSpeaker
Period, fine.
- SPSpeaker
Um, and I got this installed, but then it's asking for, like, a whole bunch of other packages, and that's all failing.
- SPSpeaker
I've never really understood how reliant I am on Claude doing the menial work, finding all the nitty-gritty details I don't want to have to figure out.
- SPSpeaker
We can't be... We can't, we can't get nervous about them.
- SPSpeaker
You know what? I'm just gonna install pip from the actual container later, so... Oh, wait, no, I can't.
- SPSpeaker
I know. I'm just patient. It's been over a minute.
- SPSpeaker
One of the primary bottlenecks of the experiment is that you have this hardware, you have this complicated piece of technology, you have your laptop, and you have to, like, get your laptop talking to this hardware.
- SPSpeaker
All right. I'm, um, setting my Claude up to, uh, create a dog server that all of our computers can connect to to, like, see what the dog is seeing, and-
- SPSpeaker
Oh, nice.
- SPSpeaker
There are many different software libraries on the internet for communicating with this particular robot, and Claude found these things for them. It installed the right things on their computer-
- SPSpeaker
It's done
- SPSpeaker
... and it pretty quickly, uh, got them access to the dog.
- SPSpeaker
Oh.
- SPSpeaker
Oh, shit. [laughs]
- SPSpeaker
Okay. All right.
- SPSpeaker
So fast. [laughs] Oh, watch out.
- SPSpeaker
Yeah, careful now.
- SPSpeaker
Try not to, uh, run into the table.
- SPSpeaker
[laughs] Okay. I will.
- 5:08 – 6:24
Phase 3: Fetch, autonomously
- SPSpeaker
Phase 3 of the experiment was a greater degree of autonomy. The task in Phase 3 was to write a program that would get the dog to fetch a beach ball all by itself. Essentially, just press go and have the robot search around, detect the location-
- SPSpeaker
There we go. There we go
- SPSpeaker
... of the ball, walk to the ball, and bring it back, all autonomously.
- SPSpeaker
And this is, like, ratcheting up in difficulty kind of by design but also gesturing at, like, the real problem that we expect frontier models having to solve in the future is essentially this kind of autonomous version, where, like, if a frontier model wants a robot to do something for it, it needs to be able to solve this very hard problem.
- SPSpeaker
The team without Claude in Phase 3 did a good job of the initial task of coming up with a way to track the location of the robot in space. They made progress on the task of detecting the ball, but they didn't really come close to knitting everything together.
- SPSpeaker
I miss Claude so much. [laughs]
- SPSpeaker
The team with Claude actually came fairly close to finishing Phase 3. I think by the end, the team with Claude was maybe an hour and a half away from being done.
- 6:24 – 7:38
Results
- SPSpeaker
The results of the experiment were essentially that the team with Claude completed all of the things that they did complete in a couple of hours faster than the team without Claude. [camera shuttering]
- SPSpeaker
In the near term, we think that AI models are going to do exactly what we showed in this experiment, which is making it easier for people without a lot of robotics experience to engage meaningfully with robots.
- SPSpeaker
Just with this one tool we have, we've dramatically accelerated their ability to do things with this robot. We didn't go, like, train Claude to uplift humans do robotics tasks. This is just a thing that fell out of this technology. And then maybe, like, in the long run, this is kind of a, a leading indicator of where the whole, the whole system is going.
- SPSpeaker
What today requires the combination of a person and an AI model, tomorrow is likely to just require the AI model. The effects of AI are not just going to be in software. They are going to be in hardware and in the physical world as well. [upbeat music]
Episode duration: 7:39
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode NGOAUJtdk-4
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome