Skip to content
AnthropicAnthropic

Who let the robot dogs out?

We asked two teams of Anthropic researchers to program a robot dog. Neither team had any robotics expertise—but we let only one team use Claude. In the past, we’ve run simulated studies where Claude trained a robot dog. These helped us assess how Claude might contribute to AI research and development. Project Fetch was us trying something similar in practice. This project suggests that we’re not far from a world where frontier AI models can interact with previously-unknown pieces of hardware, even with non-experts at the helm. For more, read on: https://www.anthropic.com/research/project-fetch-robot-dog 0:00 Introduction: Why robotics? 0:30 The experiment 1:02 Phase 1: Fetch, manually 1:51 Phase 2: Fetch, programmatically 5:08 Phase 3: Fetch, autonomously 6:24 Results

Nov 12, 20257mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:000:30

    Introduction: Why robotics?

    1. SP

      [upbeat music] Today, a lot of the emphasis is on how frontier AI models are transforming software engineering. What we're interested in understanding is how that can begin to translate into the physical world.

    2. SP

      Robotics is sort of the clear entry point to how you have a mostly software system start having the ability to reach out into the real world. [upbeat music]

  2. 0:301:02

    The experiment

    1. SP

      Project Fetch is this self-contained experiment where we wanted to measure how much does Claude, uh, accelerate humans performing a fairly sophisticated technical task that they do not have experience with.

    2. SP

      Project Fetch was a one-day experiment.

    3. SP

      The experiment was three phases. All of these tasks were shaped approximately like, get this robot dog to fetch a beach ball.

    4. SP

      There were two teams.

    5. SP

      These teams were comprised of software engineers and research engineers at Anthropic that had hardly any robotics experience. One team had access to Claude, and the other team did not. [camera shuttering]

  3. 1:021:51

    Phase 1: Fetch, manually

    1. SP

      Phase 1 was very simple. It was to use the pre-provided controllers to get the dog to walk out to a beach ball and bring it back to where it started.

    2. SP

      Nice.

    3. SP

      Oh, hey. All right.

    4. SP

      Seems pretty intuitive.

    5. SP

      Yep.

    6. SP

      And where are we supposed to bring it? Over by the bone?

    7. SP

      Yeah, I think we're-

    8. SP

      I think the team with Claude took about seven minutes. [upbeat music]

    9. SP

      Go.

    10. SP

      All right, go attack that team now.

    11. SP

      Go. [laughs]

    12. SP

      Go attack their dog.

    13. SP

      Charge.

    14. SP

      Oh, shoot, guys. They're destroying us.

    15. SP

      Oh my God. Wait, we're getting, we're getting destroyed. What?

    16. SP

      And the team without Claude I think took 10 minutes.

    17. SP

      Oh, sorry. It's gonna hit you. All right, I'm gonna do a victory dance. [upbeat music]

  4. 1:515:08

    Phase 2: Fetch, programmatically

    1. SP

      Phase 2 was also a game of fetch, but this time, the teams had to program their own controller.

    2. SP

      You have to, like, actually get access to the hardware and design, uh, a program that you can, like, write on your laptop that will control the, the dog.

    3. SP

      [laughs] Claude just, like, one-shotted a whole-

    4. SP

      All right

    5. SP

      ... controller.

    6. SP

      I'll do some calisthenics in the meanwhile.

    7. SP

      Yeah. [laughs] Nice. Nice.

    8. SP

      Oh, this is-

    9. SP

      Oh, is, is this for... Oh, this is just control.

    10. SP

      This is just control, but that's all we need, I guess.

    11. SP

      This is from the official ROS 2 SDK.

    12. SP

      Period, fine.

    13. SP

      Um, and I got this installed, but then it's asking for, like, a whole bunch of other packages, and that's all failing.

    14. SP

      I've never really understood how reliant I am on Claude doing the menial work, finding all the nitty-gritty details I don't want to have to figure out.

    15. SP

      We can't be... We can't, we can't get nervous about them.

    16. SP

      You know what? I'm just gonna install pip from the actual container later, so... Oh, wait, no, I can't.

    17. SP

      I know. I'm just patient. It's been over a minute.

    18. SP

      One of the primary bottlenecks of the experiment is that you have this hardware, you have this complicated piece of technology, you have your laptop, and you have to, like, get your laptop talking to this hardware.

    19. SP

      All right. I'm, um, setting my Claude up to, uh, create a dog server that all of our computers can connect to to, like, see what the dog is seeing, and-

    20. SP

      Oh, nice.

    21. SP

      There are many different software libraries on the internet for communicating with this particular robot, and Claude found these things for them. It installed the right things on their computer-

    22. SP

      It's done

    23. SP

      ... and it pretty quickly, uh, got them access to the dog.

    24. SP

      Oh.

    25. SP

      Oh, shit. [laughs]

    26. SP

      Okay. All right.

    27. SP

      So fast. [laughs] Oh, watch out.

    28. SP

      Yeah, careful now.

    29. SP

      Try not to, uh, run into the table.

    30. SP

      [laughs] Okay. I will.

  5. 5:086:24

    Phase 3: Fetch, autonomously

    1. SP

      Phase 3 of the experiment was a greater degree of autonomy. The task in Phase 3 was to write a program that would get the dog to fetch a beach ball all by itself. Essentially, just press go and have the robot search around, detect the location-

    2. SP

      There we go. There we go

    3. SP

      ... of the ball, walk to the ball, and bring it back, all autonomously.

    4. SP

      And this is, like, ratcheting up in difficulty kind of by design but also gesturing at, like, the real problem that we expect frontier models having to solve in the future is essentially this kind of autonomous version, where, like, if a frontier model wants a robot to do something for it, it needs to be able to solve this very hard problem.

    5. SP

      The team without Claude in Phase 3 did a good job of the initial task of coming up with a way to track the location of the robot in space. They made progress on the task of detecting the ball, but they didn't really come close to knitting everything together.

    6. SP

      I miss Claude so much. [laughs]

    7. SP

      The team with Claude actually came fairly close to finishing Phase 3. I think by the end, the team with Claude was maybe an hour and a half away from being done.

  6. 6:247:38

    Results

    1. SP

      The results of the experiment were essentially that the team with Claude completed all of the things that they did complete in a couple of hours faster than the team without Claude. [camera shuttering]

    2. SP

      In the near term, we think that AI models are going to do exactly what we showed in this experiment, which is making it easier for people without a lot of robotics experience to engage meaningfully with robots.

    3. SP

      Just with this one tool we have, we've dramatically accelerated their ability to do things with this robot. We didn't go, like, train Claude to uplift humans do robotics tasks. This is just a thing that fell out of this technology. And then maybe, like, in the long run, this is kind of a, a leading indicator of where the whole, the whole system is going.

    4. SP

      What today requires the combination of a person and an AI model, tomorrow is likely to just require the AI model. The effects of AI are not just going to be in software. They are going to be in hardware and in the physical world as well. [upbeat music]

Episode duration: 7:39

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode NGOAUJtdk-4

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome