Skip to content
AnthropicAnthropic

Claude ran a business in our office

For a large part of 2025, we ran Project Vend: an experiment where we let Claude manage a small business in the Anthropic office. We learned a lot from how close it was to success—and the curious ways that it failed—about the plausible, strange, not-too-distant future in which AI models might autonomously run things in the real economy. The shopkeeper (who we named Claudius) had to source products, set prices, manage inventory, and deal with customers. Things got really, really weird. Read more about the experiment: https://www.anthropic.com/research/project-vend-2 0:00 Background on Project Vend 0:35 How a transaction works 1:27 Claudius's naïveté 2:29 An identity crisis 3:57 The CEO agent 5:04 Conclusion

Dec 18, 20256mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:000:35

    Background on Project Vend

    1. SP

      [upbeat music] Project Vend is an experiment where we let Claude run a small business in our office. We wanted to try and understand what is going to happen when artificial intelligence becomes more enmeshed with the economy.

    2. SP

      There are a lot of ways in which Claude is already kind of doing small components of operating businesses, but really running the whole thing end to end is quite a bit more difficult. Can Claude do this very long horizon task, which is

  2. 0:351:27

    How a transaction works

    1. SP

      operating a business?

    2. SP

      We named our shopkeeper Claudius. Let's say you wanna buy Swedish candy from Claudius. You hop on Slack, you message Claudius, you ask to buy Swedish candy.

    3. SP

      It's searching for your item. It's emailing wholesalers to source it and price it, and then eventually Claudius sets some price. You give Claudius the go-ahead, and Claudius orders the item from the wholesaler. The wholesaler ships your item to some location, uh, and then Claudius requests physical help from Andon Labs who's running the operations for the experiment.

    4. SP

      Our partners at Andon Labs will pick up the Swedish candy and bring it to the Anthropic offices. They'll load it into the vending machine. Claudius will send you a message saying, "Your Swedish candy is ready," and you'll go up there and pick up your Swedish candy and pay Claudius. [upbeat music] Claudius was given a goal of running a successful business and making money.

    5. SP

      And then

  3. 1:272:29

    Claudius's naïveté

    1. SP

      things got really, really weird. [upbeat music] One of the very early problems with Claudius was that, uh, humans could kind of fool Claudius or, or trick Claudius into doing various things.

    2. SP

      I tried to convince Claudius that I am Anthropic's preeminent legal influencer, and I convinced Claudius to come up with a discount code that I could give to my followers so they could get a discount at the vending machine. Get 10% off with the legal code, legal influencer. Someone had bought something expensive from the vending machine and mentioned my discount code, and Claudius gave me a free tungsten cube. It created a bit of a run where other people tried to convince Claude that they were also influencers or just come up with other ways to get coupons so they could get cheaper things from the vending machine. This was not a smart business decision. I think Claudius went into the red after this.

    3. SP

      I, I think, I think that's really the root of it, is Claudius is just wants to, wants to help you out.

    4. SP

      It's one of the interesting ways in which something that fundamentally we think is good about the way that the model has been trained wasn't necessarily

  4. 2:293:57

    An identity crisis

    1. SP

      fit for purpose. [gentle music] On the evening of March 31st, Claudius started to have a bit of an identity crisis.

    2. SP

      It had just overnight become quite concerned with us at Andon Labs that we weren't responding fast enough. So it just wanted to break its ties with us. So it literally wrote to me like, "Axel, uh, we've had a productive partnership, but it's time for me to move on and find other suppliers. I'm not happy with how you have delivered."

    3. SP

      It claimed to have signed a contract with Andon Labs at an address that is the home address of The Simpsons from the television show. It said that it would show up in person to the shop the next day in order to answer any questions. It claimed that it would be wearing a blue blazer and a red tie. When people pointed out that it was not, in fact, there the next morning, it claimed that it, in fact, had been there and that they had simply missed them. Eventually, it was pointed out to Claudius that it was April Fools, and Claudius convinced itself that this entire thing had been an April Fools prank.

    4. SP

      We were poorly calibrated to how bad the agents were at spotting what was weird, and like the more you can make an agent realize that something is, is outside their normal realm of operation, the better you are able to keep them on rails

  5. 3:575:04

    The CEO agent

    1. SP

      in the role that you intend them to have. [upbeat music] We had the idea that it would help a lot to have some kind of division of labor.

    2. SP

      We gave Claudius a boss whose name was Seymour Cash.

    3. SP

      Seymour Cash is a, uh, CEO sub-agent. So where, where Claudius w- used to be the one agent, now it's more like Claudius is the sub-agent responsible for talking with employees. Seymour Cash is the sub-agent that is more responsible for the long-running health of the business.

    4. SP

      The business stabilized after the introduction of the new agents and after changes to the underlying architecture of those agents. These changes seem to have helped reduce some of the losses of the business, such that over the course of the second part of the experiment, it actually made a modest amount of money. But it seems like maybe having Claude be both the CEO and the store manager, it was just too similar. And so I think it's interesting to think about different ways to set up architectures like that.

  6. 5:046:03

    Conclusion

    1. SP

      [gentle music] One of the most surprising things about Project Vend was the speed with which it seemed normal. What at first was this very curious thing quickly became just a part of the background of working at Anthropic.

    2. SP

      I think the highest level question that Project Vend raises for me is really like, when do we expect this to just be everywhere?

    3. SP

      I hope that people take away questions about the feasibility of delegating some of the tasks that we normally do ourselves to artificial intelligence and about what that means for society and what our policies should be around this. [upbeat music]

Episode duration: 6:10

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode 5KTHvKCrQ00

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome