Skip to content
ClaudeClaude

Live coding session with Boris Cherny and Jarred Sumner

Ever wonder how the Claude Code team uses Claude Code themselves? Watch Boris Cherney, Head of Claude Code, and Jarred Sumner, creator of Bun, livestream their everyday workflows together.

Jarred SumnerguestBoris Chernyhost
May 6, 202632mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. SP

    [upbeat music] Please welcome to the stage Head of Claude Code of Anthropic, Boris Cherny, and creator of Bun at Anthropic, Jarred Sumner. [upbeat music]

  2. JS

    [applause] Hello. All right, so this is a developer conference. We're gonna be doing a little bit of talking, but mostly we're just gonna be, like, coding.

  3. BC

    Woo! [applause]

  4. JS

    So this is for the developers in the room.

  5. BC

    [applause]

  6. JS

    I'm gonna start by talking a little bit about how, uh, how Bun uses Claude Code to, to build and, and maintain Bun, and also kind of how our setup, uh, works and s- 'cause it's kind of a, a slightly more advanced setup than, uh, what's common today. Um, but first I'm gonna get a few agents running to just fix some GitHub issues. Uh...

  7. BC

    This is classic Jarred doing work during a talk. [laughs]

  8. JS

    Um, so, so in Bun's repo, uh, every time somebody submits an issue, uh, we have a Claude bot automatically run and try to reproduce the issue. So you can see this person has, uh, this, this side effects, and this is like one of the most recent issues. Um, and we, and we can see that Robobun, which is our, our, our bot, uh, uh, went and managed to reproduce the issue, uh, and submitted a PR automatically. Um, and s- and this PR is like, it has, uh, it, it... all these PRs always have tests. Um, it's one of the actual hard requirements before it can submit a PR. Um, uh, and so, so the challenge here is, like, does this code look correct? Um, and one of the things we do to, to check that is does the test fail in the previous version of Bun and pass, uh, in the, in this debug branch? And it, and the bot actually can't submit a PR without that, uh, being the case.

  9. BC

    And so this is... Just to make sure I understand, so this is, like, every single issue that goes up in the Bun issue tracker, you have Robobun automatically try to reproduce it before anyone looks at it?

  10. JS

    Yeah. And this saves a lot of time because we have so many open GitHub issues.

  11. BC

    Mm.

  12. JS

    Um, it, it really moves the challenge from is, uh, from just fixing and debugging the issue to is this the right thing to merge? Like, is this the right fix?

  13. BC

    Mm.

  14. JS

    Um-

  15. BC

    How, how, how good is it? Is it, is this doing like 100% of PRs? Is it, like, 10%?

  16. JS

    We can go to the Insights, uh, uh, and go to Contributors, and then if we go, uh, last three months, uh, uh, and this is specifically to main, we can see that Robobun is now a bigger contributor to Bun than I am. [laughs]

  17. BC

    [laughs] [clapping]

  18. JS

    And that's with merging not all of its PRs for sure. You can see we have a lot of PRs open right now. Um, the, the challenge is really how do we know can we merge the PR?

  19. BC

    Mm.

  20. JS

    Um, and that's the test. Um, and then the other thing that's really interesting about this is, uh, uh, we have automatic code review bots that, that run, um, and then they're going back and forth. So, like, CodeRabbit leaves a comment, and then Robobun leaves a comment. And then they go back and forth, and they, and Ro- CodeRabbit did the-

  21. BC

    [laughs]

  22. JS

    [laughs]

  23. BC

    I love this.

  24. JS

    Yeah. And, and it also marks the comments as resolved when it's done. Um, and you can see they actually went a lot. Like, there's a lot of back and forth here. There was, like, 30 comments or something.

  25. BC

    And so you're using, like, a combination of agents. So this is, like, code review. This is, like, Claude code review, and then also CodeRabbit, and, like, you're using them together.

  26. JS

    Yeah. And I, I think basically, like, CodeRabbit is, is good for, like, kind of stylistic issues and things that are like make sure that it follows the Claude MD. And then the, the Claude code review is really good at here's this really subtle edge case that would've taken me, like, 30 minutes of reading all the code and having all the context to, to, like, figure out.

  27. BC

    Mm.

  28. JS

    And, and so, so it's really good at surfacing bugs that you, you need the full context to really understand. Um, and I think basically it's, it's really hard to, to actually have all this automation without having code review that i- is in the loop with the, with Claude. They're replying, or replying is, like, very performative, but, like, at, like, fixing.

  29. BC

    Mm.

  30. JS

    Um, and that's also a big part of, like, what used to take so much time, uh, w-when, uh, w- like, why PRs would take so much time to merge is because you'd have to, like, like, check out the branch locally, fix a lint error, then run the linter locally, then push it back up, and there's all this switching cost that's constantly there.And so when you-- so I think this is, like, an especially good use case for, for LLMs because otherwise, like, it just takes up so much time to, to ship.

Episode duration: 32:00

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode DlTCu_pNDHE

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome