EVERY SPOKEN WORD
25 min read · 4,734 words- 0:00 – 0:30
Introductions
- BABrad Abrams
'Cause as a developer, like, my creativity ends at some point.
- AAAlex Albert
Yeah.
- BABrad Abrams
I can only think of so many use cases.
- AAAlex Albert
Right.
- BABrad Abrams
But the model, like, anything, anything somebody comes with, the model will figure out a way-
- AAAlex Albert
Right
- BABrad Abrams
... to go do that thing. [upbeat music]
- AAAlex Albert
Hey, I'm Alex. I lead Claude Relations here at Anthropic. Today we're talking about building the future of agents with Claude, and I'm joined by my colleagues.
- BABrad Abrams
I'm Brad. I run the PM team on the Claude Developer Platform.
- KLKatelyn Lesse
I'm Katelyn. I lead the Engineering team for the Claude Developer
- 0:30 – 2:30
What is the Claude Developer Platform?
- KLKatelyn Lesse
Platform.
- AAAlex Albert
Let's talk about the Claude Developer Platform. [chuckles]
- BABrad Abrams
Yeah, let's start with that.
- AAAlex Albert
Uh, let's start with that.
- KLKatelyn Lesse
Start there.
- AAAlex Albert
It used to be called the Anthropic API.
- BABrad Abrams
Yeah.
- AAAlex Albert
We just went through a big name change.
- BABrad Abrams
Yeah.
- AAAlex Albert
Can you walk me through why we made that change, and also what this new platform is and what it encompasses?
- KLKatelyn Lesse
Yeah, totally. So the Claude Developer Platform really encompasses our APIs, our SDKs, our documentation, all of our experiences within the console, and really everything that a developer needs to actually build on top of Claude. We're really humbled, proud to serve, um, some really awesome customers around the world who are trying to, what we like to say, raise the ceiling of intelligence-
- AAAlex Albert
Right
- KLKatelyn Lesse
... um, using Claude. Um, and the platform really enables them to do that. Um, and I would say one of my favorite parts about it is the platform doesn't just serve customers, uh, externally, the platform actually serves our internal product.
- AAAlex Albert
Mm.
- KLKatelyn Lesse
So, um, we love telling people, like Claude Code, for example, is actually built directly on our public platform.
- AAAlex Albert
I see.
- BABrad Abrams
Yeah. I mean, uh, I think when we started, we were just the Anthropic API, and very simple access to the model. But over the last year or so, we've added so many features to it. Um, we added prompt caching, we added a whole separate batch of API, uh, we added web search, a web fetch, we have this context management support, the code execution. So all these tools-
- AAAlex Albert
Yeah.
- BABrad Abrams
Now, you know, now it's kind of we feel like, yeah, it's aspirationally where it's re- it's a platform now.
- AAAlex Albert
I see. So there's just a lot more to it now.
- BABrad Abrams
Yeah.
- AAAlex Albert
It's evolved in pretty drastic way over the past year, and-
- BABrad Abrams
Yeah. Yeah, I think so
- AAAlex Albert
... better naming.
- BABrad Abrams
Yeah.
- AAAlex Albert
Um-
- BABrad Abrams
And I think that's what developers were sort of calling it anyway.
- AAAlex Albert
Yeah. Yeah.
- BABrad Abrams
You know, so it's always natural to just sort of go with what developers are saying.
- AAAlex Albert
Right. We were a little late to the game there.
- 2:30 – 3:15
What is an AI agent
- BABrad Abrams
Yeah. I mean, agents is... It's almost sort of a buzzword, right?
- AAAlex Albert
Yeah.
- BABrad Abrams
Like, everybody you talk to now is building agents, and, and whenever a industry tech term gets to that level, you know, the definition gets very gray, everything everybody builds is an agent. But Anthropic, what we really think about a agent is where the model is taking some, uh, autonomy to be able to choose what tools to call, to call those tools, to handle the results, and how to choose the next step. So, uh, as a, as a foundational research lab, like leaning into the model and be, and, and what it, its reasoning, how it decides what to do, we think that's a really important element of what an agent is.
- AAAlex Albert
Mm. So it's kind of like the, the aspect of it being autonomous in some sense.
- BABrad Abrams
Yeah.
- 3:15 – 4:00
Building frontier intelligence for AI agents
- BABrad Abrams
Yeah. Yeah.
- AAAlex Albert
Charting its own course.
- BABrad Abrams
I mean, I think there's also re- I mean, we have customers doing really useful workflows where they're sort of predefining the path that Claude-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... should walk, and that, that is a super useful thing to do. But what's nice about the agentic thing is as the model gets better, every couple of months, you know, we release a new model, and it- with a true agentic pattern, you know, those services are just gonna get better. Where, where if you build a workflow with a lot of scaffolding in it, you kinda put bounds on the model-
- AAAlex Albert
Mm
- BABrad Abrams
... which is maybe okay in some use cases, but that means that you're- you may not take advantage of the next level of intelligence that a next model release gets.
- AAAlex Albert
Yeah. So it seems like there's this interesting trend with agents, at least over the past 6 to 12 months-
- BABrad Abrams
Mm-hmm
- AAAlex Albert
... where, like you've said, the scaffolding
- 4:00 – 5:05
Reducing model scaffolding to build better agents
- AAAlex Albert
has been-
- BABrad Abrams
Yeah
- AAAlex Albert
... a bit of a hindrance, and maybe we're dropping some of that.
- BABrad Abrams
Mm-hmm.
- AAAlex Albert
Um, can you explain the intuitions behind that around is, is this actually the future? Is like we give less and less things to the model?
- BABrad Abrams
Yeah. I mean, I think over time what we're seeing is the scaffolding the model needs to be able to accomplish tasks is, is it's needing, it's needing less. As the, as the level of intelligence of the model goes up, and we have every... we believe is gonna keep going up-
- AAAlex Albert
Yeah
- BABrad Abrams
... um, that basically the model has more contextual understanding of the high level task that it's trying to accomplish, so therefore it doesn't need as many sort of guardrails.
- AAAlex Albert
Right.
- BABrad Abrams
And in fact, those guardrails in some cases become some, uh, like a liability to have. Uh, we've had customers try out new models and say, "Oh, well, it's actually only just a little bit better." And then we kinda look into it with them about what's going on, and it turns out, well, yeah, they were constraining it in ways that makes it harder for them to see the intelligence of the model.
- AAAlex Albert
Ah. Does this match what we see in the field with, like, our customers where they're also following these
- 5:05 – 6:40
The evolution of agentic frameworks
- AAAlex Albert
same trends? I know at the limit we have customers exploring all sorts of innovative techniques for managing Claude.
- KLKatelyn Lesse
Yeah, totally. And there's actually a lot of, like, discourse about this right now, right?
- AAAlex Albert
Mm-hmm.
- KLKatelyn Lesse
Like, what is an agent and, and what does it need, what do you need to build? And, and there are people saying, you know, "It's just a while loop."
- AAAlex Albert
Yeah.
- BABrad Abrams
Right.
- KLKatelyn Lesse
Like, you don't have to try that hard.
- AAAlex Albert
Yeah. Right.
- KLKatelyn Lesse
And, um, I think ultimately, uh, there's a lot of, there's been a lot of evolution of frameworks that-
- AAAlex Albert
Mm
- KLKatelyn Lesse
... people are putting around the model that are helping them orchestrate their agents, try to get the most out of the model. And, um, I think what, uh, the industry is maybe kind of circling around is a lot of that has become maybe too heavy-
- AAAlex Albert
Mm
- KLKatelyn Lesse
... and maybe too opinionated, um, and which is why you kinda get the, the people coming back to, like, "It's just a while loop, and that is all you need." Um, and I think what we're trying to, to do there is to say-
- BABrad Abrams
Maybe in a lot of ways it is a while loop, but the things we can more uniquely do to help people get the most out of the model is a lot of those tools, those features, and otherwise.
- AAAlex Albert
Mm-hmm.
- BABrad Abrams
And so what we wanna do is put, um, you know, frameworks and tools and platform out there that is opinionated to some extent-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... on how people should use those tools. Um, but it's not this like super heavy framework that really like, to Brad's point, gets in the way-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... of what the model's ultimately trying to do. So to strike the right balance, it's like, you know, we've, we've seen what a lot of people have tried to do, so we know we can be opinionated there, um, but we wanna be lightweight [chuckles] in the way that we're doing that-
- AAAlex Albert
Right
- BABrad Abrams
... and make sure that the real thing we're doing is helping you get the most out of the model, um, without, you know, bogging you down in some super heavy framework.
- AAAlex Albert
Right. So would you describe part
- 6:40 – 8:35
Unhobbling the model with tools like web fetch
- AAAlex Albert
of the strategy here then as providing these auxiliary tools and things that we can give to the model-
- BABrad Abrams
Mm-hmm
- AAAlex Albert
... but we're not necessarily, like, placing the bumper-
- BABrad Abrams
Right
- AAAlex Albert
... spawn, like the model itself or-
- BABrad Abrams
Yeah. We think about, we think about it as like, how do you unhobble the model?
- AAAlex Albert
Unhobble, yeah.
- BABrad Abrams
Like, the model already has a lot of capabilities, and in fact, I'm convinced that even if you take a current generation of models, there's way more intelligence in there than we've been able to unlock.
- AAAlex Albert
Mm.
- BABrad Abrams
But anyway, that intuition is if you just give the model, like, the tools it needs, and like set it, set it free-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... let it be able to use those in the right way, um, you'll get great results.
- AAAlex Albert
Mm.
- BABrad Abrams
And, and I think like a good example of that is we launched this server side, uh, web search tool and we- and web fetch tools, and it's been interesting to watch customers use those. And, you know, all we did real... I mean, it's a very minimal prompt that we have. We just give it the web search tool, and, like, all of a sudden, deep research tasks are, like, almost completely done-
- AAAlex Albert
Mm
- BABrad Abrams
... with, with just turning on that, uh, switch on the API because the model will call that tool, it'll look at its results, it'll say, consider it, and say, "Okay, maybe I need to call, you know, do these other searches."
- AAAlex Albert
Right.
- BABrad Abrams
And then, "Oh, that fourth link you returned, that's the great one."
- AAAlex Albert
Right.
- BABrad Abrams
It'll do a web fetch on that link and bring that data back, and really all that very autonomously on its own kind of deciding.
- AAAlex Albert
Right. I, I think it's almost kind of like an interesting shift in, like, where the intelligence of a system is being applied.
- BABrad Abrams
Exactly, yeah.
- AAAlex Albert
From like the developer having to apply their intelligence to guiding-
- BABrad Abrams
Right
- AAAlex Albert
... towards like the model now-
- BABrad Abrams
Right
- AAAlex Albert
... figuring it out.
- BABrad Abrams
And it's so exciting when the model does it because as a developer, like, my creativity ends at some point.
- AAAlex Albert
Yeah.
- BABrad Abrams
I can only think of so many use cases.
- 8:35 – 10:50
Building agents with the Claude Agent SDK (formerly the Claude Code SDK)
- AAAlex Albert
the developer platform, what do you recommend? What are some best practices or ways for me to get started?
- BABrad Abrams
Yeah. So, um, super tactically, actually, the number one thing that we recommend right now is the Claude Code SDK.
- AAAlex Albert
Mm-hmm.
- BABrad Abrams
Um-
- AAAlex Albert
Okay
- BABrad Abrams
... and what's really, really interesting about the Claude Code SDK is we essentially built an agent harness, an agentic harness, um, around the model to run that loop, right, and automate a lot of that tool calling and otherwise feature use. And obviously, originally was built for coding purposes. Um, and what, uh, the team really quickly figured out was like, actually, this is like an excellent general purpose- [chuckles]
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... agentic harness. Um, and so what the SDK does is it gives people a perfect out-of-the-box solution to actually just start prototyping agents-
- AAAlex Albert
Mm
- BABrad Abrams
... um, without having to go and build, you know, the loop with all the tool calling and otherwise. It's built on top of the messages API and all those same tools that, um, we're mentioning, but it kinda gives you that really great starting place right out of the box.
- AAAlex Albert
Right. I feel like this is a pretty common misconception, at least when I talk to developers, about the Claude Code SDK.
- BABrad Abrams
Mm-hmm.
- AAAlex Albert
So like, I'm not building a coding application.
- BABrad Abrams
Mm-hmm, mm-hmm.
- AAAlex Albert
Like, why would I wanna use this?
- BABrad Abrams
Yeah.
- AAAlex Albert
Mm-hmm. But you can kind of remove the coding-specific-
- BABrad Abrams
Yeah
- AAAlex Albert
... parts, right?
- BABrad Abrams
Yeah. I mean, I think that's a great example of what we were talking about, removing scaffolding on the model. It's like once we got done removing things-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... from the Claude, from Claude Code to really unhobble the model, it turns out there was [chuckles] nothing coding left.
- AAAlex Albert
Right.
- BABrad Abrams
When the, when you remove everything else, then it's just an agentic loop, and you're, you're really a minimalistic thing to give, uh, Claude access to, to a file system-
- AAAlex Albert
Yeah
- BABrad Abrams
... to a set of like Linux command line tools, um, to the ability to, you know, write code and execute that code. So those are all very generic kind of-
- AAAlex Albert
Right
- BABrad Abrams
... capabilities that turns out could solve a wide variety of problems.
- AAAlex Albert
Right. Yeah. I feel like something I've been running up to in like my own side projects and also seeing with-
- BABrad Abrams
Mm-hmm
- 10:50 – 14:35
Best practices for identifying agentic use cases
- AAAlex Albert
think that's super interesting.
- BABrad Abrams
Yeah. I mean, I think, I think the other really interesting to- thing to think about, especially for businesses looking at a- agents, is like, what use case to go target.
- AAAlex Albert
Right.
- BABrad Abrams
So, uh, thinking beyond the technology, like, what is the actual problem to go solve? And I, I, I think, you know, we've s- we see a lot of customers and like doing a lot of things. We love all of it, but where, you know, the biggest impacts are is where the customer has thought hard about what's the business value of this.
- AAAlex Albert
Mm.
- BABrad Abrams
Like, will it actually save this many engineering hours, or will it help us remove this much, uh, uh, manual work or whatnot? And being able to articulate, like, what you expect the outcome of the agent project to be, I think is really helpful in, in defining the scope, uh, of the agent.
- AAAlex Albert
Right. And t- tying back one more time to the SDK, so it seems like-
- BABrad Abrams
Yeah
- AAAlex Albert
... it's been really, really useful for, like, individual developers like myself-
- BABrad Abrams
Yep, yep, yep
- AAAlex Albert
... you know, starting out-
- BABrad Abrams
Yep
- AAAlex Albert
... and just wanting to get hacking on something really fast. For these customers, for enterprises that are actually trying-
- BABrad Abrams
Right
- AAAlex Albert
... to get real business value-
- BABrad Abrams
Right
- AAAlex Albert
... out of these things, should they be using the SDK? Is it ready for them? Is it ready for scaled use like that?
- KLKatelyn Lesse
Yeah. So, um, I think in a lot of ways it is. In a lot of ways, if you are in a spot where you can... Like, you can deploy that runtime. Essentially, that's what you get out of the SDK-
- AAAlex Albert
Yeah.
- KLKatelyn Lesse
-is an agentic loop runtime. Um, you can go and deploy that runtime wherever you want, [chuckles] whenever you're ready to do so. But I think what we're really trying to do is take the spirit of what the SDK unlocks for people, like go kind of up to that higher order abstraction where we give you the loop, we give you a lot of the tool calling in an automated way, um, and say, "How can we learn from that and give people out-of-the-box solutions that, like, at scale-"
- AAAlex Albert
Mm-hmm.
- KLKatelyn Lesse
-um, will really be able to solve for their use cases?" And I think that's a lot of where we're kind of trying to go with our roadmap throughout the rest of the year. Um, and one really important bit when we think about that is if the entire goal here is to help our users, like, really raise that ceiling of intelligence, get the absolute best outcome out of the models, then higher order abstractions are not just make it easier because you don't have to write all that code yourself. It's actually, like, how can we, like, really truly help you get the best outcome-
- AAAlex Albert
Mm-hmm.
- KLKatelyn Lesse
-because we, uh, we're in the room with research, we're in the room with inference. Like, we know how to make sure that our abstractions, our agentic loop is going to be, uh, extremely powerful-
- AAAlex Albert
Right.
- KLKatelyn Lesse
-and extremely good at working with Claude. Um, and the last thing that I would add in there is especially as these things get longer running, and as we provide more and more tooling to help people get at those longer running tasks, um, another big problem that our users we know are gonna keep trying to solve is, uh, observability within-
- AAAlex Albert
Mm-hmm. Mm-hmm.
- KLKatelyn Lesse
-those longer running tasks. Um, and so that's, that's one of the most common things that comes up for folks is, you know, I, I have these long-running tasks, I'm trying to get, um, these really great outcomes. But, um, you know, I might need to do some steering, or I might need to tune my prompt, or I might need to think about tool calling a little differently. And, um, that's something that we know we can give people that observability through the platform over time-
- AAAlex Albert
Mm-hmm. Mm-hmm.
- KLKatelyn Lesse
-and that's another big area of focus for us.
- 14:35 – 19:00
Best practices for managing context and memory with Claude
- AAAlex Albert
of how-
- BABrad Abrams
Hmm.
- AAAlex Albert
-we're gonna address that. Um, before I do, uh, is there other tools that exist right now that folks should be aware of when they're getting started with the developer platform? Things have... you've found helpful or useful?
- BABrad Abrams
Yeah. I mean, I think there's a c- so we mentioned, uh, web search and web fetch. Uh, I think an-another big thing that we're seeing is, um, customers, m- uh, have to do... right now have to do a lot of work to manage the context window. So by default, Claude has 200k tokens of context. We have a mil-million token available now in beta on Sonnet, which is great, but even a million, there's a limit there. Uh, and what, what many customers have told us is that, um, they get better outputs, higher intelligence if they, uh, even use a smaller part of the context. And so we've done... We have a, a couple of cool features that are just coming out to help developers manage that context.
- AAAlex Albert
Hmm.
- BABrad Abrams
So in these agentic loops, a lot of times you're doing ten, 15, 100 tool calls, and you edit this file or look up data in this database or, uh, you know, send this email, and each of those tool calls takes up like a, you know, 100, 200, 1,000 tokens.
- AAAlex Albert
Right.
- BABrad Abrams
Uh, and so we have this cool feature that lets you, uh, lets the model actually remove some of the older tool calls that are not needed anymore.
- AAAlex Albert
Interesting.
- BABrad Abrams
Uh, and that give, that gives just, just like you, if you declutter your desk and declutter your notebook-
- AAAlex Albert
Yeah.
- BABrad Abrams
-like, you can focus a little bit better. So if you declutter the prompt, actually, the model can actually focus a little bit better.
- AAAlex Albert
Oh, interesting. So okay, we're removing unnecessary context.
- BABrad Abrams
Mm-hmm.
- AAAlex Albert
Is there a risk that we remove necessary context?
- BABrad Abrams
Yeah.
- AAAlex Albert
How does that work?
- BABrad Abrams
Yeah, yeah. Yeah, yeah. So, um, we have some, some guardrails and some-
- AAAlex Albert
Okay.
- BABrad Abrams
-bounds around it, so you don't... But g-the general rule is if you, um, remo- we try to remove the tools that are, like, several turns back.
- AAAlex Albert
Okay.
- BABrad Abrams
That the model's already made decisions based on those tools. But if you... AI was playing with it, uh, recently, and r-I removed the tools that it was just called-
- AAAlex Albert
Mm.
- BABrad Abrams
-and it's, "Oh, my tool results are gone. I don't know what to do."
- AAAlex Albert
Right.
- BABrad Abrams
And then the... but the mo- the Sonnet doesn't give up. Like, it's like, "I'm just gonna call this tool again."
- AAAlex Albert
Yeah.
- BABrad Abrams
You know?
- AAAlex Albert
Yeah, yeah, yeah.
- BABrad Abrams
Um, but yeah. So generally, we have put some bounds on that because of that experience. So we, we do preserve the most recent set of tools.
- 19:00 – 22:10
The future of the Claude Developer Platform (observability, computer use, and other ways to unhobble the model)
- AAAlex Albert
here. So it sounds like there's a ton of new features that we've recently launched. There's a lot of momentum, and now there's other offerings as well, like the Claude Code SDK-
- BABrad Abrams
Mm-hmm
- AAAlex Albert
... and things coming out soon. Um, what are you most excited about, Katelyn? What's the, what's the future looking like here in the next six to twelve months?
- KLKatelyn Lesse
Yeah. So we talked a little bit about these higher orders of abstraction where we can, um, really just make it, uh, as, as simple as possible for you to get the absolute best outcomes out of Claude. Um, and we wanna pair that with the observability that we talked about, um, so that you can really like, you know, see the data and take those insights from those longer running tasks. Um, and if you combine these things together and start to think about some of the capabilities like memory that Brad just talked about, you can really start to see this flywheel where over time, we're not just able to help you get the best outcomes out of Claude, but we can help you get self-improving and continuously-
- BABrad Abrams
Mm. Mm-hmm
- KLKatelyn Lesse
... improving outcomes out of Claude. And that to me is kind of the, the like galaxy brain magic of the roadmap, is get to a point where, um, you know, we, we have people coming to us, they're building on Claude, they're, they have their tasks, they know what they're trying to do, um, and they get these like really like aha moments where over time it's getting better and better and better. Um, and you know, that is... that's kind of the biggest thing that in everything that we're doing, we're trying to make sure we're going after.
- AAAlex Albert
That's awesome.
- BABrad Abrams
Yeah, I mean, I guess I'd have to say I'm a- I, I'm always excited about model launches.
- AAAlex Albert
Yeah.
- BABrad Abrams
Like, it's like Christmas, like what, how, what will, will it, what will be possible now? So I love playing with the model launches as they come out, just unlocks more use cases. Some use cases that, you know, we've been working hard on and, and trying to improve, which is satisfying to see, but also some things, oh, we had no idea the model would be able to do this thing.
- AAAlex Albert
Right.
- BABrad Abrams
You know, now it draws ASCII pictures so much better.
- AAAlex Albert
Yeah [laughs] .
- BABrad Abrams
Or what, you know, whatever.
- AAAlex Albert
The important things [laughs] .
- BABrad Abrams
The very important things. But beyond that, the other thing I'm really excited about is, um, we're, we're in the early stages of giving Claude a computer. You know, I think about if we, uh, hire an employee here at Anthropic, and we, we welcome them, "Here's your first day," but we don't give them a computer.
- AAAlex Albert
Yeah.
- BABrad Abrams
Like, they would not be very successful [laughs] at Anthropic.
- AAAlex Albert
Right.
- BABrad Abrams
So like right now, essentially everybody u- is using Claude and, and it doesn't have a computer. So I'm, I'm really excited about giving Claude a computer, and I... you see like the very baby steps of that-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... with the code execution tool-
- AAAlex Albert
Mm-hmm
- BABrad Abrams
... uh, where the, the model can write code, execute it on the VM, and get the results back. So like it can like zoom in on images or take a, a Excel spreadsheet and create like amazing data analysis with charts and graphs, and that's just the baby step. Like, what if it had a persistent computer-
- AAAlex Albert
Right
- BABrad Abrams
... that was always there and it could like organize the files in there the way it needed-
- AAAlex Albert
Yeah
- BABrad Abrams
... and get the tools set up the way it wanted and, um, I just think there's a lot of headroom-
- AAAlex Albert
Right
- BABrad Abrams
... to, to that scenario.
Episode duration: 22:10
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode XuvKFsktX0Q
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome