Getting more out of the Claude Platform

Cut cost, manage context, boost intelligence. In this session, we'll show you how to put our latest platform capabilities to work. Through live demos you'll see what great prompt caching looks like, learn to keep context lean for long-running agents with tool search, programmatic tool calling, and compaction, and use the advisor strategy for a cost-effective intelligence boost. Together, they're a set of patterns you can apply to your agents today to get more from every token.

May 7, 202628mWatch on YouTube ↗

EPISODE INFO

Released: May 7, 2026
Duration: 28m
Channel: Claude
Watch on YouTube: ▶ Open ↗

EPISODE DESCRIPTION

Cut cost, manage context, boost intelligence. In this session, we'll show you how to put our latest platform capabilities to work. Through live demos you'll see what great prompt caching looks like, learn to keep context lean for long-running agents with tool search, programmatic tool calling, and compaction, and use the advisor strategy for a cost-effective intelligence boost. Together, they're a set of patterns you can apply to your agents today to get more from every token.

EPISODE SUMMARY

In this episode of Claude, Getting more out of the Claude Platform explores production agent performance: prompt caching, context engineering, and advisor models Prompt caching is presented as the highest-impact optimization, delivering major cost reductions, faster time-to-first-token, and relief from API rate-limit pressure for repeated prompt segments.

RELATED EPISODES