ClaudeCaching, harnesses, and advisors: Building on Claude at GitHub scale
Episode Details
EPISODE INFO
- Released
- May 6, 2026
- Duration
- 26m
- Channel
- Claude
- Watch on YouTube
- ▶ Open ↗
EPISODE DESCRIPTION
GitHub's Copilot team ships Claude to millions of developers across chat, CLI, coding agent, and code review, and has become one of the most demanding users of the Claude Platform. GitHub CPO Mario Rodriguez and Anthropic's Brad Abrams break down how the team pushes quality up and costs down at scale, from caching and evaluation to the new Advisor strategy. Walk away with patterns you can apply to your own Claude-powered product.
SPEAKERS
Brad Abrams
hostAnthropic team member and on-stage presenter discussing Claude models, evaluation feedback, and GitHub Copilot integrations.
Mario Rodriguez
guestChief Product Officer of GitHub speaking about building GitHub Copilot features at scale using Claude (caching, harnesses, and advisor models).
EPISODE SUMMARY
In this episode of Claude, featuring Brad Abrams and Mario Rodriguez, Caching, harnesses, and advisors: Building on Claude at GitHub scale explores gitHub’s Claude scaling playbook: caching, routing, and evaluation loops GitHub treats prompt caching as the primary cost-and-latency lever, targeting ~94–96% cache hit rates and considering ~70% a likely bug or prompt/tooling regression.
RELATED EPISODES
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome




