Skip to content
ClaudeClaude

The thinking lever

Adaptive thinking and effort controls give developers a new decision: how much should Claude reason for a given task? This session covers thinking budgets, effort levels, and the cost, latency, and quality tradeoffs involved.

May 8, 202624mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
May 8, 2026
Duration
24m
Channel
Claude
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Adaptive thinking and effort controls give developers a new decision: how much should Claude reason for a given task? This session covers thinking budgets, effort levels, and the cost, latency, and quality tradeoffs involved.

EPISODE SUMMARY

In this episode of Claude, The thinking lever explores how Claude scales inference compute with effort, budgets, adaptive thinking The talk defines test-time compute as spending more inference-time tokens to improve results, showing performance gains both from larger models and from higher effort on the same model.

RELATED EPISODES

Tag Claude in, right where you already work

Tag Claude in, right where you already work

Coding is no longer the constraint: Scaling devex to teams and agents at Spotify

Coding is no longer the constraint: Scaling devex to teams and agents at Spotify

How to get to production faster with Claude Managed Agents

How to get to production faster with Claude Managed Agents

Caching, harnesses, and advisors: Building on Claude at GitHub scale

Caching, harnesses, and advisors: Building on Claude at GitHub scale

How Slack uses Claude for AI search and summaries

How Slack uses Claude for AI search and summaries

Building AI-native at enterprise scale: monday.com, Doctolib, and Delivery Hero

Building AI-native at enterprise scale: monday.com, Doctolib, and Delivery Hero

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.