Demis Hassabis: Agents, AGI & The Next Big Scientific Breakthrough

Demis Hassabis has had one of the most extraordinary careers in tech. He started as a chess prodigy and video game designer at 17 before getting a PhD in neuroscience and going on to found DeepMind. His lab cracked Go, solved protein structure prediction with AlphaFold, and then gave it away free to every scientist on earth. That work won him the 2024 Nobel Prize in Chemistry. Today he leads Google DeepMind, pushing toward the same goal he set as a teenager: AGI. On this special live episode of How to Build the Future, he sat down with YC's Garry Tan to talk about what still needs to happen to get us to AGI, his advice for founders on how to stay ahead of the curve and what the next big scientific breakthroughs might be. Chapters: 00:00 — Intro 00:46 — Demis Hassabis: From Chess Prodigy to DeepMind 01:48 — What’s Missing Before We Get To AGI? 03:36 — Why Memory Is Still Unsolved 06:14 — How AlphaGo Shaped Gemini 08:06 — Why Smaller Models Are Getting So Powerful 10:46 — The 1000x Engineer 12:40 — Continual Learning and the Future of Agents 13:32 — Why AI Still Fails at Basic Reasoning 15:33 — Are Agents Overhyped or Just Getting Started? 18:31 — Can AI Become Truly Creative? 20:26 — Open Models, Gemma, and Local AI 22:26 — Why Gemini Was Built Multimodal 24:08 — What Happens When Inference Gets Cheap? 25:24 — From AlphaFold to the Virtual Cells 28:24 — AI as the Ultimate Tool for Science 30:43 — Advice for Founders 33:30 — The AlphaFold Breakthrough Pattern 35:20 — Can AI Make Real Scientific Discoveries? 37:59 — What to Build Before AGI Arrives Apply to Y Combinator: https://www.ycombinator.com/apply Work at a startup: https://www.ycombinator.com/jobs

Demis HassabisguestGarry Tanhost

Apr 29, 202640mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

Demis Hassabis on agents, memory gaps, and AI-driven science breakthroughs

Hassabis argues current foundation-model paradigms will remain core to AGI, but continual learning, long-horizon reasoning, and better memory systems are still unsolved requirements.
He frames agents as the practical path to AGI—active systems that plan and act—while noting today’s agent workflows are still early and often don’t justify heavy autonomous “swarms” yet.
DeepMind’s RL/search heritage (AlphaGo/AlphaZero) is portrayed as directly relevant to modern “thinking modes,” with older ideas like tree search re-emerging at larger scale in more general settings.
He expects rapid capability diffusion via distillation, making small/edge models nearly as strong as frontier models and enabling fast iteration, privacy-preserving local AI, and robotics assistants.
He describes AI as a general-purpose scientific instrument: AlphaFold as the archetype, virtual-cell simulations as a ~10-year goal, and a near-term frontier of AI that can generate genuinely novel hypotheses (the “Einstein test”).

IDEAS WORTH REMEMBERING

5 ideas

AGI likely needs a few missing capabilities, not a full paradigm reset.

Hassabis expects pretraining + post-training techniques to stay central, but highlights continual learning, robust long-horizon reasoning, and more principled memory as the main gaps, with a “one or two big ideas” possibility.

Context windows are not a real solution to memory.

Even huge windows are brute-force working memory; the hard part is storing the right things, filtering wrong/irrelevant info, and retrieving what matters with low latency—especially for long-running, video-rich personal assistants.

Agents are essential, but the ecosystem is still in the experimentation phase.

He sees agents as required for “active problem solving,” yet observes many multi-agent runs don’t yet produce outputs that justify the compute/time; he expects clearer killer workflows in the next 6–12 months.

Reasoning failures look like missing self-monitoring and control, not lack of raw capability.

He describes models that “overthink,” loop, or choose known-blunder moves (e.g., in chess), suggesting improvements may come from better oversight of intermediate reasoning and interventions during thought, not just larger models.

DeepMind’s game-era planning ideas are re-entering foundation models.

He connects chain-of-thought and “thinking modes” to AlphaGo-style methods and expects more explicit planning/search (e.g., Monte Carlo tree search-like augmentation) to drive near-term progress.

WORDS WORTH SAVING

5 quotes

Continual learning, long-term reasoning, uh, some aspects of memory, these are still unsolved. I think all of these are gonna be required for AGI.

— Demis Hassabis

You have to have an active system, uh, that can actively solve problems for you to get to AGI. So agents are that path, and I think we're just getting going.

— Demis Hassabis

It's not enough to come up with move 37, like that's pretty cool, very useful. Um, but can it invent Go? That's what I w- I want a system that can invent Go if you give it a high-level description.

— Demis Hassabis

One was, step one was solve intelligence, i.e., build AGI, and then step two was use it to solve everything else.

— Demis Hassabis

Given Life's very short, and y- you know, you only have so much time and energy. You might as well put your life force into something that will really make a, a, a difference if you hadn't done it, if you hadn't been there to push it.

— Demis Hassabis

What’s missing for AGI (continual learning, memory, long-term reasoning)Memory vs context windows and retrieval costsAgents as the route to AGI and current limitationsReinforcement learning, planning, and tree search revisitedDistillation and small/edge model performance (Gemini Flash, Gemma)Multimodality for world modeling, assistants, and roboticsAI-for-science: AlphaFold pattern, virtual cells, hypothesis generation

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.