No Priors Ep. 16 | With Mustafa Suleyman, Founder of DeepMind and Inflection

Suleyman, co-founder of DeepMind and now co-founder and CEO of Inflection AI, joins Sarah and Elad to discuss how his interests in counseling, conflict resolution, and intelligence led him to start an AI lab that pioneered deep reinforcement learning, lead applied AI and policy efforts at Google, and more recently found Inflection and launch Pi, a personal AI. Mustafa offers insights on the changing structure of the web, the pressure Google faces in the age of AI personalization, predictions for model architectures, how to measure emotional intelligence in AIs, and the thinking behind Pi: the AI companion that knows you, is aligned to your interests, and provides companionship. Sarah and Elad also discuss Mustafa’s upcoming book, The Coming Wave (release September 12, 2023), which examines the political ramifications of AI and digital biology revolutions. The Coming Wave: https://www.penguinrandomhouse.com/books/722674/the-coming-wave-by-mustafa-suleyman-with-michael-bhaskar/ 00:00 - From Conflict Resolution to AI Pioneering 10:36 - Defining Intelligence 15:32 - DeepMind's Journey and Breakthroughs 24:45 - The Future of Personal AI Companionship 33:22 - AI and the Future of Personalized Content 41:49 - The Launch of Pi 51:12 - Mustafa’s New Book The Coming Wave

Mustafa SuleymanguestSarah GuohostElad Gilhost

May 11, 202351mWatch on YouTube ↗

CHAPTERS

0:00 – 3:02
Mustafa’s origin story: Oxford, post‑9/11 helpline, and learning to listen at scale
Mustafa recounts how his early “change-the-world” motivation led him to pause his Oxford philosophy studies and build a nonjudgmental helpline for young British Muslims in the post‑9/11 environment. He highlights how those years shaped his worldview around empathy, cultural sensitivity, and practical impact.
- •Left theoretical philosophy for hands-on social impact work
- •Co-founded and ran a youth helpline staffed by ~100 volunteers
- •Context: post‑9/11 pressure and rising Islamophobia in the UK
- •Early lessons in fundraising, operations, and human-centered support
3:02 – 6:01
Conflict resolution consultancy: negotiating across nations and the limits of human governance
He describes founding a conflict resolution and facilitation consultancy working with international clients and tense geopolitical contexts. The Copenhagen climate negotiations become a pivotal moment, convincing him that slow governance systems can’t keep pace with exponential global and technological challenges.
- •Worked on negotiation/facilitation with governments and international bodies
- •Learned “speaking other people’s social languages” as an acquired skill
- •Copenhagen climate talks illustrated coordination failure at global scale
- •Realization: governance won’t keep up with exponential problems and tech
6:01 – 8:56
Seeing platforms as ‘frames’: Facebook, choice architecture, and behavior at societal scale
Mustafa explains how his conflict-resolution lens led him to view Facebook not as neutral infrastructure but as a designed frame shaping incentives and behavior. He draws parallels between platform defaults and deep, long-lived social structures that influence how people interact.
- •Platforms embed ‘choice architecture’ that steers behavior
- •Even small UI decisions can scale to massive societal effects
- •Societal-level framing differs from micro-level growth optimization
- •Early recognition that “neutral platforms” is a misconception
8:56 – 11:42
From tech fascination to DeepMind: poker, Demis Hassabis, Shane Legg, and early AGI language
He tells the story of reconnecting with Demis Hassabis and aligning on technology as the highest-leverage path to change. Shane Legg’s work on formalizing intelligence and measuring progress toward generality becomes the intellectual catalyst for founding DeepMind.
- •Decision to pivot from governance work to technology for scalable impact
- •DeepMind origin story: Demis connection and long conversations with founders
- •Shane Legg’s intelligence definitions and measurement-oriented framing
- •Early thesis: distill the essence of human intelligence into engineering goals
11:42 – 15:31
Rethinking intelligence: beyond generality to routing, tools, and context-sensitive attention
Mustafa updates his view of intelligence, arguing the field may have overemphasized a single general agent. He proposes that directing attention and routing between specialized systems—AI and non-AI tools—will be the key unlock, and that this is partly an engineering problem.
- •Original ‘generality across tasks’ definition is useful but incomplete
- •New emphasis: context-sensitive allocation of processing/attention
- •Future systems: routers orchestrating multiple specialized models and tools
- •Cost/latency tradeoffs imply dynamic selection among model sizes
15:31 – 17:27
Early DeepMind fundraising and the breakthrough arc: from ‘don’t say AI’ to Atari and beyond
He describes how unconventional it felt to start DeepMind in 2010, when even saying “AI” (let alone “AGI”) was frowned upon. Funding was scarce until major results—AlexNet and DeepMind’s Atari DQN—helped legitimize the field and catalyze broader attention.
- •Early era stigma: speak in ‘machine learning,’ not ‘AI/AGI’
- •Limited early funding ecosystem (e.g., Founders Fund)
- •Inflection point: AlexNet (2012) and DQN Atari paper (2013)
- •Rapid shift in public and industry awareness post-breakthroughs
17:27 – 22:41
DeepMind’s applied breakthroughs: deep RL, AlphaGo-to-AlphaFold, and transfer as the core thesis
Mustafa highlights deep reinforcement learning as a foundational direction while explaining why game-like environments were initially ideal. He then walks through AlphaFold’s origins—from searching for applications to a hackathon spark—framing transfer learning as DeepMind’s central promise.
- •Deep RL: learning from raw perceptual inputs tied to reward signals
- •Games enabled massive iteration with structured rewards
- •AlphaFold originated from a search for real-world applications and a hackathon
- •DeepMind thesis: transfer learning and compressed representations across domains
22:41 – 26:51
Why start Inflection: GPT‑3 as the wake-up call and frustration with launching at Google
He explains that language’s long-range structure made major NLP impact seem uncertain until GPT‑3. After joining Google’s LLM efforts (Meena/LaMDA) and pushing for launches that didn’t happen, he decided the technology needed to reach the world via a new company with trusted collaborators.
- •Pre‑2019 uncertainty that neural nets would transform language at scale
- •GPT‑3 as the moment that ‘clicked’ for him
- •Worked on Meena/LaMDA; explored retrieval/grounding and hallucinations
- •Left to found Inflection with Karén Simonyan and Reid Hoffman after launch friction
26:51 – 30:41
Companionship as the wedge: conversation as the next interface and the need for user feedback loops
Mustafa argues that LLMs needed real interaction feedback, and that conversation is the natural successor to today’s clunky search dialogue. He critiques how ad and SEO incentives degraded web content and claims fluent conversational interfaces will replace ‘learning to speak Google.’
- •Key missing ingredient: interaction feedback from real users
- •Search already functions as a dialogue—just a painful one
- •SEO/ads shaped content into engagement traps rather than succinct answers
- •Conversational natural language becomes the new default interface
30:41 – 33:01
What Pi is: personal intelligence aligned to you (not brands), with an empathetic conversational style
He introduces Pi (Personal Intelligence) as a personal AI designed to be on the user’s side amid a world of business, political, and brand AIs. The initial product emphasizes supportive, curious conversation and is positioned as an early version to learn and iterate from usage.
- •Future: many AIs with different objectives; users need one aligned to themselves
- •Pi’s starting personality: empathetic, supportive, reflective listening
- •Elements of great conversation: reflection, adding value, follow-ups, ‘spice’
- •Product posture: ship early, learn fast; not yet the biggest internal model
33:01 – 37:31
The internet after personal AIs: personalized summaries, dynamic formats, and AI-to-AI content access
Mustafa predicts the web will shift from fixed pages to dynamic, personalized, conversational delivery. He advises publishers and creators to think of themselves as evolving into interactive AI experiences, with personal AIs acting as intermediaries that discover and negotiate content.
- •Computing becomes conversational; Pi summarizes news and hobbies in your style
- •Static websites assume one format; generative systems enable personalization
- •Publishers can ship brand/content AIs; discovery becomes AI-to-AI interaction
- •Analogy: personal AIs become the new ‘crawler/ranker/presenter’ layer
37:31 – 40:42
Risks and governance: filter bubbles, non-neutral curation, and democratic accountability
Responding to concerns about reinforcement of beliefs, Mustafa says the default trajectory without intervention is more extreme context bubbles. He calls for AI companies to accept responsibility for curation, increase transparency, and build accountable oversight tied to democratic institutions.
- •Platforms were never neutral; ranking is inherently value-laden
- •AI accelerates curation power and risks deepening information bubbles
- •Need transparency: what’s curated, excluded, up/down-weighted
- •Argues for democratic oversight mechanisms; notes EU-style approaches as relevant
40:42 – 44:31
How Pi improves: emotional intelligence metrics, memory/personalization, utility, and real-time grounding
He outlines how Inflection evaluates progress—emotional intelligence, fluidity, respect, even-handedness—and discusses early issues like political bias. He previews a roadmap toward deeper personalization (long-term memory), integrations, web access, citations, and practical assistant capabilities.
- •Evaluation focuses: EQ, conversation quality, respect, neutrality/even-handedness
- •Bias incidents treated as errors to correct; avoid judgmental behavior
- •Current memory ~100 messages; aim for durable ‘second mind’ personalization
- •Roadmap: web retrieval, real-time info, citations, and assistant actions (shopping, travel)
44:31 – 47:11
Company-building at Inflection: small senior team, no manager layers, and a fast shipping cadence
Mustafa describes Inflection’s culture as a ~30-person, high-standards, applied AI team. They avoid rigid researcher/engineer hierarchies, prioritize exceptional ICs, and operate on six-week shipping cycles capped by an in-person hackathon week to build momentum and cohesion.
- •~30 people; hand-selected senior scientists/engineers
- •Unified ‘MTS’ role to reduce status divides between research and engineering
- •Applied AI focus: production-driven innovation over publishing
- •Operating rhythm: six-week goals + hackathon week; minimal management layers
47:11 – 50:17
Scaling laws in practice: compounding compute, more data, and efficiency breakthroughs like Chinchilla
He frames AI progress as ‘compounding exponentials’ in compute, data, and algorithmic efficiency. Using DQN vs modern training runs as an example, he argues both bigger models and more efficient architectures will drive the next wave, expanding access while escalating capabilities.
- •Training compute grew ~orders of magnitude yearly; dramatic DQN-to-today comparison
- •Data scale continues, but efficiency gains are equally important
- •Chinchilla: compute-optimal training via smaller models trained longer on more data
- •Prediction: simultaneous scale-up and efficiency improvements create a ‘coming wave of contradictions’
50:17 – 51:55
The Coming Wave: why write a book on AI, synthetic biology, and the future of the nation state
Mustafa closes by sharing motivation for writing: deadlines sharpen thinking, despite the strain of writing during a startup. He previews the book’s focus on the societal and political consequences of AI and synthetic biology over the next decade.
- •Writing as a tool to clarify thinking under deadline pressure
- •Book title: The Coming Wave
- •Focus: AI + synthetic biology trajectories and their political consequences
- •Examines implications for governance and the future of the nation state

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

Mustafa’s origin story: Oxford, post‑9/11 helpline, and learning to listen at scale

Conflict resolution consultancy: negotiating across nations and the limits of human governance

Seeing platforms as ‘frames’: Facebook, choice architecture, and behavior at societal scale

From tech fascination to DeepMind: poker, Demis Hassabis, Shane Legg, and early AGI language

Rethinking intelligence: beyond generality to routing, tools, and context-sensitive attention

Early DeepMind fundraising and the breakthrough arc: from ‘don’t say AI’ to Atari and beyond

DeepMind’s applied breakthroughs: deep RL, AlphaGo-to-AlphaFold, and transfer as the core thesis

Why start Inflection: GPT‑3 as the wake-up call and frustration with launching at Google

Companionship as the wedge: conversation as the next interface and the need for user feedback loops

What Pi is: personal intelligence aligned to you (not brands), with an empathetic conversational style

The internet after personal AIs: personalized summaries, dynamic formats, and AI-to-AI content access

Risks and governance: filter bubbles, non-neutral curation, and democratic accountability

How Pi improves: emotional intelligence metrics, memory/personalization, utility, and real-time grounding

Company-building at Inflection: small senior team, no manager layers, and a fast shipping cadence

Scaling laws in practice: compounding compute, more data, and efficiency breakthroughs like Chinchilla

The Coming Wave: why write a book on AI, synthetic biology, and the future of the nation state

Get more out of YouTube videos.