Lex Fridman Podcast

Noam Brown: AI vs Humans in Poker and Games of Strategic Negotiation | Lex Fridman Podcast #344

Noam Brown is a research scientist at FAIR, Meta AI, co-creator of AI that achieved superhuman level performance in games of No-Limit Texas Hold'em and Diplomacy. Please support this podcast by checking out our sponsors: - True Classic Tees: https://trueclassictees.com/lex and use code LEX to get 25% off - Audible: https://audible.com/lex to get 30-day free trial - InsideTracker: https://insidetracker.com/lex to get 20% off - ExpressVPN: https://expressvpn.com/lexpod to get 3 months free EPISODE LINKS: Noam's Twitter: https://twitter.com/polynoamial Noam's LinkedIn: https://www.linkedin.com/in/noam-brown-8b785b62/ webDiplomacy: https://webdiplomacy.net/ Noam's papers: Superhuman AI for multiplayer poker: https://par.nsf.gov/servlets/purl/10119653 Superhuman AI for heads-up no-limit poker: https://par.nsf.gov/servlets/purl/10077416 Human-level play in the game of Diplomacy: https://www.science.org/doi/10.1126/science.ade9097 PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 1:09 - No Limit Texas Hold 'em 5:02 - Solving poker 18:12 - Poker vs Chess 24:50 - AI playing poker 58:18 - Heads-up vs Multi-way poker 1:09:08 - Greatest poker player of all time 1:12:42 - Diplomacy game 1:22:33 - AI negotiating with humans 2:04:58 - AI in geopolitics 2:09:43 - Human-like AI for games 2:15:44 - Ethics of AI 2:19:57 - AGI 2:23:57 - Advice to beginners SOCIAL: - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/lexfridman - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Reddit: https://reddit.com/r/lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Noam BrownguestLex Fridmanhost

Dec 6, 20222h 29mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

AI Masters Poker and Diplomacy, Redefining Strategy, Trust, and Negotiation

Noam Brown discusses his work building superhuman AI systems for complex strategic games: heads‑up and six‑player no‑limit Texas Hold’em (Libratus, Pluribus) and the negotiation-heavy board game Diplomacy (Cicero).
He explains core ideas like Nash equilibrium, self‑play, counterfactual regret minimization, and the critical role of search, arguing that poker’s imperfect information makes it even more challenging than games like chess or Go.
In Diplomacy, Brown’s team combines large language models with reinforcement learning and human game data to create an AI that can negotiate, form alliances, and build trust with humans in natural language at roughly top‑human level.
They explore how such systems illuminate human irrationality, trust, deception, and the limits of self‑play, and how these ideas may transfer to future NPCs, training tools, and even real‑world negotiation and decision support.

IDEAS WORTH REMEMBERING

5 ideas

Game‑theoretic ‘balanced’ play can outperform human psychological exploitation.

Libratus crushed elite heads‑up poker pros by approximating a Nash equilibrium strategy that didn’t adapt to specific opponents or do ‘mind games’, undermining the belief that reading people always beats theory.

Search is at least as important as raw neural network strength.

Across chess, Go, and poker, planning ahead via search dramatically boosts performance; removing Monte Carlo tree search from Go AIs drops them from far‑superhuman to roughly human‑grandmaster strength.

Imperfect‑information games require optimizing action probabilities, not just actions.

In poker (and rock‑paper‑scissors), the value of a move depends on how often you do it; balancing bluffing and value bets so you are unpredictable is central, and Libratus explicitly optimizes these frequencies.

Six‑player poker shows equilibrium‑style methods can generalize beyond two‑player zero‑sum.

Although theory gives no guarantees, Pluribus uses depth‑limited search and equilibrium‑inspired self‑play to achieve superhuman performance in six‑player games, where cooperation and more complex dynamics appear.

Self‑play alone fails in social, cooperative settings; you must learn from humans.

In Diplomacy, a self‑play‑only bot develops an alien ‘robot language’ and inhuman conventions and is quickly ostracized and crushed by humans; Cicero instead anchors its policies and language to large human datasets.

WORDS WORTH SAVING

5 quotes

In any finite two‑player zero‑sum game, there is an optimal strategy that, if you play it, you are guaranteed to not lose in expectation, no matter what your opponent does.

— Noam Brown

One of the key strategies in poker is to put the other person into an uncomfortable position, and if you’re doing that, then you’re playing poker well.

— Noam Brown

We played our bot against four top heads‑up no‑limit hold’em poker players, and the bot wasn’t trying to adapt to them… it was just trying to approximate the Nash equilibrium, and it crushed them.

— Noam Brown

Diplomacy is a game about trust and being able to build trust in an environment that encourages people to not trust anyone.

— Noam Brown

War is an inherently negative‑sum game. There’s always a better outcome than war for all the parties involved.

— Noam Brown

Nash equilibrium, game theory, and imperfect-information gamesDesign and evolution of poker AIs: Libratus (heads‑up) and Pluribus (six‑player)Search vs. neural networks in games like chess, Go, and pokerCicero: a Diplomacy AI combining language models with RL and human dataTrust, deception, and human‑compatible behavior in multi‑agent systemsHuman‑like AI opponents, training tools, and cheat detection challengesPotential real‑world implications for negotiation, diplomacy, and AI ethics

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.