This video isn’t embeddableWatch on YouTube →

David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning. Support this podcast by signing up with these sponsors: - MasterClass: https://masterclass.com/lex - Cash App - use code "LexPodcast" and download: - Cash App (App Store): https://apple.co/2sPrUHe - Cash App (Google Play): https://bit.ly/2MlvP5w EPISODE LINKS: Reinforcement learning (book): https://amzn.to/2Jwp5zG PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 4:09 - First program 11:11 - AlphaGo 21:42 - Rule of the game of Go 25:37 - Reinforcement learning: personal journey 30:15 - What is reinforcement learning? 43:51 - AlphaGo (continued) 53:40 - Supervised learning and self play in AlphaGo 1:06:12 - Lee Sedol retirement from Go play 1:08:57 - Garry Kasparov 1:14:10 - Alpha Zero and self play 1:31:29 - Creativity in AlphaZero 1:35:21 - AlphaZero applications 1:37:59 - Reward functions 1:40:51 - Meaning of life CONNECT: - Subscribe to this YouTube channel - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/LexFridmanPage - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostDavid Silverguest

Apr 3, 20201h 48mWatch on YouTube ↗

EPISODE INFO

Released: April 3, 2020
Duration: 1h 48m
Channel: Lex Fridman Podcast
Watch on YouTube: ▶ Open ↗

EPISODE DESCRIPTION

David Silver leads the reinforcement learning research group at DeepMind and was lead researcher on AlphaGo, AlphaZero and co-lead on AlphaStar, and MuZero and lot of important work in reinforcement learning. Support this podcast by signing up with these sponsors:
MasterClass: https://masterclass.com/lex
Cash App - use code "LexPodcast" and download:
Cash App (App Store): https://apple.co/2sPrUHe
Cash App (Google Play): https://bit.ly/2MlvP5w
EPISODE LINKS: Reinforcement learning (book): https://amzn.to/2Jwp5zG PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 4:09 - First program 11:11 - AlphaGo 21:42 - Rule of the game of Go 25:37 - Reinforcement learning: personal journey 30:15 - What is reinforcement learning? 43:51 - AlphaGo (continued) 53:40 - Supervised learning and self play in AlphaGo 1:06:12 - Lee Sedol retirement from Go play 1:08:57 - Garry Kasparov 1:14:10 - Alpha Zero and self play 1:31:29 - Creativity in AlphaZero 1:35:21 - AlphaZero applications 1:37:59 - Reward functions 1:40:51 - Meaning of life CONNECT:
Subscribe to this YouTube channel
Twitter: https://twitter.com/lexfridman
LinkedIn: https://www.linkedin.com/in/lexfridman
Facebook: https://www.facebook.com/LexFridmanPage
Instagram: https://www.instagram.com/lexfridman
Medium: https://medium.com/@lexfridman
Support on Patreon: https://www.patreon.com/lexfridman

SPEAKERS

Lex Fridman
host
David Silver
guest

EPISODE SUMMARY

In this episode of Lex Fridman Podcast, featuring Lex Fridman and David Silver, David Silver: AlphaGo, AlphaZero, and Deep Reinforcement Learning | Lex Fridman Podcast #86 explores david Silver on AlphaGo, self-play, and the path to intelligence Lex Fridman and David Silver trace Silver’s journey from childhood programming to leading DeepMind’s landmark work on AlphaGo, AlphaZero, and MuZero. They explain why Go was such a hard AI challenge, how deep reinforcement learning and self-play enabled systems to exceed human world champions, and what these results suggest about intuition, creativity, and general intelligence. Silver details the transition from hand-crafted knowledge and search to learning-based systems that discover their own strategies, and how removing human priors made the algorithms both stronger and more general. The conversation closes with reflections on future real-world applications, the nature of goals and reward in AI, and layered views on the “meaning” of intelligence and life.

RELATED EPISODES

Garry Kasparov: Chess, Deep Blue, AI, and Putin | Lex Fridman Podcast #46

Leonard Susskind: Quantum Mechanics, String Theory and Black Holes | Lex Fridman Podcast #41

Kai-Fu Lee: AI Superpowers - China and Silicon Valley | Lex Fridman Podcast #27

David Ferrucci: IBM Watson, Jeopardy & Deep Conversations with AI | Lex Fridman Podcast #44

Bjarne Stroustrup: C++ | Lex Fridman Podcast #48

Yann LeCun: Deep Learning, ConvNets, and Self-Supervised Learning | Lex Fridman Podcast #36

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

Episode Details