Skip to content
Lex Fridman PodcastLex Fridman Podcast

Oriol Vinyals: Deep Learning and Artificial General Intelligence | Lex Fridman Podcast #306

Oriol Vinyals is the Research Director and Deep Learning Lead at DeepMind. Please support this podcast by checking out our sponsors: - Shopify: https://shopify.com/lex to get 14-day free trial - Weights & Biases: https://lexfridman.com/wnb - Magic Spoon: https://magicspoon.com/lex and use code LEX to get $5 off - Blinkist: https://blinkist.com/lex and use code LEX to get 25% off premium EPISODE LINKS: Oriol's Twitter: https://twitter.com/oriolvinyalsml Oriol's publications: https://scholar.google.com/citations?user=NkzyCvUAAAAJ DeepMind's Twitter: https://twitter.com/DeepMind DeepMind's Instagram: https://instagram.com/deepmind DeepMind's Website: https://deepmind.com Papers: 1. Gato: https://deepmind.com/publications/a-generalist-agent 2. Flamingo: https://deepmind.com/blog/tackling-multiple-tasks-with-a-single-visual-language-model 3. Language Models are Few-Shot Learners: https://arxiv.org/abs/2005.14165 4. Emergent Abilities of Large Language Models: https://arxiv.org/abs/2206.07682 5. Attention Is All You Need: https://proceedings.neurips.cc/paper/2017/file/3f5ee243547dee91fbd053c1c4a845aa-Paper.pdf PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 0:34 - AI 15:31 - Weights 21:50 - Gato 56:38 - Meta learning 1:10:37 - Neural networks 1:33:02 - Emergence 1:39:47 - AI sentience 2:03:43 - AGI SOCIAL: - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/lexfridman - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Reddit: https://reddit.com/r/lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostOriol Vinyalsguest
Jul 26, 20222h 10mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

DeepMind’s Oriol Vinyals on Scaling Toward AGI, Not Replacing Humans

  1. Lex Fridman and Oriol Vinyals explore how large neural networks, trained on sequences across text, images, and actions, are taking us toward general-purpose AI systems while still falling short of human-like lifetime learning and memory.
  2. They discuss DeepMind models such as Gato and Flamingo, modular vs. from‑scratch training, meta‑learning, and emergent abilities that only appear once models reach sufficient scale.
  3. Vinyals argues that current systems are nowhere near sentience, emphasizes the importance of data, benchmarks, engineering, and human teams, and predicts human‑level general intelligence within his lifetime, though ‘beyond human’ is less clear.
  4. They close by reflecting on ethics, future civil rights for AI‑like entities, human roles in a multi‑planetary future, and why biology and consciousness are inspirations rather than current design targets.

IDEAS WORTH REMEMBERING

5 ideas

Generalist agents show promise, but are still early and under‑scaled.

Gato unifies text, vision, and actions into a single transformer that can chat and act in diverse environments, but it underperforms specialized agents mainly because it’s relatively small and naively trained; scaling and better data/context handling are expected to unlock more synergy.

Modularity and weight reuse will be crucial to sustainable progress.

Today’s habit of retraining huge networks from random initialization is wasteful; work like Flamingo shows you can freeze a powerful language model (Chinchilla), bolt on vision modules, and get strong multimodal performance, hinting that systematically growing and composing models is a key research direction.

Meta‑learning is shifting from narrow benchmarks to natural interaction.

Early meta‑learning focused on few‑shot classification; now large language and vision‑language models can be ‘taught’ new tasks via prompts and examples in natural language, and Vinyals expects the next step to be interactive teaching—models asking for feedback, clarifications, and guidance in complex domains like games.

Emergent abilities appear abruptly once models cross task‑specific thresholds.

For some benchmarks (especially multi‑step reasoning), performance stays near random and then jumps at a certain scale, suggesting phase transitions in capability; while smooth scaling laws help plan model/data sizes, not all behaviors can be extrapolated from small models.

Current models are powerful pattern imitators, not sentient beings.

Vinyals is unequivocal that systems like LaMDA or Gato are mathematical functions trained on internet‑scale data, with no lifetime learning, rich memory, or biological complexity; he sees orders of magnitude gap between neural nets and biological systems and views sentience claims as premature, though public perceptions must be taken seriously.

WORDS WORTH SAVING

5 quotes

It certainly feels like action is a necessary condition to be more alive, but probably not sufficient either.

Oriol Vinyals

Gato is not the end. Gato is the beginning. Meow.

Oriol Vinyals

We should not be training models from scratch every few months. There should be some sort of way in which we can grow models.

Oriol Vinyals

To create these models, if we had the right software, it would be 10 lines of code and then just a dump of the internet.

Oriol Vinyals

I definitely think it’s possible that we’ll reach human‑level intelligence in my lifetime.

Oriol Vinyals

AI interviewers, agency, and the value of keeping humans in the loopDeepMind’s Gato, Flamingo, Chinchilla and the “generalist agent” paradigmTokenization, multimodal transformers, and modular vs. monolithic modelsMeta‑learning, prompting, and interactive teaching of large modelsScaling laws, emergent abilities, and the “bitter lesson” of computationSentience, consciousness, and anthropomorphism in language modelsLong‑term AGI prospects, ethics, safety, and human–AI coexistence

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome