Lex Fridman Podcast

François Chollet: Measures of Intelligence | Lex Fridman Podcast #120

François Chollet is an AI researcher at Google and creator of Keras. Support this podcast by supporting our sponsors (and get discount): - Babbel: https://babbel.com and use code LEX - MasterClass: https://masterclass.com/lex - Cash App: download app & use code "LexPodcast" EPISODE LINKS: Francois's Twitter: https://twitter.com/fchollet Francois's Website: https://fchollet.com/ On the Measure of Intelligence (paper): https://arxiv.org/abs/1911.01547 PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 5:04 - Early influence 6:23 - Language 12:50 - Thinking with mind maps 23:42 - Definition of intelligence 42:24 - GPT-3 53:07 - Semantic web 57:22 - Autonomous driving 1:09:30 - Tests of intelligence 1:13:59 - Tests of human intelligence 1:27:18 - IQ tests 1:35:59 - ARC Challenge 1:59:11 - Generalization 2:09:50 - Turing Test 2:20:44 - Hutter prize 2:27:44 - Meaning of life CONNECT: - Subscribe to this YouTube channel - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/LexFridmanPage - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostFrançois Cholletguest

Aug 30, 20202h 34mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

François Chollet Redefines Intelligence, Critiques Deep Learning’s True Limits

Lex Fridman and François Chollet discuss what intelligence really is, arguing it should be defined as the *efficiency of acquiring new skills in novel situations*, not the accumulation of skills themselves.
They contrast human cognitive abilities and priors with current machine learning systems, criticizing trends like scale-only language models (e.g., GPT‑3) and end‑to‑end deep learning for lacking robust, out-of-distribution generalization.
Chollet presents his ARC (Abstraction and Reasoning Corpus) benchmark as a psychometrics-inspired test for machine intelligence, built on explicit human core knowledge priors and designed to measure genuine abstraction and generalization rather than memorization.
They explore broader themes including developmental psychology, language as an operating system for the mind, limits of compression-as-cognition, the structure of human intelligence (g-factor), and the cultural, ripple-like meaning of human life.

IDEAS WORTH REMEMBERING

5 ideas

Intelligence is about learning efficiency, not raw skill.

Chollet defines intelligence as the efficiency with which a system acquires new skills in tasks it was not prepared for. Skill itself (e.g., playing chess) is just the crystallized output of an intelligent process, not evidence of general intelligence.

You must distinguish the intelligent process from its artifacts.

A static chess program or a hand-engineered driving system encodes the *results* of human intelligence, not intelligence itself. True machine intelligence would autonomously generate such abstractions and skills for new domains without human hand-holding.

Human cognition rests on powerful innate priors missing in machines.

Humans come equipped with core knowledge systems—objectness/physics, agentness/goals, space/topology, and basic number sense—which underlie rapid learning and abstraction. Most AI benchmarks ignore or conflate these priors, making comparisons to humans misleading.

Scaling deep learning hits hard limits without genuine abstraction.

Models like GPT‑3 are impressive at generating plausible text but mainly perform sophisticated pattern matching over massive data. They lack constraints like factuality, consistency, and robust adaptation to truly novel situations, and are ultimately data‑limited rather than compute‑limited.

A good intelligence test must control for priors and experience.

To fairly compare humans and machines, a test must make explicit which priors are allowed and tightly control exposure to training data. Otherwise, engineers can “buy” performance via rules or massive datasets, confounding true generalization with brute-force skill.

WORDS WORTH SAVING

5 quotes

Intelligence is the efficiency with which you acquire new skills at tasks that you did not previously know about.

— François Chollet

We should not confuse a road-building company with one specific road.

— François Chollet

Language is a kind of operating system for the mind.

— François Chollet

You are not a very good source of unfakeable novelty.

— François Chollet

Our actions today create ripples, and these ripples basically sum up the meaning of life.

— François Chollet

Definition and measurement of general intelligence in humans and machinesHuman cognitive priors and core knowledge (objects, agents, space, number)Limits of current deep learning and large language models (e.g., GPT‑3)ARC Challenge as a benchmark for abstraction and reasoningPsychometrics, IQ, and the structure of cognitive abilities (g-factor, CHC theory)Types of generalization: robustness, flexibility, and extreme generalizationRole of culture, language, and external tools in augmenting human cognition

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.