Skip to content
Lex Fridman PodcastLex Fridman Podcast

Leslie Kaelbling: Reinforcement Learning, Planning, and Robotics | Lex Fridman Podcast #15

Lex Fridman and Leslie Kaelbling on leslie Kaelbling on uncertainty, abstraction, and truly intelligent robots.

Lex FridmanhostLeslie Kaelblingguest
Mar 12, 20191h 1mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 1:13

    GEB, philosophy, and the early spark for AI

    Leslie Kaelbling describes how reading Gödel, Escher, Bach in high school shaped her fascination with intelligence emerging from simple primitives. She and Lex connect that early interest to logic, reasoning, and the broader question of what kinds of programs can generate intelligent behavior.

  2. 1:13 – 3:05

    From philosophy at Stanford to robotics at SRI

    Kaelbling recounts studying philosophy at Stanford (before CS was a standard major there) and then transitioning into computer science. Her first job at SRI, working on a Shakey follow-on robot, pulled her into robotics through hands-on engineering needs.

  3. 3:05 – 5:42

    What philosophy contributes to AI (and what it doesn’t)

    The discussion turns to whether AI researchers should also be philosophers. Kaelbling emphasizes the formal parts of philosophy (belief, knowledge, denotation) as directly relevant, while rejecting the idea that human-level robotics requires anything beyond technical advances.

  4. 5:42 – 7:22

    Shakey the robot: foundational ideas in planning and navigation

    Kaelbling explains why Shakey remains iconic and urges people to read the Shakey Technical Report. She details Shakey’s capabilities—planning, replanning, perception, localization, and multiple abstraction levels—highlighting how many modern ideas were already present.

  5. 7:22 – 9:12

    Flaky and “situated computation”: learning robotics by reinventing wheels

    Moving from Shakey to Flaky, Kaelbling describes building a new robot at SRI essentially from scratch. She introduces situated automata as a design philosophy and argues that reinventing wheels can be a valuable learning process for developing real robotic systems.

  6. 9:12 – 11:43

    AI’s oscillating history: cybernetics, expert systems, and shifting problems

    Kaelbling sketches a cyclical history of AI where methods and even problem definitions go in and out of fashion. She contrasts early cybernetics/control with expert systems, and explains how communities often “shelve” hard problems or reframe them as malformed.

  7. 11:43 – 15:17

    Why expert systems hit a wall—and what “symbolic” should really mean

    Kaelbling argues expert systems failed largely because humans can’t truly articulate the knowledge they use to make decisions. She separates that critique from the usefulness of formal reasoning, pivoting toward the central role of abstraction rather than ideology about “symbolic vs neural.”

  8. 15:17 – 18:04

    Abstractions for planning: shrinking state, horizon, and complexity

    The conversation dives deeper into abstraction as the mechanism that makes long-horizon planning possible. Kaelbling explains how agents reason at coarse levels (rooms, afternoons) and then switch to fine-grained continuous control when needed, advocating a pragmatic mix of representations.

  9. 18:04 – 21:45

    MDPs and POMDPs: modeling stance, uncertainty, and belief updates

    Kaelbling frames MDPs and POMDPs as modeling choices (a “stance”), not literal descriptions of reality. She defines MDPs via full state observability and Markov sufficiency, then explains POMDPs as reasoning from histories of actions/observations to infer hidden state.

  10. 21:45 – 23:22

    Planning under uncertainty is intractable—so approximate intelligently

    Kaelbling notes that optimal POMDP planning can be undecidable/intractable, but argues that doesn’t excuse ignoring it. Instead, she advocates using the formalism for clarity and then applying layered approximations in modeling and computation to reach workable solutions.

  11. 23:22 – 26:30

    From engineering to science: bounded optimality, theory gaps, and what guarantees mean

    The discussion shifts to methodology: modern AI’s engineering success is outpacing its scientific understanding. Kaelbling argues the field lacks strong approximate-solution concepts for very hard problems, and calls for principles that predict when/why systems work (like bridge engineering).

  12. 26:30 – 29:20

    Belief space control: acting to change what you know, not just the world

    Kaelbling explains belief space as treating the agent’s belief (a distribution over states) as the object being controlled. This enables deliberate information-gathering actions and risk-aware behavior based on uncertainty, illustrated with driving and navigation examples.

  13. 29:20 – 35:01

    Hierarchical planning in the real world: airports, feasibility leaps, and generalization

    Returning to long-horizon behavior, Kaelbling explains temporal hierarchy: high-level plans depend on learned expectations that subgoals are feasible without detailing every step. She uses navigating airports to motivate generalization—predicting effort/time in unfamiliar environments.

  14. 35:01 – 41:10

    Model-based vs model-free, why perception is harder than planning, and building useful structure

    Kaelbling argues intelligence won’t come from one algorithm but from multiple representations and reasoning styles, with model-based/model-free tradeoffs as time–space computation choices. She states perception is harder than planning because the key challenge is representational—what perception should output and what inductive biases (like convolution, objects, relational structure) should be built in.

  15. 41:10 – 1:01:23

    Human-level robotics, benchmarks, and AI futures: value alignment and research incentives

    In the final stretch, Kaelbling downplays consciousness as a prerequisite while acknowledging practical “self-monitoring” is necessary. She critiques benchmark/competition culture as sometimes hack-driven, discusses founding JMLR and rethinking peer review, and closes with thoughts on AI cycles, overselling, and near-term importance of objective functions and value alignment.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.