Boris Sofman: Waymo, Cozmo, Self-Driving Cars, and the Future of Robotics | Lex Fridman Podcast #241

Boris Sofman is the Senior Director Of Engineering and Head of Trucking at Waymo, formerly the Google Self-Driving Car project. He was also the CEO and co-founder of Anki, a home robotics company. Please support this podcast by checking out our sponsors: - LMNT: https://drinkLMNT.com/lex to get free sample pack - Athletic Greens: https://athleticgreens.com/lex and use code LEX to get 1 month of fish oil - ROKA: https://roka.com/ and use code LEX to get 20% off your first order - Indeed: https://indeed.com/lex to get $75 credit - BetterHelp: https://betterhelp.com/lex to get 10% off EPISODE LINKS: Boris's Twitter: https://twitter.com/bsofman Boris's LinkedIn: https://www.linkedin.com/in/bsofman Waymo's Twitter: https://twitter.com/waymo Waymo's YouTube: https://www.youtube.com/waymo Waymo's Website: https://waymo.com/ PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 1:08 - Robots in science fiction 6:49 - Cozmo 32:04 - AI companions 38:59 - Anki 1:04:33 - Waymo Via 1:36:10 - Sensor suites for long haul trucking 1:46:06 - Machine learning 2:04:03 - Waymo vs Tesla 2:14:38 - Safety and risk management 2:23:42 - Societal effects of automation 2:34:47 - Amazon Astro 2:39:12 - Challenges of the robotics industry 2:43:39 - Humanoid robotics 2:50:42 - Advice for getting a PhD in robotics 2:58:13 - Advice for robotics startups 3:09:19 - Advice for students SOCIAL: - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/lexfridman - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Reddit: https://reddit.com/r/lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Lex FridmanhostBoris Sofmanguest

Nov 16, 20213h 14mWatch on YouTube ↗

CHAPTERS

0:00 – 1:00
Boris Sofman’s robotics journey: Anki, Cozmo, and Waymo trucking
Lex introduces Boris Sofman and frames the conversation around two themes: social robotics (Cozmo/Anki) and autonomous trucking (Waymo Via). The episode is positioned as a complement to a prior discussion on the human side of trucking, focusing here on the “robotic side.”
- •Boris’s roles: Anki co-founder/CEO; Waymo head of trucking engineering
- •Cozmo highlighted as a standout social robot with emotional intelligence
- •Episode scope: consumer/social robots + autonomous trucks + future of robotics
1:00 – 3:16
Science fiction robots and why nonverbal emotion works (WALL‑E, R2‑D2, Terminator)
Boris names WALL‑E and R2‑D2 as favorites for conveying intent and emotion without language, then contrasts that warmth with the darker Terminator vision. The discussion quickly turns to how “character” and constraints shape believable robots.
- •Nonverbal cues can create strong emotional attachment
- •Humanoid form raises expectations and engineering difficulty
- •Terminator illustrates a spectrum of capability and generations in robotics
3:16 – 7:06
Design philosophy: avoid humanoids, embrace constraints, and amplify expressiveness
Boris explains why Anki intentionally avoided human form: it reduces cost and expectation mismatch and lets designers lean into robotic strengths. Lex connects this to animation—minimal features (like “two dots and a line”) can communicate a wide emotional range when designed well.
- •Humanoid form increases degrees of freedom, cost, and expectation pressure
- •Constraints can force deeper emotional clarity and better design outcomes
- •Animation principles transfer directly to physical robot interaction
7:06 – 15:08
Founding Anki at CMU: consumer robotics as ‘video games brought to life’
Boris describes starting Anki in graduate school at Carnegie Mellon with a strong bias toward applied robotics. The team saw a market opening as smartphone-driven component costs fell, enabling inexpensive hardware paired with sophisticated software.
- •Applied focus: build real products, not only research prototypes
- •Toy/entertainment chosen as a forgiving proving ground for HRI
- •Smartphone supply chain drove down costs of cameras, motors, compute
- •Core idea: physical experiences powered by game-engine-like software
15:08 – 20:44
Cozmo’s ‘three-legged stool’: hardware, AI, and character (and why mistakes help)
The conversation dives into how Cozmo was engineered to feel alive with only a few actuators and a low-res screen. Boris argues most robotics efforts underinvest in character, and that well-framed mistakes can be endearing and increase empathy.
- •Cozmo’s simplicity: ~4 DoF (treads, head, lift) + sound + screen
- •“Character” is often the missing pillar in human-robot interaction
- •Mistakes are acceptable—and even lovable—if the robot ‘reacts’ properly
- •Consumer cost constraints forced software to compensate for noisy hardware
20:44 – 26:14
Building emotional intelligence: eye contact, pets, and games as context for feelings
Boris details the animation/behavior system that generated diverse, parameterized emotional responses. Games were designed less as standalone entertainment and more as structured contexts that elicit believable emotions and bonding.
- •Pixar-inspired pipeline; collaboration with experienced character animators
- •Parameterized emotional library + behavioral engine + randomness for spontaneity
- •Eye contact as a powerful driver of engagement (large effect on playtime)
- •Studying pets (especially dogs) to shape attachment and interaction cues
- •Games create tension/winning/losing moments that make emotions feel ‘earned’
26:14 – 38:59
AI companions and the ‘Her’ question: intimacy without general intelligence
Lex and Boris explore how close we are to meaningful robot companionship and whether deep relationships require general AI. Boris argues that carefully constrained domains and expectations can enable compelling companionship sooner than humanlike agents.
- •Pet-like companions likely arrive sooner than humanlike ‘Her’ assistants
- •Uncanny valley risk rises as you approach human form/voice
- •Constraints and focused roles can deliver “friendship” without full AGI
- •Speech output is especially hard; speech understanding is more feasible
38:59 – 50:04
Anki’s shutdown: the brutal physics of hardware бизнеса and seasonal cash flow
Boris recounts the emotional and operational collapse of Anki despite strong product reception and meaningful revenue. He explains how inventory, marketing, retailer payment timing, and Q4 seasonality created extreme cash swings, culminating in a failed financing round.
- •Commercial success still didn’t guarantee survivability in hardware
- •Severe seasonality (majority of volume in Q4) amplified cash-flow risk
- •High up-front costs: inventory, marketing, payroll before retailer payments
- •Hardware market valuation shifts made fundraising harder
- •Team culture remained strong; shutdown was deeply personal and painful
50:04 – 1:05:00
After Anki: Cozmo’s legacy and how teams moved on (including to Waymo)
Lex and Boris reflect on Cozmo’s continued life under new stewardship and the difficulty of manufacturing/evolving such a complex product. Boris notes that many Anki engineers transitioned together, with a significant group joining Waymo.
- •Manufacturing and quality control are deceptively hard to replicate
- •Maintaining and evolving Cozmo requires deep cross-disciplinary expertise
- •Anki’s impact persisted via fan communities and research interest
- •Talent migration: many robotics/AI staff found a strong fit at Waymo
1:05:00 – 1:10:28
Waymo overview: Waymo Driver, Waymo One, and Waymo Via’s mission
Boris explains Waymo as a “driver” platform that can operate across vehicle types and domains. He outlines how Waymo One (people) and Waymo Via (goods) share a core stack while still requiring domain-specific engineering, especially for trucks.
- •Waymo Driver: reusable autonomy system across platforms
- •Waymo One: rider service; Via: freight/trucking and future delivery
- •Core tech leverage across products, with specialized teams for unique problems
- •Driverless Phoenix service as proof point for real-world deployment
1:10:28 – 1:33:17
Autonomous long-haul trucking strategy: transfer hubs, Texas routes, and network effects
Boris describes the initial commercialization approach: hub-to-hub freeway routes with transfer hubs at metro edges, deferring complex surface streets. He highlights Texas as a strong starting region and explains why each new route increases network value.
- •Start with L4 on Class 8 trucks focused on freeway segments
- •Transfer hubs enable early deployment while avoiding hardest urban last-mile
- •Dallas hub as a purpose-built autonomous trucking facility
- •Texas advantages: volume, regulation, weather, connectivity (Dallas–Houston–Austin)
- •Network effects: adding route segments increases whole-network utility
1:33:17 – 1:39:01
Truck-specific engineering: dynamics, safety boundaries, and sensor placement
The discussion turns to what makes trucks uniquely hard: heavy braking profiles, trailer articulation, rollover/jackknife risks, and load-dependent dynamics. Boris explains key sensor design choices, especially moving main pods to the sides to avoid trailer occlusion and improve redundancy.
- •Truck dynamics: longer stopping distances, roll risk, jackknifing constraints
- •Load changes shift physical safety limits; requires careful modeling
- •Testing to define high-dimensional safety boundaries (including extreme tests)
- •Sensor pods on sides (mirror location) reduce occlusion from tall trailer
- •Redundancy is essential: sensors will fail at scale; design assumes failures
1:39:01 – 1:46:06
Sensor fusion philosophy: LiDAR’s reliability, camera range, radar robustness
Boris lays out the complementary roles of LiDAR, cameras, and radar and why no single modality is sufficient. He also explains the trend toward earlier sensor fusion in ML systems to better exploit complementary signals, especially in weather and degraded conditions.
- •LiDAR: dense, consistent 3D geometry; weaker at long range and in some weather
- •Cameras: very long range and semantic richness; sensitive to lighting/glare and localization error at distance
- •Radar: strong in weather, provides velocity; useful long-range complement
- •Shift from ‘sensor-by-sensor’ to fused ML perception as the dominant paradigm
- •Early fusion often outperforms late fusion by preserving complementary cues
1:46:06 – 1:51:00
Machine learning at Waymo: from heuristic stacks to data-driven autonomy
Boris describes ML as increasingly central across the autonomy stack, first transforming perception and then expanding into behavior and planning. The key scaling goal is to improve with more data while minimizing the linear growth of human labor and hand-tuning.
- •ML role has grown dramatically since DARPA-era autonomy systems
- •Perception was first major ML success; behavior/planning increasingly data-driven
- •Need systems that improve with added data and avoid brittle heuristics
- •Key challenge: long-tail safety and evaluation, not just impressive demos
1:51:00 – 2:03:58
Simulation and evaluation: why proving safety is harder than driving
Waymo runs simulation at massive scale (orders of magnitude beyond real miles) to test, regress, and validate safety. Boris argues the core difficulty is evaluation—measuring readiness for rare, high-severity events and ensuring changes don’t create new failures.
- •~1000 simulated miles per real mile to accelerate learning and validation
- •Simulation used for regression testing, scenario replay, and stress testing
- •Rare events dominate readiness: issues that appear once in 100k–500k+ miles
- •Evaluation must prevent regressions while software iterates continuously
- •Borrowed safety practices from aerospace but with a far more variable environment
2:03:58 – 2:23:42
Waymo vs Tesla: L4 focus and sensor richness vs fleet-scale data advantages
Lex asks Boris to compare Waymo and Tesla as leading “robot builders” pursuing autonomy via different strategies. Boris frames Waymo’s advantage as L4-first design (hardware, safety framework, deployment learnings) and Tesla’s advantage as unprecedented real-world data collection and iteration at fleet scale.
- •Waymo’s edge: L4-first architecture, sensor suite, safety framework, driverless ops experience
- •Tesla’s edge: massive fleet data, real-world distribution, rapid learning loops
- •Core open question: timeline—how fast vision-only + ML can close long-tail gaps
- •Perception risk compounds downstream planning/safety challenges
- •Both approaches likely converge in capability, but via different paths and risks
2:23:42 – 3:14:13
Societal impacts and automation: cities, logistics efficiency, and trucking jobs
They zoom out to broader societal consequences: reshaped cities (parking, commuting), supply chain agility, environmental and safety gains, and the realities of job disruption. For trucking specifically, Boris argues autonomy may relieve the harshest long-haul work while humans remain vital in complex local routes and better logistics could improve driver quality of life.
- •Urban redesign: less parking, different commuting patterns, reduced traffic from ‘parking search’
- •Supply chain ripple effects: ports/warehouses/trucking coordination and resilience
- •Automation benefits: safety, efficiency, environmental improvements
- •Job impact framing: long-haul is hardest/least desirable; short-haul remains human-heavy longer
- •Logistics optimization can reduce unpaid waiting time and improve driver compensation

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

Boris Sofman’s robotics journey: Anki, Cozmo, and Waymo trucking

Science fiction robots and why nonverbal emotion works (WALL‑E, R2‑D2, Terminator)

Design philosophy: avoid humanoids, embrace constraints, and amplify expressiveness

Founding Anki at CMU: consumer robotics as ‘video games brought to life’

Cozmo’s ‘three-legged stool’: hardware, AI, and character (and why mistakes help)

Building emotional intelligence: eye contact, pets, and games as context for feelings

AI companions and the ‘Her’ question: intimacy without general intelligence

Anki’s shutdown: the brutal physics of hardware бизнеса and seasonal cash flow

After Anki: Cozmo’s legacy and how teams moved on (including to Waymo)

Waymo overview: Waymo Driver, Waymo One, and Waymo Via’s mission

Autonomous long-haul trucking strategy: transfer hubs, Texas routes, and network effects

Truck-specific engineering: dynamics, safety boundaries, and sensor placement

Sensor fusion philosophy: LiDAR’s reliability, camera range, radar robustness

Machine learning at Waymo: from heuristic stacks to data-driven autonomy

Simulation and evaluation: why proving safety is harder than driving

Waymo vs Tesla: L4 focus and sensor richness vs fleet-scale data advantages

Societal impacts and automation: cities, logistics efficiency, and trucking jobs

Get more out of YouTube videos.