Skip to content
Lex Fridman PodcastLex Fridman Podcast

Andrej Karpathy: Tesla AI, Self-Driving, Optimus, Aliens, and AGI | Lex Fridman Podcast #333

Andrej Karpathy is a legendary AI researcher, engineer, and educator. He's the former director of AI at Tesla, a founding member of OpenAI, and an educator at Stanford. Please support this podcast by checking out our sponsors: - Eight Sleep: https://www.eightsleep.com/lex to get special savings - BetterHelp: https://betterhelp.com/lex to get 10% off - Fundrise: https://fundrise.com/lex - Athletic Greens: https://athleticgreens.com/lex to get 1 month of fish oil EPISODE LINKS: Andrej's Twitter: http://twitter.com/karpathy Andrej's YouTube: http://youtube.com/c/AndrejKarpathy Andrej's Website: http://karpathy.ai Andrej's Google Scholar: http://scholar.google.com/citations?user=l8WuQJgAAAAJ Books mentioned: The Vital Question: https://amzn.to/3q0vN6q Life Ascending: https://amzn.to/3wKIsOE The Selfish Gene: https://amzn.to/3TCo63s Contact: https://amzn.to/3W3y5Au The Cell: https://amzn.to/3W5f6pa PODCAST INFO: Podcast website: https://lexfridman.com/podcast Apple Podcasts: https://apple.co/2lwqZIr Spotify: https://spoti.fi/2nEwCF8 RSS: https://lexfridman.com/feed/podcast/ Full episodes playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOdP_8GztsuKi9nrraNbKKp4 Clips playlist: https://www.youtube.com/playlist?list=PLrAXtmErZgOeciFP3CBCIEElOJeitOr41 OUTLINE: 0:00 - Introduction 0:58 - Neural networks 6:01 - Biology 11:32 - Aliens 21:43 - Universe 33:34 - Transformers 41:50 - Language models 52:01 - Bots 58:21 - Google's LaMDA 1:05:44 - Software 2.0 1:16:44 - Human annotation 1:18:41 - Camera vision 1:23:46 - Tesla's Data Engine 1:27:56 - Tesla Vision 1:34:26 - Elon Musk 1:39:33 - Autonomous driving 1:44:28 - Leaving Tesla 1:49:55 - Tesla's Optimus 1:59:01 - ImageNet 2:01:40 - Data 2:11:31 - Day in the life 2:24:47 - Best IDE 2:31:53 - arXiv 2:36:23 - Advice for beginners 2:45:40 - Artificial general intelligence 2:59:00 - Movies 3:04:53 - Future of human civilization 3:09:13 - Book recommendations 3:15:21 - Advice for young people 3:17:12 - Future of machine learning 3:24:00 - Meaning of life SOCIAL: - Twitter: https://twitter.com/lexfridman - LinkedIn: https://www.linkedin.com/in/lexfridman - Facebook: https://www.facebook.com/lexfridman - Instagram: https://www.instagram.com/lexfridman - Medium: https://medium.com/@lexfridman - Reddit: https://reddit.com/r/lexfridman - Support on Patreon: https://www.patreon.com/lexfridman

Andrej KarpathyguestLex Fridmanhost
Oct 29, 20223h 28mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

Andrej Karpathy on AI, AGI, aliens, and humanity’s explosive future

  1. Lex Fridman and Andrej Karpathy range across technical and philosophical territory: how modern neural networks work, the transformer’s importance, Software 2.0, large-scale data engines, and Tesla’s vision-based self-driving and humanoid robots.
  2. Karpathy argues that current AI systems already exhibit nontrivial understanding and reasoning, and that scaling data, models, and multimodal inputs will likely lead to AGI, possibly without physical embodiment.
  3. They explore the origins and prevalence of life in the universe, the Fermi paradox, whether our universe is a simulation with possible “exploits,” and how future synthetic intelligences might solve the universe’s “puzzle.”
  4. The conversation closes on ethics, safety, human meaning, longevity, and what a world of ubiquitous AI agents, humanoid robots, and virtual realities might look like, with Karpathy cautiously optimistic yet acutely aware of existential risks.

IDEAS WORTH REMEMBERING

5 ideas

Transformers act as a general-purpose differentiable computer and underpin modern AI progress.

Karpathy views the transformer as a powerful, relatively simple architecture that is expressive in the forward pass, trainable with backpropagation, and highly parallelizable on GPUs—making it the de facto backbone for language, vision, and multimodal models.

Data, not hand-coded logic, is the new center of software—“Software 2.0.”

Instead of writing rules, engineers design architectures and, crucially, build large, diverse, accurate datasets plus loss functions; optimization “fills in the blanks” in the weights, so the real programming happens via data curation and iteration loops (data engines).

Vision-only self-driving is both necessary and, Karpathy argues, sufficient.

He claims cameras provide the richest, cheapest constraints on the world and match the human sensor stack that roads are designed for; additional sensors like radar or lidar add organizational and data complexity, so they must deliver large gains to justify their cost—and often don’t.

Large language models already exhibit a form of understanding and reasoning.

Trained on next-token prediction, GPT-like systems must implicitly learn physics, chemistry, human behavior, and many tasks embedded in text; their ability to solve novel problems via prompting indicates genuine generalization rather than simple pattern lookup.

Embodiment (e.g., humanoid robots) is a powerful but not strictly necessary path to AGI.

Karpathy thinks AGI may emerge from scaled multimodal internet models alone, but sees Optimus-style humanoid robots as a high-certainty hedge: if AGI requires acting in and learning from the physical world, a large fleet of human-form robots will eventually discover the needed algorithms.

WORDS WORTH SAVING

5 quotes

We’re not writing the algorithm anymore; we’re writing the dataset.

Andrej Karpathy

A transformer is basically a general-purpose differentiable computer that happens to run extremely well on our hardware.

Andrej Karpathy

Vision is both necessary and sufficient for driving. Roads are built for human eyes.

Andrej Karpathy

I kind of think of neural nets as a very complicated alien artifact.

Andrej Karpathy

I suspect the universe is some kind of a puzzle, and synthetic AIs will uncover that puzzle and solve it.

Andrej Karpathy

Foundations of neural networks, emergent behavior, and transformer architecturesLanguage models, GPT-style systems, and the limits of text-only trainingSoftware 2.0, data engines, and large-scale deployment at Tesla (autopilot, vision-only self-driving, Optimus)Simulation, synthetic data, and agents acting on the internet and in the physical worldOrigins of life, the Fermi paradox, and the likelihood and detectability of alien civilizationsAGI, consciousness, alignment, and societal impacts of powerful AI systemsHuman productivity, learning, teaching, and personal philosophy about work, meaning, and longevity

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome