Skip to content
Y CombinatorY Combinator

Andrej Karpathy: Software Is Changing (Again)

Andrej Karpathy's keynote on June 17, 2025 at AI Startup School in San Francisco. Slides provided by Andrej: https://drive.google.com/file/d/1a0h1mkwfmV2PlekxDN8isMrDA5evc4wW/view?usp=sharing Chapters: 00:00 - Intro 01:25 - Software evolution: From 1.0 to 3.0 04:40 - Programming in English: Rise of Software 3.0 06:10 - LLMs as utilities, fabs, and operating systems 11:04 - The new LLM OS and historical computing analogies 14:39 - Psychology of LLMs: People spirits and cognitive quirks 18:22 - Designing LLM apps with partial autonomy 23:40 - The importance of human-AI collaboration loops 26:00 - Lessons from Tesla Autopilot & autonomy sliders 27:52 - The Iron Man analogy: Augmentation vs. agents 29:06 - Vibe Coding: Everyone is now a programmer 33:39 - Building for agents: Future-ready digital infrastructure 38:14 - Summary: We’re in the 1960s of LLMs — time to build Drawing on his work at Stanford, OpenAI, and Tesla, Andrej sees a shift underway. Software is changing, again. We’ve entered the era of “Software 3.0,” where natural language becomes the new programming interface and models do the rest. He explores what this shift means for developers, users, and the design of software itself— that we're not just using new tools, but building a new kind of computer. More content from Andrej: https://www.youtube.com/@AndrejKarpathy Thoughts (From Andrej Karpathy!) 0:49 - Imo fair to say that software is changing quite fundamentally again. LLMs are a new kind of computer, and you program them *in English*. Hence I think they are well deserving of a major version upgrade in terms of software. 6:06 - LLMs have properties of utilities, of fabs, and of operating systems → New LLM OS, fabbed by labs, and distributed like utilities (for now). Many historical analogies apply - imo we are computing circa ~1960s. 14:39 - LLM psychology: LLMs = "people spirits", stochastic simulations of people, where the simulator is an autoregressive Transformer. Since they are trained on human data, they have a kind of emergent psychology, and are simultaneously superhuman in some ways, but also fallible in many others. Given this, how do we productively work with them hand in hand? Switching gears to opportunities... 18:16 - LLMs are "people spirits" → can build partially autonomous products. 29:05 - LLMs are programmed in English → make software highly accessible! (yes, vibe coding) 33:36 - LLMs are new primary consumer/manipulator of digital information (adding to GUIs/humans and APIs/programs) → Build for agents! Some of the links: - Software 2.0 blog post from 2017 https://karpathy.medium.com/software-2-0-a64152b37c35 - How LLMs flip the script on technology diffusion https://karpathy.bearblog.dev/power-to-the-people/ - Vibe coding MenuGen (retrospective) https://karpathy.bearblog.dev/vibe-coding-menugen/ Apply to Y Combinator: https://ycombinator.com/apply Work at a startup: https://workatastartup.com

Jun 19, 202539mWatch on YouTube ↗

CHAPTERS

  1. Why software is changing again: a rare moment to enter the industry

    Karpathy frames the current moment as a fundamental shift in how software is built—one of the biggest in decades—and argues it creates massive opportunities to write and rewrite software. He sets up the talk around understanding LLMs as a new kind of computer and learning how to build with them responsibly.

  2. Software 1.0 → 2.0 → 3.0: code, weights, and prompts

    He lays out a clean taxonomy: Software 1.0 is hand-written code, Software 2.0 is neural network weights trained via data/optimization, and Software 3.0 is LLMs that are programmable via prompts. The key novelty is that the “programming language” for 3.0 is natural language (English).

  3. Tesla Autopilot lesson: new paradigms ‘eat’ the existing stack

    Using Autopilot as an example, he describes how capabilities migrated from C++ (1.0) into neural networks (2.0), deleting large amounts of traditional code. He argues a similar phenomenon is happening again with LLM-based software, which will consume and reshape existing stacks.

  4. LLMs as utilities: metered intelligence and ‘brownouts’

    Karpathy explains why LLMs resemble utilities: huge capex to build, opex to serve, metered pricing, and expectations around reliability and latency. He notes that when top models go down it creates something like an “intelligence brownout,” revealing growing dependence.

  5. LLMs as fabs and operating systems: the ecosystem taking shape

    He extends the analogy: LLMs also resemble fabs due to the high capex and fast-moving, centralized R&D, though software’s malleability weakens defensibility. The most fitting analogy, he argues, is the operating system: closed providers versus an open ecosystem (e.g., Llama-like ‘Linux’), plus an emerging tool/multimodal stack around the model core.

  6. Historical computing parallels: we’re in the LLM ‘1960s’ (cloud + time-sharing)

    LLM compute is still expensive, so the dominant model is centralized cloud inference with many users sharing capacity—akin to early time-sharing systems. He suggests the personal computing revolution for LLMs hasn’t happened yet, though hints appear (e.g., local inference on memory-bound hardware).

  7. A reversal in tech diffusion: consumers first, institutions later

    Unlike many transformative technologies that start with government/corporate adoption and later reach consumers, LLMs diffused directly to billions of people quickly. This shapes which applications appear first and why consumer workflows can lead enterprise transformation this time.

  8. LLM ‘psychology’: people-spirits with superpowers and sharp failure modes

    He characterizes LLMs as stochastic simulations of people—trained on human text and exhibiting emergent human-like traits. They can be superhuman in recall and breadth, yet unreliable: hallucinations, jagged intelligence, weak self-knowledge, and security vulnerabilities demand careful product design.

  9. Designing partial-autonomy apps: why dedicated products beat raw chat

    He argues that most valuable experiences won’t be ‘talk to the OS directly’ (chat) but app-specific interfaces that manage context, orchestrate multiple model calls, and present outputs in auditable ways. Cursor (coding) and Perplexity (search/research) illustrate a pattern: a classic UI plus LLM layers that enable larger “chunks” of work.

  10. The autonomy slider: calibrating how much control you give the model

    A central product concept is an autonomy slider: users choose small assists or large, agentic actions depending on risk and task complexity. He highlights Cursor’s gradations (completion → edit selection → edit file → agent across repo) and Perplexity’s modes (quick search → research → deep research).

  11. Human–AI collaboration loops: generation vs verification (and keeping AI on a leash)

    Karpathy emphasizes that today humans typically verify while AIs generate, so productivity depends on making verification fast and manageable. He advocates GUIs/visualization for rapid auditing and warns against producing huge diffs or uncontrolled changes that overwhelm human review.

  12. Autonomy reality check from self-driving: agents are a decade-long project

    Drawing on autonomy work at Tesla and the history of Waymo demos, he cautions against hype like “this is the year of agents.” Even when demos look perfect, real-world reliability and edge cases can take many years, and human-in-the-loop operation often persists longer than expected.

  13. Iron Man framing: build suits (augmentation) before fully autonomous robots

    He uses Iron Man to clarify product direction: aim for augmentation and controllable autonomy rather than flashy fully autonomous agents. The best near-term products behave like “suits” that keep humans empowered while steadily pushing the autonomy slider rightward.

  14. Vibe coding: English as the new on-ramp to programming

    Because Software 3.0 is programmed in natural language, many more people can build software without years of training. He discusses “vibe coding” as a meme that captures a real shift—rapid prototyping and personal software creation—while noting that shipping a real product still involves painful non-coding steps.

  15. Build for agents: make digital infrastructure legible and actionable to LLMs

    He argues LLMs are becoming a primary consumer/manipulator of digital information alongside humans (GUIs) and programs (APIs). The next frontier is “agent-friendly” infrastructure: markdown docs, machine-actionable instructions (replace ‘click’ with cURL), domain guidance files, MCP-style protocols, and tools that reshape existing sources into LLM-ingestable formats.

  16. Closing synthesis: the 1960s of LLMs—time to rebuild the stack

    Karpathy recaps the central thesis: LLMs are utility-like, fab-built, OS-like computers with human-ish psychology and sharp edges, and we’re early in the cycle—comparable to the 1960s of computing. The call to action is to build partial-autonomy products, accelerate human–AI loops, and upgrade infrastructure so agents can operate safely and effectively.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.