Skip to content
a16za16z

Marc Andreessen & Amjad Masad on “Good Enough” AI, AGI, and the End of Coding

Amjad Masad, founder and CEO of Replit, joins a16z’s Marc Andreessen and Erik Torenberg to discuss the new world of AI agents, the future of programming, and how software itself is beginning to build software. They trace the history of computing to the rise of AI agents that can now plan, reason, and code for hours without breaking, and explore how Replit is making it possible for anyone to create complex applications in natural language. Amjad explains how RL unlocked reasoning for modern models, why verification loops changed everything, whether LLMs are hitting diminishing returns, and if “good enough” AI might actually block progress toward true general intelligence. 00:00 Intro 00:37 Programming in Plain English 03:00 The Vision Behind Replit 05:15 From Machine Code to English Code 07:00 Building Apps with AI Agents 09:30 When the Agent Becomes the Programmer 11:00 Long-Horizon Reasoning and Coherence 13:45 Reinforcement Learning and Problem Solving 17:30 The Verification Loop and Multi-Agent Systems 21:15 Watching AI Work Like a Human Programmer 23:45 From Stochastic Parrots to Real Reasoning 26:00 Why Coding Is Advancing Faster Than Other Fields 30:15 Verifiable Domains: Math, Code, and Physics 33:45 The AGI Debate: Are We on Track? 37:45 Transfer Learning and the Limits of Human Intelligence 41:15 Functional AGI and Automating Labor 45:20 GPT-5, Diminishing Returns, and Lost “Humanity” 53:10 Creativity, Reasoning, and Finding Truth in AI 57:30 The Origins of Replit and Early Coding Days 01:03:00 Hacking His University and Getting Caught 01:08:00 The Redemption and Lessons Learned for the AI Age Stay Updated: If you enjoyed this episode, be sure to like, subscribe, and share with your friends! Resources: Follow Amjad on X: https://x.com/amasad Follow Marc on X: https://x.com/pmarca Find a16z on X: https://x.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Listen to the a16z Podcast on Spotify: https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX Listen to the a16z Podcast on Apple Podcasts: https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711 Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

Marc AndreessenhostAmjad MasadguestErik Torenberghost
Oct 23, 20251h 11mWatch on YouTube ↗

CHAPTERS

  1. AI feels like magic—yet expectations keep rising

    Marc and Amjad open by noting the strange emotional whiplash in AI: astonishing breakthroughs paired with constant disappointment that progress isn’t faster or broader. They set the theme of the conversation—“good enough” AI can be transformative while still falling short of deeper intelligence goals.

  2. Replit’s plain-English programming experience (idea → app → publish)

    Amjad walks through how a novice or experienced user can start in Replit by describing an app in natural language. The agent proposes a plan, builds the software, tests it, and can publish it to production with minimal setup from the user.

  3. From accidental complexity to ‘English is the programming language’

    They frame Replit’s mission as removing ‘accidental complexity’ (tools, package managers, setup) so builders focus on intent. Amjad argues that syntax itself became the final bottleneck, pushing the platform toward natural language as the primary interface.

  4. When the agent becomes the real programmer

    Marc highlights a key shift: the agent is no longer a helper, it’s effectively the main ‘user’ of the development tools. Amjad gives an operational example—latency issues changed once Replit realized the ‘programmer’ was the U.S.-hosted model, not the human in Asia.

  5. Long-horizon coherence: how long can agents work before they ‘derail’?

    They dig into the core technical limitation of early agents: loss of coherence over time, compounding errors, and bizarre failure modes. Amjad explains why context management and memory compression are critical to keeping agents on track for longer tasks.

  6. Why reinforcement learning changed the game for reasoning

    Amjad argues the major foundation-model breakthrough enabling longer reasoning chains is reinforcement learning (RL), especially via code execution and verifiable tasks. RL trains models on successful multi-step ‘trajectories’ that reach a correct solution, reinforcing problem-solving behavior rather than next-token prediction alone.

  7. Measuring progress: benchmarks vs Replit’s real-world success metric

    They discuss how to quantify long-horizon capability, referencing external benchmarks and Replit’s internal A/B tests. Amjad claims agent runtime improved dramatically across Replit’s releases, using ‘publish’ as the strongest signal of user value and task completion.

  8. The verification loop and multi-agent scaffolding (relay race for reliability)

    Amjad describes the non-model innovation that made long runs practical: adding a verifier in the loop. Replit uses multi-agent workflows where one agent builds, another tests (e.g., via browser automation), summarizes progress, and triggers a new trajectory when bugs appear—allowing iterative reliability over long time horizons.

  9. Watching AI code ‘like a human’: speed, tool use, and reflective pauses

    They compare agent behavior to a hyper-productive human programmer: fast but not instantaneous, with pauses to reason, reflect, and search. The agent uses tools like web search when encountering unfamiliar compatibility issues, making it feel like observing real engineering work.

  10. From ‘stochastic parrots’ to verifiable reasoning—and why code advances fastest

    They revisit early criticisms of LLMs as ‘stochastic parrots’ that mimic language without true reasoning. The conversation argues verifiable domains (code, math, some physics) improve fastest because correctness can be checked automatically, enabling scalable RL and synthetic data generation—unlike ‘squishy’ fields such as law or healthcare.

  11. AGI on track—or trapped in a ‘good enough’ local maximum?

    Amjad raises concerns that advances in one domain don’t reliably transfer to others, challenging the idea that scaling alone yields general intelligence. Marc counters with human limitations on transfer learning and notes shifting definitions of AI (once solved, it stops being called AI), while both acknowledge the risk of optimizing a locally useful but non-general peak.

  12. Functional AGI: automating labor without ‘true’ general intelligence

    Amjad proposes a pragmatic path: even without a breakthrough in general intelligence, models can become ‘functionally’ general by being trained across many economically important tasks and sectors. This could automate large portions of work through targeted data collection, domain-by-domain tooling, and applied RL setups.

  13. GPT-5, diminishing returns, and the ‘loss of humanity’ vs gains in rigor

    Amjad argues GPT-5 improved mainly in verifiable domains but felt less human and emotionally resonant than earlier models, triggering user backlash. Marc responds that for his use (deep explanations and synthesis), top-tier models produce highly coherent long-form output, raising questions about what counts as ‘new knowledge’ vs synthesis.

  14. Amjad Masad’s origin story: early computers, first software business, and Replit’s beginnings

    Amjad recounts getting his first exposure to computers in Jordan, building early software with Visual Basic, and monetizing a system for internet/LAN cafes. He explains how frustration with local dev setup and belief in the web as the platform led to the earliest versions of Replit, including compiling languages into the browser.

  15. Hacking the university, getting caught, and lessons for the AI age

    In a dramatic story, Amjad describes hacking his university database to fix attendance-related failures, triggering system anomalies that led to discovery. He avoided prosecution, helped secure the systems, and ultimately graduated—ending with reflections on nonconformity, responsibility, and using powerful tools wisely as the AI era reshapes traditional paths.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome