Skip to content
ClaudeClaude

How Lovable vibecodes production software at scale

Vibecoding a beautiful, working prototype is easier than ever. Running a platform that non-developers use to ship production software is a different engineering problem. Today, software built on Lovable serves 600M+ monthly sessions. In this session, Fabian Hedin, Cofounder and CTO of Lovable, walks through the systems Lovable built to run Claude reliably at consumer scale — the fleet-learning layer that catches coding mistakes, the eval loop that gates every model release, and how to keep Lovable itself improving.

Fabian Hedinhost
May 20, 202631mWatch on YouTube ↗

CHAPTERS

  1. Lovable’s chat-to-preview product: who it’s for and what it builds

    Fabian opens by showing Lovable’s simple UI: a chat on the left and a live preview on the right. He explains the platform’s scope—everything from landing pages to internal tools to full web apps—and highlights how broadly it’s used, from kids to Fortune 500 teams.

  2. Origin story: from “gpt-engineer” to Lovable’s mission for the 99%

    He traces Lovable’s roots to the “gpt-engineer” GitHub repo, which showcased end-to-end code generation early on. Lovable’s differentiation is aiming beyond developer productivity toward enabling non-coders (“the 99%”) to build real software, which initially wasn’t feasible until models improved.

  3. Scale and usage today: products built, traffic, and surprising user segments

    Fabian shares Lovable’s current scale: tens of millions of products and massive downstream traffic to sites users build. He notes that despite targeting non-technical users, many users self-identify as engineers, reflecting a shift toward higher-level, spec-driven building even for pros.

  4. Two guiding principles: production-grade ambition + non-technical accessibility

    He frames Lovable’s core tension: pushing toward production-grade, frontier complexity while keeping the experience usable for non-technical people. This combination makes reliability and “unsticking” users essential because users may not know how to debug or inspect code.

  5. Why ‘the last 10%’ still dominates in AI building (and the ‘stuck’ problem)

    Fabian explains that AI accelerates the first version but finishing and debugging remains disproportionately hard—mirroring classic software wisdom. He maps progress as green → yellow → red friction, noting that getting stuck is especially damaging for non-technical builders.

  6. Defining and detecting “stuck”: the is_stuck metric and categories

    He introduces Lovable’s internal “is_stuck” classifier—triggered by repeated requests, complaints, or abandonment. He then categorizes stuck states into three buckets: prompt-fixable issues, platform/tooling gaps that should be easy, and deeper product limitations requiring larger investments.

  7. Self-healing tactic #1: “Lovable Overflow” as a Stack Overflow–style corpus

    Fabian presents Lovable Overflow: a structured collection of recurring issues and their solutions. When a user reports a problem (e.g., laggy scrolling), Lovable searches this corpus and injects relevant, adapted context into the main agent so it can jump closer to the right fix without repetitive back-and-forth.

  8. Keeping retrieved knowledge trustworthy: success tracking, staleness, and pruning

    He emphasizes that a knowledge corpus can hurt if it becomes outdated. Lovable tracks success ratios per knowledge item, prunes stale entries, and continuously refills with new learnings—treating knowledge as having a half-life that must be actively managed.

  9. Self-healing tactic #2: “venting” tool—agents report friction directly to the team

    For problems caused by platform/tooling constraints (bucket 2), Lovable gives the agent a “vent” feedback tool. When the agent is materially slowed by unclear docs, broken tools, or repeated failures, it sends a report to Slack; another agent dedupes, investigates, and may open a PR for engineers to review.

  10. Venting in action: rapid bug fixes and broader ecosystem feedback

    Fabian shares concrete examples: a file-copy tool bug triggered by special characters led to a PR merged into production within minutes. He also shows an example where the agent complains about Framer Motion TypeScript typing friction, raising the idea that agents could eventually contribute fixes upstream to open source.

  11. Venting as incident detection: spikes reveal production outages early

    Unexpectedly, vent volume spikes became an operational signal. During outages (inference down, sandbox/network failures), the agent-generated vents sometimes alerted the team before traditional monitoring/paging, and also provided rich debugging context about what the agent experienced.

  12. Meta-failure and self-fix: vent spam leads to a dedupe safeguard PR

    In a meta example, the agent overused venting in parallel conversations, spamming Slack. It then proposed and implemented its own mitigation (a dedupe safeguard) via a PR, which engineers reviewed and merged—demonstrating the self-healing loop even for the self-healing system.

  13. Key lessons and measured impact: model-specific tuning, knowledge half-life, and metrics

    Fabian concludes with learnings: failure modes are model-specific, so retrieval prompts and stored knowledge must be retuned and aggressively pruned as models change. He shares outcomes—reduced stuck rate and improved publish rate from Lovable Overflow, plus a steady stream of production fixes from vent-driven PRs.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.