No Priors Ep. 79 | With Magic.dev CEO and Co-Founder Eric Steinberger

Today on No Priors, Sarah Guo and Elad Gil are joined by Eric Steinberger, the co-founder and CEO of Magic.dev. His team is developing a software engineer co-pilot that will act more like a colleague than a tool. They discussed what makes Magic stand out from the crowd of AI co-pilots, the evaluation bar for a truly great AI assistant, and their predictions on what a post-AGI world could look like if the transition is managed with care. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @EricSteinb Show Notes: 0:00 Introduction 0:45 Eric’s journey to founding Magic.dev 4:01 Long context windows for more accurate outcomes 10:53 Building a path toward AGI 15:18 Defining what is enough compute for AGI 17:34 Achieving Magic’s final UX 20:03 What makes a good AI assistant 22:09 Hiring at Magic 27:10 Impact of AGI 32:44 Eric’s north star for Magic 36:09 How Magic will interact in other tools

Sarah GuohostEric SteinbergerguestElad Gilhost

Aug 30, 202437mWatch on YouTube ↗

CHAPTERS

0:00 – 0:43
Magic.dev’s mission: an AI engineer “colleague,” not a tool
Sarah opens by framing Magic as a software-engineer copilot designed to behave more like a teammate than a traditional autocomplete tool. Eric’s eclectic background is introduced as context for why he’s building an AI model + system rather than a narrow product feature.
- •Magic aims for a colleague-like software engineering experience
- •Eric’s prior work spans Meta, games, and climate nonprofit efforts
- •Positioning: model + product system, not just a UI layer
0:43 – 2:49
From teenage ‘midlife crisis’ to reinforcement learning research roots
Eric recounts how an early obsession with “doing something important” led him toward AI, then to learning to code and focusing on reinforcement learning. He describes self-directed mentorship, early research collaborations, and a long-term fixation on algorithms that learn efficiently.
- •Early motivation: AI as the path to ‘do everything’
- •Learns to code, gravitates to reinforcement learning as a foundation
- •Cold outreach/mentorship and early research collaborations
- •Focus on sample efficiency and improved RL algorithm structures
2:49 – 3:58
Why Magic started with code: bootstrapping a system that can build itself
Sarah asks why code, and Eric explains the “work backwards from AGI” logic: a system that can write and validate code can help build the next system. Narrowing to code reduces product complexity (though not compute needs) and creates a pragmatic wedge toward general intelligence.
- •AGI framing: build a system that can build the system
- •Code + experimentation as the minimal sufficient capability set
- •Narrow domain simplifies product surface area
- •Compute is still the hard part; less savings than expected
3:58 – 6:13
Long context as the core primitive: ‘bring compute to the data’
The conversation turns to Magic’s architectural bets—especially ultra-long context windows. Eric argues that in-context learning is the key “magic” of transformers, enabling models to incorporate rapidly changing, personal/team data without constant fine-tuning.
- •Models need long histories of actions and collaboration context
- •In-context learning viewed as an ‘online optimizer’
- •Principle: ‘bring compute to the data’ rather than forcing data into training
- •Five million tokens announced early; longer horizons require more memory
6:13 – 7:13
Why long context can beat retrieval: Bitter Lesson and learning the heuristic
Elad presses on whether retrieval might be more efficient, and Eric argues that retrieval is a heuristic that selects a subset, while long-context models can learn that heuristic (and potentially outperform it). He notes the caveat that context quality still matters, but frames long context as the more general solution.
- •Retrieval chooses a subset; long context can see everything
- •If retrieval were optimal, a sufficiently capable model could learn it
- •Bitter Lesson framing: prefer scalable learning over hand-built heuristics
- •Practical caveat: you must validate long-context quality vs. short+retrieval
7:13 – 10:44
Inference-time compute and test-time search: trading training vs. thinking
Elad asks about test-time search, and Eric lays out performance as a function of training compute and inference compute. He argues for user- or workload-adjustable inference spend, but notes the algorithmic and product complexity of doing this well in general-domain settings (unlike games/RL benchmarks).
- •Performance depends on both training compute and inference compute
- •Optimal allocation depends on total budget and expected inference volume
- •Market reality: spiky spend distribution from cents to million-dollar jobs
- •Test-time compute control is valuable but non-trivial outside game-like domains
10:44 – 15:17
A pragmatic roadmap to AGI: recursive self-improvement with a safety boundary
Sarah asks how AGI goals shape roadmap decisions; Eric emphasizes iterative bootstrapping where the model helps solve both capability and safety/alignment problems at each stage. He argues compute is one of the few controllable “knobs,” and that automation may be necessary to prioritize safety effectively.
- •Risk framing: many near-term issues are solvable; ‘evolutionary’ risk is different
- •Strategy: iterative recursion—use the model to improve itself and its safety
- •Safety as a boundary condition, not a side project
- •Automation may be required to meaningfully prioritize safety work
15:17 – 17:33
‘Enough compute’ for AGI (and for coding): competing with big labs’ focus
Elad revisits compute requirements; Eric admits he underestimated how much compute would be required and how intensely others would pursue coding. He also introduces the idea that for many products there’s a practical ‘good enough’ threshold, beyond which improvements matter only for rarer frontier tasks.
- •Direct competition in code means heavy compute gets deployed in the domain
- •Eric acknowledges earlier underestimation of required compute
- •Plans include building/announcing an extremely large compute cluster
- •Concept of ‘good enough’ models for most users vs. frontier breakthroughs
17:33 – 20:03
The ‘final UX’ for Magic: escaping the uncanny valley and shipping the right step-change
Asked about productization, Eric describes internal UX iterations that felt like an uncanny valley—promising but not yet usable for real work. He argues it’s counterproductive to launch a mediocre assistant if you’re close to a step-function “final UX,” though deadlines may force an interim release.
- •Internal iterations: signs of life but not yet daily-driver quality
- •Avoid shipping mere completions; they’ll be outpaced quickly
- •Goal: design for a ‘final UX’ to prevent constant product resets
- •Reality: many engineering fires; optimism despite delays
20:03 – 21:59
What makes a great AI assistant: trust, reliability, and reduced review burden
Elad asks what separates mediocre from amazing; Eric answers: trust. He describes a bar where the assistant’s output needs minimal review—similar to how teams trust code after multiple strong human reviewers—unlocking a step-function shift from occasional use to near-total workflow dependence.
- •Core differentiator: trust (not just raw capability)
- •Automation bar: move from heavy review to light review to no review
- •Step-function adoption: users go from 0% to ~90% usage, not 5%
- •Belief: once the model clears some use cases, the leap to most is small
21:59 – 26:14
Hiring at Magic: finding overlooked talent, mission-driven culture, and quiet ambition
Sarah asks about building the team; Eric explains early recruiting was hard without marquee credentials, so Magic sought high-upside, under-recognized engineers and researchers. He emphasizes loyalty, deep mission alignment, and a culture of humility and intense productivity over “poached IP.”
- •Early-stage recruiting challenge vs. famous founders/labs
- •Strategy: identify ‘hidden gem’ talent with strong intrinsic motivation
- •Example: deeply knowledgeable engineer who innovated training sharding
- •Culture: mission-first, high output, humble ambition; no IP-poaching ethos
26:14 – 31:44
Impact of AGI: abundance, power trade-offs, and meaning in a post-work society
Sarah asks about AGI’s implications; Eric argues the future is complex and often oversimplified into single-issue debates (e.g., open source vs. centralization), while both sides can hold true simultaneously. He predicts broad automation of computer-based work and major societal challenges around identity, competition, and meaning.
- •Many debates are ‘both true’—risk trade-offs resist simplification
- •Need for capitalism/competition plus ‘right’ guardrails
- •Likely automation of most computer work (and possibly more)
- •Post-work meaning problem: competitive/achievement identities get disrupted
31:44 – 35:00
Eric’s north star: make the transition go well so the future isn’t ‘terrible’
Elad asks what grand question Magic should answer; Eric reframes his north star as ensuring the world is in a good place in 30 years. He describes AGI futures as bimodal—amazing if managed well, catastrophic if not—so the priority is responsible deployment and governance, not picking a single math problem to solve.
- •North star: a good world in 30 years, not a specific theorem
- •Belief: many knowledge problems will be answered as a side effect
- •AGI outcomes feel bimodal; ‘no mediocre AGI future’ in his view
- •Responsibility framing: leaders must focus on preventing failure modes
35:00 – 37:46
How Magic fits into developer tools: AI as the ‘employee’ above the toolchain
Elad asks how Magic will interact with IDEs and other tools; Eric says his view changed multiple times, but he now expects the AI to operate like an employee using the same toolset. He predicts tools will increasingly be optimized for AI agents (like SEO for crawlers), with the model/agent becoming the primary layer above commodity integrations.
- •Product stance: follow what the market wants; view evolving
- •AI should use the same tools humans use—like an employee
- •Expect AI-specific optimizations in tools (analogous to SEO for crawlers)
- •Agent/model becomes the main layer; many wrapper products get subsumed