Skip to content
The Twenty Minute VCThe Twenty Minute VC

David Luan: Why Nvidia Will Enter the Model Space & Models Will Enter the Chip Space | E1169

David Luan is the CEO and Co-Founder at Adept, a company building AI agents for knowledge workers. To date, David has raised over $400M for the company from Greylock, Andrej Karpathy, Scott Belsky, Nvidia, ServiceNow and WorkDay. Previously, he was VP of Engineering at OpenAI, overseeing research on language, supercomputing, RL, safety, and policy and where his teams shipped GPT, CLIP, and DALL-E. He led Google’s giant model efforts as a co-lead of Google Brain. ----------------------------------------------- Timestamps: (00:00) Intro (01:03) Lessons from Google Brain & Their Influence on Building Adept (05:06) Why It Took 6 Years for ChatGPT to Emerge After Transformers (06:49) Takeaways from OpenAI (09:57) The Key Bottleneck in AI Model Performance (16:06) Understanding Minimum Viable Capability Levels & Model Scale (20:17) The Future of the Foundational Model Layer (33:26) Adept’s Focus for Vertical Integration for AI Agents (35:53) The Distinction Between RPA & Agents (40:24) The Co-pilot Approach: Incumbent Strategy or Innovation Catalyst (42:46) Enterprise AI Adoption Budgets: Experimental vs. Core (46:53) AI Services Providers vs. Actual Providers (49:32) Open vs. Closed AI Systems for Crucial Decision Making (54:18) Quick-Fire Round ----------------------------------------------- In Today’s Episode with David Luan We Discuss: 1. The Biggest Lessons from OpenAI and Google Brain: What did OpenAI realise that no one else did that allowed them to steal the show with ChatGPT? Why did it take 6 years post the introduction of transformers for ChatGPT to be released? What are 1-2 of David’s biggest lessons from his time leading teams at OpenAI and Google Brain? 2. Foundation Models: The Hard Truths: Why does David strongly disagree that the performance of foundation models is at a stage of diminishing returns? Why does David believe there will only be 5-7 foundation model providers? What will separate those who win vs those who do not? Does David believe we are seeing the commoditization of foundation models? How and when will we solve core problems of both reasoning and memory for foundation models? 3. Bunding vs Unbundling: Why Chips Are Coming for Models: Why does David believe that Jensen and Nvidia have to move into the model layer to sustain their competitive advantage? Why does David believe that the largest model providers have to make their own chips to make their business model sustainable? What does David believe is the future of the chip and infrastructure layer? 4. The Application Layer: Why Everyone Will Have an Agent: What is the difference between traditional RPA vs agents? Why is agents a 1,000x larger business than RPA? In a world where everyone has an agent, what does the future of work look like? Why does David disagree with the notion of “selling the work” and not the tool? What is the business model for the next generation of application layer AI companies? ----------------------------------------------- Subscribe on Spotify: https://open.spotify.com/show/3j2KMcZTtgTNBKwtZBMHvl?si=85bc9196860e4466 Subscribe on Apple Podcasts: https://podcasts.apple.com/us/podcast/the-twenty-minute-vc-20vc-venture-capital-startup/id958230465 Follow Harry Stebbings on Twitter: https://twitter.com/HarryStebbings Follow David Luan on Twitter: https://twitter.com/jluan Follow 20VC on Instagram: https://www.instagram.com/20vchq Follow 20VC on TikTok: https://www.tiktok.com/@20vc_tok Visit our Website: https://www.20vc.com Subscribe to our Newsletter: https://www.thetwentyminutevc.com/contact ----------------------------------------------- #20vc #harrystebbings #davidluan #adeptai #venturecapital #ai #openai #nvidia #deepmind #chatgpt #apple

David LuanguestHarry Stebbingshost
Jun 24, 202458mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 0:57

    From “papers” to “Apollo projects”: the next phase after Transformers

    David opens with a thesis that OpenAI (and DeepMind) recognized early: the post-Transformer era would be driven less by publishing and more by attacking large, concrete scientific/engineering problems. He also previews his view that compute demand will keep rising because new training paradigms will soak up more resources.

    • Shift from academic paper progress to focused “solve the big problem” programs
    • Compute won’t hit diminishing returns in practice because new approaches will consume it
    • Tier-1 cloud providers have existential incentives to win the model layer
  2. 0:57 – 3:40

    Google Brain as the “Bell Labs” era: what bottom-up research looks like

    Harry asks about David’s Google Brain takeaways and how they shaped Adept. David describes Brain’s 2012–2018 dominance, talent density, and the mechanics of curiosity-driven, bottom-up research that produced foundational breakthroughs.

    • 2012–2018 as a golden period of rapid deep learning breakthroughs
    • Brain’s concentration of researchers behind Transformers, diffusion, optimization methods
    • Bottom-up research: minimal near-term accountability, curiosity-driven exploration
    • How this research culture differs from product-driven engineering
  3. 3:40 – 5:22

    Why Transformers changed everything—and why that shifted the playbook

    David frames Transformers as the first broadly general architecture that replaced a zoo of task-specific model families. Once the “universal model” emerged, the frontier shifted from inventing new basic architectures to scaling and applying Transformers to bigger problems.

    • Transformers were invented at Google, not OpenAI
    • Pre-Transformer world: CNNs for vision, RNNs for text, RL/tree search for games
    • Transformers as a universal substrate across tasks
    • After 2017, progress becomes less about low-level modeling novelty and more about scaling/application
  4. 5:22 – 6:49

    Why ChatGPT arrived years after Transformers: capability thresholds + packaging

    Harry probes the gap between the 2017 breakthrough and mass consumer adoption. David argues it took both (1) models crossing a “minimum viable smartness” threshold and (2) consumer-friendly product packaging—where GPT-3 existed earlier but lacked a viral, consumer interface.

    • Incremental improvements 2017→2022 created a “boiling frog” effect
    • GPT-2 as an early “aha” for general text generation
    • Minimum viable smartness: capability must cross a compelling-experience threshold
    • Packaging matters: GPT-3 API was developer-only; ChatGPT made it consumer-playable
  5. 6:49 – 8:30

    OpenAI lessons: mission-driven teams that tackle specific hard problems

    David contrasts Google Brain’s federated research with OpenAI’s problem-centric execution. He describes OpenAI’s “choose a major unsolved problem and solve it” approach and how that informs Adept’s culture and structure.

    • OpenAI’s cultural shift: fewer isolated papers, more integrated “big team” initiatives
    • Examples: robot hand control, game-playing, scaling GPT to general reasoning/chat
    • “Apollo project” framing vs organic self-organization
    • How this structure influences how Adept builds and prioritizes
  6. 8:30 – 14:53

    Compute, data, and the next scaling frontier: self-improvement loops & synthetic environments

    Harry challenges the idea of diminishing returns from compute and asks whether data is the real bottleneck. David argues scaling curves remain predictable at the right lens (e.g., doubling compute), and that a second wave—models generating/collecting their own training signal via tools, simulations, and RL loops—will drive further gains and consume massive compute.

    • Diminishing returns per GPU vs predictable returns per compute doubling (log-like scaling)
    • Analogy to Moore’s Law: progress continues via scale-up + scale-out systems
    • Next frontier: models learning by interacting with environments (tool use, reflection, retries)
    • Example: math reasoning via theorem provers/Jupyter + iterative self-critique
    • Why this shift is happening now: base-model training costs reaching multi-billion-dollar levels
  7. 14:53 – 18:18

    Agents vs chatbots: reliability, hallucinations, and emergent capability thresholds

    The conversation turns from model improvement to what agents require to be useful in the real world. David explains why hallucinations can be helpful for creative chat but unacceptable for agents, then introduces “minimum viable capabilities” where new skills appear suddenly once scale crosses a threshold.

    • Agents and chatbots are “different species” with different requirements
    • Hallucinations: feature for creativity vs bug for operational workflows
    • Minimum viable capability: sudden emergent skills at certain scales (GPT-2 arithmetic example)
    • Why agents are hard: key work knowledge isn’t neatly captured in datasets
  8. 18:18 – 23:21

    Reasoning and memory: what must be solved at the model layer vs the product layer

    Harry asks about breakthroughs in reasoning and why memory remains challenging. David argues reasoning likely requires training regimes with environments, trials, and feedback (not just internet-scale next-token prediction), while long-term memory is often better handled by end applications as part of a broader system rather than as an intrinsic LLM feature.

    • Reasoning definition: composing known thoughts to produce new ones
    • Pure scaling alone won’t solve reasoning; needs environment interaction + feedback loops
    • Reasoning improvements must happen at model-provider level (model must change)
    • Memory split: short-term context window progress vs long-term preference/task memory
    • LLMs aren’t products; products are full systems that incorporate LLMs (e.g., travel preference memory)
  9. 23:21 – 31:46

    Who wins the foundation-model layer: clouds, NVIDIA, Apple, and vertical integration pressure

    David predicts a small set of long-term frontier providers due to cost and incentives. He explains why cloud providers “must” own strong model offerings, why NVIDIA faces pressure from in-house chips, and why chips and models will converge through vertical integration; he also discusses Apple’s edge advantage and a future where model providers become hot-swappable behind the interface.

    • Steady state: ~5–7 max-scale LLM providers due to economics
    • Model layer becomes a core computing primitive; controlling it controls downstream compute value
    • NVIDIA vs in-house silicon: TPU as proof that challengers can succeed with strong will
    • Vertical integration: chipmakers move up to models; model providers move down to chips
    • Apple’s leverage: running capable models at the edge for privacy/latency/cost
    • Interface ownership enables commoditized, swappable “big brain” providers behind Apple
  10. 31:46 – 35:53

    What happens to independent model companies—and why Adept isn’t “just” a model seller

    Harry asks about the end state for foundational model startups that may not afford ongoing frontier training. David predicts tier-1 clouds will sustain their own efforts, while independents must either attach to clouds or build huge economic flywheels; he positions Adept as an enterprise agent company building a vertically integrated slice from UX to model adaptations.

    • Clouds will fund/ensure their own frontier efforts because it’s existential
    • Independent model sellers face a narrow window before commoditization
    • Survival path: massive cash-flow flywheel (often via strong distribution/product)
    • Adept’s stance: not selling foundation models to developers; selling enterprise agents
    • Why agents need vertical integration: reliability, UX leverage, and model-layer tailoring
    • Enterprise workflows as edge cases (e.g., every Salesforce instance configured differently)
  11. 35:53 – 42:46

    RPA vs agents: why the shift is disruptive, and how pricing and orgs may change

    Harry pushes on whether agents are simply “RPA 2.0.” David distinguishes RPA as deterministic automation for repetitive, uniform tasks, versus agents as planning, re-evaluating systems akin to “full self-driving,” then explains why incumbents struggle due to business-model disruption and why copilots/teammates reshape pricing and team structure.

    • RPA analogy: robots following a painted line for consistent, high-volume tasks
    • Agents analogy: full self-driving—planning, adapting, re-evaluating at every step
    • Why incumbents may be disadvantaged: current sales/implementation model is slow and services-heavy
    • Adept’s approach: end users teach capabilities via SOPs, demonstrations, corrections; aim for self-serve over time
    • Pricing debate: “price per work” fits commoditized repetition; knowledge work value looks more like teammate augmentation
    • Org impact: “collapsing the talent stack”—humans become broader generalists supervising AI specialists
  12. 42:46 – 48:20

    Enterprise adoption reality: experimental budgets, services growth, and productization waves

    Harry asks whether enterprises are moving from experimentation to core budgets, and whether AI services firms will outgrow model providers. David argues most enterprise spend remains experimental (and he avoids low-quality experimental revenue), adoption will be long and uneven, and today’s services-led implementations will be productized into repeatable software companies that capture major value.

    • Enterprise AI adoption is mostly experimental; core-budget penetration is still early
    • Enterprises lag even on “mature” tech like cloud due to on-prem/mainframe inertia
    • Pragmatic revenue quality: avoid deals funded only by experimentation budgets
    • Services fill the “gulf” between base models and enterprise needs early on
    • Over time, successful services patterns get productized into scalable companies
  13. 48:20 – 54:18

    Regulation, open vs closed, and why human-computer interaction is the missing ingredient

    David warns more about regulatory capture than over-regulation, arguing incumbents may “pull up the ladder” by shaping rules. He defends open source as a way for the broader field to keep up, then claims the critical frontier is better human-computer interaction—richer than prompting—so humans can supervise, correct, and align increasingly capable systems.

    • Primary regulatory risk: capture that entrenches a few dominant players
    • Consequence: harder to build on open source; higher barriers for new AI entrants
    • Open vs closed: misuse risks are real, but open typically lags closed due to resources
    • AGI discussions can be “infinity reasoning”; focus on near-term path dependence
    • HCI as key: chat/prompting is too thin vs human collaboration on shared canvases/tools
    • Need better supervision/alignment interfaces to shape training data and system design
  14. 54:18 – 58:17

    Quick-fire: speciation of products, misconceptions about automation, and the agent endgame

    In the closing rapid-fire, David reiterates that agents and chatbots will diverge into distinct products and challenges the narrative that AI simply replaces human work step-by-step. He paints a five-year vision where agents become a higher-level interface to computing—and flags walled-garden lock-in as a major failure mode.

    • Changed mind: agents vs chatbots will split into separate product categories
    • Misconception: AI will neatly automate and replace each human capability
    • Vision: agents as a “GUI→agents” leap, enabling higher-level goal-based interaction
    • Failure mode: incumbents’ walled gardens prevent cross-domain agents from working
    • Market size: agents address far more work than traditional RPA

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.