Skip to content
No PriorsNo Priors

No Priors Ep. 68 | With Zapier Co-Founder and Head of AI Mike Knoop

The first step in achieving AGI is nailing down a concise definition and Mike Knoop, the co-founder and Head of AI at Zapier, believes François Chollet got it right when he defined general intelligence as a system that can efficiently acquire new skills. This week on No Priors, Mike joins Elad to discuss Arc Prize which is a multi-million dollar non-profit public challenge that is looking for someone to beat the Abstraction and Reasoning Corpus (ARC) evaluation. In this episode, they also get into why Mike thinks LLMs will not get us to AGI, how Zapier is incorporating AI into their products and the power of agents, and why it’s dangerous to regulate AGI before discovering its full potential. Sign up for new podcasts every week. Email feedback to show@no-priors.com Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @mikeknoop Show Notes: 0:00 Introduction 1:10 Redefining AGI 2:16 Introducing ARC Prize 3:08 Definition of AGI 5:14 LLMs and AGI 8:20 Promising techniques to developing AGI 11:0 Sentience and intelligence 13:51 Prize model vs investing 16:28 Zapier AI innovations 19:08 Economic value of agents 21:48 Open source to achieve AGI 24:20 Regulating AI and AGI

Elad GilhostMike Knoopguest
Jun 11, 202426mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 2:00

    Why Mike Knoop is launching ARC Prize to accelerate AGI progress

    Elad introduces Mike Knoop (Zapier co-founder, Head of AI) and tees up the ARC Prize initiative. Mike explains his long-running interest in AGI, why he stepped back from exec duties to focus on AI, and why he believes AGI progress has stalled.

    • Mike’s background at Zapier and focus on application-layer AI
    • Motivation: measuring “how close are we to AGI?” after chain-of-thought era
    • Claim: perceived AGI progress is overstated; real progress has stalled
    • Concern: frontier labs publish fewer technical details, slowing community learning
  2. 2:00 – 3:08

    ARC Prize and the ARC-AGI evaluation: what it is and why it matters

    Mike introduces the ARC Prize as a nonprofit, public challenge to beat François Chollet’s ARC-AGI evaluation and open-source the solution. He describes ARC as a rare evaluation that resists brute-force scaling and notes slow progress over multiple years.

    • ARC Prize: $1M+ challenge, open-source requirement
    • ARC-AGI eval described as a strong, principled AGI test
    • Historical results: ~20% to ~34% over four years—limited gains
    • ARC puzzles resemble compact “IQ-test-like” tasks that resist LLM scaling
  3. 3:08 – 5:12

    Competing AGI definitions: economic usefulness vs. fast skill acquisition

    Elad asks what the prevailing AGI definition is and what’s wrong with it. Mike contrasts the popular “economically useful work” definition with Chollet’s definition: general intelligence as the ability to efficiently learn new skills from limited experience.

    • Industry consensus: AGI = system that can do most economically useful human work
    • Mike: that definition tracks usefulness, not general intelligence
    • Chollet’s definition: efficiently acquire new skills and solve novel problems
    • Examples: narrow AIs can excel at single domains but can’t transfer skills
  4. 5:12 – 7:40

    Why LLMs generalize yet still fall short of ‘true’ general intelligence

    Elad probes why broad, general-purpose LLMs don’t count as sufficient AGI progress. Mike frames LLMs as powerful pattern/memorization systems that struggle with genuinely novel discovery where the answer and reasoning chains aren’t present in training data.

    • LLMs: high-dimensional pattern learners; “memorization” is not full general intelligence
    • Discovery tasks (new physics, therapeutics) require inventing beyond training distributions
    • Agents work when reasoning chains are short/common enough to be abstracted from data
    • Empirical claim: LLM-based attempts have not succeeded on ARC-AGI
  5. 7:40 – 8:32

    Beyond next-token prediction: what ‘more’ is needed for AGI

    Mike argues that simply scaling next-token-prediction language models won’t reach AGI. He suggests transformers may still contribute—especially as a perception stack—but that additional mechanisms/architectures are required.

    • Scaling alone (more params/data) won’t solve AGI per Mike’s view
    • Transformers’ key contribution: robust multimodal perception representations
    • Next-token prediction as a sole objective is insufficient
    • Need additional components beyond standard LM training regimes
  6. 8:32 – 10:50

    Promising research directions: program synthesis and architecture search

    Elad asks what ideas are missing and what should be explored. Mike highlights program synthesis as a promising approach on ARC, and revisiting neural architecture search now that compute is cheaper and more available.

    • Program synthesis: search through program space to map inputs to outputs
    • Orthogonal to standard LM approaches; has driven some ARC gains
    • Neural architecture search: computers discovering architectures rather than humans
    • Revisiting NAS with modern compute and fewer human priors (‘bitter lesson’)
  7. 10:50 – 13:48

    AGI vs sentience: focusing on capabilities and an empirical lens

    Elad asks whether Mike’s AGI concept includes sentience or consciousness. Mike avoids philosophical claims and emphasizes capability-based goals: systems that can learn new skills and help humans solve open-ended problems, evaluated empirically.

    • Distinction between intelligence and sentience raised (risk implications)
    • Mike’s focus: tools that invent/discover alongside humans
    • Avoids claims about consciousness; prioritizes observable capabilities
    • Stresses empiricism over speculation about future systems
  8. 13:48 – 16:22

    Why a prize model (not startups) and why outsiders may win

    Elad asks why ARC Prize is structured as a prize rather than investment. Mike argues outsiders are critical, the problem is compact enough to be tackled by small teams, and prizes can redirect talent away from purely commercial LM opportunities.

    • Many ARC competitors are small, globally distributed teams
    • Bet: breakthrough likely from someone not steeped in scale/LM orthodoxy
    • ARC’s compact tasks make solutions feasible without massive models
    • Prize creates status + financial incentive to pursue noncommercial research
  9. 16:22 – 17:46

    Zapier’s AI journey: early internal experimentation and tool use

    Elad pivots to Zapier’s AI product journey and how they chose what to build. Mike describes a six-month period of intensive prototyping (chain/tree-of-thought, internal ChatGPT-like systems) leading to a key insight: models needed tools, which matched Zapier’s integration layer.

    • In 2022, Mike and Zapier’s CTO shifted to hands-on IC work to explore capabilities
    • Built internal prototypes including chain/tree-of-thought and a ChatGPT-like experience
    • Identified “frozen in time” limitation pre-tool-use era
    • Hooked LMs to Zapier’s 6,000+ integrations; became a ChatGPT plugin launch partner
  10. 17:46 – 19:08

    Adoption signals and new products: 50M AI tasks and Zapier Central bots

    Elad asks about usage metrics and adoption. Mike shares scale (50M+ AI tasks) and explains how users mainly embed AI steps inside workflows; he also introduces Zapier Central, where bots are programmed in natural language via inference rather than rigid mapping.

    • 50M+ AI tasks executed on Zapier over ~18 months
    • Dominant use cases: content generation, extraction, summarization within workflows
    • Zapier Central: AI bots that reduce manual configuration and are easier to use
    • Natural-language programming via inference (not just NL-to-structured translation)
  11. 19:08 – 21:46

    Economic value of agents: reliability, risk, and ‘clamping’ behavior

    Elad asks how close we are to an agentic future. Mike argues it’s already here in paid usage, and frames progress as expanding “concentric rings” of higher-stakes use cases as reliability improves—enabled by product controls that constrain agent behavior.

    • Agentic automation is already monetized in practice
    • Adoption starts with low-risk personal/team workflows
    • Same bot can be acceptable for startups but too risky for large enterprises
    • Key product challenge: adding clamps/constraints to raise reliability and reduce downside
  12. 21:46 – 23:38

    Open source as the engine for breakthroughs—and why publishing is closing down

    Elad asks about open source in AI amid regulatory shifts. Mike argues open research is essential because AGI needs fundamental breakthroughs, and closed frontier research slows idea generation; ARC Prize aims to counterbalance this by requiring open sourcing.

    • AGI progress needs new ideas; open source accelerates idea diffusion
    • Open sharing helps would-be researchers participate without gatekeeping by big labs
    • Commercialization has reduced frontier publication (tight-to-chest behavior)
    • ARC Prize: open-source solutions and open progress to rebuild shared learning
  13. 23:38 – 26:00

    Regulating AI and AGI: use existing frameworks, avoid speculative prohibitions

    Elad draws parallels to internet protocols and 1990s cryptography regulation. Mike argues today’s AI harms can be handled by existing regulatory agencies, and that prescriptive AGI legislation based on theoretical predictions is dangerous; policy should follow empirical evidence of capabilities.

    • Analogy to crypto regulation: fears vs eventual net-positive outcomes
    • For current narrow AI, existing regulatory frameworks may suffice
    • Against prescriptive AGI laws before evidence of actual capabilities
    • Values personal freedom; prefers capability-informed decisions on release/control

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.