Skip to content
a16za16z

The Future of Software Development - Vibe Coding, Prompt Engineering & AI Assistants

Is AI the Fourth Pillar of Infrastructure? Infrastructure doesn’t go away — it layers. And today, AI is emerging as a new foundational layer alongside compute, storage, and networking. Erik Torenberg interviews a16z’s Martin Casado, Jennifer Li, and Matt Bornstein breaking down how infrastructure is evolving in the age of AI — from models and agents to developer tools and shifting user behavior. We dive into what infra actually means today, how it differs from enterprise, and why software itself is being disrupted. Plus, we explore the rise of technical users as buyers, what makes infra companies defensible, and how past waves — from the cloud to COVID to AI — are reshaping how we build and invest. Timestamps: 00:00 Introduction to Infrastructure 00:48 Defining Infrastructure and Its Components 02:27 The Fourth Layer: AI Models 06:34 The Evolution of Infrastructure 17:46 Developer Tools and the AI Wave 21:27 Data Engine Systems 22:11 Defensibility in AI Infrastructure 25:28 Expansion and Contraction Phases 27:09 Challenges in AI and Infrastructure 28:32 AI Models and Generalization 30:59 Thoughts on Andrej Karpathy's Talk on AI 34:09 AI and Human Expectations 36:18 The Role of Developers in the AI Era 40:31 Synthetic Data 43:17 AI Agents 45:16 Vertical Integration vs. Horizontal Specialization Resources: Find Martin on X: https://x.com/martin_casado Find Jennifer on X: https://x.com/JenniferHli Find Matt on X: https://x.com/BornsteinMatt Stay Updated: Let us know what you think: https://ratethispodcast.com/a16z Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

Jennifer LiguestMartin CasadoguestMatt BornsteinguestErik Torenberghost
Jul 21, 202547mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 0:48

    Why infrastructure matters: the stack keeps getting layered

    The conversation opens with the idea that infrastructure doesn’t disappear—it accumulates in layers as new primitives arrive. The panel frames the current moment as unusually important because AI is changing how software itself is built and understood.

    • Infrastructure persists; new capabilities layer on top of old ones
    • AI is disrupting software ("software eating itself")
    • A new infra layer changes how people program and what the stack looks like
    • Developer attention and distribution become central in the AI era
  2. 0:48 – 2:27

    What counts as “infrastructure”: technical buyers, not vertical end-users

    They define infrastructure as “the stuff you use to build the stuff,” distinguished primarily by who buys and uses it. Infra is sold to technical users (developers, admins, data scientists), unlike vertical SaaS sold to industry-specific operators.

    • Infra is categorized by technical buyer/user (developer, IT, admin, data scientist)
    • Core components: compute, networking, storage—plus the tooling around development/operations
    • Enterprise apps can be vertical; infra is typically horizontal and cross-industry
    • Go-to-market and diligence differ sharply between technical-buyer infra and vertical SaaS
  3. 2:27 – 6:34

    AI models as the “fourth pillar” of infrastructure

    They explore whether models belong alongside compute/networking/storage as a foundational infrastructure layer. Models both depend on the classic pillars and impose new requirements (chips, data centers, latency), while also acting as an intelligence primitive used by many applications.

    • Models rely on and reshape compute, storage, networking requirements
    • Models may be analogous to databases/compute, but are fundamentally new
    • Latency, data production, and training/inference workloads drive new infra needs
    • Models become a pervasive, reusable capability across the software ecosystem
  4. 6:34 – 17:46

    How AI changes the programming model: “abdicating logic” to systems

    Martin argues the key novelty is that software is delegating parts of application logic to models, not just resources. This forces a rethink of what “programming” means when outputs are probabilistic and models sometimes “don’t listen.”

    • Past abstraction: resources (compute/storage); new abstraction: logic/answers
    • Non-determinism and reliability concerns reshape engineering practices
    • Analogies (database/network) help, but AI requires a blank-sheet approach
    • This shift is foundational to computer science and software design
  5. 17:46 – 21:27

    Supercycle dynamics: TAM expansion, new behaviors, and startup white space

    They compare AI to prior platform shifts (internet, microchip): lowering marginal costs expands markets and creates new user behaviors incumbents struggle to serve. Those new behaviors open room for challengers and new categories of companies.

    • Lower marginal costs expand TAM and attract new users
    • New behaviors create white space for startups versus incumbents’ old motions
    • AI’s scale and novelty mirror the internet-era behavioral shifts
    • Tools enable rapid prototyping and creation from “good ideas” regardless of background
  6. 21:27 – 22:11

    From low-code to natural language: developer tools and the post-COVID acceleration

    Jennifer reframes the low-code promise as arriving via natural language and AI assistants. They also highlight COVID as an accelerant for bottom-up adoption and product-led dev tooling—setting the stage for today’s AI dev tools wave.

    • Low-code’s next form is natural language as a programming interface
    • AI tools act as “thought partners” for prototyping and building quickly
    • COVID accelerated PLG/bottom-up adoption of developer tools
    • Developer tool ecosystems flourished as more people tried tools at home
  7. 22:11 – 25:28

    Infra’s evolution at a16z: pre-cloud → cloud → AI, plus the COVID blip

    They trace major infra inflection points since a16z’s early days: pre-cloud on-prem software, the cloud transition (new deployment and business metrics), and the current AI transformation. COVID was a distinct, force-majeure shift that changed sales and adoption dynamics.

    • Pre-cloud: on-prem deployments, perpetual licenses, different economics
    • Cloud shift: recurring revenue, NDR/retention, gross margins, new operations
    • AI is described as the most dramatic shift in decades
    • COVID changed enterprise selling and accelerated bottom-up tool adoption
  8. 25:28 – 27:09

    Today’s infra map: dev tools, core infra, and modern data systems

    The panel outlines key infra categories they track and invest in: developer tools, compute/network/storage, and data systems. They note the data landscape’s two branches—backend data engineering and analyst-oriented platforms—and why these remain strategically important even amid AI hype.

    • Developer tools: from GitHub to newer AI-native tools like Cursor (mentioned)
    • Core infra remains foundational: compute, networking, storage
    • Data systems split: backend big-data engines vs analyst/tabular ecosystems
    • Examples referenced: Databricks, Fivetran, dbt, Hex, Tabular (acquired)
  9. 27:09 – 28:32

    AI companies blur infra vs apps: why early cycles are hard to classify

    In early supercycles, the “new tech becomes the app,” making it difficult to separate infrastructure from applications. They use examples like OpenAI and ElevenLabs to show how model providers often ship both an API platform and an end-user product.

    • Early cycles: infra/app boundaries blur as the market and TAM form
    • Netscape as historical precedent: core tech + product bundled early on
    • OpenAI and ElevenLabs cited as both infra (platform/API) and app (consumer product)
    • “Wrappers” framing fades; building good AI products is now genuinely hard
  10. 28:32 – 30:59

    Defensibility in AI infra: beyond “no moat” and “commoditization”

    They revisit earlier skepticism that no layer had defensibility and contrast it with today’s reality: companies across the stack are succeeding simultaneously. The panel argues infra defensibility often comes from hard-to-replicate expertise, integration switching costs, and how stacks consolidate over time.

    • Early view: little defensibility across chips/cloud/models/apps; reality is nuanced
    • Infra moats: deep domain expertise, long build cycles (e.g., databases like DuckDB)
    • Switching costs are high due to embedded integration and logic, even with APIs
    • “Commoditization” is often sloppy; layers tend to retain value and margin
  11. 30:59 – 34:09

    Expansion vs contraction: why zero-sum thinking fails during the boom

    Martin describes infra markets as expanding and later contracting into oligopolies/monopolies, without eliminating value in each layer. In expansion phases, aggressive building and investing are rewarded; consolidation later tends to preserve margins rather than destroy them.

    • Expansion phase: market grows; multiple layers can win simultaneously
    • Contraction phase: consolidation leads to oligopoly/monopoly dynamics
    • Margins often persist (e.g., clouds) via scale and tacit price discipline
    • True layer displacement usually requires difficult vertical moves down the stack
  12. 34:09 – 36:18

    Generalization, RL trade-offs, and composing multi-model systems

    They debate whether improving frontier models automatically improves every downstream business. Martin questions how well RL-tuned models generalize, while Jennifer argues real-world systems will compose multiple specialized and general models in pipelines rather than rely on a single model for everything.

    • OpenAI’s “model improvements help you” advice depends on generalization behavior
    • RL fine-tuning may introduce trade-offs across capabilities (code vs other tasks)
    • Complex applications often require chaining multiple model calls and steps
    • Future likely includes both frontier general models and task-specialized models
  13. 36:18 – 40:31

    From prompt engineering to context engineering: the new infra opportunity

    Reacting to Karpathy-adjacent framing, they argue the core challenge is delivering the right context to models—often requiring classic CS tools (indexes, prioritization) alongside models. This becomes a major new infra frontier: data pipelines, observability, guarantees, and tooling to systematize AI development.

    • “Context engineering” replaces simplistic “prompt engineering” framing
    • Better performance requires curating inputs via indexing, retrieval, prioritization
    • Infrastructure needs: data pipelines, discovery, observability, and guarantees
    • They expect new formalisms and tools for AI software construction within ~5 years
  14. 40:31 – 43:17

    Humans, expectations, and the developer role: more software, more developers

    They caution against anthropomorphizing AI (utopia vs doom) and argue professionals remain essential for specification and formalism. Rather than shrinking engineering, they expect more developers and more software—because creation, product decisions, and requirements discovery remain the hard part.

    • AI triggers human projection; they advocate a pragmatic middle path
    • Formal specification still matters; professional disciplines evolve formal systems
    • Likely outcome: more developers empowered by productivity tools, not fewer
    • Value lies in product decisions/workflows/requirements, not just writing code
  15. 43:17 – 45:16

    Synthetic data and agents: what’s real today vs decade-long horizons

    They surface ongoing debates: whether synthetic data can meaningfully improve models without new information, and what agents are truly good for now. Coding agents are working well because code offers tight feedback/error correction; general-purpose web agents lag as errors compound in loops.

    • Synthetic data debate: limited gains without new information; skepticism on “self-improving” loops
    • Agent definition: LLMs in a loop; errors can compound without correction
    • Coding agents succeed due to compile/lint/test feedback and structured environments
    • They see agents as a long arc—near-term value is task-bounded, well-specified work
  16. 45:16 – 47:29

    Vertical integration vs horizontal specialization: both will coexist in AI

    They argue history shows both vertical and horizontal strategies can win, and AI already exhibits both. OpenAI is positioned as more vertically integrated via ChatGPT, while others emphasize horizontal API distribution; business strategy hinges on where value capture is clearest for a given market and persona.

    • Historical patterns: Apple-style vertical integration vs Microsoft/Intel horizontal specialization
    • AI already shows both: vertically integrated products and horizontal model/API platforms
    • Open models can function as “horizontal” layers because users can’t recreate training
    • Vertical focus requires sharp persona/use-case understanding to capture value

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.