The Twenty Minute VCThe Twenty Minute VC

The Early Days of Anthropic & How 21 of 22 VCs Rejected It | The Four Bottlenecks in AI | Anj Midha

Harry Stebbings and Anjney Midha on anj Midha on AI bottlenecks, compute grids, security, and investing.

Anjney MidhaguestHarry Stebbingshost
Apr 14, 20261h 15mWatch on YouTube ↗
Are scaling laws dead? (domain saturation vs new frontiers)The four bottlenecks: context, compute, capital, cultureVertical integration for scientific data and feedback loopsSovereign data, CLOUD Act, and Europe’s AI infrastructureMistral investment thesis: independent full-stack sovereigntyAnthropic’s early fundraising rejections and Amazon partnershipGPU wastage bubble and non-fungible computeAMP Grid: pooling capacity like an electricity gridCompute standardization and misaligned incentivesChina’s systems co-design and adversarial distillationCoordinated inference defense (“Iron Dome”)Venture returning to incubation/co-founding model
AI-generated summary based on the episode transcript.

In this episode of The Twenty Minute VC, featuring Anjney Midha and Harry Stebbings, The Early Days of Anthropic & How 21 of 22 VCs Rejected It | The Four Bottlenecks in AI | Anj Midha explores anj Midha on AI bottlenecks, compute grids, security, and investing Midha argues scaling laws are not dead; returns look saturated only in heavily explored domains like coding, while underexplored domains like materials science still show outsized gains from more compute and better feedback loops.

At a glance

WHAT IT’S REALLY ABOUT

Anj Midha on AI bottlenecks, compute grids, security, and investing

  1. Midha argues scaling laws are not dead; returns look saturated only in heavily explored domains like coding, while underexplored domains like materials science still show outsized gains from more compute and better feedback loops.
  2. He frames AI progress around four core bottlenecks—context/feedback, compute, capital, and culture—claiming culture is the most underrated driver because it attracts the talent that produces algorithmic breakthroughs.
  3. He contends “AI for science” underperformed because the necessary physics/chemistry data is scarce online and locked in labs, requiring vertically integrated data-creation loops (e.g., robots + experiments) to generate proprietary training signals.
  4. Midha describes a “GPU wastage bubble,” not an AI bubble: massive stranded compute exists because compute is not fungible across chip types/generations and lacks standards, motivating his “AMP Grid” concept as a coordinating layer akin to an electricity grid.
  5. On geopolitics, he warns China is advancing via full-stack systems co-design plus adversarial distillation, and calls for a coordinated “Iron Dome” for frontier inference to detect and respond to distillation and insider threats across Western labs.

IDEAS WORTH REMEMBERING

5 ideas

Scaling still works; “diminishing returns” is domain-dependent.

Midha says coding evals look saturated because they’re heavily optimized, but domains like materials and superconductors remain wide open—so more compute plus tight experiment-to-training loops can produce rapid gains.

The biggest capability unlock is often a better context/feedback loop, not a new model architecture.

He treats algorithmic innovation as downstream of having the right team and culture; the real edge comes from deploying models in a domain, collecting high-quality feedback, and feeding it back into training (including real-world verification).

AI-for-science fails without new data creation, not better prompting.

He claims frontier science reasoning is bottlenecked by missing physics/chemistry datasets on the open internet, pushing teams toward lab integration (robots, synthesis, measurement) to generate proprietary ground truth.

Sovereign data constraints are creating openings against hyperscalers.

Using the CLOUD Act example, he argues some European defense/logistics/industrial workloads cannot run on US-managed clouds, driving demand for local compute + local models—central to his Mistral thesis.

We’re in an infrastructure utilization crisis, not a capabilities bubble.

Midha says “stranded” clusters exist because FLOPS aren’t interchangeable across GPU generations and configurations; without standards, the market overbuys in the wrong places while innovators still can’t access what they need.

WORDS WORTH SAVING

5 quotes

AI alignment, don't get me wrong, is hard, but not the hardest problem. Human alignment is really the problem right now.

Anjney Midha

There's no saturation in superconductor discovery at all.

Anjney Midha

We are definitely in a GPU wastage bubble, not an AI bubble.

Anjney Midha

Compute is not fungible today.

Anjney Midha

If we don't secure frontier model inference… behind a coordinated Iron Dome, I don't think we have a sustainable shot at staying at the frontier over the next decade.

Anjney Midha

QUESTIONS ANSWERED IN THIS EPISODE

5 questions

On the four bottlenecks: how would you rank context vs compute vs capital vs culture for (a) a frontier lab, (b) an applied enterprise AI company, and (c) an AI-for-science startup?

Midha argues scaling laws are not dead; returns look saturated only in heavily explored domains like coding, while underexplored domains like materials science still show outsized gains from more compute and better feedback loops.

What concrete technical standards would make compute meaningfully “fungible”—is it workload portability, memory/network baselines, compiler targets, scheduling APIs, or something else?

He frames AI progress around four core bottlenecks—context/feedback, compute, capital, and culture—claiming culture is the most underrated driver because it attracts the talent that produces algorithmic breakthroughs.

Your ‘AMP Grid’ analogy implies an independent system operator—what governance model prevents capture by hyperscalers or the largest labs?

He contends “AI for science” underperformed because the necessary physics/chemistry data is scarce online and locked in labs, requiring vertically integrated data-creation loops (e.g., robots + experiments) to generate proprietary training signals.

In the ‘AI for science sucked’ section, what specific benchmarks or tasks failed most, and what did you observe as the biggest missing primitives (units, error bars, lab protocols, simulators, etc.)?

Midha describes a “GPU wastage bubble,” not an AI bubble: massive stranded compute exists because compute is not fungible across chip types/generations and lacks standards, motivating his “AMP Grid” concept as a coordinating layer akin to an electricity grid.

For Europe’s sovereignty thesis: what’s the minimum viable ‘sovereign stack’ (land/power/shell, cloud layer, model layer, security) that actually satisfies regulators and defense customers?

On geopolitics, he warns China is advancing via full-stack systems co-design plus adversarial distillation, and calls for a coordinated “Iron Dome” for frontier inference to detect and respond to distillation and insider threats across Western labs.

Chapter Breakdown

Why scaling laws still work—diminishing returns depend on the domain

Anj argues scaling laws are not “dead,” but some benchmarks (like coding evals) appear saturated, making gains look expensive. In less-explored domains like materials discovery, he claims additional compute still yields outsized improvements when paired with tight experimentation loops.

The four bottlenecks holding AI back: context/feedback, compute, capital, and culture

Anj lays out a framework for what limits frontier progress today, emphasizing that algorithms are less of a bottleneck than the systems around them. He highlights culture as the meta-bottleneck that enables top talent and flexible research direction.

Why “AI for science” underperformed: missing data and weak real-world feedback loops

He describes benchmarking frontier models on physics/chemistry and finding them surprisingly weak relative to the hype. The core issue is lack of high-quality scientific data on the public internet and difficulty accessing lab/manufacturing datasets, motivating vertically integrated data generation.

Vertically integrated model companies and the “Claudification” question

Harry probes how to predict which vertical AI companies get commoditized by foundation models. Anj reframes moats: unique context/feedback access makes progress visible and can support superior economics, but it’s not automatically a permanent moat.

Sovereign data, the CLOUD Act, and why Europe wants local AI infrastructure

Anj explains how legal and geopolitical constraints make certain workloads impossible to run on US-managed clouds. He uses the CLOUD Act to motivate “sovereign” infrastructure and local providers that can serve sensitive enterprise and government workloads.

The investment thesis behind Mistral: full-stack European independence

He describes Mistral as a bet on European sovereignty across the AI stack—power, facilities, compute, and locally trained models—alongside open deployment. The goal is independence at scale rather than relying on US hyperscalers and labs.

The brutal early days of Anthropic: 21 of 22 VCs said no

Anj recounts helping founders translate a scaling-law research hypothesis into a business plan, then facing broad investor skepticism. He describes how many VCs didn’t understand GPT-3 or compute-driven scaling economics, while strategics like Amazon immediately saw the alignment.

Public Benefit Corporations (PBCs): resolving mission vs profit tensions

Anj defends PBC governance as a mechanism to make long-term decisions that aren’t maximally profit-seeking in the short run. He positions AMP as mission-aligned infrastructure + standards advocacy, including offering compute at cost to support frontier progress.

The AMP Grid: building an electricity-grid analog for compute

Anj describes AMP as an “independent system operator” coordinating compute capacity rather than owning datacenters like a cloud provider. The thesis is that pooling and dispatching capacity improves utilization, reduces overprovisioning, and accelerates frontier output—similar to early electricity markets.

Back-to-the-future venture: co-founding and incubation like the early Valley

He argues frontier companies require deep operational partnership, not just check-writing. Drawing on Intel/Genentech/Apple examples, he suggests value accrues to investors who help build companies hands-on—especially where CapEx, infrastructure, and scientific execution dominate.

GPU wastage bubble, not an AI bubble: stranded compute and poor utilization

Anj claims the core problem is infrastructure inefficiency rather than capability hype. Large pockets of compute sit idle due to fragmentation, mismatched needs, and lack of coordination—creating the appearance of a bubble even as capability demand remains real.

Why compute isn’t fungible: chip heterogeneity and missing standards

He explains that unlike electricity, compute capacity can’t be easily swapped across chip types or clusters, even within Nvidia generations. This prevents workloads from moving efficiently, creates stranded assets, and amplifies boom/bust dynamics typical of pre-standardization infrastructure eras.

China’s systems co-design advantage and distillation as a catch-up engine

Anj argues China is competing via full-stack optimization rather than leading-edge chips alone—co-designing chips, infrastructure, and training to boost efficiency. He highlights adversarial distillation from Western endpoints as a mechanism to accelerate iteration and close the gap.

Coordinating defense: an “Iron Dome” for inference and frontier security

He warns that distillation and insider threats exploit fragmented defenses across Western labs. His proposal is a shared proxy/coordination layer across inference providers so attacks seen by one actor can trigger rapid collective response.

Perfect competition is for losers: aiming for “optimal competition” in AI markets

Updating Thiel’s “competition is for losers,” Anj argues the real enemy is perfect competition that commoditizes everyone and wastes scarce resources. He advocates an “optimal competition” structure with a small number of strong players per layer to maintain innovation without monopoly stagnation.

Quick-fire: LP advice, building to learn, legacy, and personal reflections

In the closing segment, Anj emphasizes that LPs and investors must do the work—read, build, and understand bottlenecks—rather than outsource judgment. The conversation shifts to leadership qualities, health and time, independence as motivation, and what he wants to be remembered for.

EVERY SPOKEN WORD

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome