Aravind Srinivas:Will Foundation Models Commoditise & Diminishing Returns in Model Performance|E1161

Aravind Srinivas is the Co-Founder and CEO @ Perplexity where he is on a mission to build the world’s most knowledge-centric company. Recent reports have suggested the company is raising $250M at a $3BN valuation and the company’s cap table currently includes all stars such as Jeff Bezos, Elad Gil, Nat Friedman, Tobi Lutke, Yann LeCun, Naval Ravikant, Paul Buchheit and Andrej Karpathy. Prior to founding Perplexity, Aravind was a Research Scientist @ OpenAI and before that a research intern at both Google and Deepmind. ----------------------------------------------- Timestamps: (00:00) Intro (00:46) AI Passion Journey (05:35) Addressing Diminishing Returns in AI Model Performance (08:16) The Future of AI: Specialized Models & Data Curation (11:28) Advancing AI Reasoning Quality (18:21) The Challenge of AI Memory (20:37) The Future of Foundation Models in AI (27:39) AI Models & Cloud Provider Acquisitions (31:31) Navigating Capital Competition in the AI Industry (40:30) Timing the Expansion into Enterprise Division (47:47) Fundraising Process (51:03) Predicting Perplexity's Dominant Monetization Engine (54:35) Quick-Fire ----------------------------------------------- In Today’s Episode with Aravind Srinivas: 1. Are We Reaching a Stage of Diminishing Returns with Models: Have we reached a stage where more compute will not result in a proportional improvement in model performance? What are the most exciting new modalities we will see breakthroughs in? Is voice the radical new modality that everyone thinks it is and OpenAI demoed? What will it take and when will we have true reasoning in models? 2. Are Foundation Models Commoditising: What is the end state for the foundation model layer? Will we see the specialisation of models? Will different models be used for different things? Is there room for a new foundation model to be born? Is it VC backable? Why does Aravind believe that two players will win this layer? What happens to the rest? What is needed to win in this layer? 3. How to Survive in a World of Incumbents: Funding the Machine: How can any of the current players compete in a world where Microsoft has $300M in free cash flow per day? How much money does one need to build a foundation model today? Are the barriers lowering? Why does Aravind argue that talent is not simply a game of cash? 4. From Burning Money to Printing It: What does Aravind believe are the four core monetisation methods for Perplexity? Why does Aravind think that advertising will be their largest? Why does Aravind think that consumer subscription is not a very good business for them? Is Aravind concerned about having to build an enterprise go-to-market? What will it take to have a super successful API printing money machine business? ----------------------------------------------- Subscribe on Spotify: https://open.spotify.com/show/3j2KMcZTtgTNBKwtZBMHvl?si=85bc9196860e4466 Subscribe on Apple Podcasts: https://podcasts.apple.com/us/podcast/the-twenty-minute-vc-20vc-venture-capital-startup/id958230465 Follow Harry Stebbings on Twitter: https://twitter.com/HarryStebbings Follow Aravind Srinivas on Twitter: https://twitter.com/AravSrinivas Follow 20VC on Instagram: https://www.instagram.com/20vchq Follow 20VC on TikTok: https://www.tiktok.com/@20vc_tok Visit our Website: https://www.20vc.com Subscribe to our Newsletter: https://www.thetwentyminutevc.com/contact ----------------------------------------------- #20vc #harrystebbings #aravindsrinivas #preplexity #ai #founder #ceo #venturecapital #startup #openai #chatgpt #google #whatsapp #deepmind

Aravind SrinivasguestHarry Stebbingshost

Jun 5, 20241h 3mWatch on YouTube ↗

CHAPTERS

0:00 – 0:43
From output-only LLMs to iterative, feedback-driven reasoning
Aravind opens with a vision shift: models will move from one-shot answers to iterative reasoning loops that test, get feedback, and refine. He argues that as foundation models commoditize, application-layer companies capturing users and UX will benefit most.
- •Next-gen models will iterate: propose, reason, get feedback, revise
- •This shift marks a “real reasoning era” rather than pure next-token prediction
- •Commoditization at the model layer pushes value to product/application layers
- •Owning user relationships becomes the enduring advantage
0:43 – 3:30
Accidental start: contests, scikit-learn, and early confidence in ML
Aravind describes stumbling into machine learning via a contest he joined for prize money, despite not knowing what ML was. Brute-force experimentation led him to win, giving him conviction that ML could be his craft.
- •Entered ML through a prediction contest with no prior ML knowledge
- •Used scikit-learn and iterative trial-and-error to find a winning approach
- •Winning against experienced participants built confidence
- •Heuristic for strengths: what’s easy for you but hard for others
3:30 – 5:35
Reinforcement learning roots and DeepMind-inspired projects
He traces deeper immersion into AI through reinforcement learning, guided by a professor connected to Sutton/Barto lineage. A project extending Atari learning work (transfer across games) cemented his interest and research habits.
- •RL framework made “AI as agent + environment + rewards” click
- •Mentorship and exposure to DeepMind work accelerated learning
- •Early project: transfer learning across multiple Atari games
- •Hands-on implementation and GPU hacking culture shaped his approach
5:35 – 8:16
Diminishing returns: scaling still works—if data curation is excellent
Asked whether model improvements are slowing, Aravind argues the answer is nuanced: brute-force scale can still help, but only for teams that get many details right. The differentiator is less “more compute” and more disciplined data curation and training choices.
- •Scaling alone doesn’t guarantee a better model anymore
- •Data quality/mix (languages, code, math, reasoning traces) is crucial
- •Chinchilla-style rules are guidelines, not guarantees
- •MoE and efficiency techniques matter; only a few labs execute well
8:16 – 11:28
Why “verticalized/specialized foundation models” often disappoint
Aravind pushes back on the idea that many domain-specific foundation models will win, citing BloombergGPT’s underperformance versus frontier general models. He argues the core ‘magic’ is emergent general capability from broad data, and many domains don’t have enough tokens to justify a dedicated foundation model.
- •Domain-specific FM example: BloombergGPT beaten by GPT-4 on finance tasks
- •LLMs’ power comes from general-purpose emergent capabilities
- •Many domains lack sufficient token volume; code is a notable exception
- •We still don’t fully understand why reasoning emerges or how to best train it
11:28 – 15:25
How good is LLM reasoning today—and what would real breakthroughs change?
He frames reasoning as a spectrum: current models may beat many high schoolers but are far from elite human reasoners. If models reached “advisor to Demis Hassabis” levels, pricing and business models would shift from cheap subscriptions to high-value, pay-per-outcome economics.
- •Reasoning benchmarked against humans: decent but not IMO/IOI level
- •True step-change would be superlative, high-stakes reasoning ability
- •Such capability would ‘break’ $20/month pricing assumptions
- •Value would be tied to massive ROI per session/output for users
15:25 – 18:21
Bootstrapped reasoning and self-improvement loops (STAR/Q*)—and why capital matters
Aravind explains the likely path to better reasoning: models that generate explanations, evaluate correctness, and retrain on improved rationales—iterating toward better outputs. He notes the constraint is cost: running these loops requires heavy inference compute, favoring well-capitalized companies and potentially concentrating power.
- •Self-training on rationales/explanations can improve reasoning quality
- •Future models will iterate: answer → reason → check → revise until convergence
- •Inference-heavy research makes this hard for academia to compete
- •A first-mover algorithmic breakthrough could create winner-take-most dynamics
18:21 – 20:37
The memory problem: long context vs “infinite life memory”
He distinguishes practical long-context memory (larger windows and retrieval) from the harder problem of truly persistent, unlimited personal memory. Today’s hurdles include instruction-following degradation and confusion/hallucination with too much context; infinite context is not solved algorithmically.
- •Two meanings of memory: practical long context vs infinite personal memory
- •Context windows are expanding rapidly (hundreds of K to millions of tokens)
- •Key challenge: maintain instruction-following quality with huge context
- •Infinite memory/persistent life-long recall remains unsolved
20:37 – 25:45
Commoditization of foundation models: tiers, frontier scarcity, and who wins
Aravind argues “GPT-3.75 level” models are already commoditized, while GPT-4-class models still have limited true alternatives. He emphasizes that for most products, the model isn’t the product; Perplexity focuses on post-training and orchestration rather than competing in base-model training, which he calls an ROI trap.
- •Mid-tier models are commoditized; frontier models still scarce
- •Whether GPT-5-like leaps happen will determine future commoditization
- •Training base models is a costly treadmill with uncertain business ROI
- •Perplexity positions as post-training/orchestration + UX, not base pretraining
25:45 – 27:39
Will frontier model labs consolidate to one winner? OpenAI vs Anthropic framing
Harry presses on whether the frontier ends with one dominant lab. Aravind says it hinges on who cracks bootstrapped reasoning first and then aggressively scales it; he names OpenAI and Anthropic as the most likely, contrasting OpenAI’s capital/speed with Anthropic’s algorithmic efficiency.
- •Winner-take-most depends on cracking and scaling bootstrapped reasoning
- •OpenAI advantages: capital, speed, execution momentum
- •Anthropic advantages: algorithmic/post-training strength with less capital
- •xAI has talent and funding but is behind on timeline
27:39 – 34:36
Why big cloud providers may not acquire OpenAI/Anthropic: tacit knowledge as the asset
He disagrees that clouds will simply acquire top labs, arguing the real value is the ‘machine that builds the machine’: the cohesive team, tacit know-how, and ability to produce the next breakthrough. With less publishing and more retention, that knowledge is harder to replicate or purchase piecemeal.
- •Primary asset is the team + tacit training/algorithmic knowledge, not one model snapshot
- •Top labs retain leverage because clouds depend on their ongoing output
- •Less academic publishing concentrates know-how inside a few companies
- •Acquisition becomes plausible only if breakthroughs stall and leverage evaporates
34:36 – 40:30
Perplexity’s business strategy: why subscriptions alone aren’t enough; ads as the long-term engine
Aravind explains why a $20/month subscription business may not deliver sufficient margins without massive scale, and why ads are the historically strongest margin engine. He outlines a principle: use ads without corrupting answer quality, diversify revenue, and avoid Google’s long-run misalignment between users and shareholders.
- •Subscriptions can work at huge scale, but margins and retention are challenging
- •Ads (done well) can be extremely profitable and relevant to users
- •Key constraint: keep answers/citations independent from ad influence
- •Diversified monetization (subs, ads, APIs, enterprise) to maintain alignment
40:30 – 47:48
Enterprise expansion: security/compliance first, then rethinking internal search and knowledge workflows
He describes the trigger for building Enterprise Pro: enterprises will use AI search but fear data leakage, so they need governance and compliance. Longer-term, he wants a unified platform that blends internal/external data, multiple models, ranking, and knowledge-base building—not just connectors to tools.
- •Enterprise adoption requires governance, security, and compliance features
- •AI-native search changes perceived risk vs traditional search
- •Vision: unified UI for internal+external data and multiple model backends
- •Opportunity to rethink enterprise ranking/search and build durable workflow value
47:48 – 54:34
Fundraising realities, compute spend, and the “application layer wins” thesis
Aravind says fundraising is far from effortless despite AI hype; investors probe dependency risk on model providers and defensibility. He notes most spend goes to compute (serving/post-training/API usage), and argues commoditized model costs help application companies reinvest in features and user growth—like an Amazon-style flywheel.
- •Fundraising is difficult; common pushback centers on platform dependency and defensibility
- •Perplexity spend is compute-heavy, but avoids base-model training commitments
- •Big clusters require multi-year commitments; avoiding that preserves flexibility
- •Model commoditization lowers input costs and benefits user-facing product companies
54:34 – 1:03:12
Quick-fire: AI misconceptions, product intent, agents/browsers/OS, and Perplexity’s 10-year aim
In the closing quick-fire, Aravind argues AI is under-hyped when embedded into familiar workflows, not just chat. He critiques mismatched integrations (e.g., search inside WhatsApp), shares an agent-driven vision for browsers and AI-native OS, and ends with Perplexity’s long-term goal: being the indispensable assistant for accurate facts and knowledge.
- •Big misconception: short-term takes; AI impact is underappreciated in real workflows
- •Product lesson: features must match existing user intent in an app
- •Future: agentic browsers that execute tasks; AI-native OS concepts (e.g., ‘Her’ style)
- •2034 goal: trusted, cannot-live-without assistant for facts/knowledge

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

From output-only LLMs to iterative, feedback-driven reasoning

Accidental start: contests, scikit-learn, and early confidence in ML

Reinforcement learning roots and DeepMind-inspired projects

Diminishing returns: scaling still works—if data curation is excellent

Why “verticalized/specialized foundation models” often disappoint

How good is LLM reasoning today—and what would real breakthroughs change?

Bootstrapped reasoning and self-improvement loops (STAR/Q*)—and why capital matters

The memory problem: long context vs “infinite life memory”

Commoditization of foundation models: tiers, frontier scarcity, and who wins

Will frontier model labs consolidate to one winner? OpenAI vs Anthropic framing

Why big cloud providers may not acquire OpenAI/Anthropic: tacit knowledge as the asset

Perplexity’s business strategy: why subscriptions alone aren’t enough; ads as the long-term engine

Enterprise expansion: security/compliance first, then rethinking internal search and knowledge workflows

Fundraising realities, compute spend, and the “application layer wins” thesis

Quick-fire: AI misconceptions, product intent, agents/browsers/OS, and Perplexity’s 10-year aim

Get more out of YouTube videos.