YC Root AccessThis Startup Is Trying To Solve The AI Memory Problem
CHAPTERS
Mem0’s mission: a memory layer for stateless LLM agents
The founders explain Mem0 as an infrastructure layer that gives AI agents long-term memory, addressing the core limitation that LLMs are inherently stateless. Without memory, agents effectively “start from scratch” each interaction, limiting personalization and improvement over time.
- •Mem0 is building a dedicated memory layer for AI agents and AI apps
- •LLMs are stateless, so agents don’t naturally retain user preferences or prior context
- •Goal: make any agent/app able to remember like a human would
- •Memory is positioned as foundational infrastructure for the agent ecosystem
Open-source traction and ecosystem distribution
Mem0 describes rapid adoption through open source and integrations with major agentic frameworks. They share key growth metrics and position themselves as a widely used default choice for memory.
- •14M+ Python package downloads and ~41k GitHub stars (as stated)
- •Used by thousands of companies
- •Acts as the memory provider for multiple agentic frameworks (e.g., AWS Agents SDK/Strands, CrewAI, Flowise)
- •Open-source approach accelerates adoption and standardization
Why memory improves agents: personalization that compounds over time
They illustrate how memory turns agents into systems that learn a user’s preferences and behave more consistently across sessions. This enables applications to improve with usage rather than staying static.
- •Agents can store and reuse user preferences (e.g., travel/lodging constraints)
- •Improvement over time: the app “knows you” better with continued interaction
- •Memory enables more coherent multi-session experiences
- •Applicable across consumer and enterprise agent scenarios
Cutting token cost and latency vs. dumping everything into context windows
Mem0 argues the naive approach—pushing all history into the prompt—is expensive and slow. Their system selects the most relevant information to include, reducing tokens and speeding retrieval.
- •Naive memory = copy/paste full history into the context window
- •More tokens increases inference cost and latency
- •Mem0 optimizes for the “right” and most accurate info, not maximum info
- •Positioned as an infra efficiency win for developers
Founder story: repeated startup attempts, Tesla background, and teaming up
The founders share their backgrounds: long-term collaboration since undergrad, one founder’s multiple prior startup attempts, and the other’s experience leading AI platform work at Tesla Autopilot. Their relationship and complementary skills set the stage for Mem0.
- •Founders met in undergrad and collaborated since ~2012
- •Deshraj led an AI platform team at Tesla Autopilot
- •Taranjeet attempted multiple startups before Mem0 (7th attempt)
- •They decided to build together after relocating and aligning on a project
YC application, pivot from Embed Chain, and the Sadhguru AI spark
They recount entering YC with a different framing (Embed Chain/RAG) and pivoting toward “LLM statelessness” as the deeper problem. A viral consumer side project (Sadhguru AI) revealed the need for persistent memory, prompting a fast launch after YC feedback.
- •Joined YC Summer ’24 while shifting focus from RAG to memory
- •Sadhguru AI went viral but users complained it didn’t remember their journey
- •Realization: RAG helps knowledge context, but doesn’t solve persistent personal state
- •They launched Mem0 quickly (within ~36 hours after a prompt to ship)
Developer workflow: two primitives—write memory and search memory
Mem0’s interface is described as simple building blocks: adding memories and retrieving them. The system tries to extract what matters from user-level inputs and returns key context when a new conversation or task begins.
- •Core API primitives: add memory (write) and search memory (read)
- •Developers send user-relevant data (e.g., chat responses)
- •Mem0 extracts meaningful information and builds a “state” representing evolution over time
- •On retrieval, Mem0 returns the most important information for the current interaction
Under the hood: hybrid memory datastore (KV + semantic + graph)
They explain a hybrid architecture that classifies incoming unstructured data into multiple storage representations. Retrieval queries pull from all three sources to balance accuracy and low latency in real time.
- •Unstructured inputs are classified into key-value pairs, semantic chunks, or graph memory
- •Graph memory captures relationships among facts for richer recall
- •Retrieval aggregates across the three sources for relevance and coverage
- •Design goal: real-time, low-latency retrieval with high accuracy
Customization and “expectation problems”: rules in plain language
Mem0 emphasizes that what counts as a “memory” varies by user and application. During onboarding and configuration, developers can specify—using natural language—what to store or ignore, and Mem0 converts this into operational rules in the pipeline.
- •Memory requirements differ by user and task; expectations vary
- •Onboarding aims to understand what the developer is building
- •Developers can specify capture/ignore rules in plain language
- •Rules are interpreted (via LLM) and applied to update memories reliably
Use cases across industries—and a shift toward agent self-memory
They outline broad applicability: anywhere an LLM app should improve with time, memory helps. They also note a new pattern: developers increasingly store memories about agents (their actions/decisions), not only about end users.
- •Common use cases: context management for coding agents, personal companions
- •Verticals: education (learning trajectory), healthcare (patient history), finance (trade history)
- •General principle: memory should be a default primitive for LLM apps
- •Emerging pattern: “memories about the agent” (agent behavior/history)
Staleness and decay: hard expiration, exponential weighting, and domain-specific retention
They discuss how memories can become outdated and how different applications need different forgetting strategies. Mem0 supports multiple decay mechanisms and configurable controls, including cases where certain preferences should never decay.
- •Some developers want hard decay (e.g., delete after 6 months)
- •Others want exponential decay prioritizing recency while retaining some history
- •Domain-specific logic: some preferences (e.g., travel) may remain relevant indefinitely
- •Configuration can be expressed in plain language plus additional tuning knobs
Competing with model-native memory: neutrality and portability across LLMs/frameworks
They position model-provider memory (e.g., OpenAI’s) as market education rather than a direct replacement. Mem0’s differentiation is decoupling memory from any single model vendor so developers can use multiple LLMs and retain control of read/write memory.
- •Provider memory features validate the need for memory in AI products
- •Developers often use multiple LLMs; memory shouldn’t be tied to one vendor
- •Memory is both read and write—locking it to a provider reduces flexibility
- •Mem0 aims to be neutral across frameworks and model providers
Fundraising, hiring, and roadmap: “make it work, neutral, portable”
They explain their $24M seed+Series A raise, how insiders doubled down based on traction, and what they’ll do next: hire and build the best memory product. Their longer-term vision is portable memory that works across a future of many agentic interfaces.
- •$24M seed+Series A: seed led by Kindred; Series A led by Basis Set; participation from Peak XV, YC, angels
- •Plan: expand a ~10-person team (India + SF) across applied AI, full-stack, forward-deployed, research, etc.
- •Engineering challenge: low-latency infra + reliability at scale for diverse expectations
- •Vision: progress from “works” to “neutral” to “portable” memory across many AI apps/agents
Founder lessons: focus (DFS vs BFS) and conviction over the long haul
They close with advice from their journeys—staying focused and going deep, while also acknowledging how a side project can unexpectedly create breakthroughs. They emphasize persistence and belief as key founder traits.
- •Deshraj: prioritize focus—“DFS over BFS,” go deep, talk to customers
- •Side projects can be “blessing in disguise” but aren’t always recommended as a strategy
- •Taranjeet: long-term persistence; success may take years and many attempts
- •Core takeaway: conviction and sustained execution matter