Skip to content
Best Place To BuildBest Place To Build

Pratyush Kumar, Co-founder, Sarvam AI | "Sarvam means everybody- AI should be for everyone."| Ep. 24

In this conversation with Pratyush Kumar, co-founder and CEO of Sarvam AI, we dive deep into India's AI revolution. As the leader of one of India's hottest AI startups focused on solving Indian language challenges, Pratyush shares insights on building sovereign AI capabilities, the multi-layered approach to AI development, and the vision of making technology accessible to every Indian. Recently selected by the Government of India to build the country's sovereign language model under the India AI mission, Sarvam AI represents the intersection of technological innovation and national strategy. This episode offers a compelling look at how homegrown AI is positioning India at the forefront of the global AI landscape, with valuable lessons for aspiring technologists and entrepreneurs. 00:00:00 - Introduction and Background 00:03:04 - The Birth of AI4Bharat 00:05:07 - How IIT students helped build foundational components 00:08:00 - Birth of Sarvam AI 00:14:51 - The Four Layers of AI Development 00:21:33 - Real-World Applications of AI in India 00:26:04 - Strategic Autonomy in Technology 00:28:43 - Sovereign AI: India's Approach 00:34:40 - AI as a Utility for Everyone 00:38:54 - Technology as an Equalizer 00:41:40 - The Economics of AI Development 00:40:57 - Cost Structure of AI Business 00:42:58 - The Value Loop & Long-term Vision 00:45:05 - Market Dynamics & Competition 00:46:15 - Managing Fast-Paced Growth & Focus 00:47:47 - Indian AI Ecosystem & Academic Integration 00:51:28 - Talent Pipeline & Educational Infrastructure 00:53:22 - National AI Landscape & Government Engagement 00:55:07 - Work-Life Balance & Personal Fulfillment in AI 00:57:20 - AI Integration in Daily Work & Workflows 00:59:06 - Human-AI Relationship & Philosophical Implications 01:03:12 - Sarvam's Roadmap & Closing Thoughts

Pratyush Kumarguest
May 23, 20251h 4mWatch on YouTube ↗

CHAPTERS

  1. Why Indian-language AI matters: diversity, culture, and strategic tech

    Pratyush frames the core problem Sarvam/AI4Bharat set out to solve: building AI that truly works across India’s linguistic and cultural diversity. He argues that for strategic technologies like AI, the country should retain the capability to build key systems itself.

    • India’s many scripts and language variations make Indian-language AI a uniquely hard challenge
    • Culture is embedded in language; AI must reflect both linguistic and cultural context
    • Strategic technologies require domestic capability, not just adoption
    • Building locally changes a country’s stance and autonomy
  2. Pratyush’s path into AI: systems engineering to deep learning scale

    He traces his journey from electrical engineering and systems/HPC research into AI during the early deep-learning inflection point. The discussion highlights the shift from algorithm-centric progress to compute-and-data-driven scaling and why that pulled him into foundational work.

    • Education: IIT Bombay (EE) → ETH Zurich PhD (systems/HPC) → IBM Research
    • Deep learning’s early era (post-AlexNet) and the move toward compute-driven AI
    • Systems reliability, performance, and scale as crucial AI enablers
    • Joining IIT Madras faculty and deciding to focus on AI research for India
  3. AI4Bharat origins: teaching at scale, community experiments, and a pivot to language

    AI4Bharat begins with hands-on deep learning courses that unexpectedly scale to tens of thousands of learners. After a decentralized ‘hacker motion’ doesn’t work well, the team focuses the effort on a few lab-led problems—eventually converging on Indian-language AI as the highest-leverage direction.

    • Hands-on deep learning courses; ~50,000 learners and real career impact
    • Initial decentralized volunteering model didn’t sustain execution
    • Shift to lab-driven problems with students + volunteers
    • Strategic pivot: Indian-language AI becomes the central mission
  4. Building foundational components at IIT: data pipelines to competitive translation

    Pratyush explains how students helped build critical building blocks such as high-quality web scraping for Indian-language data. Within about a year, the lab produced translation systems competitive with big tech, attracting government and philanthropic support and maturing into a large center of excellence.

    • Early ‘unsexy’ foundations: high-quality scrapers and data flywheel setup
    • Rapid progress to competitive English↔Indic translation systems
    • Support: Government of India (KASHINI) + Nandan Nilekani philanthropy
    • AI4Bharat scales to ~200+ people; leading open-source Indic language lab
    • Ongoing collaboration between Sarvam and AI4Bharat
  5. From lab to company: why Sarvam AI was created

    The conversation shifts to why a venture-backed company was needed: foundational models and production-grade LLMs require far more compute, capital, and engineering than translation systems. Sarvam is positioned as a contrarian bet on India’s market potential and long-term AI cycle, inspired by DPI scale stories like Aadhaar/UPI.

    • LLMs require far more compute/capital than earlier NLP systems
    • Motivation: build ‘from scratch’ in India, for India—beyond short-term apps
    • Co-founder Vivek Raghavan’s background: Aadhaar, DPI, and applied AI systems
    • Contrarian view: India isn’t “small market”; it’s early in a long AI adoption cycle
    • Public-private platforms (UPI-like) as a model for scaling AI
  6. What foundation models are—and why nations and companies race to build them

    Pratyush defines foundation models as general-purpose systems that can be adapted to many tasks rather than single-skill models. He connects their importance to broad economic value creation and to AI’s emerging status as strategic national infrastructure.

    • Foundation models: general-purpose capabilities across language/audio/vision
    • Applications sit ‘on top’ (e.g., search, code assistants, legal summarization)
    • Rapid cycles: months instead of decades compared to prior industrial revolutions
    • Commercial imperative plus strategic/national-security imperative
    • OpenAI/Google/Meta/China as examples of global-scale builders
  7. Sarvam’s four-layer ‘full-stack’ view: inference, models, orchestration, applications

    He lays out Sarvam’s stack and why the company spans layers instead of specializing in one. The orchestration layer is emphasized as the glue that makes real systems reliable—especially for voice and reasoning-heavy workflows—while domain experts drive application design.

    • Inference layer: efficient serving, GPU utilization, compilation/optimization, cost control
    • Model layer: LLMs and other modality models (audio/vision)
    • Orchestration layer: combining models + tools + software for reliable, scalable systems
    • Applications layer: domain-led solutions (e.g., insurance, government services)
    • Rationale: different skills needed at each layer; full-stack helps deliver outcomes
  8. Data for India: scarcity, ‘culture tokens,’ and code-mixed realities

    Pratyush details why Indic data is harder: many languages have limited digitized text and cultural content often exists in undigitized sources. He also highlights code-mixing and Romanized typing as essential real-world phenomena models must handle to be useful for Indian users.

    • Three core ingredients: data, compute, talent (plus non-obvious know-how)
    • Indic languages: uneven data availability and incomplete digitization
    • ‘Culture tokens’: content embedded in old/undigitized books and artifacts
    • Modern usage: Romanized Indic typing and evolving language norms
    • Code-mixing: English words within Indic sentences as a first-class requirement
  9. Real deployments in India: Aadhaar basement stacks, insurance calls, courts, and NITI Aayog

    Concrete case studies illustrate what ‘full-stack’ means in practice: air-gapped sovereign deployments, large-scale voice outreach, and complex data-to-policy reasoning systems. The examples stress reliability, latency, and application-specific design as key to real value creation.

    • Air-gapped, on-prem deployments for UIDAI/Aadhaar-style constraints
    • Voice bots assisting citizens when biometric authentication fails
    • Insurance renewal outreach at crore-scale in local languages
    • Judgment accessibility: working with courts; legal language isn’t plain translation
    • NITI Aayog project: querying thousands of tables + PDFs via natural language; auto-code, self-correction, report generation
  10. Strategic autonomy and sovereign AI: capability to build, deploy, and scale

    Pratyush frames sovereign AI as the ability to build strategic technology domestically without ‘decoupling’ from the world. He links the concept to national resilience, large-scale deployment capacity (compute/power/app ecosystem), and staying close to state-of-the-art through a long AI cycle.

    • Strategic autonomy: collaborate globally, but retain ability to build from scratch
    • Sovereign AI includes talent + resources + deployment capability at national scale
    • AI as a general-purpose technology akin to electricity in development impact
    • Goal: start now to stay competitive through 2030 and beyond
    • Global momentum: Japan, South Korea, France, UK and others launching programs
  11. AI as a utility: open standards, per-capita access, and technology as an equalizer

    The discussion explores how AI could mirror India’s digital public infrastructure pattern: utility-like access with open standards that enables private innovation. Pratyush suggests ‘per-capita AI consumption’ as a future proxy for competitiveness and argues that AI can flatten access gaps between urban and rural users.

    • AI should be cheap, efficient, and widely available like UPI-style utilities
    • Open standards vs open source: the right model is still evolving for AI
    • Vision: citizens get access to ‘lakhs of tokens per day’ via useful applications
    • Per-capita AI usage as a proxy for national productivity/advancement
    • Equalized experience: rural and urban users should get comparable service quality
  12. The economics of building AI: cost drivers, why funding matters, and the local value loop

    Pratyush breaks down where money goes: data preparation, GPU-heavy training, expensive talent, and productization across the stack. He introduces the ‘value loop’—deploy, learn from usage, improve quickly—and argues that keeping this loop within India is key for long-term economic and strategic returns.

    • Major costs: data prep, model training, GPUs, and top-tier talent
    • ‘GPU factories’ idea: GPUs also process/produce training datasets
    • Beyond models: software engineering + orchestration + applications add cost
    • Costs are falling with efficiency improvements, but still prohibitive at scale
    • Value loop: deployments create feedback → better models in months; must remain local
  13. Competition speed, focus, and building amid constant change

    The conversation turns to the emotional and operational reality of the AI race: rapid shifts, noisy news cycles, and strong capital flows. Pratyush emphasizes anchoring on medium-term clarity and democratization goals to avoid being whipsawed by short-term hype.

    • AI progress is ‘too fast for comfort’ and can feel overwhelming
    • Need strong medium-term vision to stay steady amid noise
    • Maintain focus despite abundant capital and constant new announcements
    • Sarvam’s north star: democratizing AI for India over decades
    • Plan for compounding gains leading up to 2030
  14. Ecosystem and talent pipeline: academia-startup integration and a ‘builders vs sellers’ mindset

    Pratyush describes the early signs of an ecosystem effect around IIT Madras but says deeper integration is needed among academia, VCs, and operators. He argues for hands-on building, real feedback loops, and structures that help students (beyond IITM) execute in a world changing every six months.

    • Ecosystem exists but needs stronger institutional bridges between campus and startups/VCs
    • Encourage students’ self-belief and exposure to real building environments
    • ‘Cut the crap’: prioritize building with fast feedback over abstractions
    • Open doors beyond IITM; leverage research park and broader participation
    • National deep-tech moment: more government and strategic sector engagement
  15. AI in daily workflows, hallucinations, and the human–AI relationship

    They discuss how AI is reshaping everyday work and the risks of over-reliance, including stacked hallucinations in research workflows. The conversation broadens into philosophy: what remains uniquely human, how engineered systems shape experience, and the need to deliberately steer toward a positive, symbiotic future.

    • AI now sits ‘in the palm’—directly used across teams and weekly workflows
    • Reliance risks: hallucinations and cascading errors across toolchains
    • Concern: future gap between human and machine capability
    • Reframing humanity beyond output; prioritizing lived experience and values
    • Engineered systems matter—society must deliberately design for positive outcomes
  16. Sarvam’s roadmap: sovereign LLM build, scaling products, closing the state-of-the-art gap

    Pratyush closes with Sarvam’s near-term execution plan and multi-year ambition. The company aims to ship India’s sovereign model, expand product surface area, and steadily reduce the gap to global state-of-the-art while keeping democratization at the center.

    • Near term: build the sovereign model within the year
    • Current products already scaled (millions of calls; analytics for organizations)
    • Sovereign model expands ‘surface area’ for new applications and partners
    • 2–5 year goal: close gap to best models of the day as the frontier shifts
    • Long-term intent: strategic capability + broad access for India’s population scale

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.