Skip to content
No PriorsNo Priors

No Priors Ep. 5 | With Huggingface’s Clem Delangue

After starting as a talking emoji companion, Hugging Face is now an organizing force for the open source AI research ecosystem. Its models are used by companies such as Apple, Salesforce and Microsoft, and it's working to become the GitHub for ML. This week on the podcast, Sarah Guo and Elad Gil talk to Clem Delangue, co-founder and CEO of Hugging Face. Clem shares how they shifted away from their original product, why every employee at Hugging Face is responsible for community-building, the modalities he's most interested in, and what role open source has in the AI race. 00:00 - Introduction 01:53 - how Clem first became interested in ML, being shouted at by eBay sellers, and the foretelling of the end of barcode scanning 03:34 - early iterations of Hugging Face, trying to make a less boring AI tamagotchi, and switching directions towards open source tools 05:36 - advice for founders considering a change in direction, 30%+ experimentation 07:39 - 1st users, MLTwitter, approach to community 10:47 - enterprise ML maturity, days to production 12:54 - open source vs. proprietary models 15:56 - main model tasks, architectures and sizes 19:12 - decentralized infrastructure, data opt out 24:16 - Hugging Face’s business model, GitHub 28:09 - What Clem is excited about in AI

Elad GilhostClem DelangueguestSarah Guohost
May 19, 202337mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 2:45

    From eBay power-seller to computer vision: the trade-show moment that changed everything

    Clem recounts growing up in France, running a prolific electronics business on eBay, and landing an eBay internship. A chance encounter at a trade show—plus a prediction about object recognition replacing barcode scanning—pulled him into ML and a computer-vision startup.

    • Ran a major French eBay seller operation and worked closely with eBay teams
    • Trade shows were dominated by PayPal support complaints directed at the shared booth
    • Met an engineer who predicted object recognition would replace barcode scanning
    • Did follow-up research, validated the person’s credibility, and decided to switch paths
    • Joined a startup doing ML for computer vision ~15 years ago
  2. 2:45 – 3:59

    Hugging Face v1: building a fun “AI Tamagotchi” instead of boring assistants

    Clem explains that Hugging Face began as a consumer chatbot product—a playful AI companion inspired by sci‑fi. The team spent nearly three years on this direction and achieved meaningful engagement at scale.

    • Founders wanted an AI project that was scientifically challenging and fun
    • Motivation: Alexa/Siri felt limited to productivity and “weather” use cases
    • Built an AI friend / talking-emoji Tamagotchi-style chatbot product
    • Operated this product for almost three years
    • Reached billions of messages exchanged between users and bots
  3. 3:59 – 5:37

    The pivot to open source: Transformers, BERT traction, and following unexpected pull

    The company’s shift toward open-source tooling emerged gradually as transformer models took off. As usage of their open-source components exploded—from a few users to hundreds of companies—the team reallocated effort and raised a Series A around the new momentum.

    • Open source was present early as partial releases alongside the chatbot product
    • Transformers/BERT era triggered a major spike in adoption of their open-source work
    • Progressive internal shift: from one cofounder maintaining it to the whole team
    • Signal-based decision-making: attention and excitement clearly concentrated on tooling
    • Series A funding reinforced the new direction as the primary company bet
  4. 5:37 – 8:12

    Founder guidance on changing direction: balancing exploration vs. exploitation

    Clem offers a framework for pivots and ongoing innovation: maintain a deliberate ratio of experimentation to execution both before and after product-market fit. Hugging Face institutionalized exploration, giving experiments time and resources to become major product lines.

    • Common failure modes: too much thrash pre-PMF, too little experimentation post-PMF
    • Aim for a healthy exploration/exploitation split to avoid local optima
    • Hugging Face targets ~30–40% of effort for exploration/long-term bets
    • Experiments need runway and ownership to mature into real products
    • Example: Spaces started as one person’s experiment and scaled to 50k+ demos
  5. 8:12 – 12:42

    Getting the first users: ML Twitter, network effects, and builder-led community

    Early growth came largely from Twitter, where the ML community amplified open-source releases. Hugging Face then benefited from marketplace-style network effects between researchers publishing models and industry adopting them, reinforced by a culture where every employee engages the community directly.

    • Initial distribution was driven by ML Twitter and organic sharing/retweets
    • Researchers shared models for visibility, pulling in industry users
    • Industry demand then encouraged more researchers to publish on the Hub
    • No dedicated community/PR hires early; community engagement was everyone’s job
    • Even the main Hugging Face Twitter account is shared across the whole team
  6. 12:42 – 14:53

    Bridging research to enterprise: why ML’s “time to production” is unusually fast

    Sarah asks about the gap between cutting-edge research and enterprise readiness. Clem argues ML’s research-to-production loop is dramatically faster than traditional sciences, and that open source is a key reason the virtuous cycle works.

    • Compared with traditional sciences, ML transfer to production is exceptionally rapid
    • Time horizons are shrinking from years to months/weeks/days
    • Open source accelerates iteration: science→production→science feedback loops
    • Concern: proprietary models could slow the ecosystem’s overall progress
    • Open collaboration is framed as core to the speed of advancement
  7. 14:53 – 18:33

    Open source vs proprietary models: coexistence, task-level leadership, and specialization

    Clem rejects a single-winner narrative, arguing open and proprietary approaches will continue to trade leads across tasks. He points to the diversity of models and builders on Hugging Face as evidence that specialization, cost, and accuracy drive a multi-model world.

    • Open and proprietary will likely coexist, as in search and databases
    • Leadership varies by task (e.g., proprietary may lead in some text generation; OSS in others)
    • Ecosystem is dynamic—advantages can flip quickly as techniques and releases evolve
    • Hugging Face scale: 250k+ models uploaded by ~15k companies suggests no single “best” model
    • Specialized models are often cheaper, faster, and more accurate for specific use cases
  8. 18:33 – 19:55

    What’s on the Hub: dominant modalities, emerging domains, and model size spectrum

    Clem characterizes the distribution of models by task and domain, highlighting NLP, vision/text-to-image, and audio as top categories. He also notes growing activity in time series and in biology/chemistry, spanning model sizes from millions to 180B parameters.

    • Top three task buckets: NLP; computer vision/text-to-image; audio (ASR/TTS, etc.)
    • Emerging growth in time series (e.g., ETA prediction) and financial/fraud applications
    • Increasing presence of biology and chemistry models/datasets/demos
    • Model sizes range broadly: from a few million parameters to ~180B parameters
    • The platform is used for both classic tasks and frontier generative workflows
  9. 19:55 – 21:10

    Why small models still matter: latency, cost, and real-time enterprise needs

    Not all use cases benefit from the largest models. Clem explains how production constraints like latency and cost create strong demand for smaller, faster, more targeted models, while some applications still justify larger general-purpose systems.

    • Model choice is use-case dependent rather than “bigger is always better”
    • Real-time products (example: Bloomberg Terminal) prioritize low latency
    • Smaller models can be faster and cheaper while meeting accuracy requirements
    • Larger models fit more general needs when cost/latency constraints are looser
    • Hugging Face observes broad, distributed usage across model sizes
  10. 21:10 – 22:35

    Infrastructure focus: making ML deployment sustainable and cost-aligned

    Clem says he’s especially excited about infrastructure—an often overlooked lever for making ML healthy and scalable. He expects the ecosystem to mature by aligning infrastructure costs with real user value and use cases.

    • Infrastructure is a key bottleneck: cost, speed, and operational sustainability
    • The field has often ignored true run costs while chasing capability gains
    • Expectation of clearer cost/performance alignment as the ecosystem matures
    • “Cloud money laundering” critique: disconnect between spend and actual value
    • Better infra economics will enable broader, longer-term progress
  11. 22:35 – 24:48

    Decentralized training and data consent: online learning and opt-in/opt-out datasets

    Clem highlights two desired ecosystem shifts: more decentralized/continuous learning infrastructure and stronger consent mechanisms for training data. He cites BigCode opt-out for code and rising opt-in datasets—especially relevant amid text-to-image artist debates.

    • Desire for decentralization in infra and more frequent/continuous training cycles
    • Critique of “train once, refresh in 6–12 months” as archaic and causes staleness
    • Online learning could keep models current (and reduce knowledge lag)
    • BigCode introduces developer opt-out from training datasets for code models
    • Momentum toward opt-in datasets; heightened importance for artist-consent in image models
  12. 24:48 – 27:05

    BLOOM and BigScience: open collaboration as an answer to power concentration and bias

    Clem explains BLOOM as the product of BigScience, a large-scale open collaboration to train an LLM transparently. He argues building in the open helps counter AI power concentration and allows broader participation to identify and mitigate bias.

    • BLOOM came from BigScience: ~1,000 researchers across ~200 organizations
    • End-to-end transparency: public runs, discussions, and decision process
    • Compute support included external infrastructure (e.g., French compute provider)
    • Open source/open science framed as solutions to concentration of power and bias
    • Diversity of contributors helps surface harms affecting underrepresented groups
  13. 27:05 – 28:34

    Adapting to new paradigms: RLHF and diffusion-era tooling for the community

    Elad asks about RLHF as a post-pretraining step and how it affects strategy. Clem describes Hugging Face’s approach: support new waves early by hosting models and building open-source libraries so companies can integrate emerging techniques.

    • RLHF is treated as an important addition to standard ML pipelines
    • Hugging Face began hosting RLHF-related models before the broader hype cycle
    • They’re developing an open-source library to help companies adopt RLHF workflows
    • Pattern: the company continually adapts to new model generations (e.g., diffusers)
    • Flexibility is positioned as essential for ML startups due to rapid paradigm shifts
  14. 28:34 – 34:04

    Monetization and the GitHub analogy: freemium, enterprise needs, and becoming a compute gateway

    Clem outlines Hugging Face’s evolving business model: a freemium platform with paid tiers driven by enterprise security/compliance and infrastructure offerings. In discussing GitHub, he notes that compute/infrastructure monetization may be a key opportunity for platforms that sit at the start of developer workflows.

    • Current shape: ~15k companies use the platform; ~3k pay (freemium dynamics)
    • Paid differentiation increasingly centers on security and compliance for large enterprises
    • Infrastructure monetization: GPU upgrades for Spaces, inference endpoints, optimization help
    • GitHub comparison: a platform can influence downstream infrastructure choices by being the starting point
    • Strategic focus: invest earlier in becoming a ‘gate for compute’ via cloud partnerships
  15. 34:04 – 37:25

    Looking ahead: bio/chem breakthroughs and the rise of full-stack ML-native companies

    Clem’s excitement centers on biology and chemistry as high-impact domains for “software 2.0” approaches. He also highlights a new generation of ML-native companies that build the models and the product together, enabling them to challenge incumbents.

    • Bio and chem seen as major next frontiers for ML impact
    • ML as a new paradigm for building technology (“software 2.0” framing)
    • Evidence of momentum: BioGPT, protein work, and growing Hub activity in these areas
    • Clem avoids specific predictions, citing the field’s unpredictability
    • Excitement about ‘full-stack ML’ companies (e.g., Runway, Grammarly, Stability) that build ML natively

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.