Skip to content
No PriorsNo Priors

No Priors Ep. 6 | With Daphne Koller from Insitro

Life-saving therapeutics continue to grow more costly to discover. At the same time, recent advances in using machine learning for the life sciences and medicine are extraordinary. Are we on the verge of a paradigm shift in biotech? This week on the podcast, a pioneer in AI, Daphne Koller, joins Sarah Guo and Elad Gil on the podcast to help us explore that question. Daphne is the CEO and founder of Insitro — a company that applies machine learning to pharma discovery and development, specifically by leveraging “induced pluripotent stem cells.” We explain Insitro’s approach, why they’re focused on generating their own data, why you can’t cure schizophrenia in mice, and how to design a culture that supports both research and engineering. Daphne was previously a computer science professor at Stanford, and co-founder and co-CEO of edutech company Coursera. 00:00 - Introduction 01:49 - How Daphne combined her biology and tech interests and ran a bifurcated lab at Stanford 04:34 - Why Daphne resigned an endowed chair at Stanford to build Coursera 14:14 - How insitro approaches target identification problems and training data 18:33 - What are pluripotent stem cells and how insitro identifies individual neurons 24:08 - How insitro operates as an engine for drug discovery and partners to create the drugs themselves 26:48 - Role of regulations, clinical trials and disease progression in drug delivery 33:19 - Building a team and workplace culture that can bridge both bio and computer sciences 39:50 - What Daphne is paying attention to in the so-called golden age of machine learning 43:12 - Advice for leading a startup in edtech and healthtech

Sarah GuohostDaphne KollerguestElad Gilhost
May 18, 202346mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

Daphne Koller on Reinventing Drug Discovery With Machine Learning and Biology

  1. Daphne Koller traces her path from core machine learning and probabilistic graphical models to co-founding Coursera and ultimately founding insitro, an ML-first drug discovery company.
  2. She explains why traditional drug development is so slow, expensive, and failure-prone, and argues that the biggest leverage point is choosing the right biological target and patient population, not just better molecules or trial design.
  3. Insitro’s strategy combines large-scale human data (“experiments of nature”) with high-content cellular experiments (e.g., CRISPR-edited iPSC-derived cells) and modern ML to build more predictive, human-relevant disease models.
  4. Koller also discusses building a hybrid tech–bio culture, the importance of biomarkers and genetics, and broader opportunities at the AI–biology interface in areas like agriculture, materials, and education.

IDEAS WORTH REMEMBERING

5 ideas

The greatest leverage in drug discovery is choosing the right target and patients.

Most drug programs fail not because of bad molecules or trial design, but because they modulate the wrong biological target in the wrong indication or population. Fixing this dramatically reduces wasted spend across the pipeline.

Combine deep learning’s pattern recognition with structured, causal reasoning.

Koller argues the field is swinging back from purely deep learning toward a synthesis with probabilistic/causal modeling and interpretability, especially in high-stakes domains like medicine where you must explain and reason about decisions.

Use “experiments of nature” plus controlled cellular experiments to compensate for missing training data.

Because we rarely have direct data mapping interventions to clinical outcomes, insitro leverages human genetics (genotype–phenotype maps) and lab-based perturbation of cells, then uses ML to connect these layers and predict human-relevant effects.

High-content, unbiased data (imaging, omics, MRIs) is a key enabler for ML in biology.

Insitro intentionally chooses therapeutic areas like neuroscience, metabolism, and oncology where rich human data (e.g., MRIs, biopsies, omics) and tractable in vitro models (e.g., neurons from iPSCs) exist to feed modern ML models.

Biomarkers and human genetics roughly double the odds of clinical success.

Drugs backed by human genetic evidence and robust biomarkers are about twice as likely to succeed in trials. ML-driven analysis of human data naturally yields such biomarkers and stratification signals when you explicitly look for them.

WORDS WORTH SAVING

5 quotes

It’s not like X‑ray crystallography. It’s like computers. You’re going to use machine learning everywhere across the drug discovery process.

Daphne Koller

If you really want to bring down that $2.5 billion number, what you have to do is bring down this completely mind‑blowing statistic of 95% of drug programs fail.

Daphne Koller

Each of us is an experiment of nature, where nature has modulated our genetics… and we can look at that mapping from genotype to phenotype as a surrogate of what a therapeutic intervention would do.

Daphne Koller

There are so many tech people who come into life sciences and they’re like, ‘We are machine learning, we’re going to solve everything,’ and they don’t respect the challenges of the other discipline.

Daphne Koller

We only have one life to live, and ultimately you want to be able to look back and say, ‘I’ve done something that’s really worthwhile and important.’

Daphne Koller

Daphne Koller’s career arc across academia, Coursera, Calico, and founding insitroEvolution and renewed relevance of probabilistic graphical models alongside deep learningStructural problems in drug discovery: costs, timelines, and 95% program failure rateInsitro’s ML-first approach to target identification using human and cellular dataUse of iPSCs, CRISPR, and high-content phenotyping to model disease in vitroBiomarkers, patient stratification, and interactions with regulators in clinical developmentBuilding culture and processes that bridge computer science and experimental biology

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome