No Priors

No Priors Ep. 6 | With Daphne Koller from Insitro

Sarah Guo and Daphne Koller on daphne Koller on Reinventing Drug Discovery With Machine Learning and Biology.

Sarah GuohostDaphne KollerguestElad Gilhost
May 19, 202346m
Daphne Koller’s career arc across academia, Coursera, Calico, and founding insitroEvolution and renewed relevance of probabilistic graphical models alongside deep learningStructural problems in drug discovery: costs, timelines, and 95% program failure rateInsitro’s ML-first approach to target identification using human and cellular dataUse of iPSCs, CRISPR, and high-content phenotyping to model disease in vitroBiomarkers, patient stratification, and interactions with regulators in clinical developmentBuilding culture and processes that bridge computer science and experimental biology

In this episode of No Priors, featuring Sarah Guo and Daphne Koller, No Priors Ep. 6 | With Daphne Koller from Insitro explores daphne Koller on Reinventing Drug Discovery With Machine Learning and Biology Daphne Koller traces her path from core machine learning and probabilistic graphical models to co-founding Coursera and ultimately founding insitro, an ML-first drug discovery company.

At a glance

WHAT IT’S REALLY ABOUT

Daphne Koller on Reinventing Drug Discovery With Machine Learning and Biology

  1. Daphne Koller traces her path from core machine learning and probabilistic graphical models to co-founding Coursera and ultimately founding insitro, an ML-first drug discovery company.
  2. She explains why traditional drug development is so slow, expensive, and failure-prone, and argues that the biggest leverage point is choosing the right biological target and patient population, not just better molecules or trial design.
  3. Insitro’s strategy combines large-scale human data (“experiments of nature”) with high-content cellular experiments (e.g., CRISPR-edited iPSC-derived cells) and modern ML to build more predictive, human-relevant disease models.
  4. Koller also discusses building a hybrid tech–bio culture, the importance of biomarkers and genetics, and broader opportunities at the AI–biology interface in areas like agriculture, materials, and education.

IDEAS WORTH REMEMBERING

7 ideas

The greatest leverage in drug discovery is choosing the right target and patients.

Most drug programs fail not because of bad molecules or trial design, but because they modulate the wrong biological target in the wrong indication or population. Fixing this dramatically reduces wasted spend across the pipeline.

Combine deep learning’s pattern recognition with structured, causal reasoning.

Koller argues the field is swinging back from purely deep learning toward a synthesis with probabilistic/causal modeling and interpretability, especially in high-stakes domains like medicine where you must explain and reason about decisions.

Use “experiments of nature” plus controlled cellular experiments to compensate for missing training data.

Because we rarely have direct data mapping interventions to clinical outcomes, insitro leverages human genetics (genotype–phenotype maps) and lab-based perturbation of cells, then uses ML to connect these layers and predict human-relevant effects.

High-content, unbiased data (imaging, omics, MRIs) is a key enabler for ML in biology.

Insitro intentionally chooses therapeutic areas like neuroscience, metabolism, and oncology where rich human data (e.g., MRIs, biopsies, omics) and tractable in vitro models (e.g., neurons from iPSCs) exist to feed modern ML models.

Biomarkers and human genetics roughly double the odds of clinical success.

Drugs backed by human genetic evidence and robust biomarkers are about twice as likely to succeed in trials. ML-driven analysis of human data naturally yields such biomarkers and stratification signals when you explicitly look for them.

Bridging tech and bio requires explicit cultural design, not just hiring both skill sets.

Insitro codified behavioral norms—engaging openly, constructively, and with respect—to avoid “ML will solve everything” arrogance, encourage naïve questions, and reconcile engineering’s desire for clean abstractions with biology’s messy reality.

AI–biology is a broad platform opportunity far beyond drug discovery.

Koller highlights parallel opportunities in agtech, environmental tech, energy, biomaterials, food tech, and education, arguing that the simultaneous tool revolutions in AI and experimental biology make this one of the richest frontiers for impact.

WORDS WORTH SAVING

5 quotes

It’s not like X‑ray crystallography. It’s like computers. You’re going to use machine learning everywhere across the drug discovery process.

Daphne Koller

If you really want to bring down that $2.5 billion number, what you have to do is bring down this completely mind‑blowing statistic of 95% of drug programs fail.

Daphne Koller

Each of us is an experiment of nature, where nature has modulated our genetics… and we can look at that mapping from genotype to phenotype as a surrogate of what a therapeutic intervention would do.

Daphne Koller

There are so many tech people who come into life sciences and they’re like, ‘We are machine learning, we’re going to solve everything,’ and they don’t respect the challenges of the other discipline.

Daphne Koller

We only have one life to live, and ultimately you want to be able to look back and say, ‘I’ve done something that’s really worthwhile and important.’

Daphne Koller

QUESTIONS ANSWERED IN THIS EPISODE

5 questions

How can probabilistic graphical models and causal inference be most effectively integrated with deep learning in real-world biomedical applications?

Daphne Koller traces her path from core machine learning and probabilistic graphical models to co-founding Coursera and ultimately founding insitro, an ML-first drug discovery company.

What technical approaches does insitro use to ensure that in vitro cellular models truly predict human clinical outcomes, rather than overfitting to lab artifacts?

She explains why traditional drug development is so slow, expensive, and failure-prone, and argues that the biggest leverage point is choosing the right biological target and patient population, not just better molecules or trial design.

How should regulators evolve their frameworks to better incorporate ML-derived biomarkers and patient stratification without compromising safety?

Insitro’s strategy combines large-scale human data (“experiments of nature”) with high-content cellular experiments (e.g., CRISPR-edited iPSC-derived cells) and modern ML to build more predictive, human-relevant disease models.

What specific cultural or organizational failures most often doom AI–bio collaborations, and how can new startups preempt them from day one?

Koller also discusses building a hybrid tech–bio culture, the importance of biomarkers and genetics, and broader opportunities at the AI–biology interface in areas like agriculture, materials, and education.

Beyond drug discovery, which AI–biology application (e.g., agtech, materials, food) does Koller believe is most underexplored yet poised for outsized impact?

EVERY SPOKEN WORD

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome