No PriorsBiohub: The Future of Biology is Open-Source with Mark Zuckerberg, Priscilla Chan, and Alex Rives
At a glance
WHAT IT’S REALLY ABOUT
Biohub’s open-source AI and biology platform to accelerate medicine
- Biohub’s core thesis is that the fastest path to medical breakthroughs is building shared, open tools and datasets that the entire scientific community can use, rather than trying to “cure diseases” as a single centralized organization.
- They argue biology is data-constrained in a fundamentally different way than internet-trained AI, so Biohub pairs frontier AI with “frontier biology” (new instruments, assays, imaging, and engineered systems) to create the missing data needed for robust models.
- Their modeling roadmap is explicitly hierarchical—proteins to cells to systems—aiming ultimately at a “virtual cell” that can generalize to new interventions and connect genotype→molecular mechanism→phenotype.
- A key research direction is mechanistic interpretability for protein language models, using model representations to uncover latent biological structure/function relationships and generate new hypotheses, not just predictions.
- ESMFold2 is highlighted as an open-source protein world model enabling fast structure prediction and emergent protein/antibody design, validated with wet-lab assays and cryo-EM, with implications for reducing off-target effects and accelerating preclinical iteration.
IDEAS WORTH REMEMBERING
5 ideasBiohub optimizes for ecosystem speed, not ownership of cures.
They repeatedly stress the goal is to accelerate the whole field by releasing tools/models openly so many labs can iterate in parallel, including on niche and rare diseases that market incentives often neglect.
Biology’s bottleneck is often *creating* the right data, not just training bigger models.
Unlike LLMs with abundant internet text, many key biological phenomena aren’t observable with existing methods, so Biohub invests in new assays, imaging, sensors, and cellular engineering to generate novel datasets.
A “virtual biology” stack requires hierarchical models with connective tissue between layers.
They argue you can’t reliably model systems-level behavior (e.g., immune dynamics) without solid molecular and cellular foundations, and they design experiments (spatial transcriptomics, developmental imaging, cell–cell signaling sensors) to bridge layers.
Mechanistic interpretability is positioned as a route to *new biological knowledge*.
Because protein language models learn from billions of sequences (including poorly characterized proteins), interpreting representation spaces could link unknown proteins to known function/structure “grammar,” generating testable hypotheses beyond black-box prediction.
ESMFold2 demonstrates an “open discovery engine” pattern: general model → emergent capabilities.
Rives emphasizes they didn’t build an antibody-specific model; a general protein world model yields protein/antibody interaction competence and enables digital exploration of designs followed by small-scale lab confirmation (e.g., 96-well validation, nanomolar binders).
WORDS WORTH SAVING
5 quotesWe'll have a bigger impact by getting this in more scientists' hands quicker by doing it as open-source projects instead.
— Mark Zuckerberg
The theory isn't that we're gonna cure the diseases. We're not. It's that we wanna help accelerate the pace of progress for the whole scientific field.
— Mark Zuckerberg
We folded over 1.1 billion proteins and predicted their structures, and we didn't design a model for antibodies. We didn't design a model to be able to bind one particular target. We just designed a model that could understand proteins.
— Alex Rives
So my goal is to be able to treat the individual as an individual, understand the mechanisms, and be able to intervene.
— Priscilla Chan
Our vision is not that there's gonna be like some central super intelligence that solves all of science. I think, like, people are really important and I think will be more important in the future, and giving people more tools to be more productive is gonna be like a critical part of any kind of positive future.
— Mark Zuckerberg
High quality AI-generated summary created from speaker-labeled transcript.