Dwarkesh PodcastDavid Reich – Bronze Age shock, the Neanderthal puzzle, & the sudden spread of farming
CHAPTERS
Ancient DNA finally gets big enough to measure selection over time
Reich explains why ancient DNA revolutionized human migration history long before it could deliver on the original promise: tracking biological change. The key bottleneck was sample size—single genomes are rich for ancestry, but poor for estimating allele-frequency trajectories and selection.
- •Ancient DNA excelled at reconstructing migrations, mixtures, and sex-biased events
- •Why biology/phenotype evolution was hard: single individuals provide only 1–2 allele samples per locus
- •Large time-series sample sizes are needed to distinguish drift from selection
- •The field’s recent scale-up enables frequency-change studies over millennia
What allele-frequency change tells you (and why migration is a confounder)
They discuss why frequency shifts across time are informative about adaptation to changing environments (diet, altitude, pathogens, domestication). Reich emphasizes that most frequency change comes from migration/admixture rather than selection, so selection signals must be detected as locus-specific deviations from genome-wide shifts.
- •Natural selection should show systematic frequency changes beyond chance
- •Environmental transitions (farming, animals, climate) create new pressures
- •Most frequency change is drift + population turnover, not adaptation
- •Migration events create huge genome-wide swings that obscure selection
- •Best selection detection occurs in relatively stable periods between migrations
A surprising result: selection is widespread but usually subtle
Reich summarizes the headline finding: the genome is ‘vibrating’ with selection even if selection explains only a small fraction of total allele-frequency movement. They estimate thousands of candidate selected positions, with varying confidence thresholds, revealing selection is far from quiescent in the last 10k years.
- •~98% of frequency change is drift/structure; selection is ~2% but pervasive
- •Hundreds of sites show very strong evidence; thousands more are plausible
- •Using probability thresholds: ~479 independent high-confidence sites; ~7,200 at 50% confidence (≈3,600 real)
- •Many weaker effects likely exist below current detection power
What traits are most targeted: immunity and metabolism dominate; behavior is harder
By intersecting selection signals with GWAS traits, the strongest enrichment appears for immune-related variants and metabolic traits (obesity/diabetes-related). Behavioral and psychiatric traits show less enrichment in the ‘top hits’ not because they aren’t selected, but because they are extremely polygenic with tiny per-variant effects.
- •Strong 4–5× enrichment for immune traits among selected loci
- •Notable enrichment also for metabolic traits (BMI, diabetes risk)
- •Little enrichment among strongest hits for behavioral/psychiatric traits
- •Explanation: immune traits often have fewer, larger-effect loci; behavior is highly polygenic
- •They argue there is still selection on behavioral traits, but power is limited
The Bronze Age inflection: accelerated selection in the last ~5,000 years
A major theme is that selection intensifies in the Bronze Age and afterward, more than during the initial adoption of farming. Reich frames this as a ‘shock’ from rising population density, new pathogens, animal proximity, and intensified social/technological changes, creating evolutionary mismatch and rapid adaptation.
- •Selection on immune and metabolic traits accelerates after ~5,000 years ago
- •High-density living + animal contact changes disease ecology dramatically
- •Farming begins earlier, but the strongest genomic response appears later
- •Idea of evolutionary mismatch: hunter-gatherer genomes in farmer/urban environments
- •Challenges the ‘big transition is farming’ cartoon—genome says Bronze Age mattered more
Concrete examples of shifting selection pressures (TB, FADS, ABO, iron, pigmentation)
They walk through specific loci showing striking time dynamics, including reversals where an allele rises then falls. Examples include TB risk (TIC2), dietary fat metabolism (FADS1/2), ABO blood groups, hemochromatosis-related variants, and depigmentation timing peaking around 4,000–2,000 years ago.
- •TIC2 allele: rises to ~9–10% then declines sharply in last ~3,000 years (possible TB-related reversal)
- •FADS1/2: diet-related selection connected to plant vs meat fatty-acid processing
- •ABO system: B rises at expense of A despite ancient balancing history across primates
- •Hemochromatosis-associated variants show reversal around Bronze/Iron Age
- •Strongest depigmentation selection occurs ~4,000–2,000 years ago, then weakens
Polygenic selection on cognition/education signals—and why interpretation is tricky
Reich claims strong polygenic selection on predictors of cognitive performance and years-of-schooling, peaking in the Bronze Age and largely absent in the last 2,000 years. They stress the measured predictors may proxy broader traits (executive function, planning, fertility timing) rather than ‘IQ’ per se, and validate robustness using cross-population GWAS comparisons.
- •Polygenic score shifts ~1 SD over 10,000 years within ancestry-stable strata; strongest in Bronze Age windows
- •Selection signal largely disappears in the most recent ~2,000 years
- •Migration causes huge apparent jumps in polygenic scores; their method aims to isolate consistent directional change
- •Modern GWAS predictors correlate with many traits (fertility timing, BMI, walking pace, wealth)
- •Cross-check: variants affecting schooling in China correlate with European ancient trajectories, arguing against GWAS artifact
Why evolution didn’t ‘max out’ intelligence: tradeoffs, changing optima, and fertility dynamics
They explore why seemingly universally useful traits might not monotonically increase. Reich speculates that selection may act on multidimensional tradeoffs (e.g., quality vs quantity of offspring investment), and that some psychiatric risk alleles could be linked to advantageous subclinical traits in certain cultural contexts.
- •Modern societies uniquely valorize measured ‘intelligence’—past values differed
- •Complex trait correlations suggest selection might target broader life-history strategies
- •Example: Iceland shows recent selection against education-linked polygenic scores over ~100 years (likely via fertility patterns)
- •Possible toggles: many children/low investment vs fewer children/high investment
- •Speculation: psychiatric-spectrum traits might confer benefits (creativity, vision-oriented roles) in some contexts
Selection on body fat and metabolism: the ‘thrifty genes’ debate and timescales
They discuss evidence for selection against obesity/BMI-associated variants over the last 10,000 years in Europe/Middle East. Reich connects this to the thrifty genes hypothesis and argues the relevant stability may be short-term food access (boom-bust hunting) rather than multi-year famine dynamics common in agricultural societies.
- •Polygenic trend: reduced genetic risk for obesity, fat mass, waist-hip ratio, type 2 diabetes
- •Thrifty genes hypothesis: agriculture changes food availability and fat-storage benefits
- •Europeans appear more genetically protected from T2D than some recently agricultural populations
- •Hunter-gatherer boom-bust eating may favor short-horizon fat storage
- •Agricultural famines occur on different timescales than fat reserves can buffer
Time vs population size: why bigger Bronze Age populations aren’t the main explanation
Dwarkesh asks whether Bronze Age population growth made selection more effective by generating more mutations and overcoming drift. Reich argues strong selection (≈0.5–1%+) works even in small populations; for weak selection that depends on huge population size, the timescales would be far too long to matter here—time is the binding constraint, not N.
- •Mutation supply isn’t limiting once populations reach ~10^6 scale; far earlier than Bronze Age
- •Selection with ~1% coefficients operates effectively even at N~1,000–10,000
- •Selection weak enough to be N-limited would take 10^4–10^5 generations to matter
- •Bronze Age changes likely reflect environmental/cultural shifts, not just larger N
- •Key principle: for these effects, time dominates over population size
Why no farming before the Holocene: climate stability as the missing ingredient
Despite genetic ‘readiness,’ agriculture appears only after ~12,000 years ago and then emerges independently in multiple regions. Reich highlights a puzzling claim from climate science: the Holocene brought unusual long-run climate stability compared with the prior two million years, potentially enabling sustained cultivation and domestication.
- •Genetic/cognitive toolkit likely existed tens to hundreds of thousands of years earlier
- •Agriculture arises only in the Holocene and independently in many regions
- •Proposed key factor: reduced climatic volatility over years/decades/centuries
- •This remains an ‘outstanding mystery’ given diverse environments of independent origins
- •Rejects idea that early farming existed but vanished without trace—archaeology would show it
The Neanderthal puzzle: genomes say one thing, archaeology says another
Reich describes a persistent tension: genome-wide, Denisovans and Neanderthals are sisters, yet Neanderthals share many cultural and genetic features with modern humans. He focuses on anomalies like Neanderthal mitochondrial DNA and Y chromosomes clustering with modern humans, plus shared Middle Paleolithic/Levallois technology absent in East Asia.
- •Genome-wide tree: Neanderthal–Denisovan split from modern humans ~700–800kya; N–D split ~500–600kya
- •But Neanderthals share Levallois/Middle Paleolithic traditions with modern humans
- •Neanderthal mtDNA and Y chromosome look ‘modern-human-like’ (from ancient gene flow)
- •Evidence of multiple admixture events complicates simple branching models
- •Question: are Neanderthals ‘more modern’ culturally/biologically than the standard model implies?
A new speculative model: an early modern-human expansion reshaping Neanderthals (and us)
In a whiteboard-style explanation, Reich proposes an alternative framing: a Middle Stone Age/Levallois innovation spread via an expansion that mixed heavily with local Eurasian archaics, leaving small genome-wide ancestry but potentially replacing uniparental lineages and transmitting culture. He draws an analogy to epicycles—suggesting current models may be over-patched—and notes parallel evidence for deep African substructure and admixture into modern humans.
- •Hypothesis: Levallois revolution originates in a population that expands into Europe and Africa ~200–300kya
- •Expansion could be genetically small in final Neanderthal genome (≈5%) yet culturally decisive, like tracer-dye migrations
- •Uniparental lineage replacement (mtDNA/Y) could occur via selection or social dynamics; explains anomalies more parsimoniously (speculative)
- •Modern humans may also reflect ancient African admixture from deeply diverged lineages (~1.5 My separation) converging ~200–300kya
- •Critique of status quo: accumulating ad hoc fixes vs searching for simpler generative histories
Methodological breakthrough: predicting genotypes from relatedness, then testing for selection
Reich returns to the technical core: they model expected allele states using a relatedness matrix across ~22k individuals (ancient + modern), capturing drift and ancestry shifts, then test whether adding a consistent directional-selection term improves prediction. Large data scale plus this statistical framing yields far more power than earlier ‘ancient vs modern frequency difference’ scans.
- •Dataset: ~16k ancient individuals across ~18k years; ~22k total including modern
- •Key idea: predict each locus from genome-wide relatedness (controls for migrations/structure)
- •Then test whether a constant selection coefficient explains residual directional change better
- •Constant-selection assumption is simplistic but useful as a detection test
- •Earlier methods hit an apparent ceiling (dozens of hits); this approach yields hundreds-to-thousands
Validating signals via GWAS enrichment; guarding against background selection
To calibrate which selection statistics are likely real, they use an external validation: enrichment for GWAS-associated loci rises with selection score and plateaus, implying high scores are mostly true positives. They test alternative explanations like background selection (purifying selection near genes) by stratifying genomic regions to show the enrichment persists.
- •External check: GWAS trait-associated loci become 4–5× enriched at high selection scores
- •Selection statistic behaves like a (roughly) Gaussian ‘SDs from zero selection’ measure
- •Plateau above ~5 suggests near-all signals at that level are genuine
- •Concern: background selection could create spurious enrichment near genes
- •Controls: stratify by background-selection intensity and allele frequency; pattern remains
How ancient DNA scaled: cheaper sequencing + capture enrichment + industrialized pipelines
Reich explains the practical innovations enabling the new sample sizes: dramatic sequencing cost declines, plus in-solution capture that enriches human DNA from microbe-dominated remains. Roboticized pipelines and targeted panels made it economical to generate thousands of samples per year, transforming what questions the field can answer.
- •Ancient samples often contain <1–10% human DNA; most is microbial colonization
- •In-solution capture ‘washes’ libraries over synthetic baits targeting informative sites
- •Target panels include variable loci and biologically interesting GWAS-linked regions
- •Industrialization/robotics: labs now produce thousands of genome-scale ancient datasets yearly
- •Scale shift: from ~10 human genomes (2010) to >20,000 reported ancient sequences today