Lex Fridman Podcast

Dmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153

Lex Fridman and Dmitry Korkin on decoding Proteins, Viruses, and AI: From Spike to AlphaFold.

Lex FridmanhostDmitry Korkinguest
Jan 11, 20212h 12m
Modular structure and evolution of proteins (domains, linkers, alternative splicing)Structural biology of SARS‑CoV‑2: spike, M, N, and E proteins and viral assemblyViral evolution, mutation, host jumps, and implications for vaccines and treatmentsAI and protein folding: CASP, AlphaFold/AlphaFold2, and limits of current approachesMachine learning for protein/virus design and biosecurity concernsOrigin and prevalence of life in the universe; rare Earth vs. ubiquitous lifeHistory of AI in biology (Joshua Lederberg, Dendral, expert systems) and broader AI milestones

In this episode of Lex Fridman Podcast, featuring Lex Fridman and Dmitry Korkin, Dmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153 explores decoding Proteins, Viruses, and AI: From Spike to AlphaFold Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.

At a glance

WHAT IT’S REALLY ABOUT

Decoding Proteins, Viruses, and AI: From Spike to AlphaFold

  1. Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.
  2. They dive deep into the structure and mechanics of SARS‑CoV‑2, focusing on the spike protein, the membrane (M) protein lattice, viral evolution, and how structural understanding can inform vaccines and antiviral strategies.
  3. The conversation then bridges to AI: protein structure prediction, the significance and limits of DeepMind’s AlphaFold, and how machine learning might be used in protein and virus design—alongside the ethical and existential risks.
  4. They close with reflections on the origin and rarity of life, alien biology, historical figures in AI and bioinformatics, the future of AI in science, and personal insights on family, academia, and Russian literature and poetry.

IDEAS WORTH REMEMBERING

7 ideas

Protein domains, not whole proteins, are the core functional and evolutionary units.

Korkin emphasizes that most proteins are composed of multiple domains—modular structural and functional units that get reused, shuffled, and recombined across evolution, making domains a more meaningful 'building block' than entire proteins.

SARS‑CoV‑2’s structure reveals multiple potential therapeutic attack points beyond the spike.

While the spike trimer and its receptor-binding domains mediate entry via ACE2, the more evolutionarily stable membrane (M) protein forms a lattice that organizes the viral envelope and may be a promising, less mutation-prone target for small‑molecule drugs.

Understanding viral evolution is essential for anticipating dangerous mutations and host jumps.

Mutations enable viruses to adapt, cross species, and potentially evade vaccines or treatments; tracking sequence changes across geography and hosts, and modeling their functional impact, may let us forecast which strains or mutations are likely to become problematic.

AlphaFold2 is a transformative tool but has not ‘solved’ protein folding in full.

It achieves near‑experimental accuracy for many single‑domain or compact proteins in CASP benchmarks, yet multi‑domain, highly flexible proteins and multi‑protein complexes remain unsolved, and the fundamental physical mechanism of folding is still not understood.

Domain-specific knowledge remains crucial in modern AI, echoing the spirit of expert systems.

Korkin notes that successful systems like AlphaFold embed detailed biological priors (evolutionary relationships, structural constraints), showing that raw deep learning alone is not enough; structured domain knowledge still drives major gains.

Machine learning can both help and potentially harm in virology and bioengineering.

The same models that predict pathogenicity or structural effects of mutations to aid pandemic preparedness could, in principle, be misused to suggest more dangerous variants—highlighting the need for regulation, transparency, and careful governance.

Scientific and data infrastructure have radically improved our pandemic response.

Compared to SARS, the structural characterization and sequencing of SARS‑CoV‑2 have happened in months instead of years, enabling rapid vaccine design and detailed evolutionary tracking, and illustrating how global scientific collaboration can accelerate under pressure.

WORDS WORTH SAVING

5 quotes

Proteins are no longer considered as a sequence of letters. There are hierarchical complexities in the way these proteins are organized.

Dmitry Korkin

If you’re able to destroy the outer shell, you are essentially destroying the viral particle itself.

Dmitry Korkin

We are very far away from understanding how these multi‑domain proteins are folded.

Dmitry Korkin

AlphaFold is a turning event where you have a machine learning system that is truly better than the more conventional biophysics‑based methods.

Dmitry Korkin

Biology gives you a brain. Life turns it into a mind.

Jeffrey Eugenides, quoted by Lex Fridman

QUESTIONS ANSWERED IN THIS EPISODE

5 questions

How might integrating AlphaFold‑like models with experimental data (e.g., cryo‑EM, NMR) accelerate our understanding of large, flexible, multi‑domain proteins and complexes?

Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.

What governance frameworks could balance open scientific progress in AI‑assisted bioengineering with safeguards against misuse for designing more dangerous pathogens?

They dive deep into the structure and mechanics of SARS‑CoV‑2, focusing on the spike protein, the membrane (M) protein lattice, viral evolution, and how structural understanding can inform vaccines and antiviral strategies.

To what extent can modularity in proteins (domains, linkers, alternative splicing) inspire new architectures for adaptive, evolving software agents or AI systems?

The conversation then bridges to AI: protein structure prediction, the significance and limits of DeepMind’s AlphaFold, and how machine learning might be used in protein and virus design—alongside the ethical and existential risks.

If we discovered non‑Earth life based on a different biochemistry, how would that reshape current assumptions in molecular biology and AI models trained on Earth‑centric data?

They close with reflections on the origin and rarity of life, alien biology, historical figures in AI and bioinformatics, the future of AI in science, and personal insights on family, academia, and Russian literature and poetry.

Where is the tipping point at which AI‑driven tools become not just aids to biologists, but primary drivers of hypothesis generation and experimental design in life sciences?

EVERY SPOKEN WORD

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome