Dmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153

Dmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153

Lex Fridman PodcastJan 11, 20212h 12m

Lex Fridman (host), Dmitry Korkin (guest), Narrator

Modular structure and evolution of proteins (domains, linkers, alternative splicing)Structural biology of SARS‑CoV‑2: spike, M, N, and E proteins and viral assemblyViral evolution, mutation, host jumps, and implications for vaccines and treatmentsAI and protein folding: CASP, AlphaFold/AlphaFold2, and limits of current approachesMachine learning for protein/virus design and biosecurity concernsOrigin and prevalence of life in the universe; rare Earth vs. ubiquitous lifeHistory of AI in biology (Joshua Lederberg, Dendral, expert systems) and broader AI milestones

In this episode of Lex Fridman Podcast, featuring Lex Fridman and Dmitry Korkin, Dmitry Korkin: Evolution of Proteins, Viruses, Life, and AI | Lex Fridman Podcast #153 explores decoding Proteins, Viruses, and AI: From Spike to AlphaFold Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.

Decoding Proteins, Viruses, and AI: From Spike to AlphaFold

Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.

They dive deep into the structure and mechanics of SARS‑CoV‑2, focusing on the spike protein, the membrane (M) protein lattice, viral evolution, and how structural understanding can inform vaccines and antiviral strategies.

The conversation then bridges to AI: protein structure prediction, the significance and limits of DeepMind’s AlphaFold, and how machine learning might be used in protein and virus design—alongside the ethical and existential risks.

They close with reflections on the origin and rarity of life, alien biology, historical figures in AI and bioinformatics, the future of AI in science, and personal insights on family, academia, and Russian literature and poetry.

Key Takeaways

Protein domains, not whole proteins, are the core functional and evolutionary units.

Korkin emphasizes that most proteins are composed of multiple domains—modular structural and functional units that get reused, shuffled, and recombined across evolution, making domains a more meaningful 'building block' than entire proteins.

Get the full analysis with uListen AI

SARS‑CoV‑2’s structure reveals multiple potential therapeutic attack points beyond the spike.

While the spike trimer and its receptor-binding domains mediate entry via ACE2, the more evolutionarily stable membrane (M) protein forms a lattice that organizes the viral envelope and may be a promising, less mutation-prone target for small‑molecule drugs.

Get the full analysis with uListen AI

Understanding viral evolution is essential for anticipating dangerous mutations and host jumps.

Mutations enable viruses to adapt, cross species, and potentially evade vaccines or treatments; tracking sequence changes across geography and hosts, and modeling their functional impact, may let us forecast which strains or mutations are likely to become problematic.

Get the full analysis with uListen AI

AlphaFold2 is a transformative tool but has not ‘solved’ protein folding in full.

It achieves near‑experimental accuracy for many single‑domain or compact proteins in CASP benchmarks, yet multi‑domain, highly flexible proteins and multi‑protein complexes remain unsolved, and the fundamental physical mechanism of folding is still not understood.

Get the full analysis with uListen AI

Domain-specific knowledge remains crucial in modern AI, echoing the spirit of expert systems.

Korkin notes that successful systems like AlphaFold embed detailed biological priors (evolutionary relationships, structural constraints), showing that raw deep learning alone is not enough; structured domain knowledge still drives major gains.

Get the full analysis with uListen AI

Machine learning can both help and potentially harm in virology and bioengineering.

The same models that predict pathogenicity or structural effects of mutations to aid pandemic preparedness could, in principle, be misused to suggest more dangerous variants—highlighting the need for regulation, transparency, and careful governance.

Get the full analysis with uListen AI

Scientific and data infrastructure have radically improved our pandemic response.

Compared to SARS, the structural characterization and sequencing of SARS‑CoV‑2 have happened in months instead of years, enabling rapid vaccine design and detailed evolutionary tracking, and illustrating how global scientific collaboration can accelerate under pressure.

Get the full analysis with uListen AI

Notable Quotes

Proteins are no longer considered as a sequence of letters. There are hierarchical complexities in the way these proteins are organized.

Dmitry Korkin

If you’re able to destroy the outer shell, you are essentially destroying the viral particle itself.

Dmitry Korkin

We are very far away from understanding how these multi‑domain proteins are folded.

Dmitry Korkin

AlphaFold is a turning event where you have a machine learning system that is truly better than the more conventional biophysics‑based methods.

Dmitry Korkin

Biology gives you a brain. Life turns it into a mind.

Jeffrey Eugenides, quoted by Lex Fridman

Questions Answered in This Episode

How might integrating AlphaFold‑like models with experimental data (e.g., cryo‑EM, NMR) accelerate our understanding of large, flexible, multi‑domain proteins and complexes?

Lex Fridman and bioinformatician Dmitry Korkin explore the modular nature and evolution of proteins, emphasizing domains, linkers, and alternative splicing as key building blocks of biological complexity.

Get the full analysis with uListen AI

What governance frameworks could balance open scientific progress in AI‑assisted bioengineering with safeguards against misuse for designing more dangerous pathogens?

They dive deep into the structure and mechanics of SARS‑CoV‑2, focusing on the spike protein, the membrane (M) protein lattice, viral evolution, and how structural understanding can inform vaccines and antiviral strategies.

Get the full analysis with uListen AI

To what extent can modularity in proteins (domains, linkers, alternative splicing) inspire new architectures for adaptive, evolving software agents or AI systems?

The conversation then bridges to AI: protein structure prediction, the significance and limits of DeepMind’s AlphaFold, and how machine learning might be used in protein and virus design—alongside the ethical and existential risks.

Get the full analysis with uListen AI

If we discovered non‑Earth life based on a different biochemistry, how would that reshape current assumptions in molecular biology and AI models trained on Earth‑centric data?

They close with reflections on the origin and rarity of life, alien biology, historical figures in AI and bioinformatics, the future of AI in science, and personal insights on family, academia, and Russian literature and poetry.

Get the full analysis with uListen AI

Where is the tipping point at which AI‑driven tools become not just aids to biologists, but primary drivers of hypothesis generation and experimental design in life sciences?

Get the full analysis with uListen AI

Transcript Preview

Lex Fridman

The following is a conversation with Dmitry Korkin, his second time on the podcast. He's a professor of bioinformatics and computational biology at WPI, where he specializes in bioinformatics of complex disease, computational genomics, systems biology, and biomedical data analytics. He loves biology, he loves computing, plus he is Russian and recites a poem in Russian at the end of the podcast. What else could you possibly ask for in this world? Quick mention of our sponsors: Brave Browser, NetSuite business management software, Magic Spoon low carb cereal, and Eight Sleep self-cooling mattress. So the choice is browsing privacy, business success, healthy diet, or comfortable sleep. Choose wisely, my friends, and if you wish, click the sponsor links below to get a discount and to support this podcast. As a side note, let me say that to me, the scientists that did the best apolitical, impactful, brilliant work of 2020 are the biologists who study viruses without an agenda, without much sleep, to be honest, just a pure passion for scientific discovery and exploration of the mysteries within viruses. Viruses are both terrifying and beautiful. Terrifying because they can threaten the fabric of human civilization, both biological and psychological. Beautiful because they give us insights into the nature of life on Earth, and perhaps even extraterrestrial life of the not-so-intelligent variety that might meet us one day as we explore the habitable planets and moons in our universe. If you enjoy this thing, subscribe on YouTube, review it on Apple Podcasts, follow on Spotify, support on Patreon, or connect with me on Twitter @LexFridman. And now, here's my conversation with Dmitry Korkin. It's often said that proteins and, uh, the amino acid residues that make them up are the building blocks of life. Do you think of proteins in this way, as the, uh, basic building blocks of life?

Dmitry Korkin

Yes and no. So the proteins indeed is the, the basic unit, biological unit, that carries out, uh, important function of the cell. However, through studying the proteins, and comparing the proteins across different species, across dis- different kingdoms, you realize that, uh, proteins are actually a more, a much more complicated, uh, so they have, um, so-called modular complexity. And so, uh, what I mean by that is, um, an average protein consists of, um, of several structural units. So we call them, uh, protein domains. And so you can imagine a protein as a string of beads, where each bead is a protein domain. And, uh, you know, in the past 20 years, scientists have been studying, uh, the nature of the protein domains, 'cause, uh, we realized that it's, it's, it's the unit. Because if you look at the functions, right? So, so, uh, many proteins have more than one function, and those, uh, protein functions, uh, are often carried out by those protein domains. So, um, we also see that, uh, in the evolution, those proteins' domains get shuffled. So, so they act actually as, as a unit. Also from the structural perspective, right? So, you know, y- uh, some people think of, uh, a protein as a sort of a globular, um, molecule, but as a matter of fact, is, is, uh, the globular part of this protein is a protein domain. So we, we often have this, uh, you know, again, the, the, the, uh, collection of these protein domains, um, align, uh, on a string as beads.

Install uListen to search the full transcript and get AI-powered insights

Get Full Transcript

Get more from every podcast

AI summaries, searchable transcripts, and fact-checking. Free forever.

Add to Chrome