The Neuroscience of Speech, Language & Music | Dr. Erich Jarvis

My guest this episode is Dr. Erich Jarvis, PhD—Professor and Head of the Laboratory of Neurogenetics of Language at Rockefeller University and Investigator with the Howard Hughes Medical Institute (HHMI). Dr. Jarvis' research spans the molecular and genetic mechanisms of vocal communication, comparative genomics of speech and language across species and the relationship between speech, language and movement. We discuss the unique ability of humans (and certain animal species) to learn and communicate using complex language, including verbal speech production and the ability to interpret both written and spoken language. We also discuss the connections between language, singing and dance, and why song may have evolved before language. Dr. Jarvis also explains some of the underlying biological and genetic components of stuttering/speech disorders, non-verbal communication, why it's easiest to learn a language as a child and how individuals can learn multiple languages at any age. This episode ought to be of interest to everyone interested in the origins of human speech, language, music and culture and how newer technology, such as social media and texting, changes our brains. Thank you to our sponsors AG1 (Athletic Greens): https://athleticgreens.com/huberman InsideTracker: https://insidetracker.com/huberman ROKA: https://roka.com/huberman LMNT: https://drinklmnt.com/huberman Supplements from Momentous https://www.livemomentous.com/huberman Social & Website Instagram: https://www.instagram.com/hubermanlab Twitter: https://twitter.com/hubermanlab Facebook: https://www.facebook.com/hubermanlab TikTok: https://www.tiktok.com/@hubermanlab LinkedIn: https://www.linkedin.com/in/andrew-hu... Website: https://hubermanlab.com Newsletter: https://hubermanlab.com/neural-network Dr. Erich Jarvis Dr. Erick Jarvis' Lab: https://www.jarvislab.net Rockefeller University: https://www.rockefeller.edu/our-scientists/heads-of-laboratories/1159-erich-d-jarvis Google Scholar: https://scholar.google.com/citations?user=cI-fi9MAAAAJ Twitter: https://twitter.com/erichjarvis Instagram: https://www.instagram.com/erich_d_jarvis LinkedIn: https://www.linkedin.com/in/erich-jarvis-ba73624 Other Resources: Earth Biogenome Project: https://www.earthbiogenome.org GenomeArk: https://vgp.github.io/genomeark Timestamps 00:00:00 Dr. Erich Jarvis & Vocal Communication 00:03:43 Momentous Supplements 00:04:36 InsideTracker, ROKA, LMNT 00:08:01 Speech vs. Language, Is There a Difference? 00:10:55 Animal Communication, Hand Gestures & Language 00:15:25 Vocalization & Innate Language, Evolution of Modern Language 00:21:10 Humans & Songbirds, Critical Periods, Genetics, Speech Disorders 00:27:11 Innate Predisposition to Learn Language, Cultural Hybridization 00:31:34 Genes for Speech & Language 00:35:49 Learning New or Multiple Languages, Critical Periods, Phonemes 00:41:39 AG1 (Athletic Greens) 00:42:52 Semantic vs. Effective Communication, Emotion, Singing 00:47:32 Singing, Link Between Dancing & Vocal Learning 00:52:55 Motor Theory of Vocal Learning, Dance 00:55:03 Music & Dance, Emotional Bonding, Genetic Predispositions 01:04:11 Facial Expressions & Language, Innate Expressions 01:09:35 Reading & Writing 01:15:13 Writing by Hand vs. Typing, Thoughts & Writing 01:20:58 Stutter, Neurogenetics, Overcome Stutter, Conversations 01:26:58 Modern Language Evolution: Texting, Social Media & the Future 01:36:26 Movement: The Link to Cognitive Growth 01:40:21 Comparative Genomics, Earth Biogenome Project, Genome Ark, Conservation 01:48:24 Evolution of Skin & Fur Color 01:51:22 Dr. Erich Jarvis, Zero-Cost Support, YouTube Feedback, Spotify & Apple Reviews, Momentous Supplements, AG1 (Athletic Greens), Instagram, Twitter, Neural Network Newsletter, Huberman Lab Clips The Huberman Lab Podcast is for general informational purposes only and does not constitute the practice of medicine, nursing or other professional health care services, including the giving of medical advice, and no doctor/patient relationship is formed. The use of information on this podcast or materials linked from this podcast is at the user's own risk. The content of this podcast is not intended to be a substitute for professional medical advice, diagnosis, or treatment. Users should not disregard or delay in obtaining medical advice for any medical condition they may have and should seek the assistance of their health care professionals for any such conditions.

Andrew HubermanhostErich Jarvisguest

Aug 29, 20221h 54mWatch on YouTube ↗

CHAPTERS

0:00 – 10:00
Intro, Guest Background, and Episode Framing
Huberman introduces Erich Jarvis, outlining his work on vocal learning, speech disorders, genetics, and the links between language, music, and movement across species. He also briefly explains the podcast’s sponsorships and mission before beginning the main conversation.
10:00 – 18:00
Defining Speech vs. Language and the Myth of a Language Module
Jarvis challenges the traditional separation of speech and language and the concept of a distinct 'language module'. He posits that language computations are embedded directly in speech production and auditory perception circuits, with vocal learning as the key specialization.
18:00 – 29:00
Animal Communication, Gestures, and Evolutionary Parallels
The discussion broadens to animal communication systems, gestural language, and the evolutionary relationship between hand/body movements and speech. Jarvis explains why some species excel at gestures more than vocal imitation and how humans unconsciously gesture while speaking.
29:00 – 38:00
Innate vs. Learned Vocalizations and the Evolutionary Origins of Speech
Jarvis distinguishes innate vocalizations from learned vocal communication, highlighting vocal learning as the rare trait underlying spoken language. He then uses genomic data from Neanderthals and other hominins to argue that spoken language is likely at least hundreds of thousands of years old.
38:00 – 42:00
Songbirds, Brain Homologies, and Genetic Convergence
Huberman and Jarvis explore the songbird model of vocal learning, including critical periods, deafening effects, and specialized song nuclei. Jarvis describes strong behavioral and genetic convergence between human speech circuits and bird song circuits despite deep evolutionary divergence.
42:00 – 54:00
Hummingbirds, Multimodal Signaling, and Motor Demands of Speech
Hummingbirds are presented as extreme examples of small-brained but highly specialized vocal learners that integrate wing-produced sounds with vocal song. Jarvis then pivots to what speech-related genes actually do in the brain, revealing unexpected roles in connectivity, protection, and plasticity.
54:00 – 1:06:00
Critical Periods, Bilingualism, and What Makes Language Easier for Kids
The conversation turns to why children more easily acquire languages and accents, how critical periods shape phoneme repertoires, and whether early bilingualism eases later language learning. Jarvis emphasizes universal critical period dynamics across brain systems with especially marked effects in speech.
1:06:00 – 1:16:00
Emotion, Semantics, Music, and Lateralization of Communication
Jarvis distinguishes semantic (meaning-based) and affective (emotion-based) communication, noting that both often use the same speech and auditory circuits but with differing emotional inputs and lateralization. He discusses how singing and music likely predate and scaffolded semantic speech.
1:16:00 – 1:35:00
Jarvis’s Path from Dance to Neuroscience and the Motor Theory of Vocal Learning
Jarvis recounts his personal history as a serious dancer and musician in a musically gifted family, and how his interest in dance led him to study the brain. He then integrates this with modern findings that vocal-learning circuits are embedded in motor circuits, forming the basis of his motor theory of vocal learning origin and its implications for dance.
1:35:00 – 1:47:00
Motor Control, Genetics of Talent, and the Role of Movement in Cognition
The discussion explores inter-individual differences in dance and singing ability, possible genetic contributions, and how motor activity supports cognition. Jarvis highlights that continuous movement (e.g., dance, walking, running) can help maintain brain health and thinking capacity into old age.
1:47:00 – 1:57:00
Facial Expressions, Gestures, and Multimodal Communication
Huberman and Jarvis examine how facial expressions, hand gestures, and speech interact in communication. Jarvis explains shared and distinct motor pathways for face and voice, the innate and learned components of expressions, and why gestures are so hard to suppress when we speak.
1:57:00 – 2:10:00
From Thought to Speech to Reading and Writing
Jarvis lays out a concrete neural model of how we go from visual symbols to internal speech to comprehension and then to writing. He explains why reading silently still activates laryngeal muscles and why writing speed must align with internal speech for fluent composition.
2:10:00 – 2:21:00
Texting, Shorthand, and the Evolution of Modern Language Use
The conversation shifts to texting, shorthand (OMG, WTF), and social media. Jarvis frames these not as degradations of language but as new forms of rapid written communication with different cognitive demands and social risks, especially given the very short delay between thought and public distribution.
2:21:00 – 2:30:00
Stuttering, Sensory–Motor Integration, and Therapeutic Approaches
Jarvis describes how his lab unintentionally modeled stuttering in songbirds via basal ganglia damage and recovery, and how these findings parallel human neurogenic stuttering. He emphasizes the role of basal ganglia and sensory–motor integration and touches on behavioral therapies.
2:30:00 – 2:38:00
Brain–Computer Interfaces and Reading Out Internal Speech
Huberman raises emerging work translating brain activity directly into text or synthesized speech in paralyzed patients. Jarvis notes that these approaches leverage the same speech circuits used for internal speech and raises ethical and practical questions about future mind-to-text systems.
2:38:00 – 2:48:00
Practical Implications: Improving Speech, Language, and Brain Health
Huberman asks about practical tools for enhancing speech and language. Jarvis emphasizes the importance of continued movement and vocal use—dance, walking, running, singing, or speaking—to maintain healthy motor and cognitive circuits, suggesting that these activities likely support overall brain function.
2:48:00 – 3:06:00
Global Genome Projects, Vocal Learning, and Conservation
The final major segment focuses on Jarvis’s leadership in large-scale genome projects. He explains why complete, error-free genomes across species are crucial for understanding trait evolution (including vocal learning) and outlines their use in conservation and potential species resurrection.
3:06:00
Closing Reflections and Future Directions
Huberman closes by reflecting on the surprising continuity between human language and animal song and movement. He thanks Jarvis for his work and hints at rapidly unfolding advances in genomics, speech neuroscience, and conservation that will likely merit future discussion.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome

Intro, Guest Background, and Episode Framing

Defining Speech vs. Language and the Myth of a Language Module

Animal Communication, Gestures, and Evolutionary Parallels

Innate vs. Learned Vocalizations and the Evolutionary Origins of Speech

Songbirds, Brain Homologies, and Genetic Convergence

Hummingbirds, Multimodal Signaling, and Motor Demands of Speech

Critical Periods, Bilingualism, and What Makes Language Easier for Kids

Emotion, Semantics, Music, and Lateralization of Communication

Jarvis’s Path from Dance to Neuroscience and the Motor Theory of Vocal Learning

Motor Control, Genetics of Talent, and the Role of Movement in Cognition

Facial Expressions, Gestures, and Multimodal Communication

From Thought to Speech to Reading and Writing

Texting, Shorthand, and the Evolution of Modern Language Use

Stuttering, Sensory–Motor Integration, and Therapeutic Approaches

Brain–Computer Interfaces and Reading Out Internal Speech

Practical Implications: Improving Speech, Language, and Brain Health

Global Genome Projects, Vocal Learning, and Conservation

Closing Reflections and Future Directions

Get more out of YouTube videos.