Skip to content
a16za16z

Google DeepMind Lead Researchers on Genie 3 & the Future of World-Building

Genie 3 can generate fully interactive, persistent worlds from just text, in real time. In this episode, Google DeepMind’s Jack Parker-Holder (Research Scientist) and Shlomi Fruchter (Research Director) join Anjney Midha, Marco Mascorro, and Justine Moore of a16z, with host Erik Torenberg, to discuss how they built it, the breakthrough “special memory” feature, and the future of AI-powered gaming, robotics, and world models. They share: - How Genie 3 generates interactive environments in real time - Why its “special memory” feature is such a breakthrough - The evolution of generative models and emergent behaviors - Instruction following, text adherence, and model comparisons - Potential applications in gaming, robotics, simulation, and more - What’s next: Genie 4, Genie 5, and the future of world models This conversation offers a first-hand look at one of the most advanced world models ever created. Timecodes: 0:00 Introduction 0:29 The Evolution of Generative Models 1:10 Real-Time Interactivity & User Experience 4:35 Applications and Use Cases 8:15 The Importance of Special Memory 13:12 Emergent Behaviors & Model Capabilities 19:45 Instruction Following & Text Adherence 20:48 Comparing Genie 3 and Other Models 21:56 The Future of World Models & Modalities 32:23 Robotics, Simulation, and Real-World Impact 37:58 Looking Ahead: Genie 4, 5, and Future World Models 40:41 Are We Living in a Simulation? Resources: Find Shlomi on X: https://x.com/shlomifruchter Find Jack on X: https://x.com/jparkerholder Find Anjney on X: https://x.com/anjneymidha Find Justine on X: https://x.com/venturetwins Find Marco on X: https://x.com/Mascobot Stay Updated: Let us know what you think: https://ratethispodcast.com/a16z Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see a16z.com/disclosures.

Shlomi FruchterguestJack Parker-HolderguestErik TorenberghostMarco MascorrohostJustine MoorehostAnjney Midhahost
Aug 16, 202542mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
August 16, 2025
Duration
42m
Channel
a16z
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

Genie 3 can generate fully interactive, persistent worlds from just text, in real time. In this episode, Google DeepMind’s Jack Parker-Holder (Research Scientist) and Shlomi Fruchter (Research Director) join Anjney Midha, Marco Mascorro, and Justine Moore of a16z, with host Erik Torenberg, to discuss how they built it, the breakthrough “special memory” feature, and the future of AI-powered gaming, robotics, and world models. They share:

  • How Genie 3 generates interactive environments in real time
  • Why its “special memory” feature is such a breakthrough
  • The evolution of generative models and emergent behaviors
  • Instruction following, text adherence, and model comparisons
  • Potential applications in gaming, robotics, simulation, and more
  • What’s next: Genie 4, Genie 5, and the future of world models

This conversation offers a first-hand look at one of the most advanced world models ever created. Timecodes: 0:00 Introduction 0:29 The Evolution of Generative Models 1:10 Real-Time Interactivity & User Experience 4:35 Applications and Use Cases 8:15 The Importance of Special Memory 13:12 Emergent Behaviors & Model Capabilities 19:45 Instruction Following & Text Adherence 20:48 Comparing Genie 3 and Other Models 21:56 The Future of World Models & Modalities 32:23 Robotics, Simulation, and Real-World Impact 37:58 Looking Ahead: Genie 4, 5, and Future World Models 40:41 Are We Living in a Simulation? Resources: Find Shlomi on X: https://x.com/shlomifruchter Find Jack on X: https://x.com/jparkerholder Find Anjney on X: https://x.com/anjneymidha Find Justine on X: https://x.com/venturetwins Find Marco on X: https://x.com/Mascobot Stay Updated: Let us know what you think: https://ratethispodcast.com/a16z Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see a16z.com/disclosures.

SPEAKERS

  • Shlomi Fruchter

    guest

    Google DeepMind researcher and lead on Veo 3 work, discussing Genie 3 and real-time world models.

  • Jack Parker-Holder

    guest

    Google DeepMind researcher working on Genie 3, focusing on interactive environment/world models and RL motivations.

  • Erik Torenberg

    host

    a16z podcast host and interviewer moderating the panel conversation.

  • Marco Mascorro

    host

    a16z panelist/interviewer asking technical and forward-looking questions about Genie 3 and robotics.

  • Justine Moore

    host

    a16z panelist/interviewer engaging guests on use cases like filmmaking, games, and model capabilities.

  • Anjney Midha

    host

    a16z panelist and primary interviewer directing questions to Jack and Shlomi about Genie 3 vs Veo and agents/robotics.

EPISODE SUMMARY

In this episode of a16z, featuring Shlomi Fruchter and Jack Parker-Holder, Google DeepMind Lead Researchers on Genie 3 & the Future of World-Building explores deepMind’s Genie 3 enables real-time interactive worlds with lasting memory Genie 3 combines multiple internal research threads (Genie 2-style world generation, GameNGen-style real-time simulation, and Veo-era text adherence) into a single interactive, real-time “world model.”

RELATED EPISODES

The Golden Age Thesis | Marc Andreessen on MTS

The Golden Age Thesis | Marc Andreessen on MTS

The Investor Behind Costco, Starbucks, and Blackstone | Tony James on The a16z Show

The Investor Behind Costco, Starbucks, and Blackstone | Tony James on The a16z Show

Digital Freedom, AI Regulation, and the Fight for the Western Internet | The a16z Show

Digital Freedom, AI Regulation, and the Fight for the Western Internet | The a16z Show

Crypto Experts Explain Stablecoins & the Future Financial System w/ Ali Yahya & Arianna Simpson

Crypto Experts Explain Stablecoins & the Future Financial System w/ Ali Yahya & Arianna Simpson

Why Scale Will Not Solve AGI | Vishal Misra - The a16z Show

Why Scale Will Not Solve AGI | Vishal Misra - The a16z Show

Emil Michael: The Department of War Is Moving Faster Than Silicon Valley on AI | The a16z Show

Emil Michael: The Department of War Is Moving Faster Than Silicon Valley on AI | The a16z Show

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome