This Week in AI: GPT-5 Ships, 4o Pulled Back, Grok Imagine Goes Social

a16z partners Olivia and Justine Moore unpack the latest in consumer AI including: - Grok’s “Imagine” and its instant, social-first creative tools - Google’s Genie 3 and the future of 3D worlds - GPT-5: what’s new and missing, plus why some want their old chatbot back - AI-generated music from ElevenLabs - Olivia’s vibecoded Jensen Huang selfie app Timecodes: 0:00 Introduction & This Week's Topics 0:24 Grok Imagine: Social AI Image & Video Generation 4:48 GPT-5 Release & GPT-4 Deprecation 5:36 Comparing GPT-5 and GPT-4: Coding vs. Personality 9:13 AI for Mental Health: Illinois Law & Industry Impact 12:29 Genie 3: Interactive World Models from Google 16:53 ElevenLabs Music Model: Licensed AI Music Generation 19:16 Vibecoding: Consumer Experiments & Platform Evolution 24:14 The Future of Vibecoding & AI Tools 27:05 Conclusions Resources: Find Olivia on X: https://x.com/omooretweets Find Justine on X: https://x.com/venturetwins Read Anish and Justine’s vibecoding post: https://a16z.com/specialized-app-gen-platforms/ Stay Updated: Let us know what you think: https://ratethispodcast.com/a16z Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details, please see a16z.com/disclosures

Justine MoorehostOlivia Moorehost

Aug 13, 202527mWatch on YouTube ↗

CHAPTERS

What’s on deck this week: creative tools, GPT-5, and vibecoding
Justine and Olivia set the agenda for the episode, spanning new consumer-facing creative models, major OpenAI model changes, and their evolving “vibecoding” thesis. They frame the discussion around what’s actually changing for everyday users versus what’s impressive on paper.
Grok Imagine goes social: image/video generation inside X
They break down xAI’s Grok Imagine and why its biggest differentiator isn’t raw quality—it’s distribution and native social integration. The ability to turn images into videos directly from X changes how casual users create and share AI content.
Speed + mobile workflow as the killer feature
They argue Grok Imagine’s near-instant image generation and fast video creation unlocks iteration in a way slower tools don’t. For non-professional creators, removing multi-step export/import workflows—especially on mobile—is a major leap for mainstream adoption.
Real-person generation and moderation tradeoffs
Grok’s “uncensored” feel (relative to other tools) enables generating real people and meme-y content with fewer blocks. They discuss how other models often restrict “prominent person” outputs, and why fewer constraints can make a product feel more playful—while also implying different safety choices.
GPT-5 ships—and GPT-4.0 disappears: why users got upset
The conversation shifts to OpenAI’s GPT-5 release and the surprise removal/deprecation of GPT-4.0 in the product experience. They explain why consumers reacted strongly: losing a familiar “friend” model felt like a downgrade even if benchmarks improved.
GPT-5 vs GPT-4.0: better at coding, worse at ‘vibes’
They separate two issues: removing excessive validation (“glazing”) is good, but GPT-5 also seems less expressive and less fun for casual chat. The net result is a model that may be objectively stronger yet less engaging for companionship-style usage.
AI for health and mental health: OpenAI leans in as regulation tightens
They discuss OpenAI’s increasing emphasis on medical use cases (including HealthBench) alongside Illinois’ new law restricting AI-driven therapy without licensed supervision. The tension: users already use general chatbots for mental health support, while enforcement and definitional boundaries remain unclear.
Genie 3 from Google: interactive world models you can ‘walk through’
They explain why Genie 3 demos went viral: it turns prompts/images (and even videos) into navigable, real-time environments. Unlike standard video generation, the user can control movement and see the scene regenerate dynamically as if exploring a virtual world.
So what do you do with a world model? video control, gaming, and RL environments
They outline practical paths for Genie 3: recording controlled traversal to make more editable video, accelerating game development, and enabling fully personalized mini-games. They also highlight a less consumer-centric but important use: generating scalable RL environments for training agents and robots.
ElevenLabs’ licensed music model: why licensing changes the market
They cover ElevenLabs’ move into AI music generation with a key differentiator: training on fully licensed music. While many consumers may not care, licensing is crucial for enterprises and media buyers who need legal clarity for commercial use.
Vibecoding in practice: Olivia ships a ‘selfie with Jensen’ app overnight
Olivia shares a hands-on vibecoding case study: she built and published a meme app that lets users generate a Jensen Huang selfie using image model APIs. The project demonstrates how non-technical creators can ship real, viral software quickly—while also revealing cost and reliability pitfalls.
Early vibecoding platforms assume technical knowledge: security and privacy gaps
They discuss the hidden sharp edges: the app accidentally exposed an API key and lacked proper private storage for uploaded photos. The key critique is that today’s vibecoding tools don’t proactively prevent common security/privacy mistakes, which blocks true mainstream adoption.
The vibecoding market will fragment: ‘training wheels’ consumers vs. stack-control pros
Justine summarizes the broader thesis from her post with Anish Acharya: vibecoding is trying to serve incompatible user segments with one product. The market likely splits into specialized platforms optimized for consumers (fast, safe, mobile, constrained) versus developers/enterprises (flexible, integrated, controllable).

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome

What’s on deck this week: creative tools, GPT-5, and vibecoding

Grok Imagine goes social: image/video generation inside X

Speed + mobile workflow as the killer feature

Real-person generation and moderation tradeoffs

GPT-5 ships—and GPT-4.0 disappears: why users got upset

GPT-5 vs GPT-4.0: better at coding, worse at ‘vibes’

AI for health and mental health: OpenAI leans in as regulation tightens

Genie 3 from Google: interactive world models you can ‘walk through’

So what do you do with a world model? video control, gaming, and RL environments

ElevenLabs’ licensed music model: why licensing changes the market

Vibecoding in practice: Olivia ships a ‘selfie with Jensen’ app overnight

Early vibecoding platforms assume technical knowledge: security and privacy gaps

The vibecoding market will fragment: ‘training wheels’ consumers vs. stack-control pros

Get more out of YouTube videos.