CHAPTERS
Consumer AI recap kickoff: what’s on deck this week
Justine and Olivia Moore launch their first “This Week in Consumer AI,” outlining the biggest consumer-facing AI developments they’ll cover. They preview a fast-moving week across AI video, voice, Apple’s AI posture, monetization data, and a creative demo.
Veo 3 breakout: the ‘ChatGPT moment’ for AI video
They discuss why Google DeepMind’s Veo 3 suddenly pushed AI video into mainstream social feeds. The key leap is native audio generation alongside video, enabling “one prompt → full talking clip” content that looks like vlogs, podcasts, and street interviews.
Why the viral formats look the way they do: 8‑second limit and ‘faceless’ characters
The hosts explain constraints that shape today’s Veo 3 content: generations are limited to ~8 seconds and audio only works reliably from text-to-video. Creators work around consistency issues by using masked/known characters (stormtroopers, yeti, capybara) where small identity drift is less noticeable.
How to access Veo 3 (and what it costs)
They clarify early confusion about availability and pricing. Veo 3 debuted behind Google’s expensive AI Ultra plan via Flow, but is now accessible through APIs and third-party tools—still with meaningful per-generation costs that change how creators prompt and iterate.
What’s next for AI video: creators, coherence, and model economics
They speculate on where AI video goes from here: more creator formats and storytelling, plus a push from model providers toward longer clips. The tradeoffs will be coherence and pricing—driving demand for smaller, cheaper models that preserve quality.
ChatGPT Advanced Voice Mode catches up: more human, more expressive
OpenAI’s Advanced Voice Mode receives a subtle but meaningful upgrade: speech feels more natural via inflections, fillers, and conversational timing. They frame it as OpenAI re-entering a category where competitors (Sesame, Gemini, Grok, open-source) had recently felt more lifelike.
Why OpenAI may have moved slowly on voice
They discuss possible reasons for the delayed voice upgrades. One factor is product prioritization across many frontiers (reasoning, image, video), and another is caution after public controversy around highly human-sounding assistants (the ‘Her’ discussion).
Apple Intelligence at WWDC: useful features, but Siri still underwhelms
They react to Apple’s AI announcements with a focus on what’s missing: a truly capable AI Siri. Apple appears to rely heavily on ChatGPT for “real AI” tasks, while shipping safer, incremental features like Genmoji improvements, transcription, and real-time translation.
ElevenLabs Eleven V3: controllable emotion, interruptions, and SFX via text tags
They cover ElevenLabs’ new Eleven V3 voice model, emphasizing a workflow shift: expressive delivery can be prompted in text rather than recorded and transferred. Tags enable emotions, accents, whispers, interruptions, and sound effects—unlocking more natural multi-character scenes.
AI storytelling convergence: Veo 3 + Eleven V3 raises the creative ceiling
They connect the dots between breakthroughs in AI video and AI voice: creators can now generate end-to-end scenes with dialogue and performance. The result is a ‘world of possibilities’ for storytelling—but also an overwhelming pace of new tools to test.
a16z data: consumer AI startups are monetizing faster than ever
Olivia shares findings from a dataset of gen-AI-era companies a16z met over ~22–24 months. Consumer AI businesses are reaching surprisingly high revenue run-rates quickly, largely driven by direct-to-consumer subscriptions and willingness to pay for powerful AI-native capabilities.
Why the monetization shift works: inference costs, higher ARPU, and new value props
They explain structural reasons consumer AI charges more: AI has real marginal costs (inference), pushing companies toward subscriptions. At the same time, AI products replace expensive human services (tutoring, coaching, language learning, creative production), making $20–$30/month feel like a bargain.
Retention and expansion: ‘AI tourism’ vs paid durability—and consumer upsell mechanics
They separate curiosity-driven usage from real subscription behavior. Free-user churn is high (‘AI tourism’), but paid retention looks comparable to pre-AI consumer norms, and revenue expansion is emerging through credits/overages—bringing enterprise-like expansion (and even ‘whale’ dynamics) to consumer apps.
Demo: building a froyo brand with AI (ChatGPT → Ideogram → Krea/Flux Context)
Justine walks through an end-to-end workflow to create a modern frozen yogurt brand (“Melt”) and generate consistent product/store imagery. The centerpiece is Flux Context (via Krea), which enables Photoshop-like edits with natural language while preserving logo/product consistency better than many general image models.
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome