Head of Gemini: You're Using 5% of What Gemini Can Actually Do | Josh Woodward

📌 Increase your visibility in AI search with HubSpot AEO — see how your brand shows up in ChatGPT, Gemini, and Perplexity. FREE for 28 days, then $50/month: https://clickhubspot.com/a400ee Josh Woodward, the VP behind Google Labs, the Gemini app, and AI Studio, gave me the clearest picture yet of where AI is actually heading at Google. 0:00 — Intro 1:01 — Why This Google I/O Is Different From Every Other One 2:21 — Feature That Was Built Two Weekends Before I/O 3:22 — Why Switch to Gemini When You Already Use Another AI 6:53 — The Shift to Voice-First Is Already Happening 8:02 — From Doing to Directing: Everyone Becomes a Manager 12:10 — Is Google Actually Losing the AI Race? 15:18 — Gemini's Personality 16:27 — How Josh Builds Personal Context Into AI 19:58 — Why Everyone Needs to Organize Their Personal Context Now 20:55 — What AGI Actually Means 22:35 — Why Human Taste Becomes More Valuable, Not Less 23:42 — Advice for Someone Starting Their Career Right Now 24:29 — How Google Labs Ships Products in Two Weekends 26:13 — The Metric Josh Uses Instead of Retention Dashboards 29:39 — The Next Shift Nobody Has Internalized Yet 31:12 — What His Five-Year-Old's Future Looks Like *Links*: 📩 Follow my Newsletter: https://siliconvalleygirl.beehiiv.com/subscribe?utm_source=youtube&utm_medium=video&utm_campaign=futureproof-sub&utm_content=Josh Woodward 🔗 My Instagram: https://www.instagram.com/siliconvalleygirl/ 📌 My Companies & Products: https://Marinamogilko.co #googleio #podcast #gemini #geminispark

Josh WoodwardguestMarina Mogilkohost

May 21, 202633mWatch on YouTube ↗

CHAPTERS

0:00 – 2:01
Gemini’s “agentic era”: why this Google I/O feels like a turning point
Josh frames this I/O as meaningfully different: Google is shifting from chat-style assistants toward agents that can take actions across tools. He also highlights multimodal model advances ("Omni") and hints at science-focused products as part of the broader momentum.
- •I/O significance: transition into an “agentic” product era
- •Gemini Omni as a shift in how inputs/outputs are handled across modalities
- •Google Labs has more “cooking,” beyond what was announced
- •Science-oriented products mentioned late in the keynote as a future direction
2:01 – 3:03
The two-weekend voice workflow demo: search your files, synthesize, and draft instantly
Marina calls out the standout demo: speaking naturally while Gemini pulls relevant info from Drive/Gmail and drafts a structured email (including tables). Josh reveals the feature was hacked together two weekends before I/O, underscoring the pace of iteration.
- •Voice interaction goes beyond transcription to retrieval + composition
- •Select local/Drive files; model understands PDFs and images
- •Model can correct/normalize details (e.g., fixing a date)
- •Rolling out soon; feature built rapidly right before I/O
3:03 – 5:22
Why switch to Gemini/Spark: ecosystem-native agents, cloud parallelism, and generative media
Josh argues Gemini Spark’s advantage is deep integration with Google apps without extra connectors, plus the ability to run many background tasks in parallel using Google Cloud. He also points to Google’s unique breadth in generative media (images/video/music), expanding what agents can produce.
- •Native integration with Gmail, Docs, Sheets, Slides, Calendar (no connectors)
- •Background virtual machines enable many parallel tasks
- •Generative suite: images, video, music alongside docs/slides
- •Roadmap mentions MCP connections and agentic payments via Google Pay/Wallet
5:22 – 6:54
“Digital chores” and killer use cases: reminders, calendar cleanup, and interest tracking
To make agents feel real, Josh emphasizes everyday pain relief: remembering deadlines, managing family logistics, and reclaiming time. He shares practical prompts like asking Gemini which meetings to cancel and using it to follow personal interests in a customized voice.
- •Agents as help for “digital chores” (deadlines, tasks, overlooked emails)
- •Calendar optimization: free time for family/hobbies
- •Prompt idea: “What are the three meetings I should cancel this week?”
- •Personalized content streams (e.g., sports updates written in a fan voice)
6:54 – 8:02
Voice-first apps are arriving: usage tipping points, dialects, and tool-calling by speech
Marina and Josh discuss the move toward voice as a primary interface, with Josh noting some regions have already tipped to voice-dominant usage. He attributes the shift to speed and naturalness, plus models now cleaning up rambling speech and executing tool calls while speaking.
- •Internal usage shows voice becoming dominant in some countries
- •Voice is faster, more natural, and now supports “rambling → cleaned output”
- •Voice tied to tool-calling and generation (images/actions)
- •Dialects/ways of speaking as an emerging UX feature; rollout on a near-term timeline
8:02 – 9:23
From coding focus to knowledge-work orchestration: NotebookLM as the preview
Josh explains how lessons from coding assistants are being applied to knowledge work, where assembling context is the key unlock. NotebookLM exemplifies the shift: provide sources and generate deliverables (podcasts, slides, mind maps), moving users toward describing outcomes rather than procedures.
- •Strategic focus expanding from software engineers to knowledge workers
- •NotebookLM: assemble sources → generate multiple formats quickly
- •Future interaction: “grab these things, make X/Y/Z” with minimal friction
- •Work style shift: specifying outcomes rather than step-by-step execution
9:23 – 11:39
“Doing to directing”: everyone becomes a manager of agents
They crystallize the cultural change: as agents handle execution, humans increasingly direct and review. Josh predicts organizations may need to teach “manager” skills to everyone, because individuals will coordinate multiple agents like a team.
- •Core shift: doing work → directing agents toward deliverables
- •“Everyone becomes a manager” framing
- •Agent management as a new baseline professional skill
- •Implications for training, leadership, and coordination practices
11:39 – 13:54
Is Google losing the AI race? Competition, leverage, and the ‘Personal Intelligence’ bet
Marina raises the perception that Google has been behind in AI mindshare; Josh responds that the market is dynamic and competition is motivating. He argues Google’s differentiator is combining long-standing assets—especially opt-in, unified personal data connections—into new products.
- •AI landscape is fast-moving; competition “sharpens” teams
- •Google advantage: combining existing products/infrastructure in new ways
- •Personal Intelligence: one-button opt-in to connect your Google context
- •Reimagining and recombining products as a strategy to catch/lead
13:54 – 15:22
Adoption and trust: making AI personal through small ‘wow’ moments
Josh describes why mainstream users lag early adopters: AI triggers excitement for some, fear or confusion for others. Google’s approach is to focus on tangible, bite-sized value—“look what Gemini can do for you”—where repeated small wins accumulate into habit and switching behavior.
- •Early adopters model-hop; mainstream users need practical clarity
- •Emphasis on personal utility vs abstract “AI” messaging
- •Adoption grows via a sequence of small, shareable ‘wow’ use cases
- •Switching happens when enough daily tasks become easier inside one tool
15:22 – 16:27
Gemini’s personality: factual, precise, concise—and steerable when needed
They discuss “assistant personality” as a product choice: Josh wants Gemini to feel trustworthy, accurate, and to-the-point, with warmth but not excessive friendliness. He also emphasizes steerability—users can ask for harsher critique or different interaction styles—while keeping it a tool, not a companion.
- •Target traits: helpful, factual/accurate, precise, concise
- •Tone: warm and friendly, but not overly personable
- •Steerability: users can request harsh critique or different modes
- •Positioning: a practical tool rather than a “friend” product fantasy
16:27 – 18:34
Building personal context: notebooks, ‘Personal Intelligence,’ and using AI as a mirror
Marina shares her “Personal Constitution” file approach; Josh echoes that the big leap is persistent context over one-off prompts. He explains how he uses connected Google context plus curated notebooks (best writing, reading notes) and reflective prompts to identify what to stop doing and how to improve patterns.
- •Shift from one-off prompting to building reusable personal context
- •Spark + Personal Intelligence: opt-in access to Gmail/Calendar/Drive history
- •Notebook strategy: best writing, newsletters, and expert notes synced for retrieval
- •Reflective prompts: “What should I stop doing?” and “What patterns should I change?”
18:34 – 20:41
The “right context” problem: source selection, UI paradigms, and retrieval improving over time
Marina worries about too much data (years of email) and token waste; Josh agrees the key is selecting the right sources. He notes NotebookLM already supports source scoping, and says models are improving at retrieval/synthesis, while product UI patterns for context control are still evolving.
- •Models benefit from context, but relevance selection is hard
- •NotebookLM provides source-level controls; broader UX still unsettled
- •Concerns: overload from long email histories and token costs
- •Expectation: retrieval quality improves with newer models (e.g., Flash)
20:41 – 24:29
AGI, workflows, and the rising value of human taste and collaboration
Josh downplays rigid AGI definitions, focusing on the experience: time saved, mental relief, and fun insights that connect dots. On work, he argues AI will keep improving drafts, but human judgment/taste and the joy of creating with others remain central—even if smaller teams can do far more.
- •AGI framed as an experience (relief, speed, insight), not a precise milestone
- •AI shifts workflows but doesn’t eliminate the need for judgment and taste
- •Teams can “simulate meetings” and anticipate feedback pre-emptively
- •Power tools enable small groups (and possibly solo operators) to create outsized impact
24:29 – 29:40
Shipping in ‘two weekends’: Google Labs culture, user obsession, and the “eyes light up” metric
Marina probes how Google can ship so fast; Josh credits small, empowered teams with minimal bureaucracy and a willingness to try multiple iterations. Instead of fixating on dashboards early, he prioritizes direct user observation—watching real people and looking for visceral delight as the signal to continue.
- •Labs model: small teams (5–6) build quickly with reduced reviews
- •Iteration: often 3–5 attempts before a hit or a kill decision
- •Early metric: user delight—“do their eyes light up?”
- •Advice: get out of conference rooms; test in coffee shops/student unions; avoid building what no one wants
29:40 – 33:33
Next shifts: voice dominance and ultra-fast models—and what today’s kids will assume is normal
Josh predicts two underappreciated accelerants: the broad move to voice and the coming jump in model speed (streaming tokens at extreme rates), which will compress multi-step agent workflows. They close on the future for their young kids: human nature stays similar, but interaction becomes effortless—talking to systems that respond in milliseconds and personalize learning through interests like sports.
- •Voice is near a major inflection point as the primary interface
- •Model speed as the next unlock: faster streaming changes usage patterns
- •Speed compresses multi-step agent workflows and accelerates science discovery
- •Kids will normalize conversational “magic” UIs; education becomes interest-driven and instantly explainable

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

Gemini’s “agentic era”: why this Google I/O feels like a turning point

The two-weekend voice workflow demo: search your files, synthesize, and draft instantly

Why switch to Gemini/Spark: ecosystem-native agents, cloud parallelism, and generative media

“Digital chores” and killer use cases: reminders, calendar cleanup, and interest tracking

Voice-first apps are arriving: usage tipping points, dialects, and tool-calling by speech

From coding focus to knowledge-work orchestration: NotebookLM as the preview

“Doing to directing”: everyone becomes a manager of agents

Is Google losing the AI race? Competition, leverage, and the ‘Personal Intelligence’ bet

Adoption and trust: making AI personal through small ‘wow’ moments

Gemini’s personality: factual, precise, concise—and steerable when needed

Building personal context: notebooks, ‘Personal Intelligence,’ and using AI as a mirror

The “right context” problem: source selection, UI paradigms, and retrieval improving over time

AGI, workflows, and the rising value of human taste and collaboration

Shipping in ‘two weekends’: Google Labs culture, user obsession, and the “eyes light up” metric

Next shifts: voice dominance and ultra-fast models—and what today’s kids will assume is normal

Get more out of YouTube videos.