Head of Gemini: You're Using 5% of What Gemini Can Actually Do | Josh Woodward
At a glance
WHAT IT’S REALLY ABOUT
Gemini’s agent era: voice-first assistants, personal context, massive speed gains
- Google I/O marked a shift toward an “agentic era,” where Gemini Spark and related tools run tasks in the background across Gmail, Drive, Calendar, and more.
- Gemini’s differentiators are tight Google ecosystem integration, scalable parallel task execution via cloud VMs, and a broad generative media stack (docs, slides, images, video, music).
- Voice-first interaction is already tipping in some countries, and new features like Docs Live and voice-to-workflow demos show voice moving beyond transcription into tool-calling and synthesis.
- Woodward argues the real user shift is from “doing” to “directing,” meaning people will increasingly manage agents and need new intuition, soft skills, and judgment rather than only execution skills.
- In Google Labs, rapid iteration with small teams and real-world user testing (the “eyes light up” metric) is emphasized over early dashboard optimization, because most ideas take multiple attempts before success.
IDEAS WORTH REMEMBERING
5 ideasGemini’s pitch is “no connectors” for Google users.
Woodward frames Gemini Spark’s advantage as native access to Gmail, Drive, Docs, Sheets, Slides, and Calendar via an opt-in “Personal Intelligence” concept, reducing setup friction versus external tools.
Agents will scale from a few tasks to hundreds running in parallel.
Gemini can spin up cloud virtual machines to execute many background tasks simultaneously, positioning agent workflows as an orchestration problem rather than a single-chat interaction.
Voice is becoming a primary interface, not a novelty.
He notes usage has tipped toward voice in some countries, and upcoming features combine voice input with tool calling, retrieval from files, and automatic cleanup of rambling speech into polished outputs.
A killer demo is “talk to your files, then ship a deliverable.”
The two-week-old I/O demo showed selecting multiple files (Drive/desktop), speaking instructions, and having Gemini synthesize PDFs/images into a structured email or document—correcting errors like dates along the way.
The work shift is from execution to management of outcomes.
Woodward describes moving from “doing” to “directing,” implying broad “manager training for everyone” as individuals coordinate multiple agents toward a final deliverable.
WORDS WORTH SAVING
5 quotesAnd so we on the team talk about is you're moving from doing to directing, and that's, like, a big shift.
— Josh Woodward
Now we're imagining we may need that for everybody... because you may be managing these different sort of, like, agents and others, so yeah.
— Josh Woodward
I always kind of tell the teams, and I try to go with them on a lot of these, you see it in people's eyes. When they use the thing, do their eyes light up, or are they, like, recoil?
— Josh Woodward
The other one I am not sure people have fully internalized, even at Google, is how fast these models are gonna get.
— Josh Woodward
So speed's a feature.
— Josh Woodward
High quality AI-generated summary created from speaker-labeled transcript.