Skip to content
How I AIHow I AI

I cloned myself with Gemini Omni in 15 minutes (and it's terrifyingly good)

In this experimental episode, I document my real-time attempt to create an AI avatar of myself using Google Flow and the new Gemini Omni video generation model. I walk through the entire process—from scanning my face with my phone to generating a complete one-minute hype video for the podcast, all in about 15 minutes. *What you’ll learn:* 1. How to create an AI avatar using Google Flow in under five minutes 2. Why video AI tools unlock creative possibilities for people with zero video production skills 3. The step-by-step process of generating a full storyboard using AI as your creative producer 4. How to use character consistency features to generate multiple video scenes with the same avatar 5. The uncanny-valley moments you’ll encounter when your AI clone doesn’t quite nail emotions or physics 6. How to stitch together AI-generated scenes into a complete video using built-in editing tools *Brought to you by:* Merge—Connective infrastructure for production AI: https://www.merge.dev/howiai Jira Product Discovery—Prioritize with insights, build with confidence: https://atlassian.com/howiai *In this episode, we cover:* (00:00) Getting started with Google Flow and Gemini Omni (01:38) The avatar creation process: scanning and photo capture (02:55) Using Flow to brainstorm a hype video storyboard (06:59) Generating the first video scene with the avatar (08:41) Troubleshooting: accidentally generating images instead of videos (09:32) Generating all seven scenes for the complete video (11:37) Reviewing the avatar videos (13:13) Stitching the videos together in the browser-based editor (14:32) The complete How I AI hype video (15:32) What worked and what didn’t (19:04) Final thoughts *Blog & detailed workflow walkthroughs from this episode:* How I Built an AI Avatar and Hype Video in 15 Minutes with Google Flow: https://www.chatprd.ai/how-i-ai/ai-avatar-video-in-15-minutes-with-google-omni-flow ↳ How to Create a Promotional Video with an AI Creative Director: https://www.chatprd.ai/how-i-ai/workflows/how-to-create-a-promotional-video-with-an-ai-creative-director ↳ How to Create a Personalized AI Avatar with Google Flow: https://www.chatprd.ai/how-i-ai/workflows/how-to-create-a-personalized-ai-avatar-with-google-flow *Tools referenced:* • Google Flow: https://labs.google/fx/tools/flow • Gemini Omni: https://gemini.google/overview/video-generation/ • Veo 3: https://deepmind.google/technologies/veo/ *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo _Production and marketing by https://penname.co/._ _For inquiries about sponsoring the podcast, email jordan@penname.co._

Claire Vohost
Jun 3, 202620mWatch on YouTube ↗

CHAPTERS

  1. Goal and setup: cloning Claire with Google Flow + Gemini Omni

    Claire sets the challenge: create an AI video avatar of herself and turn it into a ~1-minute hype video for the How I AI podcast in about 15 minutes. She opens Google Flow and explains the promise of Gemini Omni for consistent-character video generation, noting it didn’t work well on launch day but might now.

  2. Avatar capture on phone: QR scan, photos, head turns

    She starts the avatar creation flow by scanning a QR code and using her phone camera to capture required angles. The process includes taking multiple photos and turning her head left/right to complete the scan.

  3. Prompting Flow as a “creative suite”: brainstorming the hype video concept

    With an avatar available, Claire asks Flow to help storyboard a hype video for How I AI. She provides creative direction (dark green home office, books/posters, authentic but high-tech “hacker vibe”) and frames this as AI unlocking new creative capability for non–video creators.

  4. Storyboard generated: seven-scene structure and visual beats

    Flow proposes a seven-scene storyboard: keyboard close-up, office wide shot, chair spin reveal, HUD/montage moments, lifestyle beat, mic/podcast moment, and title/CTA. Claire reacts to the cheesiness but approves the structure and moves to generate scenes with her avatar reference.

  5. First scene generation attempt: environment details leak into the avatar

    Claire tries to generate the first scene using the storyboard prompt while @-mentioning her avatar. She notices the model pulls in background elements from her capture photos (posters/books), which is surprising and helps continuity—though it also reveals how tightly the avatar ties to the training context.

  6. Troubleshooting mistake: generated images instead of videos

    She realizes she accidentally ran image generation rather than video generation due to a UI toggle. After correcting the setting, she re-submits the prompt to generate actual video outputs, noting that video takes longer and typically produces two variants (V1/V2).

  7. Batching seven scenes: waiting, variants, and jump-scare realism

    Claire queues all seven scenes for generation, describing the experience as both exciting and uncanny. She reviews early outputs—like unexpected blue nail polish and a startling chair-spin clip—highlighting how close the avatar can feel while still producing surprising details.

  8. Scene review: the chair-spin intro line and character consistency quirks

    She plays two versions of the chair-spin scene where the avatar says, “I’m Claire, and this is How I AI.” The background includes a surprising NVIDIA reference; one version spins twice, and she chooses the one that flatters her more while noting hair and styling differences across clips.

  9. Stitching in Flow: browser timeline editor to assemble the hype video

    Claire demonstrates Flow’s built-in web editor timeline and stitches together her preferred takes in the suggested order. The assembly takes about five minutes, emphasizing an end-to-end pipeline from capture → storyboard → generation → edit within a single tool.

  10. Premiere: the finished “How I AI” hype video

    She plays the completed one-minute hype video, featuring the hook “We were told AI would replace us,” an intro, a montage of AI-themed visuals, and a subscription call-to-action. The result feels surprisingly polished for the time invested, despite occasional glitches and awkward phrasing.

  11. Postmortem: what worked, what didn’t, and why it’s still impressive

    Claire breaks down strengths—speed, low effort, and a ~50% “there” result that’s shareable—and weaknesses like uncanny facial moments, inconsistent backgrounds, and early-2000s “AI aesthetics.” She notes the avatar can look highly accurate in some frames and less so when expressing emotion (e.g., laughing).

  12. Closing thoughts: experimentation, future improvements, and viewer prompt

    She concludes that with tighter prompting, more consistent background constraints, and additional reference images, the results could become convincingly realistic. Claire invites viewers to try creating their own avatars and share examples, framing the episode as a successful (and strange) How I AI experiment.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.