Skip to content
How I AIHow I AI

Using Veo 3 to create AI-generated music videos, like a Tiny Desk concert with Notorious B.I.G.

Anish Acharya is an entrepreneur and general partner at Andreessen Horowitz, focusing on consumer investing and AI-native products. In this episode, he demonstrates how AI can be used for creative and personal projects beyond typical work applications. He walks through creating an AI-generated Tiny Desk Concert for Notorious B.I.G. and Kurt Cobain, building a book cataloging app using video analysis, and using browser automation for personal finance insights. Anish shares how these technologies allow anyone to bring creative ideas to life with minimal technical expertise, transforming what would have been impossible projects just a few years ago into accessible weekend activities. *What you’ll learn:* 1. A step-by-step workflow for creating AI-generated music videos featuring artists like Kurt Cobain and Notorious B.I.G. 2. How to extract vocals from existing tracks to create unique audio combinations for your AI-generated videos 3. A simple method for cataloging your book or record collection using video analysis and Gemini Flash 4. How to use Comet to analyze personal finances and get investment recommendations without manual data analysis 5. Ways AI is transforming childhood learning and play by enabling interactive storytelling and creative exploration *25k giveaway:*  To celebrate 25,000 YouTube followers, we’re doing a giveaway. Win a free year of my favorite AI products, including v0, Replit, Lovable, Bolt, Cursor, and, of course, ChatPRD, by leaving a rating and review on your favorite podcast app and subscribing to the podcast on YouTube. To enter: https://www.howiaipod.com/giveaway. *Brought to you by:* Notion—The best AI tools for work: https://www.notion.com/howiai Lenny’s List on Maven—Hands-on AI education curated by Lenny and Claire: https://maven.com/lenny *Where to find Anish Acharya:* • Andreessen Horowitz: https://a16z.com/author/anish-acharya/ • LinkedIn: https://www.linkedin.com/in/anishacharya/ • X: https://x.com/illscience *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo *In this episode, we cover:* (00:00) Introduction to Anish Acharya (03:05) How AI transforms creative constraints in music and video (06:00) Creating an AI-generated Notorious B.I.G. Tiny Desk Concert (07:36) Using GPT-4o to generate still images (09:27) Using Hedra to animate still frame images (10:40) Adding custom audio to video (11:30) Using Adobe Audition to clip and sync audio (15:42) How to use Demucs to extract vocals from any song (16:36) Using Hedra to generate a Tiny Desk Concert featuring Kurt Cobain (19:40) Creating a ’90s-style Nirvana music video with Veo 3 (27:40) Building a book collection cataloging tool with Gemini Flash (35:35) Using the Comet browser for personal finance analysis (37:20) How AI is transforming childhood learning and play (41:23) Tips for getting better results from AI tools *Tools referenced:* • GPT-4o: https://openai.com/index/hello-gpt-4o/ • Hedra: https://www.hedra.com/ • Adobe Audition: https://www.adobe.com/products/audition.html • Demucs: https://github.com/facebookresearch/demucs • Perplexity: https://www.perplexity.ai/ • Veo 3: https://deepmind.google/models/veo/ • Kapwing: https://www.kapwing.com/ • Cursor: https://cursor.com/ • Google AI Studio: https://makersuite.google.com/ • Gemini Flash: https://ai.google.dev/gemini-api • Comet: https://www.perplexity.ai/comet *Other references:* • Anish’s Notorious B.I.G. AI-generated Tiny Desk Concert: https://x.com/illscience/status/1935721063876550939 • NPR Tiny Desk Concerts: https://www.npr.org/series/tiny-desk-concerts/ • Notorious B.I.G.: https://en.wikipedia.org/wiki/The_Notorious_B.I.G. • Kurt Cobain: https://www.kurtcobain.com/ • Robinhood: https://robinhood.com _Production and marketing by https://penname.co/._ _For inquiries about sponsoring the podcast, email jordan@penname.co._

Anish AcharyaguestClaire Vohost
Aug 17, 202543mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

Building AI music videos, cataloging books, and automating personal workflows

  1. Anish Acharya demonstrates how today’s AI tools make once-impossible creative projects—like generating a Tiny Desk-style performance for a deceased artist—fast and accessible.
  2. He walks through a simple pipeline: generate a still image with GPT‑4o, pull and edit audio from YouTube, optionally separate stems (vocals vs. instrumentation), and lip-sync/animate with tools like Hedra (or alternatives like Sync Labs).
  3. He then shows how Veo 3 (via Google Flow) can generate short cinematic clips for a full music-video montage, with GPT‑4o assisting prompt iteration to lock in a specific aesthetic (e.g., 1990s Seattle grunge).
  4. In a second workflow, Anish highlights Gemini Flash’s underused multimodal video understanding by building a small app in Google AI Studio that catalogs books (or records) from a quick “flip-through” video, and closes with consumer AI unlocks like Comet browser automation and AI in parenting/education.

IDEAS WORTH REMEMBERING

5 ideas

A compelling AI music video can be built from a few modular steps.

Anish’s workflow breaks into reusable parts—still image generation, audio acquisition/editing, optional vocal/instrument separation, then video animation + lip-sync—making the process approachable and repeatable.

Use GPT‑4o as a “prompt co-writer” to converge on a precise aesthetic.

He starts with off-target generations, then asks GPT‑4o for keywords and phrasing to hit “1990s Seattle grunge” and progressively refines until the visuals become camcorder-like and grimy.

Constraints (short clips, limited durations) can increase creativity.

Both hosts note current tool limits (e.g., ~7–15 second clips) and Anish argues constraints resemble early hip-hop sampling limits that led to more inventive recombination.

Minimal prompts often work better than over-specification.

Anish repeatedly uses very short prompts (e.g., “Man singing on Tiny Desk”), arguing that leaving room for the model can yield more natural and surprising outputs than tightly constrained instructions.

Audio manipulation is a major unlock for remix culture workflows.

He emphasizes stem separation (e.g., Demucs) and layering: using live cover-band instrumentation while overlaying extracted original vocals to approximate a live “Tiny Desk” feel.

WORDS WORTH SAVING

5 quotes

It’s like the most creative satisfaction I’ve had in maybe my whole life.

Anish Acharya

AI is just the next manifestation of sampling.

Anish Acharya

We forget that this would be witchcraft three years ago.

Anish Acharya

Something like this makes me almost want to cry… it always felt so inaccessible to get these amazing ideas… into a thing.

Claire Vo

Abandon the branch and start over—because you didn’t actually do any work.

Anish Acharya

Tiny Desk-style AI performances and ethical framingPrompt iteration for aesthetic control (grunge/camcorder look)Audio sourcing, editing, and stem separation (Demucs)Image-to-video + lip sync tools (Hedra, Sync Labs)Veo 3 / Google Flow for short high-physics video clipsGoogle AI Studio app builder + Gemini Flash video ingestionConsumer AI: Comet RPA, parenting use cases, AI-native behavior

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome