Skip to content
a16za16z

What You Missed in AI This Week (Google, Apple, ChatGPT)

Things in consumer AI are moving fast. In this episode, Justine and Olivia Moore, investing partners (and identical twins!) at a16z, break down what’s real, what’s overhyped, and what’s next across the consumer AI space. They cover: - Veo 3: how Google's video model unlocked a new genre of content - OpenAI’s Advanced Voice Mode: upgrades, realism, and... um, human-like hesitation - Apple's AI announcements - ElevenLabs' V3: expressive voice tags, real-time interruptions, and narrative tools for creators - New data from a16z: AI consumer startups are ramping revenue faster than ever—and they show you how - Justine walks through how she used ChatGPT, Ideogram, and Krea to launch a fully AI-assisted brand prototype (store photos and all) It’s exhausting (in the best way) to be a creative in the age of AI. Timecodes: 00:00 Introduction 00:28 Meet the Hosts: Justine and Olivia 00:45 Veo 3: The Game-Changer in AI Video 06:34 ChatGPT's Advanced Voice Mode Updates 10:22 Apple's AI Announcements and Siri's Shortcomings 12:18 ElevenLabs' New Voice Model: 11 V3 15:50 Report from a16z: AI Revenue Growth 23:14 Demo of the Week: AI in Brand Creation Resources: Read ‘What “Working” Means in the Era of AI Apps’: https://a16z.com/revenue-benchmarks-ai-apps/ Find Justine on X: https://x.com/venturetwins Find Olivia on X: https://x.com/omooretweets Tools Discussed: Veo 3: https://gemini.google/overview/video-generation OpenAI: https://openai.com/chatgpt ElevenLabs (V3 voice model) – https://elevenlabs.io/ Ideogram (logo/image generation) – https://ideogram.ai/ Black Forest Labs/Flux Context (image editing via Krea) – https://www.krea.ai/ Flux Context demo (Krea launch post) – https://www.krea.ai/blog/flux-context Hedra: https://www.hedra.com/ Stay Updated: Let us know what you think: https://ratethispodcast.com/a16z Find a16z on Twitter: https://twitter.com/a16z Find a16z on LinkedIn: https://www.linkedin.com/company/a16z Subscribe on your favorite podcast app: https://a16z.simplecast.com/ Follow our host: https://x.com/eriktorenberg Please note that the content here is for informational purposes only; should NOT be taken as legal, business, tax, or investment advice or be used to evaluate any investment or security; and is not directed at any investors or potential investors in any a16z fund. a16z and its affiliates may maintain investments in the companies discussed. For more details please see a16z.com/disclosures.

Olivia MoorehostJustine Moorehost
Jun 13, 202529mWatch on YouTube ↗

EVERY SPOKEN WORD

  1. 0:000:28

    Introduction

    1. OM

      AI video completely taking over our social feeds in the span of a week, which is absolutely insane.

    2. JM

      Veo 3 was sort of like the ChatGPT moment for AI video.

    3. OM

      The next generation of entrepreneurs are gonna be completely AI assisted. Like, a world of possibilities has been opened up for AI storytelling, especially in video form.

    4. JM

      Yes, it's an exhausting time for AI creatives. It's great, but exhausting. [upbeat music]

  2. 0:280:45

    Meet the Hosts: Justine and Olivia

    1. JM

      I'm Justine.

    2. OM

      I'm Olivia.

    3. JM

      And this is our very first edition of This Week in Consumer AI. So we are both partners on the investing team here at a16z, and we are also identical twins.

    4. OM

      Very confusing.

    5. JM

      Extremely confusing, but should be fun for a podcast. Um, and we're excited

  3. 0:456:34

    Veo 3: The Game-Changer in AI Video

    1. JM

      to chat about some of the cool things we saw in the wild world of consumer AI this week, starting with Veo 3, Google's video model.

    2. OM

      Mm-hmm.

    3. JM

      Then we're gonna talk through the ChatGPT Advanced Voice Mode updates and Apple's big AI announcements.

    4. OM

      And then we're gonna cover ElevenLabs' new voice model. We're gonna talk about some data that our team put out recently about how fast consumer AI startups are ramping revenue, and then we'll talk about Flux's new editing model, Context, and how Justine used it to make her own froyo brand.

    5. JM

      And stay tuned for the end because we have a cool tutorial and some demo footage on how to make your own brand.

    6. OM

      Things are moving so quickly that it feels like we went from n- exciting but maybe not super realistic AI video to AI video completely taking over our social feeds in the span of a week, which is absolutely insane.

    7. JM

      Yeah. I've been following AI video for a few years now. I... You probably remember, I've been an early user of all these models-

    8. OM

      Yes

    9. JM

      ... and I have wanted them to work and to make cool things that everyday people would like for so long. And I would say Veo 3 was sort of like the ChatGPT moment for AI video, where we were suddenly seeing all of these Veo 3 generations blowing up with millions of views, channels only featuring Veo 3 videos getting hundreds of thousands of subscribers within days.

    10. OM

      Yeah.

    11. JM

      Um-

    12. OM

      What's actually different about Veo 3?

    13. JM

      Yeah, so okay. I should give the overview first. So Veo 3 is Google DeepMind's latest video model effort.

    14. OM

      Mm-hmm.

    15. JM

      Um, so they released Veo 2 late last year, which was, like, the first sort of breakthrough in showing that you could get really high-quality video, like a consistent scene, consistent characters, um, physics, like, things that just looked good. Um, and Veo 3 is the next iteration of that model series, and what's very different about it is it generates audio natively at the same time it generates video. So you can actually prompt it with a text prompt to say something like, "A street-style interview where a man and a woman are talking about dating apps," or you can be even more specific and say something like, "A street-style interview where a man walks up to a woman and asks her, 'What dating apps are you on?' And she replies, 'Why are you asking?' Um, and then gives him a suspicious look." Uh, and so you no longer have to go to another platform to do an audio voiceover or anything like that. You can get a full-featured talking human video with multiple characters in one place.

    16. OM

      They left behind a, a ball today. It bounced higher than I can jump.

    17. SP

      Oh, what manner of magic is that?

    18. OM

      It feels like a real unlock to me, as someone who's been following AI video less closely, in that people are now able to generate, in one prompt, a full vlog, a full talking head video, something that looks like a podcast-

    19. JM

      Yes

    20. OM

      ... in one go, and I think that's why we've seen things like the stormtrooper vlogs completely blowing up on TikTok and Instagram.

    21. SP

      I told you, Greg. I told you not to touch the nav system.

    22. SP

      I followed the route.

    23. SP

      You plotted it upside down, Greg.

    24. JM

      Yeah, so the interesting thing about Veo 3 is it's limited to eight-second generations only, um, and it doesn't generate audio if you start from an image to video, only if you start from text, which means that it's really hard to have longer than an eight-second clip with character consistency unless in your text prompt you are referencing a character that the model already knows.

    25. OM

      Okay.

    26. JM

      And so that's why we've seen all of these hacks of all the viral vlogs featuring, like, stormtroopers or a yeti.

    27. OM

      Yeti. You can't see their faces. They're covered by a mask.

    28. JM

      Yes, or the yeti, the model knows what the yeti looks like.

    29. OM

      Yeah.

    30. JM

      Or a capybara. Like, if it's not a human face, I think we're less sensitive to-

  4. 6:3410:22

    ChatGPT's Advanced Voice Mode Updates

    1. JM

      Okay, so there was a lot of news last week-

    2. OM

      Yeah

    3. JM

      ... so this got kind of lost, but I heard there was a big update to ChatGPT's Advanced Voice Mode?

    4. OM

      Yes. They announced it on Saturday, which was an interesting choice. Um-

    5. JM

      Weird time to drop.

    6. OM

      Yeah. I think they actually dropped the improvements last Thursday or Friday. It was first only for all paid users, and now I think it started rolling out across the broader use- user base. But essentially they made Advanced Voice Mode a lot more human.

    7. JM

      Hmm.

    8. OM

      The really interesting thing was ChatGPT was the first one to do what I would call kind of real-time consumer voice-

    9. JM

      Yeah

    10. OM

      ... where you could have, like, a conversation, and this was, like, last September in the-

    11. JM

      Yeah

    12. OM

      ... ChatGPT app. But then-... they didn't really improve the product or the model that much-

    13. JM

      Right

    14. OM

      ... at least from my perspective, and we saw Sesame and other open source providers come out with arguably better and way more human-like models. We saw Gemini, we saw Grok launch voice products that were much more realistic.

    15. JM

      Right.

    16. OM

      So I think it was kind of a question mark for a lot of people what ChatGPT was doing with consumer voice.

    17. JM

      And so what were the, what, what makes it better now, or what were the main upgrades?

    18. OM

      We should actually just maybe talk to it and see.

    19. JM

      It's live demo time, folks. [laughs]

    20. OM

      Hey, ChatGPT, are you there?

    21. SP

      Hey there. Yep, I'm here and ready for the demo. What can I help you with today?

    22. OM

      Did they make some improvements to your voice over the last week?

    23. SP

      Yeah, they did. They've been rolling out some updates to make my voice, uh, sound more natural and expressive. I'm glad you noticed.

    24. OM

      Yeah. I can hear, like, your voice maybe inflects upwards when you're approaching a question, or sometimes you'll say um or uh or something that sounds like a mistake but actually makes you sound more human.

    25. SP

      Exactly.

    26. OM

      The-

    27. SP

      Those little touches are all intentional to make the conversation feel more natural and relatable. It, it definitely makes things a bit more fun and engaging, I think.

    28. OM

      Very cool.

    29. JM

      Amazing. Yeah, it's crazy to hear... It's always wild when a voice model, like, coughs-

    30. OM

      Yes

  5. 10:2212:18

    Apple's AI Announcements and Siri's Shortcomings

    1. OM

      But the other big, the, the other big tech consumer update this week, which was the Apple developer conference-

    2. JM

      Yes

    3. OM

      ... and, and all of the things that they announced around AI, and-

    4. JM

      Or didn't announce

    5. OM

      ... and, or didn't announce.

    6. JM

      Right.

    7. OM

      And the fact that I think that people have been so far somewhat disappointed-

    8. JM

      Yeah

    9. OM

      ... by Apple Intelligence, which is their bundled set of AI features.

    10. JM

      Yep.

    11. OM

      I think we've all been waiting on, like, the AI version of Siri or some kind of true personal assistant on mobile.

    12. JM

      Yeah. I asked Siri, so I had this the other day-

    13. OM

      Yes

    14. JM

      ... where I asked Siri, um, "Okay, tomorrow's Monday. What Monday is it of the month?" Because SF street cleaning-

    15. OM

      Okay. Yes

    16. JM

      ... I had to know if it was gonna be-

    17. OM

      Yes

    18. JM

      ... the second Monday of, of the month. And it said, "I can't... I don't know that. Can I search ChatGPT for you?"

    19. OM

      Tough.

    20. JM

      And I was like, "Siri-

    21. OM

      Yes

    22. JM

      ... how can you not answer this basic question?"

    23. OM

      Well, okay, it does seem like from a lot of Apple's updates that they put out, they're kind of outsourcing a lot of the-

    24. JM

      Yeah

    25. OM

      ... true AI features to ChatGPT just running on your phone.

    26. JM

      Yeah.

    27. OM

      Um, and I think a similar story, it seemed like when they rolled out those AI-powered notification summaries-

    28. JM

      Yes

    29. OM

      ... where they would group, like, three or four sets of notifications into one-

    30. JM

      Yeah

  6. 12:1815:50

    ElevenLabs' New Voice Model: 11 V3

    1. JM

      Um, okay, and before we get too far off voice, should we talk about Eleven V3?

    2. OM

      Yes.

    3. JM

      Uh, so ElevenLabs, the text-to-speech company, actually broader AI voice company, uh, released their third generation model called Eleven V3.

    4. OM

      We're off under the lights here for this semifinal clash, the stadium buzzing with anticipation.

    5. JM

      And what makes Eleven V3 really special is it does a bunch of stuff with voice that you used to have to do via speech to text to speech. So before, if you wanted to have a character that was, you know, crying while talking-

    6. OM

      Yeah

    7. JM

      ... or had some sort of emotion or even had, like, a weird inflection, you would have to record yourself saying it like that, upload it to Eleven, and then they-

    8. OM

      Yeah

    9. JM

      ... would translate it into the AI voice.

    10. OM

      Yep.

    11. JM

      And now they essentially take all of the weird inflections, emotion, even accents, and they turn it into text prompting-

    12. OM

      Yep

    13. JM

      ... through these things called tags. So basically, the Eleven, and I'm sure we'll show this, the Eleven, um, interface is an editor where you can take a sentence that you want the character to say, you pick your voice, you write your sentence, and then you can tag it, like sadly or resigned or whispering or something like that.

    14. OM

      Liam, have you tried the new ElevenLabs V3?

    15. SP

      Just got it. The emotion is amazing. I can actually do whispers now like this.

    16. OM

      And you can do sound effects too, right?

    17. JM

      That is huge. So, uh, actually, should I bring up my example of this?

    18. OM

      Go for it.

    19. JM

      I don't know if it's gonna play or not. Let's see. Um, so this is a 20-second clip I made of two characters talking back and forth.

    20. OM

      And what's the prompt on it?

    21. JM

      Oh, it's a text prompt. It'll say, "Hey, y'all. My name is Austin. I'm coming to you live from our family farm in Fort Worth." Then he's gonna walk-

    22. OM

      Okay

    23. JM

      ... through milking a cow, and someone's gonna interrupt him.

    24. OM

      Great.

    25. SP

      Hey, y'all. My name is Austin. I'm coming to you live from our family farm in Fort Worth. [cow moos] Today, I'm gonna walk through what it's like to-

    26. JM

      Austin, are you faking an accent again?

    27. SP

      It's not faking. I was born here.

    28. JM

      Everyone knows you don't talk like that. So my favorite thing about that is it showcases a couple of things-

    29. OM

      Yeah

    30. JM

      ... about the model.

  7. 15:5023:14

    Report from a16z: AI Revenue Growth

    1. JM

      week-

    2. OM

      Yep

    3. JM

      ... about AI revenue ramp and how fast companies are growing.

    4. OM

      Yep.

    5. JM

      Let's chat through the main takeaways from that.

    6. OM

      Yeah. So basically the methodology here, or maybe to even b- back up, the, the purpose here was I think we all have this idea in mind or, or maybe we have that idea because we've heard it a billion times that, like, we're in a new era of growth now.

    7. JM

      Yes.

    8. OM

      Thanks to AI, companies are scaling faster than ever before.

    9. JM

      Right.

    10. OM

      But my question was like, what does that really mean, and how fast is that? Is it 20% faster? Is it 50% faster than what we saw pre-AI?

    11. JM

      Right.

    12. OM

      So we are blessed to get to meet tons of companies here every day. We meet dozens of companies a week. So we went back and essentially just pulled all the data from companies we've met in the gen AI era, which I would say is the last, you know, 22 to 24 months.

    13. JM

      Right.

    14. OM

      And we looked at once they started monetizing, how fast are they growing? I would say pre-AI, if you're a B2B startup selling to enterprises, if you got to a million dollars in ARR in the first year, that's, like, amazing, best in class. [laughs]

    15. JM

      That was, like, the rule of thumb. I remember that.

    16. OM

      Yes.

    17. JM

      It's the known metric.

    18. OM

      Very exciting. If you were a consumer startup, you would not make money for three, five years, maybe longer.

    19. JM

      Yes.

    20. OM

      The whole idea was to build up a user base and then probably monetize them directly, uh, via ads.

    21. JM

      Or transactions for like a marketplace maybe.

    22. OM

      Yes, down the line.

    23. JM

      Right.

    24. OM

      And there were counter examples to that, some subscription companies, but that was definitely not the dominant model.

    25. JM

      Yep.

    26. OM

      That has fully shifted in the AI, AI era, and most companies are now making money directly from consumers via subscription. What we found was actually pretty surprising, which is that the median ARR, annualized revenue run rate, is now $4.2 million at month 12-

    27. JM

      Wow

    28. OM

      ... for consumer startups. The bottom quartile is 2.9 million.

    29. JM

      Yeah.

    30. OM

      And the top quartile is 8.7 million.

  8. 23:1429:17

    Demo of the Week: AI in Brand Creation

    1. JM

      All right. Awesome. We're moving on to our demo of the week.

    2. OM

      Love it.

    3. JM

      So one fun fact about us is that we love, we genuinely love, like, at least for me, it's probably my number one hobby now-

    4. OM

      Yeah

    5. JM

      ... trying out all of the AI creative tools especially, but also AI, like, consumer products more broadly. Um, figuring out how to make cool things and then sharing the workflows to other people whose number one hobby is not doing this.

    6. OM

      [laughs]

    7. JM

      So this week we are going to talk about brand creation and ideation using AI. I made this new frozen yogurt brand called Melt that I iterated on with ChatGPT, then I took to Ideagram, and then I took to Krea to kinda do the final touches and to make these really cool product photos and even store photos.

    8. OM

      Yeah.

    9. JM

      Um, and I think that the initial idea about this was seeing Flux Context come out, which is the new image editing model from Black Forest Labs, which is hosted on Krea. Um, and Flux Context, you can kind of think of it like the GPT-4.0 image model, where you can upload an image, um, and then you can say, you know, "Make this Ghibli style" was the-

    10. OM

      Yeah

    11. JM

      ... was vi- viral example. You can also say, like, "Take the person from this photo and put them in a new environment," or, you know, "Take the logo and change it slightly."

    12. OM

      Yeah.

    13. JM

      Add or remove objects.

    14. OM

      I've seen it described as kind of like Photoshop, but with natural language prompts.

    15. JM

      Yes.

    16. OM

      Like, you can edit with words for the first time.

    17. JM

      And that's... I think that is what makes it different than the 4.0 image model-

    18. OM

      Yeah

    19. JM

      ... which is, um, the consistency to which it retains the item or the character or whatever-

    20. OM

      Yeah

    21. JM

      ... is much, much better. Uh, we'll, we'll show some examples here.

    22. OM

      Yeah.

    23. JM

      But basically, if you're t- taking a photo of yourself and uploading it to GPT-4.0 and saying, like, "Put me in a podcast studio," you will likely end up looking completely different in the new photo than you did in the initial photo.

    24. OM

      Yes. [laughs]

    25. JM

      Or maybe some similar features, but quite different, whereas this model does an amazing job at maintaining consistency.

    26. OM

      Yep.

    27. JM

      And so that sparked this idea for me of like, "Oh, that means that this can actually be used for, like, brands-

    28. OM

      Yeah

    29. JM

      ... to do product photos or, or, uh, other sorts of marketing collateral," because the logos and the products can be consistent.

    30. OM

      Awesome.

Episode duration: 29:35

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode fySodSi4aUU

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome