a16zHow OpenAI Builds for 800 Million Weekly Users: Model Specialization and Fine-Tuning
EVERY SPOKEN WORD
75 min read · 14,670 words- 0:00 – 8:36
Introduction
- SWSherwin Wu
We want ChatGPT as a first-party app. First-party app's a really great way to get 800 million WAU's or whatever now. Tenth of the globe, right? [laughs] Yeah, yeah, a tenth of the globe uses it weekly. Every week. Every week. Yeah. Even within OpenAI, the, the thinking was that there would be, like, one model that rules them all. It's, like, definitely completely changed. It's, like, becoming increasingly clear that there will be room for a bunch of specialized models. There will likely be a proliferation of other types of model. Companies just have giant treasure troves of data that they are sitting on. The big unlock that has happened recently is with the reinforcement fine-tuning. With that set up, we're now letting you actually run RL, which allows you to leverage your data way more. Sherwin, thanks very much for joining. So we're being joined by Sherwin Wu. Um, it'd be great actually if you provided the long form of your background as we get into this, just for those that, that may not know you. I mean, I, I've viewed Sherwin as one of the, the top AI thought leaders- [laughs] ... so I'm really looking forward to this. Yeah, yeah. Thanks for having me. I'm, I'm, I'm really excited to be, be on the podcast. Um, yeah, so a little bit more of my, my background. So, uh, maybe I can start from present day and go backwards. So I, I currently lead, um, the engineering team for the O- for OpenAI's, uh, developer platform. So the biggest product in there, of course, is the, the API. Um- Is there more for the developer platform than the API? Um- I just kind of assumed they were synonymous. Well, so I, I also think about, uh, other things that we put into our platform side. So, like, technically our government work, uh, is, is also like offering- Oh, interesting ... and deploying the- Oh, I didn't know that ... into different areas. That's cool. Yeah, like I've talked about on some of our government deployments- Oh, like, so if you have, like, a local deployment, like- Yeah, yeah. So we actually do have a local deployment- I didn't know that ... uh, at Los Alamos National Labs. It's super cool. I went to visit it. It's, like- No kidding ... very different than what I'm used to. Um, but yeah, in a, like, you know, classified, uh, supercomputer- That makes sense, yeah ... uh, with, with our, with our model running there. Oh, that's cool. Um, so there's that. Um, but, like, mostly the, the API, um, 'cause- Did you go to Los Alamos? We did. Yeah, I did go to Los Alamos. [laughs] It's great. They, uh, showed us around. They showed us some of the historic sites. Real history. Um, yeah, yeah. I used to work at Livermore, man, so I've got, like, an affinity- Oh, yeah, yeah, yeah ... my first job out of college, so. Right, right, right. Yeah. Maybe you'll sell to them next. Yeah, yeah, yeah. Well, we, we hope to. We hope to. Um, but, uh, yeah, so I work on the developer platform. I've been working on it for around, uh, three years, uh, now. So I joined in 2022. Uh, was basically hired to work on the, the API product, which at the time was the only product that- Yeah ... that OpenAI had. Yeah. Um, and I've basically just worked on it the, uh, the entire time. Um, I've always been super interested in the developer side and kind of, like, the startup story of this technology- Right ... and so it's been really, really cool to kind of see, see this evolve. And so, um, that's my time at OpenAI. Before OpenAI, um, I was at, uh, Opendoor, uh, for around six years. I was working on the pricing side. Yeah. My, my general background before- [laughs] That, like, is such a dissonant, like, you know- Yeah, yeah ... pricing at Opendoor to, like, running API at a- It, it's such a different, uh- Yeah ... it's been fascinating actually for me to see the differences between the companies. Yeah. Like, they're run so differently. Um, they both have "open" in the name, so there's, you know, some overlap- [laughs] ... but, like, that's basic- that's pretty much it. Um, but yeah, I was there for around six years, uh, working on the pricing team. So our team basically would run the ML models, um- This is actually pricing the assets on Opendoor- Yeah, yeah ... like the inventory. Exactly. So Open... So yeah, Opendoor would buy and sell homes, and- Yeah ... the, their main product was buying homes directly from people selling them- Yeah ... for, with all cash offers. Yeah, yeah, yeah. And so my team was responsible for how much we would pay for them. Wow. Uh, and so it was a really fun, like, ML challenge. Yeah. Uh, it had a huge operational element to it as well- Yeah ... 'cause not everything was automated, obviously. Yeah. But it was a really fascinating technical challenge and, um, yeah I was working on that for- Is there any sense of that on the API side, like, like GPU capacity buying or is it just totally unrelated? Oh, on the, on the API side? Um, there is, is a small bit of, like, how we price the, the models but, uh, I don't think we do anything as sophisticated- Yeah ... as, uh, uh, as Opendoor. Opendoor was- Yeah ... just, like, such a hard problem. Yeah, of course. Of course. It's, like, such a, like, expensive asset. It's like y- like, the, the holding costs are very expensive. You're, like, holding onto it for, like, months at a time. There's, like, a variability in, like, the holding time. A massive lo- long tail of, like, potential things that could go wrong. Lo- long tail, yes. Okay, sorry. And, like, you know, you have to sell, you know, like, try to think about it from a portfolio perspective- Yeah ... and, like, if one of them just, like, you're holding onto it for two years, it blows- Everything ... you know, everything, like, goes negative. Yeah, yeah, yeah. So it's, uh, it's a very, very different- Six years? ... uh, different challenge. Yeah. Yeah, six years there. Wow. Uh, lots of ups and downs. Uh, saw a lot of the booms, saw a lot of the struggles, and then- Wow ... we IPO'd, uh- Yeah, yeah ... before I, be- before I left. Yeah. Um, but yeah, just in general it was, it was a very f- great experience. I think, uh, for me, it was, uh, it was also just, like, had such a very, like, business operations- Yeah ... and, like- Yes ... like, a very, like, by the book type of culture, whereas OpenAI is, like, very different. Well, it's so interesting. Um- I was just thinking about it now, is like even for a company like that, like, you don't think about it as a tech company, but if there is a deep technology problem, it actually is the pricing, right? Yes. It's actually an ML problem. It's not- Yeah, that's what attracted me to the company ... it's not, like, the website, it's not the, [laughs] you know- Yeah, yeah, yeah ... it's not the platform. It's, like, it's not the API. Yeah. It's literally that. Yep, yep, yep. Yeah. And that's what attracted me to it. For sure. I think that was, was interesting. Um, uh, it's also a way, like, lower margin business than, uh, OpenAI, uh, 'cause you're, like, making a tiny spread on these homes. Yeah. Uh- Right ... uh, they would talk about, like, basis points, like, eating bits for breakfast and all that, so. Yeah, yeah. Um, anyways, I was at Open, uh, Opendoor for around six years. Um, and then before that, uh, was my first job out of college, which was at Quora, um, with, uh, Adam D'Angelo and the group there. No way. Yeah, so I was working on the newsfeed. Oh, yeah. Uh, so worked on newsfeed ranking for a bit, worked on the product side. Yeah. Um, but was, uh, uh, that was actually my first exposure to, like, actual ML in industry, and, uh, learned a lot from, uh, uh, from, from the engineers at Quora. We basically hired a lot of the, the early, like, feed, uh, engineers from Facebook. Was, was Charlie still there when you were there? Charlie was not there when I, when I was there. He had- Okay, so you left right after he left. Yeah, yeah, yeah. That was a really legendary team, you know? So- I, I was super- It's still- Yeah ... it's still known to be kind of this super iconic founding team. Yeah, yeah. The, the, the early founding team was really solid. I still think that even while I was there, I was, I, I still, like, am amazed at the quality of the talent that we had. Yes. Phenomenal. I think this was, like, when the company was, like, 50 to 100 people. Yeah. But, like, yeah, like, a bunch of the Perplexity team was there. Yeah. Dennis, Dennis was on the feed team with me. Yeah. Yeah. John, uh, uh, uh, Johnny Ho- Yeah ... uh, Jerry Ma. Yeah, that's right. Yeah. Uh- This is crazy ... and then, and then Alexander The Scale, now at MSL. Yes, Alexander Wang, yes. You know, like, uh- Yeah, yeah. It was crazy ... was there. He was, he was, he was there between, uh, high school and college. [laughs] Um, it was a, an incredible team. I, I don't think I... I think I kinda took it for granted when I was there. Yeah. Uh, it was a good group. But, um-Uh, yeah
- MCMartin Casado
How, and how did you, how did you get to Quora? What'd you study in undergrad?
- SWSherwin Wu
Yeah, so before that I was at MIT for undergrad. I studied computer science.
- MCMartin Casado
Okay.
- SWSherwin Wu
Um, uh, did like one of those, like, computer science and the master's degree-
- MCMartin Casado
Okay, yeah
- SWSherwin Wu
... kinda like crammed it in.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
Um, uh, I ended up at Quora because I got an, uh, what we call an externship there. So, like, uh, at MIT, uh, you actually get January off, so there's like the fall semester-
- MCMartin Casado
Oh, wow
- SWSherwin Wu
... and then January's off.
- MCMartin Casado
That's cool.
- SWSherwin Wu
And then you have, uh, the spring semester.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And so it's kind of this, it's called independent activities period. So some people just, like, take classes, some people just do nothing.
- MCMartin Casado
Okay.
- SWSherwin Wu
But it's, uh, some people will do, like, month-long internships.
- MCMartin Casado
Okay.
- SWSherwin Wu
And some crazy companies will, like, offer a month-long internship to a college student.
- MCMartin Casado
Oh, that's cool. Yeah, yeah.
- SWSherwin Wu
And it really is just kinda like a way to get, uh, people interested-
- MCMartin Casado
Did you come out here from Boston?
- SWSherwin Wu
Yeah. Yeah, it was-
- MCMartin Casado
How did that work?
- SWSherwin Wu
Yeah, it was crazy. So, um, so, so, so, uh, you had to apply. I remember, uh, yeah, this is like, I think 2013 s- uh, January or something. Um, you had to apply, and I remember the Quora internship was the one that just paid the most. They paid, I think it was like 8,000, $9,000.
- MCMartin Casado
Nice. [laughs]
- SWSherwin Wu
And I was like, "Wow, this is like all for a month," and you're, you're just kinda ramping up like half the time.
- MCMartin Casado
I can eat for a year. [laughs]
- SWSherwin Wu
Yeah, yeah. As a college student I was like-
- MCMartin Casado
Yeah, yeah
- 8:36 – 12:18
Horizontal vs vertical OpenAI
- SWSherwin Wu
Yeah.
- MCMartin Casado
So one place I wanted to start, um, is, uh, is something that I find very unique about, um, OpenAI, uh, is it's both a pretty horizontal company, like it's got an API. Like I would say we've got this massive portfolio of companies, right? And I would say a good fraction of them use the, the API. And then it's also a vertical company in that you've got full-on apps, right?
- SWSherwin Wu
Yep.
- MCMartin Casado
Um, you know, like everybody uses ChatGPT, for example. And so, you know, you're responsible for like the API and kind of the dev tool side. So maybe just to begin with, like is there an internal tension between the two? Like, is that a discussion? Like, you know, like, like the API may, whatever, it may help a competitor to like the vertical version, or is it not... that things are just growing-
- SWSherwin Wu
Yeah
- MCMartin Casado
... so fast it's not an issue. I would just love to... how you think about that. By the way, it's very unusual for companies to have both of that.
- SWSherwin Wu
Yeah, yeah.
- MCMartin Casado
These two things this early. It's very unusual.
- SWSherwin Wu
Yeah, yeah. I completely agree. Uh, I think there is, there's, there's some amount of tension. I, uh, I would say it's... I think one thing that really helps here is, uh, Sam and Greg, just from a, like, founder, you know, perspective-
- MCMartin Casado
Yeah
- SWSherwin Wu
... have since day one just been very principled in, in, in, in the way in which we approach this. They've always, you know, have kinda told us, "You know, we want ChatGPT as a first party app."
- MCMartin Casado
Yeah.
- SWSherwin Wu
"Uh, we also want, want the API." And then, and the nice thing is, I think they're able to do this because, uh, at the end of the day, it kinda comes back to the, the mission of OpenAI, which is, uh, to create AGI and then to distribute the benefits as broadly as possible.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And so if you interpret this, you want it in as many surfaces as, as you want, you... and the first party app's a really great way to get, you know, uh, what is it, like 800 million, uh, WAU's or whatever now. And-
- MCMartin Casado
Yeah
- SWSherwin Wu
... um, uh, but the-
- MCMartin Casado
800 million WAU's?
- SWSherwin Wu
Uh, yeah. It's, yeah. It's pretty... It's, it's actually mind-boggling to think about. Um-
- MCMartin Casado
I have to say, I don't think... Many people listening to this don't understand how big that is.
- SWSherwin Wu
Yeah.
- MCMartin Casado
But that is, I mean-
- SWSherwin Wu
It's, it's, it's crazy, yeah. Yeah
- MCMartin Casado
... that's gotta be like actually historic for the, for the, the time it's taken to get to 800 million, but like-
- SWSherwin Wu
It's historic. It's, um-
- MCMartin Casado
It's also just-
- SWSherwin Wu
... also just like, yeah, the amount of time and just like how much we've had to scale up, just like-
- MCMartin Casado
A, a 10th of the globe, right? [laughs]
- SWSherwin Wu
Yeah, yeah. 10% of the globe uses it week-
- MCMartin Casado
Every week. Every week.
- 12:18 – 15:11
Why you can’t “disintermediate” the model
- MCMartin Casado
... which is the, the, the problem historically with, you know, offering, um, a, a core service as an API is you can get disintermediated, right? And so I can build on top of it, but then-You know, the user doesn't know, like whatever, I build on top of the cloud-
- SWSherwin Wu
Mm-hmm
- MCMartin Casado
... but I disintermediate from the cloud, and then I can switch to another cloud or whatever.
- SWSherwin Wu
Yeah.
- MCMartin Casado
And it occurs to me that that's kind of hard to do with these models because the models are so hard to abstract away. Like, they're just, they're just unruly, right?
- SWSherwin Wu
Yeah.
- MCMartin Casado
If you try to like have traditional software drive them, they just don't kinda manage very well.
- SWSherwin Wu
Mm-hmm.
- MCMartin Casado
So part of me thinks that it's almost like this like anti-disintermediation technology that you kind of have to expose it to the, to the user directly.
- SWSherwin Wu
Yep.
- MCMartin Casado
Does that make sense?
- SWSherwin Wu
Yeah.
- MCMartin Casado
And so I'm wondering if like, so even if I think ChatGPT is really just trying to expose the model to the user, the API's kinda just trying to expose the model to the user. So I think there's almost this argument that's like if the real value is in the models, it doesn't really matter how you get it to them, 'cause it's-
- SWSherwin Wu
Yep
- MCMartin Casado
... gonna be very tough for someone to, to abstract it away in, in, in, in the classic sense of computer science, of like they don't know-
- SWSherwin Wu
Yeah
- MCMartin Casado
... that they're using the model. Like you always know you're using GPT-5.
- SWSherwin Wu
Yeah. And, and the interesting thing is I think like the entire industry kind of has slowly changed their mind around this too. I think like in the beginning we-
- MCMartin Casado
That's true, yeah
- SWSherwin Wu
... kind of thought like, oh, these are all gonna be interchangeable.
- MCMartin Casado
It's just like software.
- SWSherwin Wu
Yeah, yeah. Exactly.
- MCMartin Casado
And like API stuff.
- SWSherwin Wu
So it's a piece of infrastructure that you can just swap out, yeah. But I think we're learning this on the product side with like, you know, the GPT-5 launch and like 4.0 and like how, how so many people liked 03 and 4.0 and, and all of that. We're also seeing this-
- MCMartin Casado
I felt that. I felt that when it changed.
- SWSherwin Wu
[laughs]
- MCMartin Casado
I'm like, I'm like, "You're not as nice to me." [laughs]
- SWSherwin Wu
Yeah.
- MCMartin Casado
I'm like, "I like the validation."
- SWSherwin Wu
Yeah.
- 15:11 – 17:30
People build relationships with models
- MCMartin Casado
do you think that is because of a relationship between the user and the model, or do you think it's more of a technical thing, which is like my evals work for like OpenAI and it's, you know, and like the correctness maintains or-
- SWSherwin Wu
Yeah. Yeah
- MCMartin Casado
... or?
- SWSherwin Wu
I think it's both. Um, so I think there's, there's definitely an end user piece here, which is what we've heard from, from some of our customers, like they just get familiar with, with the model itself. Uh, but I also think there's a technical piece, which is like the... Also, as a developer, especially with startups, you're like really going deep with these models and like really like iterating on it, trying to get, get it really good within your particular harness. You're iterating on your harness itself. You're giving it different tools here and there. Uh, and so you really do end up like building a product around the model, and so there is a technical piece where, you know, as you kind of keep building with a particular, uh, uh, product like GPT-5, um, you're actually like building more around it so that your product works uniquely-
- MCMartin Casado
Yeah
- SWSherwin Wu
... well with that, with that model.
- MCMartin Casado
So, so I, I use, I use Cursor and, um, a lot, just for like a lot of stuff, like, like writing blogs and like-
- SWSherwin Wu
Yeah
- MCMartin Casado
... you know, you know, we're investors, and I use it for sometimes for coding. And it's remarkable how many models I use in Cursor. So like literally my go-to model is GPT-5.
- SWSherwin Wu
Yeah.
- MCMartin Casado
I love GPT-5. I think it's a phenomenal like, you know... And then like I use like max mode with GPT-5 for planning, and then, but, you know, like I mean, I like the tab complete model that's in Cursor and like-
- SWSherwin Wu
Yep
- MCMartin Casado
... you know, the, the new model they just dropped is for like some basic, you know, some stuff is, is good.
- SWSherwin Wu
Yeah, the Composer one.
- MCMartin Casado
Like yeah, the Composer one's good.
- SWSherwin Wu
Yep.
- MCMartin Casado
Um, and so like, you know...
- SWSherwin Wu
And I think that like kinda reflects this too, 'cause it's like-
- MCMartin Casado
Yeah
- SWSherwin Wu
... it's a particular model for each particular use case.
- MCMartin Casado
Yes. Yeah, yeah, yeah.
- SWSherwin Wu
Like I've talked to a bunch of people who've used the new Composer model, and-
- MCMartin Casado
Yeah
- SWSherwin Wu
... it's just really good for like fast-
- MCMartin Casado
It's super fast
- SWSherwin Wu
... like first pass.
- MCMartin Casado
Exactly. That's right. Yeah.
- SWSherwin Wu
Like keep you in flow kinda thing.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And then you kind of like bubble out to another model if you want, like, you know, deeper thinking or something like that.
- 17:30 – 20:10
Not one AGI model, but many
- MCMartin Casado
the first one is what does that mean to A- for AGI? [laughs]
- SWSherwin Wu
[laughs]
- MCMartin Casado
And the second one is what does that mean for OpenAI? Like does that mean that like you end up with a model portfolio? Do you select a subset? Do you think this all gets superseded by some God model in the future? Like how does that play out? 'Cause it's, it's, it's against what most people thought.
- SWSherwin Wu
Yeah.
- MCMartin Casado
Most people thought this is all going towards one large model that does everything.
- SWSherwin Wu
Yeah. I think the, the crazy thing about all this is just like how everyone's thinking has just changed over time.
- MCMartin Casado
Totally.
- SWSherwin Wu
Like the, I, I distinctly remember this like... And, and, and the crazy thing, it's not that long ago. It's just like three, like two or three years ago.
- MCMartin Casado
Yeah.
- SWSherwin Wu
I remember like even with an OpenAI, the, the thinking was that there would be like one model that rules them all.
- MCMartin Casado
Yep.
- SWSherwin Wu
And it's like, why would you... I mean, like this kinda goes to the fine-tuning API product. It's like, why would you even have a fine-tuning product? Why would you even want to like iterate on it?
- MCMartin Casado
Yep. Yep.
- SWSherwin Wu
Um, there's gonna be just one model that just subsumes everything.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
And that was also kind of the v- that, that is also like the most simplistic like view of what the, what the AGI will look like.
- MCMartin Casado
Yeah.
- SWSherwin Wu
Um, and, and yeah, it's like definitely completely changed since then. I think one, uh... And, and, and but then the other thing to keep in mind is like it might continue to change, like even from where we are today.
- MCMartin Casado
Yeah.
- SWSherwin Wu
But it's like becoming increasingly clear, I think that, um, uh-There, there will be room for a bunch of specialized models. There will likely be a proliferation of other types of models. I mean, you see us do this with like the Codex model-
- MCMartin Casado
Yeah, totally
- SWSherwin Wu
... uh, uh, itself. We have like, you know, we have like GPT-4.1 and like 4.0 and like 5 and, and, and, and all of this. Um, and so I, I definitely think there's, there's room for all, for, for all this. I, I, I don't think that's bad for what it's worth. Like-
- MCMartin Casado
Yeah
- SWSherwin Wu
... um, if anything, I think, you know, as we've tried to move towards AGI, things have just been very unexpected, and I think the market just evolves and the product portfolio evolves because of that. Um, so I, I don't think it's a bad thing at all. Um, what I do think it means-
- MCMartin Casado
I think you could, you could easily argue it's very good for OpenAI and very good for like the model companies to like-
- SWSherwin Wu
Yeah, because-
- MCMartin Casado
... not have like, you know, winner take all consolidated dynamics, right? I mean, you just have a healthier ecosystem, a lot more solutions you can provide, a lot-
- SWSherwin Wu
Yeah
- MCMartin Casado
... you know.
- SWSherwin Wu
Yeah. A- and as, as the ecosystem grows, it generally is helpful.
- 20:10 – 24:44
Fine-tuning, RFT, and customer data choices
- MCMartin Casado
so I, I've, I've noticed that you are moving towards kinda more sophisticated use of things like, you know-
- SWSherwin Wu
Yep
- MCMartin Casado
... like fine-tuning. Um, which, you know, in a way you could read that as a bit of a capitulation that like, you know, there is product specific data and there's pr- product specific use cases that a general model won't do, to your point, right?
- SWSherwin Wu
Mm-hmm.
- MCMartin Casado
So like as opposed to proliferation model, you do that. Um, it seems like a lot of that data is actually very, very valuable, right?
- SWSherwin Wu
Yep.
- MCMartin Casado
And so, you know, to what extent is there, like, interest in almost a tit for tat where you can like expose, you know, the ability to get product data into fine-tuning, and then you also benefit from that data because-
- SWSherwin Wu
Oh, yeah
- MCMartin Casado
... the, uh, the, the vendors provide it to you, um, versus like this is 100%, you know, like they keep their own data and-
- SWSherwin Wu
Yeah
- MCMartin Casado
... there's kind of no interest in that. 'Cause it feels to me like the next level of scaling, this is kind of where-
- SWSherwin Wu
Yeah
- MCMartin Casado
... we're at. And so I'm just kinda curious how.
- SWSherwin Wu
Yeah. So I mean, uh, maybe even like taking a step back, the, the main reason why we even invested in a fine-tuning API in the very beginning is, uh, one, there's been huge demand from people to be able to customize the models a bit more. It kinda goes into like prompt engineering and also like I think the industry's changed their mind on that as well.
- MCMartin Casado
Yeah.
- SWSherwin Wu
Like it, it's evolved. But the second thing is exactly what you said, which is the companies just have giant treasure troves of data that they are sitting on-
- MCMartin Casado
Mm
- SWSherwin Wu
... that they would like to utilize in some fashion in this AI wave. And you can, you know, the simple thing is to put it in like, you know, some like vector, like do rag with it or something.
- MCMartin Casado
Yeah.
- SWSherwin Wu
But there's also, you know, if they have a more technical team, they do wanna see how they can use it-
- MCMartin Casado
Yeah
- SWSherwin Wu
... to customize the models. And, uh, and so that is actually the main reason why we've invested in, in this. The, the interesting thing was, uh, way back, kinda back in like '22, '23, our fine-tuning offering was, I'd, I'd say like too limited, so that it was very difficult for people to, to tap into and use this data.
- MCMartin Casado
Right.
- SWSherwin Wu
So it was just like an SF, like a-
- MCMartin Casado
Yeah
- SWSherwin Wu
... supervised fine-tuning-
- MCMartin Casado
Yep
- SWSherwin Wu
... API. And like, we're like, "Oh, you can kind of use it," but in practice, it really is only useful for like, like, uh, it's, it's honestly just like instruction following plus plus.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
You like kind of change the tone, and you're just like really like instructing it.
- 24:44 – 28:06
Prompt engineering isn’t the point anymore
- SWSherwin Wu
Yeah.
- MCMartin Casado
Okay. You, you said that, uh, views on prompt engineering have changed.
- SWSherwin Wu
Yeah. [laughs]
- MCMartin Casado
I wasn't, actually I wasn't aware of that. [laughs] All the other things I wasn't aware of, this one I wasn't. How is the prompt engineering-
- SWSherwin Wu
Yeah. I mean, I think the prevailing view, this is back in 2022. I remember I was talking to so many people, and they're basically... I mean, this is similar to like the single model AGI view as well, which is like-Like prompt engineering is just not gonna be a thing, and you're just not gonna have to think about what you're putting in the, in the context window in the future.
- MCMartin Casado
Oh.
- SWSherwin Wu
Like the model will just be good enough-
- MCMartin Casado
Oh, yeah, yeah, yeah
- SWSherwin Wu
... and it will just like know.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
It'll know what, what, what you need to do.
- MCMartin Casado
Okay. Yeah, that's definitely-
- SWSherwin Wu
And so like there's like-
- MCMartin Casado
That's definitely not a thing. [laughs]
- SWSherwin Wu
Yeah, but like that, that... Like I, I don't know, maybe people forget it, but like that was like a very common-
- MCMartin Casado
No, it was a great point
- SWSherwin Wu
... use back then-
- MCMartin Casado
It was a great point, yeah
- SWSherwin Wu
... 'cause like the scaling laws or whatever-
- MCMartin Casado
Yeah
- SWSherwin Wu
... something about the scaling laws, and like you'll just mind meld with the model, and like-
- MCMartin Casado
Yeah
- SWSherwin Wu
... you just like, like prompting and like instruction following will just, will be so good that you won't really need to do it.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
And if anything, like yeah, it's like clearly been wrong and-
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
Um, uh, but it is interesting because it's, I think it's a slightly different world that we're in now where the models have gotten really, really good at instruction following relative to the-
- MCMartin Casado
Yeah
- SWSherwin Wu
... you know, like GPT-3.5 or something.
- MCMartin Casado
Yeah.
- 28:06 – 31:55
What an “agent” really is
- SWSherwin Wu
but my, my general take on agents is it's, it's a, it's an, it's an AI that will take actions on your behalf that can work over long time horizons.
- MCMartin Casado
Okay.
- SWSherwin Wu
And I think that's the, that's the mo- pretty general-
- MCMartin Casado
Pretty utilitarian-
- SWSherwin Wu
Yeah, yeah
- MCMartin Casado
... definition.
- SWSherwin Wu
But like if you think about it that way, yeah, I, I mean, maybe this is what you mean by modality, but it is just a like way of like using AI, and it is a... I guess it could be viewed as a modality, but we don't view it as like a separate thing, separate from a-
- MCMartin Casado
Well, let me, let me, let me just-
- SWSherwin Wu
... AI, uh, API, and ChatGPT
- MCMartin Casado
... let me just, let me just try and kind of, uh, you know, give you a sense of where this question's coming from. Like I know how to build a product. Like, and we know how to do go to market for products. We know how to do like, you know, we know the implications of turning them into platforms. Like it's just we've been doing this for a very long time, right?
- SWSherwin Wu
Mm-hmm.
- MCMartin Casado
We know how to do the same thing for APIs, right? We know how to do billing. We know like the tension of like people build on top of it and all of that stuff. And like what I've been trying to... And this is just maybe a, a personal inquiry. It's just not clear for me for an agent if you ha- if, if it, if it sits in one of those two camps. Is it more like the product camp? [laughs] Is it more like the... Or is, or-
- SWSherwin Wu
I see, I see
- MCMartin Casado
... 'cause it's kind of both. Like, I could like literally give you Codex.
- SWSherwin Wu
Yeah, yeah.
- MCMartin Casado
And like as a user, and then you just talk to it, or I could like build in a way, kind of embed it in like my app.
- SWSherwin Wu
Yeah.
- MCMartin Casado
And so like... But then that means something to you as far as like, you know, how do you price it, and what does it mean for ecosystem? Like-
- SWSherwin Wu
Yeah
- MCMartin Casado
... like for example, like would you be fine if I started a company and just like built it around Codex? Is that a thing?
- SWSherwin Wu
Starting a company and building it around Codex?
- MCMartin Casado
Around Codex, yeah.
- SWSherwin Wu
I actually think that'd be great. Like it's a-
- MCMartin Casado
Yeah.
- SWSherwin Wu
We, we like released like the Codex SDK, and we like want people-
- MCMartin Casado
Yeah. No, I saw that
- SWSherwin Wu
... to be able to build in it-
- MCMartin Casado
Yeah
- SWSherwin Wu
... and hack on it. Yeah. Actually, I think this might be what you're getting at, which is, um, uh, and this is like a kind of a unique thing about OpenAI and, and kind of reflects on how, how it's run, which is at the end, like at the end of the day, OpenAI is like a, an AGI company. It's like an intelligence company.
- MCMartin Casado
Yeah, yeah. For sure.
- 31:55 – 36:46
How OpenAI thinks about pricing
- MCMartin Casado
mean, how, how, how have you evolved your thinking and how do you price these, you know, access to intelligence where, you know, you don't know how many people are gonna use it. It's almost certainly usage-based billing-
- SWSherwin Wu
Yeah
- MCMartin Casado
... not something else. Like, can you-
- SWSherwin Wu
Yeah
- MCMartin Casado
... can you talk just a bit about like philosophy around pricing on these things? Is it different for product versus API or like?
- SWSherwin Wu
Yeah. I think, um, the, the, the, the honest truth here is like it's evolved over time as well. And, and like I actually think the simplest... Like, the reason why we've done, uh, usage-based pricing on the API honestly is because it's then, like, it, it's closest to how it's actually being used.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And so that's kinda how we, how we started. I actually think usage-based pricing on the API has h- has, has, like, surprisingly held strong. And, like, I actually think this might be something that we'll keep doing for, for quite a long time. Mostly because, um-
- MCMartin Casado
The cogs are so high, I don't know how you don't do usage-based.
- SWSherwin Wu
Yeah, yeah, yeah, yeah.
- MCMartin Casado
I just don't know h- how that-
- SWSherwin Wu
Yeah
- MCMartin Casado
... evolves.
- SWSherwin Wu
And then, and then, and then there's also the strategy of, like, how we price it. And, and, and internally o- one thing we do is, is, uh, we always make sure that we actually price our, our, our, our usage-based pricing from a, like, cost-plus perspective.
- MCMartin Casado
Right.
- SWSherwin Wu
Like, we're n- we're actually just, like-
- MCMartin Casado
Yeah
- SWSherwin Wu
... trying to make sure that we're being responsible-
- MCMartin Casado
Yeah, yeah
- SWSherwin Wu
... from a-
- MCMartin Casado
Totally makes sense
- SWSherwin Wu
... from a margin perspective. Uh-
- MCMartin Casado
Yeah, yeah, yeah. By the way, this is a huge shift in the industry in general, just because, like, I remember the shift from on-prem to, uh, to recurring.
- SWSherwin Wu
Yeah.
- MCMartin Casado
That was a big, big deal. Like, that created Zuora. Like, it like created whole companies.
- SWSherwin Wu
Yep, yep, yep.
- MCMartin Casado
There's like whole books on it and all, like-
- SWSherwin Wu
Yeah, yeah, yeah
- MCMartin Casado
... a bunch of consultants on how you do this. It changed, like-
- SWSherwin Wu
Yeah
- 36:46 – 42:57
Why open-weights don’t kill the API
- SWSherwin Wu
stay with that for a while.
- MCMartin Casado
So how do you think about open source? I mean, uh, you know, I think you're the only big lab that's releasing open source. Is that-
- SWSherwin Wu
No, Google has, uh, some of theirs.
- MCMartin Casado
Okay.
- SWSherwin Wu
Um, yeah, mostly smaller models on their side.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
Yep.
- MCMartin Casado
That's right.
- SWSherwin Wu
Yep, yep.
- MCMartin Casado
Yeah. So how do you think about open source vis-a-vis, you know, competition, cannibalization, you know? Like, what's the, uh, what's the strategical... What's the complexity?
- SWSherwin Wu
Yeah, yeah. Um, so, uh, I personally love open source. Like, I, I think, I think it's great that there's a, there's a-
- MCMartin Casado
All of us grew up with it, right? [laughs]
- SWSherwin Wu
Yeah, all of us grew up with it. Like, the internet wouldn't exist without it. Like, you know, so much of the world, world, world is built, built on top of it.
- MCMartin Casado
Cloud wouldn't exist without of it, basically.
- SWSherwin Wu
Yeah, yeah.
- MCMartin Casado
Nothing would exist without of it-
- SWSherwin Wu
Yeah
- MCMartin Casado
... except for maybe Windows.
- SWSherwin Wu
And so it, it was interesting 'cause, like, I felt like over the last... There was before we launched the open source model, I know Sam feels this way as well.
- MCMartin Casado
Yeah.
- SWSherwin Wu
It's like there's this, like, weird, like, you know, uh, mindset where because OpenAI hadn't launched anything, it just seemed like it was super, like, anti... Like, OpenAI was, like, super anti-open source.
- MCMartin Casado
Yeah, yeah.
- SWSherwin Wu
But I'd actually been having conversations with Sam ever since I joined about open sourcing a model.
- MCMartin Casado
Mm-hmm.
- SWSherwin Wu
We were trying to think about, like, well, how can we sequence it?
- MCMartin Casado
Yeah.
- SWSherwin Wu
What compute is always the hard thing. It's like, do we have the compute to kinda like train, train this thing?
- MCMartin Casado
Yeah.
- SWSherwin Wu
So we've always wanted to, to-
- MCMartin Casado
Yeah
- 42:57 – 45:47
Different stacks for text, images, video
- MCMartin Casado
Like-
- SWSherwin Wu
Yeah
- MCMartin Casado
... how, how do you operationalize?
- SWSherwin Wu
Yeah. I think, uh, I, I think you're totally right. It's an anti-pattern. It's pretty tough to pull off. Um, uh, I think honestly, like props to Mark on our research team for like, you know, s- structuring things in a way where we're, we're able to do it. Um, from, from my perspective, I think the biggest thing is I think our, like, image, uh, like our, our, um, I think called like the world simulation team or like the team that builds Sora and all that under Aditya-
- MCMartin Casado
Yeah
- SWSherwin Wu
... uh, is just extremely solid. Like they are pro- it's like the highest concentration of, like, talent that I've seen-
- MCMartin Casado
Wait, but no-
- SWSherwin Wu
... in a while
- MCMartin Casado
... so but, but is it the same, like, is it the sa- is it like, are they like totally separate infrastructure? Do they use the same infrastructure?
- SWSherwin Wu
Yeah, yeah, yeah. So it's, it's, it's actually like pretty separate. So and I think that's part of the-
- MCMartin Casado
Oh, okay. Yeah
- SWSherwin Wu
... the reason why we're able to-
- MCMartin Casado
That makes sense
- SWSherwin Wu
... to kind of do this.
- MCMartin Casado
Yeah, yeah, yeah.
- SWSherwin Wu
Well, it's like one is like the team needs to be extremely strong, which, which, which they are.
- MCMartin Casado
Yeah. That's fair.
- SWSherwin Wu
And then two is, um, they're, they're, they're, they're run very separately. They're kind of like thinking about their own, uh, par- particular roadmap. They think about productization very separately as well, right? Which is how like the Sora app kind of, uh-
- MCMartin Casado
Yeah, yeah, yeah
- SWSherwin Wu
... came, came, came out of that as well. Um-And then, uh, yeah, even like the inference stacks are, are, are slightly different-- are, are, are kind of like different. Um, they, they, they own a lot more around their-
- MCMartin Casado
Right
- SWSherwin Wu
... inference stack and they optimize their inference stack pretty, pretty, uh, separately. And so, um-
- MCMartin Casado
Maybe you could-
- SWSherwin Wu
I think that, that contributes to, to helping us run things in parallel-
- MCMartin Casado
I see
- SWSherwin Wu
... but, uh, uh, it's, it's pretty hard to pull off, for sure.
- MCMartin Casado
May-may-maybe you can educate, uh, this on me. Like, so I think about APIs as mostly text-based for an opening. Do you guys do actual... Do you do actual-
- SWSherwin Wu
Yeah
- MCMartin Casado
... pixel-based stuff?
- SWSherwin Wu
Yeah, yeah. We do. We, we have a bunch. Uh, so DALL-E, DALL-E-
- 45:47 – 53:21
How the agent builder actually works
- SWSherwin Wu
Yep.
- MCMartin Casado
It seems to me that like actually how you build agents and expose them has evolved too.
- SWSherwin Wu
Yeah.
- MCMartin Casado
So maybe you can talk a bit about that.
- SWSherwin Wu
Yeah. Yeah. I think, um... So at, at Dev Day this year when we launched our agent builder, I got a bunch of questions around this because-
- MCMartin Casado
Yeah, that's what I was gonna figure about exactly. Yeah, yeah, yeah
- SWSherwin Wu
... the agent builder is like, yeah, is like the bunch of different nodes-
- MCMartin Casado
It's different. Totally
- SWSherwin Wu
... and it's like a deterministic thing.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And everyone's like, "Oh, is this really like the future, future of agents?" And, um, we o-obviously put a lot of thought into this when we were thinking about building that product, but the way I think about it is-
- MCMartin Casado
Do you think, do you think they came from a point of being constrained, by the way? They're like, "Oh, this is too constraining," and like...
- SWSherwin Wu
Yeah. I think people are like, "It, it's too constraining. It's not like AGI forward."
- MCMartin Casado
No, that's right. That, yeah
- SWSherwin Wu
... you know, like at the end of the... Again, at the, at the end of the day, the AGI will be able to do everything.
- MCMartin Casado
[laughs]
- SWSherwin Wu
And so like why not-
- MCMartin Casado
So why, why, why-
- SWSherwin Wu
Why have nodes in this like node builder thing?
- MCMartin Casado
... make nodes do everything? Just tell it what to do. [laughs]
- SWSherwin Wu
Um, yeah. And so I think there's like two things at play here. One of them is like there is a, like practicality component.
- MCMartin Casado
Yeah.
- SWSherwin Wu
And then the other thing is I think there are actually like different types of work that exist out there that could be automated into agents.
- MCMartin Casado
Okay.
- SWSherwin Wu
And so on the practicality side is, yeah, like the models today just like maybe in some future world, instruction following would be so good that you just like ask it to do this four-step process and it like always does the four-step process exactly.
- MCMartin Casado
Yeah.
- SWSherwin Wu
Um, we're still not there yet, and in the meantime, you know, this entire industry being born and a lot of, you know, people still wanna use these models, so like what, what can you build for them? So there's a practicality component of it.
- MCMartin Casado
When, when, when did you launch that?
- SWSherwin Wu
Uh, Dev Day. So [laughs] it feels like forever ago. Uh, earlier this month, uh, October, uh-
- MCMartin Casado
[laughs] It feels so funny
Episode duration: 53:22
Install uListen for AI-powered chat & search across the full episode — Get Full Transcript
Transcript of episode 3x0jhpEj_6o
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome