$6.6B AI CEO: How to Make Your First $10,000 with AI
At a glance
WHAT IT’S REALLY ABOUT
ElevenLabs CEO on voice agents, monetization, and deepfake safeguards
- Mati argues voice is becoming a primary interface to AI because it carries emotion, context, and usability that text cannot, and businesses are rapidly adopting voice agents for support, sales, and product navigation.
- He explains how companies can deploy low-latency voice agents by connecting speech, LLM reasoning, and business workflows (calendar booking, knowledge bases, handoffs), often for hundreds of dollars per month plus telephony integrations like Twilio.
- On the creator side, ElevenLabs’ voice marketplace lets users authenticate, clone their voice with ~30 minutes of recording, and earn royalties when others use it—about $5M paid to the community so far, with earnings skewed toward distinctive voices/accents.
- They also discuss risks: impersonation will happen, so the future needs an “assume AI by default” mindset plus a three-layer verification model (device authenticity, authenticated/watermarked AI, and default distrust when unverified), alongside job shifts where AI replaces those who don’t use AI.
IDEAS WORTH REMEMBERING
5 ideasVoice will be a dominant interface for AI interactions.
Mati emphasizes voice transmits more information than text—emotion, inflection, imperfections—making it both a richer input signal and a more natural, “pleasurable” output for users.
Voice agents already convert, especially in self-serve tiers.
ElevenLabs uses its own agents to accelerate inbound leads; the agent can convert directly for business-tier self-serve plans, while enterprise still requires KYC and human-assisted processes.
Deployment is less about coding and more about business logic.
The platform abstracts the hard orchestration (speech + LLM + TTS latency), but success depends on mapping your knowledge base, defining workflows (“if X then trigger Y”), and integrating systems like calendars, checkout links, or CRM.
A practical SMB setup can be cost-effective to start.
For smaller businesses, Mati suggests initial costs in the “hundreds of dollars per month” range, with telephony brought via integrations such as Twilio and existing phone numbers.
The voice marketplace can generate real royalties, but outcomes are power-law distributed.
Creators can authenticate and share a cloned voice to ElevenLabs’ marketplace and earn when it’s used. With nearly ~10,000 shared voices and ~$5M paid out (approaching ~$10M total by his estimate), most may earn modest amounts (e.g., hundreds/month) while unique voices can become breakout hits.
WORDS WORTH SAVING
5 quotesVoice will be the... one of the key interfaces to the technology around us.
— Mati Staniszewski
You don't have to be the coder. You just need to...
— Mati Staniszewski
I think it's going to happen.
— Mati Staniszewski
By default, it's AI, and you assume it's AI.
— Mati Staniszewski
All the people that will be replaced will be replaced by people that use AI.
— Mati Staniszewski
High quality AI-generated summary created from speaker-labeled transcript.
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome