Skip to content
Silicon Valley GirlSilicon Valley Girl

$6.6B AI CEO: How to Make Your First $10,000 with AI

Get the best domain for your business with https://get.online/marina4 or use coupon code: Marina to get your perfect domain for just 99 cents for the first year. In this interview, Mati Staniszewski, CEO & co-founder of @ElevenLabs, unpacks the future of voice AI—from real sales/support voice agents that speak any language in your voice, to a voice marketplace already paying creators millions, to the safeguards we’ll need as agents start calling… other agents. I dig into practical playbooks, real costs, the tools to use, and where the next $10k-a-month opportunities are hiding for solo operators and SMBs. Chapters: 00:00 In this video 01:23 Role of voice in AI 02:28 Can AI Voice agent generate and convert leads 05:22 Get the best domain for your business 07:14 How to set up voice agents in your business 12:00 How to make money by selling your voice 15:18 How to clone your voice 17:37 The future of Voice AI 21:37 Deepfakes & the 3-layer safeguard model 25:10 Main fears of an AI founder 27:35 Jobs at risk and how to adapt 31:23 Tops 3 AI tools 33:50 The $10k/mo voice-agent opportunity 36:49 Advice for everyone who’s starting out 41:53 Will we still learn languages? Links: 📩 Follow my Newsletter: https://siliconvalleygirl.beehiiv.com/subscribe 🔗 My Instagram: https://www.instagram.com/siliconvalleygirl/ 📌 My Companies & Products: https://Marinamogilko.co 📹 Video brainstorming, research, and project planning - all in one place - https://partner.spotterstudio.com/ideas-with-marina 💻 Resources that helps my team and me grow the business: - Email & SMS Marketing Automation - https://your.omnisend.com/marina - AI app to work with docs and PFDs - https://www.chatpdf.com/?via=marina 📱Develop your YouTube with AI apps: - AI tool to edit videos in a minutes https://get.descript.com/fa2pjk0ylj0d - Boost your view and subscribers on YouTube - https://vidiq.com/marina - #1 AI video clipping tool - https://www.opus.pro/?via=7925d2 💰 Investment Apps: - Top credit cards for free flights, hotels, and cash-back - https://www.cardonomics.com/i/marina - Intuitive platform for stocks, options, and ETFs - https://a.webull.com/Tfjov8wp37ijU849f8 ⭐ Download my English language workbook - https://bit.ly/3hH7xFm I use affiliate links whenever possible (if you purchase items listed above using my affiliate links, I will get a bonus). #siliconvalleygirl #ai #aijobs

Mati StaniszewskiguestMarina Mogilkohost
Oct 4, 202543mWatch on YouTube ↗

At a glance

WHAT IT’S REALLY ABOUT

ElevenLabs CEO on voice agents, monetization, and deepfake safeguards

  1. Mati argues voice is becoming a primary interface to AI because it carries emotion, context, and usability that text cannot, and businesses are rapidly adopting voice agents for support, sales, and product navigation.
  2. He explains how companies can deploy low-latency voice agents by connecting speech, LLM reasoning, and business workflows (calendar booking, knowledge bases, handoffs), often for hundreds of dollars per month plus telephony integrations like Twilio.
  3. On the creator side, ElevenLabs’ voice marketplace lets users authenticate, clone their voice with ~30 minutes of recording, and earn royalties when others use it—about $5M paid to the community so far, with earnings skewed toward distinctive voices/accents.
  4. They also discuss risks: impersonation will happen, so the future needs an “assume AI by default” mindset plus a three-layer verification model (device authenticity, authenticated/watermarked AI, and default distrust when unverified), alongside job shifts where AI replaces those who don’t use AI.

IDEAS WORTH REMEMBERING

5 ideas

Voice will be a dominant interface for AI interactions.

Mati emphasizes voice transmits more information than text—emotion, inflection, imperfections—making it both a richer input signal and a more natural, “pleasurable” output for users.

Voice agents already convert, especially in self-serve tiers.

ElevenLabs uses its own agents to accelerate inbound leads; the agent can convert directly for business-tier self-serve plans, while enterprise still requires KYC and human-assisted processes.

Deployment is less about coding and more about business logic.

The platform abstracts the hard orchestration (speech + LLM + TTS latency), but success depends on mapping your knowledge base, defining workflows (“if X then trigger Y”), and integrating systems like calendars, checkout links, or CRM.

A practical SMB setup can be cost-effective to start.

For smaller businesses, Mati suggests initial costs in the “hundreds of dollars per month” range, with telephony brought via integrations such as Twilio and existing phone numbers.

The voice marketplace can generate real royalties, but outcomes are power-law distributed.

Creators can authenticate and share a cloned voice to ElevenLabs’ marketplace and earn when it’s used. With nearly ~10,000 shared voices and ~$5M paid out (approaching ~$10M total by his estimate), most may earn modest amounts (e.g., hundreds/month) while unique voices can become breakout hits.

WORDS WORTH SAVING

5 quotes

Voice will be the... one of the key interfaces to the technology around us.

Mati Staniszewski

You don't have to be the coder. You just need to...

Mati Staniszewski

I think it's going to happen.

Mati Staniszewski

By default, it's AI, and you assume it's AI.

Mati Staniszewski

All the people that will be replaced will be replaced by people that use AI.

Mati Staniszewski

Voice as the next AI interfaceVoice agents for customer support and sales conversionHow to deploy agents: knowledge base, workflows, integrationsVoice marketplace royalties and passive income mechanicsVoice cloning quality nuances (context/conditioning, mixing)Deepfakes, authentication, watermarking, and trust modelsJobs at risk, adapting via AI + domain expertiseTop recommended AI tools (Claude, Black Forest Labs, Lovable/v0/Replit)$10k/month opportunity: SMB voice-agent deployment servicesLanguages in an era of real-time translation

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome