Aakash GuptaHe Uses 7 Claude Code Agents to Build Apps with 0 Employees
CHAPTERS
Why Gabor built a “startup OS” inside Claude Code
Aakash frames the episode as a shift from a PM operating system to a full startup operating system run by AI agents. Gabor previews how he uses agents to go from idea to an App Store-ready build quickly—and why the skill gap for PMs will widen fast.
Inside the 21-agent org chart: roles, responsibilities, and why system analyst is central
Gabor walks through his full agent lineup (CTO, designers, performance, test architect, UX flow architect, product council, maintainability, brand, etc.). He explains the guiding idea: model agents like a real software team to improve spec quality and outcomes.
Under the hood: what an agent definition file looks like
Gabor opens a system analyst agent markdown file and explains what it’s instructed to do. The focus is on ambiguity resolution, dependency tracking, and structuring requirements into executable work.
Starting from the consumer Claude app: dictation-driven ideation and role prompting
To make the workflow accessible, Gabor starts in the Claude consumer app using voice dictation. He first asks Claude to define what a good vs bad system analyst does, then primes it to behave like a good one before eliciting requirements.
Connecting Confluence/Jira via Atlassian MCP: why documentation prevents “vibe-code debt”
Gabor sets up an empty Confluence space and Jira board and explains why durable documentation matters for maintainability. He compares one-shot prompting to telling one person to build a house without plans—versus structured specs and iteration.
Anti-spaghetti strategy: maintainability agent, permissions hygiene, and tool boundaries
They discuss what breaks when people skip scaffolding and how to mitigate it. Gabor introduces his maintainability agent and emphasizes permission awareness in Claude Code—especially requests outside the project directory.
Full product spec by voice: Rule Ask (IIHF rules chat app) requirements and guardrails
Gabor dictates a detailed product/technical spec: Flutter + Firebase, Claude API with RAG over IIHF rulebooks, fallback web search, token/cost controls, and strict secret handling. The system analyst then begins clarifying key decisions about UX and limits.
Getting Claude Code ready: project agents setup, troubleshooting, and MCP installs
Gabor distinguishes between “role prompting” in chat and real Claude Code agents. He troubleshoots an agent setup issue by switching directories, authenticates, installs/validates MCPs (Figma, Chrome DevTools), and explains why /BTW-style mid-run instructions matter.
Design pipeline: inspiration → Figma Make style guide → Figma screens built by agents
Gabor shows how he generates a brand/style guide in Figma Make using inspiration images (with “don’t copy” constraints). Then Claude Code uses that style guide and the spec to automatically create real Figma screens for onboarding, chat, settings, etc.
From static screens to clickable prototype: UX flow automation in Figma
Instead of manually wiring prototype links, Gabor uses the UX flow architect + system analyst to add prototype arrows. The result is a clickable end-to-end flow that can guide development and reduce ambiguity.
Ticket factory: Jira epics/stories, sprint tagging, and parallel front-end/back-end planning
The system analyst generates initial backend setup tickets, then the whole agent team expands into full front-end and back-end ticket sets. Gabor enforces a critical rule: front-end tickets must include Figma links/screenshots or agents will produce generic “AI-looking” UI.
Tooling opinions: Claude Code vs Lovable/Bolt/Codex + hot takes on Dispatch/Cowork
Gabor explains why he prefers Claude Code for this workflow and how MCPs are the biggest unlock. He shares mixed experiences with earlier tools (e.g., Lovable reliability loops) and describes Dispatch/Cowork as promising but currently fragile.
Build execution: running sprints, fixing retrieval quality, and shipping a working simulator app
After sprint execution, Gabor notes a retrieval-quality issue (topic identification) and prompts improvements. They run the app in Apple Simulator and demonstrate the core experience, including an “observer mode” that reveals RAG steps, hits, tokens, and latency.
Deployment to TestFlight and App Store realities + career/portfolio takeaways
They upload the build to TestFlight, discuss what remains for App Store submission (screenshots, support/privacy URLs), and warn that Apple review times can be slow for first submissions. The conversation widens into Gabor’s career story (Deliveroo during COVID → Google) and advice: learn by building, not collecting certificates, and use builds as proof in a PM portfolio.
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome