Skip to content
How I AIHow I AI

I let Codex run for 6 hours. Here’s what happened.

In this 30-minute episode, I walk through my favorite feature in Codex: the /goal command. I show how Goals transform AI from a turn-based assistant that needs constant ‘what’s next?’ prompting into an autonomous agent that can work for hours on complex, multi-step tasks. I share three real examples: eliminating thousands of Sentry errors, cleaning 3,900 emails down to 68, and organizing hundreds of Linear tasks. *What you’ll learn:* 1. What Goals are and how they differ from standard prompts 2. How I used /goal to eliminate hundreds of error logs in my codebase over a five-hour autonomous run 3. The non-technical use cases that make Goals incredibly powerful: cleaning up 3,900 emails in under four hours and organizing hundreds of project management tasks in Linear 4. How to write effective /goal prompts with measurable outcomes, verification methods, and constraints 5. When not to use Goals and what makes a strong versus weak Goal 6. Why Goals represent a fundamental shift in how we work with AI, from babysitting the model to managing it *Brought to you by:* Mercury—Radically different banking loved by over 300K entrepreneurs: https://mercury.com/ *In this episode, we cover:* (00:00) Introduction (01:50) What is /goal and when should you use it? (02:45) The difference between prompts and Goal-based loops (04:06) Claire’s first five-hour 45-minute autonomous coding task (05:05) How to manage a Goal lifecycle: view, pause, resume, and clear (06:06) How to write strong goals: outcomes vs. outputs (07:34) The six components of effective Goals (08:57) Example: Reducing P95 checkout latency with /goal (09:36) Demo: Using /goal to eliminate Sentry errors in ChatPRD (13:18) Demo: Burning down Vercel API errors (17:28) Non-technical use case: Cleaning 3,900 emails with /goal (21:24) Demo: Using /goal to clean up Linear project tasks (24:41) When not to use /goal (26:10) Why /goal changes everything *Tools referenced:* • Codex: https://openai.com/codex/ • Sentry: https://sentry.io/ • Vercel: https://vercel.com/ • Linear: https://linear.app/ *Other reference:* • OpenAI blog post “Using Goals in Codex”: https://developers.openai.com/cookbook/examples/codex/using_goals_in_codex *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo _Production and marketing by https://penname.co/._ _For inquiries about sponsoring the podcast, email jordan@penname.co._

Claire Vohost
May 27, 202630mWatch on YouTube ↗

CHAPTERS

  1. Why Codex “Goals” enables overnight-style autonomy

    Claire Vo frames the episode around Codex’s /goal feature and why it’s the missing piece behind long-running, autonomous coding sessions people share online. She previews both technical and non-technical use cases and sets expectations for when this approach is worth using.

  2. When to use /goal vs. a normal prompt (prompting vs. looping)

    Using OpenAI’s diagram, Claire contrasts turn-based prompting with a goal-based loop. The core difference is that /goal keeps iterating—work, verify, decide next step—until evidence shows the goal is met.

  3. Claire’s first 5h45m autonomous coding run—and why it mattered

    Claire explains that prior agent tools didn’t reliably self-manage long, multi-hour tasks for her, but /goal did. She describes this as a step-change in practical autonomy even for “non-operating-system” style work.

  4. Managing the Goal lifecycle: view, pause, resume, clear

    She outlines the operational controls for safely running autonomous loops. You can start a goal, inspect it, pause if it’s drifting, resume later, or clear it entirely.

  5. Writing strong goals: define outcomes and proof, not tasks

    Claire shifts to the craft of goal-writing, emphasizing outcomes over outputs. A strong goal specifies what success looks like and how it will be validated, similar to well-written OKRs or success criteria.

  6. The six components of an effective /goal

    She breaks down OpenAI’s recommended structure for robust goals. The six components ensure the agent can iterate safely, validate progress, and stop responsibly when blocked.

  7. Example goal: reducing P95 checkout latency with guardrails

    Claire walks through a concrete template: reduce P95 checkout latency below a threshold while keeping correctness green. The example illustrates how verification + constraints prevent ‘cheating’ solutions (like deleting the page).

  8. Case study: eliminating Sentry ‘invalid edit operation’ errors in ChatPRD

    Claire shows how she used /goal to systematically burn down a large class of recurring Sentry errors caused by a complex diff-based document editor. The agent categorized root causes, implemented fixes, replayed historical events, and iterated until errors dropped to zero.

  9. Live demo setup: burning down Vercel API errors with a measurable success state

    Claire starts a live /goal targeting Vercel log errors on a chat endpoint. She writes a success criterion (no user-facing errors; downgrade non-critical ones to warnings) and instructs Codex to classify, fix, validate against two weeks of logs, and open PRs.

  10. Non-technical killer use case: cleaning 3,900 emails with Gmail access

    Claire demonstrates using /goal with a Gmail plugin to triage a massive inbox: categorizing, labeling, unsubscribing, and flagging items needing judgment. The run took ~3h52m and heavy token usage, but reduced thousands of emails down to a small review set.

  11. Non-technical demo: cleaning up a chaotic Linear backlog (podcast tasks)

    She applies /goal to project hygiene in Linear, focusing on closing stale tasks from past episodes and keeping only forward-looking work. Codex identifies the relevant team, infers rules for what to cancel, and performs bulk updates to restore a usable backlog.

  12. When not to use /goal—and why it changes the way you work

    Claire closes with cautions and a broader thesis: Goals are overkill for tiny edits and fail when the finish line is vague. Used correctly (durable objective + evidence-based finish line + multi-step path), /goal shifts you into ‘manager mode,’ enabling error-zero/tech-debt burn downs and more human-like delegation.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.