CHAPTERS
- 0:00 – 1:55
Why this episode matters: a rare look at how OpenAI PMs actually work
Aakash frames the episode around OpenAI’s explosive growth and the lack of public examples showing real OpenAI PM workflows. He introduces Abhi Muchhal (International Growth PM at OpenAI) and previews live Codex-based building and automation demos.
- 1:55 – 3:48
Codex as an “agent”: what it unlocks for PM leverage
Abhi explains the evolution from chatbot → collaborator → agent, and why Codex changes his day-to-day PM work. He highlights two main value streams: automating repetitive PM tasks and enabling PMs to build prototypes/features to 70–80% without waiting on engineering bandwidth.
- 3:48 – 5:34
Live demo: an international growth dashboard that replaces “seven dashboards and a headache”
Abhi demos a web app he built with Codex that consolidates multiple internal analytics sources into one international growth cockpit. The dashboard supports per-country views, headline metrics, strengths/risks, and deeper competitive benchmarking, updated daily via automation.
- 5:34 – 10:04
Generalizable takeaway: synthesis + TL;DR beats raw dashboards
Aakash challenges the “why not just use Databricks?” question, and Abhi clarifies the core principle. Codex adds value by aggregating cross-tool data and generating interpretation—reducing cognitive load and turning metrics into prioritized takeaways.
- 10:04 – 11:32
How to build with Codex: defining inputs/outputs, previews, and Playwright testing
Abhi walks through the build flow inside Codex: specify desired output behavior and list data inputs/connectors, then let Codex scaffold the app. He shows how Codex runs local previews and uses Playwright/browser screenshots to self-diagnose UI issues and iterate quickly.
- 11:32 – 14:52
From PRDs to prototypes: a new PM workflow (with a companion FAQ doc)
Abhi describes replacing long PRDs with prototypes as the primary artifact for alignment and feedback. He still maintains a lightweight companion document to cover hypotheses, success metrics, and stakeholder concerns, but the prototype becomes the “main show.”
- 14:52 – 21:23
Shipping mechanics: working locally vs. real repo, and how PMs get to 80%
Abhi explains when he builds locally (internal tools) versus working against the actual ChatGPT codebase. A key tactic is asking engineers for the closest existing reference in the repo, pointing Codex to it, and iterating to a real PR engineers can refine.
- 21:23 – 28:37
Codex use cases and honest limitations: repetitive automation vs. net-new building
Abhi buckets Codex use into (1) automating repetitive PM work and (2) unlocking new capabilities like dashboards and prototypes. He also shares failure modes: ambiguity in data definitions, and imperfect signal-to-noise in summaries, requiring human review.
- 28:37 – 30:06
OpenAI’s Slack-heavy operating system: three pre-day-start automations
Abhi shows how Codex fits into OpenAI’s communication workflow by automating Slack triage, dashboard refresh, and weekly updates. These automations reduce missed messages across time zones and create reliable, repeatable reporting cadences.
- 30:06 – 33:05
Harness + skills: why “the Codex harness” is the real differentiator
Abhi argues the biggest unlock isn’t just the model—it’s the harness: connectors, skills, and repeatable workflows. He describes a team-built “experiment review” skill that ingests Statsig, tracks progress, drafts hypotheses/postmortems, and proposes recommendations.
- 33:05 – 37:00
Personal agent demos: Codex “computer use” with WhatsApp + Calendar actions
Abhi demonstrates Codex computer use to triage WhatsApp messages and identify actionable items. He then pushes it further: Codex reads a message, checks Google Calendar availability, drafts a reply in WhatsApp, and leaves final sending to the user for control.
- 37:00 – 43:42
Building a 1040 tax-filing app with Codex—and how to think about safety
Abhi shares a personal project: a web app that ingests tax documents and outputs a complete 1040 form. He cross-checked it with his accountant, found a missed income source, and discusses safe usage via data controls and permissioned action levels.
- 43:42 – 47:18
International growth at OpenAI: serving the majority of humanity
Abhi explains the mission-driven rationale for international growth: most users live outside the US and have different contexts and workflows. He outlines his scope across three layers—model improvements, product use-case surfacing, and top-of-funnel storytelling/partnerships.
- 47:18 – 59:26
What drove ChatGPT to ~900M weekly actives: search, multimodality, and ImageGen
Abhi frames growth as expanding beyond knowledge workers and students, especially in markets where knowledge workers are a small minority. Feature breakthroughs like Search (fresh info) and ImageGen (multimodal, low-text friction) broaden relevance globally.
- 59:26 – 1:05:27
ImageGen 2 deep dive: biggest quality jump, multilingual text, and pro editing workflows
Abhi showcases ImageGen 2’s improvements: realism, multi-image storytelling, better multilingual character rendering, and finer edit control. He shares practical tips like using “thinking” mode, region-based edits, and ratio formatting, while noting steerability remains an area to improve.
- 1:05:27 – 1:07:06
Breaking into OpenAI as a PM: core skills + living AI + speaking evals
Abhi outlines what matters for PM candidates: core PM fundamentals still apply, but you must actively use AI tools and understand frontier dynamics. He emphasizes evals as the “currency of progress” and shares his personal path—international experience plus a builder mindset from side projects.
