An exclusive inside look at GPT-5

In this episode, I share my hands-on experience with OpenAI’s GPT-5, the company’s new frontier model. As one of the first users outside of OpenAI to test the model, I put GPT-5 head-to-head with GPT-4.1 across real-world product use cases—from writing PRDs to generating code to assisting with visual design work. This is my unfiltered look at what GPT-5 can (and can’t) do—and how it changes the game for builders. *What you’ll learn:* 1. How GPT-5 differs from previous models with its engineering-focused approach to problem-solving and tendency to prioritize technical details over business context 2. A comparative analysis of how GPT-5 and GPT-4.1 generate different types of product requirement documents and prototypes for the same prompt 3. Why GPT-5 excels at technical writing, functional requirements, and code generation while potentially skipping important business discovery questions 4. The model’s impressive spatial awareness capabilities when generating images for interior design and other visual tasks 5. Practical considerations for choosing the right model based on your specific use case and audience 6. How GPT-5’s extensive tool-calling behavior and bullet-point communication style reflect its engineering-oriented design *Brought to you by ChatPRD—an AI copilot for PMs and their teams:* https://www.chatprd.ai/howiai *25k giveaway:* To celebrate 25,000 YouTube followers, we’re doing a giveaway. Win a free year of my favorite AI products, including v0, Replit, Lovable, Bolt, Cursor, and, of course, ChatPRD, by leaving a rating and review on your favorite podcast app and subscribing to the podcast on YouTube. To enter: https://www.howiaipod.com/giveaway *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo *In this episode, we cover:* (00:00) Introduction to GPT-5 (04:34) Testing GPT-5 in ChatPRD for document generation (07:10) Comparing GPT-5 and GPT-4.1 on business vs. technical orientation (11:22) Side-by-side comparison of PRDs generated by both models (15:23) Where GPT-5 excels: Technical considerations and documentation quality (17:35) Comparing prototypes generated from different model outputs (19:57) Testing homepage critique capabilities between models (23:14) OpenAI’s strengths in API design and developer support (25:37) GPT-5’s performance as a coding assistant (27:26) Examining GPT-5 in ChatGPT’s interface (28:50) Testing GPT-5’s front-end design capabilities (31:17) Personal use case: bathroom remodel planning (33:45) Comparing GPT-5 vs. GPT-4 for interior design visualization (38:10) Summary of key findings and recommendations *Tools referenced:* • OpenAI: https://openai.com/ • ChatGPT: https://chat.openai.com/ • Claude: https://claude.ai/ • Gemini: https://gemini.google.com/ • Cursor: https://cursor.sh/ • v0: https://v0.dev/ • Lovable: https://lovable.dev/ • Bolt: https://bolt.com/ • LaunchDarkly AI Configs: https://launchdarkly.com/docs/home/ai-configs *Other reference:* • Benjamin Moore paints: https://www.benjaminmoore.com/ _Production and marketing by https://penname.co/._ _For inquiries about sponsoring the podcast, email jordan@penname.co._

Claire Vohost

Aug 6, 202540mWatch on YouTube ↗

WHAT IT’S REALLY ABOUT

GPT-5 review: engineer-first model excels at code and specs

Claire Vo shares an early-access, workflow-driven evaluation of OpenAI’s GPT-5, arguing it feels “built by engineers for engineers” with standout strength in coding, technical writing, and functional requirements detail.
In side-by-side tests within ChatPRD, GPT-5 tends to jump quickly to implementation (“what/how”) versus GPT-4.1’s more business/discovery framing (“who/why”), which can be a mismatch for stakeholder-facing artifacts.
She finds GPT-5’s verbosity and specificity can produce stronger downstream prototyping outcomes (more components/ideas), even if the raw PRD can feel overly dense for alignment.
Beyond developer use, she highlights improvements in ChatGPT Canvas/front-end taste and notably stronger image-generation spatial awareness (tested via a “bathroom remodel” benchmark), while flagging tradeoffs like heavy tool-calling and bullet-pointy style.

IDEAS WORTH REMEMBERING

5 ideas

GPT-5 is optimized for execution, not discovery.

Across PRD brainstorming and feature ideation, GPT-5 rapidly converges on concrete features and implementation details, while GPT-4.1 spends more time on business goals, personas, and metric framing—better for stakeholder alignment.

For functional requirements and tech specs, GPT-5 clearly outclasses GPT-4.1.

Vo highlights GPT-5’s unusually detailed, engineer-friendly requirements (edge cases, warnings, prioritized tables) and technical considerations, making it well-suited for engineering handoff and spec writing.

GPT-5’s “developer artifacts” leak into non-dev documents.

Even when asked for a prose PRD, GPT-5 adds code-like elements (e.g., code-block comments) and defaults to markdown bullets, signaling strong developer training but requiring style constraints for business docs.

Verbosity is a tradeoff: better build fidelity, worse readability for stakeholders.

More detail can help engineers and coding agents implement accurately, but can dilute the core narrative for executives or cross-functional partners who need concise alignment and decision-ready summaries.

More detailed PRDs can yield richer prototypes—even if uglier by default.

In the v0 prototype comparison, GPT-4.1 produced a cleaner, more colorful design, but GPT-5 generated a more component-dense prototype (upgrade widgets, locked states, trial flows), offering more ideation material to pick from.

WORDS WORTH SAVING

5 quotes

From my very first interaction, I felt like this was a engineer built by engineers for engineers.

— Claire Vo

GPT-5… loves a bullet point list.

— Claire Vo

Tell me what to build, tell me exactly how the features work… give me something to code.

— Claire Vo

Girlfriend loves to call a tool.

— Claire Vo

My benchmark is: Can it reasonably help with my bathroom remodel?

— Claire Vo

Engineer-first tone and behaviorChatPRD PRD side-by-side: GPT-5 vs GPT-4.1Business discovery vs implementation focus (who/why vs what/how)Functional requirements and technical considerations qualityPrototype generation differences (v0)Tool-calling intensity and token/performance concernsCanvas prototyping and image-generation spatial awareness

High quality AI-generated summary created from speaker-labeled transcript.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.