At a glance
WHAT IT’S REALLY ABOUT
GPT-5 review: engineer-first model excels at code and specs
- Claire Vo shares an early-access, workflow-driven evaluation of OpenAI’s GPT-5, arguing it feels “built by engineers for engineers” with standout strength in coding, technical writing, and functional requirements detail.
- In side-by-side tests within ChatPRD, GPT-5 tends to jump quickly to implementation (“what/how”) versus GPT-4.1’s more business/discovery framing (“who/why”), which can be a mismatch for stakeholder-facing artifacts.
- She finds GPT-5’s verbosity and specificity can produce stronger downstream prototyping outcomes (more components/ideas), even if the raw PRD can feel overly dense for alignment.
- Beyond developer use, she highlights improvements in ChatGPT Canvas/front-end taste and notably stronger image-generation spatial awareness (tested via a “bathroom remodel” benchmark), while flagging tradeoffs like heavy tool-calling and bullet-pointy style.
IDEAS WORTH REMEMBERING
5 ideasGPT-5 is optimized for execution, not discovery.
Across PRD brainstorming and feature ideation, GPT-5 rapidly converges on concrete features and implementation details, while GPT-4.1 spends more time on business goals, personas, and metric framing—better for stakeholder alignment.
For functional requirements and tech specs, GPT-5 clearly outclasses GPT-4.1.
Vo highlights GPT-5’s unusually detailed, engineer-friendly requirements (edge cases, warnings, prioritized tables) and technical considerations, making it well-suited for engineering handoff and spec writing.
GPT-5’s “developer artifacts” leak into non-dev documents.
Even when asked for a prose PRD, GPT-5 adds code-like elements (e.g., code-block comments) and defaults to markdown bullets, signaling strong developer training but requiring style constraints for business docs.
Verbosity is a tradeoff: better build fidelity, worse readability for stakeholders.
More detail can help engineers and coding agents implement accurately, but can dilute the core narrative for executives or cross-functional partners who need concise alignment and decision-ready summaries.
More detailed PRDs can yield richer prototypes—even if uglier by default.
In the v0 prototype comparison, GPT-4.1 produced a cleaner, more colorful design, but GPT-5 generated a more component-dense prototype (upgrade widgets, locked states, trial flows), offering more ideation material to pick from.
WORDS WORTH SAVING
5 quotesFrom my very first interaction, I felt like this was a engineer built by engineers for engineers.
— Claire Vo
GPT-5… loves a bullet point list.
— Claire Vo
Tell me what to build, tell me exactly how the features work… give me something to code.
— Claire Vo
Girlfriend loves to call a tool.
— Claire Vo
My benchmark is: Can it reasonably help with my bathroom remodel?
— Claire Vo
High quality AI-generated summary created from speaker-labeled transcript.
Get more out of YouTube videos.
High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.
Add to Chrome