Skip to content
How I AIHow I AI

Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer?

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error. *What you’ll learn:* 1. How each AI model approaches the same design challenge differently 2. Why planning capabilities dramatically impact design quality 3. The specific visual and functional improvements each model made 4. Which model excels at front-end design versus back-end functionality 5. How to strategically choose the right AI model for different parts of your workflow 6. The importance of model-switching based on specific use cases *Blog design:* https://www.chatprd.ai/blog *Brought to you by:* Lovable—Build apps by simply chatting with AI: https://lovable.dev/ *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo *In this episode, we cover:* (00:00) Introduction to the AI design challenge (01:25) The question: Which model is the better designer? (03:08) The prompt used for all three models (04:10) Gemini 3 Pro’s approach and results (06:00) Opus 4.5’s approach and results (10:54) Codex 5.1’s approach and disappointing results (14:51) Comparing the three designs side by side (16:03) Analyzing the change logs and SEO improvements from each model (22:43) Final verdict (23:00) Conclusion and next steps *Tools referenced:* • Gemini 3 Pro: https://deepmind.google/models/gemini/pro/ • Anthropic Opus 4.5: https://www.anthropic.com/news/claude-opus-4-5 • OpenAI Codex 5.1: https://platform.openai.com/docs/models/gpt-5.1-codex • Cursor: https://cursor.com/ Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

Claire Vohost
Dec 3, 202525mWatch on YouTube ↗

Episode Details

EPISODE INFO

Released
December 3, 2025
Duration
25m
Channel
How I AI
Watch on YouTube
▶ Open ↗

EPISODE DESCRIPTION

I put three cutting-edge AI models to the test in a head-to-head design competition. Using the exact same prompt, I challenged Google’s Gemini 3, Anthropic’s Opus 4.5, and OpenAI’s Codex 5.1 to redesign my blog page, evaluating them on visual design quality, user experience improvements, and SEO optimization capabilities. One model produced a beautiful, polished, production-ready redesign. One was fine. And one completely whiffed. If you’re trying to figure out where each model fits in your workflow—design, planning, back-end, or something else—this episode will save you a lot of trial and error. *What you’ll learn:*

  1. How each AI model approaches the same design challenge differently
  2. Why planning capabilities dramatically impact design quality
  3. The specific visual and functional improvements each model made
  4. Which model excels at front-end design versus back-end functionality
  5. How to strategically choose the right AI model for different parts of your workflow
  6. The importance of model-switching based on specific use cases

*Blog design:* https://www.chatprd.ai/blog *Brought to you by:* Lovable—Build apps by simply chatting with AI: https://lovable.dev/ *Where to find Claire Vo:* ChatPRD: https://www.chatprd.ai/ Website: https://clairevo.com/ LinkedIn: https://www.linkedin.com/in/clairevo/ X: https://x.com/clairevo *In this episode, we cover:* (00:00) Introduction to the AI design challenge (01:25) The question: Which model is the better designer? (03:08) The prompt used for all three models (04:10) Gemini 3 Pro’s approach and results (06:00) Opus 4.5’s approach and results (10:54) Codex 5.1’s approach and disappointing results (14:51) Comparing the three designs side by side (16:03) Analyzing the change logs and SEO improvements from each model (22:43) Final verdict (23:00) Conclusion and next steps *Tools referenced:*

Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email jordan@penname.co.

SPEAKERS

  • Claire Vo

    host

EPISODE SUMMARY

In this episode of How I AI, featuring Claire Vo, Gemini 3 vs. Claude Opus 4.5 vs. GPT-5.1 Codex: Which AI model is the best designer? explores three coding models redesign a blog page, Opus wins decisively Host Claire Vo runs a “one-shot” redesign challenge on an underwhelming ChatPRD blog page, using the same codebase and prompt in Cursor across three new coding models: Gemini 3 Pro, Opus 4.5, and GPT-5.1 Codex.

RELATED EPISODES

Claude Code Just Got WAY More Powerful

Claude Code Just Got WAY More Powerful

Quests, token leaderboards, and a skills marketplace: the elite AI adoption playbook | John Kim

Quests, token leaderboards, and a skills marketplace: the elite AI adoption playbook | John Kim

The internal AI tool that's transforming how Stripe designs products | Owen Williams

The internal AI tool that's transforming how Stripe designs products | Owen Williams

A complete beginner's guide to coding with AI: From PRD to generating your very first lines of code

A complete beginner's guide to coding with AI: From PRD to generating your very first lines of code

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

How Microsoft's AI VP automates everything with Warp | Marco Casalaina

How to turn meeting notes into prototypes that your sales team can immediately demo to customers

How to turn meeting notes into prototypes that your sales team can immediately demo to customers

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

Add to Chrome