Skip to content
OpenAIOpenAI

Shaping model behavior in GPT-5.1— the OpenAI Podcast Ep. 11

What does it mean for an AI model to have "personality"? Researcher Christina Kim and product manager Laurentia Romaniuk talk about how OpenAI set out to build a model that delivers on both IQ and EQ, while giving people more flexibility in how ChatGPT responds. They break down what goes into model behavior and why it's an important, but still imperfect blend of art and science. Chapters - 00:00:43 — GPT-5.1 goals and the shift to reasoning models - 00:02:18 — Differences between GPT-5 and GPT-5.1 - 00:04:55 — Unpacking the model switcher - 00:07:24 — Understanding user feedback - 00:08:27 — Measuring progress on emotional intelligence - 00:10:02 — What is model personality? - 00:14:25 — Model steerability, bias, and uncertainty - 00:21:59 — Advantages of memory in ChatGPT - 00:25:27 — Looking ahead and advice for getting the most out of models

Andrew MaynehostLaurentia RomaniukguestChristina Kimguest
Dec 2, 202528mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 1:01

    What GPT-5.1 set out to fix: feedback-driven goals and reasoning by default

    Andrew opens with Christina Kim and Laurentia Romaniuk outlining the intent behind GPT-5.1: respond to GPT-5 launch feedback while raising baseline capability. A central shift is making reasoning available across chat models so the system can “decide to think” when needed.

    • GPT-5.1 planning anchored on community feedback from GPT-5
    • All chat models moving to reasoning models as a baseline capability
    • Model dynamically allocates “thinking” based on prompt difficulty
    • Reasoning enables refinement, tool use, and improved instruction following
  2. 1:01 – 2:23

    System 1 vs System 2: when the model thinks—and why it improves everything

    Christina describes how GPT-5.1 can switch between fast responses and deeper deliberation, similar to Kahneman’s System 1/System 2 framing. Even for everyday prompts, improved intelligence and selective thinking boosts broad performance.

    • “Decide to think” behavior varies by prompt complexity
    • Reasoning time improves answer quality and reduces impulsive mistakes
    • Better intelligence lifts non-obvious tasks like instruction following
    • Evals show across-the-board gains when reasoning is default-capable
  3. 2:23 – 4:55

    Productizing the differences: warmth, intuition, and the real causes of ‘cold’ behavior

    Laurentia explains why users perceived GPT-5 as less warm: not only model outputs, but surrounding system factors. Fixes in 5.1 include better context carryover, smoother experiences, and stronger adherence to custom instructions.

    • User feedback: weaker intuition, less warmth in GPT-5
    • Coldness can be caused by context not persisting across turns
    • Auto-switching between chat vs reasoning can create jarring tone shifts
    • 5.1 improves custom instructions consistency and persistence
    • New style/trait controls give users more direct behavioral steering
  4. 4:55 – 5:56

    Demystifying the auto-switcher: navigating multiple models inside one ChatGPT

    The conversation unpacks why there isn’t a single ‘one model’ experience anymore. Laurentia frames the switcher as a UX and quality system that routes users to the best capability profile based on prompt needs and evaluation targets.

    • Different models have different strengths; choice can confuse users
    • UI and model switcher aim to route users automatically
    • Routing is informed by evals (accuracy, detail) and context signals
    • Goal: forecast which model response style/capability is most helpful
  5. 5:56 – 7:15

    A system of models, not a single set of weights: research and product implications

    Christina expands on GPT-5.1 as an ecosystem: multiple reasoning tiers, the auto-switcher itself as a model, plus tool-backed models. This sets up a future where long-thinking ‘deep research’ runs in the background and is invoked as needed.

    • Fast doesn’t have to mean “dumb”—baseline models can still reason
    • Future: long-horizon thinking (minutes) as background tools
    • ChatGPT increasingly becomes an orchestrated system of models/tools
    • Users may assume one set of weights, but experience is assembled
  6. 7:15 – 8:28

    How OpenAI uses user feedback: conversation links, experiments, and signal balancing

    Laurentia describes how feedback is triaged and debugged using shared conversation links and metadata. For auto-switching, teams balance factuality, latency, and user satisfaction signals to decide when switching helps versus harms.

    • Conversation links provide critical context for diagnosing issues
    • Feedback can reveal users were in experiments with edge cases
    • Switcher quality judged by signals: helpfulness, factuality, latency
    • Balancing speed vs quality is both art and science
  7. 8:28 – 10:05

    Measuring ‘EQ’ in models: user signals research, memory, and listening behaviors

    Andrew asks how to quantify emotional intelligence improvements. Christina and Laurentia connect EQ to intent understanding, better reward modeling from user signals, and system features like memory and context carryover that enable listening and continuity.

    • EQ measurement is open-ended; requires new user-signal methodologies
    • Training reward models using prod/user signals to capture intent
    • Smarter models better infer context and respond appropriately
    • “Listening” includes remembering prior messages and logging memory well
    • Style features can contribute to perceived EQ and rapport
  8. 10:05 – 11:41

    What ‘personality’ really means: style/tone vs the full app ‘harness’

    Laurentia reframes personality as an overloaded term: there’s a narrow feature (response style and traits), and a broader experiential personality shaped by UI, latency, context window, and rate limits. The challenge is mapping user complaints to the true component causing the feeling.

    • Personality feature ≈ response style/tone (concise, verbose, emojis)
    • Most users mean the entire product experience, not just wording
    • ‘Harness’ includes UI feel, fonts, latency, context handling, rate limits
    • Model changes and product constraints both alter perceived personality
  9. 11:41 – 13:41

    Post-training is an art: RL tradeoffs, quirks, and preserving steerability

    Christina describes reinforcement learning as a multi-objective balancing act: improving capabilities without losing warmth or helpfulness. Laurentia ties this to OpenAI’s Model Spec goal of maximizing freedom while minimizing harm, warning against overcorrecting quirks in ways that reduce user control.

    • RL reward configurations involve subtle choices and tradeoffs
    • Hard to target subjective goals like warmth without regressions
    • Model Spec principle: maximize user freedom while minimizing harm
    • Removing quirks globally (e.g., banning punctuation) can break steerability
    • Goal: keep behaviors adjustable rather than permanently trained out
  10. 13:41 – 15:43

    From blanket refusals to safe completions: evolving safety without judgment

    They reflect on early ChatGPT being overly refusal-prone and easy to bypass, and how safety has matured. Laurentia highlights “safe completions,” where the model still helps in earnest while avoiding harmful actions, improving tone and usefulness around boundaries.

    • Early systems defaulted to refusals due to misuse concerns
    • Refusals used to sound judgmental and unhelpful
    • Safe completions aim to resolve requests safely rather than just refuse
    • Safety tech increasingly handles nuanced boundary cases
  11. 15:43 – 17:16

    Nuance in harmful-content policies: creative writing, legal work, and context matters

    Andrew and Laurentia discuss real-world friction when models over-sanitize sensitive content, including a lawyer’s casework example. Laurentia argues for library-like access principles paired with strong contextual rules, distinguishing legitimate professional needs from harmful intent.

    • Creative writing and crime/motive discussions should be supported safely
    • Example: legal document weakened when assault details were scrubbed
    • Library analogy: broad access with careful contextualization
    • Key challenge: distinguish legitimate use (legal, research) from abuse (harassment)
    • Ongoing work to improve nuance; more progress still needed
  12. 17:16 – 17:58

    Handling bias and subjective domains: uncertainty, open-endedness, and anchored truth

    Laurentia explains efforts to improve responses in subjective areas: expressing uncertainty appropriately and engaging ideas in earnest while staying grounded in objective truth when it exists. Users should see models become more open-ended and self-directable in ambiguous domains.

    • Focus area: model behavior in subjective/uncertain topics
    • Better uncertainty expression and non-dogmatic exploration
    • Engage user ideas earnestly while staying anchored to objective truths
    • Expected user-visible shifts toward more open-ended responses
  13. 17:58 – 19:37

    A ‘sleeper’ upgrade: wider expressive range and creativity through steerable writing

    The team calls out creativity and expressive range as underappreciated improvements in 5.1. Post-training challenges grow when there’s no single ground truth, but the payoff is a model that can flex between elevated and simple writing when pushed.

    • 5.1 expands expressive range—subtle by default, strong when steered
    • Users can push tone: elevated prose, simplicity, different voices
    • Creativity work is hard because quality is context-dependent and subjective
    • Product opportunity: better ways to surface these capabilities to users
  14. 19:37 – 22:48

    Where model behavior goes next: personalization at scale, inferred expertise, and user control

    They argue one default personality can’t serve 800M+ weekly users, so customization must expand. The discussion covers priming vs inferred personalization, the role of memory, and the importance of transparency and control over what the system infers.

    • Customization is necessary at global scale; steerability will increase
    • Anecdote: expert priming unlocks frontier-level scientific usefulness
    • Future: less explicit prompting as models infer expertise via memory/context
    • PM principle: users should know what’s inferred and be able to change it
    • Memory controls (on/off/delete) keep users in charge
  15. 22:48 – 26:27

    Memory and proactive experiences: from cold starts to personalized daily utility

    Christina defines memory as persistent user facts/preferences used across chats. Andrew shares how memory enables proactive features like Pulse, making the experience feel more personal and less like restarting from scratch each session.

    • Memory stores user info from conversations for later reference
    • Reduces repetition; enables responses grounded in personal context
    • Proactive features can leverage memory to surface relevant updates
    • Experience difference is noticeable when memory is disabled
  16. 26:27 – 28:40

    Getting the best results: pressure-test, iterate, ask for better prompts—and pick your style

    They close with practical advice: test the model on domains you know well, keep trying as updates ship frequently, and ask the model to improve your prompts. A light ending reveals their own style settings and how personalization can backfire in professional contexts.

    • Use hard questions in areas you know to evaluate progress
    • Model behavior changes fast—retry use cases over time
    • Ask the model to help you write better prompts
    • Christina uses default; Laurentia cycles styles (e.g., ‘nerd’, ‘Albertan’)
    • Customization is powerful but needs context switching for professionalism

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.