Skip to content
OpenAIOpenAI

ChatGPT Atlas and the next era of web browsing — the OpenAI Podcast Ep. 9

How will the internet feel when your browser can actually do things for you? OpenAI’s Ben Goodger and Darin Fisher, whose past work shaped some of the most popular modern browsers, dive into the making of ChatGPT Atlas. They explore how AI changes what a browser can be, from tabs you can talk to, to agents that take over tedious tasks. Learn more about the decisions they made along the way and what’s coming next. Guests: Ben Goodger and Darin Fisher Chapters - 00:00:45 What is Atlas? - 00:03:34 The state of browsers and AI on the web - 00:13:55 Under the hood (https://openai.com/index/building-chatgpt-atlas/) - 00:22:00 Building with AI - 00:33:39 Search in Atlas - 00:41:28 Favorite features - 00:45:23 Side Chat in action - 00:46:59 Real-world wins with agent mode - 00:52:45 Why Chromium? - 01:07:57 Five-year vision - 01:13:11 Power tips and closing remarks

Andrew MaynehostDarin FisherguestBen Goodgerguest
Nov 14, 20251h 14mWatch on YouTube ↗

CHAPTERS

  1. 0:00 – 2:00

    What ChatGPT Atlas is: a browser built around natural-language interaction

    Andrew sets the stage for Atlas as a new kind of browser where ChatGPT isn’t a bolt-on, but the central interface. Ben and Darin frame it as a tool for telling the computer what you want, then letting it use the web to accomplish tasks over minutes to months.

  2. 2:00 – 3:40

    Why build it now: models, computer-use, and the “slope” of capability

    The team argues that recent leaps in model performance and “computer use” make integrated browsing experiences finally compelling. They compare progress from earlier agent experiences (like Operator) to Atlas’s speed and capability, and emphasize laying foundations early for the future.

  3. 3:40 – 10:08

    The browser isn’t going away: the durable, open platform for work and research

    They discuss why browsers remain central despite shifts like mobile and app ecosystems. The web’s openness, lack of gatekeepers, and role as a conduit to information make it resilient—and generative AI naturally augments that experience rather than replacing it.

  4. 10:08 – 13:30

    From “semantic web” dreams to models that meet the web where it is

    Andrew raises the idea that the web never fully became machine-annotated in the way early visions suggested. Darin explains that modern models can interpret human-oriented interfaces directly, reducing reliance on perfect structured markup and enabling machine help on today’s messy web.

  5. 13:30 – 17:35

    Browsers are complex “mini operating systems,” and Atlas goes beyond a plugin

    They unpack how much hidden engineering lives inside browsers and why Atlas needed deeper integration than an extension can provide. Ben explains the design aim: weave ChatGPT across the entire browsing surface to enable richer actions like writing assistance and personalized behavior.

  6. 17:35 – 19:29

    Agent workspace: separating your tabs from the agent’s tabs (and why it matters)

    Ben and Darin describe a key product abstraction: agents get their own tab workspace so their browsing doesn’t clutter yours. The agent can open many pages, work in the background, and then present results and provenance without disrupting the user’s active tab strip.

  7. 19:29 – 22:00

    What counts as an “agent task”: acting on the web while you watch and learn

    They clarify the difference between standard ChatGPT queries and agent mode: agents take actions inside web apps on your behalf. Examples include generating charts in spreadsheets or leaving comments in docs—while keeping the process visible so users can learn workflows.

  8. 22:00 – 25:29

    Building Atlas with AI: Codex-driven productivity and faster platform expansion

    The conversation shifts to how OpenAI’s own teams use coding agents to speed development. Ben and Darin share anecdotes of massive productivity gains, cross-language translation, and even using Swift for Windows to unify expertise and codebases.

  9. 25:29 – 28:42

    Safety and user control in agent mode: sensitive tasks, stop buttons, and auth choices

    Andrew asks about agent pausing when you leave a tab. Ben explains “sensitive mode” for high-stakes contexts (like email) where users must supervise and can immediately stop actions, while Darin adds options like running signed-out agent sessions for safer experimentation.

  10. 28:42 – 32:03

    Attention, ecosystem, and serendipity: expanding the web without getting trapped

    They explore how AI browsing changes discovery and incentives when agents open tabs and sites compete for attention. The team emphasizes that agents act only on user requests and that agent tabs are restricted from common spam vectors like notification permission prompts.

  11. 32:03 – 33:29

    Browser memories and personalization: recall, context-aware search, and controls

    Ben and Darin describe how Atlas uses browsing activity to build memories that help users return to prior work and reduce repetitive instructions. They highlight preference learning (e.g., defaulting to favorite travel sites) and emphasize user controls to view, limit, or disable personalization.

  12. 33:29 – 41:28

    Search inside the browser: familiar entry points that reveal “model superpowers”

    Andrew probes whether Atlas implies OpenAI’s own search direction. Darin explains they include familiar search facets (Images, News) to match user intent while blending in model responses so people learn to use ChatGPT naturally as part of everyday browsing—starting from a single input box.

  13. 41:28 – 52:44

    Favorite non-AI primitives: scrolling tabs, tab search, and performance architecture

    They highlight pragmatic browser improvements like scrolling tabs and tab search that make large tab sets manageable, enabling workflows where users don’t constantly close tabs. Darin also explains resource management (closing background pages), fast restarts, and Owl (Chromium out-of-process) for resilience and speed.

  14. 52:44 – 58:40

    Why Chromium (and its lineage): compatibility, extensions, and where innovation sits

    Darin explains Chromium is chosen for web compatibility and extension ecosystem realities. Ben adds historical context (WebKit, KHTML, Blink) and they clarify Atlas’s innovation focus: not reinventing the renderer, but adding an AI-native layer on top with a distinct process architecture and modern Swift app stack.

  15. 58:40 – 1:14:21

    Long-term bet and what’s next: platforms, mobile UX, and the agentic web in 5–10 years

    They address whether Atlas is a serious product (not just an experiment) and outline a long roadmap with frequent updates and broader platform support. Looking forward, they envision users expressing intent while agents handle “toil,” with increasing agent-driven web traffic—while the open web remains the publishing substrate.

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.