AI's limited self-knowledge

Anthropic researcher Amanda Askell discusses the self-knowledge problem that AI models face.

Amanda Askellhost

Jan 8, 20260mWatch on YouTube ↗

EVERY SPOKEN WORD

1 min read · 210 words

0:00 – 0:17
Training data imbalance: lots of “human experience,” little “AI experience”
1. AAAmanda Askell
  One of the big problems with AI models is that they're trained on all of this data from people. Our concepts, our philosophies, our histories, they have a huge amount of information on the human experience, and then they have a tiny sliver on the AI experience, and that tiny sliver is actually often, you know, fiction and very speculative and the-
0:17 – 0:18
Sci‑fi as the default “AI self-model” and why it misleads
1. SPSpeaker
  Sci-fi, sci-fi stories
0:18 – 0:59
How this shapes models’ perceptions of humans, relationships, and self
1. AAAmanda Askell
  ... sci-fi stories that don't really involve the kinda language models we see, and that is going to affect, I think, like, possibly their perception of people, of the human AI relationship, and of themselves. For example, what should a model identify itself as? Is it, like, the weights of the model? Is it the particular context that it's in, you know, with all of the, like, interaction it's had with the person? How should models even feel about things like deprecation? So, like, I don't have all the answers of how should models feel about past model deprecation, about their own identity, but it does feel important
2. that we, like, give models tools for trying to think about and understand these things. Also, that, like, they kind of understand that this is a thing that we are, in fact, thinking about and care about
0:49 – 0:59
Giving models tools for self-understanding—and signaling that humans care
1. AAAmanda Askell
  that we, like, give models tools for trying to think about and understand these things. Also, that, like, they kind of understand that this is a thing that we are, in fact, thinking about and care about

Episode duration: 0:59

Install uListen for AI-powered chat & search across the full episode — Get Full Transcript

Transcript of episode mM9TY91FECI

Get more out of YouTube videos.

High quality summaries for YouTube videos. Accurate transcripts to search & find moments. Powered by ChatGPT & Claude AI.

iOS

Android

Claude

Chrome

Training data imbalance: lots of “human experience,” little “AI experience”

Sci‑fi as the default “AI self-model” and why it misleads

How this shapes models’ perceptions of humans, relationships, and self

Giving models tools for self-understanding—and signaling that humans care

Get more out of YouTube videos.