What I'd like to know is why they write it all in the third person. One might ex...

simonw · 2025-05-25T17:23:50 1748193830

My best guess is that this is a reflection of how these things actually work.

When you "chat" with an LLM you are actually still participating in a "next token" prediction sequence.

The trick to get it to behave like it is a chat is to arrange that sequence as a screenplay:

  User: five facts about squirrels

  Assistant: (provide five facts)

  User: two more

  Assistant:

When you think about the problem like that, it makes sense that the LLM is instructed in terms of how that assistant should behave, kind of like screen directions.

dcre · 2025-05-25T22:53:29 1748213609

I bet it’s stronger than that, and they anchor a lot of the alignment training to the unique (ish) token of Claude.

mike_hearn · 2025-05-26T08:20:19 1748247619

But if true, then why choose a real name and not a made up one? Maybe they only realized they needed to do that later? ChatGPT is a far more unique name than Claude is.

dist-epoch · 2025-05-27T08:39:06 1748335146

Maybe to avoid confusion. "you" is relative to point of view. "Claude" is an absolute reference to the model.