> The 8k context window is new Hasn’t Claude had this for many months (before th...

simonw · on Oct 26, 2023

Claude is a large language model, which is a different thing from an embedding model.

Der_Einzige · on Oct 26, 2023

Any large language model generates embedding representations at every layer of the model, and these can be trivially extracted. So, large language models are indeed embedding models.

This leaderboard doesn't compare these custom tailored embedding models vs the obvious thing of average pooling layered with any traditional LLM, which is easily implemented using sentence transformers.

anothernewdude · on Oct 27, 2023

Because 4K+ dimensional embeddings are functionally useless.

theptip · on Oct 26, 2023

Aha, that’s what I missed, thanks!