A better way would be to ask the LLM to generate keywords (or queries). And then...

brookst · 2025-03-23T14:30:43 1742740243

How is that better than embeddings? You’re using embeddings to get a finite list of keywords, throwing out the extra benefits of embeddings (support for every human language, for instance), using a conventional index, and then going back to embeddings space for the final LLM?

That whole thing can be simplified to: compute and store embeddings for docs, compute embeddings for query, find most similar docs.

amelius · 2025-03-23T14:44:17 1742741057

Yes, you can do the "old school search" part with embeddings.

brookst · 2025-03-23T16:13:21 1742746401

Ah, I had interpreted “old school search” to mean classic text indexing and Boolean style search. I’d argue that if it’s using embeddings and cosine similarity, it’s not old school. But that’s just semantics.

osmarks · 2025-03-23T20:19:11 1742761151

https://arxiv.org/abs/2212.10496