> \* Huge context window But how well does it actually handle that context windo...

asadm · 2025-02-05T19:11:39 1738782699

it works REALLY well. I have used it to dump many references codes and then help me write a new modules etc. I have gone up to 200k tokens I think with no problems in recall.

ai-christianson · 2025-02-05T19:17:53 1738783073

Awesome. Models that can usefully leverage such large context windows are rare at this point.

Something like this opens up a lot of use cases.

Havoc · 2025-02-05T19:10:31 1738782631

I'm sure someone will do a haystack test, but from my casual testing it seems pretty good

llm_nerd · 2025-02-05T21:12:32 1738789952

There is the needle in the haystack measure which is, as you probably guessed, hiding a small fact in a massive set of tokens and asking it to recall it.

Recent Gemini models actually do extraordinarily well.

https://cloud.google.com/blog/products/ai-machine-learning/t...

f38zf5vdt · 2025-02-05T19:26:07 1738783567

It works okay out to roughly 20-40k tokens. Once the window gets larger than that, it degrades significantly. You can needle in the haystack out to that distance, but asking it for multiple things from the document leads to hallucinations for me.

Ironic, but GPT4o works better for me at longer contexts <128k than Gemini 2.0 flash. And out to 1m is just hopeless, even though you can do it.

summerlight · 2025-02-05T19:08:38 1738782518

My experience is that Gemini works relatively well on larger contexts. Not perfect, but more reliable.