The best benchmark is the community vibe in the weeks following a release. Claud...

diggan · 2025-09-12T13:09:06 1757682546

> The best benchmark is the community vibe in the weeks following a release.

True, just be careful what community you use as a vibe-check. Most of the mainstream/big ones around AI and LLMs basically have influence campaigns run against them, are made of giant hive-minds that all think alike and you need to carefully asses if anything you're reading is true or not, and votes tend to make it even worse.

theblazehen · 2025-09-12T15:37:20 1757691440

I generally check LM Arena as well as which models have had the most weekly tokens on openrouter

wubrr · 2025-09-11T22:00:55 1757628055

the vibes are just a collection anecdotes

ryoshu · 2025-09-12T00:00:22 1757635222

"qual"