Gemini 2.5 Flash is an impressive model for its price. However, I don't understa...

simonw · 2025-09-25T18:21:59 1758824519

My one big problem with OpenRouter is that, as far as I can tell, they don't provide any indication of how many companies are using each model.

For all I know there are a couple of enormous whales on there who, should they decide to switch from one model to another, will instantly impact those overall ratings.

I'd love to have a bit more transparency about volume so I can tell if that's what is happening or not.

minimaxir · 2025-09-25T18:28:00 1758824880

Granted, due to OpenRouter's 5.5% surcharge, any enormous whales have a strong financial incentive to use the provider's API directly.

A "weekly active API Keys" faceted by models/app would be a useful data point to measure real-world popularity though.

eli · 2025-09-25T19:36:30 1758828990

They kinda have that already, no? https://openrouter.ai/apps?url=https%3A%2F%2Faider.chat%2F

minimaxir · 2025-09-25T19:45:21 1758829521

Aggregating by tokens causes the problem simonw mentions in that one poweruser can skew the chart too much.

simonw · 2025-09-25T20:10:44 1758831044

Right, that chart shows App usage based on the user-agent header but doesn't tell you if there is a single individual user of an app that skews the results.

__mharrison__ · 2025-09-26T03:25:24 1758857124

I was skewing the Gemini starts with my Aider usage. Basically the only model in using with openrouter, until I recently started running qwen3-next locally.

2.5 is probably the best balance for tools like Aider.

frde_me · 2025-09-25T17:55:51 1758822951

I know we have a lot of workloads at my company on older models no one has bothered to upgrade yet

koakuma-chan · 2025-09-25T17:57:37 1758823057

Hell yeah, GPT 35 Turbo

kilroy123 · 2025-09-25T18:50:05 1758826205

There are cheaper models. Could cut the bill in half or more.

koakuma-chan · 2025-09-25T22:25:32 1758839132

davinci-001 xd

tiahura · 2025-09-25T18:04:00 1758823440

Primarily classification or something else?

mistic92 · 2025-09-25T18:06:37 1758823597

Price, 2.0 Flash is cheaper than 2.5 Flash but still very good model.

nextos · 2025-09-25T18:26:49 1758824809

API usage of Flash 2.0 is free, at least till you hit a very generous bound. It's not simply a trial period. You don't even need to register any payment details to get an API key. This might be a reason for its popularity. AFAIK only some Mistral offerings have a similar free tier?

FergusArgyll · 2025-09-25T19:09:45 1758827385

Yeah, that's my use case. When you want to test some program / script that utilizes an llm in the middle and you just want to make sure everything non-llm related is working. It's free! just try again and again till it "compiles" and then switch to 2.5

indigodaddy · 2025-09-25T19:48:51 1758829731

wow this would be great for a webapp/site that just needs a basic/performant LLM for some basic tasks.

nextos · 2025-09-25T20:46:14 1758833174

You might hit some throttling limits. During certain periods of the day, at least in my location, some requests are not served.

It might not be OK for that kind of usecase, or might breach ToS.

But it's still great. Even my premium Perplexity account doesn't give me free API access.

YetAnotherNick · 2025-09-25T17:59:17 1758823157

Gemini 2.0 Flash is the best fast non reasoning model by quite a margin. Lot of things doesn't require any reasoning.

crazysim · 2025-09-25T17:53:25 1758822805

Maybe the same reason why they kept the name for the 2.5 Flash update.

People are lazy at pointing to the latest name.

rohansood15 · 2025-09-26T16:48:33 1758905313

2.0 Flash is significantly cheaper than 2.5 Flash, and is/was better than 2.5-Flash-Lite before this latest update. It's a great workhorse model for basic text parsing/summary/image understanding etc. Though looks like 2.5-Flash-Lite will make it redundant.

koakuma-chan · 2025-09-25T17:54:28 1758822868

Why is Grok so popular

minimaxir · 2025-09-25T18:32:01 1758825121

Grok Code Fast 1 usage is driven almost entirely by Kilo Code and Cline: https://openrouter.ai/x-ai/grok-code-fast-1/apps

Both apps have offered usage for free for a limited time:

https://blog.kilocode.ai/p/grok-code-fast-get-this-frontier-...

https://cline.bot/blog/grok-code-fast

ewoodrich · 2025-09-25T19:16:03 1758827763

Yep Kilo (and Cline/Roo more recently) push these free trial of the week models really hard, partially as incentive to register an account with their cloud offering. I began using Cline and Roo before "cloud" features were even a thing and still haven't bothered to register, but I do play with the free Kilo models when I see them since I'm already signed in (they got me with some kind of register and spend $5 to get $X model credits deal) and hey, it's free (I really don't care about my random personal projects being used for training).

If xAI in particular is in the mood to light cash on fire promoting their new model, you'll see it everywhere during the promo period, so not surprised that heavily boosts xAI stats. The mystery codename models of the week are a bit easier to miss.

NitpickLawyer · 2025-09-25T18:03:12 1758823392

It's pretty good and fast af. At backend stuff is ~ gpt5-mini in capabilities, writes ok code, and works good with agentic extensions like roo/kilo. My colleagues said it handles frontend creation so-so, but it's so fast that you can "roll" a couple of tries and choose the one you want.

Also cheap enough to not really matter.

SR2Z · 2025-09-25T18:14:47 1758824087

Yeah, the speed and price are why I use it. I find that any LLM is garbage at writing code unless it gets constant high-entropy feedback (e.g. an MCP tool reporting lint errors, a test, etc.) and the quality of the final code depends a lot more on how well the LLM was guided than the quality of the model.

A bad model with good automated tooling and prompts will beat a good model without them, and if your goal is to build good tooling and prompts you need a tighter iteration loop.

nwienert · 2025-09-25T18:45:23 1758825923

This is so far off my experience. Grok 4 fast is straight trash, it literally isn’t even close to decent code for what I tried. Meanwhile Sonnet is miles better - but even still, Opus while I guess technically being only slightly better, in practice is so much better that I find it hard to use Sonnet at all.

SR2Z · 2025-09-25T19:30:52 1758828652

Not Grok 4, the code variant of Grok. I think it's different - I agree with you Grok 4 kind of sucks.

nwienert · 2025-09-25T19:56:17 1758830177

I meant to say code actually my bad, I found it significantly worse.

coder543 · 2025-09-25T17:57:34 1758823054

I think it has been free in some editor plugins, which is probably a significant factor.

I would rather use a model that is good than a model that is free, but different people have different priorities.

YetAnotherNick · 2025-09-25T18:05:26 1758823526

Non free has double usage than free. Free one uses your data for training.

Imustaskforhelp · 2025-09-25T18:08:16 1758823696

I mean, I can kinda roll through a lot of iterations with this model without worrying about any AI limits.

Y'know with all these latest models, the lines are kinda blurry actually. The definition of "good" is being foggy.

So it might as well be free as the definition of money is clear as crystal.

I also used it for some time to test on something really really niche like building telegram bot in cloudflare workers and grok-4-fast was kinda decent on that for the most part actually. So that's nice.

BoredPositron · 2025-09-25T18:01:25 1758823285

They had a lot of free promos with coding apps. It's okay and cheap so I bet some sticked with it.

davey48016 · 2025-09-25T17:57:55 1758823075

I think it's very cheap right now.

riku_iki · 2025-09-25T18:00:38 1758823238

I think it is included for free into some coding product

keeeba · 2025-09-25T17:58:11 1758823091

It came from nowhere to 1T tokens per week, seems… suspect.

Simon321 · 2025-09-26T06:35:14 1758868514

it was free

PetrBrzyBrzek · 2025-09-25T18:14:47 1758824087

It’s cheaper and faster. What’s not to understand?

testycool · 2025-09-25T21:53:44 1758837224

You can get it to be unhinged as well. It's awesome.