Medium Is the New Large

sauwan · 2025-05-07T20:12:30 1746648750

It's not cheaper than Deepseek V3.1, though, and Deepseek outperforms on nearly everything. And only between 1- 3x the throughput based on the openrouter metrics (near equivalent throughput if you use an FP8 quant). Wish I could be a little more excited about this one.

amai · 2025-05-07T20:23:13 1746649393

Mistral seems to be the only company that doesn‘t fake benchmarks. That makes it of course not exciting. But it doesn‘t have to be to be useful.

YetAnotherNick · 2025-05-07T22:06:52 1746655612

What are you basing this on?

adt · 2025-05-07T20:22:17 1746649337

https://lifearchitect.ai/models-table/

Onawa · 2025-05-07T21:38:03 1746653883

Love the information on this guy's website, dislike the layout of information and use of Google Sheets on pages. Makes it difficult to navigate and grok.

boramalper · 2025-05-07T14:20:29 1746627629

I guess this one (Mistral Medium 3) won't be open?

deanc · 2025-05-07T20:14:42 1746648882

They are dropping some hints about a larger model and something relating to open models in the final paragraph:

> With the launches of Mistral Small in March and Mistral Medium today, it’s no secret that we’re working on something ‘large’ over the next few weeks. With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we’re excited to ‘open’ up what’s to come :)

kergonath · 2025-05-07T18:11:25 1746641485

Correct: https://docs.mistral.ai/getting-started/models/models_overvi...

moralestapia · 2025-05-07T19:33:59 1746646439

Hmm, bearish on Mistral now. I thought they had a plan to monetize that was much more sophisticated than "freemium".

There's no reason for someone to use this over Anthropic, OpenAI, etc ... they don't even outperform them.

Kuinox · 2025-05-07T20:43:38 1746650618

Business can self deploy Mistral models in their infra.

You cant do that with the providers you listed.

moralestapia · 2025-05-07T20:52:19 1746651139

Indeed, however in practice almost nobody does it.

They event hint at it on their PR:

"Mistral La Plateforme and Amazon Sagemaker, and soon on IBM WatsonX, NVIDIA NIM, Azure AI Foundry, and Google Cloud Vertex"

My point being, why would you pay to use a Mistral model hosted on Azure, instead of using any other company model hosted on Azure?

My answer to that, yesterday, would have been "because the model is free and unrestricted, I only pay for hardware"; that premise is gone today.

Kuinox · 2025-05-08T09:38:38 1746697118

> Indeed, however in practice almost nobody does it.

They released it at the same time of the article we are discussing on...

https://mistral.ai/news/le-chat-enterprise

Palmik · 2025-05-08T13:30:18 1746711018

The big dogs also offer on-prem if you are important enough. Here's article for Gemini: https://cloud.google.com/blog/products/ai-machine-learning/r...

jokethrowaway · 2025-05-08T13:00:02 1746709202

EU Legal / Privacy teams are generally fine with OpenAI models running on Azure from my experience (I assume for GDPR reasons). That's "my infra" enough.

And you get decent quality models without the hassle of maintaining the infra.

Jackson__ · 2025-05-07T19:52:57 1746647577

Ah, but here's the thing!

If you carefully read their performance chart, they beat every open source model they bothered to list.

So if we were in a world in which they released this model, and only the listed open source models existed, they would be le SOTA!

ed · 2025-05-07T22:05:37 1746655537

DeepSeek wins most benchmarks according to that chart, so not quite SOTA, and there's no mention of model size so it's hard to compare efficiency

qeternity · 2025-05-07T22:42:35 1746657755

> There's no reason for someone to use this over Anthropic, OpenAI, etc

Uh except that you can host this yourself? You just have to license the model. What is the issue?

moralestapia · 2025-05-08T01:03:50 1746666230

>You just have to license the model.

That's the issue.

bn-l · 2025-05-07T18:05:27 1746641127

The medium model is crazy fast. Reminds me of maverick on groq (except better according to their own testing).

kristianp · 2025-05-07T20:36:26 1746650186

The only hint at its size is that it requires "self-hosted environments of four GPUs and above".