It's not cheaper than Deepseek V3.1, though, and Deepseek outperforms on nearly everything. And only between 1- 3x the throughput based on the openrouter metrics (near equivalent throughput if you use an FP8 quant). Wish I could be a little more excited about this one.
Love the information on this guy's website, dislike the layout of information and use of Google Sheets on pages. Makes it difficult to navigate and grok.
They are dropping some hints about a larger model and something relating to open models in the final paragraph:
> With the launches of Mistral Small in March and Mistral Medium today, it’s no secret that we’re working on something ‘large’ over the next few weeks. With even our medium-sized model being resoundingly better than flagship open source models such as Llama 4 Maverick, we’re excited to ‘open’ up what’s to come :)
EU Legal / Privacy teams are generally fine with OpenAI models running on Azure from my experience (I assume for GDPR reasons). That's "my infra" enough.
And you get decent quality models without the hassle of maintaining the infra.