Pixtral 12B

mkaic · 2024-09-17T17:06:19 1726592779

As much as I love their work, I can't be the only one who really struggles to see a path to profitability for Mistral, right? How do you make money selling API access to a model which anyone else can spin up an API for (license is Apache 2.0) on AWS or GCP or similar? Do they have some sort of magic inference optimization that allows them to be cheaper per-token than other hosting providers? Why would I use their API instead of anybody else's?

Asking these questions as a genuine fan of this company—I really want to believe they can succeed and not go the way of StabilityAI.

kridsdale3 · 2024-09-17T17:23:45 1726593825

Mistral's business model is to take advantage of French National Pride and be state funded.

qwertox · 2024-09-17T17:30:50 1726594250

Which could end up being beneficial if the EU decides that it wants to have a European Foundation Model.

Like the French having nukes is also beneficial for them.

halJordan · 2024-09-17T21:25:43 1726608343

It's only leeching if it's an American company

meiraleal · 2024-09-18T10:12:29 1726654349

If VC funding for AI dries up and the french continue investing in mistral that would prevent much of the damage of an AI winter that could make openai and anthropic fail

foolfoolz · 2024-09-17T18:38:14 1726598294

exactly. they are (and have been since their founding) too big to fail. the eu needs a domestic ai company. mistral is the only option

shin_lao · 2024-09-17T17:21:40 1726593700

Probably keep building something awesome and worry about monetization later. Google did that.

startupsfail · 2024-09-17T17:27:52 1726594072

Sovereign AI. France can afford supporting a shop that makes one.

woodrowbarlow · 2024-09-17T17:24:24 1726593864

google didn't give the world free access to their crawler database, though -- which i'd say is equivalent to what mixtral is doing.

bananapub · 2024-09-17T17:29:46 1726594186

even if Google had, it would have been of little value, since running Google, even back then, required a lot of computers and a lot of humans.

which is rather like Mistral - running large models is expensive, and hosting lets you amortise that across lots of users who individually use the model very little.

genewitch · 2024-09-17T19:02:24 1726599744

google ostensibly started on beige boxes, though. They used whatever computers they could get cheaply and quickly, even older hardware sufficed. There was a niche global group of people who could make stuff like that work as a much larger compute system (beowulf, etc). I don't know that it took "a lot of humans" to bootstrap.

phh · 2024-09-17T17:18:27 1726593507

Their best models are not public weights but only behind their API

Deathmax · 2024-09-17T17:21:31 1726593691

That hasn't been true for their largest model since the 2407 release of Mistral Large 2 (https://mistral.ai/news/mistral-large-2407/), it is however under a non-commercial license.

oceanplexian · 2024-09-17T18:08:50 1726596530

> How do you make money selling API access to a model which anyone else can spin up an API for (license is Apache 2.0) on AWS or GCP

Uhhh.. easily. Don’t host it in AWS or GCP where everyone is hosting their infrastructure on proprietary infrastructure with a 10x markup? Don’t hire thousands of unnecessary employees? Don’t bank on outrageous valuations? Lots of ways to compete with big tech.

mkaic · 2024-09-17T20:17:33 1726604253

I guess I was just under the impression that cloud inference is such a competitive market that it'd be nigh-on-impossible to compete with the major players.

Fraterkes · 2024-09-17T17:09:17 1726592957

Maybe it's just the dropbox model? Make something that's theoretically easy to do even easier

TeMPOraL · 2024-09-17T17:21:32 1726593692

Except Dropbox always kept their users tied to their platform, which allowed them to gradually enshittify their offering, starting with removal of directly addressable content in "Public" folder, and continuing through various changes and side products that all had very little to do with "a folder that syncs". Mistral can't successfully enshittify if they can't keep users captive.

bschmidt1 · 2024-09-17T18:11:06 1726596666

It's somewhat common to open source the core yet still monetize a version (browser vendors, SaaS, games). People will still pay for convenience, reliability, or for the best product.

> Why would I use their API?

To pay 15 cents per million tokens instead of $5.

> license is Apache 2.0

There are browser vendors that use Chromium despite "competing" with Chrome - even though it's the same kind of web browser product, there are some benefits if they allow the other options to exist. The same can be said of open source games and frienemy situations like Uber vs Lyft in the early days - it doesn't necessarily hurt to have others playing your game, especially if you have a common enemy (Firefox, other games, cabs, respectively).

I run mostly Mistral offline in Terminal (via ollama cli) but in the case where I need a text-to-text LLM for an app and users pay for access to the LLM-powered stuff, why not use Mistral's API? Then I could have a super cheap app setup on Vercel or whatever and do everything through an API key. The app would "be AI" and yet it runs on a calculator for cents.

The main thing that comes to mind regarding "just spin it up on AWS" is the considerable backend needs (GPU) and cost to train and run LLMs. In the same way you ask why use the LLM's cloud option when I could use AWS' cloud option? could also ask the inverse (or just host it yourself for free after initial setup if it's cost you're after).

If you need geo located instances and some other specific requirements use IaaS, but otherwise I think IaaS like AWS and GCP are a nightmare to manage - the awful IAM experience, all the vendor-specific jargon, navigating the hell that is Amazon.com. For something like an LLM "just spin it up on AWS" is just funny when you really consider what you're getting yourself into.

mkaic · 2024-09-17T20:19:40 1726604380

My intention was less to imply "I'll just spin it up" and more to imply "Some competitor of Mistral's will spin it up". I agree that from my perspective as a casual user, Mistral's API is quite convenient. What I don't understand is why they aren't driven to zero-margin instantaneously by an onslaught of clones of their business model.

devinprater · 2024-09-17T17:34:15 1726594455

Seems okay at image descriptions, I suppose. Still a 12B model though, and doesn't always get OCR anywhere near correct. I tried it on Le Chat, and waiting for it to be on Ollama.

davedx · 2024-09-17T17:05:03 1726592703

Anyone from Mistral here? The link to the docs is broken, and I really would like to know more about what the specifications are for calling this via API. Foremost, what's the maximum image size you can use via the API? Thank you!

diggan · 2024-09-17T17:08:07 1726592887

Seems the platform team and docs team didn't sync up before the release :) Seems the docs are in the process of being merged and deployed, in the meantime, you can find that doc here: https://github.com/mistralai/platform-docs-public/blob/a4f76...

Edit: 7 minutes later and the docs are up now! https://docs.mistral.ai/capabilities/vision/

adt · 2024-09-17T21:22:17 1726608137

Nemo 12B MMLU=68.0

Pixtral 12B MMLU=69.2

Looking at images made it smarter...

https://lifearchitect.ai/models-table/

etaioinshrdlu · 2024-09-17T17:23:19 1726593799

It would be interesting to add a decoder for image outputs, similar to GPT-4o (that feature hasn't been talked about much, or released...).

adzm · 2024-09-17T17:00:40 1726592440

Very impressive results