That's at BF16, so it should fit fairly well on 24GB GPUs after quantization to ...

numpad0 · 2025-09-23T15:02:51 1758639771

So like, 1x32GB is all you need for quite a while? Scrolling through the Web makes me feel like I'm out unless I have minimum 128GB of VRAM.

zenmac · 2025-09-22T22:13:09 1758579189

are there any that would run on 16GB Apple M1?

bigyabai · 2025-09-22T22:41:53 1758580913

Not quite. The smallest Qwen3 A3B quants are ~12gb and use more like ~14gb depending on your context settings. You'll thrash the SSD pretty hard swapping it on a 16gb machine.