> they added another trillion tokens and shrank the model from 18 GB to 9 GB thr...

cubefox · 2025-04-30T10:22:29 1746008549

> This sounds like what they call "Bamba-9B" is actually an 18B model quantised to 8 bits.

No it doesn't? The fact that it is 18 GB with 16 bit per parameter before quantization means that it is a 9B parameter model.

anentropic · 2025-04-30T11:01:36 1746010896

Ah thanks, I see where I got confused now.

tmalsburg2 · 2025-04-30T10:15:23 1746008123

Yeah, that's confusing, but the HuggingFace page says it has 9.78 B parameters.