NVidia and AMD make $$$ on datacenter GPUs so it makes sense they don't want to ...

beAbU · 2024-12-03T20:35:50 1733258150

Is nVidia or AmD offering 128gb cards in any configuration?

latchkey · 2024-12-03T21:08:43 1733260123

They aren't "cards" but MI300x has 192GB and MI325x has 256GB.

phkahler · 2024-12-03T22:02:41 1733263361

You can run an AMD APU with 128GB of shared RAM.

treprinum · 2024-12-03T22:29:52 1733264992

It's too slow and not very compatible. Most BIOSes also don't allow sharing that much memory with GPU (max like 16GB).

ryao · 2024-12-04T10:46:03 1733309163

Isn’t that setting just a historical thing annd ann integrated GPU is able to access any system memory that is mapped by the IOMMU? I assume this is how it works for people using the NVIDIA Jetson AGX Orin 64GB Developer Kit to do inference. I do not know why it would be different for AMD APUs.

treprinum · 2024-12-04T11:54:59 1733313299

I remember somebody complaining about it on reddit, unable to overcome some BIOS limitation on an AMD G processor. Even on M3 Max one had to issue a special command to enable GPU to access more memory.

ryao · 2024-12-04T19:07:41 1733339261

The command on the M3 Max is a sysctl command to adjust an operating system enforced limit. That is different than the aperture setting in the bios. The limitation on AMD is more interesting and more relevant.

genewitch · 2024-12-04T00:33:45 1733272425

you can do that with nvidia too but it takes you from 6tok/s to 6s/token or worse (not even exaggerating)