Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

1 chassis, 8 gpus.

We want to be able to break that chassis up into individual GPUs and allocate 1 GPU to 1 "machine". I previously PXE booted 20,000 individual playstation 5 diskless blades and I'm not sure how PXE would solve this.

The only alternative right now is to do what runpod (and AMD's aac) are doing and do docker containers. But that has the limitation of docker in docker, so people end up having to repackage everything. You also can't easily run different ROCm versions since that comes from the host, and if you have 8 people on a single chassis... it becomes a nightmare to manage it.

We're just patiently waiting for AMD to fix the problem.




Got it it's clear, I thought you had 1 GPU in one chassis in some cases.


No such thing with mi300x. They come 8 at a time on a OAM/UBB.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: