Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Intel does make cards aimed at this space too:

https://www.intel.com/content/www/us/en/products/details/pro...

Coincidentally, it has 128GB of RAM. However, it is not a GPU, is designed to do training too and uses expensive HBM.

Modern GPUs can do more than inference/training and the original poster asked about a GPU with 128GB of RAM, not a card that can only do inferencing as you described. Interestingly, Qualcomm made its own card targeted at only inferencing with 128GB of RAM without using HBM:

https://www.qualcomm.com/news/onq/2023/11/introducing-qualco...

They do not sell it through PC parts channels so I do not know the price, but it is exactly what you described and it has been built. Presumably, a GPU with the same memory configuration would be of interest to the original poster.




Back in January, someone on Reddit claimed the list price was $16k.


It's competing against Nvidia H100s, which cost $25k. It's cheap, at least by the norms of the space.




Consider applying for YC's Fall 2025 batch! Applications are open till Aug 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: