On the networking side. M4 max does have thunderbolt 5, 80gbps advertised.
Would ip over TB not allow for significantly faster interconnects when clustering Macs?
The DGX Spark, Ascent GX10, and related machines have no relation to NVIDIA Grace Blackwell GB200. The chip they are based on is called GB10, and is architecturally very different from NVIDIA's datacenter solutions, in addition to being vastly smaller and less powerful. They don't have anything resembling the Grace CPU NVIDIA used in Grace Hopper and Grace Blackwell datacenter products. The CPU portion of GB10 is a Mediatek phone chip's CPU complex that metastasized, not NVIDIA's datacenter CPU cut down.
Mediatek is in the picture because NVIDIA outsourced everything in GB10 but the GPU to Mediatek. GB10 is two chiplets, and the larger one is from Mediatek. Yes, Mediatek uses off the shelf ARM CPU core IP, but they still had to make a lot of decisions about how to implement it: how many cores, what cluster and cache arrangements, none of which resemble NVIDIA's Grace CPU.
Thanks for the clarification. I was surprised to learn it is not a single chip; thought they did something akin to Apple Silicon integrating some ARM cores on their main chip. Kind of disappointing: they basically asked MediaTek to build a CPU with an NV-Link I/O.
The big picture is probably that GB10 is destined to show up in laptops, but NVIDIA couldn't be bothered to do all the boring work of building the rest of the SoC and Mediatek was the cheapest and easiest partner available. It'll eventually be followed by an Intel SoC with NVIDIA providing the GPU chiplet, but in the meantime the Mediatek CPU solution is good enough.
From NVIDIA's perspective, they need an answer to the growing segment of SoCs with decent sized GPUs and unified memory; their existing solutions at the far end of a PCIe link with a small pool of their own memory just can't work for some important use cases, and providing GPU chiplets to be integrated into other SoCs is how they avoid losing ground in these markets without the expense of building their own full consumer hardware platform and going to war with all of Apple, Intel, AMD, Qualcomm.
- Price: $3k / $5k
- Memory: same (128GB)
- Memory bandwidth: ~273GB/s / 546GB/sec
- SSD: same (1 TB)
- GPU advantage: ~5x-10x depending on memory bottleneck
- Network: same 10Gbe (via TB)
- Direct cluster: 200Gb / 80Gb
- Portable: No / Yes
- Free Mac included: No / Yes
- Free monitor: No / Yes
- Linux out of the box: Yes / No
- CUDA Dev environment: Yes : No