There was no leap in research. Everything had to do with availability of compute...

tomrod · 2025-05-02T11:57:26 1746187046

> ever get like ASIC for ml

Is this what you're mentioning?

[0] https://linearmicrosystems.com/using-asic-chips-for-artifici...

ActorNightly · 2025-05-05T18:18:08 1746469088

Sort of. Those are just inference chips. You wouldn't be able to iterate architecture.

In terms of math, every single transformer can be expressed as a sequence of deep layers, so you could have an ASIC laid out in such a way where the architecture of the model depends on where you put the zeros.