>99% of the code in this PR [for llama.cpp] is written by DeekSeek-R1 Yes, but: ...

janwas · 2025-01-29T07:06:39 1738134399

Interesting that both de-novo and porting seems to have worked.

I do not understand why GGML is written this way, though. So much duplication, one variant per instruction set. Our Gemma.cpp only requires a single backend written using Highway's portable intrinsics, and last I checked for decode on SKX+Zen4, is also faster.