DeepSeek published a bunch of benchmarks when they released the models: https://...

punkpeye · 2025-01-28T17:15:53 1738084553

This is very useful. Thank you.

so basically there is not much reason to go beyond DeepSeek-R1-Distill-Qwen-32B, at least for coding tasks

punkpeye · 2025-01-28T19:01:21 1738090881

Just had a chance to play around with 32B model

I am using it with Cline VSCode extension to write code.

It works impressively well for a model this size.

Thanks again for sharing those benchmarks!