I'd like to see detailed benchmarks run by other unaffiliated organizations.
so basically there is not much reason to go beyond DeepSeek-R1-Distill-Qwen-32B, at least for coding tasks
https://glama.ai/models/deepseek-r1-distill-qwen-32b
I am using it with Cline VSCode extension to write code.
It works impressively well for a model this size.
Thanks again for sharing those benchmarks!
I'd like to see detailed benchmarks run by other unaffiliated organizations.