Sure that can happen but it hasn’t been my experience. I just spent a whole day ...

cheshire_cat · 2026-05-02T11:55:09 1777722909

Sounds promising, thanks for your report.

I didn't want to say that they're not cheaper to run, artificial analysis also shows that they're cheaper. My main point was about it being important to also look at token efficiency, not only cost per token, to get the full picture.

cassianoleal · 2026-05-02T14:57:16 1777733836

I agree! I don't find Claude models to be particularly efficient anyway though. Maybe when running through Claude Code? I don't know, I tried it a while back but it didn't suit me and I kept hitting bugs so I dropped it in favour of something that does something closer to what I want rather than what the provider wants!

pedrosorio · 2026-05-02T14:34:40 1777732480

What harness do you use?

cassianoleal · 2026-05-02T14:54:50 1777733690

Mostly OpenCode but I've been experimenting with Pi a bit lately.

I use Agent Hive [0] for more complex tasks. It sends off subagents with models and parameters I can configure for each different agent (i.e. a low-temp coder, a higher temp with some top_k / top_p for research and architecture, etc).

[0] https://github.com/rretsiem/opencode-hive