Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yeah, that’s why I like the smaller models too. Big context windows and intelligent enough most of the time. They don’t follow instructions as well as the larger models ime. But then on the flip side the reasoning models struggle to deviate. I gave deepseek an existential crisis by accident the other day lol.

Agreed on personalities. Phi, I think because of the curated training data comes across as very dry.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: