Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s all a dice game with these things, you have to watch them closely or they start running you (with bad outcomes). Disclaimers aside:

Sonnet is better in the small, by a lot. It’s sharply up from idk, three months ago or something when it was still an attractive nuisance. It still tops out at “Best SO Answer”, but it hits that like 90%+. If it involves more than copy paste, sorry folks, it’s still just really fucking good copy paste.

But for sheer “doesn’t stutter every interaction at the worst moment”? You’ve got to hand it to the ops people: 4o can give you second best in industrial quantity on demand. I’m finding that if AI is good enough, then OpenAI is good enough.



>If it involves more than copy paste, sorry folks, it’s still just really fucking good copy paste.

Are you sure you're using Claude 3.5 Sonnet? In my experience it's absolutely capable of writing entire small applications based off a detailed spec I give it, which don't exist on GitHub or Stack Overflow. It makes some mistakes, especially for underspecified things, but generally it can fix them with further prompting.


I’m quite sure what model revision their API quotes, though serious users rapidly discover that like any distributed system, it has a rhythm to it.

And I’m not sure we disagree.

Vercel demo but Pets is copy paste.


We have entered the era of generic fashionable CRUD framework demo Too Cheap To Hawk.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: