Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It would be a more interesting blog post if you showed that the app they built is actually production ready.

In my experience, letting LLMs work autonomously just produces code slop.



Not being controversial, but I genuinely believe it's user error when it's pure slop. There are too many people delivering real value with it for that not to be the case. I think there is an art to how you prompt, what problems you give it, how you break things down, etc. Also knowing what good looks like and being able to steer it in the right direction. The 20 agents I had running at various times were still taking input from me and I was interjecting when I saw them going off track (which happened!).

Model matters too. Lots of what I did was with Opus. Significantly better than Sonnet.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: