Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They are not, and that's the whole point of doing this research. If we can build good benchmark, models developers would have nice goal.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: