Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They are already above average human level on many tasks, like math benchmarks.


They really aren't better than humans at math or logic, they are good at the benchmarks because they are hyper optimized for the benchmarks lol. But if you ask LLMs simple logical questions they still get them wrong all the time


Yes, there are certain tasks they're great at, just as AI has been superhuman in some tasks for decades.


But now they are good or even great at way more tasks than before because they can understand and use natural languages like English.


Yeah, and they're still under delivering to their hype and the improvements have vastly slowed down.


So are calculators …


If you ignore the part where there proofs are meandering drivel, sure.


Even if you don't ignore this part they (e.g. o1-preview) are still better at proofs than the average human. Substantially better even.




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: