Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To be fair, such tests are designed with the human mind in, well, mind, and assume that various hard-to-quantify variables – ones that the tester is actually interested in – correlate with test performance. But LLMs are alien minds with very different correlations. It’s clear, of course, that ChatGPT’s language skills vastly exceed those of an average 2-year-old, and indeed surpass the skills of a considerable fraction of general adult population, but the generality of its intelligence is probably not above a human toddler.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: