Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
AI Index Report 2025 (stanford.edu)
9 points by T-A 4 months ago | hide | past | favorite | 1 comment


I wonder how long before we start to acknowledge that AI labs are heavily gaming benchmarks and they are mostly useless as a way of judging model performance.

The latest one to be caught was Meta, but they've all been doing it for a while now.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: