Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Our goal is to benchmark on real world data. Which is often more complex than plain text. If we have to make the benchmark data easier for the model to perform better, it's not an honest assessment of the reality.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: