> https://arxiv.org/abs/2210.17323 I've read the paper and to be honest I'm not ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		rnosov on March 20, 2023 \| parent \| context \| favorite \| on: The genie escapes: Stanford copies the ChatGPT AI ... > https://arxiv.org/abs/2210.17323 I've read the paper and to be honest I'm not sure what to make of it. Their headline benchmark is perplexity on WikiText2 which would not be particularly relevant to most users. If you look at the tables in the appendix A.4 with some more relevant benchmarks you'll sometimes find that straight RTN 4 bit quantisation beats both GPTQ and even full 16 bit original! No explanation of it is given in the paper.

sebzim4500 on March 20, 2023 [–]

Some of those benchmarks have a pretty small sample size IIRC, might just be coincidence that the noise introduced by RTN just happens to slightly improve them.

GPTQ beats RTN on almost every benchmark at almost every size, though.

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact