Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I get about the same speed (35 tok/s) out of 13B Llama2 on a 4070 Ti, FWIW.


Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: