Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It seems way worse than other small models, including responding with complete non sequiturs. I think my favorite small model is still DeepSeek distilled with Llama 8B.


The key here is multimodal.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: