Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The Neural Engine is its own block. Neural Engine is not used for local LLMs on Macs. Neural Engine is optimized for power efficiency while running small models. It's not good for LARGE language models.

This change is strictly adding matmul acceleration into each GPU core where it is being used for LLMs.



Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: