Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

There are multiple books about this topic now. What are your takes on the alternatives? Why did you choose this one? Appreciate your thoughts!


It is regarded to be "the best" book on the topic by many. I found just like what Giles Thomas wrote that the book focuses on the details and how to write the lower level code without providing the big picture.

I am personally not very interested in that as these details are likely to change rather quickly while the principles of LLMs and transformers will probably remain relevant for many years.

I have been looking for, but failed, to find a good resource that approaches it the way 3blue1brown [1] explains it but then go deeper from there.

The blog series from Giles seem to take the book and add the background to the details.

[1] https://m.youtube.com/watch?v=wjZofJX0v4M




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: