Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
anvuong
11 months ago
|
parent
|
context
|
favorite
| on:
DeepSeek's multi-head latent attention and other K...
Neither. Think of it as something like redis or memcached. It's external to the program, and the program will run just fine without it. But it avoids a lot of duplicate works.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: