Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
Tostino
on May 23, 2023
|
parent
|
context
|
favorite
| on:
RWKV: Reinventing RNNs for the Transformer Era
Are there any ways to train it to maintain attention on the original prompt no matter the distance from it, and selectively pay attention to its own output where relevant?
pico_creator
on May 23, 2023
[–]
Instruction training. This is a WIP
Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: