Are there any ways to train it to maintain attention on the original prompt no m...

		Tostino on May 23, 2023 \| parent \| context \| favorite \| on: RWKV: Reinventing RNNs for the Transformer Era Are there any ways to train it to maintain attention on the original prompt no matter the distance from it, and selectively pay attention to its own output where relevant?

Instruction training. This is a WIP