By probe, I mean observe the internal activations. There are methods that can su...

		astrange on Feb 22, 2024 \| parent \| context \| favorite \| on: Unexpected responses from ChatGPT: Incident Report By probe, I mean observe the internal activations. There are methods that can suggest if it's hallucinating or not, and ones that can delete individual pieces of knowledge from the model.