> That's one of the reason's OpenAI are signing big dollar deals with media comp...

lukan · on May 29, 2024

"Since a bunch of media outlets have adopted GPT-powered writing tools, future tokens are presumably going to be far less valuable"

As long as a human verified the output, I think it is fine. Training on unverified data is bad.

swiftcoder · on May 30, 2024

You are still looping your own models output into the training set regardless. Human verification may avoid outright errors creeping in, but it won't stop the model biasing it's own training set