I guess you might be able to get around this if you train only on legal content and can interpolate into content that would be illegal if a real recording.
However, I'm not sure whether there are any other applications for this specific interpolation scenario that would lead to it being developed, as the effort required to make it work is likely much higher.
Having the model produce realistic interpolations through areas of the latent space that had no associated training data is surely something that people will be trying to make happen.
However, I'm not sure whether there are any other applications for this specific interpolation scenario that would lead to it being developed, as the effort required to make it work is likely much higher.