From a copyright perspective the only question that matters is this: Do we treat...

pona-a · 2025-06-02T17:19:50 1748884790

We treat them as models. We allowed them to be fitted on copyrighted data, arguing the research is an inherent public good. But now that these companies are directly competing with that material's copyright holders, it makes sense to reevaluate that assumption.

A good first step would be to mandate AI labs share their weights and methodology before commercial release or lose that privilege. This would spare universities and non-profits, while requiring commercial labs to contribute something back, be it in licensing fees or usable research.

altruios · 2025-06-02T16:06:37 1748880397

A good argument. however to compare an AI model to a Xerox machine is reductive and not a sound metaphor...

It can not be treated just as a Xerox machine, but it can be treated as a Xerox machine that has within it all the copywritten works (that a user can inventively request combinations there within) which it has trained on (and saved in the form of weights/bias). In this case the AI model itself is the distribution of works under copyright. Encrypting/transforming copywritten works and transmitting it is a violation of copyright (afaik; ianal).

This is all to say, copyright - as it stands - needs heavy reform. I'm rather copyleft. Because all of this is vestigial nonsense from an age where printers from the 1800's setting the rules, and our thinking hasn't updated yet.

parpfish · 2025-06-02T17:21:08 1748884868

What would happen if you made a lossy image compression format derived from tons of scraped, copyrighted images.

There’s no generative ability, but anytime you compress/decompress your image the model uses weights and biases learned from copyrighted works.

Is that a violation?

altruios · 2025-06-02T23:03:25 1748905405

Nothing. and no:

Only distributing it to others is when copyright is an issue. Private translations/transformations are unenforceable. You can mark up a book you've bought as much as you want.

copyright makes no sense;

deadbabe · 2025-06-02T17:53:37 1748886817

Treat them as a search engine.

ljlolel · 2025-06-02T18:19:11 1748888351

This is wrong on so many counts, you should not be giving legal judgments in comments. As one example, “no distribution == no violation of copyright” is incorrect.