Hacker Newsnew | past | comments | ask | show | jobs | submit | more Pulcinella's commentslogin

Given how video game patents are in the news lately (with Nintendo suing the PalWorld devs for patent infringement rather than copyright infringement), I wonder how this would have turned out if Capcom managed to patent how SF2 works and sued for that instead.

Personally I think patenting game mechanics is absurd (far worse than software patents, even). It would be like an authoring patenting the three-act structure. More legal clarity around this would be nice. Maybe Judge Alsup could become a combo lab monster[0] as well to help decide the case!

[0]https://glossary.infil.net/?t=Lab


Even "plagiarism" is putting way to positive a spin on it. "Rampant copyright infringement" is more accurate.

I'm sure we all have our own feelings about IP law, but remember what happens to regular people who try stuff like this. I don't think the RIAA, Disney, or Nintendo (or the government) are going to be pleased to hear "it's not piracy! It's a transformative experience protected by fair use!"


Millenials might be the generation that both got threatened with jail for music copyright infringement violations as youth AND gets to have their job threatened by automated mass corporate copyright infringement in adulthood!


We wanted Aaron Swartz and we got Sam Altman.


Truly the darkest timeline.


"Haha, yeah, those scrappy Millennials - who knows where their breaking point is but I'm sure there's a fintech app for making that bet"


Learning from copyrighted works to create new ones has never been protected by copyright[1], and has never needed separate licensing rights. Until 2022, no one even suggested it, to a rounding error. If anything, people would have been horrified at the idea of being dinged because your novel clearly drew inspiration from another work.

That narrative only got picked up because people needed a reason to demonize evil corps that they already hated for unrelated reasons.

[1] Yes, if you create "new" works from your learning that are basically copies, that has always been infringement. I'm talking about the general case.


> Learning from copyrighted works to create new ones has never been protected by copyright

The term "learning" (I presume from "machine learning") shoulders a lot of weight. If we describe the situation more precisely, it involves commercially exploiting literature and other text media to produce a statistical corpus of texts, which is then commercially exploited. It's okay if that is licensed, but none of the AI companies bothered to license said original texts. Some (allegedly) just downloaded torrents of books, which is clear as day piracy. It has little to do with "learning" as used in common English — a person naturally retaining some knowledge of what they've consumed. Plain English "learning" doesn't describe the whole of what's happening with LLMs at all! It's a borrowed term, so let's not pretend it isn't.

What's happening is closer to buying some music cassettes, ripping parts of songs off them into various mixtapes, and selling them. The fact that the new cassettes "learned" the contents of the old ones, or that the songs are now jumbled up, doesn't change that the mixtape maker never had a license to copy the bits of music for commercial exploitation in the first place. After the infringement is done, the rest is smoke and mirrors...


>The term "learning" (I presume from "machine learning") shoulders a lot of weight. If we describe the situation more precisely, it involves commercially exploiting literature and other text media to produce a statistical corpus of texts, which is then commercially exploited.

It's "commercially exploiting literature" in the same sense that an author would if they read a bunch of novels and then wrote their own based on what the learned from the pre-existing text. The whole point in dispute is whether that turns into infringement when an AI does it.

By labeling only one of them as "commercially exploiting literature" but not the other, you're failing to distinguish them in any meaningful way, and basically arguing from name-calling.

>It has little to do with "learning" as used in common English — a person naturally retaining some knowledge of what they've consumed. Plain English "learning" doesn't describe the whole of what's happening with LLMs at all! It's a borrowed term, so let's not pretend it isn't.

That's fair, that you can't just call them both "learning" and call it a day. But then the burden's on you to show how machine learning breaks from the time-honored tradition of license-free learning/"updating what you write based on having viewed other works". What's different? What is it about machine learning that makes it infringement in a way that it isn't when humans update their weights from having seen copyrighted works?

>What's happening is closer to buying some music cassettes, ripping parts of songs off them into various mixtapes, and selling them. The fact that the new cassettes "learned" the contents of the old ones, or that the songs are now jumbled up, doesn't change that the mixtape maker never had a license to copy the bits of music for commercial exploitation in the first place.

Okay, but (as above) to make that case, you'd need to identify where "acceptable" learning/"updating what you write based on having viewed other works" crosses over into the infringing mixtape example, and I have yet to see anyone try beyond "they're evil corps, it must be bad somehow".


> narrative only got picked up because people needed a reason to demonize evil corps

Either they aren't evil in which case they're being demonized, or they're already evil in which case demonization is redundant.

Keeping aside the motives of people, what is clear is that scale effects of AI cannot be ignored. An AI "learning" millions of pieces of content in a short span is not the same as humans spending time, effort and energy to replicate someone's style. You can argue that its 'neural nets' in both cases, but the massive scale is what separates the two.

A village is not a large family, a city is not a large village, ... and all that.


>Either they aren't evil in which case they're being demonized, or they're already evil in which case demonization is redundant.

If you were trying to be charitable rather than clever, you would have read "evil corps" as "corps that the critic regards as evil".

>Keeping aside the motives of people, what is clear is that scale effects of AI cannot be ignored.

Okay, so just give some kind of standard -- any clear, articulable standard -- for how and why the scale matters. It's a cop-out to just rest your case on a hand-wavy "it changes at scale".


> Learning

Human learning doesn't involve making a copy (or any other use of an exclusive rights) as defined in copyright law (the human brain not being a fixed medium), AI training does, because digital storage is.

AI training may fall into the Fair Use exception in the US, but it absolutely does not fall through the same gap that makes human learning not even eequire fair use analysis since it doesn't meet the definitions ser out for a violation in the first place.


>Human learning doesn't involve making a copy (or any other use of an exclusive rights) as defined in copyright law (the human brain not being a fixed medium), AI training does, because digital storage is.

That's just false -- AI models themselves only store parameter weights, which represent a high-level, aggregated understanding across all data that was learned on, i.e. what human brains do. This is clear from all the examples where you have to painstakingly trick them into producing exact text.

And even if they did store something that's "effectively a copy", that's no more copyright infringement than when Google caches a site in order to handle search queries. It's not copyright infringement until they start redistributing [non-fair-use] content from the sites.


Sorry, that was expressed less precisely than I intended.

Human learning doesn't involve fixing anything, whether or not it would be a copy, into a fixed medium, and therefore even the question of whether or not it would be a copy if it were in a fixed medium, much less the question of whether or not such a copy would be exempted due to fair use, does not arise.

AI training does involve fixing something into a fixed medium, which makes it disanalogous to human learning. This raises the question of whether the thing fixes (parameter weights) is or is not legally a lossy mechanical copy of the training corpus, which further potentially raises the question of whether the incorporation of individual copyright protected works into the training corpus combined with its use in training would (before considering exceptions like Fair Use) violate the copyright on the works involved and whether, if it would, the use nevertheless satisfies the requirements of one or more of those exceptions (Fair Use being the one usually argued for.)

> And even if they did store something that's "effectively a copy", that's no more copyright infringement than when Google caches a site in order to handle search queries.

Among the reasons that Google removed outside access to its cached copies is that the evolution of the web meant that providing them increasingly potentially had negative revenue on the original content providers, which harms the Fair Use case for caching, since effect on the market of the original work is a fair use factor. (Google's cache was ruled as fair use based on the factual situation -- including implied consent, which AI model trainers have a much weaker arguent for -- in 2006, but changes in the facts relevant to fair use can change the outcome.)

But AI training is not so analogous to Google's cache (much less the situation of Google's cache in 2006) that one can simply leap with no analysis from one being fair use to the other in the first place. That's applying wishful thinking, not Fair Use analysis.


>Human learning doesn't involve fixing anything,... AI training does involve fixing something into a fixed medium, which makes it disanalogous to human learning.

No, it breaks one part of the analogy, which has never been considered relevant: how long the learning persists. Yes, computers can store weights of a AI model much longer than any human could. But storing this "impact of having viewed a copyrighted work" has never been a factor in considering something infringing, regardless of how long it's stored. Courts don't consider it infringement if you simply use what you have learned from reading previous novels (the updates to your brain's neural weights) in producing new content.

Your argument is saying that the "fixedness of storing model weights" (aggregated high level understanding) can cause it to be infringement. That's without precedent. It would imply that if you're really good at "fixing" (remembering) the style of an author you read 50 years ago, it somehow crosses over into infringement because you "fixed that understanding into a medium" (your brain). That's not how it works at all.

>Among the reasons that Google removed outside access to its cached copies

I wasn't referring to the cached part of a site that Google serves to users, but the undistributed cache that they hold merely to know which sites to point you to, so you're not addressing the analogy. Here Google does store an exact copy (of at least some portions) and even then it's not considered copyright infringement until they start redistributing that content (or at least, too much of it).

My point was that acceptance of this practice even further bolsters the case that AI models aren't infringing, because, even if they did store exact copies, that generally not considered infringement until they start serving close-enough copies of the original copyrighted content.


Good thing my MP3 files only store a psycho-acoustic model of that Metallica album!

I mean sure, if you go to painstaking lengths, you can trick your computer into making some noise that seems vaguely similar to the copyrighted work it was trained on, but I trust the consumer to make their own fair use evaluation.


Its hard not to demonize large corps that often enjoy the governments legal largess when there are many examples of individuals with ruined lives for the same behavior.


"It's not piracy! I'm learning from all this media I didn't pay for!"


I had this argument presented to me and I wasn't sure what to do with it.

> Humans are allowed to "absorb" art around them into their brains and generate derivative art. People may copy Miyazaki's style... why shouldn't an AI farm be allowed to?

Let's put aside for a moment that AI may have "consumed" some art without a license (e.g., "google books" - did google purchase every book?).


except lawyers keep saying "fanart is actually technically illegal" and resinging/changing lyrics in songs isn't enough to be protected by "fair use" stuff

if anything, I'd campaign for "we should limit copyright because it already doesn't work for Ai"


I wasn't intending to include fan art.

Copying Miyazaki's style ... or copying Monet's style... those aren't fan art.


Why not? I’m a fan of X and make art in the style of X using the tools at my disposal (AI). Why is that not fanart?


They are. I meant the subset of fan art of the infringing type where you make fan art of, say, Mickey Mouse and sell stickers.


The same legal rule applies to both for determining whether something is a derivative work.

No one is stopping you from using similar proportions or colors as Miyazaki to draw a character. You are also allowed to draw your own interpretation of an electric mouse-like monster.

Copyright infringement occurs if that character looks exactly like say Totoro or Pikachu. That is not “in the style of”, that is copying.

A problem with LLMs is that since their corpus is so large, it is difficult to identify when any given output is crossing that line because a single observer’s knowledge of the works influencing the output is limited. You might feed it a picture of your grandfather and it returns an almost exact copy of a grandfather character from a Miyazaki film you haven’t seen. If you don’t share the output with others, it might never be noticed that the infringement occurred.

The given argument conflates the slightest influence with direct copying. It is a reductive take that, personally, I’ve found emblematic of pro-LLM arguments.


Thanks for helping pick apart the argument presented to me.

I don't like the idea that photos I've published on, say, flickr have been pulled into these. Especially stuff I've published with creative commons non-commercial use.


> People may copy Miyazaki's style... why shouldn't an AI farm be allowed to?

People may take a penny from the tray at the 7-11, so why can't an AI farm take pennies from all the trays? Or take them from a much bigger tray and do it a couple of million times?


There is no de jure legal requirement that the RIAA, Disney, Nintendo, or the government be "pleased to hear" about new technology.

And, while copyright prohibits some sorts of reproduction of copyrighted materials, it doesn't give rightsholders veto power over all downstream uses of legal copies.


[flagged]


Scale matters. I might miss the context though. Happy to be corrected


In comparison to other places in the world (especially Asia), there is somewhat lower stigma regarding ADHD in the U.S.

I would add the UK to the stigma list as well. The UK government and NHS are pretty rabidly ADHD-denialist and make it incredibly difficult to get a diagnosis.


> I would add the UK to the stigma list as well. The UK government and NHS are pretty rabidly ADHD-denialist and make it incredibly difficult to get a diagnosis.

Don't you also have to wait years on a wait list for an evaluation? I feel like I remember hearing that a few years ago.


NHS waiting list is a few years but they will pay for you to use a private clinic, my waiting list is 5 months.


I was surprised when I moved to Europe that at least several (many?) EU countries don’t even authorize any ADHD drugs to be prescribed. So sure, maybe the doctor knows you have it, but even if they say you have it… so what, what can they do about it?

As someone with some horrific environmental allergies, I see it as similar. The US has a ton more allergy medications available (both RX or OTC) than any of the other countries I’ve lived in or visited, so of course there are more people who have a diagnosis. In Finland they just stared at me wide-eyed and shook their heads when I asked if they had Sudafed. “Oh no, very strong…”


They literally pay for you to use a private clinic to ensure everyone can be assessed quickly. ADHD is legally protected and recognised. Lumping it in with most of Asia is ridiculous.


Where in the NHS is that? Maybe just an English thing. I had to go private on my own dime and thankfully my GP took over prescribing, which is uncommon and becoming increasingly so. They also put me on the NHS waiting list, back when you could still get on it without being severe, and it was only last week I had my appointment almost 4 years after being put on the list.


It's called Right to Choose and it may be England only. The NHS waiting list here is also over 3 years!


Does the EU not have protections against bills of attainder?


Not really, no. The lack of such protection is one of the many flaws in the foundational EU treaties and charters.


Six sentences can hardly be considered a "rant."


Hard to determine comment accuracy. Industry will manufacture false claims in order to stop environmentalism because they believe in material wealth over all else.


Also true. That is the information space we currently inhabit.

Easy to forget the faked species called the snail darter, invented to be endangered to try to stop a dam being built.


You have rights with a public school system you don’t have with private school system. And not just “it’s a government run school so they have to follow the constitution.” You have explicit political rights. For example, you have the right to vote in school board elections and bonds. You don’t get to vote for the board of EduCorp LLC.


I do not wish for taxpayer dollars to fund private school (religious or otherwise) or homeschooling. One of the things I love most about this country is that America is a civic nation. This country is built on shared civic ideals, values, and vision. This country is not built on a specific racial, ethnic, or religious identity like many, many old-world Eurasian nations.


> This country is built on shared civic ideals, values, and vision.

Sadly I think that died many decades ago and now we're just an economic zone


"This country is not built on a specific racial, ethnic, or religious identity like many, many old-world Eurasian nations."

I know there's an ideal of what America should be, but don't muddle the reality. America was built on whiteness (whose definition shifted over time) and class identity.


It's true, but that's not a reason to dump the ideals. Quite the contrary, that's a reason to embrace the ideals even more.

A professor once taught me that the law is not meant to represent the people, but to frame the ideal we want to be.


Really? I think taxpayer dollars should be focused on an outcome (creating a cohesive society of high performance critical thinkers able to solve problems independently and in groups would be my outcome measure). With that being said, I see quite a lot of problems with the public system (suboptimal teacher provision partially driven by union pushing for things that help their union members at the expense of students, weird administrator to infrastructure spending ratios, weird far left ideologies being pushed on kids that I think are far outside school responsibility into parent responsibility, etc) and vouchers seem like a reasonable way to address this by taking funding out of the system to places that can perform better.

To me, it seems more like a failure of voucher design than anything. The solution is probably to make the voucher worth up to 80 or 90% of what it costs to teach the kid in the public system (because if it doesn't cost less why are we doing it?) and force any school receiving a voucher to not charge more than a small amount more than the voucher, make the same restrictions on religion present in public enforced in private, etc.

Declining public enrollment is a problem for crappy teachers and their union more than anything, it's not something to be concerned about. Fire the unneeded teachers and administrators to save costs and it should sort itself out medium term.


Harvard was given public money in 1636. It was a religious school and trained clergy.


Britain, had no separation of church and state.

People who objected had no right to express their displeasure.

And the United States was a twinkle in the eye of men who wouldn't be born for like 70 years. In other words, non-existent.

Nowadays, we have the right to ask for something back if we give universities money man.


That's weird, I could've sworn America was a capitalist oligarchy working to drive down education to produce a cheaper and less educated labour force.


> One of the things I love most about this country is that America is a civic nation

What are you a communist? /s

Jokes aside, with the recent election, the country has been sold to capitalists and oligarchs.

I genuinely wouldn't be surprised if the school system in some states is fed to vultures to suck money from.

You may think it is impossible in America but remember anything has been possible in America, including enslavement and redlining. Selling schools to vultures is not at all outside the realm of possibilities here.


> Jokes aside, with the recent election, the country has been sold to capitalists and oligarchs.

My question is how long will it take for it to be reversed.

I have a deep respect for the ideals upon which the US was founded, but it's getting increasingly hard to respect the ground truth of what the country has become.


Government entities use taxpayer money to fund legal action (and pretty much everything else they do) all the time. I am not a lawyer but I believe it is called “administrative law” and is completely normal. To think so otherwise is completely ahistorical and incorrect.


"Government entities use taxpayer money..."

Individual people - government employees - decided to do this.

They decided to use taxpayer funds (that were specifically allocated for educating students) to finance legal action aimed at preventing parents from choosing which schools would receive taxpayer funding to educate their children.

In other words, government employees spent public education money on litigation designed to keep money flowing into their own pockets, rather than allowing those funds to follow students to whatever educational environment their families determined would best serve them.


Of course it’s individual people. The political entity they work for obviously has no will of its own. It is terribly unsurprising to me that people tasked with a job by the government would do what they can to best carry out that job.

Are you scandalized when a city and the county it is in sue each other and the transportation department over who has responsibilities for what sections of road and how much money who should get rather than letting the citizens who drive on those roads decide? Where do you draw the line for what money the government decides how to spend and what money individuals should decide how to spend?


"It is terribly unsurprising to me that people tasked with a job by the government would do what they can to best carry out that job."

Right, but the government tasked school district employees with educating kids who enroll in government-run schools. The legislature explicitly created a voucher scheme, and it's unclear why school district employees would think it was their job to prevent this from happening.


Declining enrollment and pulled funding can destroy the ability of a district to carry out the mission tasked to it by the government. This is not just hypothetical. The rural, conservative school district I used to work for (and may, many like them as pointed out in the article) and the families that rely on them do not like school vouchers, even if they otherwise support everything else that politician does. If too many students unenroll and utilize school vouchers, the whole district could collapse due to lack of funding and screw over all the remaining students.

This is compounded by the fact that vouchers often pull out more funding that the government otherwise would provide to the district for that student. This is mentioned in the article as well. e.g. the state pays the district $5k a year per student, but pulls $8k a year for a student that uses a voucher.

Also, in a lot of these rural areas there are no other good options anyway besides the school district. It’s either the district, a scammy online charter school, a scammy fly-by-night charter school, or homeschooling.


"This is compounded by the fact that vouchers often pull out more funding that the government otherwise would provide to the district for that student."

Can you please help me understand the "This is compounded" part here?


In other words, a private school gets more money from a student’s voucher than a district gets from having that same student enrolled. The voucher doesn’t just move the same amount of money around. A voucher pulls more money from education funding than then enrolling a student in a public school does.


There's an argument to be made that maintaining the integrity of the school district is, in fact, serving the education of the students.


Parents probably care more about the education of their children, than government employees do. I think those parents can be trusted to know what is "serving the education of the students".


And plenty of parents don't.


What are you gonna do about the ones that don't care?


Have a well-funded school system that exists no matter what to give their children at least something.


Yeah Windows does the same thing. It's something indie game devs run into. If you release on Steam, Steam launches your game. Otherwise you have to pay a certificate authority to sign your game or Windows Smart Screen will do the same "this code is sketchy. Are you really really really sure you want to run it?" thing MacOS does for non-notarized apps.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: