AI Loophole #1; Your GitHub README.md

Elias Griffin@lemmy.world · edit-2 12 days ago

AI Loophole #1; Your GitHub README.md

Blaster M@lemmy.world · 12 days ago

So… if you don’t want the world to see your work, why are you hosting it publicly?

AlexanderESmith@social.alexanderesmith.com · 12 days ago

“The world seeing [their] work” is not equal to “Some random company selling access to their regurgitated content, used without permission after explicitly attempting to block it”.

LLMs and image generators - that weren’t trained on content that is wholly owned by the group creating the model - is theft.

Not saying LLMs and image generators are innately thievery. It’s like the whole “illegal mp3” argument. mp3s are just files with compressed audio. If they contain copyrighted work, and obtained illegitimately, THEN their thievery. Same with content generators.

Victoria Antoinette @lemmy.world · 11 days ago

stealing removes something. copying makes more of it. it’s not theft

AlexanderESmith@social.alexanderesmith.com · 11 days ago

The MPAA and music industry would beg to differ. As would the US courts, as well as any court in a country we share copyright agreements with.

Consider that if a movie uses a scene from another movie without permission, or a music producer uses a melody without permission, or either of them use too much of an existing song without permission, everyone sues everyone else, and they win.

Consider also that if a large corporation uses an individual’s content without permission, we have documented cases of the individual suing, and winning (or settling).

Some other facts to consider;

An mp3 file is not inherently illegal. Nor is a torrent file/tracker/download.
If the mp3 file contains audio you don’t own the rights to, it is illegal, same for the torrent you used to download/distribute it. In the eyes of the law, it’s theft.
A trained LLM or image generation model is not inherently theft, if you only use open-source or licensed/owned content to train it
(at odds in our conversation) What of a model that eas trained with content the trainer didn’t own?

In the mp3 example, its largely an individual stealing from a large company. On the Internet, this is frequently cheered as the user “sticking it to the man” (unless, of course, you’re an indie creator who can’t support yourself because everyone’s downloading your content for free). Discussions regarding the morality of this have been had - and will be had - for a long time, but it’s legality is a settled matter: It’s not legal.

In the case of “AI” models, its large companies stealing from a huge number of individuals who have no support or established recourse.

You’re suggesting that it’s fine because, essentially, the creators haven’t lost anything. This makes it extremely clear to me that you’ve never attempted to support yourself as a creator (and I suspect you haven’t created anything of meaning in the public domain either).

I guess what it comes down to is this; If creators can be stolen from without consequence, what incentive does anyone have to create anything? Are you going to work your 40-60 hours a week, then come home and work another 20-40 hours to create something for no personal benefit other than the act of creation? Truely, some people will. Most wont.

Victoria Antoinette @lemmy.world · 11 days ago

this doesn’t address what I said at all.

AlexanderESmith@social.alexanderesmith.com · 11 days ago

The first sentence directly addresses your comment “it’s not theft” with “the law says it is”.

The rest of the post attempts to explain why it is so and some of the moral or ethical discussions surrounding some examples.

Eiim@lemmy.blahaj.zone · 11 days ago

Copyright violations ≠ conversion. Those are two completely different sets of laws. If you’re going to argue that legal definitions back you up, at least make sure you know what they are?

Victoria Antoinette @lemmy.world · 11 days ago

the law does not say it is theft.

Victoria Antoinette @lemmy.world · 11 days ago

people made art, music, and stories long before copyright

Hawk@lemmy.dbzer0.com · 11 days ago

If I copy McDonald’s site one by one for my own restaurant and just change the name, you can expect to be sued.

And yet, their site is available publicly?