The FSF doesn't usually sue for copyright infringement, but when we do, we settle for freedom
"Yeah we can't prosecute this person for stealing your car, because you haven't considered how they're going to get to work"
This is the reason why AI companies won't let anyone inspect which content was in the training set. It turns out the suspicions from many copyright holders (including the FSF) was true (of course).
Anthropic and others will never admit it, hence why they wanted to settle and not risk going to trial. AI boosters obviously will continue to gaslight copyright holders to believe nonsense like: "It only scraped the links, so AI didn't directly train on your content!", or "AI can't see like humans, it only see numbers, binary or digits" or "AI didn't reproduce exactly 100% of the content just like humans do when tracing from memory!".
They will not share the data-set used to train Claude, even if it was trained on AGPLv3 code.
> "Therefore, we urge Anthropic and other LLM developers that train models using huge datasets downloaded from the Internet to provide these LLMs to their users in freedom"
> We are a small organization with limited resources and we have to pick our battles, but if the FSF were to participate in a lawsuit such as Bartz v. Anthropic and find our copyright and license violated, we would certainly request user freedom as compensation.
Sounds more like “we can’t and won’t sue, but this is the kind of compensation that we think would be appropriate”
They don't have the rights to distribute the training data.
mjg59•2h ago
(Edit: In the event of it being changed to match the actual article title, the current subject line for this thread is " FSF Threatens Anthropic over Infringed Copyright: Share Your LLMs Freel")
lelanthran•1h ago
Not a nothing burger, but not totally insignificant either.
mjg59•1h ago
jfoster•1h ago
mjg59•28m ago
eschaton•1h ago
mjg59•1h ago
boramalper•1h ago
sunnyps•1h ago
Bombthecat•1h ago
agile-gift0262•1h ago
Bombthecat•39m ago
tsimionescu•27m ago
teiferer•1h ago
FSF licenses contain attribution and copyleft clauses. It's "do whatever you want with it provided that you X, Y and Z". Just taking the first part without the second part is a breach of the license.
It's like renting a car without paying and then claiming "well you said I can drive around with it for the rest of the day, so where is the harm?" while conveniently ignoring the payment clause.
You maybe confusing this with a "public domain" license.
jcul•1h ago
"Sam Williams and Richard Stallman's Free as in freedom: Richard Stallman's crusade for free software"
"GNU Free Documentation License (GNU FDL). This is a free license allowing use of the work for any purpose without payment."
I'm not familiar with this license or how it compares to their software licenses, but it sounds closer to a public domain license.
kennywinker•53m ago
> 4. MODIFICATIONS
> You may copy and distribute a Modified Version of the Document under the conditions of sections 2 and 3 above, provided that you release the Modified Version under precisely this License, with the Modified Version filling the role of the Document, thus licensing distribution and modification of the Modified Version to whoever possesses a copy of it. In addition, you must do these things in the Modified Version:
Etc etc.
In short, it is a copyleft license. You must also license derivative works under this license.
Just fyi, the gnu fdl is (unsurprisingly) available for free online - so if you want to know what it says, you can read it!
mjg59•29m ago
onion2k•22m ago
If I took a book and cut it up into individual words (or partial words even), and then used some of the words with words from every other book to write a new book, it'd be hard to argue that I'm really "distributing the first book", even if the subject of my book is the same as the first one.
This really just highlights how the law is a long way behind what's achievable with modern computing power.
karel-3d•42m ago
wikipedia used to be under FDL and they lobbied FSF to allow an escape hatch to Commons for a few months, because FDL was so annoying.
Dylan16807•1h ago
mjg59•20m ago
I used to be on the FSF board of directors. I have provided legal testimony regarding copyleft licenses. I am excruciatingly aware of the difference between a copyleft license and the public domain.