US Copyright Office found AI companies breach copyright. Its boss was fired

https://www.theregister.com/2025/05/12/us_copyright_office_ai_copyright/

246•croes•5h ago

Comments

andy99•2h ago

Two different issues that while apparently related need separate consideration. Re the copyright finding, does the US copyright office have standing to make such a determination? Presumably not since various claims about AI and copyright are before the courts. Why did they write this finding?

kklisura•2h ago

> The Office is releasing this pre-publication version of Part 3 in response to congressional inquiries and expressions of interest from stakeholders

They acknowledge the issue is before courts:

> These issues are the subject of intense debate. Dozens of lawsuits are pending in the United States, focusing on the application of copyright’s fair use doctrine. Legislators around the world have proposed or enacted laws regarding the use of copyrighted works in AI training, whether to remove barriers or impose restrictions

Why did they write the finding: I assume it's because it's their responsibility:

> Pursuant to the Register of Copyrights’ statutory responsibility to “[c]onduct studies” and “[a]dvise Congress on national and international issues relating to copyright,”...

All excerpts are from https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

_heimdall•2h ago

Given that the issue at hand is related to potential misuse of copyright protected material, it seems totally reasonable for the copyright office to investigate and potentially act to reconcile the issue.

Sure the courts may find its out of their jurisdiction, but they should act as they see fit and let the courts settle that later.

bgwalter•2h ago

The US Supreme court has complained on multiple occasions that it is forced to do the work of the legislative.

Why could a copyright office not advise the congress/senate to enact a law that forbids copyrighted material to be used in AI training? This is literally the politicians' job.

9283409232•1h ago

Part of Congresses power is to defer that agencies it has created. Such as the US Copyright Office.

megamix•2h ago

Tyrants, Kings, Emperors or modern politicians - I mean what’s the difference?

HenryBemis•2h ago

Tyrants & Kings were forced on us and we could only remove with at the cost of blood.

Politicians, they try to crack as fewer eggs as possible, telling us they are our friends, and we believe them. Now then.. some do more good than bad, some do more bad than good. But on the other hand something that is _good for me_ is _bad for you_ and vice versa. Politicians are just the means to move the needle juuuuuuust a little bit, so show a change, but never make a drastic one. The cost of drastic changes is re-election. And this is the bread and butter of politicians (yes, I am over-over-simplifying but this is human history and a lot will be left out in a comment).

codr7•2h ago

The chains become more subtle with time, but start pulling a tiny bit and it's the same chains with the same people holding them. The illusion of a choice is a pretty amazing pacifier.

seper8•2h ago

(this is duplicate of https://news.ycombinator.com/item?id=43960518)

prvc•2h ago

The released draft report seems merely to be a litany of copyright holder complaints repeated verbatim, with little depth of reasoning to support the conclusions it makes.

raverbashing•2h ago

I don't have much spare sympathy here honestly

bgwalter•2h ago

The required reasoning is not very deep though: If an AI reads 100 scientific papers and churns out a new one, it is plagiarism.

If a savant has perfect recall, remembers text perfectly and rearranges that text to create a marginally new text, he'd be sued for breach of copyright.

Only large corporations get away with it.

scraptor•1h ago

Plagiarism is not an issue of copyright law, it's an entirely separate system of rules maintained by academia. The US Copyright Office has no business having opinions about it. If a AI^W human reads 100 papers and then churns out a new one this is usually called research.

ta1243•1h ago

Only when those papers are referenced

dfxm12•1h ago

Please argue in good faith. A new research paper is obviously materially different from "rearranging that text to create a marginally new text".

shkkmo•1h ago

The comment is responding to this line:

> If an AI reads 100 scientific papers and churns out a new one, it is plagiarism.

That is a specific claim that is being directly addressed and pretty clearly qualifies as "good faith".

int_19h•1h ago

"Rearranging text" is not what modern LLMs do though, unless you specifically ask them to.

palmotea•42m ago

> Plagiarism is not an issue of copyright law, it's an entirely separate system of rules maintained by academia. The US Copyright Office has no business having opinions about it. If a AI^W human reads 100 papers and then churns out a new one this is usually called research.

If you draw a Venn Diagram of plagiarism and copyright violations, there's a big intersection. For example: if I take your paper, scratch off your name, make some minor tweaks, and submit it; I'm guilty of both plagiarism and copyright violation.

satanfirst•1h ago

That's not logical. If the savant has perfect recall and makes minor edits they are like a digital copy and aren't really like a human, neural network or by extension any other ML model that isn't over-fitted.

tantalor•1h ago

If AI really could "churn out a new scientific paper" we would all be ecstatically rejoicing in the dawning of an age of AGI. We are nowhere near that.

viraptor•1h ago

We're relatively close already https://openreview.net/pdf?id=12T3Nt22av And we don't need anything even close to AGI to achieve that.

glial•1h ago

It reminds me of the old joke.

"To steal ideas from one person is plagiarism; to steal from many is research."

shkkmo•1h ago

> If a savant has perfect recall, remembers text perfectly and rearranges that text to create a marginally new text, he'd be sued for breach of copyright.

Any suits would be based on the degree the marginally new copy was fair use. You wouldn't be able to sue the savant for reading and remembering the text.

Using AI to creat marginally new copies of copyrighted work is ALREADY a violation. We don't need a dramatic expansion of copyright law that says that just giving the savant the book to real is a copyright violation.

Plagarism and copyright are two entirely different things. Plagarism is about citations and intellectual integrity. Copyright is a about protecting economic interests, has nothing to to with intellectual integrity, and isn't resolved by citing the original work. In fact most of the contexts where you would be accused of plagarism, would be places like reporting, criticism, education or research goals make fair use arguments much easier.

Maxatar•1h ago

Plagiarism isn't illegal, has nothing to do with the law.

shkkmo•1h ago

Plagarism is often illegal. If you use plagarism to obtain a financial or other benefit, that can be fraud.

jobigoud•33m ago

That further drives the point that the issue is not what the AI is doing but what people using it are doing.

JKCalhoun•52m ago

My understanding — LLMs are nothing at all like a "savant with perfect recall".

More like a speed-reader who retains a schema-level grasp of what they’ve read.

mr_toad•44m ago

> If a savant has perfect recall

AI don’t have perfect recall.

nadermx•1h ago

Not only does it read like a litany[0]. It seems like the copyright holders are not happy with how the meta case is working through court and are trying to sidestep fair use entirely.

https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

mr_toad•38m ago

Copywriter holders have always hated fair use, and often like to pretend it doesn’t exist.

The average copywrite holder would like you to think that the law only allows use of their works in ways that they specifically permit, i.e. that which is not explicitly permitted is forbidden.

But the law is largely the reverse; it only denies use of copyright works in certain ways. That which is not specifically forbidden is permitted.

whamlastxmas•1h ago

The reasoning is orange man bad

imafish•2h ago

news flash - the billionaires don't care about you.

achrono•2h ago

If anyone was skeptical of the US government being deeply entrenched with these companies in letting this blatant violation of the spirit of the law [1] continue, this should hopefully secure the conclusion.

And for the future, here's one heuristic: if there is a profound violation of the law anywhere that (relatively speaking) is ignored or severely downplayed, it is likely that interested parties have arrived at an understanding. Or in other words, a conspiracy.

[1] There are tons of legal arguments on both sides, but for me it is enough to ask: if this is not illegal and is totally fair use (maybe even because, oh no look at what China's doing, etc.), why did they have to resort to & foster piracy in order to obtain this?

whycome•2h ago

What’s your reading of the spirit of the law?

NitpickLawyer•1h ago

> If anyone was skeptical of the US government being deeply entrenched with these companies in letting this blatant violation of the spirit of the law [1] continue, this should hopefully secure the conclusion.

European here, but why do you think this is so clear cut? There are other jurisdictions where training on copyrighted data has already been allowed by law/caselaw (Germany and Japan). Why do you need a conspiracy in the US?

AFAICT the US copyright law deals with direct reproductions of a copyrighted piece of content (and also carves out some leeway with direct reproduction, like fair use). I think we can all agree by now that LLMs don't fully reproduce "letter perfect" content, right? What then is the "spirit" of the law that you think was broken here? Isn't this the definition of "transformative work"?

Of note is also the other big case involving books - the one where google was allowed to process mountains of books, they were sued and allowed to continue. How is scanning & indexing tons of books different than scanning & "training" an LLM?

AlotOfReading•1h ago

Google asserted fair use in that case, which is an admission of (allowed) copyright infringement. They didn't turn books into a "new form", they provided limited excerpts that couldn't replace the original usage and directly incentivized purchases through normal sales channels while also providing new functionality.

Contrast that with AI companies:

They don't necessarily want to assert fair use, the results aren't necessarily publicly accessible, the work used isn't cited, users aren't directed to typical sales channels, and many common usages do meaningfully reduce the market for the original content (e.g. AI summaries for paywalled pages).

It's not obvious to me as a non-lawyer that these situations are analogous, even if there's some superficial similarity.

codr7•2h ago

As if it wasn't already obvious to anyone paying attention that we're going to eat this shit voluntarily or kicking and screaming.

brador•2h ago

Lifetime for human copyright, 20 years for corporate copyright. That’s the golden zone.

Zambyte•1h ago

Zero (0) years for corporate copyright, zero (0) years for human copyright is the golden zone in my book.

umanwizard•1h ago

Why?

Zambyte•1h ago

It took me a while to be convinced that copyright is strictly a bad idea, but these two articles were very convincing to me.

https://drewdevault.com/2020/08/24/Alice-in-Wonderland.html

https://drewdevault.com/2021/12/23/Sustainable-creativity-po...

SketchySeaBeast•1h ago

The first article is saying that "Copyright is bad because of corporations", and I can kind of get behind that, especially the very long term copyrights that have lost the intent, but the second article says that artists will be happier without copyright if we just solve capitalism first. I don't know about you, but that reads to me like "If you wish to make an apple pie from scratch you must first invent the universe".

If an artist produces a work they should have the rights to that work. If I self-publish a novel and then penguin decides that novel is really good and they want to publish it, without copyright they'd just do that, totally swamping me with their clout and punishing my ever putting the work out. That's a bad thing.

Zambyte•29m ago

> If an artist produces a work they should have the rights to that work.

That would indeed be nice, but as the article says, that's usually not the case. The rights holder and the author are almost never the same entity in commercial artistic endeavors. I know I'm not the rights holder for my erroneously-considered-art work (software).

> If I self-publish a novel and then penguin decides that novel is really good and they want to publish it, without copyright they'd just do that, totally swamping me with their clout and punishing my ever putting the work out. That's a bad thing.

Why? You created influential art and its influence was spread. Is that not the point of (good) art?

SketchySeaBeast•24m ago

> The rights holder and the author are almost never the same entity in commercial artistic endeavors.

There's definitely problems with corporatization of ownership of these things, I won't disagree.

> Why? You created influential art and its influence was spread. Is that not the point of (good) art?

Why do we expect artists to be selfless? Do you think Stephen King is still writing only because he loves the art? You don't simply make software because you love it, right? Should people not be able to make money off their effort?

noirscape•18m ago

It may surprise you, but artists need to buy things like food, water and pay for their basic necessities like electricity, rent and taxes. Otherwise they die or go bankrupt.

In our current society, that means they need some sort of means to make money from their work. Copyright, at least in theory, exists to incentivize the creation of art by protecting an artists ability to monetize it.

If you abolish copyright today, under our current economic framework, what will happen is that people create less art because it goes from a (semi-)viable career to just being completely worthless to pursue. It's simply not a feasible option unless you fundamentally restructure society (which is a different argument entirely.)

jasonjayr•18m ago

But in this idealized copyright-free world, those self-publishing companies could just as easily take Penguin's top sellers and reproduce those.

The thing that'd set apart these companies are the services + quality of their work.

SketchySeaBeast•14m ago

Is not part of the quality of the work the contents of the book? What are these companies putting within the pages? We've taken the greatest and longest part of the effort and made it meaningless.

int_19h•1m ago

The problem of "how do artists earn enough money to eat?" is legitimate, but I don't think it's a good idea to solve it by making things that inherently don't work like real property to work like it, just so that we can shove them into the same framework. And this is exactly what copyright does - it takes information, which can be copied essentially for free by its very fundamental nature, and tries to make it scarce through legal means solely so that it can be sold as if it were a real good.

There are two reasons why it's a problem. The first reason is that any such abstraction is leaky, and those leaks are ripe for abuse. For example, in case of copyright on information, we made it behave like physical property for the consumers, but not for the producers (who still only need to expend resources to create a single work from scratch, and then duplicate it for free while still selling each copy for $$$). This means that selling information is much more lucrative than selling physical things, which is a big reason why our economy is so distorted towards the former now - just look at what the most profitable corporations on the market do.

The second reason is that it artificially entrenches capitalism by enmeshing large parts of the economy into those mechanics, even if they aren't naturally a good fit. This then gets used as an argument to prop up the whole arrangement - "we can't change this, it would break too much!".

dmonitor•24m ago

You need some mechanism in place to prevent any joe schmoe from spinning up FreeSteam and rehosting the whole thing.

whamlastxmas•1h ago

Because the concept of owning an idea is really gross. Copyright means I can’t write about whatever I want in my own home even if I never distribute it or no one ever sees it. I’m breaking the law by privately writing Harry Potter fanfic in my journal or whatever. Copyright is supposed to be about encouraging intangibles, and the reality is that it only massively stifles it

redwall_hp•54m ago

Whole genres of music are based entirely on sampling, and they got screwed by copyright law as it evolved over the 90s and 2000s. Now only people with a sufficiently sized business backing them can truly participate, or they're stuck licensing things on Splice.

And that's not even touching the spurious lawsuits about musical similarity. That's what musicians call a genre...

It makes some sense for a very short term literal right to reproduction of a singular work, but any time the concept of derivative works comes into play, it's just a bizarrely dystopian suppression of art, under the supposition that art is commercial activity rather than an innate part of humanity.

flats•38m ago

I don’t believe this is true? I’m pretty sure that you’re prohibited from making money from that fan fiction, not from writing it at all. So I don’t understand the claim that copyright “massively stifles” creativity. There are of course examples of people not being able to make money on specific “ideas” because of copyright laws, but that doesn’t seem to me to be “massively stifling” creativity itself, especially given that it also protects and supports many people generating these ideas. And if we got rid of copyright law, wouldn’t we be in that exact place, where people wouldn’t be allowed to make money off of creative endeavors?

I mean, owning an idea is kinda gross, I agree. I also personally think that owning land is kinda gross. But we live in a capitalist society right now. If we allow AI companies to train LLMs on copyrighted works without paying for that access, we are choosing to reward these companies instead of the humans who created the data upon which these companies are utterly reliant for said LLMs. Sam Altman, Elon Musk, and all the other tech CEOs will benefit in place of all of the artists I love and admire.

That, to me, sucks.

Zambyte•26m ago

> And if we got rid of copyright law, wouldn’t we be in that exact place, where people wouldn’t be allowed to make money off of creative endeavors?

This is addressed in the second article I linked.

otterley•21m ago

Consider how many books exist on how to care for trees. Each one of them has similar ideas, but the way those ideas are expressed differ. Copyright protects the content of the book; it doesn’t protect the ideas of how to care for trees.

achierius•59m ago

Well what we're getting is lifetime for corporate, and zero (0) for human. Hope you're happy.

Zambyte•35m ago

I'm not, because that's not what I asked for.

GuB-42•1h ago

The issue with lifetime (vs something like lifetime + X years) is that of inheritance.

Assuming you agree with the idea of inheritance, which is another topic, then it is unfair to deny inheritance of intellectual property. For example if your father has built a house, it will be yours when he dies, it won't become a public house. So why would a book your father wrote just before he died become public domain the moment he dies. It is unfair to those doing who are doing intellectual work, especially older people.

If you want short copyright, is would make more sense to make it 20 years, human or corporate, like patents.

dghlsakjg•56m ago

Then make it the greater of 20 years or the lifetime for humans.

Comparing intellectual property to real or physical property makes no sense. Intellectual property is different because it is non exclusive. If you are living in your father’s house, no one else can be living there. If I am reading your fathers book, that has nothing to do with whether anyone else can read the book.

MyOutfitIsVague•10m ago

The issue with that is that inheritance only makes sense for tangible, scarce resources. Having copyright isn't easily analogous to ownership of a physical object, because an object is something you have and if somebody else has it, you can not have and use it.

Copyright is about control. If you know a song and you sing it to yourself, somebody overhears it and starts humming it, they have not deprived you of the ability to still know and sing that song. You can make economic arguments, of deprived profit and financial incentives, and that's fine; I'm not arguing against copyright here (I am not a fan of copyright, it's just not my point at the moment), I'm just saying that inheritance does not naturally apply to copyright, because data and ideas are not scarce, finite goods. They are goods that feasibly everybody in the world can inherit rapidly without lessening the amount that any individual person gets.

If real goods could be freely and easily copied the way data can, we might be having some very interesting debates about the logic and morality of inheriting your parents' house and depriving other people of having a copy.

tempeler•1h ago

I think, A new chapter is about to begin. It seems that in the future, many IPs will become democratized — in other words, they will become public assets.

ahmeni•1h ago

If only there was some sort of term for fake democracy where you're actually just there to plunder resources.

gadders•1h ago

Congress? https://www.capitoltrades.com/

tempeler•1h ago

This idea does not belong to me. If lawmakers and regulators allow companies to use these IPs, how can you keep ordinary people away from them? Something created by AI is regarded as if it was created from scratch by human hands. that's reality.

AlexandrB•1h ago

Public assets as long as you pay your monthly ChatGPT bill.

SketchySeaBeast•1h ago

"Democratized" as in large corporations are free to ingest the IPs and then reinterpret and censor them before they feed their version back to us, with us never having free access to the original source?

rurban•22m ago

"Democratized" in the meaning of fascistoized, right? Laws do not apply to the cartels, military, executive and secret services.

numpad0•1h ago

Oh yeah. It's the Cultural Revolution all over again.

kmeisthax•35m ago

They aren't going to legalize, say, publishing Mario fangames or whatever. They're just going to make copyright allow AI training, because AI is what the owner class wants. That's not democratizing IP, that's just prejudicial (dis)enforcement against the creative class.

jobigoud•18m ago

Millions of pages of fan fic based on existing IP have been written. There is a point where it doesn't really make sense trying to go after individuals especially if they make no money out of it.

If we enter a world where anyone can create a new Mario game and there are thousands of them released on the public web it would be impossible for the rights holders to do anything, and it would be a PR bad move to go after individuals doing it for fun.

int_19h•10m ago

Imagine a world where all models capable of creating a new Mario game from scratch are only available through cloud providers which must implement mandatory filters such that asking "write me a Mario clone" (or anything functionally equivalent) gets you a lecture on don't-copy-that-floppy.

Bad PR? The entire copyright enforcement industry has had bad PR pretty much since easy copying enabled grassroots piracy - i.e. since before computers even. It never stopped them. What are you going to do about it? Vote? But all the mainstream parties are onboard with the copyright lobby.

throw0101c•1h ago

See "Copyright and Artificial Intelligence Part 3: Generative AI Training" (PDF):

* https://www.copyright.gov/ai/Copyright-and-Artificial-Intell...

thomastjeffery•1h ago

> The remarks about Musk may refer to the billionaire’s recent endorsement of Twitter founder Jack Dorsey’s desire to “Delete all IP law"...

Yes please.

Delete it for everyone, not just these ridiculous autocrats. It's only helping them in the first place!

mattxxx•1h ago

Well, firing someone for this is super weird. It seems like an attempt to censor an interpretation of the law that:

1. Criticizes a highly useful technology 2. Matches a potentially-outdated, strict interpretation of copyright law

My opinion: I think using copyrighted data to train models for sure seems classically illegal. Despite that, Humans can read a book, get inspiration, and write a new book and not be litigated against. When I look at the litany of derivative fantasy novels, it's obvious they're not all fully independent works.

Since AI is and will continue to be so useful and transformative, I think we just need to acknowledge that our laws did not accomodate this use-case, then we should change them.

madeofpalk•59m ago

> Humans can read a book, get inspiration, and write a new book and not be litigated against

Humans get litigated against this all the time. There is such thing as, charitably, being too inspired.

https://en.wikipedia.org/wiki/List_of_songs_subject_to_plagi...

jrajav•24m ago

If you follow these cases more closely over time you'll find that they're less an example of humans stealing work from others and more an example of typical human greed and pride. Old, well established musicians arguing that younger musicians stole from them for using a chord progression used in dozens of songs before their own original, or a melody on the pentatonic scale that sounds like many melodies on the pentatonic scale do. It gets ridiculous.

Plus, all art is derivative in some sense, it's almost always just a matter of degree.

ActionHank•51m ago

Assuming this means copyright is dead, companies will be vary upset and patents will likely follow.

The hold US companies have on the world will be dead too.

I also suspect that media piracy will be labelled as the only reason we need copyright, an existing agency will be bolstered to address this concern and then twisted into a censorship bureau.

palmotea•48m ago

> My opinion: I think using copyrighted data to train models for sure seems classically illegal. Despite that, Humans can read a book, get inspiration, and write a new book and not be litigated against. When I look at the litany of derivative fantasy novels, it's obvious they're not all fully independent works.

The fatal flaw in your reasoning: machines aren't humans. You can't reason that a machine has rights from the fact a human has them. Otherwise it's murder to recycle a car.

jobigoud•28m ago

We are talking about the rights of the humans training the models and the humans using the models to create new things.

Copyright only comes into play on publication. It's only concerned about publication of the models and publication of works. The machine itself doesn't have agency to publish anything at this point.

MyOutfitIsVague•20m ago

It's not only publication, otherwise people wouldn't be able to be successfully sued for downloading and consuming copyrighted content, it would only be the uploaders who get into trouble.

ulbu•26m ago

these comparisons of llms with human artists copying are just ridiculous. it’s saying “well humans are allowed to break twigs and damage the planet in various ways, so why not allow building a fucking DEATH STAR”.

abstracting llms from their operators and owners and possible (and probable) ends and the territories they trample upon is nothing short of eye-popping to me. how utterly negligent and disrespectful of fellow people must one be at the heart to give any credence to such arguments

timdiggerm•45m ago

Or we could acknowledge that something could be a bad idea, despite its utility

stevenAthompson•43m ago

Doing a cover song requires permission, and doing it without that permission can be illegal. Being inspired by a song to write your own is very legal.

AI is fine as long as the work it generates is substantially new and transformative. If it breaks and starts spitting out other peoples work verbatim (or nearly verbatim) there is a problem.

Yes, I'm aware that machines aren't people and can't be "inspired", but if the functional results are the same the law should be the same. Vaguely defined ideas like your soul or "inspiration" aren't real. The output is real, measurable, and quantifiable and that's how it should be judged.

toast0•21m ago

> Doing a cover song requires permission, and doing it without that permission can be illegal.

I believe cover song licensing is available mechanically; you don't need permission, you just need to follow the procedures including sending the licensing fees to a rights clearing house. Music has a lot of mechanical licenses and clearing houses, as opposed to other categories of works.

mjburgess•16m ago

I fear the lack of our ability to measure your mind might render you without many of the legal or moral protections you imagine you have. But go ahead, tare down the law to whatever inanity can be described by the trivial machines of the world's current popular charlatans. Presumably you weren't using society's presumption of your agency anyway.

jeroenhd•39m ago

Pirating movies is also useful, because I can watch movies without paying on devices that apps and accounts don't work on.

That doesn't make piracy legal, even though I get a lot of use out of it.

Also, a person isn't a computer so the "but I can read a book and get inspired" argument is complete nonsense.

Workaccount2•21m ago

It's only complete non-sense if you understand how humans learn. Which we don't.

What we do know though is that LLMs, similar to humans, do not directly copy information into their "storage". LLMs, like humans, are pretty lossy with their recall.

Compare this to something like a search indexed database, where the recall of information given to it is perfect.

vessenes•30m ago

Thank you - a voice of sanity on this important topic.

I understand people who create IP of any sort being upset that software might be able to recreate their IP or stuff adjacent to it without permission. It could be upsetting. But I don't understand how people jump to "Copyright Violation" for the fact of reading. Or even downloading in bulk. The Copyright controls, and has always controlled, creation and distribution of a work. In the nature even of the notice is embedded the concept that the work will be read.

Reading and summarizing have only ever been controlled in western countries via State's secrets type acts, or alternately, non-disclosure agreements between parties. It's just way, way past reality to claim that we have existing laws to cover AI training ingesting information. Not only do we not, such rules would seem insane if you substitute the word human for "AI" in most of these conversations.

"People should not be allowed to read the book I distributed online if I don't want them to."

"People should not be allowed to write Harry Potter fanfic in my writing style."

"People should not be allowed to get formal art training that involves going to museums and painting copies of famous paintings."

We just will not get to a sensible societal place if the dialogue around these issues has such a low bar for understanding the mechanics, the societal tradeoffs we've made so far, and is able to discuss where we might want to go, and what would be best.

ceejayoz•23m ago

> Despite that, Humans can read a book, get inspiration, and write a new book and not be litigated against.

You're still not gonna be allowed to commercially publish "Hairy Plotter and the Philosophizer's Rock".

WesolyKubeczek•14m ago

No, but you are most likely allowed to commercially publish "Hairy Potter and the Philosophizer's Rock", a story about a prehistoric community. The hero is literally a hairy potter who steals a rock from a lazy deadbeat dude who is pestering the rest of the group with his weird ideas.

regularjack•10m ago

Then they need to be changed for everyone and not just AI companies, but we all know that ain't happening.

evanjrowley•1h ago

If AI companies in the US are penalized for this, then the effect on copyright holders will only be slowed until foriegn AI companies overtake them. In such cases the legal recourse will be much slower and significantly limited.

mitthrowaway2•54m ago

Access to copyrighted materials might make for slightly better-trained models the way that access to more powerful GPUs does. But I don't think it will accelerate foundational advances in the underlying technology. If anything, maybe having to compete under tight constraints means AI companies will have to innovate more, rather than merely push scale.

internet_rand0•58m ago

the internet demands it.

the people demand free mega upload for everybody, why? because we can (we seem to NOT want to, but that should be a politically solvable problem)

renewiltord•26m ago

I wonder when general internet sentiment moved from pro-piracy to IP maximalism. Fascinating shift.

ronsor•14m ago

AI has made people lose their minds and principles. It's fascinating to observe.

In the meantime, I will continue to dislike copyright regardless of the parties involved.

sophrocyne•22m ago

The USCO report was flawed, biased, and hypocritical. A pre-publication of this sort is also extremely unusual.

https://chatgptiseatingtheworld.com/2025/05/12/opinion-why-t...

ceejayoz•21m ago

What in https://chatgptiseatingtheworld.com/about/ says "ah, yes, trustworthy unbiased analysis" to you? Why should I trust this source's opinion?

Pre-publication reports aren't unusual. https://www.federalregister.gov/public-inspection/current

https://www.federalregister.gov/reader-aids/using-federalreg...

> The Federal Register Act requires that the Office of the Federal Register (we) file documents for public inspection at our office in Washington, DC at least one business day before publication in the Federal Register.

aurizon•14m ago

Ned Ludd heirs at last win - High Court rules the spinning Jenny IS ILLEGAL!. All machine made cloth and machines must be destroyed. This is the end of the road for all mechanical ways to make cloth. Get naked, boys 'n girls = this will be fun!

ChrisArchitect•11m ago

[dupe] https://news.ycombinator.com/item?id=43960518

ChrisArchitect•10m ago

Earlier on the report pdf:

https://news.ycombinator.com/item?id=43955025

Workaccount2•8m ago

I have yet to see someone explain in detail how transformer model training works (showing they understand the technical nitty gritty and the overall architecture of transformers) and also layout a case for why it is clearly a violation of copyright.

You can find lots of people talking about training, and you can find lots (way more) of people talking about AI training being a violation of copyright, but you can't find anyone talking about both.

anhner•6m ago

because people who understand how training works also understand that it's not a violation of copyright...

autobodie•4m ago

I have yet to see someone explain in detail how writing the same words as another person works (showing they understand the technical nitty gritty and the overall architecture of the human mind) and also layout a case for why it is clearly a violation of copyright. You can find lots of people talking about reading, and you can find lots (way more) of people talking about plagarism being a violation of copyright, but you can't find anyone talking about both.

jsiepkes•3m ago

This isn't about training AI on a book, but AI companies never paying for the book at all. As in: They "downloaded the e-book from a warez site" and then used it for training.

jfengel•1m ago

I'm not sure I understand your question. It's reasonably clear that transformers get caught reproducing material that they have no right to. The kind of thing that would potentially result in a lawsuit if you did it by hand.

It's less clear whether taking vast amounts of copyrighted material and using it to generate other things rises to the level of copyright violation or not. It's the kind of thing that people would have prevented if it had occurred to them, by writing terms of use that explicitly forbid it. (Which probably means that the Web becomes a much smaller place.)

Your comment seems to suggest that writers and artists have absolutely no conceivable stake in products derived from their work, and that it's purely a misunderstanding on their part. But I'm both a computer scientist and an artist and I don't see how you could reach that conclusion.

stevetron•8m ago

It's amazing the amount of bad deeds coming out of the current administration in support of special interests.

hatenberg•6m ago

Big Tech: We shouldn’t pay, each individual piece of content is worth basically nothing.

Also Big Tech: We added 300.000.000 users worth of GTM because we trained in the 10 specific anime movies of Studio Ghibli and are selling their style.

A community-led fork of Organic Maps

University of Texas-led team solves a big problem for fusion energy

Ruby 3.5 Feature: Namespace on read

Spade Hardware Description Language

Plain Vanilla Web

I ruined my vacation by reverse engineering WSC

The FTC puts off enforcing its 'click-to-cancel' rule

Universe expected to decay in 10⁷⁸ years, much sooner than previously thought

Continuous glucose monitors reveal variable glucose responses to the same meals

US Copyright Office found AI companies breach copyright. Its boss was fired

A Typical Workday at a Japanese Hardware Tool Store [video]

Spark AI (YC W24) Is Hiring a Full Stack Engineer in San Francisco

Gig Companies Violate Workers Rights

Continuous Thought Machines

Intellect-2 Release: The First 32B Model Trained Through Globally Distributed RL

CrowdStrike CEO cuts his voting power by 92% with unexplained gifts

Implicit UVs: Real-time semi-global parameterization of implicit surfaces [pdf]

Armbian Updates: OMV support, boot improvents, Rockchip optimizations

Making PyPI's test suite faster

Show HN: CLI that spots fake GitHub stars, risky dependencies and licence traps

Why Bell Labs Worked

Car companies are in a billion-dollar software war

Absolute Zero Reasoner

High-school shop students attract skilled-trades job offers

Scraperr – A Self Hosted Webscraper

Writing an LLM from scratch, part 13 – attention heads are dumb

Ask HN: Cursor or Windsurf?

Title of work deciphered in sealed Herculaneum scroll via digital unwrapping

I hacked my clock to control my focus

How friction is being redistributed in today's economy

A community-led fork of Organic Maps

University of Texas-led team solves a big problem for fusion energy

Ruby 3.5 Feature: Namespace on read

Spade Hardware Description Language

Plain Vanilla Web

I ruined my vacation by reverse engineering WSC

The FTC puts off enforcing its 'click-to-cancel' rule

Universe expected to decay in 10⁷⁸ years, much sooner than previously thought

Continuous glucose monitors reveal variable glucose responses to the same meals

US Copyright Office found AI companies breach copyright. Its boss was fired

A Typical Workday at a Japanese Hardware Tool Store [video]

Spark AI (YC W24) Is Hiring a Full Stack Engineer in San Francisco

Gig Companies Violate Workers Rights

Continuous Thought Machines

Intellect-2 Release: The First 32B Model Trained Through Globally Distributed RL

CrowdStrike CEO cuts his voting power by 92% with unexplained gifts

Implicit UVs: Real-time semi-global parameterization of implicit surfaces [pdf]

Armbian Updates: OMV support, boot improvents, Rockchip optimizations

Making PyPI's test suite faster

Show HN: CLI that spots fake GitHub stars, risky dependencies and licence traps

Why Bell Labs Worked

Car companies are in a billion-dollar software war

Absolute Zero Reasoner

High-school shop students attract skilled-trades job offers

Scraperr – A Self Hosted Webscraper

Writing an LLM from scratch, part 13 – attention heads are dumb

Ask HN: Cursor or Windsurf?

Title of work deciphered in sealed Herculaneum scroll via digital unwrapping

I hacked my clock to control my focus

How friction is being redistributed in today's economy

US Copyright Office found AI companies breach copyright. Its boss was fired

Comments