2. Is it possible to estimate how much of copyrighted material has been used?
You've reached the end!
2. Is it possible to estimate how much of copyrighted material has been used?
There's no easy answer there, hence New York Times v. OpenAI.
I think sticking a straw in Zlib or AA or LibGen or whatever it is, and drinking until it makes gurgling slurping noises as it hoovers up the dregs at the bottom of the barrel, is far, far removed from “fair use”.
muzani•23h ago
2. This is harder as a lot of them don't disclose training sets.