The launch of ChatGPT polluted the world forever

https://www.theregister.com/2025/06/15/ai_model_collapse_pollution/

27•rntn•8h ago

Comments

Den_VR•8h ago

Someday maybe we’ll have a term similar to “low-background steel” for information and web content.

etherlord•8h ago

https://blog.jgc.org/2025/06/low-background-steel-content-wi...

ChrisArchitect•2h ago

Large discussion earlier this week: https://news.ycombinator.com/item?id=44239481

willis936•8h ago

The root of it is deterioration in trust. Even before LLMs hit the scene there was suspicion of narrative manipulation by social media sites. ChatGPT only changed how popular this take is, but not its measure.

cheschire•7h ago

Why did you paraphrase the article’s subtitle?

Den_VR•4h ago

I’m not sure if admitting I didn’t even open the article helps or harms my case.

Eddy_Viscosity2•7h ago

This is a great analogy.

happa•8h ago

LLMs don't really need more training data than they already have. They just need to start using it more efficiently.

myflash13•7h ago

Exactly. Smart humans work with far less training data and do better.

ghusto•8h ago

The article keeps making it sound as if it's a problem for humans. e.g.:

> Now here the date is more flexible, let's say 2022. But if you're collecting data before 2022 you're fairly confident that it has minimal, if any, contamination from generative AI. Everything before the date is 'safe, fine, clean,' everything after that is 'dirty.'"

Though what it seems to actually mean is that it's a problem for (future) generative AI (the "genAI collapse"). To which I say;

joshstrange•8h ago

This seems like a very badly written article that rambles on in random directions. It proposes incredibly dumb ideas to anyone with half a brain like water marking AI output.

The most damning part for me is mentioning the Apple paper and the refute of the Apple paper, to my knowledge that paper had nothing to do with training on generated data. It was talking about reasoning models, but because they use the word “model collapse”, apparently, the author of this article decided to include it in, which just shows how they don’t know what they’re talking about (unless I’m completely misunderstanding the Apple paper).

m4r1k•7h ago

This! And I’d add, it’s the Register–it has always had a very low bar.

famahar•7h ago

lowbackgroundsteel.ai sounds really promising. I don't really care for it as a clean AI training source, but I'm interested in a curated internet where I know it's not diluted with generative content. I'm not sure what that would look like when it comes to social media. This AI era has made me return to reading physical books as a hobby and engaging with offline/non-anonymous online communities more. Confidence in authenticity is one of the most important things for me these days.

iJohnDoe•4h ago

I have no expertise in LLMs. I do think the article poses an interesting question. How do you get the models recent information without ingesting information that has been generated by AI. I’m sure it’s possible, but not without a certain level of uncertainty.

Humanity now lives in a world where any text has most likely been influenced by AI, even if it’s by multiple degrees of separation.

Three Thoughts on AI and Life

Stop Adding More Drive Modes and Just Build a Car That Drives Properly

How Tariffs Are Breaking US Trade

Deere must face US farmers' 'right-to-repair' lawsuits, judge rules (2023)

How to Do Research

You're being lied to about protein

The true reason behind Aero Glass's removal from a Microsoft Engineer (2015)

Google 'handling stolen goods' with YouTube theft of paywalled news articles

Should We Embrace Boredom?

NASA reveals that trees near volcanoes send warning signals before eruptions

Wine 10.10 (Dev) – Run Windows Applications on Linux, BSD, Solaris and macOS

The English Programming Language

Simplicity Matters (Rich Hickey)

A Google Shareholder Is Suing the Company over the TikTok Ban

Brain implant breakthrough helps ALS man talk – and sing – again

Writing Toy Software Is a Joy

Marking the 20th anniversary of Steve Jobs' Stanford address

Diagnostic pen converts writing into electrical signals to detect Parkinson's

Apple Completes Migration of Key Service to Swift, Gains 40% Performance Uplift

Cqrs.com

ESA moving ahead with 'resilience from space' satellite imaging program

The Lo-Fi Art and Human Tools Era

FollowUp: My move to Netherlands is currently blocked

Scientists want to test a solar umbrella that could help fight climate change

Accounting for Elevation

I Rewrote "Fishy" in Rust → WASM → Browser. Open Source

ChatGPT Search gets an upgrade as OpenAI takes aim at Google

Letter: Defense of Earth itself should be Space Job One [pdf]

Show HN: Urn Notice –| Build yourself an audience, without a newsletter or blog

China achieves thorium nuclear reactor milestone based on abandoned US research