frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

The launch of ChatGPT polluted the world forever

https://www.theregister.com/2025/06/15/ai_model_collapse_pollution/
27•rntn•9h ago

Comments

Den_VR•9h ago
Someday maybe we’ll have a term similar to “low-background steel” for information and web content.
etherlord•9h ago
https://blog.jgc.org/2025/06/low-background-steel-content-wi...
ChrisArchitect•3h ago
Large discussion earlier this week: https://news.ycombinator.com/item?id=44239481
willis936•9h ago
The root of it is deterioration in trust. Even before LLMs hit the scene there was suspicion of narrative manipulation by social media sites. ChatGPT only changed how popular this take is, but not its measure.
cheschire•9h ago
Why did you paraphrase the article’s subtitle?
Den_VR•5h ago
I’m not sure if admitting I didn’t even open the article helps or harms my case.
Eddy_Viscosity2•9h ago
This is a great analogy.
happa•9h ago
LLMs don't really need more training data than they already have. They just need to start using it more efficiently.
myflash13•9h ago
Exactly. Smart humans work with far less training data and do better.
ghusto•9h ago
The article keeps making it sound as if it's a problem for humans. e.g.:

> Now here the date is more flexible, let's say 2022. But if you're collecting data before 2022 you're fairly confident that it has minimal, if any, contamination from generative AI. Everything before the date is 'safe, fine, clean,' everything after that is 'dirty.'"

Though what it seems to actually mean is that it's a problem for (future) generative AI (the "genAI collapse"). To which I say;

joshstrange•9h ago
This seems like a very badly written article that rambles on in random directions. It proposes incredibly dumb ideas to anyone with half a brain like water marking AI output.

The most damning part for me is mentioning the Apple paper and the refute of the Apple paper, to my knowledge that paper had nothing to do with training on generated data. It was talking about reasoning models, but because they use the word “model collapse”, apparently, the author of this article decided to include it in, which just shows how they don’t know what they’re talking about (unless I’m completely misunderstanding the Apple paper).

m4r1k•9h ago
This! And I’d add, it’s the Register–it has always had a very low bar.
famahar•9h ago
lowbackgroundsteel.ai sounds really promising. I don't really care for it as a clean AI training source, but I'm interested in a curated internet where I know it's not diluted with generative content. I'm not sure what that would look like when it comes to social media. This AI era has made me return to reading physical books as a hobby and engaging with offline/non-anonymous online communities more. Confidence in authenticity is one of the most important things for me these days.
iJohnDoe•5h ago
I have no expertise in LLMs. I do think the article poses an interesting question. How do you get the models recent information without ingesting information that has been generated by AI. I’m sure it’s possible, but not without a certain level of uncertainty.

Humanity now lives in a world where any text has most likely been influenced by AI, even if it’s by multiple degrees of separation.

Modifying an HDMI dummy plug's EDID using a Raspberry Pi

https://www.downtowndougbrown.com/2025/06/modifying-an-hdmi-dummy-plugs-edid-using-a-raspberry-pi/
146•zdw•4h ago•28 comments

Telephone Exchanges in the UK

https://telephone-exchanges.org.uk/
26•petecooper•1h ago•2 comments

Why it's nearly impossible to buy an original Bob Ross painting

https://thehustle.co/why-its-nearly-impossible-to-buy-an-original-bob-ross-painting
11•rmason•26m ago•2 comments

First 2D, non-silicon computer developed

https://www.psu.edu/news/research/story/worlds-first-2d-non-silicon-computer-developed
27•giuliomagnifico•3d ago•3 comments

Show HN: Seastar – Build and dependency manager for C/C++ with Cargo's features

https://github.com/AI314159/Seastar
17•AI314159•1h ago•6 comments

How to modify Starlink Mini to run without the built-in WiFi router

https://olegkutkov.me/2025/06/15/how-to-modify-starlink-mini-to-run-without-the-built-in-wifi-router/
208•LorenDB•8h ago•53 comments

Datalog in miniKanren

https://deosjr.github.io/dynamicland/datalog.html
54•deosjr•4h ago•3 comments

Datalog in Rust

https://github.com/frankmcsherry/blog/blob/master/posts/2025-06-03.md
208•brson•9h ago•22 comments

Simplest C++ Callback, from SumatraPDF

https://blog.kowalczyk.info/a-stsj/simplest-c-callback-from-sumatrapdf.html
41•jandeboevrie•3h ago•29 comments

Childhood leukemia: how a deadly cancer became treatable

https://ourworldindata.org/childhood-leukemia-treatment-history
101•surprisetalk•7h ago•23 comments

IPOChatter: Track Prospective Tech IPOs

https://ipochatter.com
5•civilaircraft•34m ago•0 comments

Canyon.mid

https://canyonmid.com/
184•LorenDB•7h ago•104 comments

An Introduction to the Hieroglyphic Language of Early 1900s Train-Hoppers

https://www.openculture.com/2018/08/hobo-code-introduction-hieroglyphic-language-early-1900s-train-hoppers.html
3•squircle•52m ago•0 comments

Show HN: Pipo360 – Generate production-ready back end APIs in 60 seconds with AI

https://pipo360.xyz
4•the_plug•1h ago•0 comments

Why SSL was renamed to TLS in late 90s (2014)

https://tim.dierks.org/2014/05/security-standards-and-name-changes-in.html
61•Bogdanp•6h ago•9 comments

1k year old 3 sisters crop farm found in Northern Michigan

https://www.smithsonianmag.com/smart-news/massive-field-where-native-american-farmers-grew-corn-beans-and-squash-1000-years-ago-discovered-in-michigan-180986758/
126•CoopaTroopa•3d ago•53 comments

Cure Dolly's Japanese Grammar Lessons

https://kellenok.github.io/cure-script/
9•agnishom•1d ago•0 comments

The experience continues until you stop experiencing it

https://strangemachine.tv/safespace/popov/
47•durakot•4h ago•11 comments

SQLite Date and Time Functions (2007)

https://www2.sqlite.org/cvstrac/wiki?p=DateAndTimeFunctions
33•1vuio0pswjnm7•1d ago•13 comments

Foundations of Computer Vision

https://visionbook.mit.edu
97•tzury•10h ago•4 comments

GNOME and Red Hat Linux eleven years ago (2009)

https://linuxgazette.net/165/laycock.html
90•marcodiego•4h ago•48 comments

The Skyscraper That Could Have Toppled over in the Wind (1995)

https://www.newyorker.com/magazine/1995/05/29/the-fifty-nine-story-crisis-citicorp-center
22•georgecmu•5h ago•14 comments

The Art of Lisp and Writing (2003)

https://www.dreamsongs.com/ArtOfLisp.html
146•Bogdanp•13h ago•57 comments

Text-to-LoRA: Hypernetwork that generates task-specific LLM adapters (LoRAs)

https://github.com/SakanaAI/text-to-lora
84•dvrp•3d ago•3 comments

Biofuels Policy, a Mainstay of American Agriculture, a Failure for the Climate

https://insideclimatenews.org/news/13062025/agriculture-ethanol-biofuel-policy-climate-failure/
54•rntn•4h ago•31 comments

Social anxiety disorder-associated gut microbiota increases social fear

https://www.pnas.org/doi/abs/10.1073/pnas.2308706120
126•thunderbong•4h ago•82 comments

Writing Toy Software Is a Joy

https://www.jsbarretto.com/blog/software-is-joy/
59•todsacerdoti•1h ago•11 comments

Studio Ghibli marks 40 years, but future looks uncertain

https://www.japantimes.co.jp/culture/2025/06/06/film/ghibli-anniversary-40/
45•gslin•3h ago•25 comments

Tell HN: I just made a first ever dollar on my SaaS

7•yu3zhou4•45m ago•2 comments

Show HN: Tikt.com – Remove the "OK" from TikTok URL's to Download as MP3 or MP4

https://tikt.com/
67•nadermx•2h ago•24 comments