news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

What GPT-OSS leaks about OpenAI's training data

https://fi-le.net/oss/

66•fi-le•2h ago

Comments

zaptrem•54m ago

> There are about 936 tokens with very low L2 norm, centered at about 2. This likely means that they did not occur in the training process of GPT-oss and were thus depressed by some form of weight decay.

Afaik embedding and norm params are excluded from weight decay as standard practice. Is this no longer true?

E.g., they exclude them in minGPT: https://github.com/karpathy/minGPT/blob/37baab71b9abea1b76ab...

3abiton•22m ago

Unfortunately the article glances over some of practices of uncovering such patterns in the training data. It goes very straitghfully to the point, no lube needed. It didn't land well for me.

behnamoh•30m ago

Is there any work on reverse engineering LLMs, especially the closed source API ones? For example, how can we learn about the data used in Claude Sonnet 4.5 training?

And more tricky but as important, is there any work on extrapolating the pretrained model AFTER it's RLHF'd? For example, what kinds of biases did exist in gpt-4o before it was unbiased?

Do biases go away completely or they just get suppressed down deep in the model's "mind"?

Wowfunhappy•28m ago

Maybe I'm misinterpreting, but the article seems (?) to be implying there's something scandalous about OpenAI training an adult websites.

I find that odd. Would anyone be surprised to know that Google indexes adult websites, and ranks them in its search algorithm? If not, what is the difference for an LLM?

rs186•17m ago

Many of the crude translations of those Chinese phrases are way off to the point that it fails to understand the meaning, which makes me think the data in those matrices is inaccurate as well. The author really needs to ask a native Chinese speaker with experience in ... searching explicit content to proofread the article and examine the results.

fi-le•15m ago

Hi, thanks! If someone posts better translations I will update them.

Apt Down – The North Korea Files

https://phrack.org/issues/72/7_md#article

1•maximilianthe1•1m ago•0 comments

Memory access is O(N^[1/3])

https://vitalik.eth.limo/general/2025/10/05/memory13.html

1•jxmorris12•5m ago•0 comments

What messy middle? Orange EV has logged over 10 million all-electric hours

https://electrek.co/2025/10/04/what-messy-middle-orange-ev-has-logged-over-10-million-all-electri...

2•breve•5m ago•0 comments

Elon Musk's $1T pay plan faces pushback from investors, state officials

https://www.reuters.com/business/autos-transportation/elon-musks-1-trillion-pay-plan-faces-pushba...

2•1vuio0pswjnm7•6m ago•0 comments

Ten Years, One Goal

https://docs.eventsourcingdb.io/blog/2025/10/06/ten-years-one-goal/

1•goloroden•11m ago•0 comments

Ask HN: TT-RSS Forks?

1•4oo4•11m ago•0 comments

Ask HN: Startup Governance Woes

2•throwawayfoundz•11m ago•0 comments

Misophonia Might Be a Brain Regulation Disorder

https://neurosciencenews.com/misophonia-cognitive-emotional-flexibility-29770/

1•dystopian-brain•16m ago•1 comments

How I sped up my WordPress using Cloudflare (for free)

https://www.cyberpunk.tools/jekyll/update/2025/04/30/best-wordpress-cache-plugin.html

1•JawsofDeath•21m ago•0 comments

Show HN: Early Shift – Detect trending Roblox mechanics 24-48h before saturation

https://github.com/SanchitSharma10/early-shift

1•san10•22m ago•0 comments

Knowing you are the same HUMAN without knowing who you are

https://github.com/jortsupetterson/anonymous-web-authentication

1•jortsupetteron•28m ago•1 comments

Tesla investors against Musk's billion-euro package: Board criticised

https://www.heise.de/en/news/Tesla-investors-against-Musk-s-billion-euro-package-Board-criticised...

3•doener•31m ago•0 comments

Heat your house with 500 Raspberry Pi

https://www.theregister.com/2025/10/03/thermify_heathub_raspberry_pi/

1•xezzed•31m ago•0 comments

Dear Tesla Shareholder [pdf]

https://static1.squarespace.com/static/5d374de8aae9940001c8ed59/t/68de828992aced27fc2d217c/175941...

8•doener•31m ago•0 comments

Sampling at Negative Temperature

https://cavendishlabs.org/blog/negative-temperature/

2•ag8•32m ago•0 comments

Why Fears of a Trillion-Dollar AI Bubble Are Growing

https://www.bloomberg.com/news/articles/2025-10-04/why-ai-bubble-concerns-loom-as-openai-microsof...

2•haltingproblem•34m ago•0 comments

YC, Take Two

https://www.raf.xyz/blog/01-yc-take-two

3•2arrs2ells•38m ago•0 comments

Black to Play

https://blacktoplay.com/?p=624

2•kqr•39m ago•0 comments

How We Picked the Name 'Monzo' (2016)

https://monzo.com/blog/2016/08/26/how-we-picked-monzo

2•susam•42m ago•0 comments

AI-powered, self-hostable image proxy

https://imgproxy.net/

2•gschier•42m ago•2 comments

Unrealistic but Dreamworthy Interior Design Ideas

https://estimateproperty.blogspot.com/2025/10/unrealistic-but-dreamworthy-interior.html

1•dweepseek•50m ago•0 comments

If you can get past the terrible logo, Audacity 4 looks pretty great

https://www.theverge.com/news/792368/if-you-can-get-past-the-terrible-logo-audacity-4-looks-prett...

6•mikhael•56m ago•1 comments

Impro: Palantir's Weirdest Book Recommendation

https://www.generalist.com/p/impro

2•walterbell•1h ago•1 comments

Publication and Citation-Based Impact

https://www.rosenberglab.net/impact.html

2•kkoncevicius•1h ago•0 comments

High-Quality Pull-Request Descriptions

https://www.racecondition.software/blog/pr-descriptions/

2•ingve•1h ago•0 comments

I built a tool that lets you backtest trading strategies using plain English

4•satabdom27•1h ago•0 comments

Why George R.R. Martin Broke the Cardinal Rule of Hollywood

https://www.hollywoodreporter.com/movies/movie-features/george-r-r-martin-howard-waldrop-ugly-chi...

7•throwoutway•1h ago•4 comments

He Drops Trump Jr.'S Name in Pursuit of Billion-Dollar Deals

https://www.wsj.com/politics/policy/donald-trump-jr-friend-gentry-beach-03824825

3•doener•1h ago•1 comments

Hybrid unary-binary design for multiplier-less printed ML classifiers

https://arxiv.org/abs/2509.15316

1•PaulHoule•1h ago•0 comments

The two minute mile problem

https://hollisrobbinsanecdotal.substack.com/p/the-two-minute-mile-problem

1•HR01•1h ago•0 comments