frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Different Language Models Learn Similar Number Representations

https://arxiv.org/abs/2604.20817
36•Anon84•1h ago

Comments

gn_central•1h ago
Curious if this similarity comes more from the training data or the model architecture itself. Did they look into that?
OtherShrezzing•1h ago
They describe that both are important, and researched in the paper, within the opening paragraph.
ACCount37•51m ago
The "platonic representation hypothesis" crowd can't stop winning.

Potentially useful for things like innate mathematical operation primitives. A major part of what makes it hard to imbue LLMs with better circuits is that we don't know how to connect them to the model internally, in a way that the model can learn to leverage.

Having an "in" on broadly compatible representations might make things like this easier to pull off.

LeCompteSftware•43m ago
"using periodic features with dominant periods at T=2, 5, 10" seems inconsistent with "platonic representation" and more consistent with "specific patterns noticed in commonly-used human symbolic representations of numbers."

Edit: to be clear I think these patterns are real and meaningful, but only loosely connected to a platonic representation of the number concept.

causal•41m ago
You seem to be going off the title which is plainly incorrect and not what the paper says. The paper demonstrates HOW different models can learn similar representations due to "data, architecture, optimizer, and tokenizer".

"How Different Language Models Learn Similar Number Representations" (actual title) is distinctly different from "Different Language Models Learn Similar Number Representations" - the latter implying some immutable law of the universe.

FrustratedMonky•41m ago
Same with images maybe?

Saw similar study comparing brain scans of person looking at image, to neural network capturing an image. And were very 'similar'. Similar enough to make you go 'hmmmm, those look a lot a like, could a Neural Net have a subjective experience?'

dboreham•48m ago
It's going to turn out that emergent states that are the same or similar in different learning systems fed roughly the same training data will be very common. Also predict it will explain much of what people today call "instinct" in animals (and the related behaviors in humans).
panagathon•15m ago
Oh yeah, that's clever
matja•45m ago
The eigenvalue distribution looks somewhat similar to Benford's Law - isn't that expected for a human-curated corpus?
causal•42m ago
Title is editorialized and needs to be fixed; the paper does not say what this title implies, nor is that the title of the paper.
jdonaldson•41m ago
(Pardon the self promotion) Libraries like turnstyle are taking advantage of shared representation across models. Neurosymbolic programming : https://github.com/jdonaldson/turnstyle

Machine learning& gut microbiome pathway analysis in Alzheimer's risk prediction

https://alz-journals.onlinelibrary.wiley.com/doi/10.1002/dad2.70340
1•bookofjoe•1m ago•0 comments

SDL Now Supports DOS

https://github.com/libsdl-org/SDL/pull/15377
1•Jayschwa•1m ago•0 comments

Show HN: MR Links – Inline link references for Marginal Revolution blog posts

https://github.com/donchuru/mr-links
1•nanfinitum•2m ago•0 comments

Show HN: Turn speech into text anywhere via hotkey (runs on Intel NPU, no cloud)

https://github.com/anubhavgupta/whisper-npu
1•anubhav200•3m ago•0 comments

I Turned the Game Boy Color into a Watch

https://www.youtube.com/watch?v=gTUg_NePXy8
1•mehackernewsacc•3m ago•0 comments

Cloud Functions in Firebase now supports Dart as an experimental feature

https://twitter.com/Firebase/status/2047405653879070917
1•nostromoWOWWOW•3m ago•0 comments

Diatec, known for its mechanical keyboard brand FILCO, has ceased operations

https://gigazine.net/gsc_news/en/20260424-filco-diatec/
1•gslin•4m ago•0 comments

Firefox Has Integrated Brave's Adblock Engine

https://itsfoss.com/news/firefox-ships-brave-adblock-engine/
2•eaf7e281•5m ago•0 comments

ReactGhost: Four locations of an unguarded property lookup in React Flight

https://reactghost.com/
1•cybrdude•7m ago•0 comments

Beware Software Brain

https://anderegg.ca/2026/04/23/beware-software-brain
1•Brajeshwar•8m ago•0 comments

Show HN: #1 On This Day

https://onthisday-theta.vercel.app
2•starzmustdie•9m ago•1 comments

Phantom: Web Automation Without a Browser

https://saadnaveed.com/writing/phantom-web-automation-without-a-browser/
1•saadn92•13m ago•0 comments

We are our own worst enemies

https://www.ufried.com/blog/worst_enemies/
1•cdrnsf•14m ago•0 comments

TurboBird – Firebird Database Tool

https://github.com/mdadali/TurboBird
1•mariuz•15m ago•0 comments

Google Plans to Invest Up to $40B in Anthropic

https://www.bloomberg.com/news/articles/2026-04-24/google-plans-to-invest-up-to-40-billion-in-ant...
7•elffjs•17m ago•0 comments

Show HN: Kadō, an open source habit tracker app for iOS

https://github.com/scastiel/kado
3•scastiel•19m ago•0 comments

Visual-base is a second brain from your eyes

https://github.com/oilbeater/visual-base
1•recrush•20m ago•0 comments

I Cancelled Claude: Token Issues, Declining Quality, and Poor Support

https://nickyreinert.de/en/2026/2026-04-24-claude-critics/
4•y42•22m ago•0 comments

Is the Novelty Budget Dead?

https://simonshine.dk/articles/is-the-novelty-budget-dead/
1•sshine•24m ago•0 comments

PGO Build TPC-C Analysis MariaDB v11.8.6 TideSQL

https://tidesdb.com/articles/pgo-build-tpc-c-analysis-mariadb-v11-8-6-tidesql/
1•alexpadula•26m ago•0 comments

LLMs – What Experienced Practitioners See

https://dr-knz.net/llms-in-practice.html
1•knz42•27m ago•1 comments

Ask HN: How does Google crawls x.com website?

1•iaziz786•28m ago•1 comments

Games for Change

https://www.gamesforchange.org/
1•csmillie•30m ago•0 comments

What Anthropic's Mythos Means for the Future of Cybersecurity

https://spectrum.ieee.org/ai-cybersecurity-mythos
2•Brajeshwar•32m ago•1 comments

Refuse to let your doctor record you

https://buttondown.com/maiht3k/archive/why-you-should-refuse-to-let-your-doctor-record/
39•speckx•33m ago•31 comments

Space Reactor 1

https://en.wikipedia.org/wiki/Space_Reactor%E2%80%911_Freedom
1•hansmayer•35m ago•0 comments

Intel stock hits new all-time highs for first time since 2000

https://cryptobriefing.com/intel-stock-all-time-high-since-2000/
3•mgh2•35m ago•2 comments

Content credentials – hardware signing of photo and video cameras

https://contentcredentials.org/
1•sveme•35m ago•0 comments

Why I'm Done Making Desktop Applications

https://www.kalzumeus.com/2009/09/05/desktop-aps-versus-web-apps/
24•claxo•37m ago•14 comments

Six things I'll remember when I think about Tim Cook's version of Apple

https://arstechnica.com/gadgets/2026/04/six-things-ill-remember-when-i-think-about-tim-cooks-vers...
1•01-_-•41m ago•0 comments