frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Will AI models over time converge into the same system?

7•ThinkBeat•6mo ago
I probably am not using the correct terms here so sorry about that.

If all general LLM are eventually exposed to the same data, and a lot of the same use cases will they over time converge in responses?

Even if they are of different arcitecture? or are the current architecture companies use for their big LLM close enough to each other?

Comments

allears•6mo ago
Not an expert, but I believe it's just the opposite. Even with the same LLM and the same training data, responses diverge. And that can be a problem.
drooby•6mo ago
Id think yes.

Intelligence is a model of reality and the future. They'll converge into the same system as a reflection of the laws of physics and human psychology.

And then when they are used as weapons they'll perhaps try to diverge and it will become an arms race to create models of the adversaries models.

_

Another way to look at it is our own history. Intelligent apes all "converged" into our one homo sapien.

Buttons840•6mo ago
I wonder how much of the AI depends on its initial weights? If in coming decades we understand better how neural networks work, it would be funny to look back and realize that Google beat OpenAI because they got lucky with their initial weights or something.
joules77•6mo ago
At a basic level it generates a probability distribution of what the next token should be.

There are a zillion questions that can be asked where you can get a prob dist where multiple tokens have the same probability (flat probability distributions). Then it has to randomly pick one and you can get large variation.

l33tbro•6mo ago
I'd guess no. While they have similar training data, there is plenty of novelty and unique data entering each model due to how each user is using it. This is why ideas like model collapse are fun in theory, but don't really play out due to the irregular ways LLMs are used in the real world.

I could be wrong, but I have not heard a convincing argument for what you propose.

ijk•6mo ago
In aggregate? Signs point to yes. For the general purpose SFT base models. We see some evidence even with RNNs vs Transformers. You're essentially finding a function that models language. Use the same optimization function, get a similar result.

However, the RL and especially the RLHF does a lot to reshape the responses, and that's potentially a lot more varied. For the training that wasn't just cribbed from ChatGPT, anyway.

Lastly, it's unlikely that you'll get the _exact same_ responses; there's too many variables at inference time alone. And as for training, we can fingerprint models by their vocabulary to a certain extent. So in practical terms there's probably always going to be some differences.

This assumes our current training approaches don't change too drastically, of course.

UltraSane•6mo ago
This is called the The Platonic Representation Hypothesis

https://arxiv.org/abs/2405.07987

We argue that representations in AI models, particularly deep networks, are converging. First, we survey many examples of convergence in the literature: over time and across multiple domains, the ways by which different neural networks represent data are becoming more aligned. Next, we demonstrate convergence across data modalities: as vision models and language models get larger, they measure distance between datapoints in a more and more alike way. We hypothesize that this convergence is driving toward a shared statistical model of reality, akin to Plato's concept of an ideal reality. We term such a representation the platonic representation and discuss several possible selective pressures toward it. Finally, we discuss the implications of these trends, their limitations, and counterexamples to our analysis.

moomoo11•6mo ago
There are like maybe <100 people who actually contribute actively to LLMs.

Just treat it like a commodity (like cloud infrastructure) and build cool shit using it.

If the provider can roll that feature into their offerings then you’re not actually adding any value to the world.

mikewarot•6mo ago
I'm fairly certain that wouldn't happen. Unless you were to overfit the models until the error were to drop to zero, which would likely take almost infinite time. If you did get that point, you've managed to achieve lossless compression of the training data into the weights of the model.

Given that AI models are randomly initialized with noise, and the goal of training is to avoid overfit, there will always be variance between the weights of models, even if trained from the same data, due to those initial conditions, and chaos theory.

And all of the above, is for the same model architecture. I expect you could do some principle component analysis and come up with a transform to work between models, again if they were overfit to zero error. (After all, that would be a compression engine instead of an AI at that point)

Upon reflection, it seems to me that free Stanford AI course I took a decade ago actually stuck. 8)

Ask HN: Anyone Using a Mac Studio for Local AI/LLM?

44•UmYeahNo•1d ago•28 comments

Ask HN: Ideas for small ways to make the world a better place

10•jlmcgraw•10h ago•17 comments

Ask HN: Non-profit, volunteers run org needs CRM. Is Odoo Community a good sol.?

2•netfortius•5h ago•1 comments

Ask HN: Non AI-obsessed tech forums

18•nanocat•8h ago•13 comments

Ask HN: 10 months since the Llama-4 release: what happened to Meta AI?

43•Invictus0•1d ago•11 comments

AI Regex Scientist: A self-improving regex solver

6•PranoyP•12h ago•1 comments

Ask HN: Who wants to be hired? (February 2026)

139•whoishiring•4d ago•513 comments

Ask HN: Who is hiring? (February 2026)

312•whoishiring•4d ago•511 comments

Tell HN: Another round of Zendesk email spam

104•Philpax•2d ago•54 comments

Ask HN: Is Connecting via SSH Risky?

19•atrevbot•2d ago•37 comments

Ask HN: Why LLM providers sell access instead of consulting services?

4•pera•18h ago•13 comments

Ask HN: Any International Job Boards for International Workers?

2•15charslong•7h ago•2 comments

Ask HN: Has your whole engineering team gone big into AI coding? How's it going?

17•jchung•2d ago•12 comments

Ask HN: What is the most complicated Algorithm you came up with yourself?

3•meffmadd•19h ago•7 comments

Ask HN: How does ChatGPT decide which websites to recommend?

5•nworley•1d ago•11 comments

Ask HN: Is it just me or are most businesses insane?

7•justenough•1d ago•5 comments

Ask HN: Mem0 stores memories, but doesn't learn user patterns

9•fliellerjulian•2d ago•6 comments

Ask HN: Is there anyone here who still uses slide rules?

123•blenderob•3d ago•122 comments

Ask HN: Anyone Seeing YT ads related to chats on ChatGPT?

2•guhsnamih•1d ago•4 comments

Ask HN: Does global decoupling from the USA signal comeback of the desktop app?

5•wewewedxfgdf•1d ago•2 comments

Kernighan on Programming

170•chrisjj•4d ago•61 comments

We built a serverless GPU inference platform with predictable latency

5•QubridAI•2d ago•1 comments

Ask HN: How Did You Validate?

4•haute_cuisine•1d ago•4 comments

Ask HN: Does a good "read it later" app exist?

8•buchanae•3d ago•18 comments

Ask HN: Have you been fired because of AI?

17•s-stude•4d ago•15 comments

Ask HN: Cheap laptop for Linux without GUI (for writing)

15•locusofself•3d ago•16 comments

Ask HN: Anyone have a "sovereign" solution for phone calls?

12•kldg•3d ago•1 comments

Test management tools for automation heavy teams

2•Divyakurian•1d ago•2 comments

Ask HN: OpenClaw users, what is your token spend?

14•8cvor6j844qw_d6•4d ago•6 comments

Ask HN: Has anybody moved their local community off of Facebook groups?

23•madsohm•4d ago•18 comments