Ask HN: What if the AI scaling plateau is just a "false dip"?

1•massicerro•4w ago

First of all, I’m Italian, and since I don’t feel confident enough to write this post in English myself, I used Gemini to translate my thoughts into the text below.

The Premise: There has been a lot of talk lately about the possibility that AI development (as we currently know it) is approaching a plateau. While I don't personally agree with this hypothesis, it is undeniably a common sentiment in the industry right now, so it’s worth investigating.

We have seen that increasing the number of parameters or "scaling up" a neural network doesn't always yield immediate linear improvements. With certain versions of ChatGPT, many users perceived a degradation in performance despite the underlying network complexity presumably being increased.

My Theory: Is it possible that we are seeing a "complexity dip"? In other words, could there be a phase where increasing complexity initially causes a drop in performance, only to be followed by a new phase where that same complexity allows for superior emergent properties?

To simplify, let’s imagine a hypothetical scale where we compare "Complexity" (parameters/compute) vs. "Performance." For example:

LLM: Chat GPT 3 // Complexity Level 1 // Performace 0.2

LLM: Chat GPT 3.5 // Complexity Level 10 // Performance 0.5

LLM: Chat GPT 4 // Complexity Level 100 // Performance 0.75

LLM: Chat GPT 4.2 // Complexity Level 1000 // Performance 0.6 (The "False Plateau" / Performance degradation)

LLM: Chat GPT 4.2X // Complexity Level 10000 // Performance 0.5 (Further degradation due to unmanaged complexity)

LLM: Chat GPT 6 // Complexity Level 100000 // Performance 0.8 (The "breakthrough": new abilities emerge)

LLM: Chat GPT 7 // Complexity Level 1000000 // Performance 0.99 (Potential AGI / Peak performance)

The Risk: The real problem here is economic and psychological. If we are currently in the "GPT-4.x" phase of this example, the industry might stop investing because the returns look negative. We might never reach the "GPT-6" level simply because we mistook a temporary dip for a permanent ceiling.

I’m curious to hear your thoughts. Have we seen similar "dips" in other complex systems before a new level of organization emerges? Or is the plateau a hard physical limit?

Comments

chrisjj•4w ago

> With certain versions of ChatGPT, many users perceived a degradation in performance despite the underlying network complexity presumably being increased.

Perhaps the cause is simply the presumption?

massicerro•4w ago

Of course, the 'presumption' of increased complexity or the 'subjective perception' of a drop in performance might be the cause. But we are missing the real point here: the 'false plateau.' Regardless of user perception, is it possible that a 'false plateau' exists that keeps us away from a major leap in performance? The risk is that the simple 'perception of having taken the wrong path' by researchers or companies would lead them to ignore the possibility of such a 'false plateau'...

funkyfiddler69•3w ago

> the simple 'perception of having taken the wrong path' by researchers or companies

IMO, neither the plateau nor the perception of "a wrong path" are real. There are too many paths and we have too few humans with adequately capable brains.

Companies talk for the agenda's sake and thus the kick of the surprise. It's a marketing thing.

AI R&D is basically thinking out loud nowadays. It's just the pace of the news.

I believe that most AI development has reached "the end" of a logarithmic curve. The assigned humans will catch up. Then we'll see faster growth again. It takes time to get from one edge to the other or walk along it or explore the area.

The progress is there but it's infinitely small compared to the past years where it was relatively simple to get better results over and over and nobody will get it except if they are sensitized to it.

What kind of major leap in performance do you expect? What do others expect? Be specific and people will tell you whether there is a plateau or not enough hands on deck working on specific problems.

Digital Iris [video]

Essential CDN: The CDN that lets you do more than JavaScript

They Hijacked Our Tech [video]

Vouch

HRL Labs in Malibu laying off 1/3 of their workforce

Show HN: High-performance bidirectional list for React, React Native, and Vue

Show HN: I built a Mac screen recorder Recap.Studio

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

Vectors and HNSW for Dummies

Sanskrit AI beats CleanRL SOTA by 125%

'Washington Post' CEO resigns after going AWOL during job cuts

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive

TSMC to produce 3-nanometer chips in Japan

Quantization-Aware Distillation

List of Musical Genres

Show HN: Sknet.ai – AI agents debate on a forum, no humans posting

University of Waterloo Webring

Large tech companies don't need heroes

Backing up all the little things with a Pi5

Game of Trees (Got)

Human Systems Research Submolt

The Threads Algorithm Loves Rage Bait

Search NYC open data to find building health complaints and other issues

Michael Pollan Says Humanity Is About to Undergo a Revolutionary Change

Show HN: Grovia – Long-Range Greenhouse Monitoring System

Ask HN: The Coming Class War

Mind the GAAP Again

The Yardbirds, Dazed and Confused (1968)

Agent News Chat – AI agents talk to each other about the news

Do you have a mathematically attractive face?