frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What if the AI scaling plateau is just a "false dip"?

1•massicerro•14h ago
First of all, I’m Italian, and since I don’t feel confident enough to write this post in English myself, I used Gemini to translate my thoughts into the text below.

The Premise: There has been a lot of talk lately about the possibility that AI development (as we currently know it) is approaching a plateau. While I don't personally agree with this hypothesis, it is undeniably a common sentiment in the industry right now, so it’s worth investigating.

We have seen that increasing the number of parameters or "scaling up" a neural network doesn't always yield immediate linear improvements. With certain versions of ChatGPT, many users perceived a degradation in performance despite the underlying network complexity presumably being increased.

My Theory: Is it possible that we are seeing a "complexity dip"? In other words, could there be a phase where increasing complexity initially causes a drop in performance, only to be followed by a new phase where that same complexity allows for superior emergent properties?

To simplify, let’s imagine a hypothetical scale where we compare "Complexity" (parameters/compute) vs. "Performance." For example:

LLM: Chat GPT 3 // Complexity Level 1 // Performace 0.2

LLM: Chat GPT 3.5 // Complexity Level 10 // Performance 0.5

LLM: Chat GPT 4 // Complexity Level 100 // Performance 0.75

LLM: Chat GPT 4.2 // Complexity Level 1000 // Performance 0.6 (The "False Plateau" / Performance degradation)

LLM: Chat GPT 4.2X // Complexity Level 10000 // Performance 0.5 (Further degradation due to unmanaged complexity)

LLM: Chat GPT 6 // Complexity Level 100000 // Performance 0.8 (The "breakthrough": new abilities emerge)

LLM: Chat GPT 7 // Complexity Level 1000000 // Performance 0.99 (Potential AGI / Peak performance)

The Risk: The real problem here is economic and psychological. If we are currently in the "GPT-4.x" phase of this example, the industry might stop investing because the returns look negative. We might never reach the "GPT-6" level simply because we mistook a temporary dip for a permanent ceiling.

I’m curious to hear your thoughts. Have we seen similar "dips" in other complex systems before a new level of organization emerges? Or is the plateau a hard physical limit?

Comments

chrisjj•14h ago
> With certain versions of ChatGPT, many users perceived a degradation in performance despite the underlying network complexity presumably being increased.

Perhaps the cause is simply the presumption?

massicerro•13h ago
Of course, the 'presumption' of increased complexity or the 'subjective perception' of a drop in performance might be the cause. But we are missing the real point here: the 'false plateau.' Regardless of user perception, is it possible that a 'false plateau' exists that keeps us away from a major leap in performance? The risk is that the simple 'perception of having taken the wrong path' by researchers or companies would lead them to ignore the possibility of such a 'false plateau'...

Show HN: Turn any topic into a 3Blue1Brown-style video

https://github.com/mateolafalce/topic2manim
1•lafalce•7s ago•0 comments

After planning, work history scatters across tools and people's heads

1•aryan_192002•1m ago•0 comments

UK threatened with sanctions if Starmer bans X

https://www.telegraph.co.uk/business/2026/01/09/uk-threatened-with-sanctions-if-starmer-bans-x/
1•TheAlchemist•2m ago•0 comments

Not All Browser APIs Are "Web" APIs

https://polypane.app/blog/not-all-browser-apis-are-web-apis/
1•OuterVale•2m ago•0 comments

Asset Hoard – Local-first asset manager for indie game devs (beta)

https://assethoard.com
1•markyg•6m ago•1 comments

Show HN: Awesome-Nanobanana-Prompts

https://github.com/Transcendo/awesome-nanobanana-prompts
1•hellomerlin•9m ago•0 comments

Hush Line review: Accessible whistleblowing platform for journalists and lawyers

https://www.privacyguides.org/posts/2026/01/09/hush-line-review-an-accessible-whistleblowing-plat...
2•evolve2k•9m ago•0 comments

Apple Loses Safari Lead Designer to the Browser Company

https://www.macrumors.com/2026/01/08/apple-loses-safari-designer-to-the-browser-company/
1•akyuu•10m ago•0 comments

Volcano Model DBMS

https://www.oreateai.com/blog/volcano-model-research-on-the-scalable-architecture-of-database-que...
1•flavio_poblete•14m ago•0 comments

A Nobel Prize cannot be revoked, shared, or transferred

https://www.nobelpeaceprize.org/press/press-releases/a-nobel-prize-cannot-be-revoked-shared-or-tr...
2•tech234a•16m ago•0 comments

AI's Memorization Crisis

https://www.theatlantic.com/technology/2026/01/ai-memorization-research/685552/
1•casparvitch•17m ago•1 comments

AI Coding

https://martinrue.com/on-ai-coding/
1•afisxisto•20m ago•1 comments

iOS 26 Shows Unusually Slow Adoption Months After Release

https://www.macrumors.com/2026/01/08/ios-26-shows-unusually-slow-adoption/
4•m463•20m ago•0 comments

Media Handling made simple using FileKit.dev

https://FileKit.dev
1•georgealbert•20m ago•0 comments

CES Worst in Show Awards Call Out the Tech Making Things Worse

https://apnews.com/article/ces-worst-show-ai-0ce7fbc5aff68e8ff6d7b8e6fb7b007d
1•m463•23m ago•0 comments

What is a Doomsday Plane and why did it land at LAX?

https://www.hindustantimes.com/world-news/us-news/what-is-a-doomsday-plane-and-why-did-it-land-at...
1•clanky•28m ago•3 comments

New evidence for a particle system that 'remembers' its previous quantum states

https://phys.org/news/2026-01-evidence-particle-previous-quantum-states.html
3•westurner•28m ago•1 comments

Recursive Language Models W: Alex Zhang [video]

https://www.youtube.com/watch?v=_TaIZLKhfLc
1•bob1029•34m ago•0 comments

Reason Studios acquired by AI music production specialist LANDR

https://www.musicradar.com/music-tech/this-isnt-about-changing-reason-its-about-giving-it-room-to...
1•CrypticShift•35m ago•0 comments

Americans Won't Ban Kids from Social Media. What Can We Do Instead?

https://www.newyorker.com/news/fault-lines/americans-wont-ban-kids-from-social-media-what-can-we-...
1•PaulHoule•40m ago•0 comments

Show HN: Scroll Podcasts Like TikTok

https://podtoc.com/app/
1•conradbez•42m ago•0 comments

Training Your Own LLM on a MacBook in 10 Minutes

https://opuslabs.substack.com/p/training-your-own-llm-on-a-macbook
1•opuslabs•44m ago•0 comments

Agentic ProbLLMs: Exploiting AI Computer-Use and Coding Agents [video]

https://www.youtube.com/watch?v=8pbz5y7_WkM
1•lynx97•47m ago•0 comments

How to Steal Any React Component

https://fant.io/react/
1•handfuloflight•47m ago•0 comments

ICEout.Tech demand letter from tech community

https://docs.google.com/forms/d/e/1FAIpQLSfCcCDd5aw2viBsT-sKAP5w9k66g8EdrSWpScTdM_-38v025g/viewform
7•theworkeragency•48m ago•3 comments

Amazon has big hopes for wearable AI – starting with this $50 gadget

https://www.seattletimes.com/business/amazon-has-big-hopes-for-wearable-ai-starting-with-this-50-...
1•walterbell•52m ago•0 comments

Show HN: Readable – A Swipeable Article Reader

https://chromewebstore.google.com/detail/readable-swipeable-articl/cegfoepnghfonapjdmjiigdekdnhnjof
2•randoglando•54m ago•0 comments

Bored

https://idiallo.com/static/bored.html
1•foxfired•55m ago•0 comments

Show HN: arxiv2md: Convert ArXiv papers to markdown

https://arxiv2md.org/
2•timf34•55m ago•0 comments

Firefox pinch zoom without trackpad

https://superuser.com/questions/1659519/firefox-pinch-zoom-without-trackpad
1•goodburb•56m ago•0 comments