frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We don't need continual learning for AGI. What top labs are currently doing

3•kok14•1h ago
Many people think that we won't reach AGI or even ASI if LLM's don't have something called "continual learning". Basically, continual learning is the ability for an AI to learn on the job, update its neural weights in real-time, and get smarter without forgetting everything else (catastrophic forgetting). This is what we do everyday, without much effort.

What's interesting now, is if you look at what the top labs are doing, they’ve stopped trying to solve the underlying math of real-time weight updates. Instead, they’re simply brute-forcing it. It is exactly why, in the past ~ 3 months or so, there has been a step-function increase in how good the models have gotten.

Long story short, the gist of it is, if you combine:

very long context windows

reliable summarization

structured external documentation,

you can approximate a lot of what people mean by continual learning.

How it works is, the model does a task and absorbs a massive amount of situational detail. Then, before it “hands off” to the next instance of itself, it writes two things: short “memories” (always carried forward in the prompt/context) and long-form documentation (stored externally, retrieved only when needed). The next run starts with these notes, so it doesn't need to start from scratch.

Through this clever reinforcement learning (RL) loop, they train this behaviour directly, without any exotic new theory.

They treat memory-writing as an RL objective: after a run, have the model write memories/docs, then spin up new instances on the same, similar, and dissimilar tasks while feeding those memories back in. How this is done, is by scoring performance across the sequence, and applying an explicit penalty for memory length so you don’t get infinite “notes” that eventually blow the context window.

Over many iterations, you reward models that (a) write high-signal memories, (b) retrieve the right docs at the right time, and (c) edit/compress stale notes instead of mindlessly accumulating them.

This is pretty crazy. Because when you combine the current release cadence of frontier labs where each new model is trained and shipped after major post-training / scaling improvements, even if your deployed instance never updates its weights in real-time, it can still “get smarter” when the next version ships AND it can inherit all the accumulated memories/docs from its predecessor.

This is a new force multiplier, another scaling paradigm, and likely what the top labs are doing right now (source: TBA).

Ignoring any black swan level event (unknown, unknowns), you get a plausible 2026 trajectory:

We’re going to see more and more improvements, in an accelerated timeline. The top labs ARE, in effect, using continual learning (a really good approximation of it), and they are directly training this approximation, so it rapidly gets better and better.

Don't believe me? Look at what both OpenAi(https://openai.com/index/introducing-openai-frontier/) and Anthropic(https://resources.anthropic.com/2026-agentic-coding-trends-report) have mentioned as their core things they are focusing on. It's exactly why governments & corporations are bullish on this; there is no wall....

Styx Document Language

https://styx.bearcove.eu/
1•todsacerdoti•3m ago•0 comments

A Call for Meaningful Work at a Slower Pace

https://jenteottenburghs.wordpress.com/2025/11/18/a-call-for-meaningful-work-at-a-slower-pace/
1•carschno•3m ago•0 comments

Show HN: Anaya – CLI that scans codebases for DPDP compliance violations

https://github.com/sandip-pathe/anaya-scan
1•sandippathe•3m ago•0 comments

Show HN: Parsewise – Cursor for Business Documents

https://www.parsewise.ai/platform
1•maxhofer•3m ago•0 comments

Show HN: Chartle – Describe a chart in plain English and it creates it

https://www.chartle.app/
1•moorst•4m ago•1 comments

Show HN: A framework for building nexuses of agents

https://github.com/NetMindAI-Open/NexusAgent
1•Demi369•4m ago•0 comments

Fresh claim of making elusive 'hexagonal' diamond is the strongest yet

https://www.nature.com/articles/d41586-026-00711-9
1•Brajeshwar•8m ago•0 comments

AI can write genomes – how long until it creates synthetic life?

https://www.nature.com/articles/d41586-026-00681-y
1•Brajeshwar•9m ago•0 comments

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [Text]

https://www.spatialintelligence.ai/p/the-intelligence-monopoly-is-over
2•fragmede•9m ago•1 comments

Show HN: Mnemora – Serverless memory DB for AI agents (no LLM in your CRUD path)

https://github.com/mnemora-db/mnemora
2•isaacgbc•12m ago•1 comments

Show HN: Slay the Spire 2 Wiki (database and card maker tool)

https://slaythespire2.gg
1•WanderZil•12m ago•1 comments

Top K is a deceptively hard problem in relational databases

https://www.paradedb.com/blog/optimizing-top-k
1•birdculture•13m ago•0 comments

Frak – a simple code deployment utility

https://github.com/frakjs/frak
1•strube•17m ago•0 comments

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [video]

https://www.youtube.com/watch?v=0p8o7AeHDzg
2•KellyCriterion•18m ago•1 comments

Write Small Rust Scripts

https://llogiq.github.io/2026/03/05/auto.html
1•ingve•19m ago•0 comments

Bourdieu's theory of taste: a grumbling abrégé

https://dynomight.net/bourdieu/
1•sebg•20m ago•0 comments

Polsia – vibe coded companies with live revenue and digital marketing

https://polsia.com
1•DaedalusII•22m ago•0 comments

What's an API?

https://read.technically.dev/p/whats-an-api
1•sebg•24m ago•0 comments

Show HN: AI Governance Architecture – DB-Governed, LLM-Agnostic, EU AI Act

https://github.com/war851/AI-Governance-Architecture
1•war851•26m ago•1 comments

Paloha – Agence de communication Montpellier

https://paloha.fr/fr
1•PatrickNav•28m ago•1 comments

Are companies preventing sensitive data from being sent to external LLM APIs

1•jayakrishna96•29m ago•0 comments

Fly.io deleted my apps and DBs in an unrelated organization without warning

1•pakejro•30m ago•0 comments

Emacs internals: Deconstructing Lisp_Object in C (Part 2)

https://thecloudlet.github.io/blog/project/emacs-02/
2•thecloudlet•30m ago•0 comments

Deno Sandbox: run AI generated code with real isolation and complete control

https://deno.com/deploy/sandbox
1•AbuAssar•33m ago•0 comments

How to Convert OST File to PST File in Outlook?

https://blog.perfectdatasolutions.com/ost-to-pst-converter-software-2/
1•tieanderson•34m ago•1 comments

Stop Writing Instrumentation Code

https://encore.dev/blog/stop-writing-instrumentation-code
3•andout_•36m ago•0 comments

OpenClaw Agent

https://openclawagent.net
3•czmilo•37m ago•1 comments

ClickMem: Agent memory built on chDB(ClickHouse embedded)

https://github.com/auxten/clickmem
1•tosh•37m ago•0 comments

Looking for suggestions: project orchestration solutions

1•paulistaunb•41m ago•0 comments

Simple and Inexpensive Website Monitoring

https://updown.io/
1•jauco•42m ago•0 comments