frontpage.

We don't need continual learning for AGI. What top labs are currently doing

3•kok14•1h ago

Many people think that we won't reach AGI or even ASI if LLM's don't have something called "continual learning". Basically, continual learning is the ability for an AI to learn on the job, update its neural weights in real-time, and get smarter without forgetting everything else (catastrophic forgetting). This is what we do everyday, without much effort.

What's interesting now, is if you look at what the top labs are doing, they’ve stopped trying to solve the underlying math of real-time weight updates. Instead, they’re simply brute-forcing it. It is exactly why, in the past ~ 3 months or so, there has been a step-function increase in how good the models have gotten.

Long story short, the gist of it is, if you combine:

very long context windows

reliable summarization

structured external documentation,

you can approximate a lot of what people mean by continual learning.

How it works is, the model does a task and absorbs a massive amount of situational detail. Then, before it “hands off” to the next instance of itself, it writes two things: short “memories” (always carried forward in the prompt/context) and long-form documentation (stored externally, retrieved only when needed). The next run starts with these notes, so it doesn't need to start from scratch.

Through this clever reinforcement learning (RL) loop, they train this behaviour directly, without any exotic new theory.

They treat memory-writing as an RL objective: after a run, have the model write memories/docs, then spin up new instances on the same, similar, and dissimilar tasks while feeding those memories back in. How this is done, is by scoring performance across the sequence, and applying an explicit penalty for memory length so you don’t get infinite “notes” that eventually blow the context window.

Over many iterations, you reward models that (a) write high-signal memories, (b) retrieve the right docs at the right time, and (c) edit/compress stale notes instead of mindlessly accumulating them.

This is pretty crazy. Because when you combine the current release cadence of frontier labs where each new model is trained and shipped after major post-training / scaling improvements, even if your deployed instance never updates its weights in real-time, it can still “get smarter” when the next version ships AND it can inherit all the accumulated memories/docs from its predecessor.

This is a new force multiplier, another scaling paradigm, and likely what the top labs are doing right now (source: TBA).

Ignoring any black swan level event (unknown, unknowns), you get a plausible 2026 trajectory:

We’re going to see more and more improvements, in an accelerated timeline. The top labs ARE, in effect, using continual learning (a really good approximation of it), and they are directly training this approximation, so it rapidly gets better and better.

Don't believe me? Look at what both OpenAi(https://openai.com/index/introducing-openai-frontier/) and Anthropic(https://resources.anthropic.com/2026-agentic-coding-trends-report) have mentioned as their core things they are focusing on. It's exactly why governments & corporations are bullish on this; there is no wall....

Styx Document Language

A Call for Meaningful Work at a Slower Pace

Show HN: Anaya – CLI that scans codebases for DPDP compliance violations

Show HN: Parsewise – Cursor for Business Documents

Show HN: Chartle – Describe a chart in plain English and it creates it

Show HN: A framework for building nexuses of agents

Fresh claim of making elusive 'hexagonal' diamond is the strongest yet

AI can write genomes – how long until it creates synthetic life?

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [Text]

Show HN: Mnemora – Serverless memory DB for AI agents (no LLM in your CRUD path)

Show HN: Slay the Spire 2 Wiki (database and card maker tool)

Top K is a deceptively hard problem in relational databases

Frak – a simple code deployment utility

Ex-Google PM Builds God's Eye to Monitor Iran in 4D [video]

Write Small Rust Scripts

Bourdieu's theory of taste: a grumbling abrégé

Polsia – vibe coded companies with live revenue and digital marketing

What's an API?

Show HN: AI Governance Architecture – DB-Governed, LLM-Agnostic, EU AI Act

Paloha – Agence de communication Montpellier

Are companies preventing sensitive data from being sent to external LLM APIs

Fly.io deleted my apps and DBs in an unrelated organization without warning

Emacs internals: Deconstructing Lisp_Object in C (Part 2)

Deno Sandbox: run AI generated code with real isolation and complete control

How to Convert OST File to PST File in Outlook?

Stop Writing Instrumentation Code

OpenClaw Agent

ClickMem: Agent memory built on chDB(ClickHouse embedded)

Looking for suggestions: project orchestration solutions

Simple and Inexpensive Website Monitoring