frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Continual Learning Problem

https://jessylin.com/2025/10/20/continual-learning/
59•Bogdanp•1w ago

Comments

optimalsolver•4h ago
Rather than handcrafting solutions like it’s 1993, why not make robustness against forgetting part of the training objective?

Let the search algorithm figure it out.

vessenes•3h ago
The reason you're getting slightly downvoted, I think, is that you need to answer this question first: which of the 15T tokens are you going to evaluate for forgetting? And, please explain how doing that is different than doing another full epoch type pass over the weights.

Some of the appeal here is that this architecture (handcrafted) allows ongoing gradient descent learning as you go on a much smaller set of weights.

intalentive•1h ago
Funny you say that, this write-up recalled Stephen Grossberg's Adaptive Resonance Theory for me. The same basic ideas come up when addressing the stability-plasticity dilemma.

That said, the authors are saving this for future work. Fine-tuning is cheaper, easier, faster to validate.

>Switching to a new architecture at pretraining time has a high cost, but there are reasons we might want this (besides the better scaling behavior). The main benefit is that the model can learn to organize its memory from scratch, and once we’ve already “allocated” this high-capacity memory pool, there’s a clearer path to learning on multiple tasks and corpora over time.

This means you could "fine-tune" the model on your custom corpus at ingestion time, without having to actually train via backprop. Your corpus would be compressed into model-readable memory that updates model behavior. Then different memory units could be swapped in and out, like programs on a floppy disk. I can see this concept being especially useful for robotics.

yorwba•55m ago
The memory is model-readable but not model-writable, so you still need to train via backprop to get the memory to store useful data.
esafak•4h ago
Great writeup. Are there any libraries that implement some of the methods described?
skeptrune•3h ago
I appreciate that people are going beyond RAG and few shot prompting.

The Mack Super Pumper was a locomotive engined fire fighter (2018)

https://bangshift.com/bangshiftxl/mack-super-pumper-system-locomotive-engine-powered-pumper-extin...
44•mstngl•1h ago•20 comments

Ask HN: Who is hiring? (November 2025)

253•whoishiring•6h ago•270 comments

Learning to read Arthur Whitney's C to become smart (2024)

https://needleful.net/blog/2024/01/arthur_whitney.html
184•gudzpoz•5h ago•71 comments

AI's Dial-Up Era

https://www.wreflection.com/p/ai-dial-up-era
14•nowflux•1h ago•4 comments

The MP3.com Rescue Barge Barge

https://blog.somnolescent.net/2025/09/mp3-com-rescue-barge-barge/
59•CharlesW•1w ago•11 comments

Gallery of wonderful drawings our little thermal printer received

https://guestbook.goodenough.us
53•busymom0•3h ago•18 comments

Ask HN: Who wants to be hired? (November 2025)

97•whoishiring•6h ago•201 comments

Tiny electric motor can produce more than 1,000 horsepower

https://supercarblondie.com/electric-motor-yasa-more-powerful-tesla-mercedes/
490•chris_overseas•12h ago•453 comments

S1130 – IBM 1130 Emulator in C#

https://github.com/semuhphor/S1130/tree/feature/web-frontend
15•rbanffy•1w ago•0 comments

</> Htmx – The Fetch()ening

https://htmx.org/essays/the-fetchening/
156•leephillips•2h ago•30 comments

Agent-O-rama: build LLM agents in Java or Clojure

https://blog.redplanetlabs.com/2025/11/03/introducing-agent-o-rama-build-trace-evaluate-and-monit...
13•yayitswei•3h ago•0 comments

The Case Against PGVector

https://alex-jacobs.com/posts/the-case-against-pgvector/
229•tacoooooooo•9h ago•92 comments

A visualization of the RGB space covered by named colors

https://codepen.io/meodai/full/zdgXJj/
190•BlankCanvas•5d ago•42 comments

WebAssembly (WASM) arch support for the Linux kernel

https://github.com/joelseverin/linux-wasm
199•marcodiego•2d ago•45 comments

Harder, Better, Faster, Stronger Version of Uber H3 in Rust

https://grim7reaper.github.io/blog/2023/01/09/the-hydronium-project/
71•ashergill•1w ago•25 comments

VimGraph

https://resources.wolframcloud.com/FunctionRepository/resources/VimGraph/
133•gdelfino01•8h ago•22 comments

Skyfall-GS – Synthesizing Immersive 3D Urban Scenes from Satellite Imagery

https://skyfall-gs.jayinnn.dev/
88•ChrisArchitect•8h ago•25 comments

FreakWAN: A floor-routing WAN implementing a chat over bare-LoRa (no LoRaWAN)

https://github.com/antirez/freakwan
13•teleforce•2h ago•4 comments

Open-sourced game logic, art and Spine animations – SuperWEIRD Game Kit

https://ludenio.itch.io/superweird-game-kit
5•gamescodedogs•4d ago•2 comments

First recording of a dying human brain shows waves similar to memory flashbacks

https://louisville.edu/medicine/news/first-ever-recording-of-a-dying-human-brain-shows-waves-simi...
170•thunderbong•15h ago•156 comments

Show HN: Tamagotchi P1 for FPGAs

https://github.com/agg23/fpga-tamagotchi
38•agg23•6d ago•3 comments

Robert Hooke's "Cyberpunk” Letter to Gottfried Leibniz

https://mynamelowercase.com/blog/robert-hookes-cyberpunk-letter-to-gottfried-leibniz/
61•Gormisdomai•6h ago•14 comments

An Illustrated Introduction to Linear Algebra, Chapter 2: The Dot Product

https://www.ducktyped.org/p/linear-algebra-chapter-2-the-dot
91•egonschiele•8h ago•47 comments

Measuring characteristics of TCP connections at Internet scale

https://blog.cloudflare.com/measuring-network-connections-at-scale/
47•fleahunter•5d ago•0 comments

The Case That A.I. Is Thinking

https://www.newyorker.com/magazine/2025/11/10/the-case-that-ai-is-thinking
97•ascertain•4h ago•256 comments

Show HN: a Rust ray tracer that runs on any GPU – even in the browser

https://github.com/tchauffi/rust-rasterizer
76•tchauffi•8h ago•21 comments

Leverage Points: Places to Intervene in a System (1999)

https://donellameadows.org/archives/leverage-points-places-to-intervene-in-a-system/
8•Jtsummers•2h ago•1 comments

State of Terminal Emulators in 2025: The Errant Champions

https://www.jeffquast.com/post/state-of-terminal-emulation-2025/
113•SG-•7h ago•80 comments

The Continual Learning Problem

https://jessylin.com/2025/10/20/continual-learning/
59•Bogdanp•1w ago•6 comments

No Socials November

https://bjhess.com/posts/no-socials-november
94•speckx•5h ago•139 comments