news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

The Continual Learning Problem

https://jessylin.com/2025/10/20/continual-learning/

56•kiyanwang•6d ago

Comments

mynti•6d ago

Super interesting blogpost. I just wonder how this is actually different to LORA, since LORA also adds some parameters and freezes the rest of the model. This seems like a sparse, memory efficient LORA with a couple of extra steps, since it uses attention again to make the sparsity work. All while making it a lot more effective compared to LORA (performance drop of only 11% compared to 71%).

sva_•5h ago

> LORA

I think you meant LoRA (not to be confused with LoRa)

alyxya•8h ago

I think the solution to continual learning is as simple as using context distillation. We know that models are good at in-context learning, so we just want an efficient way to distill context into the weights. I suspect context rot may come from how the softmax in attention gets diluted with a longer context, so this wouldn't be an issue with context distillation.

killerstorm•8h ago

Perhaps it can work through multiple stages: ICL -> prompt/context optimization (*) -> prefix tuning / KV distillation -> context distillation.

*: it is possible to measure how much part of a prompt helps with a task e.g. measuring change in entropy

Keep Android Open

http://keepandroidopen.org/

427•LorenDB•3h ago•78 comments

What we talk about when we talk about sideloading

https://f-droid.org/2025/10/28/sideloading.html

1003•rom1v•13h ago•442 comments

Tips for stroke-surviving software engineers

https://blog.j11y.io/2025-10-29_stroke_tips_for_engineers/

103•padolsey•3h ago•21 comments

uBlock Origin Lite Apple App Store

https://apps.apple.com/in/app/ublock-origin-lite/id6745342698

86•mumber_typhoon•3h ago•14 comments

ChatGPT's Atlas: The Browser That's Anti-Web

https://www.anildash.com//2025/10/22/atlas-anti-web-browser/

219•AndrewDucker•3d ago•89 comments

Who needs Graphviz when you can build it yourself?

https://spidermonkey.dev/blog/2025/10/28/iongraph-web.html

37•pdubroy•2h ago•1 comments

EuroLLM: LLM made in Europe built to support all 24 official EU languages

https://eurollm.io/

597•NotInOurNames•16h ago•448 comments

Wacl – A Tcl Distribution for WebAssembly

https://github.com/ecky-l/wacl

25•shakna•3h ago•1 comments

Tinkering is a way to acquire good taste

https://seated.ro/blog/tinkering-a-lost-art

249•jxmorris12•9h ago•184 comments

Generative AI Image Editing Showdown

https://genai-showdown.specr.net/image-editing

227•gaws•10h ago•48 comments

Boring is what we wanted

https://512pixels.net/2025/10/boring-is-what-we-wanted/

284•Amorymeltzer•11h ago•157 comments

Project Shadowglass

https://shadowglassgame.com

68•layer8•6h ago•23 comments

Gluing and framing a 9000-piece jigsaw

https://river.me/blog/puzzle-glue-9000/

36•busymom0•3d ago•3 comments

Keeping the Internet fast and secure: introducing Merkle Tree Certificates

https://blog.cloudflare.com/bootstrap-mtc/

119•tatersolid•8h ago•38 comments

The AirPods Pro 3 flight problem

https://basicappleguy.com/basicappleblog/the-airpods-pro-3-flight-problem

401•andrem•17h ago•227 comments

HTTPS by default

https://security.googleblog.com/2025/10/https-by-default.html

177•jhalderm•13h ago•158 comments

Apple will phase out Rosetta 2 in macOS 28

https://developer.apple.com/documentation/apple-silicon/about-the-rosetta-translation-environment

115•summarity•4d ago•124 comments

Why do some radio towers blink?

https://www.jeffgeerling.com/blog/2025/why-do-some-radio-towers-blink

135•warrenm•11h ago•94 comments

Fil-C: A memory-safe C implementation

https://lwn.net/SubscriberLink/1042938/658ade3768dd4758/

187•chmaynard•14h ago•60 comments

Nvidia takes $1B stake in Nokia

https://www.cnbc.com/2025/10/28/nvidia-nokia-ai.html

191•kjhughes•15h ago•118 comments

Mapping the off-target effects of every FDA-approved drug in existence

https://www.owlposting.com/p/mapping-the-off-target-effects-of

135•abhishaike•13h ago•26 comments

Falcon: A Reliable, Low Latency Hardware Transport

https://dl.acm.org/doi/10.1145/3718958.3754353

11•teleforce•2d ago•1 comments

We need a clearer framework for AI-assisted contributions to open source

https://samsaffron.com/archive/2025/10/27/your-vibe-coded-slop-pr-is-not-welcome

237•keybits•20h ago•125 comments

Using AI to negotiate a $195k hospital bill down to $33k

https://www.threads.com/@nthmonkey/post/DQVdAD1gHhw

909•stevenhubertron•15h ago•799 comments

Ubiquiti SFP Wizard

https://blog.ui.com/article/welcome-to-sfp-liberation-day

225•eXpl0it3r•17h ago•171 comments

Notes on Waveguide Synthesis (2018)

https://www.osar.fr/notes/waveguides/

15•jstrieb•6d ago•1 comments

The decline of deviance

https://www.experimental-history.com/p/the-decline-of-deviance

147•zdw•15h ago•130 comments

Samsung makes ads on smart fridges official with upcoming software update

https://arstechnica.com/gadgets/2025/10/samsung-makes-ads-on-3499-smart-fridges-official-with-upc...

518•stalfosknight•12h ago•404 comments

Our LLM-controlled office robot can't pass butter

https://andonlabs.com/evals/butter-bench

188•lukaspetersson•17h ago•100 comments

I've been loving Claude Code on the web

https://ben.page/claude-code-web

123•speckx•14h ago•93 comments