frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

A Bitter Lesson for Memory

https://personal-website-3bed.onrender.com/blog-viewer.html?slug=A%20Bitter%20Lesson%20for%20Memory
4•wenhan_zhou•1h ago

Comments

wenhan_zhou•1h ago
If understanding emerges from pre-training, then perhaps memory is what emerges from post-training.
wgd•1h ago
I've always been amazed at how terrible most frontier LLMs are at compaction given how embarrassingly easy it is to come up with half a dozen different RL training evals which would teach models to generate useful context summaries. Heck, you could bolt it onto any existing RL eval by just forcing a compaction every three turns.