Ask HN: Are we close to figuring out LLM/Agent Memory

3•wkyleg•1h ago

The main issue I experience with LLMs, and the one that seems to most inhibit my further adoption is lack of ability of agents to remember relevant context.

A few years ago everyone was using RAG, embeddings, databases on top of models. Now models with access to local markdown and memory files (like OpenClaw) seem to be readily outperforming these databases with grep and simple UNIX tools.

Is this an inherent issue in scaling LLMS? Does Obsidian work that much better for most people? It anyone finding anything that actually outperforms markdown?

At this point the main bottleneck in my adoption seems to be memory and persistent long term context, not quality or reliability of the models.

I'm curious if there are any technical or scaling metrics we could use to forecast where this will end up going.

Comments

kageroumado•57m ago

For my personal use case, I use something that's known as “lossless context management.” I made a custom harness implementation that uses it. In short, it has a database with every message ever exchanged, and the model can access any of those messages using a simple search. On top of that, every exchange is summarized and stored separately as a level zero summary. Level zero summaries are then periodically summarized together into level one summaries that leave only the most important parts (lessons, knowledge).

The full context then looks something like: [intro prompt] + [old exhanges lvl 1 summaries] + [larger system prompt] + [more recent exchanges lvl 0 summaries] + [temporal context] + [recent messages with tool results stripped] + [recent messages including tool results]

Tool results are progressively stripped because they are generally only useful for a few turns. This allows to keep everything we've ever done in the context, and the model can easily look up more information by expanding each node. It's a single perpetual session that never compacts during active work.

I find it outperforming every other solution I tried for my use case (personal assistant).

AndyNemmity•54m ago

I don't think there are reasonable metrics.

I have a custom learning system. We are all trying things, that's where ai development is.

None of us know the best solution. We are all exploring in paths. I don't find memory and persistent long term context to be an issue for me, but I am using a full custom ai claude code setup, so perhaps I have sorted it for myself. Unsure.

Can you give a specific example? Like, talk through your workflow so I can understand it better?

Show HN: Download entire/partial Substack to ePub for offline reading

Bluesky announces $100M Series B after CEO transition

The miracle of PowerToys, Microsoft's last great Windows app

JavaScript's Trademark Problem (2025)

Show HN: macOS Kokoro-TTS powered document reader – listen to any text

OpenClaw: An Opinionated Resource List

Google's AI Studio now integrates with Firebase for vibe coding production apps

Direct kinetic impact. a flying sword. 450km/h

How to stop your autoresearch loop from cheating

China could be the biggest public funder of science within two years

OpenClaw demand in China is driving up the price of secondhand MacBooks

French sailor's fitness app bungle exposes location of aircraft carrier

The Long Farewell to Mark Zuckerberg's Metaverse

Jeff Bezos in Talks to Raise $100B Fund to Transform Companies with A.I

Ship's Clock – a maritime bell clock that lets you hear time

The Displacement of Cognitive Labor and What Comes Next

I Built an E-Commerce Platform from My Off-Grid Homestead Using AI

Reverse-Engineering the Personal AI Supercomputer

Redox OS AGPLv3 Violation

MCP 2026 Roadmap

Agent HTTP – Claude Code HTTP API Made Possible by Channels

Supermicro Employees Arrested, Smuggling Nvidia Chips

Show HN: I wrote an open source SEC filing compliance package

Open Source Pledge

I replaced a Scrum team with AI agents for 10 days

IntegrateAPI: Install API Integrations in Next.js via CLI

TI-89 Height-Mapped Raycaster

Pentagon: Anthropic's Chinese employees are security risks

LiteParse: Local document parsing for AI agents (Open source)

I built a skill-optimizer to clean up my messy pile of skills