frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: A file-based agent memory framework that works like skill

https://github.com/NevaMind-AI/memU
11•Nicole9•1d ago
Hi HN,

We’ve been building [memU](https://github.com/NevaMind-AI/memU), an open-source memory framework for AI agents that supports both classic RAG and LLM-based direct file reading.

RAG has become the default in LLM systems, but many of its failures don’t come from the model — they come from the retrieval assumptions. Embedding-based retrieval is fundamentally an approximation over semantic similarity. It works well for fuzzy recall, but it often breaks when relevance ≠ correctness, which is common in real systems.

From a retrieval perspective, RAG struggles with: - Time- and version-sensitive facts (embeddings don’t encode validity or order) - Structured, canonical knowledge like configs, policies, or agent state - Multi-step reasoning, where incomplete or slightly wrong context compounds errors

In practice, RAG often returns plausible but incorrect context — especially harmful for agents that act over long horizons.

memU takes a different approach.

Instead of trying to make embedding search smarter, we ask: what should not be retrieved via embeddings at all?

Retrieval in memU starts at a Memory Category Layer: - memory is organized into semantically stable categories - each category is stored as a readable Markdown file - these files act as long-term, canonical memory

When a query arrives, the LLM reads the relevant memory files directly, using semantic understanding rather than vector similarity. Only when this layer is insufficient does memU fall back to item-level retrieval, optionally using embeddings for speed.

This design treats the LLM as what it’s increasingly good at: reading, reasoning, and maintaining structured knowledge, not just ranking vectors. Using Markdown files is deliberate — similar to ideas like `skills.md` — making memory explicit, inspectable, and stable over time.

Compared to existing approaches: - [mem0](https://github.com/mem0ai/mem0) is fast and simple with classic RAG, but can struggle with temporal accuracy and precise state changes.

- [Zep](https://github.com/getzep/graphiti) uses graphs, which handle structure well but add complexity and maintenance overhead.

- [memU](https://github.com/NevaMind-AI/memU) uses non-embedding retrieval to address RAG’s structural limits in accuracy, stability, and long-term consistency — without replacing RAG entirely.

For long-running agents, retrieval needs to provide reliable premises for reasoning, not just relevant text. In those settings, direct LLM reading over structured memory often aligns better with how models actually reason.

Comments

mikasisiki•1d ago
Feels like file-system-style storage is pretty similar, conceptually, to Claude’s current Skills design.
snasan•1d ago
There are quite a few frameworks focused on agent memory now, and I’m not sure if yours is better than Mem0.
Junnn•1d ago
I’m working on a sales assistant agent with long-term memory. What database does memU support by default? I’m using pg.
quinncom•1d ago
It appears that this is a tool useful for people who are building AI agents. Rather than for people who are using AI agents such as Claude Code. MCP is not mentioned in the README.

Notebook Lawyer

https://avc.xyz/notebook-lawyer
1•sethbannon•2m ago•0 comments

Nestlé infant formula recall spans globe

https://efoodalert.com/2026/01/07/nestle-infant-formula-recall-spans-globe-updated-january-7-2026/
1•speckx•3m ago•0 comments

Dell admits consumers don't care about AI PCs

https://www.theverge.com/news/857723/dell-consumers-ai-pcs-comments
1•thisislife2•4m ago•0 comments

Show HN: Basic AI agent that auto-generates B2B sales follow-ups

https://github.com/sneurgaonkar/sales-followup-agent
1•sneurgaonkar•6m ago•0 comments

Zed: Dev Containers

https://zed.dev/docs/dev-containers
1•tosh•7m ago•0 comments

The Inevitable Rise of the Art TV

https://www.wired.com/story/art-frame-tv-trends/
1•m463•8m ago•0 comments

Some programming languages worth learning

https://codecrafters.io/blog/new-programming-languages
1•vitaelabitur•10m ago•0 comments

Filmmaker Béla Tarr Has Died

https://en.wikipedia.org/wiki/B%C3%A9la_Tarr
1•keiferski•10m ago•0 comments

Bela Tarr, RIP

https://www.nytimes.com/2026/01/06/movies/bela-tarr-dead.html
1•paulpauper•11m ago•0 comments

Australia's social media ban could affect art institutions

https://www.theartnewspaper.com/2026/01/05/how-australias-social-media-ban-could-affect-art-insti...
2•paulpauper•11m ago•0 comments

Virus Total Analysis

https://www.virustotal.com/gui/file/1f8c98a24f1dc2e22a18ce4218972ce83b7da4d54142d2ca0caeb05225dbc...
1•KaoruAK•11m ago•0 comments

Why are knots so useful the studying numbers?

https://old.maa.org/press/periodicals/convergence/unreasonable-effectiveness-of-knot-theory
1•morpheos137•12m ago•1 comments

Reflections on Vibe Researching

https://joshuagans.substack.com/p/reflections-on-vibe-researching
1•paulpauper•13m ago•0 comments

Project Ava: The Next Evolution of AI Companions

https://www.razer.com/newsroom/product-news/project-ava/
1•dfajgljsldkjag•13m ago•0 comments

ICE agents fatally shoot woman in Minneapolis

https://www.cnbc.com/2026/01/07/ice-dhs-minneapolis-shooting.html
10•erhuve•14m ago•0 comments

The AI Will Vote the Shares

https://www.bloomberg.com/opinion/newsletters/2026-01-07/the-ai-will-vote-the-shares
1•feross•16m ago•0 comments

Your Brain on ChatGPT [pdf]

https://www.researchgate.net/publication/392560878_Your_Brain_on_ChatGPT_Accumulation_of_Cognitiv...
2•herbertl•16m ago•0 comments

Show HN: A to Z – A word game I built from a childhood road trip memory

https://a26z.fun/
1•jackhulbert•17m ago•0 comments

Web dependencies are broken. Can we fix them?

https://lea.verou.me/blog/2026/web-deps/
1•speckx•18m ago•0 comments

Introducing ChatGPT Health

https://openai.com/index/introducing-chatgpt-health/
6•saikatsg•18m ago•4 comments

United States Invasion of Grenada of 1983

https://en.wikipedia.org/wiki/United_States_invasion_of_Grenada
1•thinkingemote•19m ago•0 comments

Cisco MCP Scanner Behavioural Code Scanning for Threats

https://blogs.cisco.com/ai/ciscos-mcp-scanner-introduces-behavioral-code-threat-analysis
2•hsanthan•21m ago•1 comments

50k people were dropped from one AI training project during the holidays

2•KyleW9•25m ago•3 comments

The importance of Agent Harness in 2026

https://www.philschmid.de/agent-harness-2026
1•twapi•26m ago•0 comments

Russia Once Offered U.S. Control of Venezuela for Free Rein in Ukraine

https://www.nytimes.com/2026/01/06/world/americas/russia-us-venezuela-ukraine.html
10•croes•27m ago•0 comments

Merry Christmas Day Have a MongoDB Security Incident

https://doublepulsar.com/merry-christmas-day-have-a-mongodb-security-incident-9537f54289eb
2•begueradj•28m ago•0 comments

A practical guide to converting YAML to JSON safely (with Kubernetes examples)

https://coderaviverma.github.io/yaml-to-json-guide/
5•jsonviewertool•28m ago•7 comments

Tailwind creator: we had six months left

https://twitter.com/adamwathan/status/2008909129591443925
3•brunojppb•29m ago•1 comments

Show HN: Vid2ascii – real-time video to ASCII with WebGPU

https://wspr-zeta.vercel.app/
1•luthiraabeykoon•30m ago•0 comments

Let AI speak in its mother tongue

https://manidoraisamy.com/ai-mother-tongue.html
1•QueensGambit•30m ago•1 comments