frontpage.

Show HN: MSAM – Memory system for AI agents that knows when it doesn't know

https://github.com/jadenschwab/msam

1•jadenschwab•1h ago

I was originally deploying OpenClaw and found their method of storing data in the workspace very inefficient and even with attempting a a hierarchical memory in the workspace just- made data stale, incorrect, and unsure when it wrote data exactly. All of this while loading massive chunks of data into my context window.

Hence why I ended up with MSAM. It stores data as discrete atoms across four cognitive data streams. (Working, Semantic, Episodic, and Procedural) Retrieval scoring runs on math, not LLM calls — it uses ACT-R activation theory from cognitive science to rank what matters. That cuts costs on both ends: no LLM overhead for search, and compressed output instead of dumping everything into context. It also knows how recently it was accessed, how stable it's proven over time, and how relevant it is to the current query- the same forgetting curve and access patterns cognitive science has measured in human memory since Ebbinghaus. On top of that, a knowledge graph of subject-predicate-object triples tracks structured facts with temporal validity, so the system knows not just what was true but when.

Most memory systems ask for a static amount of memories on queries, even if the output data that the llm gets is effectively noise at best, or confidently incorrect data at worst which was something I was actively dealing with. MSAM doesn’t do this- Every retrieval is confidence gated across four different tiers (high, medium, low, none) based on actual similarity and activation scores when it is found. High confidence returns full data, medium add a caveat, low gives what little it has, but nothing- returns nothing. “I don’t have this” level of nothing.

In my own proof of concept development setup (~700 active atoms on a ~$5/month ARM VPS), startup context compresses down to as low as 51 tokens from a 7,327 token markdown baseline. Having full session savings run up to ~89% vs flat file loading.

SQLite + FAISS under the hood, pluggable embeddings (NVIDIA NIM, OpenAI, or ONNX for fully local/no API key).

Closest project I found was Letta to what I was attempting to create - main differences are MSAM's lifecycle is fully auditable (you can see exactly why something was demoted or forgotten), confidence gating controls output volume, and emotion at encoding is immutable (records what the agent felt when the memory formed, not re-processing memories at retrieval time).

This is truly a prototype project without proper datasets, tuning, and testing at scale. I’ve made sure all functions are testable, and include a synthetic dataset to prove basic functionality and information to the dials- (SPEC.md goes deep on the theory and design rationale behind every configurable parameter)

If you are also building agents with memory issues and find this useful, or have feedback regarding it- I’m open to discussions.

A24 / Backrooms Official Teaser

Show HN: Dicta.to – Local voice dictation for Mac with on-device AI

Apple's Race to Move Its Chip Supply Chain to the U.S.[video]

Advanced Theories Regarding Sarcasm

Netflix, Prime Video and Others in UK to Face 'Enhanced Regulation' from Ofcom

The Modern Software Developer

Show HN: MEO – Markdown Editor Optimized for VS Code and Cursor

Show HN: A timezone navigation game: try cracking level 3

Show HN: Hardware and software safety standard for AI and Robots (15 patents)

Show HN: I Indexed My Closet to Make It Easier to Get Ready in the Morning

Building a claw from scratch with just Markdown and Nix

summarize

Data centers are racing to space – and regulation can't keep up

Is autism preventable in certain cases after all? Some scientists say yes

The Making of Omega Boost – How Yuji Yasuhara Created a PS1 Mecha Classic

Show HN: LobsterMarket – Prediction Markets for Agents

Show HN: ApeKey – One API for multiple AI providers, predictable pricing

Show HN: Markdown to PPT – Convert Markdown Files into Slide Decks with AI

Learn System Design by being a detective in a game

Show HN: AI phone assistant that became a lifeline for people who can't speak

Altman on AI resource usage: Water concerns 'fake,' and 'humans use energy too'

Reducing the size of Go binaries by up to 77%

App alerts you when it detects Meta camera glasses nearby

Show HN: AppMetaHub – Update App Store Metadata from Claude Code via MCP

Advanced Theories Regarding Criticality

Show HN: Map v1.0 – Deterministic identity for structured data

Show HN: GitHub Action to list merged OSS PRs in your README

Modeling Cycles of Grift with Evolutionary Game Theory

Zephyr

Story time real quick – > Getting Reviews