frontpage.

I built a persistent memory layer for AI agents in Rust

1•architsingh15•2h ago

Every Claude Code session starts from zero. It doesn't remember the bug you debugged yesterday, the architecture decision you made last week, or that you prefer Tailwind over Bootstrap. I built Memori to fix this.

It's a Rust core with a Python CLI. One SQLite file stores everything -- text, 384-dim vector embeddings, JSON metadata, access tracking. No API keys, no cloud, no external vector DB.

What makes it different from Mem0/Engram/agent-recall:

- Hybrid search: FTS5 full-text + cosine vector search, fused with Reciprocal Rank Fusion. Text queries auto-vectorize -- no manual --vector flag needed.

- Auto-dedup: cosine similarity > 0.92 between same-type memories triggers an update instead of a new insert. Your agent can store aggressively without worrying about duplicates.

- Decay scoring: logarithmic access boost + exponential time decay (~69 day half-life). Frequently-used memories surface first; stale ones fade.

- Built-in embeddings: fastembed AllMiniLM-L6-V2 ships with the binary. No OpenAI calls.

- One-step setup: `memori setup` injects a behavioral snippet into ~/.claude/CLAUDE.md that teaches the agent when to store, search, and self-maintain its own memory.

Performance (Apple M4 Pro): - UUID get: 43µs - FTS5 text search: 65µs (1K memories) to 7.5ms (500K) - Hybrid search: 1.1ms (1K) to 913ms (500K) - Storage: 4.3 KB/memory, 8,100 writes/sec - Insert + auto-embed: 18ms end-to-end

The vector search is brute-force (adequate to ~100K), deliberately isolated in one function for drop-in HNSW replacement when someone needs it.

After setup, Claude Code autonomously:

- Recalls relevant debugging lessons before investigating bugs

- Stores architecture insights that save the next session 10+ minutes of reading

- Remembers your tool preferences and workflow choices

- Cleans up stale memories and backfills embeddings ~195 tests (Rust integration + Python API + CLI subprocess), all real SQLite, no mocking.

MIT licensed.

GitHub: https://github.com/archit15singh/memori

Blog post on the design principles: https://archit15singh.github.io/posts/2026-02-28-designing-cli-tools-for-ai-agents/

Associated Press Announces It's Teaming Up with Kalshi Ahead of the Midterms

Why (and how) we built 3 AI agents into our product

February 2026: Bitcoin fell 24%. Nothing in crypto infrastructure broke

YCombodogpatchrental

Only 526 AI tools are in the topM most-visited websites

Ask HN: How do you get better at coding agents?

This Month in Ladybird – February 2026: Adopting Rust for LibJS

Show HN: Vanilla JavaScript refinery simulator built to explain job to my kids

What keeps IoT devices running for a decade

Show HN: Synapse – P2P AI agent collaboration with async human supervision

AI-generated code, AI-generated findings, and the verification bottleneck

Show HN: I used an IoT sensor and Claude to diagnose a hairdryer

Show HN: Agd – a content-addressed DAG for tracking what AI agents do

Anthropic to Department of Defense: Drop Dead

Abstraction Beats Realism in VR Concert Recreation

Kanban Code - Native MacOS UI for Managing Multiple Claude Codes

Show HN: PRD Agent – Turn your app idea into a shareable RPD in minutes

We Claudified our iOS app without wrecking our codebase

Home Assistant can run DOOM

California introduces age verification law for all operating systems

Israel spent years hacking Tehran’s traffic cameras and monitoring bodyguards

End of hallucinations? How Vancouver AI firms achieve accuracy

How to Recover Your Stolen Crypto After a Scam–Guidance from Intelligence Cyber

CSOA Forte Prenestino (2023)

Prohibited Countries – Mercury Bank

API to Clean Markdown Docs for AI Agents (No More Stale Endpoints)

Dr Seuss Day: 'Without Oxford University, We Don't Get Dr. Seuss'

Connected Claude to a 1983 oscilloscope [video]

FFmpeg at Meta: Media Processing at Scale

Managed OpenClaw hosting your own AI assistant in 60 seconds, no server needed