Show HN: Mnemora – Serverless memory DB for AI agents (no LLM in your CRUD path)

https://github.com/mnemora-db/mnemora

2•isaacgbc•1h ago

Hi HN,

I built Mnemora because every AI agent memory solution I evaluated (Mem0, Zep, Letta) routes data through an LLM on every read and write. At scale, that means 200-500ms latency per operation, token costs on your memory layer, and a runtime dependency you don't control.

Mnemora takes the opposite approach: direct database CRUD. State reads hit DynamoDB at sub-10ms. Semantic search uses pgvector with Bedrock Titan embeddings — the LLM only runs at write time to generate the embedding vector. All reads are pure database queries.

Four memory types, one API: 1. Working memory: key-value state in DynamoDB (sub-10ms reads) 2. Semantic memory: vector-searchable facts in Aurora pgvector 3. Episodic memory: time-stamped event logs in S3 + DynamoDB 4. Procedural memory: rules and tool definitions (coming v0.2)

Architecture: fully serverless on AWS — Aurora Serverless v2, DynamoDB on-demand, Lambda, S3. Idles at ~$1/month, scales per-request. Multi-tenant by default: each API key maps to an isolated namespace at the database layer.

What I'd love feedback on: 1. Is the "no LLM in CRUD path" differentiator clear and compelling? 2. Would you use this over Mem0/Zep for production agents? What's missing? 3. What memory patterns are you solving that don't fit these 4 types?

Happy to answer architecture questions.

SDK: pythonpip install mnemora

from mnemora import MnemoraSync

client = MnemoraSync(api_key="mnm_...") client.store_memory("my-agent", "User prefers bullet points over prose") results = client.search_memory("output format preferences", agent_id="my-agent") # [0.54] User prefers bullet points over prose Drop-in LangGraph CheckpointSaver, plus LangChain and CrewAI integrations.

Links: 5-min quickstart: https://mnemora.dev/docs/quickstart GitHub: https://github.com/mnemora-db/mnemora PyPI: https://pypi.org/project/mnemora/ Architecture deep-dive: https://mnemora.dev/blog/serverless-memory-architecture-for-...

Comments

isaacgbc•1h ago

All feedback is appreciated :)

jlongo78•25m ago

Mnemora sounds neat. But, can it handle chaos of real-world dev work? Sessions piling up, distractions? Good luck keeping that organized.

Show HN: BidWix – one-shot price agreement using secret max/min

Every Single Board Computer I Tested in 2025

BullshitBench v2: LLMs answering nonsense questions

Build to Last – Chris Lattner Talks with Jeremy Howard [video]

Show HN: 3D linear and nonlinear WebGL Schrödinger numerical solver

BaZi – Deterministic life-charting from the Chinese calendar

Show HN: HiTank – A skill manager for Claude Code, written in pure Ruby

Ask HN: How do you give AI agents real codebase context without burning tokens?

World is entering an era of 'water bankruptcy'

Show HN: PeekAPI – API analytics middleware, 7 languages, zero dependencies

The coming war on general-purpose computation (2011)

Show HN: NiroDB – A key-value storage engine built from scratch in Go

Show HN: MCP server for KubeCon EU 2026 – AI-powered conference planning

The cruelty of teaching computing science (1988)

Self-evolving software is the future

Sherlup, a tool to let LLMs check your dependencies before you upgrade

Log into Windows with a Bitwarden Passkey

Migrating from Heroku to Magic Containers

Show HN: PulseTech.news – an automated, privacy-first tech aggregator

nbdev – Create delightful software with Jupyter Notebooks

Duolingo Is Talking to ByteDance: Cracking the Pangle SDK's Encryption

NiroDB – A key-value storage engine built from scratch in Go

OpenAI pushes to add surveillance safeguards following Pentagon deal

Show HN: Crikket – An open source, self-hostable alternative to jam.dev

Americans are stealthily adding DIY solar systems

Global Delivery Partners

Golang Developer

Show HN: A minimalistic, elegant, and local-first family meal planner

Is llms.txt is of any use?

Turn Linear issues into a public roadmap and feedback board