frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Unified multimodal memory framework, without embeddings

https://github.com/NevaMind-AI/memU
7•k_kiki•1d ago
Hi HN,

We’ve been building memU(https://github.com/NevaMind-AI/memU), an open-source, general-purpose memory framework for AI agents. It supports dual-mode retrieval: classic RAG and LLM-based direct file reading.

Most multimodal memory systems either embed everything into vectors or treat non-text data as attachments. These work, but at scale it becomes hard to explain why certain context was retrieved and what evidence it relies on.

memU takes a different approach: since models reason in language, multimodal memory should converge into structured, queryable text, while remaining fully traceable to original data.

---

## Three-Layer Architecture

- Resource Layer Stores raw multimodal data as ground truth. All higher-level memory remains traceable to this layer.

- Memory Item Layer Extracts atomic facts from raw data and stores them as natural-language statements. Embeddings are optional and used only for acceleration.

- Memory Category Layer Aggregates items into readable, theme-based memory files (e.g. user preferences, work logs). Frequently accessed topics stay active; low-usage content is demoted to balance speed and coverage.

---

## Memorization Bottom-up and asynchronous. Data flows from resources → items → category files without manual schemas. When capacity is reached, recently relevant memories replace the least used ones.

## Retrieval Top-down. memU searches category files first, then items, and only falls back to raw data if needed. At the item layer, it combines BM25 + embeddings to balance exact matching and semantic recall, avoiding embedding-only imprecision.

Dual-mode retrieval lets applications choose between: - low-latency embedding search, or - LLM-based direct reading of memory files.

## Evolution Memory structure adapts automatically based on real usage: - Frequently accessed memories remain at the Category layer - Memories retrieved from raw data are promoted upward and linked - Organization evolves from usage patterns, not predefined rules

Goal: keep relevant memories retrievable at the Category layer and minimize latency over time.

---

## A Unified Multimodal Memory Pipeline memU is a text-centered multimodal memory system. Multimodal inputs are progressively converted into interpretable text memory, while staying traceable to original data. This provides stable, high-level context for reasoning, with detailed evidence available when needed—inside a memory structure that evolves through real-world use.

Comments

Junnn•1d ago
From an engineering perspective, what I find compelling here is not “no embeddings”, but the decision to treat memory as a first-class, inspectable system rather than a retrieval trick.

Most agent memory stacks today collapse everything into embeddings and hope similarity search is enough. That works for recall, but breaks down quickly when you need traceability, temporal reasoning, or explanation of why something was remembered.

The layered design here (raw resources → extracted memory items → categorized memory files) feels much closer to how we design real systems: separation of concerns, clear abstraction boundaries, and the ability to reason about state changes over time.

Storing memories in human-readable form also makes debugging and evolution practical. You can audit what the agent “knows”, adjust policies, or let the LLM reason directly over memory instead of treating it as a black box vector store.

Embeddings still make sense as an optimization layer, but making them optional rather than foundational is an important architectural choice if agents are meant to run long-term and stay coherent.

This feels less like a retrieval hack and more like actual infrastructure.

Bohann•1d ago
Great to see a framework tackling the architecture of memory rather than just retrieval. The concept of separating 'Resource Layer' from 'Memory Item Layer' makes a lot of sense for avoiding context pollution in long-running agents.

Practically speaking, how significant is the improvement in retrieval accuracy compared to a standard RAG setup (e.g., vanilla vector search) for nuanced queries? I'd love to understand the 'lift' I could expect before migrating my current stack.

CHERIoT: Complete Memory Safety for Embedded Devices

https://dl.acm.org/doi/10.1145/3613424.3614266
1•blakepelton•20s ago•1 comments

Time in C++: Additional clocks in C++20

https://www.sandordargo.com/blog/2026/01/07/clocks-part-6-cpp20
1•ibobev•28s ago•0 comments

Warner Bros. sticks with Netflix merger, calls Paramount's $108B bid "illusory"

https://arstechnica.com/tech-policy/2026/01/warner-bros-sticks-with-netflix-merger-calls-paramoun...
1•voxadam•32s ago•0 comments

What I Got Wrong About "Hard Work" in My 20s

https://lemire.me/blog/2026/01/08/what-i-got-wrong-about-hard-work-in-my-20s/
1•ibobev•1m ago•0 comments

Today's White House Press Briefing had a 98% chance of running over 65 minutes

https://twitter.com/PredMTrader/status/2009018474916663346
1•doener•1m ago•0 comments

SwiftVector: Deterministic Control for Stochastic Agent Systems

https://www.agentincommand.ai/
1•macweeney•2m ago•0 comments

Show HN: Tachyon – A C++ JSON parser matching simdjson's throughput via AVX-512

https://github.com/wilkolbrzym-coder/Tachyon.JSON
1•wilkolbrzym•3m ago•1 comments

Multi REPL Sessions in Calva for Clojure [video]

https://www.youtube.com/watch?v=36dTtyfa_OY
1•TheWiggles•3m ago•0 comments

Maine company in the spotlight after Maduro apparently wore one of their hoodies

https://www.boston.com/news/business/2026/01/06/maine-company-maduro-venezuela-hoodie/
3•mhb•5m ago•0 comments

Researchers develop efficient, reusable compound for capturing CO2 from air

https://phys.org/news/2025-12-efficient-reusable-compound-capturing-carbon.html
2•PaulHoule•5m ago•0 comments

Extracting books from production language models

https://arxiv.org/abs/2601.02671
2•articsputnik•7m ago•0 comments

Dialogue Between a Developer and a Kid

https://riggraz.dev/dialogue-developer.html
2•todsacerdoti•8m ago•0 comments

Show HN: Agent Workflow Language – Open-Source Scala DSL for LLM Orchestration

https://awfl.us
2•Dezmo•8m ago•1 comments

ChatGPT Health Is a Marketplace, Guess Who Is the Product?

https://consciousdigital.org/chatgpt-health-is-a-marketplace-guess-who-is-the-product/
2•yoaviram•9m ago•0 comments

Show HN: Nucleus - AI prompt framework

https://github.com/michaelwhitford/nucleus
1•dulakian•10m ago•0 comments

Ask HN: Why has "using" gone out of fashion?

2•bf9d413906•11m ago•1 comments

JPMorgan Chase Reaches Deal to Take over Apple Credit Card

https://www.wsj.com/finance/banking/jpmorgan-chase-reaches-a-deal-to-take-over-the-apple-credit-c...
1•throw0101d•12m ago•0 comments

The Tyranny of the Complainers

https://marginalrevolution.com/marginalrevolution/2026/01/the-tyranny-of-the-complainers.html
1•mhb•13m ago•1 comments

Show HN: GhostMark – A bookmark manager that uses GitHub Gists as its database

https://github.com/abishekvenkat/ghostmark
1•abishekvenkat•14m ago•0 comments

How far can decentralized training over the internet scale?

https://epoch.ai/gradient-updates/how-far-can-decentralized-training-over-the-internet-scale
1•gmays•15m ago•0 comments

Interactive map: the EU's banned pesticide trade

https://unearthed.greenpeace.org/2025/12/16/interactive-map-eu-banned-pesticide-trade/
2•stareatgoats•16m ago•0 comments

Ask HN: Feeling irrelevant in back end. How to pivot to automotive software?

1•culopatin•17m ago•1 comments

Show HN: MirrorMate – A self-hosted AI smart mirror with local LLMs

https://github.com/orangekame3/mirrormate
1•orangekame3•19m ago•0 comments

NASA considers whether to bring sick crew member back to Earth

https://www.bbc.co.uk/news/articles/cx2l9perw68o
2•fredoralive•19m ago•0 comments

Show HN: Input password by twist your MacBook's lid

https://github.com/huanglizhuo/Flappy.Lid
2•huang4fun•20m ago•0 comments

The C-word: What security people don't get about compliance

https://bfswa.substack.com/p/the-c-word
1•zdw•20m ago•0 comments

Splitting a Linux Kernel Package

https://blog.brixit.nl/splitting-a-linux-kernel-package/
1•speckx•20m ago•0 comments

Neovim as a Window Manager

https://www.reddit.com/r/neovim/s/CotIhGMeIb
1•fork-bomber•21m ago•0 comments

Against Bravery Debates (2013)

https://slatestarcodex.com/2013/05/18/against-bravery-debates/
1•cassepipe•22m ago•0 comments

'Autofocus' glasses can change their lenses in real time

https://www.cnn.com/science/autofocus-glasses-ixi-change-lenses-spc
2•Brajeshwar•22m ago•0 comments