frontpage.

Hi HN,

I'm an independent researcher and I've just published a whitepaper for an LLM memory architecture I designed: DREAM (Dynamic Retention Episodic Architecture for Memory).

The core problem I'm tackling is the tension between persistent memory and cost in large-scale AI systems. Storing nothing forces users to re-explain context. Storing everything creates a privacy, latency, and cost nightmare.

DREAM is a plug-in architectural pattern that sits around the LLM (no model changes needed) and unifies existing tech (RAG, NoSQL).

The core innovation is the Adaptive Retention Mechanism (ARM).

Instead of a static 30-day TTL, ARM dynamically extends an episode's TTL based on user engagement. For example, a memory's life doubles each time the user revisits it (e.g., 7 days -> 14 -> 28 -> 56) .

This creates a "self-pruning" memory layer where storage cost scales directly with actual user relevance, not just raw traffic.

The architecture also includes:

Episodic Units (EUs): Storing compressed summaries + embeddings, not raw logs.

User-Centric Opt-In: Explicit user approval per episode for privacy.

Aligned Sharding: A design for sharding orchestrators and storage (partitioned by user_id) to ensure horizontal scalability and cache locality .

I designed DREAM to be a practical blueprint implementable with today's infrastructure (Cassandra, FAISS, Kubernetes, etc.).

I don't have the resources to test this at scale, so I'm publishing the architecture to share the idea. I would be genuinely grateful for any technical feedback, criticisms, or thoughts on the design.

Whitepaper (PDF): https://zenodo.org/records/17619917 GitHub (Code Examples/Arch): https://github.com/MatheusPereiraSilva/dream-architecture

Building a High Performance Home

Sakana AI Series B Announcement

Why movies just don't feel "real" anymore

Scams 'Ghost broking': cut-price car insurance isn't all it seems

'Aristocratic Tutoring' Cannot Explain von Neumann's Success

Show HN: AirPods Seamless-Handoff with Linux

Hertfordshire police admit unlawful arrest of couple in school WhatsApp row

The weird technical restrictions of the Nintendo 64

Which Humans? LLMs mainly mirror WEIRD minds (Europeans?)

Simpler train travel London airports, tap-in, tap-out expanded across SE England

Quotes from Moral Mazes (2019)

blockchain analytics platform

Commander's Guide to Money as a Weapons System [pdf]

Charlie Javice Legal Bills

Show HN: CUDA, Shmuda: Fold Proteins on a MacBook

I Worked All over Silicon Valley. This Is How It Lost Its Spine

A new chapter begins for EV batteries with the expiry of key LFP patents

Coherent Synchrotron Radiation by Excitation of SPPs on Near-Critical CNT

Ask HN: LangChain for Rails, Port with AI?

Strandbeest Evolution 2025 [video]

1990 VHS • Cyberpunk 60 FPS [video]

The Deployment Age

Aptronym

How do we train a frontier model (small) in 2025?

Show HN: Hirelens – AI Resume Analyzer for ESL and Global Job Seekers

The State of Startups in 2025

An Open-Source HDMI Keyboard/Video/Mouse (KVM) Switch

Mathematically Formalizing the TOON Efficiency Revolution versus JSON

Heat pumps of the 1800s are becoming the technology of the future

Language and economic behaviour: Future tense use causes less temp discounting

Show HN: Dream – An LLM memory architecture using adaptive TTL to control cost