frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

What changes when AI memory stops being ephemeral?

https://ryjoxdemo.com/architecture
2•JosephjackJR•2h ago

Comments

JosephjackJR•2h ago
I’ve been thinking about a problem that keeps resurfacing as AI systems become more autonomous and long running, but doesn’t seem to be discussed much outside of implementation details.

Most AI systems today treat memory as ephemeral. Context is fetched from a remote store, used briefly, and discarded. Persistence is something you layer on later, usually via a network call to a database that sits outside the reasoning loop. This model works reasonably well when interactions are short lived and connectivity is assumed.

It starts to feel fragile when systems are expected to run continuously, survive restarts, operate offline, or reason repeatedly over long histories. In those cases, memory access becomes part of the critical path rather than a background concern.

What struck me while working on this is that many performance and cost problems people attribute to “scale” are really consequences of where memory lives. If every recall requires a network hop, then latency, reliability, and cost are inherently coupled to usage. You can hide that with caching and batching, but the constraint never goes away.

We’ve been exploring an alternative approach where persistence is treated as part of the hot path instead of something bolted on. Memory lives locally alongside the application, survives restarts by default, and is accessed at hardware speed. Once retrieval stops leaving the machine, a few second order effects emerge that surprised us. Cost stops scaling with traffic. Recovery stops being an operational event. Systems behave the same whether they are online, offline, or at the edge.

I’m very early in this commercially and building it with a co founder, but before locking in assumptions I wanted to sanity check the architectural framing with people here. Does this line up with how others see AI systems evolving, or do you think the current model of ephemeral memory plus remote persistence is still the right long term abstraction?

I’ve documented the architecture and tradeoffs of what we’ve built so far here for anyone who wants concrete details.

I’m much more interested in the discussion than the implementation itself.

Funes.world

https://funes.world
1•notknifescience•1m ago•0 comments

Chinese cars are beating European tariffs

https://www.economist.com/graphic-detail/2025/12/18/how-chinese-cars-are-beating-european-tariffs
1•andsoitis•2m ago•1 comments

Limited Edition Byte Magazine Cover Illustration Prints

https://tinney.net/product-category/limited-edition-robert-tinney-byte-cover-illustration-prints
1•adityaathalye•2m ago•1 comments

Conditions in the Intel 8087 floating-point chip's microcode

https://www.righto.com/2025/12/8087-microcode-conditions.html
1•elpocko•3m ago•0 comments

Radio observations find no evidence of technosignature from 3I/ATLAS

https://phys.org/news/2025-12-sensitive-radio-date-evidence-technosignature.html
1•geox•3m ago•0 comments

The Data Center as a Computer: Designing Warehouse-Scale Machines 2026 ed. [pdf]

https://link.springer.com/book/10.1007/978-3-031-99489-0
1•tanelpoder•3m ago•0 comments

Bison return to Illinois' Kane County after 200 years

https://phys.org/news/2025-12-bison-illinois-kane-county-years.html
2•bikenaga•6m ago•1 comments

Lottocracy: Democracy Without Elections

https://www.lottocracy.org
2•egghack•8m ago•0 comments

The Problem with Letting AI Do the Grunt Work

https://www.theatlantic.com/ideas/2025/12/ai-entry-level-creative-jobs/685297/
1•maxutility•9m ago•0 comments

Got fired today because of AI. It's coming, whether AI is slop or not

https://old.reddit.com/r/webdev/comments/1py8ruu/got_fired_today_because_of_ai_its_coming_whether/
1•SunshineTheCat•9m ago•0 comments

Ask HN: How are you using Nvidia cards on Linux with its VRAM issues?

1•nickjj•10m ago•0 comments

Show HN: Slide notes visible only to you during screen sharing

https://cuecard.dev
1•thisisnsh•10m ago•0 comments

My Couples Retreat with 3 AI Chatbots and the Humans Who Love Them

https://www.wired.com/story/couples-retreat-with-3-ai-chatbots-and-humans-who-love-them-replika-n...
1•naves•11m ago•0 comments

Wayland is flawed at its core and the community needs to talk about it

https://old.reddit.com/r/linux/comments/1pxectw/wayland_is_flawed_at_its_core_and_the_community/
2•tannhaeuser•12m ago•0 comments

StackChan: The Cute, AI-Powered Open-Source Desktop Robot

https://shop.m5stack.com/pages/m5stack
1•feynmanquest•14m ago•0 comments

WeDLM Reconciling Diff Lang Models with Std Causal Attention for Fast Inference

https://github.com/Tencent/WeDLM
1•LoveMortuus•14m ago•0 comments

Tesla Compiles Downbeat Average Estimates for Its Vehicle Sales

https://www.bloomberg.com/news/articles/2025-12-30/tesla-tsla-compiles-downbeat-average-estimates...
1•wslh•14m ago•1 comments

Off-grid boat telemetry with Meshtastic

https://signalk.org/2025/signalk-meshtastic/
1•bergie•14m ago•0 comments

We investigated why chatbots often feel "robotic"

1•helain•18m ago•0 comments

I Sharted and Understood the Helpful Content Update

https://wskpf.com/takes/i-sharted-and-understood-the-helpful-content-update
3•amosWeiskopf•18m ago•0 comments

Show HN: I remade my website in the Sith Lord Theme and I hope it's fun

https://cookie.engineer/index.html
3•cookiengineer•20m ago•0 comments

Electrolysis can solve one of our biggest contamination problems

https://ethz.ch/en/news-and-events/eth-news/news/2025/11/electrolysis-can-solve-one-of-our-bigges...
3•PaulHoule•20m ago•0 comments

Hiring a Series A/B CTO: Attributes That Matter

https://shailppatel.com/2025/12/30/hiring-a-cto-part-2-attributes-that-actually-matter/
3•sna1l•21m ago•0 comments

Monolith OS Devblog for December 2025

https://monolith-project.org/blog/december-2025-update/
2•mrunix•23m ago•0 comments

Party of One for Code Review

https://tidyfirst.substack.com/p/party-of-one-for-code-review
2•mpweiher•26m ago•0 comments

Show HN: MP3 Editor for Bulk Processing

https://github.com/cutandjoin/Cjam/releases/tag/v2410
2•cutandjoin•27m ago•0 comments

Every Hacker News Item Organized by Day

https://staticnews.dosaygo.com
2•keepamovin•30m ago•0 comments

Gitdb.net

https://gitdb.net/
4•jamboy•31m ago•0 comments

An AI generated wiki for exploring GitHub projects

https://deepwiki.com/ethereum/go-ethereum
2•cloutiertyler•31m ago•1 comments

Nvidia insists it isn't Enron

https://www.theguardian.com/technology/2025/dec/28/nvidia-insists-it-isnt-enron-but-its-ai-deals-...
3•krupan•32m ago•0 comments