frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

2026 will be the year of on-device agents

1•mycelial_ali•2h ago
I have been building a local AI memory layer for a while, and the same problem shows up every time you try to make an assistant feel stateful.

The agent is impressive in the moment, then it forgets. Or it remembers the wrong thing and hardens it into a permanent belief. A one off comment becomes identity. A stray sentence becomes a durable trait. That is not a model quality issue. It is a state management issue.

Most people talk about memory as “more context.” Bigger windows, more retrieval, more prompt stuffing. That is fine for chatbots. Agents are different. Agents plan, execute, update beliefs, and come back tomorrow. Once you cross that line, memory stops being a feature and becomes infrastructure.

The mental model I keep coming back to is an operating system.

1.What gets stored 2.What gets compressed 3.What gets promoted from “maybe” to “true” 4.What decays 5.What gets deleted 6.What should never become durable memory in the first place

If you look at what most memory stacks do today, the pipeline is basically the same everywhere.

Capture the interaction. Summarize or extract. Embed. Store vectors and metadata. Retrieve. Inject into the prompt. Write back new memories.

That loop is not inherently wrong. The bigger issue is where the loop runs. In a lot of real deployments, the most sensitive parts happen outside the user’s environment. Raw interactions get shipped out early, before you have minimized or redacted anything, and before you have decided what should become durable.

When memory goes cloud first, the security model gets messy in a very specific way. Memory tends to multiply across systems. One interaction becomes raw snippets, summaries, embeddings, metadata, and retrieval traces. Even if each artifact feels harmless alone, the combined system can reconstruct a person’s history with uncomfortable fidelity.

Then there is the trust boundary problem. If retrieved memories are treated as trusted context, retrieval becomes a place where prompt injection and poisoning can persist. A bad instruction that gets written into memory does not just affect one response. It can keep resurfacing later as “truth” unless you have governance that looks like validation, quarantine, deletion, and audit.

Centralized memory also becomes a high value target. It is not just user data, it is organized intent and preference, indexed for search. That is exactly what attackers want.

And even if you ignore security, cloud introduces latency coupling. If your agent reads and writes memory constantly, you are paying a network tax on the most frequent operations in the system.

This is why I think the edge is not a constraint. It is the point. If memory is identity, identity should not default to leaving the device.

There is also a hardware angle that matters as agents become more persistent. CXL is interesting here because it enables memory pooling. Instead of each machine being an island, memory can be disaggregated and allocated as a shared resource. That does not magically create infinite context, but it does push the stack toward treating agent state as a real managed substrate, not just tokens.

My bet for 2026 is simple. The winning agent architectures will separate cognition from maintenance. Use smaller local models for the repetitive memory work like summarization, extraction, tagging, redundancy checks, and promotion decisions. Reserve larger models for the rare moments that need heavy reasoning. Keep durable state on disk so it survives restarts, can be inspected, and can actually be deleted.

Curious what others are seeing. For people building agents, what is the biggest blocker to running memory locally today: model quality, tooling, deployment, evaluation, or something else?

Comments

4d4m•1h ago
Very excited about getting local models. There are too many concerns and bad actor possibilities with centralized models.

A message-driven DAG execution engine

https://github.com/rodmena-limited/stabilize
2•rodmena•1m ago•0 comments

Red Teaming with RL: Exploiting Tinker API for Harmful RL on 235B Model

https://huggingface.co/blog/georgefen/red-teaming-with-rl
1•gmays•1m ago•0 comments

RICO Lawsuit Accuses Drake of Fake Streams

https://www.digitalmusicnews.com/2026/01/02/drake-stake-lawsuit/
1•geox•11m ago•0 comments

bcherny's Claude Code Setup

https://twitter.com/i/status/2007179832300581177
2•doppp•17m ago•1 comments

Open-source CLI tool for generating licenses for your repositories

https://github.com/anth0nycodes/license-generator
1•anth0nycodes•21m ago•1 comments

High-end AirPods Pro 3 adding cameras for Apple Intelligence features and more

https://9to5mac.com/2026/01/02/another-airpods-pro-3-model-is-coming-with-one-rumored-upgrade/
1•wj•21m ago•0 comments

Becoming a Centenarian

https://www.newyorker.com/magazine/2025/12/22/becoming-a-centenarian
2•mrjaeger•26m ago•0 comments

The Genius Whose Simple Invention Saved Us from Shame at the Gas Station

https://www.wsj.com/business/autos/ford-gas-arrow-inventor-jim-moylan-6b2ef066
1•CaliforniaKarl•27m ago•1 comments

Adventure 751 (1980)

https://bluerenga.blog/2026/01/01/adventure-751-1980/
1•quuxplusone•30m ago•0 comments

Show HN: Open-source CSPM, DSPM, CIEM, and vulnerability management

https://github.com/clay-good/mantissa-stance
1•hireclay•31m ago•0 comments

Terry Tao: "LLMs are simpler than you think"

https://www.youtube.com/watch?v=ukpCHo5v-Gc
3•Ianjit•33m ago•2 comments

Show HN: Office2PDF - Official SDKs for Node.js (Python/Go/Java Coming)

https://github.com/politehq/office2pdf-sdks
1•alexpham14•40m ago•0 comments

Why your brain needs everyday rituals

https://bigthink.com/smart-skills/why-your-brain-needs-everyday-rituals/
1•thunderbong•40m ago•0 comments

Logistic Regression, the Sigmoid, and Log Loss

https://mateolafalce.github.io/2026/Logistic%20Regression%2C%20the%20Sigmoid%2C%20and%20Log%20Los...
1•lafalce•42m ago•0 comments

ChromeOS Flex resurrects a >12 year old laptop

https://konaraddi.com/writing/2026/2026-01-01-chromeos-flex/
1•konaraddi•48m ago•0 comments

Marathon OS: A gesture-based mobile shell and Linux system inspired by BB10

https://marathonos.xyz/
1•PaulHoule•1h ago•0 comments

Show HN: TCP chat server written in C# and .NET 9, used in the terminal

https://github.com/Sieep-Coding/simple-chat-csharp
2•sieep•1h ago•0 comments

The Kimwolf Botnet Is Stalking Your Local Network

https://krebsonsecurity.com/2026/01/the-kimwolf-botnet-is-stalking-your-local-network/
5•SamValYlieRcHE2•1h ago•0 comments

Panda Diplomacy

https://en.wikipedia.org/wiki/Panda_diplomacy
3•sieep•1h ago•0 comments

Schwarzman, OpenAI's Brockman Boost $102M Trump War Chest

https://finance.yahoo.com/news/schwarzman-openai-brockman-boost-102-151056084.html
8•dougb5•1h ago•1 comments

RubyEvents.org 2025 Wrapped – a look back at the Ruby community's year

https://www.rubyevents.org/wrapped
2•marcoroth•1h ago•0 comments

"I taught an octopus piano" [video]

https://www.youtube.com/shorts/rXM6_AiisB4
1•trelane•1h ago•1 comments

Tech Startups Are Handing Out Free Nicotine Pouches to Boost Productivity

https://www.wsj.com/tech/tech-startups-are-handing-out-free-nicotine-pouches-to-boost-productivit...
3•mattas•1h ago•2 comments

Ask HN: Transition Out of SWE and Regret It

2•t4367•1h ago•1 comments

2026 Delights and Not-So-Delightfuls

https://jakesimonds.leaflet.pub/3mbi4mcd3uk2v
1•jakesimonds•1h ago•0 comments

Einstein Probe detects an X-ray flare from nearby star

https://phys.org/news/2025-12-einstein-probe-ray-flare-nearby.html
5•wglb•1h ago•1 comments

Could OpenAI make a move on Pinterest?

https://seekingalpha.com/news/4536354-could-openai-make-a-move-on-pinterest
1•randycupertino•1h ago•1 comments

Dotnet Source Build Fails in 2026 Due To Date Overflow

https://github.com/dotnet/dotnet/issues/4037
3•csmantle•1h ago•0 comments

Year end sees record borrowing from Fed's standing repo operation

https://www.reuters.com/business/finance/banks-tap-record-liquidity-new-york-feds-standing-repo-f...
4•JumpCrisscross•1h ago•0 comments

A Basic Just-In-Time Compiler (2015)

https://nullprogram.com/blog/2015/03/19/
22•ibobev•1h ago•0 comments