frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: We cut >60% of tokens from agentic tasks by removing repeated context

https://parcle.ai/
1•longtermop•13h ago
Every agentic system I see has the same hidden tax: the model keeps rereading the same context.

Tickets, Slack threads, docs, customer history, database notes, runbooks, logs, prior decisions. You can cache static prefixes, route to cheaper models, or set team budgets, but none of those fixes the underlying behavior: agents start most tasks trying to re-explore everything.

We built Parcle as a shared memory layer for AI agents. It ingests operational context, indexes what happened, and lets agents retrieve a small, relevant memory set for the next step instead of pasting everything back into the prompt - or worse, letting the agent go explore on it's own and burning tokens.

We started tracking our tokens consumed on tasks with and without our memory layer just with indexing of local files. In our deployments/evals, the biggest reduction we’ve seen is up to 70% lower token spend on agentic tasks, with roughly 2x faster task completion. The median was ~30% less tokens spent. The biggest savings often come from data and context-heavy workflows; when the agent needs to retrieve data and context from multiple locations and sources. The best cases so far are support, ops, research, sales, and finance workflows where the agent otherwise reloads the same account/workflow/history context again and again.

Why I think this matters now:

Pylon’s AI cost post made us ask the question:

How much are companies paying because their agents keep looking for the same context? Is this a hidden tax that memory could solve?

We built Parcle to make agents remember. The surprise was that memory does not just make agents more useful. It also cuts down on tokens consumed. Less tokens spent figuring where things are, and more time spent doing actually productive work.

- Anthropic says agents use about 4x more tokens than chat. We think this is an understatement, - OpenAI and Anthropic both have prompt caching because repeated prompt context is expensive, but caching mostly helps when the reusable content is stable enough to hit the cache. But this doesn't resolve the fact that prompt caching is forfeited after 5min-15mins of inactivity. - “Lost in the Middle” and Chroma’s “context rot” work both point at the same issue: more context is not the same thing as usable memory. - The context-engineering crowd seems to be converging on this: the hard part is deciding what the model should see at each step.

Parcle is our attempt at making that operational: memory outside the model, selected into context only when useful.

I’d love feedback from people running real agents in production:

1. Where are your tokens actually going: repeated input context, tool traces, retries, output, evals, or something else? 2. Have prompt caching and model routing been enough? 3. What would you need to trust an external memory layer inside an agent loop?

A PostgreSQL Database for Every Agent: In-Database RAG, Graph, and Multitenancy

https://www.yugabyte.com/blog/a-postgresql-database-for-every-agent/
1•bug2man•2m ago•0 comments

Death by 1k Compromises: How to Tap into Founder Mode

https://tim.blog/2026/06/18/mark-pincus/
1•Michelangelo11•2m ago•0 comments

Where do migrants live, and where were they born?

https://ourworldindata.org/where-do-migrants-live-and-where-were-they-born
1•TMWNN•4m ago•1 comments

Making budget models punch above their weight with a smart Rust harness

https://yogthos.net/posts/2026-06-08-dirge-code.html
1•theanonymousone•6m ago•0 comments

Show HN: A small, crazy fast hybrid search engine written in Rust

https://github.com/deepbluedynamics/lume
1•kordlessagain•6m ago•1 comments

Why AI Is Incorrigibly Didactic

https://www.theatlantic.com/ideas/2026/06/ai-writing-style-literature/687536/
1•Michelangelo11•7m ago•0 comments

Third time's the charm for a row of faint galaxies without dark matter

https://news.yale.edu/2026/06/16/third-times-charm-row-faint-galaxies-without-dark-matter
1•gmays•7m ago•0 comments

Wiki Spy

https://neal.fun/wiki-spy/
3•irs•7m ago•0 comments

Pnpm temp paths broke lifecycle sockets

https://github.com/pnpm/pnpm/pull/12327
1•scarabsystems•8m ago•0 comments

Developers React to AI-Scented Blog Posts

https://writethatblog.substack.com/p/dev-reaction-to-ai-blog-posts
1•birdculture•8m ago•0 comments

Show HN: SDK for embedding zot coding agent in your Node.js applications

https://github.com/patriceckhart/zot-sdk-javascript
4•patriceckhart•10m ago•0 comments

Commodore Unveils Linux Powered Flip Phone

https://hackaday.com/2026/06/18/commodore-unveils-linux-powered-flip-phone/
1•kristianpaul•10m ago•0 comments

Apple's A12 and A13 Chips Facing New Unpatchable Exploit

https://www.macrumors.com/2026/06/18/a12-and-a13-chips-facing-exploit/
1•latexr•10m ago•0 comments

Is it time for a new Embedded Linux build system?

https://yoebuild.org/blog/time-for-a-new-build-system/
1•cbrake•11m ago•0 comments

Europe Scales Down AI Ambitions with Smaller Data Center Tender

https://www.bloomberg.com/news/articles/2026-06-18/europe-scales-down-ai-ambitions-with-smaller-d...
1•alecco•11m ago•1 comments

Show HN: Foglamp – Open-source o11y for AI SDK

https://www.foglamp.dev/
1•gustavofior•12m ago•0 comments

The founder of Craigslist has given away half a billion dollars

https://www.independent.co.uk/us/money/craigslist-multimillionaire-craig-newmark-b2980681.html
2•Tomte•12m ago•0 comments

Project Fetch: Phase Two

https://www.anthropic.com/research/project-fetch-phase-two
2•meetpateltech•12m ago•0 comments

Tech CEOs are breaking the law

https://kiesow.net/tech-ceos-are-breaking-the-law/
2•speckx•13m ago•0 comments

Show HN: Kelora – query and transform logs from the command line

https://github.com/dloss/kelora
1•dloss•13m ago•0 comments

Hotshot – tool to make screenshot –> Claude/copilot/goose faster

https://github.com/kubestellar/hotshot
1•andan02•13m ago•1 comments

ClojureWasm is a Clojure runtime written from scratch in Zig and Clojure, no JVM

https://github.com/clojurewasm/ClojureWasm
1•yogthos•13m ago•0 comments

Brexit tore apart European science – now the research rifts are healing

https://www.nature.com/articles/d41586-026-01841-w
1•Brajeshwar•14m ago•0 comments

Leaked Names Expose Billionaire Peter Thiel's 'Dialog' Society

https://bsky.app/profile/crimew.gay/post/3moejlixgvc2z
1•Sam6late•14m ago•1 comments

Show HN: Tamper-evident audit logs for LangChain/Crew AI agents

https://github.com/Providex-AI/rootsign
1•oabolade•15m ago•0 comments

Infrastructure Is the Source of Truth

https://engineeringforteams.substack.com/p/infrastructure-is-the-source-of-truth
1•nickstinemates•15m ago•0 comments

AI data centre would be 'one of Scotland's top polluters' if plans greenlit

https://www.thenational.scot/news/26205897.larbert-ai-data-centre-become-top-10-polluter/
4•pera•18m ago•0 comments

Google just lost one of its biggest AI names to OpenAI

https://www.businessinsider.com/google-veteran-founded-characterai-is-jumping-to-openai-talent-wa...
3•ksec•19m ago•0 comments

Openfoot Manager

https://openfootmanager.com/
3•skibz•20m ago•0 comments

Show HN: electron-expose – Make Electron IPC boring with TypeScript decorators

https://github.com/johnyrokita/electron-expose
2•jskull•23m ago•0 comments