frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: USST – A protocol to reduce LLM context redundancy by 98.5%

https://gist.github.com/maverick069/06d6f6e89947d621b4905765245a220a
2•mgopanna•58m ago
I’ve been working on a primitive called User-Segmented Session Tokens (USST).

The Problem: Currently, if a teacher (or lead dev) wants 50 students (or junior devs) to use an LLM with a specific, deep context (e.g., a 50-page curriculum or a complex repo), all 50 users have to re-upload and re-tokenize that context. It’s redundant, expensive, and forces everyone to have a high-tier subscription.

The Solution: USST allows a "Sponsor" (authenticated, paid account) to run a Deep Research session once and mint a signed Context Token. Downstream users (anonymous/free tier) pass this token in their prompt. The provider loads the pre-computed KV cache/context state without re-processing the original tokens.

Decouples payment from utility: Sponsor pays the heavy compute; Users pay the inference. Privacy: Users don't need the Sponsor's credentials, just the token. Efficiency: Removes the "Linear Bleed" of context re-computation.

I wrote up the full architecture and the "why" here: https://medium.com/@madhusudan.gopanna/the-8-6-billion-oppor...

The Protocol Spec / Repo is the main link above.

Would love feedback on the abuse vectors and how this fits with current provider caching (like Anthropic’s prompt caching).

Comments

mgopanna•50m ago
I wanted to share the economic model that drove me to build this. I call it the "Redundancy Tax."

When you look at the hidden costs of "Per-Seat" architecture in an education setting, the numbers get large very quickly. I broke down the cost of redundant context re-processing:

The Baseline:

    Target: ~20M connected classrooms (secondary/tertiary globally).

    Volume: 1,000 high-value interactions per year (a conservative estimate for active AI tutoring).

    The Waste: Re-processing a 35k context window for every single student query instead of reusing the cached state.
The USST Math: By shifting from "Raw Mode" (everyone tokenizes everything) to "USST Mode" (Sponsor tokenizes once, students reuse):

    We see a ~98.5% reduction in incremental token load.

    That saves roughly $0.432 per interaction in compute costs.

    0.432×1,000 interactions×20M classrooms≈$8.6 Billion annually.
The Grid Impact: Beyond the money, this is an infrastructure stability issue. A simultaneous classroom start (e.g., 10:05 AM) currently looks like a 1 Megawatt spike on the grid. With shared context tokens, that drops to a 15 Kilowatt blip (just the inference delta).

We don't need 100x more chips to solve this; we just need a protocol that stops treating every user session as a blank slate.

There is no psychohistory, and there never will be (2018)

https://scatter.wordpress.com/2018/02/22/there-is-no-psychohistory-and-there-never-will-be/
1•isomorph•2m ago•0 comments

Four countries to boycott Eurovision 2026 as Israel cleared to compete

https://www.theguardian.com/tv-and-radio/2025/dec/04/ireland-spain-and-the-netherlands-to-boycott...
4•JustSkyfall•3m ago•0 comments

Show HN: A collection of questions on the human experience

https://goodquestions.qzz.io/
1•tdsone3•3m ago•0 comments

Show HN: Steps.org – AI prompts for porn addiction recovery (free, no app)

https://www.steps.org
1•tiagom87•4m ago•1 comments

Netflix wins the bidding war for Warner Bros

https://www.theverge.com/news/838781/netflix-warner-bros-discover-bids-buyout
2•poniko•5m ago•0 comments

Writing our own Cheat Engine in Rust

https://lonami.dev/blog/woce-1/
1•hu3•6m ago•0 comments

Zerodha Is Down

https://zerodha.com/
1•vinyasns•6m ago•0 comments

Tonely – an on-device AI keyboard that detects tone and intent privately

https://www.usetonely.ai
1•bpwldn•6m ago•1 comments

Hike to a Hidden Gem in Galway, Ireland, on St Patrick's Bed Trail [video]

https://www.youtube.com/watch?v=lSO39uwHT-w
1•keepamovin•9m ago•0 comments

Ask HN: Which companies were caught out again or not by Cloudfare?

1•tippa123•9m ago•0 comments

We would sell books by AI, says Waterstones boss

https://www.bbc.co.uk/news/articles/cpvdkw4xgewo
1•ouked•10m ago•0 comments

Cloudflare Is Back Onine?

2•llmacpu•10m ago•0 comments

The AI will see you now

https://www.jom.media/the-ai-will-see-you-now/
1•decimalenough•12m ago•0 comments

What Is the Best Startup Accelerator for Sri Lankan Startup

1•pasindu_anuhas•13m ago•0 comments

Show HN: Banana Prompts – Curated Prompts for Nano Banana Pro

https://bananaprompts.fun/
1•zenja•13m ago•0 comments

Zoom is down (because Cloudflare)

https://www.zoom.com
1•ale42•16m ago•1 comments

Stack Overflow Is Down

https://stackoverflow.com/questions
5•nomilk•17m ago•1 comments

Kenyan court declares law banning seed sharing unconstitutional

https://apnews.com/article/kenya-seed-sharing-law-ruling-ad4df5a364299b3a9f8515c0f52d5f80
3•thunderbong•18m ago•0 comments

Render.com down – hosted servers unresponsive

https://render.com/
2•khaledg•20m ago•0 comments

npm registry down

1•arbol•20m ago•1 comments

Toyota Gazoo Racing Presents the World Premiere of the GR GT and GR GT3

https://newsroom.toyota.eu/toyota-gazoo-racing-presents-the-world-premiere-of-the-gr-gt-and-gr-gt3/
1•teleforce•20m ago•0 comments

Show HN: Internet Recipe

https://matthewquerzoli.com/#/projects/internet-recipe
1•Quiza12•20m ago•0 comments

Tell HN: Even LinkedIn is running on Cloudflare and not Azure

1•vira28•22m ago•0 comments

It's time to buy short positions for Cloudflare stocks again

3•mensoi•22m ago•1 comments

Claude is down (05 Dec '25) Cloudflare returning 500

https://claude.ai/#
1•fagnerbrack•22m ago•0 comments

Open Social (and Back to Open Web)

https://www.ssp.sh/brain/open-social/
1•articsputnik•22m ago•0 comments

Cloudflare Hosting Down

https://www.cloudflarestatus.com/?date=05-03-2025
2•glemmaPaul•23m ago•0 comments

Cloudflare is investigating issues with Cloudflare Dashboard and related APIs

https://www.cloudflarestatus.com/incidents/lfrm31y6sw9q
4•zlatkov•23m ago•0 comments

Cloudflare Down

https://dash.cloudflare.com/
4•ransom1538•23m ago•0 comments

Cloudflare Is Down Again

https://radar.cloudflare.com/outage-center
2•edweis•24m ago•0 comments