frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Prompt-caching – auto-injects Anthropic cache breakpoints (90% token savings)

https://prompt-caching.ai/
38•ermis•1h ago

Comments

spiderfarmer•1h ago
Will this work for Cowork as well?
stingraycharles•1h ago
This is not at all an MCP server you want to use with a regular tool, as this is about low level context window management. Tbh it’s really trivial to do this, and I have no idea why OP decided to make an MCP server for this as it’s completely useless for that.

As a matter of fact, i think this is not a problem at all as Anthropic makes it extremely easy to cache stuff; you just set your preferred cache level on the last message, and Anthropic will automatically cache it under the hood. Every distinct message is another “cache” point, eg they first compute the hash of all messages, if not found, compute the hash of all messages - 1, etc.

It’s really a non problem.

ermis•40m ago
No. Claude.ai is a consumer product — you have no access to the API layer underneath it. cache_control is an API-level feature only. This plugin works exclusively when you're making direct Anthropic API calls, either through the SDK in your own code or through MCP-compatible clients like Claude Code, Cursor, Windsurf, etc.
somesnm•1h ago
Hasn't this been largely solved by auto-caching introduced recently by Anthropic, where you pass "cache_control": {"type": "ephemeral"} in your request and it puts breakpoints automatically? https://platform.claude.com/docs/en/build-with-claude/prompt...
stingraycharles•56m ago
Yes, it has, this is a non-problem, and even if it was a problem, an MCP server would most definitely be one of the worst ways to fix it.
philipp-gayret•55m ago
Looking at my own usage with claude code out of the box and nothing special around caching set up. For this month according to ccusage I have in tokens 0.2M input, 0.6M output, 10M cache create, 311M cache read for 322M total tokens. Seems to me that it caches out of the box quite heavily, but if I can trim my usage somehow with these kind of tools I'd love to know.
stingraycharles•38m ago
This is not about caching things for stuff that others built, it’s solely to modify code that you’re writing that will use Anthropic’s API endpoints.
gostsamo•32m ago
It is answered in the FAQ.
mijoharas•56m ago
I don't understand, Claude code already has automatic prompt caching built in.[0] How does this change things?

[0] https://code.claude.com/docs/en/costs

katspaugh•47m ago
> This plugin is built for developers building their own applications with the Anthropic API.

> Important note for Claude Code users: Claude Code already handles prompt caching automatically for its own API calls — system prompts, tool definitions, and conversation history are cached out of the box.

Source: their GitHub

fschuett•31m ago
Slightly off-topic, but I recently tested some tool and it turns out Opus is far cheaper than Sonnet, because it produces way less output tokens and those are what's expensive. It's also much slower than Opus (I did 9 runs to compare Haiku, Sonnet and Opus on the same problem). I also thought "oh, Sonnet is more light-weight and cheaper than Opus", no, that's actually just marketing.
CGamesPlay•17m ago
Claude subscriptions (strangely) have a Sonnet limit which is lower than the general model limit. Using Sonnet counts against both limits, using Opus only the general limit. So the subscriptions are discouraging Sonnet use as well.
adi_pradhan•26m ago
This is applicable only to the API from what i understand. Since claude code already caches quite aggressively (try npx ccusage)

Also the anthropic API did already introduce prompt-caching https://platform.claude.com/docs/en/build-with-claude/prompt...

What is new here?

numlocked•25m ago
As per its own FAQ this plugin is out of date and doesn’t actually do anything incremental re:caching:

> "Hasn't Anthropic's new auto-caching feature solved this?"

> Largely, yes — Anthropic's automatic caching (passing "cache_control": {"type": "ephemeral"} at the top level) handles breakpoint placement automatically now. This plugin predates that feature and originally filled that gap.

orphea•22m ago
I don't understand and I'm curious, why a dead on arrival open source tool needs a separate domain?

  Domain Name: prompt-caching.ai
  Updated Date: 2026-03-12T20:31:44Z
  Creation Date: 2026-03-12T20:27:35Z
  Registry Expiry Date: 2028-03-12T20:27:35Z
derrida•10m ago
Is it perhaps because this is for claude code but there's other tools that use anthropics api like custom agents? (some i prefer to use than claude code - e.g sketch.dev what is now called shelley at exe.dev) perhaps?
Slav_fixflex•9m ago
Interesting – I've been using Claude heavily for building projects without writing code myself. Token costs add up fast, anything that reduces that is welcome. Has anyone tested this in production workflows?

Decisions, extracted knowledge, handoff context: reasoning data infrastructure

https://sderosiaux.substack.com/p/ai-agents-produce-a-new-kind-of-data
1•chtefi•1m ago•0 comments

Show HN: Build Your Own OpenClaw – A step by step guide

https://github.com/czl9707/build-your-own-openclaw
1•zane__chen•2m ago•0 comments

JavaScript Minification Benchmarks

https://github.com/privatenumber/minification-benchmarks
1•javatuts•2m ago•0 comments

Readme, Don't Agents.md Me

https://www.joshbeckman.org/blog/practicing/readme-dont-agentsmd-me
1•bckmn•2m ago•0 comments

Twenty years of Amazon S3 and building what's next

https://aws.amazon.com/blogs/aws/twenty-years-of-amazon-s3-and-building-whats-next/
1•soheilpro•2m ago•0 comments

Cursor Cloud Telegram Connector

https://github.com/tb5z035i/cursor-tg
1•javatuts•2m ago•0 comments

Top 100 Gen AI Consumer Apps – 6th Edition

https://a16z.com/100-gen-ai-apps-6/
1•bookofjoe•2m ago•0 comments

Openreach: Fiber can sniff out leaky water pipes – if anyone bothers fixing them

https://www.theregister.com/2026/03/13/openreach_fiber_water_leaks/
1•Brajeshwar•2m ago•0 comments

Backblaze Now Serving 314T Digits of Pi

https://www.backblaze.com/blog/backblaze-now-serving-314-trillion-digits-of-pi/
1•soheilpro•3m ago•0 comments

The Dirty, Dystopian World of AI Data Centers

https://www.theatlantic.com/magazine/2026/04/ai-data-centers-energy-demands/686064/
3•fortran77•6m ago•1 comments

US issues 30-day sanctions waiver for purchase of Russian oil at sea

https://www.reuters.com/business/energy/us-issues-new-russia-related-general-license-oil-treasury...
2•geox•10m ago•0 comments

Show HN: WSG - turn project activity into client-ready reports

https://daniel043apps.gumroad.com/l/wgs
1•ThunderDanOAE•10m ago•1 comments

Nanny state discovers Linux, demands it check kids' IDs before booting

https://www.theregister.com/2026/03/13/opinion_os_verification/
4•jjgreen•12m ago•1 comments

Production-ready Agent Skills, 17 agents, and a orchestration protocol

https://alirezarezvani.github.io/claude-skills/
2•jungard•12m ago•1 comments

Show HN: Golf – A browser version of the classic card game

https://www.golfingwithcards.com/
1•beagle_byte•12m ago•1 comments

SmolClaw, a Microvm to Run Picoclaw

https://github.com/NetBSDfr/smolBSD/tree/main/service/clawd
1•iMil•13m ago•0 comments

NYC plans new AI-focused school as rules for the tech are delayed

https://gothamist.com/news/nyc-plans-new-ai-focused-school-as-rules-for-the-tech-are-delayed
1•righthand•14m ago•0 comments

Bonus – Clean Room as a Service

https://nickvidal.github.io/bonus/
1•nickvidal•14m ago•0 comments

Show HN: DashClaw – intercept and audit AI agent decisions before they execute

https://github.com/ucsandman/DashClaw
1•ucsandman•14m ago•1 comments

Show HN: StatusDrop – Status page and live widget for your SaaS in 30 seconds

https://statusdrop.dev
1•razvanmac•18m ago•0 comments

How the classic computer game Doom became a tool for science

https://www.nature.com/articles/d41586-026-00813-4
1•sohkamyung•18m ago•0 comments

Show HN: Chat.nvim v1.4.0 – OpenClaw-like AI assistant for Neovim

https://github.com/wsdjeg/chat.nvim/releases/tag/v1.4.0
2•wsdjeg•18m ago•0 comments

In search of Banksy, Reuters found the artist took on a new identity

https://www.reuters.com/investigates/special-report/global-art-banksy/
1•speckx•19m ago•0 comments

E2E encrypted messaging on Instagram will no longer be supported after 8 May

https://help.instagram.com/491565145294150
2•mindracer•19m ago•0 comments

Drain3: A robust streaming log template miner based on the Drain algorithm

https://github.com/logpai/Drain3
2•kvaranasi_•19m ago•1 comments

301M Records Exposed: The HIPAA Breach Epidemic

https://ciphercue.com/blog/hipaa-breach-epidemic-301-million-records
4•adulion•22m ago•0 comments

Tell HN: Apple Developer Agreement AI Updates

3•alexfromapex•22m ago•0 comments

Show HN: I made an open source/API/aiagent first ride sharing app like-Uber grab

https://github.com/sawirricardo/openjek.com
1•sawirricardo•22m ago•0 comments

"Open" Data/AI Platforms for Increasingly Specialized Compute Engines

https://www.hopsworks.ai/post/data-ai-platforms-should-be-open-for-use-by-increasingly-specialize...
2•jamesblonde•23m ago•0 comments

Tendem: Outsource Tasks to Hybrid AI and Human Workflow

https://tendem.ai/
1•MrBuddyCasino•24m ago•0 comments