frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Autocache – Cut Claude API costs 90% (for n8n, Flowise, etc.)

https://github.com/montevive/autocache
1•jmrobles•2h ago

Comments

jmrobles•2h ago
Hi HN! I built Autocache, an intelligent proxy for the Anthropic Claude API that automatically reduces costs by up to 90% and latency by up to 85%.

  **The Impact:**
  If you're spending $100/day on Claude API calls with system prompts and tools, Autocache can reduce that to ~$10/day with zero code changes. For a 1000-token system prompt reused across requests, you pay 1.25× once to cache it, then 0.1× on every
  subsequent request.

  **The Problem:**
  Anthropic's Prompt Caching requires manually placing cache breakpoints in your API requests. For applications like n8n workflows, Flowise chatbots, or any complex integration with system prompts, tools, and conversation history, you either can't
  access the request structure to optimize it, or doing so manually is extremely tedious.

  **How Autocache Works:**
  It's a transparent drop-in proxy. For each request, it:
  1. Analyzes token counts across system prompts, tools, and message content
  2. Calculates ROI scores for potential cache breakpoints (write costs vs. read savings)
  3. Automatically injects cache-control fields at optimal positions
  4. Returns X-Autocache-* headers showing projected savings and break-even points

  **Perfect for:**
  - n8n AI workflows (change base URL in Claude node)
  - Flowise chatbots (configure HTTP endpoint)
  - LangChain/LlamaIndex apps
  - Custom Claude integrations
  - Any app where you can't manually optimize prompts

  **Try it in 30 seconds:**
  ```bash
  docker run -d -p 8080:8080 -e ANTHROPIC_API_KEY=sk-ant-... ghcr.io/montevive/autocache:latest

  Point your app to http://localhost:8080/v1/messages – check response headers for actual savings metrics on your workload.

  GitHub: https://github.com/montevive/autocache

  I've tested this with n8n workflows and seen $200→$25/day cost reductions on production workloads. The ROI algorithm uses conservative estimates, but I'd love feedback on edge cases or strategies I haven't considered.

  Tech: Go, ~29MB Docker image, multi-arch, MIT licensed.

What a Data Center Is

https://andymasley.substack.com/p/what-a-data-center-is
1•andymasley•2m ago•0 comments

If you use Claude Code with Codex or Cursor: ln -s AGENTS.md CLAUDE.md

https://coding-with-ai.dev/posts/sync-claude-code-codex-cursor-memory/
1•codeclimber•5m ago•0 comments

Hosting a static site on an original Raspberry Pi [Alpine Linux "diskless" mode]

https://cablespaghetti.dev/hosting-a-static-site-on-an-original-raspberry-pi.html
1•indigodaddy•8m ago•0 comments

Insurers balk at paying out settlements for claims against AI firms

https://arstechnica.com/ai/2025/10/insurers-balk-at-paying-out-huge-settlements-for-claims-agains...
1•worik•9m ago•0 comments

The great butterfly heist: Collector stole 1000s from Australian museums

https://www.theguardian.com/global/2025/oct/04/great-butterfly-heist-how-collector-stole-thousand...
1•mhb•10m ago•0 comments

Ask HN: Anyone Building an AI Airtable?

1•matt3D•10m ago•0 comments

I have a GPS bike computer

https://utcc.utoronto.ca/~cks/space/blog/tech/WhyIHaveGPSBikeComputer
1•speckx•10m ago•0 comments

Ask HN: Did Twitter's 280-character limit improve discourse?

1•cryptography•13m ago•0 comments

Most of the world has recently set all-time heat records

https://www.theclimatebrink.com/p/most-of-the-world-has-recently-set
3•littlexsparkee•16m ago•2 comments

Julia 1.12 Highlights

https://julialang.org/blog/2025/10/julia-1.12-highlights/
5•pella•16m ago•2 comments

The $40k a year school where AI shapes every lesson, without teachers

https://www.cbsnews.com/news/alpha-school-artificial-intelligence/
2•paulpauper•17m ago•0 comments

Software is Eating Labor [video]

https://www.youtube.com/watch?v=dhyhR4Bzc0I
1•pppoe•17m ago•0 comments

SBI Crypto Reportedly Hit by $21M Hack with Suspected DPRK Links

https://www.coindesk.com/business/2025/10/01/sbi-crypto-reportedly-hit-by-usd21m-hack-with-suspec...
1•paulpauper•17m ago•0 comments

Tuitka is a TUI wrapper that leverages Nuitka to compile Python applications

https://github.com/Nuitka/Tuitka
1•willm•18m ago•0 comments

A Theoretical Framework for Studying the Phenomenon of Gaslighting

https://journals.sagepub.com/doi/10.1177/10888683251342291
1•PaulHoule•21m ago•0 comments

Explainer: Why have metal–organic frameworks won the Nobel Prize in chemistry

https://www.chemistryworld.com/news/explainer-why-have-metal-organic-frameworks-won-the-nobel-pri...
2•rolph•21m ago•0 comments

Ask HN: What's the best live translation app for voice?

1•transitivebs•21m ago•0 comments

I Know What You Did Last Summer (With Val Town)

https://www.raymondcamden.com/2025/10/08/i-know-what-you-did-last-summer-with-val-town
1•stevekrouse•21m ago•0 comments

The Bases of Antenna Towers [video]

https://www.youtube.com/watch?v=3nDdLiXS5wk
2•skibz•21m ago•0 comments

Schools' Embrace of AI Connected to Increased Risks to Students

https://cdt.org/insights/hand-in-hand-schools-embrace-of-ai-connected-to-increased-risks-to-stude...
2•CharlesW•22m ago•0 comments

Reverse Engineering keyboard firmware with Ghidra

https://blog.usedbytes.com/2020/03/reverse-engineering-keyboard-firmware-with-ghidra-part-1/
3•o4c•23m ago•0 comments

How the Fed would respond to AI-pocalypse

https://www.axios.com/2025/10/08/ai-fed-interest-rates
1•c420•24m ago•0 comments

6 months and $485: My journey into building with AI

https://harshdeepgupta.substack.com/p/6-months-and-485-my-journey-into
1•thedeep_mind•24m ago•0 comments

Google just cut off 90% of the internet from AI – no one's talking about it

https://www.reddit.com/r/ArtificialInteligence/s/spZ9qh0Ia1
5•alexgotoi•24m ago•1 comments

Show HN: Spica – OSS Tool to Generate Infinite Length Sora-2 Videos

https://spica.kuber.studio/
1•kuberwastaken•24m ago•0 comments

The Museum of Soviet Arcade Games (2010)

https://web.archive.org/web/20100915164732/http://adangerousbusiness.com/2010/01/05/the-museum-of...
1•breppp•25m ago•1 comments

Show HN: Quant, the AI stock trading analyst

3•mceoin•26m ago•0 comments

AI gets more 'meh' as you get to know it better

https://www.theregister.com/2025/10/08/more_researchers_use_ai_few_confident/
6•rntn•26m ago•1 comments

Show HN: Twoway, a Go package for HPKE encrypted request-response flows

https://github.com/confidentsecurity/twoway
3•1268•27m ago•0 comments

Show HN: Vincent – A delegation framework for wallet automation

https://docs.heyvincent.ai/concepts/introduction/about
3•glitch003•27m ago•0 comments