frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Costile – open-source proxy, blocks AI API requests when budget is hit

https://costile.com/
1•Mkiza•3h ago
I got a surprise bill. Nothing catastrophic, but enough to make me dig into why — an agent had hit a retry loop and kept calling the API for hours. There's no way to set a hard cap on the Anthropic or OpenAI APIs. You can get an email after the fact, but nothing that actually stops requests mid-flight.

So I built a proxy. You swap one environment variable, it routes through Costile instead of calling Anthropic directly, and when you hit your daily or monthly limit it blocks further requests immediately. No SDK changes, no code refactor. Took me about a weekend. Currently supports Anthropic, with OpenAI next.

It's MIT licensed and self-hostable in about 5 minutes. Try the demo at costile.com if you want to poke at it.

I've got anomaly detection on the roadmap, but I'm second-guessing the scope — is surfacing cost spikes enough, or do people actually need to know why the agent went off the rails? The former is straightforward to build, the latter is a much harder problem. Curious where others would draw that line.

GitHub: https://github.com/Mkiza/ai-agent-cost

Comments

_zer0c00l_•3h ago
You can do this with OpenRouter though, can't you? They have a markup which is annoying, but they also have a long list of LLMs
Mkiza•3h ago
True, OpenRouter covers the basics, but to my understanding you're paying a markup on every token (correct me if I'm wrong) — for high-volume setups that gets painful fast. And if you're calling Anthropic directly (common for compliance or reliability reasons), OpenRouter isn't really an option anyways. That said, if you're already on OpenRouter and the markup doesn't bother you, it probably does the job.

GenesisDB Syncra: Merge multiple event stores into one deterministic stream

https://syncra.genesisdb.io
1•patriceckhart•36s ago•0 comments

Ask HN: What standards or protocols exist for AI Agent permissions

1•lyfeninja•42s ago•0 comments

Claude Code Routines

https://code.claude.com/docs/en/routines
1•matthieu_bl•46s ago•0 comments

AI Approach Reveals Ocean Currents in Unprecedented Detail

https://today.ucsd.edu/story/new-ai-approach-reveals-ocean-currents-in-unprecedented-detail
1•gmays•52s ago•0 comments

An Oligarchy of Old People

https://www.theatlantic.com/magazine/2026/05/gerontocracy-wealth-power/686585/
2•sleepyguy•2m ago•1 comments

Mechanical Sympathy

https://vickiboykis.com/2026/04/13/mechanical-sympathy/
2•tosh•2m ago•0 comments

MCP Attack Atlas – 40 AI agent attack patterns catalogued

https://sunglasses.dev/mcp-attack-atlas
2•azrollin•2m ago•0 comments

Intel NPU based Speech To Text in any app at cursor

https://github.com/anubhavgupta/whisper-npu
2•anubhavgupta•3m ago•1 comments

Autonomous AI Agents Become Secure by Design with Nvidia OpenShell

https://blogs.nvidia.com/blog/secure-autonomous-ai-agents-openshell/
2•eigenBasis•4m ago•0 comments

Job Matcher – LLM-scored job listings from 10 boards, self-hosted

https://christopherbeaulieu.net/job-matcher/
2•cmb_dev•5m ago•0 comments

The toxic side of the Moon (2018)

https://www.esa.int/Science_Exploration/Human_and_Robotic_Exploration/The_toxic_side_of_the_Moon
3•tcp_handshaker•7m ago•0 comments

The State of Open Source Licensing in 2026

https://redmonk.com/sogrady/2026/03/25/open-source-licensing-2026/
2•Tomte•8m ago•0 comments

SHOW HN: DuckDB / DuckLake Server (With Arrow Flight SQL) for iOS

2•philbe77•8m ago•0 comments

Valve has just made it easier to run games on Linux with 8 GB cards

https://www.pcgamer.com/software/linux/a-valve-developer-has-just-made-it-easier-to-run-games-on-...
3•evo_9•9m ago•0 comments

Routines in Claude Code

https://claude.com/blog/introducing-routines-in-claude-code
2•meetpateltech•10m ago•0 comments

RAG-Forge – Framework-agnostic RAG toolkit with a maturity model

https://github.com/hallengray/rag-forge
2•hallengray•11m ago•0 comments

The Perimeter Problem

https://www.southwind.ai/blog/the-perimeter-problem
2•Eswo•11m ago•0 comments

Show HN: Java RocksDB Without JNI

https://github.com/dfa1/rocksdbffm
2•dfa11•11m ago•0 comments

I Discover New Blogs

https://kevquirk.com/how-i-discover-new-blogs
4•speckx•12m ago•0 comments

William Cecil's Succession Plan

https://www.historytoday.com/archive/history-matters/william-cecils-succession-plan
2•Petiver•12m ago•0 comments

Can LLMs Perform Synthesis?

https://arxiv.org/abs/2603.20264
3•PaulHoule•14m ago•0 comments

Add Animal Crossing events to your digital calendar

https://sethmlarson.dev/animal-crossing-calendar
2•SethMLarson•14m ago•0 comments

Show HN: Privacy-first retirement planner – no bank linking, AI-powered analysis

https://planwithclarity.app
2•smilano85•14m ago•0 comments

The problem with thinking you're part Neanderthal

https://www.technologyreview.com/2026/04/14/1135169/problem-thinking-part-neanderthal-human-evolu...
2•laurex•15m ago•1 comments

Anthropic Opposes the Extreme AI Liability Bill That OpenAI Backed

https://www.wired.com/story/anthropic-opposes-the-extreme-ai-liability-bill-that-openai-backed/
3•srameshc•15m ago•1 comments

Tell HN: GitHub might have been leaking your webhook secrets. Check your emails.

8•ssiddharth•15m ago•1 comments

Interview: John Calhoun on the Origins of Glider (2010)

https://macscene.net/d/4678-interview-john-calhoun-on-the-origins-of-glider-part-1
2•CharlesW•16m ago•0 comments

Anthropic Plots Lovable Challenger

https://sifted.eu/articles/anthropic-lovable-challenger-leak
3•AnhTho_FR•17m ago•0 comments

Digital Realty Spending €2B to Turn Italy into a Mediterranean Data Hub

https://thecoolingreport.com/intel/digital-realty-italy-2-billion-mediterranean-data-hub.html
2•jackdilusso•17m ago•1 comments

The Cost of Concurrency Coordination [video]

https://www.youtube.com/watch?v=tND-wBBZ8RY
2•tosh•19m ago•0 comments