frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: I built a BYOK AI gateway after a 100x Cloudflare KV cost mistake

https://qzira.com/en/
1•qzira•2h ago
I run a small data recovery business in Japan, and over the past year I've been building with AI coding tools like Claude Code, Cursor, and Cline.

One of my side projects is an overnight content pipeline for my business. It pulls RSS feeds, fetches source articles, generates posts with AI, scores them, and publishes them to WordPress without supervision.

The content is a bit niche: cybersecurity incidents for Japanese manufacturing companies in Aichi Prefecture — Toyota's home region — where older workflows like fax and password-protected ZIPs still haven't fully disappeared.

The physical setup is also a little ridiculous: a SwitchBot turns the PC on at 3am, Windows Task Scheduler starts the Python pipeline, and another scheduled task shuts the machine down when it's done.

Originally this was only meant to solve my own problem. But the more failure modes I found, the more features I kept adding.

What pushed me to build qzira was cost control.

The first lesson was operational: alerts don't help at 3am. What I needed wasn't another notification, but a kill switch outside the application — something that could stop requests before they reached the provider, regardless of what the agent decided to do.

The second lesson was more embarrassing: I had miscalculated Cloudflare KV write costs by 100x. Every request was triggering a KV put. Rewriting that path to batch via cron jobs reduced writes by about 99% and fixed the unit economics.

I also became much more conservative about model choice for production content after running a simple comparison on my own pipeline.

I ran 10 articles through Claude and didn't find any hallucinations.

Then I ran 1 article through gpt-4o-mini, and it immediately inverted the meaning of the source: it wrote "operations were suspended" where the original said "no impact on operations was confirmed."

To be fair, the pipeline was tuned around Claude, so I don't take this as a general statement about model quality. It may simply have been a prompt/model fit issue. But for me, it was enough to become much more conservative about where lower-cost models are allowed to touch production content.

Both problems pointed to the same conclusion: cost and policy enforcement belong at the infrastructure layer, not inside the application.

So I built qzira — a BYOK AI gateway in front of OpenAI, Anthropic, and Google AI. It adds gateway-level budget controls, hard stops, and routing by changing the base_url in tools like Claude Code or Cursor.

Stack: Cloudflare Workers, Hono, D1, KV, and Vectorize.

There's a free tier.

Happy to answer questions about the architecture, the cost mistake, the overnight pipeline, or the slightly absurd physical setup behind it.

Show HN: GZOO Forge – persistent project memory as an MCP server for Claude Code

https://github.com/gzoonet/forge
1•gzoo•45s ago•0 comments

I Asked 15 DevTool Maintainers About Documentation Localization

1•skytin1004•57s ago•1 comments

Google Maps's Moat (2017)

https://www.justinobeirne.com/google-maps-moat
1•dbl000•1m ago•0 comments

How to automatically generate subtitles in any language

https://www.flowsub.ai/
1•bloomder•1m ago•0 comments

Anthropic vs. Dow

https://www.documentcloud.org/documents/27781298-anthropic-v-dow/
2•antimora•1m ago•0 comments

Private credit woes could become data center difficulties

https://www.axios.com/2026/03/09/ai-data-center-private-credit
1•1vuio0pswjnm7•2m ago•0 comments

The Execution Layer

https://twitter.com/RhysSullivan/status/2030903539871154193
1•tosh•3m ago•0 comments

A Mysterious Code Is Being Broadcast on Shortwave Radio. Is It Iran?

https://www.theatlantic.com/national-security/2026/03/asymmetric-warfare-iran-numbers-stations-cy...
1•jbegley•3m ago•0 comments

I want Artificial Competence, not more Artificial Intelligence

https://jdauriemma.com/programming/i-want-artificial-competence-not-more-ai
1•jdauriemma•4m ago•0 comments

Show HN: Built a small CLI for self-improving OpenClaw agent loops

https://github.com/shadmau/autocouncil
1•j0xnvm•4m ago•0 comments

Evolve or Die

https://herbertlui.net/evolve-or-die-2/
1•speckx•4m ago•0 comments

Why I'm not worried about AI causing mass unemployment

https://www.understandingai.org/p/software-didnt-eat-the-world
1•gukov•5m ago•0 comments

Anthropic vs. U.S. Department of War, etc. [pdf]

https://storage.courtlistener.com/recap/gov.uscourts.cand.465515/gov.uscourts.cand.465515.1.0_1.pdf
3•mef•6m ago•2 comments

Show HN: ApexStore – An embedded LSM-Tree storage engine written in Rust

https://github.com/ElioNeto/ApexStore
1•texuguito•7m ago•1 comments

Show HN: Pacto – A runtime contract for cloud-native services

https://github.com/trianalab/pacto
1•edu-diaz•8m ago•0 comments

GitHub Security Lab's open source AI-powered vulnerability scanner

https://github.blog/security/how-to-scan-for-vulnerabilities-with-github-security-labs-open-sourc...
1•tcbrah•8m ago•0 comments

An opinionated take on how to do important research that matters

https://nicholas.carlini.com/writing/2026/how-to-win-a-best-paper-award.html
1•mad•8m ago•0 comments

Show HN: Qiyaas – a word game based on numbers

https://www.qiyaasgame.com/
1•xamed•8m ago•0 comments

In Defense of Death Caps

https://northspore.substack.com/p/in-defense-of-death-caps
1•Quitschquat•10m ago•0 comments

Bitcoin difficulty jumps 15% largest increase since 2021, despite price slump

https://www.coindesk.com/markets/2026/02/20/bitcoin-difficulty-jumps-15-largest-increase-since-20...
1•PaulHoule•11m ago•0 comments

Ghostty 1.3.0

https://ghostty.org/docs/install/release-notes/1-3-0
1•matrixhelix•11m ago•0 comments

Peter Thiel and Jeffrey Epstein Had a Yearslong Relationship

https://jacobin.com/2026/03/thiel-epstein-barak-ai-israel/
11•johnbarron•11m ago•0 comments

Show HN: Pu-erh Lab, a CUDA-accelerated RAW photo editor

https://github.com/zidage/PuerhLab
1•yurunzi•11m ago•1 comments

Geo Platform for AI Search Visibility (ChatGPT, Claude, Gemini, Perplexity)

https://geoark.ai
1•abedaarabi•12m ago•1 comments

Mobile AI: fuzzy logic trust and compatibility

https://www.loxation.com/blog/posts/blog-fuzzy-social-reasoning/
1•jabbr•13m ago•0 comments

Avoiding temptation beats building willpower

https://www.npr.org/2026/03/09/nx-s1-5736553/fast-food-screens-kids-health
2•marojejian•13m ago•1 comments

Ask HN: What words and phrases make HN peeps see red?

2•bookofjoe•13m ago•0 comments

Software Got Weird

https://www.coryzue.com/writing/software-got-weird/
1•carnevalem•14m ago•0 comments

US missile hit military base near Iran school, video analysis shows

https://www.bbc.com/news/articles/cvg548lyjnyo
2•johnbarron•14m ago•0 comments

AI and Software Development

https://allanvital.com/ai-and-software-development/
1•speckx•14m ago•0 comments