frontpage.

I built Genosis because my AI trading assistant's Anthropic bill was eating the project alive — 12% cache hit rate when it should have been 80%, and I was spending more time optimizing costs than building the actual product.

Every major LLM provider offers 50-90% discounts on cached tokens, but the mechanics to actually capture them are different for every provider, change regularly, and are genuinely hard to get right.

Genosis watches your traffic (content-blind — it only sees hashes, never your data), figures out which blocks are worth caching and in what order, and delivers a manifest that the SDK applies locally. It also catches duplicate requests and serves them from a local cache — saving both input and output tokens.

It's not a proxy. It's never in your request path. If it goes down, your app works exactly as if it was never there.

- Open-source SDK: Python (pip install genosis) and TypeScript (npm install @genosis/sdk) - Supports Anthropic and OpenAI (Google coming soon) - Free tier — you only pay if we save you money

I'm the sole founder. Happy to answer questions about how it works, the caching mechanics, or anything else.

Show HN: Stickynotes.fyi – a free browser sticky notes board that saves locally

Population's cadmium overexposure requires urgent action, French agency tells

Claude Is Down

Agentic Utilities – Citrini Research

Ask HN: How do you manage knowledge you accumulate as a developer?

We Social Engineered Our Own AI. Here's What Happened

The Terminal I Wished Existed, So I Built It

The Comeback of Small Conferences

Europe facing fuel shortage within days, warns Shell boss

The frantic, high-tech fight to stop climate-fueled dengue fever

Apple introduces age verification for iCloud accounts in the UK

Small Models Are Getting Easy. Serving Them Still Isn't

Tracing the March 2026 TeamPCP supply chain campaign

AI chatbots ranked by data they collect

TikTok Scraper API with predictable pricing and high request limits

Is OpenClaw a new paradigm, or just better automation UX?

Show HN: Gameplan – Play poker against a solver

Xeneta Port Congestion Map

The Epistemology of Microphysics

Show HN: Hit Rec Notes – Global Windows Dictation with Whisper API

Connecticut law lets lenders go after small businesses nationwide

Show HN: Pomodare – synchronized Pomodoro timer for two, via 4-letter code

Per session security for Claude Code

Windows native HTML/CSS UI framework, looking for feedbacks

Oil Theft Is Burning a Billion-Dollar Hole in the West Texas Economy

Changing One Constant Reduced Our CI Memory Usage by 70%

NASA Lays Out Ambitious Plans for Moon Base and Nuclear Mars Mission

Source Code is the new Assembly: Loss-driven Code

Show HN: Podwise CLI – Search and ask questions across podcasts

Chemists turned bourbon waste into supercapacitors

Show HN: Genosis – LLM cost optimization that learns from your traffic