frontpage.

Hi HN — I built Opsmeter, a lightweight LLM telemetry tool focused on cost attribution + budget control.

Provider dashboards mostly show totals. Opsmeter shows what caused the bill by breaking spend down by endpointTag, promptVersion, and optionally userId — plus latency and success/error rates.

It’s no-proxy: Opsmeter doesn’t sit in your request path. After each LLM call, you send a small telemetry payload to /v1/ingest/llm-request (provider, model, endpointTag, promptVersion, token counts, latency, status). Opsmeter normalizes cost via a provider/model pricing table and surfaces trends + regressions.

Links:

Home: https://opsmeter.io

Docs: https://opsmeter.io/docs

Pricing: https://opsmeter.io/pricing

If you try it and share anonymized screenshots/feedback, I’m happy to help you interpret the results — e.g.

which endpoints drive spend

which prompt versions increased tokens/cost (deploy regressions)

which users (optional) are the biggest cost drivers

suggested budget thresholds (80% warning / 100% exceeded) and alerting setup

Feedback welcome — especially on what you’d want next: staying telemetry-first, and potentially adding an optional gateway mode later.

Go 1.26 Introduces Two Language Changes, New Performance Improvements

Waku: The Minimal React Framework Reaches Alpha

The Singularity Is Always Near

Discord clarifies approach to age assurance

Hacker News Alternative Where People Are Positive About AI

Show HN: Berkeley Xcelerator – early-stage AI and agentic AI accelerator

5-century tree-ring record reveals intensification of West Mediterranean storms

Russia Further Restricts Telegram, Escalating Internet Clampdown

Private RAG and marketplace to sell your knowledge to AI agents

Debugging random slow writes with GIN indexes in PostgreSQL

FOSDEM 2026: RISC-V Hardware Is Here. What About Software? [video]

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Ask HN: What useful knowledge do you have that LLMs don't?

Show HN: Give your OpenClaw agent a serverless back end

10xBench – one-shot coding in Astro, React and Tailwind

Chasing Satellites with My Geodesic Dome [video]

Lamp Rubbers

Humanity's Last Programming Language

An Innocuous Blog Post about vPMU in QEMU

The Linux Load Average

US embassy in London denies visas to executives over minor offences

Adventures in Neural Rendering

Localstack will require an account to use starting in March 2026

It's Over. The iPad Won

WeWatch AI – The fix took 5 mins, the RCA took 8 hours. So we built this

Amazon CloudFront Global Outage

The Concussive Geode: When Recurrent BPPV Creates Sensory Mismatch

Two Million Is Small

Exploring AI Driven Coding: Using Xcode 26.3 MCP Tools in Cursor, Claude, Codex

The Tavistock Clinic scandal: 1k court cases that never materialised