frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built a tool to benchmark my AI agent's API costs

https://local001.com/tokens
5•sampleSal•1h ago

Comments

sampleSal•1h ago
We're building AI agents on OpenClaw and were burning $1,100/week on Anthropic API calls.

No idea if our prompting strategy was inefficient or if everyone was paying this much.

Built a quick benchmarking tool: https://local001.com/tokens

Submit your weekly spend + provider + use case → see your percentile + comparisons.

The dataset is early — it gets more useful the more people submit. But here's why I built this:

We're spending $1,100/week on Anthropic for a mix of coding agents and personal assistant tasks. I have no idea if that's normal or insane. Specifically:

Are we overspending by use case? Our coding agent burns ~$700/week and the assistant tasks burn ~$400. But I don't know what "good" looks like. Is $700/week for an agentic coding workflow competitive? Are teams doing similar work at $200? $2,000? There's zero public data on this.

Are we overspending on Anthropic? We're all-in on Claude right now. For coding tasks, maybe that's the right call. But for assistant/chat workflows — should we be routing half of that to GPT-4o or Gemini and cutting costs 60%? I genuinely don't know, and I haven't seen anyone publish real cost comparisons by task type, not just benchmark scores.

That's what this tool is for. Submit your weekly spend, provider, and use case → see where you land. If 50 teams submit data, we'll finally have a real answer to "is Anthropic worth the premium for X?"

Open questions:

Should we track tokens/$ instead of just $?

Should we separate o1/reasoning models vs base models?

How do you benchmark "efficiency" vs raw spend?

Built with Next.js + Cloudflare Workers + D1. Submissions are anonymous (just hashed IPs).

Long-term goal: use this data to negotiate bulk API rates with Anthropic/OpenAI/Google.

How would you improve this?

https://local001.com/tokens

No Consent Required: A Minimal Data Privacy Policy

https://launchdayadvisors.com/blog/no-consent-required/
1•weldone00•43s ago•1 comments

Free LLM APIs Compared: Real Limits and Setup for 10 Providers

https://clawhosters.com/blog/posts/free-llm-api-openclaw
2•yixn_io•1m ago•0 comments

Chrome Extension Risk Scoring (Partially Open Source, Evidence-Based)

https://extensionshield.com/
1•Stanzin7•2m ago•1 comments

He Studied Cognitive Science at Stanford. Then He Wrote a Startling Play (Cont)

https://www.nytimes.com/2026/02/16/opinion/play-ai-authoritarianism.html
1•whack•2m ago•0 comments

Story of the Fed balance sheet in a single chart

https://www.ft.com/content/cbe2d1e9-8a8a-443e-9cfb-95f3b238b656
2•marojejian•3m ago•1 comments

Show HN: Supervisor IDE – Command center for coding agents in complex projects

https://nexroo.ai/supervisor
1•nexroo•3m ago•0 comments

The Perils of ISBN

https://rygoldstein.com/posts/perils-of-isbn
2•evakhoury•3m ago•0 comments

Legally ban certain autonomous LLM-based AI agents, or risk societal collapse?

https://greystonethoughts.substack.com/p/legally-ban-certain-autonomous-llm
1•words0n•4m ago•0 comments

AI adoption hitting Irish graduate jobs, finance department says

https://www.reuters.com/business/ai-adoption-already-hitting-irish-graduate-jobs-finance-departme...
1•giuliomagnifico•5m ago•0 comments

Thoughtworks Future of Software Development Retreat

https://www.lasantha.org/blog/future-of-software-engineering-thoughtworks/
1•kiriberty•7m ago•0 comments

An Open Source Client for World of Warcraft

https://hackaday.com/2026/02/18/an-open-source-client-for-world-of-warcraft/
2•erenkaplan•8m ago•0 comments

Show HN: DovahScript – A language for the Thu'um-powered developer

https://github.com/basteez/DovahScript
2•basteez•9m ago•0 comments

Firetiger: Long Horizon Agents in Production

https://blog.firetiger.com/how-firetiger-works/
2•pryz•9m ago•0 comments

Tesla announces Powerwall 3P with native three-phase inverter

https://electrek.co/2026/02/13/tesla-announces-powerwall-3p-with-native-three-phase-inverter/
4•thelastgallon•10m ago•0 comments

Microplastic pollution induces algae blooms in experimental ponds

https://www.nature.com/articles/s44458-025-00014-6
1•PaulHoule•11m ago•0 comments

Benchmarking STT for Voice Agents – 10 Services, 1k Samples, Semantic WER

https://www.daily.co/blog/benchmarking-stt-for-voice-agents/
1•edgarsDev•11m ago•1 comments

No food, no fuel, no tourists: Under US pressure, life in Cuba grinds to a halt

https://www.cnn.com/2026/02/18/americas/cuba-us-trump-oil-tourism-intl-latam
3•thelastgallon•12m ago•2 comments

We built Writtte using vanilla JavaScript (TS), PSQL, and a Go, No frameworks

https://github.com/writtte/writtte
1•lasgawe•12m ago•1 comments

Practical Guide to Reducing AI Agent Token Costs

https://clawhosters.com/blog/posts/openclaw-token-costs-optimization
1•yixn_io•13m ago•0 comments

Kalshi Dealt Major Setback in Fight to Remain in Nevada

https://www.wsj.com/us-news/law/kalshi-loses-bid-to-stop-nevada-from-proceeding-with-case-against...
1•lucaspauker•14m ago•0 comments

We Built a QA Agent for Our Background Agent

https://www.ranger.net/post/why-we-built-a-qa-agent-for-our-background-agent
2•joship•14m ago•0 comments

Leaking Secrets from the Claud

https://ironpeak.be/blog/leaking-secrets-from-the-claud/
1•lumpa•15m ago•0 comments

Japan Plans $36B in U.S. Investments Under Trump Administration Deal

https://www.wsj.com/world/asia/japan-plans-36-billion-in-u-s-investments-under-trump-administrati...
1•bear_with_me•15m ago•0 comments

An Inside Look at Lego's New Tech-Packed Smart Brick

https://www.wired.com/story/exclusive-inside-look-at-new-lego-smart-brick/
2•rkangel•15m ago•0 comments

Show HN: Vett – Scan, sign, and verify AI agent skills before installing

https://vett.sh
1•nikon•16m ago•0 comments

Zero-Code Tracing Setup for Claude Agent SDK

https://www.scorecard.io/blog/the-first-zero-code-tracing-setup-for-the-claude-agent-sdk
1•gk1•17m ago•0 comments

I code from bed now – a Telegram bot for Claude Code

https://claude-code-on-the-go.vercel.app/
1•aleeexg•17m ago•0 comments

How do I embed Polymarket odds on Substack?

https://support.substack.com/hc/en-us/articles/28879761546260-How-do-I-embed-Polymarket-odds-on-S...
3•Agreed3750•18m ago•0 comments

Plasma 6.6

https://kde.org/announcements/plasma/6/6.6.0/
6•aceki•18m ago•1 comments

A Guide to Which AI to Use in the Agentic Era

https://www.oneusefulthing.org/p/a-guide-to-which-ai-to-use-in-the
2•gmays•18m ago•0 comments