frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

New research on analyzing and predicting token consumption of coding agents

https://arxiv.org/abs/2604.22750
3•jiaxinpei•1h ago

Comments

jiaxinpei•1h ago
Key findings:

1. Agentic coding tasks consume ~1000× more tokens than chat or reasoning workloads. And input tokens, not output, become the dominant cost driver, because each round re-feeds the entire trajectory back into the model. 2. More tokens ≠ better outcomes. Runs on the same task can vary by up to 30× in token use, and accuracy often peaks at intermediate cost. Beyond that, extra spending tends to reflect redundant exploration and does not bring further performance gain. 3. Models differ substantially in token efficiency. On the same successfully solved tasks, Kimi-K2 and Claude Sonnet-4.5 use roughly twice as many tokens as GPT-5.2. The gap becomes even larger when all the models fail. 4. Human-rated task difficulty weakly predicts actual cost. "Easy" tasks for humans can be surprisingly expensive for agents, and vice versa. The classic "Moravec's Paradox" is also true for coding agents! 5. Agents struggle to predict their own costs. Self-prediction correlations top out around 0.39, and every model we tested systematically underestimates what a task will cost. Result-based pricing still has a long way to go when we cannot even figure out the token cost beforehand.

H4ckf0r0day/obscura: The headless browser for AI agents and web scraping

https://github.com/h4ckf0r0day/obscura
2•rezaprima•2m ago•1 comments

Show HN: Optical Design and Simulation in Matlab

https://www.mathworks.com/help/images/optical-system-design-and-analysis.html
1•ashishuthama•4m ago•0 comments

Could the X-Bat Stealth Fighter Drone Change the Air Combat Game?

https://www.twz.com/air/could-the-x-bat-stealth-fighter-drone-change-the-air-combat-game
1•breve•8m ago•0 comments

I Replaced Kiro with a Free Plugin – Here's What Happened

https://www.getdraft.dev/blog/replaced-kiro-with-free-plugin/
1•mayurpise•9m ago•0 comments

Drones are getting drugs, escape tools and crab legs to inmates

https://www.cnn.com/2026/05/03/us/drone-deliveries-contraband-prison-inmates
1•breve•11m ago•0 comments

Show HN: Cryptographic receipt authority for ISO 20022 financial messages

https://20022validator.com
1•NextGenRails•12m ago•0 comments

Show HN: Stigmem – open-source federated knowledge fabric for AI agents (v1.0)

2•barryjones20•12m ago•0 comments

Agentic Coding Is a Trap

https://larsfaye.com/articles/agentic-coding-is-a-trap
1•ayoisaiah•13m ago•0 comments

Local, deterministic, version-controlled knowledge graph

https://www.getdraft.dev/blog/local-graph-engine/
1•mayurpise•19m ago•0 comments

The PHP License, Simplified

https://ben.ramsey.dev/blog/2026/05/the-php-license-simplified
2•gslin•28m ago•0 comments

Open source intelligence about Palantir

https://palantirwatch.org
1•seb1204•29m ago•0 comments

Using the "Sandwich Method" to Teach Mathematics

https://pikuma.com/blog/sandwich-method-math-education
1•atan2•31m ago•0 comments

Kloak keeps secrets out of your application's memory

https://getkloak.io/blog/kloak-50000-feet-view/
1•spinningfactory•31m ago•0 comments

PyFlue – Python-Native Agent Harness Framework (Python Clone of Flue)

https://super-agentic.ai/pyflue
2•sebst•33m ago•0 comments

Show HN: Zuma Portable

https://drive.google.com/drive/folders/1hDRvlY707VrO_UztEtIt1EoPgKBICL8Q?usp=sharing
1•zeeeeeebo•37m ago•0 comments

Simpson's Paradox

https://en.wikipedia.org/wiki/Simpson%27s_paradox
5•basilikum•40m ago•1 comments

California man uses elaborate drone show to help delivery drivers find his house

https://www.dexerto.com/entertainment/california-man-uses-elaborate-drone-show-to-help-delivery-d...
2•gnabgib•41m ago•0 comments

Exit, Voice, and Loyalty

https://en.wikipedia.org/wiki/Exit,_Voice,_and_Loyalty
2•akyuu•42m ago•0 comments

Why should a Trace-ID be 128 bits? (A Surprisingly Long Answer)

https://newsletter.signoz.io/p/why-should-a-trace-id-be-128-bits
2•birdculture•43m ago•0 comments

HN Signal (Last 24 hours) | Curated top stories from HN in the last 24 hrs

https://www.heydebrief.com/dubkc/hn-best-24
1•baetylus•49m ago•2 comments

DeepClaude – Claude Code agent loop with DeepSeek V4 Pro, 17x cheaper

https://github.com/aattaran/deepclaude
14•alattaran•52m ago•6 comments

Show HN: Triggering anti-cheats with just a browser tab title

https://github.com/elliott-diy/DontTrustTitles
6•Elliott-Diy•52m ago•1 comments

Broadcasting GPS on the Local Network

https://evertpot.com/broadcasting-gps-on-local-network/
1•treve•54m ago•0 comments

Brutal in production, lands like a verdict

1•Non_Von_Neumann•54m ago•0 comments

ReactOS Introduces Unified Live/Install Media, New Storage Driver

https://www.phoronix.com/news/ReactOS-Unified-ISO
2•kykat•56m ago•1 comments

Introduction to Atom

https://validator.w3.org/feed/docs/atom.html
3•susam•56m ago•0 comments

The Google Cloud Knowledge Catalog

https://cloud.google.com/blog/products/data-analytics/introducing-the-google-cloud-knowledge-catalog
2•laxmena•57m ago•0 comments

Next-Token Predictor Is An AI's Job, Not Its Species

https://www.astralcodexten.com/p/next-token-predictor-is-an-ais-job
1•optimalsolver•57m ago•0 comments

Seriously, Anthropic? [video]

https://www.youtube.com/watch?v=J8O9LLpJNrg
4•dp-hackernews•1h ago•0 comments

Sato – AI desktop companion for macOS with multi-provider support

https://www.sato.host/
2•vitalune•1h ago•0 comments