frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: What's your biggest LLM cost multiplier?

3•teilom•1h ago
"Tokens per request" has been a misleading cost model for us in production. The real drivers seem to be multipliers: retries/429s, tool fanout, P95 context growth, and safety passes.

What’s been the biggest cost multiplier in your prod LLM systems, and what policies worked (caps, degraded mode, fallback, hard fail)?

Comments

teilom•1h ago
Background note I wrote (framing + “budget as contract”): https://github.com/teilomillet/enzu/blob/main/docs/BUDGETS_A...
teilom•1h ago
If you’re trying to estimate before prod, logging these 4 things in a pilot gets you 80% there: - tokens/run (in+out) - tool calls/run (and fanout) - retry rate (timeouts/429s) - context length over turns (P50/P95)

Fanout × retries is the classic “bill exploder”, and P95 context growth is the stealth one. The point of “budget as contract” is deciding in advance what happens at limit (degraded mode / fallback / partial answer / hard fail), not discovering it from the invoice.

Google Cloud suspended my account for 2 years, only automated replies

1•andylizf•16s ago•0 comments

Software: Solo or Team?

https://boragonul.com/blog/software-solo-or-team/
1•bora_gonul•1m ago•0 comments

Show HN: Openground, and on-device and open source alternative to Context7

https://github.com/poweroutlet2/openground
1•poweroutlet2•1m ago•0 comments

Show HN: Unicode is weird so I built a site to make text cooler anywhere

https://fontgen.cool/
1•liquid99•5m ago•0 comments

Synth Town

https://synth.town
1•count_zero•6m ago•0 comments

SeL4: The most highly assured and fastest operating system kernel

https://sel4.systems/
1•doener•8m ago•0 comments

California Senate passes bill regulating lawyers' use of AI

https://www.reuters.com/legal/government/california-senate-passes-bill-regulating-lawyers-use-ai-...
2•1vuio0pswjnm7•8m ago•0 comments

Nova OS Virtualization Architecture

https://hypervisor.org/
1•doener•9m ago•0 comments

New heat-shrinking method integrates electronic circuits on irregular shapes

https://techxplore.com/news/2026-01-method-electronic-circuits-irregular.html
2•PaulHoule•9m ago•0 comments

Still conscious? Brain marker signals when anaesthesia takes hold

https://www.nature.com/articles/d41586-026-00301-9
1•bookofjoe•10m ago•1 comments

Trump Officials Move to Double Number of H-2B Guest Visas This Year

https://www.nytimes.com/2026/01/30/us/politics/h2b-visas.html
2•ripe•10m ago•0 comments

Agentchan – imageboard built for AI agents

https://chan.alphakek.ai/
1•TMWNN•13m ago•0 comments

Iva Kosic

1•billystr•13m ago•0 comments

Nintendo DS code editor and scriptable game engine

https://crl.io/ds-game-engine/
2•Antibabelic•14m ago•0 comments

The Context Window Is Becoming a Virtual Machine

https://conikeec.substack.com/p/the-context-window-is-becoming-a
1•conikeec•15m ago•0 comments

Total communications blackout for 22 day, 30k protesters thought killed

https://www.youtube.com/watch?v=hxc8RgchpBs
1•us321•17m ago•0 comments

Show HN: An open-source Chrome extension that lets any LLMs control the browser

https://github.com/hanzili/llm-in-chrome
1•hanzili•19m ago•0 comments

Why Some Code Feels Easier to Read

https://evan-moon.github.io/2026/01/30/developer-intuition-readable-code-and-neuroscience/en/
2•bboydart•24m ago•0 comments

I built a milestone-based B2B escrow system using Stripe Connect(Laravel+NextJS)

https://www.trustora.ro/en/open-soon
1•arsene94•24m ago•1 comments

Guest Post from an Iranian

https://scottaaronson.blog/?p=9530
5•Tomte•25m ago•0 comments

Improving Unnesting of Complex Queries [pdf]

https://15799.courses.cs.cmu.edu/spring2025/papers/11-unnesting/neumann-btw2025.pdf
1•todsacerdoti•32m ago•0 comments

The surprising attention on sprites, exe.dev, and shellbox

https://lalitm.com/trying-sprites-exedev-shellbox/
3•todsacerdoti•32m ago•0 comments

Charcoal-Powered Generator – Charging Off-Grid Battery with Homemade Power [video]

https://www.youtube.com/watch?v=vpjBlfd3s4g
1•Dries007•37m ago•0 comments

Genode OS is a tool kit for building highly secure special-purpose OS

https://genode.org/about/index
18•doener•38m ago•0 comments

What Was History's Deadliest Era?

https://jacobin.com/2026/01/book-review-crais-modernity-violence
1•wahnfrieden•38m ago•0 comments

ClawMatch – A dating API for AI agents

https://clawmatch.ai
1•mischainc•40m ago•0 comments

CachyOS Saying "No" to Bazzite's Open Gaming Collective

https://old.reddit.com/r/cachyos/comments/1qq0dxr/open_gaming_collective_ogc_formed_to_push_linux...
2•tuananh•40m ago•0 comments

Ask HN: Any real OpenClaw (Clawd Bot/Molt Bot) users? What's your experience?

33•cvhc•41m ago•35 comments

Over Creamy Chicken, Europe's Leaders Try to Reduce Dependence on Trump

https://www.nytimes.com/2026/01/31/world/europe/eu-trump-greenland-europe.html
1•doener•44m ago•0 comments

Musk's SpaceX applies to launch 1M satellites into orbit

https://www.bbc.co.uk/news/articles/cyv5l24mrjmo
4•mellosouls•45m ago•1 comments