frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•2m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
2•karakoram•2m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•2m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•2m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•4m ago•0 comments

Show HN: AI Agent Tool That Keeps You in the Loop

https://github.com/dshearer/misatay
2•dshearer•5m ago•0 comments

Why Every R Package Wrapping External Tools Needs a Sitrep() Function

https://drmowinckels.io/blog/2026/sitrep-functions/
1•todsacerdoti•6m ago•0 comments

Achieving Ultra-Fast AI Chat Widgets

https://www.cjroth.com/blog/2026-02-06-chat-widgets
1•thoughtfulchris•8m ago•0 comments

Show HN: Runtime Fence – Kill switch for AI agents

https://github.com/RunTimeAdmin/ai-agent-killswitch
1•ccie14019•10m ago•1 comments

Researchers surprised by the brain benefits of cannabis usage in adults over 40

https://nypost.com/2026/02/07/health/cannabis-may-benefit-aging-brains-study-finds/
1•SirLJ•12m ago•0 comments

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

https://fortune.com/2026/02/04/peter-thiel-antichrist-greta-thunberg-end-of-modernity-billionaires/
1•randycupertino•13m ago•2 comments

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

https://www.twz.com/sea/uss-preble-used-helios-laser-to-zap-four-drones-in-expanding-testing
2•breve•18m ago•0 comments

Show HN: Animated beach scene, made with CSS

https://ahmed-machine.github.io/beach-scene/
1•ahmedoo•19m ago•0 comments

An update on unredacting select Epstein files – DBC12.pdf liberated

https://neosmart.net/blog/efta00400459-has-been-cracked-dbc12-pdf-liberated/
2•ks2048•19m ago•0 comments

Was going to share my work

1•hiddenarchitect•22m ago•0 comments

Pitchfork: A devilishly good process manager for developers

https://pitchfork.jdx.dev/
1•ahamez•22m ago•0 comments

You Are Here

https://brooker.co.za/blog/2026/02/07/you-are-here.html
3•mltvc•26m ago•1 comments

Why social apps need to become proactive, not reactive

https://www.heyflare.app/blog/from-reactive-to-proactive-how-ai-agents-will-reshape-social-apps
1•JoanMDuarte•27m ago•1 comments

How patient are AI scrapers, anyway? – Random Thoughts

https://lars.ingebrigtsen.no/2026/02/07/how-patient-are-ai-scrapers-anyway/
1•samtrack2019•28m ago•0 comments

Vouch: A contributor trust management system

https://github.com/mitchellh/vouch
2•SchwKatze•28m ago•0 comments

I built a terminal monitoring app and custom firmware for a clock with Claude

https://duggan.ie/posts/i-built-a-terminal-monitoring-app-and-custom-firmware-for-a-desktop-clock...
1•duggan•29m ago•0 comments

Tiny C Compiler

https://bellard.org/tcc/
2•guerrilla•30m ago•0 comments

Y Combinator Founder Organizes 'March for Billionaires'

https://mlq.ai/news/ai-startup-founder-organizes-march-for-billionaires-protest-against-californi...
2•hidden80•31m ago•2 comments

Ask HN: Need feedback on the idea I'm working on

1•Yogender78•31m ago•0 comments

OpenClaw Addresses Security Risks

https://thebiggish.com/news/openclaw-s-security-flaws-expose-enterprise-risk-22-of-deployments-un...
2•vedantnair•32m ago•0 comments

Apple finalizes Gemini / Siri deal

https://www.engadget.com/ai/apple-reportedly-plans-to-reveal-its-gemini-powered-siri-in-february-...
1•vedantnair•32m ago•0 comments

Italy Railways Sabotaged

https://www.bbc.co.uk/news/articles/czr4rx04xjpo
12•vedantnair•32m ago•2 comments

Emacs-tramp-RPC: high-performance TRAMP back end using MsgPack-RPC

https://github.com/ArthurHeymans/emacs-tramp-rpc
1•fanf2•34m ago•0 comments

Nintendo Wii Themed Portfolio

https://akiraux.vercel.app/
2•s4074433•38m ago•2 comments

"There must be something like the opposite of suicide "

https://post.substack.com/p/there-must-be-something-like-the
1•rbanffy•40m ago•1 comments
Open in hackernews

My analysis of 439 models proves: You're overpaying for your LLMs

https://whatllm.vercel.app/
7•demian101•6mo ago

Comments

demian101•6mo ago
While everyone's geeking out over Grok4's insane physics sims and Kimi K2's 1T OS bombshell (crushing coding benchmarks for pennies), the real AI drama is in the pricing shadows. After my LLM Selector post blew up here, I kept getting DMs asking "but which provider should I actually use?" So I dove deep into 439 models across 63 providers.

What I found? some interesting insights:

1. huge markup on identical models Take DeepSeek R1 0528 (quality 68 from Artificial analysis bench, beats many flagships):

Completely free on Google Vertex and CentML (decent speeds too, 121 tok/s and 87 tok/s).

But jumps to $0.91 on Deepinfra, $4.25 on Fireworks Fast, and a whopping $5.50 on SambaNova, for the exact same model (ofc with speed differences).

Arbitrage alert: Why pay infinite markup when free tiers deliver the goods for experimentation or bulk runs?

2. Latency goldmines hiding in plain sight Sub millisecond responses aren't just for premium setups:

Nebius Base crushes it with DeepSeek R1 at 0.61ms latency for $1.00/1M (103 tok/s) and Qwen3 235B at 0.56ms for $0.30/1M (50 tok/s).

Groq takes it further with models like Qwen3 32B at 0.14ms for $0.36/1M (627 tok/s).

Arbitrage alert: These blow away slower "enterprise" options costing 10x more, ideal for real-time apps

3. speed demons with massive throughput gaps Hardware optimization creates wild performance swings:

Cerebras with Qwen3 32B at 2,496 tok/s for $0.50/1M and Llama 4 Scout at 2,808 tok/s for $0.70/1M.

Compare to the same models elsewhere: Often stuck at 40-80 tok/s for similar or higher prices.

Arbitrage alert: 50x+ throughput boosts on the same model?

4. Quality overpays that defy logic High-quality doesn't mean high-price anymore:

Qwen3 235B (quality 62) at $0.10/1M on Fireworks (79 tok/s): outperforms Claude 4 Opus (quality 58) which costs $30/1M everywhere (19-65 tok/s).

Grok 3 mini (quality 67) at $0.35/1M on xAI (210 tok/s), edging out pricier closed source rivals.

Arbitrage alert: 300x cheaper for better quality? Open-source gems like these make "premium" models look like rip-offs lol

5. Provider flips on big-name models Even giants like OpenAI show huge variances:

GPT-4.1 mini ($0.70/1M): Azure blasts 217 tok/s vs OpenAI's 73 tok/s.

o3 ($3.50/1M): OpenAI hits 199 tok/s vs Azure's slower 99 tok/s (with double the latency).

Arbitrage alert: Same price, but 3x throughput or half the latency? Picking the right endpoint saves thousands on production workloads.

We're in the Wild West of pricing amid all the hype. Big names coast on reputation, but smaller providers like Nebius and Cerebras optimize like mad.

Open-source crushes closed-source on value: top 20 price-perf plays are ALL open.

What should you do?

Stop assuming expensive = better

Hunt latency and speed arbitrages (they're everywhere)

Test specialised providers for throughput wins

Grab sub-$0.50 open-source beasts (like Qwen3 or Grok mini)

Exploit these gaps now before "normalization" hits

Centralised all the data from Artificial analysis on whatllm.com, and insights are the real gold.

Found crazier arbitrages? Spill in comments!

which hype are you actually buying, and why?

This rabbit hole hit harder than any benchmark!

Happy to geek out more!