frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: LocalGPT – A local-first AI assistant in Rust with persistent memory

https://github.com/localgpt-app/localgpt
152•yi_wang•5h ago•48 comments

Haskell for all: Beyond agentic coding

https://haskellforall.com/2026/02/beyond-agentic-coding
73•RebelPotato•5h ago•18 comments

SectorC: A C Compiler in 512 bytes (2023)

https://xorvoid.com/sectorc.html
267•valyala•13h ago•51 comments

Total surface area required to fuel the world with solar (2009)

https://landartgenerator.org/blagi/archives/127
30•robtherobber•4d ago•28 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
207•mellosouls•15h ago•355 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
170•surprisetalk•12h ago•163 comments

LLMs as the new high level language

https://federicopereiro.com/llm-high/
75•swah•4d ago•130 comments

Brookhaven Lab's RHIC concludes 25-year run with final collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
76•gnufx•11h ago•59 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
183•AlexeyBrin•18h ago•35 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
176•vinhnx•16h ago•17 comments

Why there is no official statement from Substack about the data leak

https://techcrunch.com/2026/02/05/substack-confirms-data-breach-affecting-email-addresses-and-pho...
30•witnessme•2h ago•7 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
328•jesperordrup•23h ago•98 comments

The Architecture of Open Source Applications (Volume 1) Berkeley DB

https://aosabook.org/en/v1/bdb.html
8•grep_it•5d ago•0 comments

First Proof

https://arxiv.org/abs/2602.05192
138•samasblack•15h ago•81 comments

Wood Gas Vehicles: Firewood in the Fuel Tank (2010)

https://solar.lowtechmagazine.com/2010/01/wood-gas-vehicles-firewood-in-the-fuel-tank/
35•Rygian•2d ago•9 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
86•momciloo•13h ago•17 comments

Vouch

https://twitter.com/mitchellh/status/2020252149117313349
77•chwtutha•3h ago•20 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
109•thelok•15h ago•24 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
593•theblazehen•3d ago•212 comments

Show HN: A luma dependent chroma compression algorithm (image compression)

https://www.bitsnbites.eu/a-spatial-domain-variable-block-size-luma-dependent-chroma-compression-...
41•mbitsnbites•3d ago•5 comments

FDA intends to take action against non-FDA-approved GLP-1 drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
114•randycupertino•8h ago•241 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
314•1vuio0pswjnm7•19h ago•502 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
235•limoce•4d ago•125 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
907•klaussilveira•1d ago•277 comments

Where did all the starships go?

https://www.datawrapper.de/blog/science-fiction-decline
160•speckx•4d ago•244 comments

Selection rather than prediction

https://voratiq.com/blog/selection-rather-than-prediction/
36•languid-photic•4d ago•17 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
304•isitcontent•1d ago•39 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
498•lstoll•1d ago•332 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
447•ostacke•1d ago•114 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
314•dmpetrov•1d ago•158 comments
Open in hackernews

AI API Prices are 90% Subsidized

https://tinyml.substack.com/p/the-unsustainable-economics-of-llm
27•csoham•7mo ago

Comments

PaulHoule•7mo ago
When the AI hype train left the station I said "we don't understand how these things work at all and they're going to get much cheaper to run" and that turned out to be... true.

Already vendors of legacy models like ChatGPT-4 have to subsidize inference to keep up with new entrants based on a better foundation. It's likely that inference costs can be brought down by another factor of ten or so so of course you have to 90% subsidize these to get where the industry will be in 2-3 years.

revskill•7mo ago
No lol. The quality is mostly bad. Basically u need to prompt in detail like writing a novel for llm to understand. At that price, we want real AI who can really have common sense, not just an autocompletion tool.

Stop adverting LLM as AI, instead sell it as a superior copy & paste engine.

What's worst about LLM, is the more you talk with it, the worse it became to the point of broken.

mrtksn•7mo ago
Subsidized is probably not the correct word here, it's probably more like loss leader in the race of the land grab.

It's like the early days of the internet when everything was amazing and all the people who put money into this thing were "losing" their money.

It's going to be like this until monopolization and moat becomes defensible and then they will enshittify the crap of it and make their money back 10x, 100x etc.

apsec112•7mo ago
This ignores batching - token generation is much more efficient in batch - and I strongly suspect is itself written by AI, given the heavy use of bullets
biophysboy•7mo ago
is it common for adjacent tokens to use the same weights in a memory cache?
twoodfin•7mo ago
The “X—not Y” pattern is also a dead giveaway.
GaggiX•7mo ago
This calculation doesn't account for batches, it makes no sense.
BriggyDwiggs42•7mo ago
On average how much does batching bring costs down?
GaggiX•7mo ago
It balances the computing and memory bandwidth bottleneck so by a lot, with continuous batching you can easily see a x10, x20 or more.
BriggyDwiggs42•7mo ago
Wow! Thanks.
impure•7mo ago
I’ve been playing around with Gemma E4B and have gotten really good results. That’s a model you can run on a phone. So although prices have been going up recently I suspect they will start to fall again soon.
python273•7mo ago
A much better article on token prices: https://www.tensoreconomics.com/p/llm-inference-economics-fr...

There's not much incentive to subsidize prices for OpenRouter providers for example, and the prices are much lower than the $6.37/M estimate from the article.

https://openrouter.ai/meta-llama/llama-3.3-70b-instruct

avg $0.37/M input tokens, $0.73/M output tokens (21 providers)

Llama is not even a good example, as the recent models are more optimized using Mixture Of Experts and KV cache compression.

daft_pink•7mo ago
Also, it ignores the fact that they will optimize it and make it more efficient like Moore’s law, so everyone is basically assuming that the price will come down over time.