frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

AI inference is obviously profitable

https://www.seangoedecke.com/ai-inference-is-obviously-profitable/
10•emirb•1h ago

Comments

blinded•30m ago
Agree fully.

The smaller models will only get better which push out the usefulness of older gpus.

vannevar•14m ago
Two issues with this. One, it's profitable assuming you just keep serving the same model forever, which is not realistic in this market. A given model has a shelf-life, which these days is measured in months, not years. Which means that trying to separate the cost of training the model from the cost of serving it doesn't make much business sense. And two, for providers that provide inference only via open weight models, the margins quickly move to commoditization. The "someday" when frontier model providers can enjoy their current high inference margins without the burden of significant training costs is never going to arrive.
manwithopinions•3m ago
Profitable on a per token basis is meaningless. Ed Zitron doesn’t argue that it is impossible to offer profitable inference, he argues that the business as it stands today is deeply unprofitable and only getting worse because it isn’t a high-margin inference business.

Play out the most likely pessimist’s scenario: LLMs are marginally useful but frontier models are overkill so businesses just use dirt cheap open weight models on their own hardware and/or they rent hardware instead of paying per token. Then what for OpenAI and Anthropic?

OpenAI’s business collapses if customers are happy with an LLM that costs $0.10 per million tokens even if it only costs OpenAI $0.05 in inference per million tokens. The insane bonkers claim from Garry Tan that in 2 years we will be using 90,000x as many tokens as today is… well, obviously not true.

The fixed costs that OpenAI and Anthropic have created need inference demand far beyond what is plausible.

Disputing the Declaration of Independence

https://www.historytoday.com/archive/feature/disputing-declaration-independence
1•pepys•2m ago•0 comments

Cosmic Fireworks by Infant Stars

https://www.space.com/astronomy/galaxies/infant-stars-celebrate-their-independence-with-cosmic-fi...
1•cybermango•5m ago•0 comments

Archivegenocide.com 64k videos show evidence of Israeli genocide

https://archivegenocide.com/
1•Dig1t•5m ago•0 comments

Show HN: Reading Assistant Physical Books Meta RayBans

1•roshangill•11m ago•0 comments

AI Trade Is Losing One of Its Key Signals

https://www.bloomberg.com/news/articles/2026-07-03/the-ai-trade-is-losing-one-of-its-key-signals-...
3•cybermango•16m ago•0 comments

Why Vancouver is always a stand-in for San Francisco in movies and TV shows

https://www.sfgate.com/sf-culture/article/vancouver-stand-in-movie-tv-sf-16613821.php
2•amichail•21m ago•0 comments

The Demoralization of the White-Collar Worker

https://nooneshappy.com/article/the-demoralization-of-the-white-collar-worker/
4•njrc•26m ago•0 comments

Illegible Benefits

https://aeon.co/essays/what-we-cant-measure-about-ai-yet
2•m-hodges•29m ago•0 comments

GLM5.2 on AMD MI355X at 2626 tok/s/node at over 2x lower cost than Blackwell

https://www.wafer.ai/blog/glm52-amd
2•latchkey•30m ago•0 comments

The Termi Protocol: Watch AI Coding Agents Build in 3D

https://termiprotocol.com/
1•jonbaer•31m ago•1 comments

Show HN: Durable AI agents without the workflow engine

https://www.noworkflows.dev/
3•iacguy•32m ago•0 comments

EVE Online's Carbon engine is now open source: Fenris Creations explains why

https://www.gamesindustry.biz/eve-onlines-carbon-engine-is-now-open-source-fenris-creations-expla...
4•Stevvo•33m ago•0 comments

Goodbye, Forever, Probably

https://whitep4nth3r.com/blog/goodbye-forever-probably/
13•backlit4034•37m ago•2 comments

Global Ocean Science Report: most complete picture of ocean science

https://www.ioc.unesco.org/en/articles/global-ocean-science-report-worlds-most-complete-picture-o...
3•bryanrasmussen•39m ago•0 comments

Labor Market Tightens Despite Tepid Job Growth as Labor Force Declines Further

https://wolfstreet.com/2026/07/02/labor-market-tightens-despite-tepid-job-growth-as-labor-force-d...
3•toomuchtodo•42m ago•1 comments

Bought an expired domain. Then I inherited their AWS Root account

https://easydns.com/blog/2026/07/03/bought-an-expired-startup-domain-a-few-days-later-i-inherited...
7•StuntPope•43m ago•1 comments

How to migrate from Proxmox VE 8 to 9: step-by-step guide (2026)

https://lucasaguiar.xyz/en/posts/migracao-proxmox-8-9-2026/
2•isfttr•44m ago•1 comments

Show HN: Theta-spec harness agnostic config surface

https://github.com/tamarillo-ai/theta-spec
3•ivanbelenky•44m ago•0 comments

sf house sept 26

5•jadenfix123•46m ago•0 comments

Software, from First Principles

https://fazamhd.com/mental-models/software/
3•faza•50m ago•0 comments

Show HN: Updated my landing page with Fable (retro pixel style)

https://www.tryguildly.com/
7•spiken23•50m ago•11 comments

Cadreen – memory, governance, self-healing, and execution as one system

4•ope_john•51m ago•0 comments

Best 3D Modeling Apps for iPad and Android (2026)

https://bambu3design.com/best-3d-modeling-apps-for-ipad-android-2026-create-3d-models-anywhere/
2•ehsanamel•53m ago•0 comments

You don't need Electron to build native apps in TypeScript [video]

https://www.youtube.com/watch?v=o5RDfAmzE7s
2•arbayi•54m ago•0 comments

AI First: How the Federal Government Is Prioritizing AI over People and Planet

https://stopgreedbuildgreen.climateandcommunity.org/posts/ai-first
23•eatox•58m ago•17 comments

Gaza's Children

https://gazaschildren.com/
13•abdelhousni•1h ago•3 comments

The Lost World (1925) [video]

https://archive.org/details/the.-lost.-world.-1925.1080p.-blu-ray.x-264-sadpanda
2•petethomas•1h ago•0 comments

New serious vulnerabilities spiked around release of Claude Mythos Preview

https://epoch.ai/data-insights/cve-severity-spike
3•cubefox•1h ago•0 comments

Show HN: Pulse v0.2.0

3•xerrs•1h ago•1 comments

AI inference is obviously profitable

https://www.seangoedecke.com/ai-inference-is-obviously-profitable/
10•emirb•1h ago•3 comments