frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

The Unbearable Cheapness of Open Weight Models

https://jamesoclaire.com/2026/06/25/the-unbearable-cheapness-of-open-weight-models/
26•ddxv•4h ago

Comments

linzhangrun•1h ago
It would not be surprising if GPT and Claude get cheaper too as inference gets cheaper. Two years ago, o1 was the strongest model and cost much more than Fable, while being nowhere near as smart as a Qwen 3.6 35B that you can now run on a DGX Spark without much trouble.
ddxv•54m ago
True, outside of the dark tactics I imagined in the article, they will have to compete at lower costs. It's just that the current iteration does not feel cost competitive yet.
odie5533•23m ago
This is what concerns me about how AI giants are planning to make money. Their product has already been commoditized at prices which for them are still subsidized to grab market share. Unless the giants invent a technological leap, their prices are going to be dragged down by open weight models and I don't see how they'll turn a profit.
arikrahman•12m ago
With cache hit rates being effectively free, harnesses like Reasonix have let me do a month of work for less than 2 dollars. It's not even the subsidies making it cheap, American providers like Digital Ocean or Cloudflare host the same model with similar pricing.

Half-Life 2 in a Browser

https://hl2.slqnt.dev/
82•panza•1h ago•18 comments

Anthropic says Alibaba illicitly extracted Claude AI model capabilities

https://www.reuters.com/world/china/anthropic-says-alibaba-illicitly-extracted-claude-ai-model-ca...
314•htrp•11h ago•539 comments

OpenAI unveils its first custom chip, built by Broadcom

https://techcrunch.com/2026/06/24/openai-unveils-its-first-custom-chip-built-by-broadcom/
653•jamdesk•13h ago•370 comments

Cloudflare launched self-managed OAuth for all

https://blog.cloudflare.com/oauth-for-all/
113•terryds•4h ago•42 comments

Markdy: Like Mermaid Diagrams, but for Motion

https://markdy.com
27•surprisetalk•1d ago•2 comments

Blogging can just be stating the obvious

https://blog.jim-nielsen.com/2026/blogging-stating-the-obvious/
184•Curiositry•7h ago•64 comments

LuaJIT 3.0 proposed syntax extensions

https://github.com/LuaJIT/LuaJIT/issues/1475
131•phreddypharkus•6h ago•75 comments

Dostoyevsky isn't difficult

https://www.autodidacts.io/dostoyevsky-isnt-difficult/
126•surprisetalk•2d ago•104 comments

Zombie unicorns are haunting Silicon Valley

https://www.economist.com/business/2026/06/21/zombie-unicorns-are-haunting-silicon-valley
77•andsoitis•4h ago•29 comments

Medical students are using popular research tool to pump out misleading studies

https://www.science.org/content/article/medical-students-are-using-popular-research-tool-pump-out...
28•rndsignals•4h ago•10 comments

Qualcomm to Acquire Modular

https://www.reuters.com/business/qualcomm-buy-ai-startup-modular-2026-06-24/
187•timmyd•17h ago•56 comments

45°C cooling design cuts data center water use to near zero

https://blogs.nvidia.com/blog/liquid-cooling-ai-factories/
289•nitin_flanker•16h ago•183 comments

Mixing Visual and Textual Code

https://arxiv.org/abs/2603.15855
43•doppioandante•5h ago•11 comments

RubyLLM: A Ruby framework for all major AI providers

https://rubyllm.com/
377•doener•16h ago•60 comments

GLM-5.2 is a step change for open agents

https://www.interconnects.ai/p/glm-52-is-the-step-change-for-open
201•vantareed•2d ago•111 comments

The Unbearable Cheapness of Open Weight Models

https://jamesoclaire.com/2026/06/25/the-unbearable-cheapness-of-open-weight-models/
26•ddxv•4h ago•4 comments

PR spam today looks like email spam in the early 2000s

https://www.greptile.com/blog/prs-on-openclaw
208•dakshgupta•16h ago•121 comments

Computer use in Gemini 3.5 Flash

https://blog.google/innovation-and-ai/models-and-research/gemini-models/introducing-computer-use-...
206•swolpers•13h ago•129 comments

Show HN: Write SaaS apps where users control where their data is stored

https://github.com/wolfoo2931/linkedrecords/
37•WolfOliver•5d ago•17 comments

Matt's Script Archive: The Scripts That Reshaped the Web

https://tedium.co/2026/06/22/matts-script-archive-retrospective/
39•1317•2d ago•10 comments

The Xteink X4 E-Ink Reader

https://blog.omgmog.net/post/xteink-x4-e-ink-reader/
218•felixdoerp•14h ago•125 comments

Show HN: Brain Frog – Can you be random enough for 11 lines of JavaScript?

https://brainfrog.lol
27•AlexanderZ•5d ago•23 comments

Bohemia Interactive: Cold War Assault Remastered Source Code on GitHub

https://github.com/BohemiaInteractive/CWR
7•dewey•2d ago•1 comments

Ending respiratory infections

https://blog.interceptfund.com/p/ending-respiratory-infections
135•EthanFantl•5h ago•60 comments

Show HN: Nub – A Bun-like all-in-one toolkit for Node.js

https://github.com/nubjs/nub
230•colinmcd•16h ago•66 comments

15 sorting algorithms in 6 minutes (2013) [video]

https://www.youtube.com/watch?v=kPRA0W1kECg
21•akkartik•2d ago•2 comments

A Practical Guide to SSH Tunnels: Local and Remote Port Forwarding

https://labs.iximiuz.com/tutorials/ssh-tunnels
311•signa11•5d ago•60 comments

Crawling BitTorrent DHTs for Fun and Profit [pdf]

https://www.usenix.org/legacy/event/woot10/tech/full_papers/Wolchok.pdf
88•dgellow•3d ago•34 comments

Exploring the internal representations of Pangram 3.3.2

https://www.pangram.com/pangram-space
23•krackers•5h ago•5 comments

Krea 2: SOTA open-weights 12B image model

https://www.krea.ai/blog/krea-2-technical-report
369•mattnewton•1d ago•40 comments