frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

GLM5.2 on AMD MI355X at 2626 tok/s/node at over 2x lower cost than Blackwell

https://www.wafer.ai/blog/glm52-amd
39•latchkey•2h ago

Comments

minraws•52m ago
Can you folks add performance per watt as a metric to these comparisons, I honestly want to understand where AMD fits in the stack in terms of actual performance to dollars. I have had talks with companies wanting to build data centers outside of US and find it hard to source anything Nvidia in sufficient capacity and scale.

If AMD is competitive performance per watt and roughly reliable in terms of software support which is what most folks outside of US prioritize above all else, since outside of China and US electricity tends to at a relative premium.

Maybe if they make smaller data centers viable at the right price, AMD could be part of the stack outside of US where ever Nvidia is more limited in supply. Though I have genuinely no idea what sourcing an AMD GPU looks like.

I have never seen a company use AMD outside of wafer and a couple others mostly in US.

Genuinely intriguing or maybe not really (could be this stuff is common knowledge) and I am just stuck in my Nvidia bubble here.

craftkiller•26m ago
> I have never seen a company use AMD

Meta is using AMD: https://www.amd.com/en/newsroom/press-releases/2026-2-24-amd...

And OpenAI: https://www.amd.com/en/newsroom/press-releases/2025-10-6-amd...

Twirrim•18m ago
> I have never seen a company use AMD outside of wafer and a couple others mostly in US.

There's a few using them, and even more starting to experiment with them. AMD has long been a source of disappointment around this side of things, so I'm hesitant to feel optimistic we'll finally get some competition. The market really needs viable competition to Nvidia, especially performance/watt.

GZGavinZhao•18m ago
> roughly reliable in terms of software support

*chuckles

I have no knowledge of the support for enterprise-grade hardware, but their consumer-grade hardware support is still quite atrocious. I believe in the AMD team and I've been watching them since 2023 catch up with NVIDIA at an unprecedented speed thanks to AI, but no, I still wouldn't consider AMD's software support as good, at least for consumer level hardware running AI. The fact that the Vulkan backend of llama.cpp consistently outperforms the ROCm backend by a 5~10% margin on any model I run on is just laughable (source: I run local LLMs and I always benchmark, but you can also find similar issues in llama.cpp).

oDot•52m ago
Do these providers have 80+% gross margins or is something eating into them? Maybe utilization?
technoabsurdist•18m ago
hi i work at wafer. no the margins are lower averaging at about ~40%. utilization is one of the highest order bits in determining margins here, yes.
yieldcrv•37m ago
Agentic coding drivers for different architectures is a massive unlock for the world

So much compute is under utilized waiting for a savant or company to prioritize an architecture, and now all the other engineers can tackle this at any time if they get inspired on the right prompts

AussieWog93•20m ago
The 2600 tok/s is an "aggregate", not the actual throughput.
technoabsurdist•17m ago
yes it is 213 tok/s single stream (so per user)

Giant trees have no trouble pumping water to top branches

https://news.exeter.ac.uk/faculty-of-environment-science-and-economy/giant-trees-have-no-trouble-...
45•hhs•1h ago•20 comments

Odin, Wikipedia and Engagement Farming

https://katamari64.se/posts/2026/odin-wikipedia/
21•stock_toaster•46m ago•3 comments

Leanstral 1.5: Proof Abundance for All

https://mistral.ai/news/leanstral-1-5/
35•programLyrique•1h ago•4 comments

SearXNG: A free internet metasearch engine

https://github.com/searxng/searxng
110•theanonymousone•3h ago•26 comments

The circuit that lets your brain think and see

https://www.engineering.columbia.edu/about/news/circuit-lets-your-brain-think-and-see
16•hhs•1h ago•1 comments

Steam Controller Auto-Charge – pilot to magnetic charging puck using CV

https://github.com/FossPrime/Steam-Controller-Auto-Charge
31•zdw•1h ago•5 comments

Amsterdam invented the fire department

https://worksinprogress.co/issue/how-amsterdam-invented-the-fire-department/
25•zdw•1h ago•6 comments

Dispersion loss counteracts embedding condensation in small language models

https://chenliu-1996.github.io/projects/LM-Dispersion/
15•E-Reverance•1h ago•3 comments

GLM5.2 on AMD MI355X at 2626 tok/s/node at over 2x lower cost than Blackwell

https://www.wafer.ai/blog/glm52-amd
39•latchkey•2h ago•9 comments

Jamesob's guide to running SOTA LLMs locally

https://github.com/jamesob/local-llm
256•livestyle•9h ago•121 comments

Espionage Against the European Parliament

https://citizenlab.ca/research/member-of-committee-investigating-spyware-hacked-with-pegasus/
248•ledoge•3h ago•64 comments

Infracost (YC W21) Is Hiring a Marketing Lead to Shift FinOps Left

https://www.ycombinator.com/companies/infracost/jobs/YTJcFwr-marketing-lead
1•akh•3h ago

Applied Category Theory Course (2018)

https://math.ucr.edu/home/baez/act_course/index.html
38•measurablefunc•3h ago•5 comments

We put a Redis server inside our runtime

https://encore.dev/blog/redis-runtime
13•eandre•2d ago•5 comments

New serious vulnerabilities spiked around release of Claude Mythos Preview

https://epoch.ai/data-insights/cve-severity-spike
22•cubefox•2h ago•6 comments

FreeBSD ate my RAM

https://crocidb.com/post/freebsd-ate-my-ram/
80•theanonymousone•5h ago•32 comments

Africans Are Turning to Starlink

https://www.economist.com/middle-east-and-africa/2026/07/02/africans-are-turning-to-starlink
80•bookofjoe•3h ago•71 comments

International chess federation sanctions Kramnik

https://www.fide.com/fide-ethics-disciplinary-commission-issues-a-decision-in-case-involving-gm-v...
110•DarkContinent•7h ago•58 comments

Costco is the anti-Amazon

https://phenomenalworld.org/analysis/the-anti-amazon/
261•bookofjoe•8h ago•247 comments

Factories are just rooms

https://interconnected.org/home/2026/07/03/factories
180•arbesman•8h ago•73 comments

Hunting a 16-year-old SQLite WAL bug with TLA+

https://ubuntu.com/blog/hunting-a-16-year-old-sqlite-bug-with-tla-is-dqlite-affected
163•peterparker204•3d ago•12 comments

Software, from First Principles

https://fazamhd.com/mental-models/software/
17•faza•2h ago•6 comments

GitFut – Your GitHub stats turned into a World-Cup-style player card

https://gitfut.com
6•redbell•1h ago•4 comments

Show HN: Mcpsnoop – Wireshark for MCP (transparent proxy and live TUI)

https://github.com/kerlenton/mcpsnoop
45•kerlenton•7h ago•13 comments

Wordgard: In-browser rich-text editor from the creator of ProseMirror

https://wordgard.net/
255•indy•15h ago•90 comments

PostgreSQL and the OOM killer: Why we use strict memory overcommit

https://www.ubicloud.com/blog/postgresql-and-the-oom-killer-why-we-use-strict-memory-overcommit
150•furkansahin•11h ago•85 comments

I Wasn't Allowed Prompting ChatGPT During My Chalk Talk: This Is Discrimination (2025)

https://inpreparation.substack.com/p/opinion-i-was-not-allowed-to-type
135•theanonymousone•6h ago•71 comments

A peek into Reddit's anti-spam internals

https://lyra.horse/blog/2026/06/reddit-spam-internals/
155•OuterVale•6d ago•56 comments

Valve open-source the Steam Machine e-ink screen so you can make your own

https://www.gamingonlinux.com/2026/07/valve-open-source-the-steam-machine-e-ink-screen-so-you-can...
527•ahlCVA•11h ago•97 comments

Ask HN: Is anyone experimenting with different ways of using LLMs for coding?

123•yehiaabdelm•17h ago•149 comments