frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Taking on CUDA with ROCm: 'One Step After Another'

https://www.eetimes.com/taking-on-cuda-with-rocm-one-step-after-another/
22•mindcrime•1h ago

Comments

blovescoffee•48m ago
Naive question, could agents help speed up building code for ROCm parity with CUDA? Outside of code, what are the bottlenecks for reaching parity?
jiggawatts•35m ago
Lack of focus from AMD management. See the sibling comment: https://news.ycombinator.com/item?id=47745611

They just don't care enough to compete.

WorldPeas•35m ago
to be honest, outside of fullstack and basic MCU stuff, these agents aren't very good. Whenever a sufficiently interesting new model comes out I test it on a couple problems for android app development and OS porting for novel cpu targets and we still haven't gotten there yet. I'd be happy to see a day where it was possible however
superkuh•44m ago
AMD hasn't signaled in behavior or words that they're going to actually support ROCm on $specificdevice for more than 4-5 years after release. Sometimes it's as little as the high 3.x years for shrinks like the consumer AMD RX 580. And often the ROCm support for consumer devices isn't out until a year after release, further cutting into that window.

Meanwhile nvidia just dropped CUDA/driver support for 1xxx series cards from their most recent drivers this year.

For me ROCm's mayfly lifetime is a dealbreaker.

canpan•32m ago
I was thinking to get 2x r9700 for a home workstation (mostly inference). It is much cheaper than a similar nvidia build. But still not sure if good value or more trouble.
chao-•22m ago
Talking to friends who have fought more homelab battles than I ever will, my sense is that (1) AMD has done a better job with RDNA4 than the past generations, and (2) it seems very workload-dependent whether AMD consumer gear is "good value", "more trouble", or both at the same time.
cyberax•14m ago
I have this setup, with 2x 32Gb cards. It's perfect for my needs, and cheaper than anything comparable from NV.
stephlow•4m ago
I own a single R9700 for the same reason you mentioned, looking into getting a second one. Was a lot of fiddling to get working on arch but RDNA4 and ROCm have come a long way. Every once in a while arch package updates break things but that’s not exclusive to ROCm.

LLM’s run great on it, it’s happily running gemma4 31b at the moment and I’m quite impressed. For the amount of VRAM you get it’s hard to beat, apart from the Intel cards maybe. But the driver support doesn’t seem to be that great there either.

Had some trouble with running comfyui, but it’s not my main use case, so I did not spent a lot of time figuring that out yet

hotstickyballs•25m ago
Driver support eats directly into driver development
lrvick•20m ago
ROCm is open source and TheRock is community maintained, and in a minute the first Linux distro will have native in-tree builds. It will be supported for the foreseeable future due to AMDs open development approach.

It is Nvidia that has the track record of closed drivers and insisting on doing all software dev without community improvements to expected results.

mindcrime•20m ago
Last year, AMD ran a GitHub poll for ROCm complaints and received more than 1,000 responses. Many were around supporting older hardware, which is today supported either by AMD or by the community, and one year on, all 1,000 complaints have been addressed, Elangovan said. AMD has a team going through GitHub complaints, but Elangovan continues to encourage developers to reach out on X where he’s always happy to listen.

Seems like they're making some effort in that direction at least. If you have specific concerns, maybe try hitting up Anush Elangovan on Twitter?

shmerl•33m ago
Side question, but why not advance something like Rust GPU instead as a general approach to GPU programming? https://github.com/Rust-GPU/rust-gpu/

From all the existing examples, it really looks the most interesting.

I.e. what I'm surprised about is lack of backing for it from someone like AMD. It doesn't have to immediately replace ROCm, but AMD would benefit from it advancing and replacing the likes of CUDA.

MobiusHorizons•10m ago
From the readme:

> Note: This project is still heavily in development and is at an early stage.

> Compiling and running simple shaders works, and a significant portion of the core library also compiles.

> However, many things aren't implemented yet. That means that while being technically usable, this project is not yet production-ready.

Also projects like rust gpu are built on top of projects like cuda and ROCm they aren’t alternatives they are abstractions overtop

HarHarVeryFunny•9m ago
If you don't want/need to program at lowest level possible, then Pytorch seems the obvious option for AMD support, or maybe Mojo. The Triton compiler would be another option for kernel writing.
lrvick•27m ago
Just spent the last week or so porting TheRock to stagex in an effort to get ROCm built with a native musl/mimalloc toolchain and get it deterministic for high security/privacy workloads that cannot trust binaries only built with a single compiler.

It has been a bit of a nightmare and had to package like 30+ deps and their heavily customized LLVM, but got the runtime to build this morning finally.

Things are looking bright for high security workloads on AMD hardware due to them working fully in the open however much of a mess it may be.

alecco•11m ago
Apple got it right with unified memory with wide bus. That's why Mac Minis are flying for local models. But they are 10x less powerful in AI TOPS. And you can't upgrade the memory.

I really wish AMD and Intel boards get replaced by competent people. They could do it in very short time. Both have integrated GPUs with main memory. AMD and Intel have (or at least used to have) serious know-how in data buses and interconnects, respectively. But I don't see any of that happening.

ROCm? It can't even support decent Attention. It lacks a lot of features and NVIDIA is adding more each year. Soon they will reach escape velocity and nobody will catch them for a decade. smh

caycep•9m ago
Granted, I feel like NVIDIA GPU pricing is such that Mac minis will be way less than 10x cheaper if not already, so one might still get ahead purchasing a bulk order of Mac minis....
KennyBlanken•1m ago
A 5090 will cost you about the same amount of money as a Mac Studio M3 Ultra with eight times the RAM.

It's pretty insane how overpriced NVIDIA hardware is.

Surely there must be a way to make container secrets less dangerous?

https://dalmatian.life/2026/04/11/surely-there-must-be-a-way-to-make-container-secrets-less-dange...
1•birdculture•6m ago•0 comments

You can have an RSS dependent website in 2026

https://matduggan.com/you-can-absolutely-have-an-rss-dependent-website-in-2026/
1•OberstKrueger•8m ago•0 comments

The AI Industry's Most Expensive Mistake

https://www.thealgorithmicbridge.com/p/inside-the-ai-industrys-most-expensive
1•gmays•10m ago•0 comments

A Macroeconomic Perspective on Stock Market Valuation Ratios

https://researchdatabase.minneapolisfed.org/concern/publications/5x21tf60n
1•mooreds•19m ago•1 comments

2026 Is the New 2016

https://en.wikipedia.org/wiki/2026_is_the_new_2016
3•CHB0403085482•19m ago•0 comments

Why Trump Mishandled Iran

https://foreignpolicy.com/2026/04/08/iran-war-ceasefire-trump-united-states-israel/
1•mooreds•20m ago•1 comments

Any USB drive or cable you plug in might be a silent killer

https://www.askwoody.com/newsletter/free-edition-any-usb-drive-or-cable-you-plug-in-might-be-a-si...
3•devonnull•21m ago•0 comments

Made an eBPF syscall tracer with a live TUI

https://github.com/pandaadir05/snoop
1•pigeon1231•21m ago•1 comments

Old, Discontinued Fiats Are Outselling New Fiats

https://www.jalopnik.com/2144450/old-discontinued-fiat-500x-outselling-new-500e/
1•mooreds•22m ago•0 comments

I solved NP‑complete problems by turning them into planets

https://github.com/TheAnalyticalAbsurdist/vedic-planetary-transformers
1•AbsurdityBureau•26m ago•0 comments

Gliding on Snow: One Man's Dream

https://bd-numerique.museebombardier.com/en/
2•sxzygz•28m ago•0 comments

Give Them Two Choices

https://chadnauseam.com/coding/tips/give-them-two-choices
1•ChadNauseam•31m ago•0 comments

The AI Productivity Paradox: Why the AI Multiplier Is Less Than 2x

https://www.forrestthewoods.com/blog/the-ai-productivity-paradox/
1•forrestthewoods•32m ago•0 comments

Javier Milei's bribery scandal may have derailed Argentina's crypto investment

https://www.theguardian.com/world/2026/apr/12/crypto-javier-milei-bribery-argentina
1•kasperni•34m ago•0 comments

Language, Curiosity and Life – By Masato Hagiwara

https://masatohagiwara.net/lcl.html
1•jotaefea•42m ago•0 comments

New metal with triple copper's heat conduction challenges fundamental physics

https://www.scientificamerican.com/article/new-metal-with-triple-coppers-heat-conduction-challeng...
3•thunderbong•56m ago•1 comments

Harvesting easter eggs: An exploratory study of enjoying transnarrative media

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0341588
1•PaulHoule•1h ago•0 comments

ORAC-NT MedChem Copilot that blocks synthetically infeasible molecules

https://github.com/Kretski/ORAC-NT
1•DREDREG•1h ago•0 comments

Show HN: Local LLM on a Pi 4 controlling hardware via tool calling

https://github.com/stfurkan/pi-llm
2•stfurkan•1h ago•3 comments

How to Split Mounjaro Pens

https://henryaj.substack.com/p/you-can-just-take-less-mounjaro
1•henryaj•1h ago•0 comments

Memoir to Secure Computing Technology Corp

https://cryptosmith.com/r/sctc/
1•relaunched•1h ago•1 comments

Postgres Advisory Locks: Deadlocks, Key Packing, and Fetch-Lock-Refetch

https://dteather.com/blogs/postgres-advisory-locks/
2•davidteather•1h ago•0 comments

Google Externalised the Cost of Renaming Gmail

https://nativerse-ventures.com/gmail-rename-cascade
8•Srinathprasanna•1h ago•1 comments

Show HN: A native C/GTK Linux tray internet monitor to log ISP dropouts

https://github.com/lsferreira42/internet-indicator
1•lsferreira42•1h ago•0 comments

Show HN: A social feed with no strangers

https://www.grateful.so/
2•rpatni•1h ago•0 comments

A protective case for your glasses

https://thangs.com/designer/Gazzaladra/3d-model/Glasses%20case-1511373
2•dsego•1h ago•0 comments

Taking on CUDA with ROCm: 'One Step After Another'

https://www.eetimes.com/taking-on-cuda-with-rocm-one-step-after-another/
22•mindcrime•1h ago•18 comments

Vegetative Patients May Be More Aware Than We Knew

https://www.nytimes.com/2026/04/09/magazine/vegetative-states-conscious-aware.html
7•bookofjoe•1h ago•1 comments

Should capitalism be a choice?

https://cyrusradfar.com/thoughts/should-capitalism-be-a-choice
6•cyrusradfar•1h ago•2 comments

Major history podcasts are being faked

https://old.reddit.com/r/podcasting/comments/1sjgscs/major_history_podcasts_are_being_faked/
1•softwaredoug•1h ago•0 comments