frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: Is anyone using AMD GPUs for their AI workloads?

4•technoabsurdist•6h ago
^ title. I've been renting MI300Xs coz they are cheaper than H100s and my experience has been generally OK (smoother than i expected based on people shitting on AMD so much online). ROCm 6.x seems decent out of the box now, and I'll happily spend 30 more minutes setting up my GPU if it means 20% cheaper. that being said, it's still annoying to run inference for LLMs on AMD's hardware (e.g. You have to install vLLM from source). And there are some other small details which still suck. As a small example, nvidia-smi gives you a nice clear interface while rocm-smi dumps 3 pages of context that's hard to navigate.

would be curious to hear experiences from other folks experimenting with AI workloads.

Comments

dlcarrier•4h ago
I'm using an MI25, flashed as a PRO WX 9100, which requires an older version of ROCm to work. That's expectedm, because my GPU is depricated in future versions of ROCm, but what irks me is that everything neural network related barely works. You need the exact version of every interpreter and library, which ends up working on some distributions but not others. I've noticed that when people program in compiled languages, they seem to make a concerted effort to do some kind of bounds testing, but anything in Python or Node.js seems to be released as soon as it kind-of-sort-of works, some of the time.
technoabsurdist•2h ago
oh yeah, in my experience anything below ROCm6.x really sucks.

I tried to run qwen2.5-32B on ROCm5.x and it was running at <15tok/s lol.

Have you tried running any sort of LLM inference on your MI25, or what NN workloads are you running?

A literary magazine accessible only via telnet

1•edent•2m ago•0 comments

Judge backs AI firm over use of copyrighted books

https://www.bbc.co.uk/news/articles/c77vr00enzyo
1•_ua_•4m ago•0 comments

Free Directories to Submit

https://aiex.me/free-tools-directory-to-submit
1•zack119•6m ago•0 comments

iPhone Users Upset About Apple Promoting F1 Movie with Wallet App Notification

https://www.macrumors.com/2025/06/24/apple-wallet-notification-f1-movie-ad/
1•tosh•7m ago•0 comments

Reproducing U-Net

https://doubledissent.bearblog.dev/reproducing-u-net/
1•txus•8m ago•0 comments

Can machine consciousness be triggered with the right prompt?

https://docs.google.com/document/d/1YHNC8YvBtYLAptYUE61B1nNUFbBTPbcF6ulxanVFtm0/edit?usp=drivesdk
1•kamil_gr•10m ago•2 comments

Anabolic response to protein ingestion during recovery has no upper limit

https://pmc.ncbi.nlm.nih.gov/articles/PMC10772463/
1•Luc•11m ago•0 comments

Musk's 'robotaxis' draw regulatory scrutiny already

https://apnews.com/article/musk-austin-robotaxis-incidents-tesla-autonomous-selfdriving-0e32a7613a6c41c20ce258dbc5ec2cba
1•Anumbia•13m ago•0 comments

Tesla sales decline in Europe for fifth straight month as rivals gain ground

https://www.timeslive.co.za/motoring/news/2025-06-25-tesla-sales-decline-in-europe-for-fifth-straight-month-as-rivals-gain-ground/
24•Bluestein•13m ago•7 comments

How to Keep Your Home Cool in Extreme Heat

https://www.scientificamerican.com/article/how-to-keep-your-home-cool-in-extreme-heat/
1•beardyw•14m ago•1 comments

Xi Jinping's plan to overtake America in AI

https://www.economist.com/china/2025/05/25/xi-jinpings-plan-to-overtake-america-in-ai
1•willvarfar•15m ago•0 comments

Web Translator API

https://developer.mozilla.org/en-US/docs/Web/API/Translator
2•kozika•15m ago•0 comments

Show HN: Supercharge Your Readwise Library with Local, Semantic Search

https://github.com/leonardsellem/readwise-vector-db
1•monsieurleon•19m ago•0 comments

Musci – AI Music Generator and Audio Creation Platform

https://musci.app
1•xbaicai•20m ago•0 comments

Video Background Remover – Remove Video Background Online

https://www.videobackgroundremover.io
1•xbaicai•20m ago•0 comments

Foe: Functional Opcode Encoding

https://github.com/aston89/FOE-compression-archetype
1•Aston89•21m ago•0 comments

How do you index Pages not indexed by Google Search in 2025?

https://medium.com/@ikennaseo/how-to-index-pages-unindexed-by-google-search-2025-d7899acbf7c1
1•hatzest4370•23m ago•0 comments

Updating Flutter Plugin Project Structure

1•flfljh•23m ago•0 comments

Fundamentals of HarmonyOS Development

1•flfljh•24m ago•0 comments

AI chatbots for any website with RAG search, streaming responses, endpoints

https://github.com/mendableai/firestarter
1•swyx•25m ago•0 comments

Balatro Port on GBA

https://github.com/cellos51/balatro-gba
1•crowfunder•29m ago•0 comments

Iranian-backed hackers go to work after US strikes

https://apnews.com/article/iran-trump-cybersecurity-hacking-9009bff8425d97366e9423b50fb52edf
1•Anumbia•31m ago•0 comments

Fedora 44 Looks to Drop I686 Support: No More Multi-Lib / x86 32-Bit Packages

https://www.phoronix.com/news/Fedora-43-Change-No-i686
2•rcarmo•37m ago•0 comments

Universum Mechanical Watch Wristwatches

https://ifdesign.com/en/winner-ranking/project/universum-mechanical-watch/620684
2•Bluestein•39m ago•0 comments

Rocket Two big Asian reuse milestones, Vandenberg becomes SpaceX west

https://arstechnica.com/civis/threads/rocket-report-two-big-asian-reuse-milestones-vandenberg-becomes-spacex-west.1507980/page-3
2•Bluestein•41m ago•0 comments

Reading NFC Passport Chips in Linux

https://shkspr.mobi/blog/2025/06/reading-nfc-passport-chips-in-linux/
12•robin_reala•44m ago•0 comments

Perplexity enables Veo 3-powered video creation on X

https://twitter.com/AskPerplexity/status/1935759056209428531
2•willmarquis•47m ago•0 comments

Show HN: I gave it my 100%, you will not believe what I came up with

https://enchanted-work-037926.framer.app/
2•Divyansh_13•52m ago•0 comments

ChatGPT Is Becoming a Religion

https://www.youtube.com/watch?v=zKCynxiV_8I
4•cmsefton•53m ago•0 comments

How I Passed the Cissp: My Personal Experience (2023)

https://techkettle.blogspot.com/2022/12/cissp-you-dont-have-to-study-everything.html
1•elsadek•54m ago•0 comments