AMD Publishes Open-Source Driver for GPU Virtualization, Radeon "In the Roadmap"

https://www.phoronix.com/news/AMD-GIM-Open-Source

39•davidlt•2h ago

Comments

janpmz•1h ago

This article is almost unreadable for me. The ads change in size and make the text jump. I'm adding it to NotebookLM now.

Mountain_Skies•1h ago

The article is extremely light on details anyway. The most important thing in it is the link to the repo at https://github.com/amd/MxGPU-Virtualization

proxysna•1h ago

That's pretty sick. Nice to see such things trickle down to consumer GPU's.

seanhunter•1h ago

It blows my mind how reliably AMD shoots itself in the foot. What we want isn’t that hard:

1) Support your graphics cards on linux using kernel drivers that you upstream. All of them. Not just a handful - all the ones you sell from say 18 months ago till today.

2) Make GPU acceleration actually work out of the box for pytorch and tensorflow. Not some special fork, patched version that you “maintain” on your website, the tip of the main branch for both of those libraries should just compile out of the box and give people gpu-accelerated ML.

This is table stakes but it blows my mind that they keep making press releases and promises like this that things are on the roadmap without doing thing one and unfucking the basic dev experience so people can actually use their GPUs for real work.

How it actually is: 1) Some cards work with rocm, some cards work with one of the other variations of BS libraries they have come up with over the years. Some cards work with amdgpu but many only work with proprietary kernel drivers which means if you don’t use precisely one of the distributions and kernel versions that they maintain you are sool.

2) Nothing whatsoever builds out of the box and when you get it to build almost nothing runs gpu accelerated. For me, pytorch requires a special downgrade, a python downgrade and a switch to a fork that AMD supposedly maintain although it doesn’t compile for me and when I managed to beat it into a shape where it compiled it wouldn’t run GPU accelerated even though games use the GPU just fine. I have a GPU that is supposedly current, so they are actively selling it, but can I use it? Can I bollocks. Ollama won’t talk to my GPU even though it supposedly works with ROCm. It only works with ROCm with some graphics cards. Tensorflow similar story when I last tried it although admittedly I didn’t try as hard as pytorch.

Just make your shit work so that people can use it. It really shouldn’t be that hard. The dev experience with NVidia is a million times better.

faust201•1h ago

IIRC there was only one AMD employee that was working to integrate linux based things. Often, the response was - things are stuck in Intellectual property, or project managers etc. So even specs were not available.

logicchains•50m ago

SemiAnalysis had a good article on this recently, basically the reason AMD still sucks on the ML software side is that their compensation for devs is significantly worse than competitors like NVidia, Google and OpenAI, so most of the most competent devs go elsewhere.

bayindirh•44m ago

AMD has two driver teams at this point. One of Linux/Open Source, one for Catalyst/Closed source, and they are not allowed to interact.

Because, there are tons of IP and trade secrets involved in driver development and optimization. Sometimes game related, sometimes for patching a rogue application which developers can't or don't fix, etc. etc.

GPU drivers are ought to be easy, but in reality, they are not. The open source drivers are "vanilla" drivers without all these case-dependent patching and optimization. Actually, they really work well out of the box for normal desktop applications. I don't think there are any cards which do (or will) not work with the open source kernel drivers as long as you use a sufficiently recent version.

...and you mention ROCm.

I'm not sure how ROCm's intellectual underpinnings are but, claiming lack of effort is a bit unfair to AMD. Yes, software was never their strong suit, but they're way better when compared to 20 years earlier. They have a proper open source driver which works, and a whole fleet of open source ROCm packages, which is very rigorously CI/CD tested by their maintainers now.

Do not forget that some of the world's most powerful supercomputers run on Instinct cards, and AMD is getting tons of experience from these big players. If you think the underpinnings of GPGPU libraries are easy, I can only say that the reality is very different. The simple things people do with PyTorch and other very high level libraries pull enormous tricks under the hood, and you're really pushing the boundaries of the hardware performance and capability-wise.

NVIDIA is not selling a tray full of switches and GPUs and require OEMs to integrate it as-is for no reason. On the other hand, the same NVIDIA acts very slowly to enable an open source ecosystem.

So, yes, AMD is not in an ideal position right now, but calling them incompetent doesn't help either.

P.S.: The company which fought for a completely open source HDMI 2.1 capable display driver is AMD, not NVIDIA.

throwaway48476•26m ago

If AMD does deliver on client dGPU virtualization it would be amazing.

A generalist system prompt for software arch/design/code review/etc.

Cars and Key Fobs: Attacks on Car Remotes

Lustre v5.0.0 Released

Enumerating All Fractions

On loyalty to Your Employer

LogIT – FREE Expense tracker for your everyday needs

High-quality search engine without LLM-generated results?

Why Software Devs Keep Burning Out

Creating your own federated microblog

Ctrl-Z: Controlling AI Agents via Resampling

Get to know the AI behind every video call with lily

Cursor vs. Windsurf – Choose the Right AI Code Editor for Your Team

Our brains can communicate wordlessly, through our eyes

Does using Rust make your software safer?

Redmine 6.0.0 is now available

Realistic roles for hydrogen in the future energy transition

North American Aviation's 1965 Plan for Piloted Planetary Flybys in the 1970s

MarineTraffic: Global Ship Tracking Intelligence

After EU fines, Big Tech wants Trump to swoop in

Kubernetes v1.33: Octarine

Squiggle Orbs

ASML, creator of lithography machines, has a messy software stack

Vim Language, Motions, and Modes Explained

Ask HN: Where to find new businesses in USA?

Punch any college you hate

How Gen Z Became the Most Gullible Generation

Ask HN: Built my own license key system, now facing the pricing dilemma

Building Cuiz AI: How I Used AI to Create a Document-to-Quiz Generator

Lessons from building and maintaining distributed systems at scale

In 1859, a South African declared himself emperor of the United States