frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

What about OpenCL and CUDA C++ alternatives?

https://www.modular.com/blog/democratizing-ai-compute-part-5-what-about-cuda-c-alternatives
25•eatonphil•5d ago

Comments

20k•1d ago
I've been programming in OpenCL for 10+ years. This article is clearly AI generated, and its also completely wrong

OpenCL did not fragment into a mess of vendor specific extensions. I don't know where that even comes from, the extensions in the 1.1/1.2 days were very minimal. The story goes like this:

1. OpenCL 1.2 turned up. This was the first really viable version of OpenCL, although 1.1 was usable. It started to gain a fair bit of traction

2. As OpenCL increasingly started to eat at Nvidia's cuda dominance, Nvidia deliberately killed OpenCL support by neglecting it. There literally did 0 technical development on it for years, with long running critical bugs remaining unfixed

3. At some point, OpenCL started to gain prominence in machine vision stuff in the embedded world, especially for cars and that space. Nvidia's poor OpenCL support became a liability, and so they heavily invested in OpenCL, fixed all the old bugs, and implemented OpenCL 3.0 support

4. Apple dropped OpenCL support because their anti competitive market strategy is vendor lock in, so a cross platform cross vendor API is bad for them. There's some kind of long running legal clusterfuck going on between Apple, and Khronos, the details of which are not public

5. AMD's OpenCL support was absolutely great. A+_best in class, would recommend. It had a fair bit of buggyness/jankyness, but it was an order of magnitude better than Nvidia's support back when Nvidia was deliberately hamstringing it. Then they started developing ROCm

6. Around this time, Vulkan was standardised. Khronos had been considering merging OpenCL into vulkan to force vendors to support it properly, but this was killed. Instead, we got the absolutely insane OpenCL-specific SPIRV format that nobody has ever used for anything

7. With the advent of ROCm, AMD abandoned OpenCL. They swapped over their good OpenCL compiler, to a new compiler that is much shitter and still has way worse performance. OpenCL is now badly implemented on top of ROCm, but some of it is so buggy its clear that nobody has ever used it. They refuse to implement 3.0 support, even though Nvidia supports it

8. The tables have turned: AMDs GPU compute support sucks donkeyballs, and Nvidia is lightyears ahead of them

OpenCL is not a story of fragmentation. Its a story of how vendors deliberately independently set out to kill it, along with AMD's incompetence, and Khronos making several boneheaded decisions during its standardisation that lead to it being cut off at the knees

Despite this, OpenCL 1.2 is still by far the best cross platform cross vendor GPU Compute API, because it is the only one that exists. Vulkan still doesn't support everything you need. Its a nightmare all around

gspr•1d ago
This is a fantastic overview. I learned a lot, thanks!

As someone whose only experience with GPU programming (except very high-level stuff via Torch) has been OpenCL, I gotta say I found it really pleasant! I've always wondered why it didn't take off, and hate the current situation.

I looked with hope at Vulkan, but am saddened by what you write about it. Could you elaborate on what the state of Vulkan Compute is, and where it might go from here?

Reubend•1d ago
Thanks so much for explaining this. I was just about to comment "Isn't Vulkan the main OpenCL and CUDA alternative these days?" could you share a bit more on what Vulkan is missing relative to the other two?
melodyogonna•1d ago
From what I can see, your comment does not disapprove the main points of the article... Which are that OpenCL development moved very slowly (design by committee always do) and that vendors did not contribute with all their cards at the table.

While you may have adopted OpenCL for a long time, Chris Lattner created it (with his team, of course), I wouldn't dismiss his account of the early history offhand.

Your ePub Is fine

https://andreklein.net/your-epub-is-fine-kobo-disagrees-blame-adobe/
583•sohkamyung•10h ago•195 comments

Apple Foundation Models

https://platform.claude.com/docs/en/cli-sdks-libraries/libraries/apple-foundation-models
123•MehrdadKhnzd•4h ago•34 comments

Even more batteries included with Emacs

https://karthinks.com/software/even-more-batteries-included-with-emacs/
195•signa11•7h ago•38 comments

What the Fuck Happened to Nerds

https://mrmarket.lol/what-the-fuck-happened-to-nerds/
106•vrnvu•1h ago•56 comments

Curl will not accept vulnerability reports during July 2026

https://daniel.haxx.se/blog/2026/06/15/curl-summer-of-bliss/
385•secret-noun•3h ago•136 comments

Show HN: Kage – Shadow any website to a single binary for offline viewing

https://github.com/tamnd/kage
563•tamnd•16h ago•110 comments

There Is(Ǝ) – Such That (∋)

https://www.fractalkitty.com/there-is-3-such-that/
34•evakhoury•3d ago•13 comments

Bitsy

https://bitsy.org/
197•tosh•3d ago•6 comments

Firewood Splitting Simulator

https://screen.toys/firewood/
819•memalign•5d ago•242 comments

21 years and counting of 'eight fallacies of distributed computing' (2025)

https://blog.apnic.net/2025/12/08/21-years-and-counting-of-eight-fallacies-of-distributed-computing/
87•teleforce•9h ago•16 comments

Dalus (YC W25) Is Hiring a Senior Software Engineer in Germany

https://www.ycombinator.com/companies/dalus/jobs/5IDmKJt-senior-software-frontend-engineer-german...
1•sebastianvoelkl•2h ago

Rio de Janeiro's "homegrown" LLM appears to be a merge of an existing model

https://github.com/nex-agi/Nex-N2/issues/4
350•unrvl22•18h ago•188 comments

Why does paper fold so well?

https://www.bbc.co.uk/programmes/w3ct8k70
42•zeristor•1d ago•14 comments

Being an old school web-based sports sim dev in the era of vibe coded games

https://zengm.com/blog/2026/06/vibecoded-games/
6•YesBox•2d ago•2 comments

A short history of Cerro Torre, the most controversial mountain (2012)

https://www.markhorrell.com/blog/2012/a-short-history-of-cerro-torre/
43•joebig•4d ago•16 comments

Ask HN: What are you working on? (June 2026)

232•david927•17h ago•839 comments

Formal methods and the future of programming

https://blog.janestreet.com/formal-methods-at-jane-street-index/?from_theconsensus=1
267•eatonphil•21h ago•94 comments

Show HN: Trace – Offline Mac meeting transcripts you can flag mid-call

https://traceapp.info
163•AG342•1d ago•59 comments

Chaosnet (1981)

https://tumbleweed.nu/r/lm-3/uv/amber.html
87•RGBCube•14h ago•12 comments

Exploring building a tiny FUSE filesystem

https://www.shayon.dev/post/2026/161/building-a-tiny-fuse-filesystem/
9•shayonj•2d ago•2 comments

TorchCodec 0.14: HDR Video Decoding for CPU and CUDA, and Fast Wav Decoder

https://github.com/meta-pytorch/torchcodec/releases/tag/v0.14.0
47•scott_s•4d ago•5 comments

The only scalable delete in Postgres is DROP TABLE

https://planetscale.com/blog/the-only-scalable-delete
169•hollylawly•3d ago•58 comments

Windows 11 users are tired of MS account requirements creeping into everything

https://www.windowscentral.com/microsoft/windows-11/windows-11-users-are-tired-of-microsoft-accou...
336•josephcsible•11h ago•230 comments

Perlisisms (1982)

https://www.cs.yale.edu/homes/perlis-alan/quotes.html
116•tosh•18h ago•57 comments

Caddy compatibility for zeroserve: 3x throughput and 70% lower latency

https://su3.io/posts/zeroserve-caddy-compat
186•losfair•19h ago•53 comments

Show HN: Discover Wikipedia articles popular on Hacker News

https://www.orangecrumbs.com/
114•octopus143•15h ago•26 comments

Segmented type appreciation corner (2018)

https://aresluna.org/segmented-type/
73•unexpectedVCR•3d ago•16 comments

Write for One Person

https://wizardzines.com/comics/write-for-one-person/
220•evakhoury•2d ago•65 comments

How to earn a billion dollars

https://paulgraham.com/earn.html
631•kingstoned•21h ago•1639 comments

I indexed 669 GB of my GoPro videos using my M1 Max computer and local ML models

370•iliashad•18h ago•94 comments