frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication Through RL

https://github.com/deepreinforce-ai/CUDA-L2
35•dzign•1h ago

Comments

stonogo•41m ago
Am I reading this wrong, or does this only support FP16 inputs, and compares its performance against an FP32 solver?
bgwalter•38m ago
> To valid kernel correctness, we need to compare its output to a reference correct kernel with the same inputs.

No, you need a numerical proof, which you don't have.

krapht•26m ago
This is a standard which few kernels will ever meet. I'd say requiring a numerical proof is the same as requiring no proof at all - because it won't ever happen unless you're validating silicon or something equally expensive.
j2kun•25m ago
They claim the algorithm "discovered" the new techniques, but the methods described in section 5 do not seem all that novel to me. It smells like it could be "laundering" the literature [1] and reshuffling existing techniques. This is not inherently a bad thing, but I would hope that if it is borrowing existing techniques, the appropriate citation would eventually make it into this paper.

[1]: https://www.argmin.net/p/lore-laundering-machines

AlexCoventry•24m ago
In the future, we will all be Jürgen Schmidhuber. :-)
alyxya•6m ago
There generally aren't new techniques when optimizing something ubiquitous. Instead, there are a lot of ways to apply existing techniques to create new and better results. Most ideas are built on top of the same foundational principles.
alyxya•9m ago
The chart confused me because I expected to see performance numbers of CUDA-L2 compared to the others, but instead it shows a chart showing the speedup percentage of CUDA-L2 over the others. In some sense, the bar chart effectively inverts the performance of torch.matmul and cuBLAS with how much percentage it shows. 0% on the bar chart would only mean equal performance.

Plane crashed after 3D-printed part collapsed

https://www.bbc.com/news/articles/c1w932vqye0o
129•toss1•1h ago•87 comments

CUDA-L2: Surpassing cuBLAS Performance for Matrix Multiplication Through RL

https://github.com/deepreinforce-ai/CUDA-L2
35•dzign•1h ago•7 comments

Thoughts on Go vs. Rust vs. Zig

https://sinclairtarget.com/blog/2025/08/thoughts-on-go-vs.-rust-vs.-zig/
27•yurivish•35m ago•6 comments

Multivox: Volumetric Display

https://github.com/AncientJames/multivox
177•jk_tech•5h ago•21 comments

Transparent leadership beats servant leadership

https://entropicthoughts.com/transparent-leadership-beats-servant-leadership
322•ibobev•8h ago•155 comments

Why are 38 percent of Stanford students saying they're disabled?

https://reason.com/2025/12/04/why-are-38-percent-of-stanford-students-saying-theyre-disabled/
341•delichon•4h ago•512 comments

It’s time to free JavaScript (2024)

https://javascript.tm/letter
610•pavelai•13h ago•320 comments

Hammersmith Bridge – Where did 25,000 vehicles go?

https://nickmaini.substack.com/p/hammersmith-bridge
41•tobr•3h ago•31 comments

Django 6

https://docs.djangoproject.com/en/6.0/releases/6.0/
70•wilhelmklopp•1h ago•30 comments

PyTogether: Collaborative lightweight real-time Python IDE for teachers/learners

https://github.com/SJRiz/pytogether
43•indigodaddy•4h ago•3 comments

How elites could shape mass preferences as AI reduces persuasion costs

https://arxiv.org/abs/2512.04047
434•50kIters•13h ago•445 comments

I ignore the spotlight as a staff engineer

https://lalitm.com/software-engineering-outside-the-spotlight/
371•todsacerdoti•10h ago•169 comments

Show HN: Onlyrecipe 2.0 – I added all features HN requested – 4 years later

https://onlyrecipeapp.com/?url=https://www.allrecipes.com/turkish-pasta-recipe-8754903
93•AwkwardPanda•7h ago•82 comments

The RAM shortage comes for us all

https://www.jeffgeerling.com/blog/2025/ram-shortage-comes-us-all
253•speckx•2h ago•275 comments

Fighting the age-gated internet

https://www.wired.com/story/age-verification-is-sweeping-the-us-activists-are-fighting-back/
124•geox•8h ago•101 comments

Autism should not be treated as a single condition

https://www.economist.com/science-and-technology/2025/12/03/why-autism-should-not-be-treated-as-a...
150•bookofjoe•5h ago•206 comments

Converge (YC S23) is hiring a martech expert in NYC

https://www.runconverge.com/careers/technical-customer-success-manager
1•janhenr•5h ago

Feynman vs. Computer

https://entropicthoughts.com/feynman-vs-computer
50•cgdl•6h ago•19 comments

Who Hooked Up a Laptop to a 1930s Dance Hall Machine?

https://www.chrisbako.com/posts/2025-12-04-speelkok-museam
24•ChrisbyMe•3h ago•5 comments

Launch HN: Browser Buddy (YC W24) – A recommendation system for Internet writing

https://www.browserbuddy.com/
30•alien0006•5h ago•24 comments

Microsoft drops AI sales targets in half after salespeople miss their quotas

https://arstechnica.com/ai/2025/12/microsoft-slashes-ai-sales-growth-targets-as-customers-resist-...
313•OptionOfT•6h ago•229 comments

A Most Important Mustard

https://www.asimov.press/p/arabidopsis
11•surprisetalk•3d ago•0 comments

Functional Quadtrees

https://lbjgruppen.com/en/posts/functional-quadtree-clojure
104•lbj•8h ago•38 comments

PGlite – Embeddable Postgres

https://pglite.dev/
460•dsego•11h ago•99 comments

CJEU has made it effectively impossible to run a user-generated platform legally

https://www.techdirt.com/2025/12/04/eus-top-court-just-made-it-literally-impossible-to-run-a-user...
68•alsetmusic•2h ago•23 comments

Yawning abyss of the decimal labyrinth

https://oh4.co/site/numogrammaticism.html
11•austinallegro•1w ago•0 comments

Uncloud - Tool for deploying containerised apps across servers without k8s

https://uncloud.run/
331•rgun•16h ago•139 comments

A lost Amazon world just reappeared in Bolivia

https://www.frontiersin.org/news/2025/11/06/landscapes-that-remember-indigenous-peoples-thrived-a...
86•ashishgupta2209•3d ago•18 comments

RAM is so expensive, Samsung won't even sell it to Samsung

https://www.pcworld.com/article/2998935/ram-is-so-expensive-samsung-wont-even-sell-it-to-samsung....
329•sethops1•8h ago•307 comments

Tunnl.gg

https://tunnl.gg
145•klipitkas•12h ago•85 comments