fp.
newest
Open in hackernews
Modular beat Nvidia's cuBLAS kernels on B200s in 170 LOC
https://twitter.com/AliesTaha/status/1970510268745896036
3
•
pbd
•
4mo ago
Comments
pbd
•
4mo ago
kernel optimisation is paying $$$
pbd•4mo ago