We just dropped a new open source project: CUDA to cuTile transpiler for NVIDIA's CUDA 13.1
We built a transpiler that converts your CUDA kernels to cuTile automatically. It figures out what your kernel does (flash attention, matrix multiplication, RoPE) and writes the cuTile version
Zero AI involved! It's pure pattern matching and code analysis
rncode•12h ago
We built a transpiler that converts your CUDA kernels to cuTile automatically. It figures out what your kernel does (flash attention, matrix multiplication, RoPE) and writes the cuTile version
Zero AI involved! It's pure pattern matching and code analysis
Currently supports 18 kernel patterns: - Core: GEMM, Reduction, Scan, Stencil, Elementwise, FFT - ML/DL: Convolution (1D/2D/3D), Pooling, Normalization - LLM: Flash Attention, RoPE, KV Cache, Quantization (INT8/FP8) - Specialized: Sparse matrices, Histogram, Sorting
Contributions we need:More kernel pattern templates