AI-generated CUDA kernels silently break training and inference [R]

By laginimaineb

· r/MachineLearning · May 27, 2026

Researchers found AI-generated CUDA kernels can pass benchmark verification while breaking transformer training and inference workloads.

Categories: Research

Excerpt

Last month NVIDIA released [SOL-ExecBench](https://research.nvidia.com/benchmarks/sol-execbench), a new benchmark of 235 production CUDA kernels lifted from DeepSeek, Qwen, Gemma, and Kimi. We took several top-ranked AI-generated submissions and tried using them in production workloads. Many of them broke, sometimes in surprising ways. One of those kernels is the fused embedding-gradient + RMSNorm backward pass, which runs at the end of every transformer training step. We took the fastest submission on the benchmark for it, and dropped it into the training loop of a small transformer. The kernel had passed the benchmark's verifier with room to spare. But in our training run, the loss diverged and never recovered. We started debugging. Replace the dataset distribution with uniformly sampled tokens, the divergence vanishes. Swap SGD for AdamW, also vanishes. This is the worst kind of bug for research. Symptoms and masks both look exactly like "the idea didn't work". It's the type of bug that can make researchers spend a long time debugging without knowing what's at fault: the dataset? the research idea? the architecture? or the implementation itself? Turns out, the actual bug is that the embedding-gradient half of the kernel accumulates in bf16 instead of fp32. Embedding backward sums many small gradient contributions into each token's row of the embedding matrix. With uniform random tokens the contributions spread evenly and bf16 precision is enough. In real text, a handfu

Read at source: https://www.reddit.com/r/MachineLearning/comments/1tpaw6x/aigenerated_cuda_kernels_silently_break_training/

Discussions

reddit · 68 points · 8 comments
reddit · 76 points · 10 comments
reddit · 91 points · 10 comments
reddit · 97 points · 10 comments
reddit · 111 points · 10 comments
reddit · 117 points · 11 comments
reddit · 118 points · 11 comments
reddit · 125 points · 11 comments
reddit · 142 points · 11 comments
reddit · 145 points · 12 comments
reddit · 151 points · 12 comments
reddit · 150 points · 14 comments
reddit · 165 points · 15 comments
reddit · 169 points · 15 comments
reddit · 183 points · 16 comments
reddit · 185 points · 16 comments
reddit · 193 points · 17 comments
reddit · 195 points · 17 comments
reddit · 206 points · 17 comments
reddit · 206 points · 17 comments
reddit · 209 points · 18 comments
reddit · 214 points · 18 comments
reddit · 214 points · 19 comments
reddit · 213 points · 19 comments
reddit · 217 points · 19 comments
reddit · 216 points · 19 comments
reddit · 224 points · 19 comments
reddit · 221 points · 19 comments
reddit · 226 points · 19 comments
reddit · 227 points · 19 comments
reddit · 224 points · 19 comments
reddit · 226 points · 20 comments
reddit · 230 points · 21 comments
reddit · 227 points · 23 comments
reddit · 228 points · 23 comments