Hacker News
new
top
best
ask
show
job
How to Beat Unsloth's CUDA Kernel Using Mojo–With Zero GPU Experience
(
www.modular.com
)
3 points
by
timmyd
24 days ago
1 comment
timmyd
24 days ago
David Robertson took a quantization challenge designed for CUDA experts, and solved it in Mojo with AI assistance, and ended up 1.07x to 1.84x faster than the state-of-the-art C++/CUDA implementation.