Hacker News
new
top
best
ask
show
job
From 800ms to ~25ms: harness-driven optimization of a CUDA matmul kernel
(
github.com
)
3 points
by
icyace
5 hours ago
1 comment
icyace
5 hours ago
[dead]