Hacker News
new
top
best
ask
show
job
Show HN: I built a 2nd-order PyTorch optimizer for LLMs that runs on 16GB GPUs
2 points
by
dnosoz
5 hours ago
3 comments
dnosoz
5 hours ago
Author here. Happy to answer any deep-dive questions about the CUDA implementation or the Kronecker factorization math.
satvikpendem
5 hours ago
Your account is shadow banned by the way, I guess you've just been self promoting too much.
lostmsu
4 hours ago
Does it actually improve time to target loss?