Hacker News
new
top
best
ask
show
job
FairyFuse: Multiplication-Free LLM Inference on CPUs via Fused Ternary Kernels
(
arxiv.org
)
17 points
by
PaulHoule
13 hours ago
1 comment
Reubend
7 hours ago
Paper looks great. No GitHub link that I can find though. Maybe I'll take a crack at an implementation if I've got some extra free time.