Hacker News
new
top
best
ask
show
job
Show HN: 3.125-Bit LLM quantization bypassing tensor cores
(
blog.djellalmohamedaniss.workers.dev
)
3 points
by
dmaniss
2 hours ago
1 comment
dmaniss
2 hours ago
[flagged]