Hacker News
new
top
best
ask
show
job
Show HN: Turboquant.cpp – Quantize embeddings to 1-4 bits, no training (400 LoC)
(
github.com
)
2 points
by
andrewmikhail
5 hours ago
1 comment
andrewmikhail
5 hours ago
[flagged]