Hacker News
new
top
best
ask
show
job

Re-quantizing a local LLM 14x faster by skipping the tensors that didn't change(andreaborio.substack.com)

6 pointsby andreaborio6 hours ago1 comment

andreaborio6 hours ago
[dead]

Guidelines
FAQ
Lists
API
Security
Legal
Apply to YC
Contact

Search: