Hacker News
new
top
best
ask
show
job

New KV cache compaction technique cuts LLM memory 50x without accuracy loss(venturebeat.com)

8 pointsby mellosouls8 hours ago1 comment

androiddrew4 hours ago
I hope this is real.

Guidelines
FAQ
Lists
API
Security
Legal
Apply to YC
Contact

Search: