Hacker News
new
top
best
ask
show
job
Storage based KVCache for denser token factory
(
blogs.oracle.com
)
1 point
by
baruch
5 hours ago
1 comment
baruch
5 hours ago
It is possible to get more tokens out of the same hardware by leveraging fast storage for KVCache, it is especially useful for agentic workloads.