Hacker News
new
top
best
ask
show
job
Speculative pre-positioning: off-path decode for stateful inference sessions
(
arxiv.org
)
1 point
by
logotype
6 hours ago
1 comment
logotype
5 hours ago
With native support for sessions in an inference engine we can make use of idle GPUs... doing work!