Hacker News
new
top
best
ask
show
job
Reducing TTFT by CPUMaxxing Tokenization
(
www.crusoe.ai
)
4 points
by
AlonKejzman
13 hours ago
3 comments
13 hours ago
undefined
AlonKejzman
13 hours ago
I am one of the researchers who worked on this, would love to hear your opinions
h011yM011y
13 hours ago
Does it work on Qwen3.5?
AlonKejzman
13 hours ago
Of course! It actually works out of the box, due to its generic design