Hacker News
new
top
best
ask
show
job
Why LLM APIs Shouldn't Ship UTF-8", "Stop Wasting Bandwidth on LLM Text APIs
(
github.com
)
3 points
by
Zombwaffle
4 hours ago
3 comments
oofbey
3 hours ago
The README claims that it costs 50-100 bytes per token when rendered to UTF8 text and wrapped in JSON. Citation needed please? JSON can be inefficient if you have lots of keys or use pretty whitespace. But UTF8 is very efficient. I don’t see it.
Zombwaffle
4 hours ago
[flagged]
Zombwaffle
4 hours ago
[flagged]