Hacker News
new
top
best
ask
show
job
Tell HN: Google increased existing finetuned model latency by 5x
13 points
by
deaux
2 days ago
1 comment
jpau
6 hours ago
Hey we're also a Vertex tuning customer in a similar spot. We're seeing other capacity issues, although not a leap in latency. Can you DM me? I'd love to trade notes.
https://x.com/hellofromjames