"One thing I really suspect we'll see a lot more of is much more generous rate limits at 'off peak' times - likely to be early morning UTC - as there is no doubt a lot of "idle" compute sitting there"
I strongly suspect this will end up in the opposite happening - where peak tokens are far more "expensive" (whether that be thru usage limits of API costs) than off-peak.
PS: Anthropic have managed to improve reliability but are absolutely shredding opus tok/s at peak times. It absolutely crawls on the web (maybe 2-3 tok/s?) and I believe that on non-max plans it's also incredibly slow on claude code.
Also, my (extremely naive) understanding is that at the cutting edge, hardware is diverging for training vs inference. That might not be true for Anthropic though.
What I wish Anthropic would do is be a lot more explicit about what windows apply when. Surely they have the data to say "you get X usage from hours A to B, Y usage from B to C"
We all know these services see huge load spikes and sometimes service degradation when America wakes up, and I bet they'd appreciate it if as many "chug-and-plug" agent workflows moved to overnight hours as possible.
Plus we are technologists, we want to try out different stuff and compare.
But increasingly I'm using Claude for basically all real coding. I ask Gemini and Codex questions, but I'm honestly in awe at Opus' ridiculous capabilities.
Before that I would totally agree with you, it felt really endless
Regular price window around the world: https://www.worldtimebuddy.com/?qm=1&lid=5368361,5128581,316...
https://platform.claude.com/docs/en/about-claude/models/over...
From my understanding: Peak time (non-promo): UTC 12:00–18:00 / KST (UTC+9): 21:00–03:00 Off-peak time (promo): UTC 18:00–12:00 / KST (UTC+9): 03:00–21:00
I guess I’ll need to do more coding during the daytime.
codex is still in minority use but has taken many customers from them over a short period.
So us European folks get promotional rates during the morning and evening.
EDIT: Actually, because the promo ends at the end of March, it'll all be within DST shenanigans. So peak times are 12:00–18:00 London, 13:00–19:00 Berlin.
Perhaps an opportunity for them to improve workload scheduling orchestration, like submitting a job to a distributed computing cluster queue, to smooth demand and maximize utilization.
If they are doing it “right” I think any off peak usage should count 50% toward your weekly limits.
Edit: it does look like they are doing it the "right" way.
> No. The additional usage you get during off-peak hours doesn’t count toward any weekly usage limits on your plan.
I'm not sure how to square that with the quote you gave.
Thanks for clearing that up. It'll help me schedule stuff in the future.
If you are only using a session a day, you're wasting a session. :)
So they could “double” your usage by keeping it the same and then simply halving peak usage.
There is no way 5-11 AM PT is peak traffic
* This is the regular price window, the rest is the promo usage.
I dunno y’all; feels like free drug samples. Who would ever think of coding without it?