GPT-5.2 and GPT-5.2-Codex are now 40% faster(twitter.com)

57 pointsby davidbarker10 hours ago7 comments

prodigycorp9 hours ago
This is great.
In the past month, OpenAI has released for codex users:
- subagents support
- a better multi agent interface (codex app)
- 40% faster inference
No joke, with the first two my productivity is already up like 3x. I am so stoked to try this out.
- jswny6 hours ago
  How do you get sub agents to work?
- wahnfrieden8 hours ago
  this is for api only
  - prodigycorp8 hours ago
    Shoot me
    wahnfrieden6 hours ago
    looks like i'm wrong
    wahnfrieden4 hours ago
    wrong again: https://x.com/embirico/status/2018928763040665702
- ChatGPTBanger8 hours ago
  [dead]
- brianwawok9 hours ago
  Try Claude and you can get x^2 performance. OpenAI is sweating
  - viraptor8 hours ago
    May be a bit different depending on what kind of work you're doing, but for me 5.2-codex finally reached higher level than opus.
  - klipklop8 hours ago
    5.2-codex is pretty solid and you get dramatically higher usage rates with cheap plans. I would assume API use is much cheaper as well.
    jerkstate7 hours ago
    people are sleeping on openai right now but codex 5.2 xhigh is at least as good as opus and you get a TON more usage out of the OpenAI $20/mo plan than Claude's $20/mo plan. I'm always hitting the 5 hour quota with Opus but never have with Codex. Codex tool itself is not quite as good but close.
    indemnity3 hours ago
    Is there a plan like the $100 Claude Max? $200 for ChatGPT Pro is a little bit too much for me.
    Whereas Claude Max 5x is enough that I don’t really run out with my usage patterns.
  - akmarinov3 hours ago
    If i could use GPT-5.2 with Claude Code - yeah. Otherwise slOpus requires too much steering to get things done. GPT-5.2 just works
    ramon15611 minutes ago
    4.1 or 4.5? I did not need to steer Opus 4.5 at many points. A good description was more than enough
thadk7 hours ago
It was probably from the other day when roon realized that normal people have it slower than staff.
Then from that they realized they could just run API calls more like staff, fast, not at capacity.
Then they leave the billion other people's calls at remaining capacity.
https://thezvi.substack.com/i/185423735/choose-your-fighter
> Ohqay: Do you get faster speeds on your work account?
> roon: yea it’s super fast bc im sure we’re not running internal deployment at full load
thebigspacefuck5 hours ago
Speed was always my main complaint, these models always felt really good but too slow. I’ll have to give them a try again.
OutOfHere9 hours ago
OpenAI in my estimation has the habit of dropping a model's quality after its introduction. I definitely recall the web ChatGPT 5.2 being a lot better when it was introduced. A week or two later, its quality suddenly dropped. The initial high looked to be to throw off journalists and benchmarks. As such, nothing that OpenAI says in terms of model speed can be trusted. All they have to do is lower the reasoning effort on average, and boom, it becomes 40% faster. I hope I am wrong, because if I am right, it's a con game.
Starting off the ChatGPT Plus web users with the Pro model, then later swapping it for the Standard model -- would meet the claims of model behavior consistency, while still qualifying as shenanigans.
- tedsanders8 hours ago
  It's good to be skeptical, but I'm happy to share that we don't pull shenanigans like this. We actually take quite a bit of care to report evals fairly, keep API model behavior constant, and track down reports of degraded performance in case we've accidentally introduced bugs. If we were degrading model behavior, it would be pretty easy to catch us with evals against our API.
  In this particular case, I'm happy to report that the speedup is time per token, so it's not a gimmick from outputting fewer tokens at lower reasoning effort. Model weights and quality remain the same.
  - deaux7 hours ago
    It looks like you do pull shenanigans like these [0]. The person you're replying to even mentioned "ChatGPT 5.2", but you're specifically talking only about the API, while making it sound like it applies across the board. Also appreciate the attempt to further hide this degradation of the product they paid for from users by blocking the prompt used to figure this out.
    Happy to retract if you can state [0] is false.
    [0] https://x.com/btibor91/status/2018754586123890717
    virgildotcodes6 hours ago
    Would love a direct response to this.
  - zamadatix8 hours ago
    Hey Ted, can you confirm whether this 40% improvement is specific to API customers or if that's just a wording thing because this is the OpenAI Developers account posting?
  - 8note7 hours ago
    so what actually happens if it isnt shenanigans?
    its worth you guys doing on your end, some analysis of why customers are getting worse results a week or two later, and putting out some guidelines about what context is poisonous and the like
  - OutOfHere7 hours ago
    Starting off the ChatGPT Plus web users with the Pro model, then later swapping it for the Standard model -- would meet the claims of model behavior consistency, while still qualifying as shenanigans.
  - wahnfrieden8 hours ago
    You're confirming you don't alter "juice" levels..?
  - jiggawatts6 hours ago
    I've seen Sam Altman make similar claims in interviews, and I now interpret every statement from an Open AI employee (and especially Sam) as if an Aes Sedai had said it.
    I.e.: "keep API model behavior constant" says nothing about the consumer ChatGPT web app, mobile apps, third-party integrations, etc.
    Similarly, it might mean very specifically that a "certain model timestamp" remains constant but the generic "-latest" or whatever model name auto-updates "for your convenience" to the new faster performance achieved through quantisation or reduced thinking time.
    You might be telling the full, unvarnished truth, but after many similar claims from OpenAI that turned out to be only technically true, I remain sceptical.
- jxmesth4 hours ago
  Someone should create a daily benchmark site for Codex like they did for Claude
- bethekidyouwant9 hours ago
  I mean you can just run the benchmark again
simianwords9 hours ago
It’s interesting that they kept the price the same while doing inference on Cerebras is much more expensive.
- diwank9 hours ago
  I dont think this is Cerebras. Running on cerebras would change model behavior a bit and it could potentially get a ~10x speedup and it'd be more expensive. So most likely this is them writing new more optimized kernels for Blackwell series maybe?
  - simianwords9 hours ago
    Fair point but it remains to answer - why isn’t this speed up available in ChatGPT and only in the api?
- chillee9 hours ago
  this is almost certainly not being done on cerebras
riku_iki7 hours ago
tons of posts on reddit that they also significantly dropped quality
angoragoats9 hours ago
[flagged]
- 9 hours ago
  undefined