4 pointsby hongbo_zhang3 hours ago2 comments
  • hongbo_zhang3 hours ago
    This is the benchmark between the latest models on a new programming language to avoid overfitting. Latest models are quite good over generalization to new languages, they can write tens of thousands of lines of code in one prompt that just works.
  • alontorres2 hours ago
    I do feel like the latest codex 5.2 and 5.3 have been really excellent in coding and have been giving opus a good fight. I still prefer Opus 4.6 as my daily driver but specifically for coding tasks I think codex 5.3 is the best, especially when considering value for money.
    • hongbo_zhang2 hours ago
      Another thing I like about codex 5.3 is that its CLI support queueing the message directly without using third party plugins. And it can run weeks without any issues, the CC used to have memory issues and stackoverflows.