More competition among model vendors is great for developers!
I wonder what's their plan moving forward, they have been releasing a ton of random features lalely.
We don't plan on reporting SWE-bench Verified, for similar reasons to OpenAI: https://openai.com/index/why-we-no-longer-evaluate-swe-bench...