2 pointsby dcre4 hours ago1 comment
  • dcre4 hours ago
    A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.