6 pointsby gertlabs4 hours ago1 comment
  • gertlabs4 hours ago
    We're still running Grok 4.3 evals, since API access is now widely available. So far it looks like it's not the frontier model, but definitely worthy of mention. The field moves fast... The benchmarks and blog post will be updated within 24 hours to incorporate its full results.