We're still running Grok 4.3 evals, since API access is now widely available. So far it looks like it's not the frontier model, but definitely worthy of mention. The field moves fast... The benchmarks and blog post will be updated within 24 hours to incorporate its full results.
gertlabs•1h ago