Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator
• 10%-20% accuracy gains across benchmarks • 2-3x faster • Uniformly preferred in head-to-head human-evals
Navigator is trained with mid-training, SFT, and RL — RL not only on simulated web envs, but also direct interactions with live websites!
dhruvbatra•1h ago
Navigator achieves pareto-domination over Gemini 2.5, Claude 4.5, and OpenAI Operator
(Gemini 3 computer-use hasn't been released yet, so no comparison possible)Navigator is trained with mid-training, SFT, and RL — RL not only on simulated web envs, but also direct interactions with live websites!