Agents start optimizing for passing the verification step, not necessarily for solving the task itself.
You sometimes see a similar effect in recommender systems: optimizing secondary metrics shifts system behavior.
So the system starts optimizing for passing the verifier, not necessarily for solving the task.
nareyko•1h ago
Agents start optimizing for passing the verification step, not necessarily for solving the task itself.
You sometimes see a similar effect in recommender systems: optimizing secondary metrics shifts system behavior.
chrisdudek•1h ago
nareyko•14m ago
So the system starts optimizing for passing the verifier, not necessarily for solving the task.