State tracking -> tool routing -> evaluation loops
What will be interesting is how reward functions are defined once these systems operate at larger scale.
nareyko•1h ago
State tracking -> tool routing -> evaluation loops
What will be interesting is how reward functions are defined once these systems operate at larger scale.