fp.
newest
Open in hackernews
PivotRL: High Accuracy Agentic Post-Training at Low Compute Cost
https://arxiv.org/abs/2603.21383
1
•
matt_d
•
1h ago