fp.
newest
Open in hackernews
Reward models for LMs are fundamentally broken
https://twitter.com/vijaytarian/status/2069438063345115187
1
•
panthertrax
•
1h ago