Funny how there was a lot of concerns then about reward hacking, something I never hear anyone talk about with current AI
jhurliman•5mo ago
I think it just got folded under the umbrella concept of model alignment. And it moved from theoretical discussions to practical daily struggles with LLMs deleting failing unit tests
CGMthrowaway•5mo ago
jhurliman•5mo ago