Ask HN: Did Google know about RLHF(breakthru) only after OpenAI shared
1•elarocks•1h ago
I am still unable to think of a reason why Google, Anthropic and AWS did not know about/invest in RLHF, before OpenAI shared their success around implementing and in scale that was viable. Would you say that if OpenAI had not shared about RLHF, Google and Anthropic wouldn't be where they are today ?
Comments
bigyabai•49m ago
RLHF is basically a fancy, overengineered GAN. Most of the industry could see that DPO was more efficient for fitting to human behavior.
bigyabai•49m ago