frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Ask HN: Did Google know about RLHF(breakthru) only after OpenAI shared

1•elarocks•2mo ago
I am still unable to think of a reason why Google, Anthropic and AWS did not know about/invest in RLHF, before OpenAI shared their success around implementing and in scale that was viable. Would you say that if OpenAI had not shared about RLHF, Google and Anthropic wouldn't be where they are today ?

Comments

bigyabai•2mo ago
RLHF is basically a fancy, overengineered GAN. Most of the industry could see that DPO was more efficient for fitting to human behavior.