> Reinforcement learning is a technical subject—there are whole textbooks written about it.
and then linking to the still wip RLHF book instead of the book on RL: Sutton & Barto.
mnkv•1h ago
> Reinforcement learning is a technical subject—there are whole textbooks written about it.
and then linking to the still wip RLHF book instead of the book on RL: Sutton & Barto.