🎮 Reinforcement Learning
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL
📚 Continue on Coursera
External links · Free to audit
1 / 3
View all →
RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL