Foundations

Reinforcement Learning

RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL

831
lessons
Skills in this topic
View full skill map →
RL Foundations
beginner
Formalise a problem as an MDP
Policy Gradient Methods
intermediate
Implement REINFORCE from scratch
RLHF & Alignment
advanced
Describe the RLHF pipeline end-to-end
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Introduction to Learning
📚 External: Coursera ↗
Self-paced
Introduction to Learning
Opens on Coursera ↗
Overview of Advanced Methods of Reinforcement Learning in Finance
📚 External: Coursera ↗
Self-paced
Overview of Advanced Methods of Reinforcement Learning in Finance
Opens on Coursera ↗
Creating a Team Culture of Continuous Learning
📚 External: Coursera ↗
Self-paced
Creating a Team Culture of Continuous Learning
Opens on Coursera ↗
Generative AI Advance Fine-Tuning for LLMs
📚 External: Coursera ↗
Self-paced
Generative AI Advance Fine-Tuning for LLMs
Opens on Coursera ↗
Optimizing Diversity on Teams
📚 External: Coursera ↗
Self-paced
Optimizing Diversity on Teams
Opens on Coursera ↗
Aléatoire : une introduction aux probabilités - Partie 1
📚 External: Coursera ↗
Self-paced
Aléatoire : une introduction aux probabilités - Partie 1
Opens on Coursera ↗