Foundations

Reinforcement Learning

RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL

831
lessons
Skills in this topic
View full skill map →
RL Foundations
beginner
Formalise a problem as an MDP
Policy Gradient Methods
intermediate
Implement REINFORCE from scratch
RLHF & Alignment
advanced
Describe the RLHF pipeline end-to-end
What is RLHF (Reinforcement Learning from Human Feedback) ? | The Secret Ingredient Behind ChatGPT
2:15
Reinforcement Learning ⚡ AI Lesson
What is RLHF (Reinforcement Learning from Human Feedback) ? | The Secret Ingredient Behind ChatGPT
VLR Software Training Beginner 7mo ago
What is Reinforcement Learning from Human Feedback (RLHF)
0:54
Reinforcement Learning ⚡ AI Lesson
What is Reinforcement Learning from Human Feedback (RLHF)
Data Science Made Easy Beginner 7mo ago
Reinforcement Learning from Human Feedback (RLHF): The Secret Behind Smarter AI Models
3:40
Reinforcement Learning ⚡ AI Lesson
Reinforcement Learning from Human Feedback (RLHF): The Secret Behind Smarter AI Models
AI Study Hub Beginner 1y ago
🔥 How AI Really Learns: The Power of RLHF (Reinforcement Learning from Human Feedback)
1:00
Reinforcement Learning ⚡ AI Lesson
🔥 How AI Really Learns: The Power of RLHF (Reinforcement Learning from Human Feedback)
Sadie Mir | AI Tools + Agents Beginner 1y ago
RLHF: What is it and how does it work? Reinforcement Learning from Human Feedback #ai #learnai
1:29
Reinforcement Learning ⚡ AI Lesson
RLHF: What is it and how does it work? Reinforcement Learning from Human Feedback #ai #learnai
Harper Carroll AI Beginner 1y ago
What is RLHF (or reinforcement learning from human feedback)
0:31
Reinforcement Learning ⚡ AI Lesson
What is RLHF (or reinforcement learning from human feedback)
Diansaurbytes 🦖 - Tech, Startups, AI Beginner 1y ago
RLHF in 60 Seconds #ReinforcementLearning #MachineLearning #AI
0:48
Reinforcement Learning ⚡ AI Lesson
RLHF in 60 Seconds #ReinforcementLearning #MachineLearning #AI
AI Beware Beginner 2y ago
How RLHF, Reinforcement Learning from Human Feedback, Works #ai#learnai#artificialintelligence#learn
0:58
Reinforcement Learning ⚡ AI Lesson
How RLHF, Reinforcement Learning from Human Feedback, Works #ai#learnai#artificialintelligence#learn
Harper Carroll AI Beginner 2y ago
Reinforcement Learning Human Feedback (RLHF) #shorts #samaltman #ai #lexfridman
0:57
Reinforcement Learning ⚡ AI Lesson
Reinforcement Learning Human Feedback (RLHF) #shorts #samaltman #ai #lexfridman
Money YCR Beginner 2y ago
#Shorts Reinforcement Learning from Human Feedback (RLHF)
0:59
Reinforcement Learning ⚡ AI Lesson
#Shorts Reinforcement Learning from Human Feedback (RLHF)
Super Data Science: ML & AI Podcast with Jon Krohn Beginner 2y ago
What is Reinforcement Learning with Human Feedback (RLHF) ?
3:34
Reinforcement Learning ⚡ AI Lesson
What is Reinforcement Learning with Human Feedback (RLHF) ?
Data Science in your pocket Beginner 3y ago
Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF  HuggingFace Course
2:50
Reinforcement Learning ⚡ AI Lesson
Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course
Discover AI Beginner 3y ago
📚 Continue on Coursera External links · Free to audit
1 / 3 View all →
Q Learning in Reinforcement Training Basics
📚 External: Coursera ↗
Self-paced
Q Learning in Reinforcement Training Basics
Opens on Coursera ↗
Marketing Design with Easil
📚 External: Coursera ↗
Self-paced
Marketing Design with Easil
Opens on Coursera ↗
Reinforcement Learning in Finance
📚 External: Coursera ↗
Self-paced
Reinforcement Learning in Finance
Opens on Coursera ↗
Inspiring and Motivating Individuals
📚 External: Coursera ↗
Self-paced
Inspiring and Motivating Individuals
Opens on Coursera ↗
Aléatoire : une introduction aux probabilités - Partie 1
📚 External: Coursera ↗
Self-paced
Aléatoire : une introduction aux probabilités - Partie 1
Opens on Coursera ↗
Algorithms, Data Collection, and Starting to Code
📚 External: Coursera ↗
Self-paced
Algorithms, Data Collection, and Starting to Code
Opens on Coursera ↗