Foundations

Reinforcement Learning

RL algorithms, reward modelling, RLHF, policy gradients, Q-learning and multi-agent RL

831

lessons

Skills in this topic

3 skills — Sign in to track your progress

View full skill map →

Formalise a problem as an MDP

Policy Gradient Methods

Implement REINFORCE from scratch

RLHF & Alignment

Describe the RLHF pipeline end-to-end

Videos 639 Reads 192

Level: All Beginner Intermediate Advanced

Any Length Short (<5m) Medium (5-20m) Long (>20m)

Newest Popular Oldest

What is RLHF (Reinforcement Learning from Human Feedback) ? | The Secret Ingredient Behind ChatGPT

Reinforcement Learning ⚡ AI Lesson

What is RLHF (Reinforcement Learning from Human Feedback) ? | The Secret Ingredient Behind ChatGPT

VLR Software Training Beginner 7mo ago

What is Reinforcement Learning from Human Feedback (RLHF)

Reinforcement Learning ⚡ AI Lesson

What is Reinforcement Learning from Human Feedback (RLHF)

Data Science Made Easy Beginner 7mo ago

Reinforcement Learning from Human Feedback (RLHF): The Secret Behind Smarter AI Models

Reinforcement Learning ⚡ AI Lesson

Reinforcement Learning from Human Feedback (RLHF): The Secret Behind Smarter AI Models

AI Study Hub Beginner 1y ago

🔥 How AI Really Learns: The Power of RLHF (Reinforcement Learning from Human Feedback)

Reinforcement Learning ⚡ AI Lesson

🔥 How AI Really Learns: The Power of RLHF (Reinforcement Learning from Human Feedback)

Sadie Mir | AI Tools + Agents Beginner 1y ago

RLHF: What is it and how does it work? Reinforcement Learning from Human Feedback #ai #learnai

Reinforcement Learning ⚡ AI Lesson

RLHF: What is it and how does it work? Reinforcement Learning from Human Feedback #ai #learnai

Harper Carroll AI Beginner 1y ago

What is RLHF (or reinforcement learning from human feedback)

Reinforcement Learning ⚡ AI Lesson

What is RLHF (or reinforcement learning from human feedback)

Diansaurbytes 🦖 - Tech, Startups, AI Beginner 1y ago

RLHF in 60 Seconds #ReinforcementLearning #MachineLearning #AI

Reinforcement Learning ⚡ AI Lesson

RLHF in 60 Seconds #ReinforcementLearning #MachineLearning #AI

AI Beware Beginner 2y ago

How RLHF, Reinforcement Learning from Human Feedback, Works #ai#learnai#artificialintelligence#learn

Reinforcement Learning ⚡ AI Lesson

How RLHF, Reinforcement Learning from Human Feedback, Works #ai#learnai#artificialintelligence#learn

Harper Carroll AI Beginner 2y ago

Reinforcement Learning Human Feedback (RLHF) #shorts #samaltman #ai #lexfridman

Reinforcement Learning ⚡ AI Lesson

Reinforcement Learning Human Feedback (RLHF) #shorts #samaltman #ai #lexfridman

Money YCR Beginner 2y ago

#Shorts Reinforcement Learning from Human Feedback (RLHF)

Reinforcement Learning ⚡ AI Lesson

#Shorts Reinforcement Learning from Human Feedback (RLHF)

Super Data Science: ML & AI Podcast with Jon Krohn Beginner 2y ago

What is Reinforcement Learning with Human Feedback (RLHF) ?

Reinforcement Learning ⚡ AI Lesson

What is Reinforcement Learning with Human Feedback (RLHF) ?

Data Science in your pocket Beginner 3y ago

Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course

Reinforcement Learning ⚡ AI Lesson

Learn about Reinforcement Learning from Human Feedback - ChatGPT / RLHF HuggingFace Course

Discover AI Beginner 3y ago

📚 Continue on Coursera External links · Free to audit

View all →

The Science of the Solar System

📚 External: Coursera ↗

The Science of the Solar System

Opens on Coursera ↗

📚 External: Coursera ↗

Algorithms, Data Collection, and Starting to Code

Opens on Coursera ↗

📚 External: Coursera ↗

Optimize with GA & RL

Opens on Coursera ↗

📚 External: Coursera ↗

Introduction to C++ Programming and Unreal

Opens on Coursera ↗

Fundamentals of Reinforcement Learning

📚 External: Coursera ↗

Fundamentals of Reinforcement Learning

Opens on Coursera ↗

Cooking for Busy Healthy People

📚 External: Coursera ↗

Cooking for Busy Healthy People

Opens on Coursera ↗

Interacting with the System and Managing Memory

📚 External: Coursera ↗

Interacting with the System and Managing Memory

Opens on Coursera ↗

A Beginner's Guide to Investing

📚 External: Coursera ↗

A Beginner's Guide to Investing

Opens on Coursera ↗

Everyday Parenting: The ABCs of Child Rearing

📚 External: Coursera ↗

Everyday Parenting: The ABCs of Child Rearing

Opens on Coursera ↗

RStudio for Six Sigma - Process Capability

📚 External: Coursera ↗

RStudio for Six Sigma - Process Capability

Opens on Coursera ↗

A Complete Reinforcement Learning System (Capstone)

📚 External: Coursera ↗

A Complete Reinforcement Learning System (Capstone)

Opens on Coursera ↗

Decision Making and Reinforcement Learning

📚 External: Coursera ↗

Decision Making and Reinforcement Learning

Opens on Coursera ↗

📚 External: Coursera ↗

Aléatoire : une introduction aux probabilités - Partie 2

Opens on Coursera ↗

Sample-based Learning Methods

📚 External: Coursera ↗

Sample-based Learning Methods

Opens on Coursera ↗

📚 External: Coursera ↗

Designing Larger Python Programs for Data Science

Opens on Coursera ↗

Study Skills for University Success

📚 External: Coursera ↗

Study Skills for University Success

Opens on Coursera ↗

Understand and Apply Artificial Intelligence Fundamentals

📚 External: Coursera ↗

Understand and Apply Artificial Intelligence Fundamentals

Opens on Coursera ↗

Marketing Design with Easil

📚 External: Coursera ↗

Marketing Design with Easil

Opens on Coursera ↗