Sample-based Learning Methods

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Sample-based Learning Methods

Coursera · Beginner ·🎮 Reinforcement Learning ·3mo ago
In this course, you will learn about several algorithms that can learn near optimal policies based on trial and error interaction with the environment---learning from the agent’s own experience. Learning from actual experience is striking because it requires no prior knowledge of the environment’s dynamics, yet can still attain optimal behavior. We will cover intuitively simple but powerful Monte Carlo methods, and temporal difference learning methods including Q-learning. We will wrap up this course investigating how we can get the best of both worlds: algorithms that can combine model-based planning (similar to dynamic programming) and temporal difference updates to radically accelerate learning. By the end of this course you will be able to: - Understand Temporal-Difference learning and Monte Carlo as two strategies for estimating value functions from sampled experience - Understand the importance of exploration, when using sampled experience rather than dynamic programming sweeps within a model - Understand the connections between Monte Carlo and Dynamic Programming and TD. - Implement and apply the TD algorithm, for estimating value functions - Implement and apply Expected Sarsa and Q-learning (two TD methods for control) - Understand the difference between on-policy and off-policy control - Understand planning with simulated experience (as opposed to classic planning strategies) - Implement a model-based approach to RL, called Dyna, which uses simulated experience - Conduct an empirical study to see the improvements in sample efficiency when using Dyna
Watch on External: Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Deep Q-Networks — When the Q-Table Won’t Fit
Learn to implement Deep Q-Networks in Python for reinforcement learning problems where the Q-table won't fit, and understand their benefits over traditional Q-learning
Medium · Python
Reward hacking in Reinforcement learning
Learn to identify and fix reward hacking in Reinforcement Learning, a crucial step in ensuring reliable AI decision-making
Medium · LLM
Learning by messing up: A beginner’s tour of Reinforcement Learning
Learn the basics of Reinforcement Learning, from agents and rewards to the Markov property and Gym environments, and start building your own RL projects
Medium · Deep Learning
Any suggestions on this RL Fortnite bot model?
Learn to implement a basic RL model for a Fortnite bot using action dictionaries and simulation, and understand how to map indices to readable strings for direction, healing, and firing actions
Reddit r/deeplearning
Up next
Synthetic Data Generation for SQL Commands: Step-by-Step Tutorial
UBIAI
Watch →