Notebooks for Reinforcement Learning: An Introduction second edition
Github
Repo
Method Summary Notebooks
Tabular Methods Overview
Chapter Overview Notebooks
Part 1: Tabular Methods
Chapter 1: Introduction
Chapter 2: Multi-armed Bandits
Chapter 3: Finite Markov Decision Processes
Chapter 4: Dynamic Programming
Chapter 5: Monte Carlo Methods
Chapter 6: Temporal Difference Learning
Chapter 7: n-Step Bootstrapping
Chapter 8: Planning & Learning with Tabular Methods
Part 2: Approximate Solution Methods
Chapter 9: On-policy Prediction with Approximation
Chapter 10: On-policy Control with Approximation
Chapter 11:Off-policy Methods with Approximation