Dissecting “Reinforcement Learning” by Richard S. Sutton with custom Python implementations, Episode V
Originally appeared here:
Introducing n-Step Temporal-Difference Methods
Go Here to Read this Fast! Introducing n-Step Temporal-Difference Methods