A Paper a Week - Week 18
Introduction For the eighteenth post in this series, I read “Policy Gradient Methods for Reinforcement Learning with Function Approximation” by Sutton et al. This paper presents the famous “policy gradient theorem” which is the basis for policy gradient methods in reinforcement learning. The authors show that the gradient of a...
[Read More]