On the theory of policy gradient methods: Optimality, approximation, and distribution shift
Authors
Alekh Agarwal, Sham M Kakade, Jason D Lee, Gaurav Mahajan
Publication date
2021
Journal
Journal of Machine Learning Research
Volume
22
Issue
98
Pages
1-76
Scholar articles
A Agarwal, SM Kakade, JD Lee, G Mahajan - Journal of Machine Learning Research, 2021
A Agarwal, SM Kakade, JD Lee, G Mahajan - Conference on Learning Theory, 2020
A Agarwal, SM Kakade, JD Lee, G Mahajan - 2020