Thomas, Valentin
11 publications
NeurIPS
2022
On the Role of Overparameterization in Off-Policy Temporal Difference Learning with Linear Function Approximation
ICML
2021
Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization