Parmas, Paavo
8 publications
ICLR
2026
Does “Do Differentiable Simulators Give Better Policy Gradients?” Give Better Policy Gradients?
ICLR
2025
Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form
8 publications