Zha, Yuqi

1 publications

ICML 2025 Provable Policy Gradient for Robust Average-Reward MDPs Beyond Rectangularity Qiuhao Wang, Yuqi Zha, Chin Pang Ho, Marek Petrik