Pathak, Eshika

1 publications

TMLR 2026 Natural Policy Gradient for Average Reward Non-Stationary Reinforcement Learning Neharika Jali, Eshika Pathak, Pranay Sharma, Guannan Qu, Gauri Joshi