Kesari, Amit

1 publications

TMLR 2026 Mitigating Steady-State Bias in Off-Policy TD Learning via Distributional Correction Emani Naga Sai Venkata Sowmya, Amit Kesari, Ajin George Joseph