Ke, Yijing

1 publications

ICLR 2026 Frozen Policy Iteration: Computationally Efficient RL Under Linear $Q^{\pi}$ Realizability for Deterministic Dynamics Yijing Ke, Zihan Zhang, Ruosong Wang