Wu, Mian

1 publications

ICLR 2026 RLAC: Reinforcement Learning with Adversarial Critic for Free-Form Generation Tasks Mian Wu, Gavin Zhang, Sewon Min, Sergey Levine, Aviral Kumar