Lu, Brian

1 publications

ICLR 2026 Generalization of RLVR Using Causal Reasoning as a Testbed Brian Lu, Hongyu Zhao, Shuo Sun, Hao Peng, Rui Ding, Hongyuan Mei