Tan, Zheyue

1 publications

ICLR 2026 MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs Huining Yuan, Zelai Xu, Zheyue Tan, Xiangmin Yi, Mo Guang, Kaiwen Long, Haojia Hui, Boxun Li, Xinlei Chen, Bo Zhao, Xiao-Ping Zhang, Chao Yu, Yu Wang