Wang, Xihuai

4 publications

ICLR 2026 SRFT: A Single-Stage Method with Supervised and Reinforcement Fine-Tuning for Reasoning Yuqian Fu, Tinghong Chen, Jiajun Chai, Xihuai Wang, Songjun Tu, Guojun Yin, Wei Lin, Qichao Zhang, Yuanheng Zhu, Dongbin Zhao
NeurIPS 2024 ZSC-Eval: An Evaluation Toolkit and Benchmark for Multi-Agent Zero-Shot Coordination Xihuai Wang, Shao Zhang, Wenhao Zhang, Wentao Dong, Jingxiao Chen, Ying Wen, Weinan Zhang
ICLR 2023 Order Matters: Agent-by-Agent Policy Optimization Xihuai Wang, Zheng Tian, Ziyu Wan, Ying Wen, Jun Wang, Weinan Zhang
IJCAI 2021 Model-Based Multi-Agent Policy Optimization with Adaptive Opponent-Wise Rollouts Weinan Zhang, Xihuai Wang, Jian Shen, Ming Zhou