Feng, Xidong

18 publications

ICLR 2025 Efficient Reinforcement Learning with Large Language Model Priors Xue Yan, Yan Song, Xidong Feng, Mengyue Yang, Haifeng Zhang, Haitham Bou Ammar, Jun Wang
NeurIPS 2025 Generating Creative Chess Puzzles Xidong Feng, Vivek Veeriah, Marcus Chiam, Michael D Dennis, Federico Barbero, Johan Obando-Ceron, Jiaxin Shi, Satinder Singh, Shaobo Hou, Nenad Tomasev, Tom Zahavy
ICLRW 2025 Natural Language Reinforcement Learning Xidong Feng, Bo Liu, Ziyu Wan, Haotian Fu, Girish A. Koushik, Zhiyuan Hu, Mengyue Yang, Ying Wen, Jun Wang
ICML 2024 AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training Ziyu Wan, Xidong Feng, Muning Wen, Stephen Marcus Mcaleer, Ying Wen, Weinan Zhang, Jun Wang
JMLR 2024 Heterogeneous-Agent Reinforcement Learning Yifan Zhong, Jakub Grudzien Kuba, Xidong Feng, Siyi Hu, Jiaming Ji, Yaodong Yang
NeurIPS 2024 Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi
ICLRW 2024 Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models Zhiyuan Hu, Chumin Liu, Xidong Feng, Yilun Zhao, See-Kiong Ng, Anh Tuan Luu, Junxian He, Pang Wei Koh, Bryan Hooi
NeurIPSW 2023 AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training Xidong Feng, Ziyu Wan, Muning Wen, Ying Wen, Weinan Zhang, Jun Wang
NeurIPS 2023 ChessGPT: Bridging Policy Learning and Language Modeling Xidong Feng, Yicheng Luo, Ziyan Wang, Hongrui Tang, Mengyue Yang, Kun Shao, David Mguni, Yali Du, Jun Wang
ICML 2023 MANSA: Learning Fast and Slow in Multi-Agent Systems David Henry Mguni, Haojun Chen, Taher Jafferjee, Jianhong Wang, Longfei Yue, Xidong Feng, Stephen Marcus Mcaleer, Feifei Tong, Jun Wang, Yaodong Yang
MLOSS 2023 TorchOpt: An Efficient Library for Differentiable Optimization Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang
NeurIPS 2022 A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning Bo Liu, Xidong Feng, Jie Ren, Luo Mai, Rui Zhu, Haifeng Zhang, Jun Wang, Yaodong Yang
NeurIPSW 2022 Contextual Transformer for Offline Meta Reinforcement Learning Runji Lin, Ye Li, Xidong Feng, Zhaowei Zhang, Xian Hong Wu Fung, Haifeng Zhang, Jun Wang, Yali Du, Yaodong Yang
NeurIPSW 2022 TorchOpt: An Efficient Library for Differentiable Optimization Jie Ren, Xidong Feng, Bo Liu, Xuehai Pan, Yao Fu, Luo Mai, Yaodong Yang
NeurIPS 2022 Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning Yuanpei Chen, Tianhao Wu, Shengjie Wang, Xidong Feng, Jiechuan Jiang, Zongqing Lu, Stephen McAleer, Hao Dong, Song-Chun Zhu, Yaodong Yang
NeurIPS 2021 Neural Auto-Curricula in Two-Player Zero-Sum Games Xidong Feng, Oliver Slumbers, Ziyu Wan, Bo Liu, Stephen McAleer, Ying Wen, Jun Wang, Yaodong Yang
AAAI 2021 Towards Effective Context for Meta-Reinforcement Learning: An Approach Based on Contrastive Learning Haotian Fu, Hongyao Tang, Jianye Hao, Chen Chen, Xidong Feng, Dong Li, Wulong Liu
AAAI 2020 MRI Reconstruction with Interpretable Pixel-Wise Operations Using Reinforcement Learning Wentian Li, Xidong Feng, Haotian An, Xiang Yao Ng, Yu-Jin Zhang