ML Anthology
Authors
Search
About
Feng, Xidong
18 publications
ICLR
2025
Efficient Reinforcement Learning with Large Language Model Priors
Xue Yan
,
Yan Song
,
Xidong Feng
,
Mengyue Yang
,
Haifeng Zhang
,
Haitham Bou Ammar
,
Jun Wang
NeurIPS
2025
Generating Creative Chess Puzzles
Xidong Feng
,
Vivek Veeriah
,
Marcus Chiam
,
Michael D Dennis
,
Federico Barbero
,
Johan Obando-Ceron
,
Jiaxin Shi
,
Satinder Singh
,
Shaobo Hou
,
Nenad Tomasev
,
Tom Zahavy
ICLRW
2025
Natural Language Reinforcement Learning
Xidong Feng
,
Bo Liu
,
Ziyu Wan
,
Haotian Fu
,
Girish A. Koushik
,
Zhiyuan Hu
,
Mengyue Yang
,
Ying Wen
,
Jun Wang
ICML
2024
AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training
Ziyu Wan
,
Xidong Feng
,
Muning Wen
,
Stephen Marcus Mcaleer
,
Ying Wen
,
Weinan Zhang
,
Jun Wang
JMLR
2024
Heterogeneous-Agent Reinforcement Learning
Yifan Zhong
,
Jakub Grudzien Kuba
,
Xidong Feng
,
Siyi Hu
,
Jiaming Ji
,
Yaodong Yang
NeurIPS
2024
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in LLMs
Zhiyuan Hu
,
Chumin Liu
,
Xidong Feng
,
Yilun Zhao
,
See-Kiong Ng
,
Anh Tuan Luu
,
Junxian He
,
Pang Wei Koh
,
Bryan Hooi
ICLRW
2024
Uncertainty of Thoughts: Uncertainty-Aware Planning Enhances Information Seeking in Large Language Models
Zhiyuan Hu
,
Chumin Liu
,
Xidong Feng
,
Yilun Zhao
,
See-Kiong Ng
,
Anh Tuan Luu
,
Junxian He
,
Pang Wei Koh
,
Bryan Hooi
NeurIPSW
2023
AlphaZero-like Tree-Search Can Guide Large Language Model Decoding and Training
Xidong Feng
,
Ziyu Wan
,
Muning Wen
,
Ying Wen
,
Weinan Zhang
,
Jun Wang
NeurIPS
2023
ChessGPT: Bridging Policy Learning and Language Modeling
Xidong Feng
,
Yicheng Luo
,
Ziyan Wang
,
Hongrui Tang
,
Mengyue Yang
,
Kun Shao
,
David Mguni
,
Yali Du
,
Jun Wang
ICML
2023
MANSA: Learning Fast and Slow in Multi-Agent Systems
David Henry Mguni
,
Haojun Chen
,
Taher Jafferjee
,
Jianhong Wang
,
Longfei Yue
,
Xidong Feng
,
Stephen Marcus Mcaleer
,
Feifei Tong
,
Jun Wang
,
Yaodong Yang
MLOSS
2023
TorchOpt: An Efficient Library for Differentiable Optimization
Jie Ren
,
Xidong Feng
,
Bo Liu
,
Xuehai Pan
,
Yao Fu
,
Luo Mai
,
Yaodong Yang
NeurIPS
2022
A Theoretical Understanding of Gradient Bias in Meta-Reinforcement Learning
Bo Liu
,
Xidong Feng
,
Jie Ren
,
Luo Mai
,
Rui Zhu
,
Haifeng Zhang
,
Jun Wang
,
Yaodong Yang
NeurIPSW
2022
Contextual Transformer for Offline Meta Reinforcement Learning
Runji Lin
,
Ye Li
,
Xidong Feng
,
Zhaowei Zhang
,
Xian Hong Wu Fung
,
Haifeng Zhang
,
Jun Wang
,
Yali Du
,
Yaodong Yang
NeurIPSW
2022
TorchOpt: An Efficient Library for Differentiable Optimization
Jie Ren
,
Xidong Feng
,
Bo Liu
,
Xuehai Pan
,
Yao Fu
,
Luo Mai
,
Yaodong Yang
NeurIPS
2022
Towards Human-Level Bimanual Dexterous Manipulation with Reinforcement Learning
Yuanpei Chen
,
Tianhao Wu
,
Shengjie Wang
,
Xidong Feng
,
Jiechuan Jiang
,
Zongqing Lu
,
Stephen McAleer
,
Hao Dong
,
Song-Chun Zhu
,
Yaodong Yang
NeurIPS
2021
Neural Auto-Curricula in Two-Player Zero-Sum Games
Xidong Feng
,
Oliver Slumbers
,
Ziyu Wan
,
Bo Liu
,
Stephen McAleer
,
Ying Wen
,
Jun Wang
,
Yaodong Yang
AAAI
2021
Towards Effective Context for Meta-Reinforcement Learning: An Approach Based on Contrastive Learning
Haotian Fu
,
Hongyao Tang
,
Jianye Hao
,
Chen Chen
,
Xidong Feng
,
Dong Li
,
Wulong Liu
AAAI
2020
MRI Reconstruction with Interpretable Pixel-Wise Operations Using Reinforcement Learning
Wentian Li
,
Xidong Feng
,
Haotian An
,
Xiang Yao Ng
,
Yu-Jin Zhang