Zhu, Yuanheng

9 publications

ICML 2025 Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium Runyu Lu, Yuanheng Zhu, Dongbin Zhao
ICML 2025 DipLLM: Fine-Tuning LLM for Strategic Decision-Making in Diplomacy Kaixuan Xu, Jiajun Chai, Sicheng Li, Yuqian Fu, Yuanheng Zhu, Dongbin Zhao
ICLR 2025 Divergence-Regularized Discounted Aggregation: Equilibrium Finding in Multiplayer Partially Observable Stochastic Games Runyu Lu, Yuanheng Zhu, Dongbin Zhao
ICLR 2025 Empowering LLM Agents with Zero-Shot Optimal Decision-Making Through Q-Learning Jiajun Chai, Sicheng Li, Yuqian Fu, Dongbin Zhao, Yuanheng Zhu
NeurIPS 2025 Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games Runyu Lu, Peng Zhang, Ruochuan Shi, Yuanheng Zhu, Dongbin Zhao, Yang Liu, Dong Wang, Cesare Alippi
ICLR 2025 INS: Interaction-Aware Synthesis to Enhance Offline Multi-Agent Reinforcement Learning Yuqian Fu, Yuanheng Zhu, Jian Zhao, Jiajun Chai, Dongbin Zhao
NeurIPS 2025 Learning and Planning Multi-Agent Tasks via an MoE-Based World Model Zijie Zhao, Zhongyue Zhao, Kaixuan Xu, Yuqian Fu, Jiajun Chai, Yuanheng Zhu, Dongbin Zhao
NeurIPSW 2024 Empowering LLM Agents with Zero-Shot Optimal Decision-Making Through Q-Learning Jiajun Chai, Sicheng Li, Yuqian Fu, Dongbin Zhao, Yuanheng Zhu
NeurIPS 2024 Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement Zhi Wang, Li Zhang, Wenhao Wu, Yuanheng Zhu, Dongbin Zhao, Chunlin Chen