ML Anthology
Authors
Search
About
Zhu, Yuanheng
9 publications
ICML
2025
Constrained Exploitability Descent: An Offline Reinforcement Learning Method for Finding Mixed-Strategy Nash Equilibrium
Runyu Lu
,
Yuanheng Zhu
,
Dongbin Zhao
ICML
2025
DipLLM: Fine-Tuning LLM for Strategic Decision-Making in Diplomacy
Kaixuan Xu
,
Jiajun Chai
,
Sicheng Li
,
Yuqian Fu
,
Yuanheng Zhu
,
Dongbin Zhao
ICLR
2025
Divergence-Regularized Discounted Aggregation: Equilibrium Finding in Multiplayer Partially Observable Stochastic Games
Runyu Lu
,
Yuanheng Zhu
,
Dongbin Zhao
ICLR
2025
Empowering LLM Agents with Zero-Shot Optimal Decision-Making Through Q-Learning
Jiajun Chai
,
Sicheng Li
,
Yuqian Fu
,
Dongbin Zhao
,
Yuanheng Zhu
NeurIPS
2025
Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games
Runyu Lu
,
Peng Zhang
,
Ruochuan Shi
,
Yuanheng Zhu
,
Dongbin Zhao
,
Yang Liu
,
Dong Wang
,
Cesare Alippi
ICLR
2025
INS: Interaction-Aware Synthesis to Enhance Offline Multi-Agent Reinforcement Learning
Yuqian Fu
,
Yuanheng Zhu
,
Jian Zhao
,
Jiajun Chai
,
Dongbin Zhao
NeurIPS
2025
Learning and Planning Multi-Agent Tasks via an MoE-Based World Model
Zijie Zhao
,
Zhongyue Zhao
,
Kaixuan Xu
,
Yuqian Fu
,
Jiajun Chai
,
Yuanheng Zhu
,
Dongbin Zhao
NeurIPSW
2024
Empowering LLM Agents with Zero-Shot Optimal Decision-Making Through Q-Learning
Jiajun Chai
,
Sicheng Li
,
Yuqian Fu
,
Dongbin Zhao
,
Yuanheng Zhu
NeurIPS
2024
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model Disentanglement
Zhi Wang
,
Li Zhang
,
Wenhao Wu
,
Yuanheng Zhu
,
Dongbin Zhao
,
Chunlin Chen