Jiang, Yuhua

8 publications

ICLR 2026 GlobeDiff: State Diffusion Process for Partial Observability in Multi-Agent System Yiqin Yang, Xu Yang, Yuhua Jiang, Ni Mu, Hao Hu, Runpeng Xie, Ziyou Zhang, Siyuan Li, Yuan-Hua Ni, Qianchuan Zhao, Bo Xu
ICLR 2026 OPRIDE: Efficient Offline Preference-Based Reinforcement Learning via In-Dataset Exploration Yiqin Yang, Hao Hu, Yihuan Mao, Jin Zhang, Chengjie Wu, Yuhua Jiang, Xu Yang, Runpeng Xie, Yi Fan, Bo Liu, Yang Gao, Bo Xu, Chongjie Zhang
ICLR 2026 Risk-Sensitive Reinforcement Learning for Alleviating Exploration Dilemmas in Large Language Models Yuhua Jiang, Jiawei Huang, Yufeng Yuan, Xin Mao, YuYue, Qianchuan Zhao, Lin Yan
ICLR 2025 Episodic Novelty Through Temporal Distance Yuhua Jiang, Qihan Liu, Yiqin Yang, Xiaoteng Ma, Dianyu Zhong, Hao Hu, Jun Yang, Bin Liang, Bo Xu, Chongjie Zhang, Qianchuan Zhao
ICLR 2025 Fewer May Be Better: Enhancing Offline Reinforcement Learning with Reduced Dataset Yiqin Yang, Quanwei Wang, Chenghao Li, Hao Hu, Chengjie Wu, Yuhua Jiang, Dianyu Zhong, Ziyou Zhang, Qianchuan Zhao, Chongjie Zhang, Bo Xu
NeurIPSW 2024 Episodic Novelty Through Temporal Distance Yuhua Jiang, Qihan Liu, Yiqin Yang, Xiaoteng Ma, Dianyu Zhong, Bo Xu, Jun Yang, Bin Liang, Chongjie Zhang, Qianchuan Zhao
AAAI 2024 Learning Diverse Risk Preferences in Population-Based Self-Play Yuhua Jiang, Qihan Liu, Xiaoteng Ma, Chenghao Li, Yiqin Yang, Jun Yang, Bin Liang, Qianchuan Zhao
NeurIPS 2024 NeuralPlane: An Efficiently Parallelizable Platform for Fixed-Wing Aircraft Control with Reinforcement Learning Chuanyi Xue, Qihan Liu, Xiaoteng Ma, Yang Qi, Xinyao Qin, Yuhua Jiang, Ning Gui, Jinsheng Ren, Bin Liang, Jun Yang