Zhan, Xianyuan

48 publications

AAAI 2025 Are Expressive Models Truly Necessary for Offline RL? Guan Wang, Haoyi Niu, Jianxiong Li, Li Jiang, Jianming Hu, Xianyuan Zhan
ICLR 2025 Data Center Cooling System Optimization Using Offline Reinforcement Learning Xianyuan Zhan, Xiangyu Zhu, Peng Cheng, Xiao Hu, Ziteng He, Hanfei Geng, Jichao Leng, Huiwen Zheng, Chenhui Liu, Tianshun Hong, Yan Liang, Yunxin Liu, Feng Zhao
ICLR 2025 Diffusion-Based Planning for Autonomous Driving with Flexible Guidance Yinan Zheng, Ruiming Liang, Kexin Zheng, Jinliang Zheng, Liyuan Mao, Jianxiong Li, Weihao Gu, Rui Ai, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu
ICLRW 2025 Diffusion-Based Planning for Autonomous Driving with Flexible Guidance Yinan Zheng, Ruiming Liang, Kexin Zheng, Jinliang Zheng, Liyuan Mao, Jianxiong Li, Weihao Gu, Rui Ai, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu
ICML 2025 Efficient Robotic Policy Learning via Latent Space Backward Planning Dongxiu Liu, Haoyi Niu, Zhihao Wang, Jinliang Zheng, Yinan Zheng, Zhonghong Ou, Jianming Hu, Jianxiong Li, Xianyuan Zhan
ICLRW 2025 Efficient Robotic Policy Learning via Latent Space Backward Planning Dongxiu Liu, Haoyi Niu, Zhihao Wang, Jinliang Zheng, Yinan Zheng, Zhonghong Ou, Jianming Hu, Jianxiong Li, Xianyuan Zhan
NeurIPS 2025 Flow Matching-Based Autonomous Driving Planning with Advanced Interactive Behavior Modeling Tianyi Tan, Yinan Zheng, Ruiming Liang, Zexu Wang, Kexin Zheng, Jinliang Zheng, Jianxiong Li, Xianyuan Zhan, Jingjing Liu
ICLRW 2025 Pushing the Limit of Sample-Efficient Offline Reinforcement Learning Peng Cheng, Zhihao Wu, Jianxiong Li, Ziteng He, Haoran Xu, Wei Sun, Youfang Lin, Xianyuan Zhan
ICLR 2025 Skill Expansion and Composition in Parameter Space Tenglong Liu, Jianxiong Li, Yinan Zheng, Haoyi Niu, Yixing Lan, Xin Xu, Xianyuan Zhan
NeurIPS 2025 Towards Robust Zero-Shot Reinforcement Learning Kexin Zheng, Lauriane Teyssier, Yinan Zheng, Yu Luo, Xianyuan Zhan
NeurIPS 2025 Uni-RL: Unifying Online and Offline RL via Implicit Value Regularization Haoran Xu, Liyuan Mao, Hui Jin, Weinan Zhang, Xianyuan Zhan, Amy Zhang
CVPR 2025 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng, Jianxiong Li, Dongxiu Liu, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan
ICLRW 2025 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng, Jianxiong Li, Dongxiu Liu, Yinan Zheng, Zhihao Wang, Zhonghong Ou, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan
IJCAI 2024 A Comprehensive Survey of Cross-Domain Policy Transfer for Embodied Agents Haoyi Niu, Jianming Hu, Guyue Zhou, Xianyuan Zhan
NeurIPSW 2024 Are Expressive Models Truly Necessary for Offline RL? Guan Wang, Haoyi Niu, Jianxiong Li, Li Jiang, Jianming Hu, Xianyuan Zhan
ICML 2024 DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan
ICMLW 2024 DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning Jianxiong Li, Jinliang Zheng, Yinan Zheng, Liyuan Mao, Xiao Hu, Sijie Cheng, Haoyi Niu, Jihao Liu, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Xianyuan Zhan
NeurIPS 2024 Diffusion-DICE: In-Sample Diffusion Guidance for Offline Reinforcement Learning Liyuan Mao, Haoran Xu, Xianyuan Zhan, Weinan Zhang, Amy Zhang
NeurIPS 2024 Instruction-Guided Visual Masking Jinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan
ICMLW 2024 Instruction-Guided Visual Masking Jinliang Zheng, Jianxiong Li, Sijie Cheng, Yinan Zheng, Jiaming Li, Jihao Liu, Yu Liu, Jingjing Liu, Xianyuan Zhan
ICLR 2024 ODICE: Revealing the Mystery of Distribution Correction Estimation via Orthogonal-Gradient Update Liyuan Mao, Haoran Xu, Weinan Zhang, Xianyuan Zhan
ICML 2024 OMPO: A Unified Framework for RL Under Policy and Dynamics Shifts Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan
ICML 2024 Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL Yu Luo, Tianying Ji, Fuchun Sun, Jianwei Zhang, Huazhe Xu, Xianyuan Zhan
ICLR 2024 OpenChat: Advancing Open-Source Language Models with Mixed-Quality Data Guan Wang, Sijie Cheng, Xianyuan Zhan, Xiangang Li, Sen Song, Yang Liu
ICLR 2024 Query-Policy Misalignment in Preference-Based Reinforcement Learning Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang
NeurIPSW 2024 Robo-MUTUAL: Robotic Multimodal Task Specification via Unimodal Learning Jianxiong Li, Zhihao Wang, Jinliang Zheng, Xiaoai Zhou, Guanming Wang, Guanglu Song, Yu Liu, Jingjing Liu, Ya-Qin Zhang, Junzhi Yu, Xianyuan Zhan
ICLR 2024 Safe Offline Reinforcement Learning with Feasibility-Guided Diffusion Model Yinan Zheng, Jianxiong Li, Dongjie Yu, Yujie Yang, Shengbo Eben Li, Xianyuan Zhan, Jingjing Liu
ICML 2024 Seizing Serendipity: Exploiting the Value of past Success in Off-Policy Actor-Critic Tianying Ji, Yu Luo, Fuchun Sun, Xianyuan Zhan, Jianwei Zhang, Huazhe Xu
NeurIPSW 2024 xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing Haoyi Niu, Qimao Chen, Tenglong Liu, Jianxiong Li, Guyue Zhou, Yi Zhang, Jianming Hu, Xianyuan Zhan
NeurIPS 2023 Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL Peng Cheng, Xianyuan Zhan, Zhihao Wu, Wenjia Zhang, Youfang Lin, Shou cheng Song, Han Wang, Li Jiang
ICMLW 2023 Look Beneath the Surface: Exploiting Fundamental Symmetry for Sample-Efficient Offline RL Peng Cheng, Xianyuan Zhan, Zhihao Wu, Wenjia Zhang, Youfang Lin, Shou cheng Song, Han Wang
ICLR 2023 Mind the Gap: Offline Policy Optimization for Imperfect Rewards Jianxiong Li, Xiao Hu, Haoran Xu, Jingjing Liu, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang
NeurIPS 2023 Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value Regularization Xiangsen Wang, Haoran Xu, Yinan Zheng, Xianyuan Zhan
ICLR 2023 Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan
ICMLW 2023 Query-Policy Misalignment in Preference-Based Reinforcement Learning Xiao Hu, Jianxiong Li, Xianyuan Zhan, Qing-Shan Jia, Ya-Qin Zhang
ICLR 2023 When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang
NeurIPS 2022 A Policy-Guided Imitation Approach for Offline Reinforcement Learning Haoran Xu, Li Jiang, Li Jianxiong, Xianyuan Zhan
ECCV 2022 Adversarial Contrastive Learning via Asymmetric InfoNCE Qiying Yu, Jieming Lou, Xianyuan Zhan, Qizhang Li, Wangmeng Zuo, Yang Liu, Jingjing Liu
AAAI 2022 Constraints Penalized Q-Learning for Safe Offline Reinforcement Learning Haoran Xu, Xianyuan Zhan, Xiangyu Zhu
AAAI 2022 DeepThermal: Combustion Optimization for Thermal Power Generating Units Using Offline Reinforcement Learning Xianyuan Zhan, Haoran Xu, Yue Zhang, Xiangyu Zhu, Honglei Yin, Yu Zheng
CoRL 2022 Discriminator-Guided Model-Based Offline Imitation Learning Wenjia Zhang, Haoran Xu, Haoyi Niu, Peng Cheng, Ming Li, Heming Zhang, Guyue Zhou, Xianyuan Zhan
ICML 2022 Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin
NeurIPSW 2022 Distance-Sensitive Offline Reinforcement Learning Jianxiong Li, Xianyuan Zhan, Haoran Xu, Xiangyu Zhu, Jingjing Liu, Ya-Qin Zhang
IJCAI 2022 Model-Based Offline Planning with Trajectory Pruning Xianyuan Zhan, Xiangyu Zhu, Haoran Xu
NeurIPSW 2022 Sparse Q-Learning: Offline Reinforcement Learning with Implicit Value Regularization Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Xianyuan Zhan
NeurIPS 2022 When to Trust Your Simulator: Dynamics-Aware Hybrid Offline-and-Online Reinforcement Learning Haoyi Niu, Shubham Sharma, Yiwen Qiu, Ming Li, Guyue Zhou, Jianming Hu, Xianyuan Zhan
NeurIPSW 2021 Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations Haoran Xu, Xianyuan Zhan, Honglei Yin, Huiling Qin
AAAI 2021 Robust Spatio-Temporal Purchase Prediction via Deep Meta Learning Huiling Qin, Songyu Ke, Xiaodu Yang, Haoran Xu, Xianyuan Zhan, Yu Zheng