Zhang, Chuheng

11 publications

ICML 2025 AdaptiveStep: Automatically Dividing Reasoning Step Through Model Confidence Yuliang Liu, Junjie Lu, Chaofeng Qu, Zhaoling Chen, Zefan Cai, Jason Klein Liu, Chonghan Liu, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin
ICML 2025 Policy Filtration for RLHF to Mitigate Noise in Reward Models Chuheng Zhang, Wei Shen, Li Zhao, Xuyun Zhang, Xiaolong Xu, Wanchun Dou, Jiang Bian
NeurIPS 2025 What Do Latent Action Models Actually Learn? Chuheng Zhang, Tim Pearce, Pushi Zhang, Kaixin Wang, Xiaoyu Chen, Wei Shen, Li Zhao, Jiang Bian
IJCAI 2024 Diversification of Adaptive Policy for Effective Offline Reinforcement Learning Yunseon Choi, Li Zhao, Chuheng Zhang, Lei Song, Jiang Bian, Kee-Eung Kim
ICLR 2024 Whittle Index with Multiple Actions and State Constraint for Inventory Management Chuheng Zhang, Xiangsen Wang, Wei Jiang, Xianliang Yang, Siwei Wang, Lei Song, Jiang Bian
AAAI 2023 RePreM: Representation Pre-Training with Masked Model for Reinforcement Learning Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang
ICML 2023 Robust Situational Reinforcement Learning in Face of Context Disturbances Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian
IJCAI 2023 Towards Generalizable Reinforcement Learning for Trade Execution Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao
AAAI 2021 Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li
ICLR 2021 Return-Based Contrastive Representation Learning for Reinforcement Learning Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu
AAAI 2020 Policy Search by Target Distribution Learning for Continuous Control Chuheng Zhang, Yuanqi Li, Jian Li